Zero-Shot Learning

576 papers with code • 19 benchmarks • 29 datasets

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Benchmarks

Add a Result

These leaderboards are used to track progress in Zero-Shot Learning

Dataset	Best Model	Compare
MedConceptsQA	gpt-4-0125-preview	See all
CUB-200-2011	DUET	See all
SUN Attribute	SPOT (VAEGAN)	See all
AwA2	ZSL-KG	See all
Oxford 102 Flower	SPOT	See all
VOC-MLT	CLIP(ResNet-50)	See all
COCO-MLT	ResNet-50	See all
CUB-200 - 0-Shot Learning	zsl_ADA	See all
PASCAL Context	ZS3Net	See all
iVQA	FrozenBiLM	See all
SNIPS	ZSL-KG	See all
aPY - 0-Shot	ZSL-KG	See all
LSMDC	FrozenBiLM	See all
MSRVTT-QA	HiTeA	See all
MSVD-QA	HiTeA	See all
TVQA	FrozenBiLM	See all
MIT-States	CZSL	See all
ImageNet_CN	$M^2$-Encoder	See all
How2QA	SeViLA	See all

Show all 19 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Zero-Shot Learning models and implementations

mlfoundations/open_clip

3 papers

8,592

faceonlive/ai-research

3 papers

218

alibaba/EasyNLP

2 papers

1,957

sicara/easy-few-shot-learning

2 papers

923

Datasets

Subtasks

Multi-label zero-shot learning

GZSL Video Classification

Most implemented papers

Most implemented Social Latest No code

Learning Transferable Visual Models From Natural Language Supervision

openai/CLIP • • 26 Feb 2021

State-of-the-art computer vision systems are trained to predict a fixed set of predetermined object categories.

Paper
Code

Language Models are Few-Shot Learners

openai/gpt-3 • NeurIPS 2020

By contrast, humans can generally perform a new language task from only a few examples or from simple instructions - something which current NLP systems still largely struggle to do.

Paper
Code

LLaMA: Open and Efficient Foundation Language Models

facebookresearch/llama • • arXiv 2023

We introduce LLaMA, a collection of foundation language models ranging from 7B to 65B parameters.

Paper
Code

Prototypical Networks for Few-shot Learning

jakesnell/prototypical-networks • • NeurIPS 2017

We propose prototypical networks for the problem of few-shot classification, where a classifier must generalize to new classes not seen in the training set, given only a small number of examples of each new class.

Paper
Code

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

dmis-lab/biobert • • 25 Jan 2019

Biomedical text mining is becoming increasingly important as the number of biomedical documents rapidly grows.

Paper
Code

Learning to Compare: Relation Network for Few-Shot Learning

floodsung/LearningToCompare_FSL • • CVPR 2018

Once trained, a RN is able to classify images of new classes by computing relation scores between query images and the few examples of each new class without further updating the network.

Paper
Code

Zero-Shot Learning -- A Comprehensive Evaluation of the Good, the Bad and the Ugly

sbharadwajj/embarrassingly-simple-zero-shot-learning • 3 Jul 2017

Due to the importance of zero-shot learning, i. e. classifying images where there is a lack of labeled training data, the number of proposed approaches has recently increased steadily.

Paper
Code