Browse State-of-the-Art
Datasets
Methods
More
Newsletter
RC2022
About
Trends
Portals
Libraries
Sign In
Subscribe to the PwC Newsletter
×
Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets.
Read previous issues
Join the community
×
You need to
log in
to edit.
You can
create a new account
if you don't have one.
Browse SoTA
> Reasoning
Reasoning
127 benchmarks • 68 tasks • 182 datasets • 4248 papers with code
Classification
Classification
326 benchmarks
3257 papers with code
Text Classification
180 benchmarks
1106 papers with code
Graph Classification
69 benchmarks
382 papers with code
Audio Classification
23 benchmarks
134 papers with code
Medical Image Classification
8 benchmarks
124 papers with code
See all 19 tasks
Question Answering
Question Answering
226 benchmarks
2919 papers with code
Open-Ended Question Answering
209 papers with code
Open-Domain Question Answering
15 benchmarks
199 papers with code
Conversational Question Answering
1 benchmark
62 papers with code
Answer Selection
6 benchmarks
47 papers with code
See all 19 tasks
Decision Making
Decision Making
1 benchmark
2078 papers with code
Imitation Learning
523 papers with code
Natural Language Inference
Natural Language Inference
43 benchmarks
735 papers with code
Answer Generation
2 benchmarks
57 papers with code
Visual Entailment
3 benchmarks
28 papers with code
Cross-Lingual Natural Language Inference
4 benchmarks
16 papers with code
Logical Reasoning
Navigate
411 papers with code
Logical Reasoning
19 benchmarks
184 papers with code
Novel Concepts
51 papers with code
Temporal Sequences
51 papers with code
StrategyQA
13 papers with code
See all 23 tasks
Multi-Label Classification
Multi-Label Classification
36 benchmarks
376 papers with code
Missing Labels
40 papers with code
Extreme Multi-Label Classification
29 papers with code
Hierarchical Multi-label Classification
19 benchmarks
15 papers with code
Medical Code Prediction
7 benchmarks
15 papers with code
General Reinforcement Learning
Offline RL
2 benchmarks
226 papers with code
Model-based Reinforcement Learning
195 papers with code
Conformal Prediction
151 papers with code
Text Simplification
11 benchmarks
119 papers with code
Music Source Separation
3 benchmarks
53 papers with code
Decision Making Under Uncertainty
45 papers with code
Audio Source Separation
8 benchmarks
44 papers with code
See all 9 tasks
Common Sense Reasoning
Common Sense Reasoning
37 benchmarks
257 papers with code
Physical Commonsense Reasoning
1 benchmark
6 papers with code
Riddle Sense
2 benchmarks
5 papers with code
Winowhy
4 papers with code
Anachronisms
3 papers with code
See all 16 tasks
Visual Reasoning
Visual Reasoning
19 benchmarks
215 papers with code
Visual Commonsense Reasoning
7 benchmarks
29 papers with code
Program Synthesis
Program Synthesis
10 benchmarks
139 papers with code
Type prediction
3 benchmarks
41 papers with code
Program Repair
3 benchmarks
34 papers with code
Value prediction
1 benchmark
16 papers with code
Enumerative Search
5 papers with code
See all 6 tasks
Mathematical Reasoning
Mathematical Reasoning
20 benchmarks
118 papers with code
Math Word Problem Solving
11 benchmarks
63 papers with code
Formal Logic
1 benchmark
11 papers with code
Geometry Problem Solving
8 papers with code
Abstract Algebra
1 benchmark
3 papers with code
See all 8 tasks
Video Question Answering
Video Question Answering
35 benchmarks
155 papers with code
Zero-Shot Video Question Answer
12 benchmarks
34 papers with code
Few-shot Video Question Answering
1 papers with code
Multi-Label Learning
Multi-Label Learning
1 benchmark
83 papers with code
Missing Labels
40 papers with code
Mathematical Proofs
Automated Theorem Proving
10 benchmarks
70 papers with code
Mathematical Proofs
10 benchmarks
17 papers with code
Arithmetic Reasoning
Arithmetic Reasoning
2 benchmarks
71 papers with code
Math Word Problem Solving
Math Word Problem Solving
11 benchmarks
63 papers with code
Mathematical Question Answering
Math Word Problem Solving
11 benchmarks
63 papers with code
Systematic Generalization
Systematic Generalization
62 papers with code
Program Repair
Program Repair
3 benchmarks
34 papers with code
Fault localization
15 papers with code
Variable misuse
9 papers with code
Exception type
2 papers with code
Function-docstring mismatch
1 papers with code
See all 7 tasks
Video-based Generative Performance Benchmarking
Video-based Generative Performance Benchmarking (Contextual Understanding)
1 benchmark
11 papers with code
Video-based Generative Performance Benchmarking (Consistency)
1 benchmark
10 papers with code
Video-based Generative Performance Benchmarking (Correctness of Information)
1 benchmark
10 papers with code
Video-based Generative Performance Benchmarking (Detail Orientation))
1 benchmark
10 papers with code
Video-based Generative Performance Benchmarking (Temporal Understanding)
1 benchmark
10 papers with code
Decision Making Under Uncertainty
Decision Making Under Uncertainty
45 papers with code
Uncertainty Visualization
3 papers with code
Multimodal Reasoning
Multimodal Reasoning
3 benchmarks
38 papers with code
Natural Language Visual Grounding
Natural Language Visual Grounding
16 papers with code
Generative Visual Question Answering
Video-based Generative Performance Benchmarking
6 benchmarks
15 papers with code
Discrete Choice Models
Discrete Choice Models
14 papers with code
Causal Identification
Causal Identification
12 papers with code
Odd One Out
Odd One Out
1 benchmark
10 papers with code
Geometry Problem Solving
Geometry Problem Solving
8 papers with code
Autonomous Navigation
Sequential Place Recognition
5 papers with code
Autonomous Flight (Dense Forest)
1 benchmark
1 papers with code
Autonomous Web Navigation
Abstract Argumentation
Abstract Argumentation
4 papers with code
Analogical Similarity
Analogical Similarity
1 benchmark
4 papers with code
Theory of Mind Modeling
Theory of Mind Modeling
4 papers with code
Anachronisms
Anachronisms
3 papers with code
Human Judgment Correlation
Human Judgment Correlation
2 benchmarks
3 papers with code
Human Judgment Classification
Human Judgment Classification
1 benchmark
2 papers with code
Identify Odd Metapor
Identify Odd Metapor
1 benchmark
2 papers with code
Commonsense Reasoning for RL
Commonsense Reasoning for RL
1 benchmark
1 papers with code
Pre-election ratings estimation
Pre-election ratings estimation
1 papers with code