Repository files navigation Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Adversarial data evaluation
Intermediate task evaluation
Paper
Form
Purpose
Task
Github
Dataset
Note
Inoue et al. 2020
Triple
Evaluation & Training
Derivation generation
URL
R4C
based on HotpotQA
Ho et al. 2020
Triple
Evaluation & Training
Evidence generation
URL
2WikiMultiHopQA
Wolfson et al. 2020
QDMR
Training
-
URL
Break it down
based on ten datasets (e.g., HotpotQA & DROP)
Tang et al. 2021
Sub-question
Evaluation
QA about sub-questions
URL
1000 samples
based on HotpotQA
Geva et al. 2021
Sub-question
Evaluation & Training
QA about sub-questions
URL
StrategyQA
implicit questions
Ho et al. 2022
Sub-question
Evaluation & Training
QA about sub-questions
URL
HieraDate
only for comparison about Date information
Trivedi et al. 2022
Sub-question
Evaluation & Training
QA about sub-questions
URL
MuSiQue
Dalvi et al. 2021
Entailment Tree
Evaluation & Training
tree generation
URL
EntailmentBank
based on ARC and WorldTree V2
Ribeiro et al. 2023
a graph
Evaluation & Training
graph generation
URL
STREET
based on ARC, SCONE, GSM8K, AQUA-RAT, and AR-LSAT
Language skills evaluation
Training on adversarial data
Altering the training process
Utilizing intermediate tasks
About
Reasoning Shortcuts in MRC
Topics
Resources
Stars
Watchers
Forks
You can’t perform that action at this time.