A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task
Chen, Danqi
and
Bolton, Jason
and
Manning, Christopher D.
Association for Computational Linguistics - 2016 via Local Bibsonomy
Keywords:
dblp
Hermann et al (2015) created a dataset for testing reading comprehension by extracting summarised bullet points from CNN and Daily Mail. All the entities in the text are anonymised and the task is to place correct entities into empty slots based on the news article.
https://i.imgur.com/qeJATKq.png
This paper has hand-reviewed 100 samples from the dataset and concludes that around 25% of the questions are difficult or impossible to answer even for a human, mostly due to the anonymisation process. They present a simple classifier that achieves unexpectedly good results, and a neural network based on attention that beats all previous results by quite a margin.