Summaries from Empirical Methods on Natural Language Processing (EMNLP) on ShortScience.org

aclanthology.info
sci-hub
scholar.google.com

Neural Sequence-Labelling Models for Grammatical Error Correction
Yannakoudakis, Helen and Rei, Marek and Andersen, Øistein E. and Yuan, Zheng
Empirical Methods on Natural Language Processing (EMNLP) - 2017 via Local Bibsonomy
Keywords: dblp

[link] Summary by Marek Rei 7 years ago

Using error detection to improve error correction. A neural sequence labeling model is used to find correctness probabilities for every token, which are then used to rerank possible correction candidates. The process consistently improves the performance of different correction systems.

https://i.imgur.com/DMkotr6.png

aclanthology.info
sci-hub
scholar.google.com

Grasping the Finer Point: A Supervised Similarity Network for Metaphor Detection
Rei, Marek and Bulat, Luana and Kiela, Douwe and Shutova, Ekaterina
Empirical Methods on Natural Language Processing (EMNLP) - 2017 via Local Bibsonomy
Keywords: dblp

[link] Summary by Marek Rei 7 years ago

A specialised architecture for detecting metaphorical phrases. Uses a gating mechanism to condition one word based on the other, a neural version of weighted cosine similarity to make a prediction and hinge loss to optimise the model. Achieves high results on detecting metaphorical adjective-noun, verb-object and verb-subject phrases.

https://i.imgur.com/p3zyCcJ.png

aclanthology.info
sci-hub
scholar.google.com

Learning how to Active Learn: A Deep Reinforcement Learning Approach
Fang, Meng and Li, Yuan and Cohn, Trevor
Empirical Methods on Natural Language Processing (EMNLP) - 2017 via Local Bibsonomy
Keywords: dblp

[link] Summary by Marek Rei 7 years ago

Active learning (choosing which examples to annotate for training) is proposed as a reinforcement learning problem. The Q-learning network predicts for each sentence whether it should be annotated, and is trained based on the performance improvement from the main task. Evaluation is done on NER, with experiments on transferring the trained Q-learning function to other languages.

https://i.imgur.com/5rXm5vZ.png

arxiv.org
scholar.google.com

Sentence Simplification with Deep Reinforcement Learning
Zhang, Xingxing and Lapata, Mirella
arXiv e-Print archive - 2017 via Local Bibsonomy
Keywords: dblp

[link] Summary by Udibr 7 years ago

* Output can contain several sentences, that are considered as a single long sequence. 
* Seq2Seq+attention:
  * Oddly they use the formula used by Bahdanau attention weights to combine the weighted attention $c_t$ with the decoder output $h_t^T =  W_0 \tanh \left( U_h h_t^T + W_h c_t \right) $ while the attention weights are computed with softmax over dot product between encoder and decoder outputs $h_t^T \cdot h_i^S$
  * Glove 300
  * 2 layer LSTM 256
* RL model
  * Reward=Simplicity+Relevance+Fluency = $\lambda^s r^S + \lambda^R r^R + \lambda^F r^F$
    * $r^S = \beta \text{SARI}(X,\hat{Y},Y) + (1-\beta) \text{SARI}(X,Y,\hat{Y})$
    * $r^R$ cosine of output of RNN auto encoder run on input and a separate auto encoder run on output
    * $r^F$ perplexity of LM trained on output
  * Learning exactly as in [MIXER](https://arxiv.org/abs/1511.06732)
* Lexical Simplification model: they train a second model $P_{LS}$ which uses pre-trained attention weights and then use the weighted output of an encoder LSTM as the input to a softmax