Unsupervised Modeling of Twitter Conversations on ShortScience.org

www.aclweb.org
sci-hub
scholar.google.com

Unsupervised Modeling of Twitter Conversations
Ritter, Alan and Cherry, Colin and Dolan, Bill
The Association for Computational Linguistics HLT-NAACL - 2010 via Local Bibsonomy
Keywords: dblp

Summaries/Notes 1

[link] Summary by AcaWiki 9 years ago

This paper models dialog acts in Twitter conversations and presents a corpus of 1.3 million conversations. They provide a status diagram showing the likelihood of transitions between dialogue acts.

![](http://i.imgur.com/eTTVcXO.png)

### Methodology
Unsupervised LDA modelling of Twitter conversations, evaluated by held-out test conversations. Uses a conversation+topic model (segmenting post words into those that involve the topic of conversation, the dialogue act, or something else). Trained on 10,000 randomly sampled conversations (conversation length 3-6) from the corpus.

### Corpus
1.3 million conversations with each conversation containing between 2 and 243 posts. In summer 2009, they selected a random sample of Twitter users by gathering 20 randomly selected posts per minute, then queried to get all their posts. Followed any replies to collect conversations. Removed non-English conversations and non-reply posts.

Your comment:

Write your summary here (You can use $\LaTeX$ and markdown syntax):

Anon Private