Making Topics More Human(e)

Jordan  Boyd-Graber
Jordan Boyd-Graber
Assistant ProfessorSchool of Information Studies and Institute for Advanced Computer Studies (UMIACS)University of MarylandRead Bio

Imagine you need to get the gist of what’s going on in a large text dataset such as all tweets that mention Obama, all e-mails sent within a company, or all newspaper articles published by The New York Times in the 1990s. Topic models, which automatically discover the themes which permeate a corpus, are a popular tool for discovering what’s being discussed. However, topic models aren’t perfect; errors hamper adoption of the model, performance in downstream computational tasks, and human understanding of the data. However, humans can easily diagnose and fix these errors. We present a statistically sound model to incorporate hints and suggestions from humans to iteratively refine topic models to better model large datasets. We also examine how topic models can be used to understand topic control in debates and discussions. We demonstrate a technique that can identify when speakers are “controlling” the topic of a conversation, which can identify events such as when participants in a debate don’t answer a question, when pundits steer a conversation toward talking points, or when a moderator exerts her influence on a discourse.

Media

A continuously updated schedule of talks is also available on the Digital Dialogues page.

Unable to attend the events in person? Archived podcasts can be found on the MITH website, and you can follow our Digital Dialogues Twitter account @digdialog as well as the Twitter hashtag #mithdd to keep up with live tweets from our sessions. Viewers can watch the live stream as well.

All talks free and open to the public. Attendees are welcome to bring their own lunches.

Contact: MITH (mith.umd.edu, mith@umd.edu, 301.405.8927).