Unsupervised aspect extraction from free-form conversations

Published in SIGKDD Workshop on Issues of Sentiment Discovery and Opinion Mining, 2017

Aspect-based sentiment analysis on forum data can produce a wealth of knowledge due to the massive and free-form nature of the discussions involved. Existing works in aspect extraction for sentiment analysis include: 1) simple frequency count of noun phrases relying on labelled datasets for learning supervised models, and 2) topic modelling trained on large unlabelled datasets requiring tuning of complex parameters. Our goal is to efficiently and effectively extract aspects (features and attributes) of certain entities (products or brands) from massive heterogenous collections of user-generated free-form conversations. We construct an aspect dictionary in three steps: 1) first we extract candidate aspects using simple lexico-syntactic patterns that capture the “aspect-of” relation between a noun phrase and a mention of an entity; 2) next, we filter the candidates by drawing on an automatically-compiled commonness blacklist, as well as a neighbourhood-based measure of aspecthood; and 3) lastly, we expand the dictionary to increase coverage using a variety of simple techniques. When compared to state-of-the-art methods for aspect extraction, our method is capable of efficiently constructing an extremely compact aspect dictionary (98% more compact) with comparable performance.

Recommended citation: En-Shiun Annie Lee, Richie Wenjie Zi, Afsaneh Fazly, Brandon Seibel, & Anderson de Andrade. (2017). "Unsupervised aspect extraction from free-form conversations." SIGKDD Workshop on Issues of Sentiment Discovery and Opinion Mining.
Download Paper | Download Bibtex