A Needle in a Haystack? Harnessing Onomatopoeia and User-specific Stylometrics for Authorship Attribution of Micro-messages


The world is facing a new era in which social media communication plays a fundamental role in people’s lives. Along with irrefutable benefits, several collateral drawbacks have risen, one being the wide spread of false information with malicious intents, what is now commonly called “Fake News”. The fight against this problem is not easy, especially when taking into account the nature of text messages involved on social media platforms (a sea of small messages and myriad users). In this work, we cope with the challenging problem of authorship attribution of small text messages posted on social media platforms. Differently from what has been done with longer texts, we rely upon data-driven approaches, exploiting recent advances of deep neural networks in the field of pattern recognition. By viewing small texts usually employed in social media as unidimensional signals, we devise modern deep-learning techniques tailored for this kind of data to find the author of these posts with promising results.

IEEE International Conference on Acoustics, Speech and Signal Processing
Antônio Theóphilo
Antônio Theóphilo
Ph.D. Student

I am a Ph.D. student at the Institute of Computing/University of Campinas (UNICAMP) in the fields of Artificial Intelligence and Natural Language Processing. My research interests include Artificial Intelligence, Natural Language Processing, and Information Security.