Machine Learning Models that Predict Mental Health Status on Twitter and Their Privacy Implications

Presented at ShmooCon XV (2019), Jan. 18, 2019, 4:30 p.m. (30 minutes).

Recent studies have shown that machine learning can be used to identify individuals with mental illnesses by analyzing their social media posts. These findings open up various possibilities in mental health research and early detection of mental illnesses. However, they also raise numerous privacy concerns. Our results show that machine learning can be used to make predictions even if the individuals do not actively talk about their mental illness on social media. In order to fully understand the implications of these findings, we need to analyze the features that make these predictions possible. We analyze bag of words, word clusters, part of speech n-grams and topic models to understand the machine learning model and to discover language patterns that differentiate individuals with mental illnesses from a control group. This analysis confirmed some of the known language patterns and uncovered several new patterns. We then discuss the possible applications of machine learning to identify mental illnesses, the feasibility of such applications, and associated privacy implications. We then suggest mitigating steps that can be taken by policymakers, social media platforms, and users.


Presenters:

  • Janith Weerasingh
    Janith Weerasinghe is a Doctoral Candidate at Drexel University working on machine learning and privacy. He has worked on identifying mental illnesses by analyzing the language use of individuals, and on understanding manipulation of social media curation algorithms. He recently spoke at HotPETS 2018.
  • Rachel Greenstadt
    Dr. Rachel Greenstadt (@ragreens) is an Associate Professor of Computer Science and Engineering at NYU (formerly Drexel University). She has a history of speaking at hacker conferences including DEF CON 14 and 26, ShmooCon 2009, 31C3, and 32C3.

Similar Presentations: