1 Department of Informatics and Mathematical Modeling, Technical University of Denmark2 Cognitive Systems, Department of Informatics and Mathematical Modeling, Technical University of Denmark3 Northeastern University4 Copenhagen Center for Health Technology, Center, Technical University of Denmark5 Harvard Medical School6 Northeastern University
Every second, the thoughts and feelings of millions of people across the world are recorded in the form of 140-character tweets using Twitter. However, despite the enormous potential presented by this remarkable data source, we still do not have an understanding of the Twitter population itself: Who are the Twitter users? How representative of the overall population are they? In this paper, we take the first steps towards answering these questions by analyzing data on a set of Twitter users representing over 1% of the U.S. population. We develop techniques that allow us to compare the Twitter population to the U.S. population along three axes (geography, gender, and race/ethnicity), and find that the Twitter population is a highly non-uniform sample of the population.
Proceedings of the Fifth International Aaai Conference on Weblogs and Social Media, 2011, p. 554-557
Main Research Area:
5th International AAAI Conference on Weblogs and Social Media (ICWSM 2011)International Conference on Weblogs and Social Media