Home
/
Authors
/
Michal Kosinski

Author

Michal Kosinski

Other affiliations: University of Cambridge

Bio: Michal Kosinski is an academic researcher from Stanford University. The author has contributed to research in topics: Personality & Big Five personality traits. The author has an hindex of 40, co-authored 80 publications receiving 9720 citations. Previous affiliations of Michal Kosinski include University of Cambridge.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Private traits and attributes are predictable from digital records of human behavior

[...]

Michal Kosinski¹, David Stillwell¹, Thore Graepel²•Institutions (2)

University of Cambridge¹, Microsoft²

09 Apr 2013-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is shown that easily accessible digital records of behavior, Facebook Likes, can be used to automatically and accurately predict a range of highly sensitive personal attributes including: sexual orientation, ethnicity, religious and political views, personality traits, intelligence, happiness, use of addictive substances, parental separation, age, and gender.

...read moreread less

Abstract: We show that easily accessible digital records of behavior, Facebook Likes, can be used to automatically and accurately predict a range of highly sensitive personal attributes including: sexual orientation, ethnicity, religious and political views, personality traits, intelligence, happiness, use of addictive substances, parental separation, age, and gender. The analysis presented is based on a dataset of over 58,000 volunteers who provided their Facebook Likes, detailed demographic profiles, and the results of several psychometric tests. The proposed model uses dimensionality reduction for preprocessing the Likes data, which are then entered into logistic/linear regression to predict individual psychodemographic profiles from Likes. The model correctly discriminates between homosexual and heterosexual men in 88% of cases, African Americans and Caucasian Americans in 95% of cases, and between Democrat and Republican in 85% of cases. For the personality trait “Openness,” prediction accuracy is close to the test–retest accuracy of a standard personality test. We give examples of associations between attributes and Likes and discuss implications for online personalization and privacy.

...read moreread less

2,232 citations

Journal Article•DOI•

Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach

[...]

H. Andrew Schwartz¹, Johannes C. Eichstaedt¹, Margaret L. Kern¹, Lukasz Dziurzynski¹, Stephanie M. Ramones¹, Megha Agrawal¹, Achal Shah¹, Michal Kosinski², David Stillwell², Martin E. P. Seligman¹, Lyle H. Ungar¹ - Show less +7 more•Institutions (2)

University of Pennsylvania¹, University of Cambridge²

25 Sep 2013-PLOS ONE

TL;DR: This represents the largest study, by an order of magnitude, of language and personality, and found striking variations in language with personality, gender, and age.

...read moreread less

Abstract: We analyzed 700 million words, phrases, and topic instances collected from the Facebook messages of 75,000 volunteers, who also took standard personality tests, and found striking variations in language with personality, gender, and age. In our open-vocabulary technique, the data itself drives a comprehensive exploration of language that distinguishes people, finding connections that are not captured with traditional closed-vocabulary word-category analyses. Our analyses shed new light on psychosocial processes yielding results that are face valid (e.g., subjects living in high elevations talk about the mountains), tie in with other research (e.g., neurotic people disproportionately use the phrase ‘sick of’ and the word ‘depressed’), suggest new hypotheses (e.g., an active life implies emotional stability), and give detailed insights (males use the possessive ‘my’ when mentioning their ‘wife’ or ‘girlfriend’ more often than females use ‘my’ with ‘husband’ or 'boyfriend’). To date, this represents the largest study, by an order of magnitude, of language and personality.

...read moreread less

1,435 citations

Journal Article•DOI•

Computer-based personality judgments are more accurate than those made by humans

[...]

Wu Youyou¹, Michal Kosinski², David Stillwell¹•Institutions (2)

University of Cambridge¹, Stanford University²

27 Jan 2015-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is shown that computers’ judgments of people’s personalities based on their digital footprints are more accurate and valid than judgments made by their close others or acquaintances, and that computer personality judgments have higher external validity when predicting life outcomes such as substance use, political attitudes, and physical health.

...read moreread less

Abstract: Judging others’ personalities is an essential skill in successful social living, as personality is a key driver behind people’s interactions, behaviors, and emotions. Although accurate personality judgments stem from social-cognitive skills, developments in machine learning show that computer models can also make valid judgments. This study compares the accuracy of human and computer-based personality judgments, using a sample of 86,220 volunteers who completed a 100-item personality questionnaire. We show that (i) computer predictions based on a generic digital footprint (Facebook Likes) are more accurate (r = 0.56) than those made by the participants’ Facebook friends using a personality questionnaire (r = 0.49); (ii) computer models show higher interjudge agreement; and (iii) computer personality judgments have higher external validity when predicting life outcomes such as substance use, political attitudes, and physical health; for some outcomes, they even outperform the self-rated personality scores. Computers outpacing humans in personality judgment presents significant opportunities and challenges in the areas of psychological assessment, marketing, and privacy.

...read moreread less

740 citations

Journal Article•DOI•

Facebook as a research tool for the social sciences: Opportunities, challenges, ethical considerations, and practical guidelines.

[...]

Michal Kosinski¹, Sandra Matz², Samuel D. Gosling³, Vesselin Popov², David Stillwell² - Show less +1 more•Institutions (3)

Stanford University¹, University of Cambridge², University of Texas at Austin³

01 Sep 2015-American Psychologist

TL;DR: This work demonstrates how to recruit participants using Facebook, incentivize them effectively, and maximize their engagement, and outlines the most important opportunities and challenges associated with using Facebook for research.

...read moreread less

Abstract: Facebook is rapidly gaining recognition as a powerful research tool for the social sciences. It constitutes a large and diverse pool of participants, who can be selectively recruited for both online and offline studies. Additionally, it facilitates data collection by storing detailed records of its users' demographic profiles, social interactions, and behaviors. With participants' consent, these data can be recorded retrospectively in a convenient, accurate, and inexpensive way. Based on our experience in designing, implementing, and maintaining multiple Facebook-based psychological studies that attracted over 10 million participants, we demonstrate how to recruit participants using Facebook, incentivize them effectively, and maximize their engagement. We also outline the most important opportunities and challenges associated with using Facebook for research, provide several practical guidelines on how to successfully implement studies on Facebook, and finally, discuss ethical considerations.

...read moreread less

709 citations

Journal Article•DOI•

Automatic personality assessment through social media language.

[...]

Gregory Park¹, H. Andrew Schwartz¹, Johannes C. Eichstaedt¹, Margaret L. Kern¹, Michal Kosinski², David Stillwell², Lyle H. Ungar¹, Martin E. P. Seligman¹ - Show less +4 more•Institutions (2)

University of Pennsylvania¹, University of Cambridge²

31 May 2015-Journal of Personality and Social Psychology

TL;DR: Results indicated that language-based assessments can constitute valid personality measures: they agreed with self-reports and informant reports of personality, added incremental validity over informant reports, adequately discriminated between traits, and were stable over 6-month intervals.

...read moreread less

Abstract: Language use is a psychologically rich, stable individual difference with well-established correlations to personality. We describe a method for assessing personality using an open-vocabulary analysis of language from social media. We compiled the written language from 66,732 Facebook users and their questionnaire-based self-reported Big Five personality traits, and then we built a predictive model of personality based on their language. We used this model to predict the 5 personality factors in a separate sample of 4,824 Facebook users, examining (a) convergence with self-reports of personality at the domain- and facet-level; (b) discriminant validity between predictions of distinct traits; (c) agreement with informant reports of personality; (d) patterns of correlations with external criteria (e.g., number of friends, political attitudes, impulsiveness); and (e) test-retest reliability over 6-month intervals. Results indicated that language-based assessments can constitute valid personality measures: they agreed with self-reports and informant reports of personality, added incremental validity over informant reports, adequately discriminated between traits, exhibited patterns of correlations with external criteria similar to those found with self-reported personality, and were stable over 6-month intervals. Analysis of predictive language can provide rich portraits of the mental life associated with traits. This approach can complement and extend traditional methods, providing researchers with an additional measure that can quickly and cheaply assess large groups of participants with minimal burden.

...read moreread less

528 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18

Collapse

Cited by

PDF

Open Access

More filters

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

How to Do Things With Words

[...]

Csr Young

01 Jan 2009

7,241 citations

Social Network Analysis

[...]

Tom A. B. Snijders

01 Jan 2012

3,692 citations