scispace - formally typeset
Search or ask a question
Proceedings ArticleDOI

Predicting Personality from Twitter

TL;DR: This paper presents a method by which a user's personality can be accurately predicted through the publicly available information on their Twitter profile, and the implications this has for social media design, interface design, and broader domains.
Abstract: Social media is a place where users present themselves to the world, revealing personal details and insights into their lives. We are beginning to understand how some of this information can be utilized to improve the users' experiences with interfaces and with one another. In this paper, we are interested in the personality of users. Personality has been shown to be relevant to many types of interactions, it has been shown to be useful in predicting job satisfaction, professional and romantic relationship success, and even preference for different interfaces. Until now, to accurately gauge users' personalities, they needed to take a personality test. This made it impractical to use personality analysis in many social media domains. In this paper, we present a method by which a user's personality can be accurately predicted through the publicly available information on their Twitter profile. We will describe the type of data collected, our methods of analysis, and the machine learning techniques that allow us to successfully predict personality. We then discuss the implications this has for social media design, interface design, and broader domains.

Content maybe subject to copyright    Report

Predicting Personality from Twitter
Jennifer Golbeck
, Cristina Robles
, Michon Edmondson
, and Karen Turner
University of Maryland;{jgolbeck,crobles,michonk8,kturner}@umd.edu
Abstract—Social media is a place where users present them-
selves to the world, revealing personal details and insights into
their lives. We are beginning to understand how some of this
information can be utilized to improve the users’ experiences with
interfaces and with one another. In this paper, we are interested
in the personality of users. Personality has been shown to be
relevant to many types of interactions; it has been shown to be
useful in predicting job satisfaction, professional and romantic
relationship success, and even preference for different interfaces.
Until now, to accurately gauge users’ personalities, they needed to
take a personality test. This made it impractical to use personality
analysis in many social media domains. In this paper, we present
a method by which a user’s personality can be accurately
predicted through the publicly available information on their
Twitter profile. We will describe the type of data collected, our
methods of analysis, and the machine learning techniques that
allow us to successfully predict personality. We then discuss the
implications this has for social media design, interface design,
and broader domains.
Index Terms—personality, social media
I. INTRODUCTION
Social networking on the web has grown dramatically over
the last decade. In January 2005, a survey of social networking
websites estimated that among all sites on the web there
were roughly 115 million members [14]. Just over ve years
later, Twitter alone has exceeded 200 million members. In the
process of creating social networking profiles, users reveal a
lot about themselves both in what they share and how they
say it. Through self-description, status updates, photos, and
interests, much of a user’s personality comes out through their
profile.
For decades, psychology researchers have worked to un-
derstand personality in a systematic way. After extensive
work to develop and validate a widely accepted personality
model, researchers have shown connections between general
personality traits and many types of behavior. Relationships
have been discovered between personality and psychological
disorders [42], job performance [4] and satisfaction [24], and
even romantic success [46].
This paper attempts to bridge the gap between social media
and personality research by using the information people
reveal in their online profiles. Our core research question asks
whether social media profiles can predict personality traits. If
so, then there is an opportunity to integrate the many results
on the implications of personality factors and behavior into the
users’ online experiences and to use social media profiles as
a source of information to better understand individuals. For
example, the friend suggestion system could be tailored to a
user based on whether they are more introverted or extraverted.
Previous work has shown that the information in users’
Facebook profiles is reflective of their actual personalities, not
an “idealized” version of themselves [3]. We expect Twitter
to have similar characteristics, and that plus a broad user base
of 200 million people makes it an ideal platform for study.
We administered the Big Five Personality Inventory to 279
subjects through a Twitter application. In the process, we
gathered their 2000 most recent public Twitter posts (tweets).
This was aggregated, quantified, and passed through a text
analysis tool to obtain a feature set. Using these statistics, we
were able to develop a model that can predict personality on
each of the five personality factors to within between 11% and
18% of the actual values.
The ability to predict personality has implications in many
areas. Existing research has shown connections between per-
sonality traits and success in both professional and personal re-
lationships. Social media tools that seek to support these rela-
tionships could benefit from personality insights. Additionally,
previous work on personality and interfaces showed that users
are more receptive to and have greater trust in interfaces and
information that is presented from the perspective of their own
personality features (i.e. introverts prefer messages presented
from an introvert’s perspective). If a user’s personality can be
predicted from their social media profile, online marketing and
applications can use this to personalize their message and its
presentation.
We begin by presenting background on the Big Five Per-
sonality index and related work on personality and social
media. We then present our experimental setup and methods
for analyzing and quantifying Twitter profile information. To
understand the relationship between personality and social
media profiles, we present results on correlations between each
profile feature and personality factor. Based on this, we de-
scribe the machine learning techniques used for classification
and show how we achieve large and significant improvements
over baseline classification on each personality factor. We
conclude with a discussion of the implications that this work
has for social media websites and for organizations that may
utilize social media to better understand the people with whom
they interact.
II. B
ACKGROUND AND RELATED WORK
A. The Big Five Personality Inventory
The “Big Five” model of personality dimensions has
emerged as one of the most well-researched and well-regarded
measures of personality structure in recent years. The models
five domains of personality, Openness, Conscientiousness,
extroversion, Ageeableness, and Neuroticism, were conceived
2011 IEEE International Conference on Privacy, Security, Risk, and Trust, and IEEE International Conference on Social Computing
978-0-7695-4578-3/11 $26.00 © 2011 IEEE
DOI
149

Fig. 1: A person has scores for each of the five personality
factors. Together, the five factors represent an individual’s
personality.
by Tupes and Christal [47] as the fundamental traits that
emerged from analyses of previous personality tests [29].
McCrae & Costa [28] and John [21] continued five-factor
model research and consistently found generality across age,
gender, and cultural lines [29]. Additional research has proved
that different tests, languages, and methods of analysis do
not alter the models validity [29], [10], [21], [27]. Such
extensive research has led to many psychologists to accept the
Big Five as the current definitive model of personality [43],
[34]. It should be noted that the models dependence on trait
terms indicates that the Big Five traits are based on a lexical
approach to personality measurement [43], [9], [10], [16]. The
Big Five traits are characterized by the following:
Openness to Experience: curious, intelligent, imaginative.
High scorers tend to be artistic and sophisticated in taste
and appreciate diverse views, ideas, and experiences.
Conscientiousness: responsible, organized, persevering.
Conscientious individuals are extremely reliable and tend
to be high achievers, hard workers, and planners.
extroversion: outgoing, amicable, assertive. Friendly and
energetic, extroverts draw inspiration from social situa-
tions.
Agreeableness: cooperative, helpful, nurturing. People
who score high in agreeableness are peace-keepers who
are generally optimistic and trusting of others.
Neuroticism: anxious, insecure, sensitive. Neurotics are
moody, tense, and easily tipped into experiencing negative
emotions.
B. Applications of the Big Five
Much work has been done with personality as it relates to
our lives and the choices we make. In terms of relationships
with others, many relationships have been identified. Personal-
ity type is linked to whom users choose to friend on Facebook.
[45] found that extraversion, agreeableness, and openness all
correlated with friendship selection. Personality features have
also been tied to many aspects of romantic relationships,
including partner choice, level of attachment and success
[8], [46]. In terms of interpersonal conflict, studies have
associated Big Five traits with coping responses, vengefulness,
and rumination [32],[5]. Social relationships aside, personality
also relates to preferences. Rentfrow and Gosling [39] is one
of many studies that found that personality is a factor that
relates to the music an individual prefers to listen to. Jost et
al. [23] also found that the personality type of an individual
was able to predict whether they would be more likely to
vote for McCain or Obama in 2008. Research has also found
personality differences between self-professed “dog people”
and “cat people” [37], [17]. Within the context of marketing
and advertising, Big Five personality traits have been shown to
accurately predict a consumers preference for national brands
or independent brands [48]. Studies like this show a promising
future for the integration of personality analysis and consumer
profiling.
Many studies have demonstrated the usefulness of person-
ality profiles within the professional context. Hodgkinson and
Ford [20] found that personality traits affect job performance
and satisfaction, and Barrick and Mount [4] correlated specific
traits with occupational choices and proficiency. Big Five
dimensions have proved valid predictors for team performance
[31], counterproductive behaviors [41], and entrepreneurial
status [49], among many other factors. [6] also revealed rela-
tionships between personality and behavior among managers,
and Barrick and Mount found recurring personality profiles
among both high-autonomy and low-autonomy positions in
the workforce [5].
In the space of Human-Computer Interaction, one of the
pioneering studies on the connection between personality and
interface preference was presented in [30]. Users listened to
audio readings of ve book reviews which were written from
the perspective of introverts vs. extroverts. Subjects were able
to identify the personality differences between the reviews and
showed an attraction to those which were closest to their own
personality type. When the personality type matched, subjects
were even more likely to buy the book being reviewed.
This work was extended into ideas of Graphical User
Interface design in [25]. Different GUIs were developed to
represent introverted vs. extroverted personality types. As in
[30], subjects could identify the personality differences and
preferred the interface that matched their own personality type.
C. Personality Research and Social Media
To the best of our knowledge, our work is among the first to
look at the relationship between profile information provided
in social networks and personality traits. However, there have
150

Fig. 2: Average scores on each personality trait shown with
standard deviation bars.
been a few previous studies on how personality relates to social
networking more generally.
It has been shown in [40] that extroversion and consci-
entiousness positively correlate with the perceived ease of
use of social media websites. extroversion was also shown
to have a positive correlation with perceived usefulness of
such sites. Not surprisingly, extroversion was also shown to
correlate with the size of a user’s social network in several
studies [2], [44], [45]. There have also been mixed results for
other personality traits. Work in [45] showed that individuals
with high agreeableness scores were selected more often as
friends and that people tended to choose friends with similar
agreeableness, extroversion, and openness scores. This was
not repeated in [44], but a correlation between openness and
number of friends.
III. D
ATA COLLECTION
We created a Twitter application with two functions. First, it
administered a 45-question version of the Big Five Personality
Inventory [22] to users. Subjects would take the test and for
each, we collected the most recent 2,000 tweets from the user
(or all tweets if they had less than 2,000).
We had fifty subjects who were recruited through posts on
Twitter, Facebook, and relevant mailing lists. Twitter does not
collect or release demographic information about its users and,
since we would have no general baseline for comparison, we
did not collect it for our subjects.
Average scores on the personality test are shown in figure
2 and in table I.
For each user, we began by collecting a simple set of
statistics about their accounts and their tweets. These included
the following:
Number of followers (people following the user)
Number of following (people the user follows)
Density of the social network
Number of “@mentions” - An @mention is when a user
mentions the name of another user by adding an @ to
the front of the username, as is convention on Twitter
Number of replies - Using the Twitter API, we could see
how many of the user’s tweets were direct replies to other
user’s tweets.
Number of hashtags - Hashtags (e.g. #cscw2012) are a
way of tagging a tweet to be part of a given topic or
event. They are also used in “games” where users come
up with tweets to go with a tag (e.g. #firstdraftmovielines
is used with altered first movie lines created by users).
Number of links
Words per tweet
For the number of @mentions, replies, hashtags, and links,
we used the raw numbers and the average per tweet.
Our primary analysis was a basic processing of the text of
the tweets. This was done by merging the collected tweets for
a given user into a single “document” and analyzing that.
Previous research has shown that linguistic features can be
used to predict personality traits [26], [36]. . Data collected in
[36] was used in both studies. They had three separate sources
of text, ranging from an average of 1,770 words to over 5,000
words per person.
There is potential to apply these linguistic analysis methods
to help predict personality by analyzing a person’s tweets.
However, the text samples used in earlier studies are much
larger than are available to us through any twitter posting.
Aggregating many tweets from a user gives more information,
but as a series of disconnected statements rather than a
coherent document as was used in other studies. Thus, it is
unclear if Twitter text will be as connected to personality
as was the case in other work. Tweets are much different
sources of text. Each one is limited to 140 characters, and
a compilation of tweets from a given user is more a stream
of disjointed thoughts than a coherent narrative as is found in
the text used in previous personality studies. Thus, it was not
entirely clear whether tweets would be a useful source of data
for this type of analysis.
There were an average of 1914 words per user, and the
distribution is shown in figure 3. The number of words ranged
from 50 to 5724. These came from an average of 142.2 tweets,
with one using having a maximum of 350 tweets and another
with a minimum of 4.
Following the methods used in [26], [36] as well as other
studies of social media behavior, such as [13], we utilized
two main tools to analyze the content of users’ tweets. The
first is that Linguistic Inquiry and Word Count (LIWC) tool
[35]. LIWC produces statistics on 81 different features of
text in five categories. These include Standard Counts (word
count, words longer than six letters, number of prepositions,
etc.), Psychological Processes (emotional, cognitive, sensory,
and social processes), Relativity (words about time, the past,
the future), Personal Concerns (such as occupation, financial
issues, health), and Other dimensions (counts of various types
of punctuation, swear words). We excluded the Standard
Counts and Other Dimension features to eliminate what is
likely to be noise on the type of text we have. The exceptions
are that we included word count, words per sentence, and
swear word counts since these reflect verbosity and tone of
151

Fig. 3: Number of words per user.
the user. For the other three categories, the values are given
as the percentage of words in the input that match words in a
given category. For example, it counts the number of “social”
words such as “talk”, “us”, and “friend”, or “anxiety” words
like “nervous”, “afraid”, and “tense”. Correlations between
these features and personality traits (e.g. anxiety words and
neuroticism scores) would not be surprising. This produced
79 text features.
In addition, we ran the text again the MRC Psycholinguistic
Database, a list of over 150,000 words with linguistic and
psycholinguistic features of each word. These include: Kucera-
Francis written frequency, number of categories, and num-
ber of samples; Brown verbal frequency; Familiarity rating;
Meaningfulness via Colorado norms and via Paivio Norms;
Concreteness; age of acquisition; Thorndike-Lorge written
frequency; and the number of letters, phonemes, and syllables.
We computed the average non-zero score for each feature over
all the words from each user.
In addition, we performed a word by word sentiment anal-
ysis of each user’s tweets. Using the General Inquirer dataset
[1], which provides a hand annotated dictionary that assigns
words sentiment values on a -1 to +1 scale, we computed a
score for each user that was the average sentiment score for
all words used in their list of tweets.
IV. P
ERSONALITY AND TWITTER BEHAVIOR
CORRELATIONS
We began by running a Pearson correlation analysis between
subjects’ personality scores and each of the features obtained
from analyzing their tweets and public account data. These are
shown in table II.There are a number of significant correlations
here, however none of them are strong enough to directly
predict any personality trait. Correlations that were statistically
significant for p<0.05 are bolded.
Many of the correlations make intuitive sense. For example,
conscientiousness is negatively correlated with words about
death (e.g. “bury”, “coffin”, “kill”) and with negative emotions
and sadness, suggesting conscientious people tend to talk less
about unhappy subjects. At the same time, the trait is positively
Fig. 4: Features used for predicting personality.
TABLE I: Average scores on each personality factor on a
normalized 0-1 scale
Agree. Consc. Extra. Neuro. Open.
Average 0.697 0.617 0.586 0.428 0.755
Stdev 0.162 0.176 0.190 0.224 0.147
correlated with the use of “you”, indicating the same people
tend to talk about or to others. Agreeable people also tend to
use “you” a lot, but are less likely to talk about achievements
and money.
However, there are not such intuitive explanations for other
correlations. For example, the number of parentheses used is
negatively correlated with both extraversion and openness. It
is unclear why this is the case, or if these are perhaps falsely
significant data points. However, since our focus in this paper
is on predicting personality rather than on focusing on any
particular correlation, we do not assign much weight to any
of these connections. A space of future work would be to
probe more deeply into these correlations over a larger data
set.
V. P
REDICTING PERSONALITY
To predict the score of a given personality feature, we
performed a regression analysis in Weka [18]. We used two
regression algorithms: Gaussian Process and ZeroR, each with
a 10-fold cross-validation with 10 iterations. Two algorithms
had similar performance over the personality features. Results
are shown in table III.
We found that Openness was the easiest to compute and
neuroticism was the most difficult, consistent with the results
152

TABLE II: Pearson correlation values between feature scores and personality scores. Significant correlations are shown in bold
for p<0.05. Only features that correlate significantly with at least one personality trait are shown.
Language Feature Examples Extro. Agree. Consc. Neuro. Open.
“You” (you, your, thou) 0.068 0.364 0.252 -0.212 -0.020
Articles (a, an, the) -0.039 -0.139 -0.071 -0.154 0.396
Auxiliary Verbs (am, will, have) 0.033 0.042 -0.284 0.017 0.045
Future Tense (will, gonna) 0.227 -0.100 -0.286 0.118 0.142
Negations (no, not, never) -0.020 0.048 -0.374 0.081 0.040
Quantifiers (few, many, much) -0.002 -0.057 -0.089 -0.051 0.238
Social Processes (mate, talk, they, child) 0.262 0.156 0.168 -0.141 0.084
Family (daughter, husband, aunt) 0.338 0.020 -0.126 0.096 0.215
Humans (adult, baby, boy) 0.204 -0.011 0.055 -0.113 0.251
Negative Emotions (hurt, ugly, nasty) 0.054 -0.111 -0.268 0.120 0.010
Sadness (crying, grief, sad) 0.154 -0.203 -0.253 0.230 -0.111
Cognitive Mechanisms (cause, know, ought) -0.008 -0.089 -0.244 0.025 0.140
Causation (because, effect, hence) 0.224 -0.258 -0.155 -0.004 0.264
Discrepancy (should, would, could) 0.227 -0.055 -0.292 0.187 0.103
Certainty (always, never) 0.112 -0.117 -0.069 -0.074 0.347
Perceptual Processes
Hearing (listen, hearing) 0.042 -0.041 0.014 0.335 -0.084
Feeling (feels, touch) 0.097 -0.127 -0.236 0.244 0.005
Biological Processes (eat, blood, pain) -0.066 0.206 0.005 0.057 -0.239
Body (cheek, hands, spit) 0.031 0.083 -0.079 0.122 -0.299
Health (clinic, flu, pill) -0.277 0.164 0.059 -0.012 -0.004
Ingestion (dish, eat, pizza) -0.105 0.247 0.013 -0.058 -0.202
Work (job, majors, xerox) 0.231 -0.096 0.330 -0.125 0.426
Achievement (earn, hero, win) -0.005 -0.240 -0.198 -0.070 0.008
Money (audit, cash, owe) -0.063 -0.259 0.099 -0.074 0.222
Religion (altar, church, mosque) -0.152 -0.151 -0.025 0.383 -0.073
Death (bury, coffin, kill) -0.001 0.064 -0.332 -0.054 0.120
Fillers (blah, imean, youknow) 0.099 -0.186 -0.272 0.080 0.120
Punctuation
Commas 0.148 0.080 -0.24 0.155 0.170
Colons -0.216 -0.153 0.322 -0.015 -0.142
Question Marks 0.263 -0.050 0.024 0.153 -0.114
Exclamation Marks -0.021 -0.025 0.260 0.317 -0.295
Parentheses -0.254 -0.048 -0.084 0.133 -0.302
Non-LIWC Features
GI Sentiment 0.177 -0.130 -0.084 -0.197 0.268
Number of Hashtags 0.066 -0.044 -0.030 -0.217 -0.268
Words per tweet 0.285 -0.065 -0.144 0.031 0.200
Links per tweet -0.061 -0.081 0.256 -0.054 0.064
153

Citations
More filters
Journal ArticleDOI
TL;DR: It is shown that easily accessible digital records of behavior, Facebook Likes, can be used to automatically and accurately predict a range of highly sensitive personal attributes including: sexual orientation, ethnicity, religious and political views, personality traits, intelligence, happiness, use of addictive substances, parental separation, age, and gender.
Abstract: We show that easily accessible digital records of behavior, Facebook Likes, can be used to automatically and accurately predict a range of highly sensitive personal attributes including: sexual orientation, ethnicity, religious and political views, personality traits, intelligence, happiness, use of addictive substances, parental separation, age, and gender. The analysis presented is based on a dataset of over 58,000 volunteers who provided their Facebook Likes, detailed demographic profiles, and the results of several psychometric tests. The proposed model uses dimensionality reduction for preprocessing the Likes data, which are then entered into logistic/linear regression to predict individual psychodemographic profiles from Likes. The model correctly discriminates between homosexual and heterosexual men in 88% of cases, African Americans and Caucasian Americans in 95% of cases, and between Democrat and Republican in 85% of cases. For the personality trait “Openness,” prediction accuracy is close to the test–retest accuracy of a standard personality test. We give examples of associations between attributes and Likes and discuss implications for online personalization and privacy.

2,232 citations

Journal ArticleDOI
25 Sep 2013-PLOS ONE
TL;DR: This represents the largest study, by an order of magnitude, of language and personality, and found striking variations in language with personality, gender, and age.
Abstract: We analyzed 700 million words, phrases, and topic instances collected from the Facebook messages of 75,000 volunteers, who also took standard personality tests, and found striking variations in language with personality, gender, and age. In our open-vocabulary technique, the data itself drives a comprehensive exploration of language that distinguishes people, finding connections that are not captured with traditional closed-vocabulary word-category analyses. Our analyses shed new light on psychosocial processes yielding results that are face valid (e.g., subjects living in high elevations talk about the mountains), tie in with other research (e.g., neurotic people disproportionately use the phrase ‘sick of’ and the word ‘depressed’), suggest new hypotheses (e.g., an active life implies emotional stability), and give detailed insights (males use the possessive ‘my’ when mentioning their ‘wife’ or ‘girlfriend’ more often than females use ‘my’ with ‘husband’ or 'boyfriend’). To date, this represents the largest study, by an order of magnitude, of language and personality.

1,435 citations


Cites background or methods from "Predicting Personality from Twitter..."

  • ...conditions of other studies (for examples, see [25,27,65]), and its component traits are most loosely related [85]....

    [...]

  • ...As a first pass at predicting personality from language in Facebook, Golbeck used LIWC features over a sample of 167 Facebook volunteers as well as profile information and found limited success of a regression model [65]....

    [...]

Journal ArticleDOI
TL;DR: It is concluded that previous work may have overestimated the degree of ideological segregation in social-media usage and liberals were more likely than conservatives to engage in cross-ideological dissemination.
Abstract: We estimated ideological preferences of 3.8 million Twitter users and, using a data set of nearly 150 million tweets concerning 12 political and nonpolitical issues, explored whether online communication resembles an “echo chamber” (as a result of selective exposure and ideological segregation) or a “national conversation.” We observed that information was exchanged primarily among individuals with similar ideological preferences in the case of political issues (e.g., 2012 presidential election, 2013 government shutdown) but not many other current events (e.g., 2013 Boston Marathon bombing, 2014 Super Bowl). Discussion of the Newtown shootings in 2012 reflected a dynamic process, beginning as a national conversation before transforming into a polarized exchange. With respect to both political and nonpolitical issues, liberals were more likely than conservatives to engage in cross-ideological dissemination; this is an important asymmetry with respect to the structure of communication that is consistent with psychological theory and research bearing on ideological differences in epistemic, existential, and relational motivation. Overall, we conclude that previous work may have overestimated the degree of ideological segregation in social-media usage.

940 citations

Journal ArticleDOI
TL;DR: A survey of technologies capable of dealing with human personality, and a conceptual model underlying the three main problems addressed in the literature, namely Automatic Personality Recognition, Automatic Personality Perception and Automatic Personality Synthesis.
Abstract: Personality is a psychological construct aimed at explaining the wide variety of human behaviors in terms of a few, stable and measurable individual characteristics. In this respect, any technology involving understanding, prediction and synthesis of human behavior is likely to benefit from Personality Computing approaches, i.e. from technologies capable of dealing with human personality. This paper is a survey of such technologies and it aims at providing not only a solid knowledge base about the state-of-the-art, but also a conceptual model underlying the three main problems addressed in the literature, namely Automatic Personality Recognition (inference of the true personality of an individual from behavioral evidence), Automatic Personality Perception (inference of personality others attribute to an individual based on her observable behavior) and Automatic Personality Synthesis (generation of artificial personalities via embodied agents). Furthermore, the article highlights the issues still open in the field and identifies potential application areas.

450 citations


Cites background or methods from "Predicting Personality from Twitter..."

  • ...A similar approach was used by the same authors to predict the personality of Twitter users [63]....

    [...]

  • ...Social media are one of the main channels through which people interact with others, an ideal means for self-disclosure and, therefore, an excellent ground for research on personality computing [51], [63], [64], [65], [66], [67], [68], [69] (see Table 4 for a synopsis of data, approaches and results)....

    [...]

  • ...Other [51] 167 167 Facebook profile info., egocentric R 0:12 0:10 0:10 0:11 0:10 Profiles networks, LIWC MAE MAE MAE MAE MAE [63] 279 2000 tweets LIWC, MRC, R 0:16 0:13 0:14 0:18 0:12 per subject profile info....

    [...]

  • ...MAE MAE MAE MAE MAE [64] 335 335 Twitter number of followers/ R 0:88 0:79 0:76 0:85 0:69 profiles followings, listed counts RMSE RMSE RMSE RMSE RMSE [65] 209 209 RenRen Profile info., usage C(2) 83:8 69:7 82:4 74:9 81:1 profiles statistics, emotional states C(3) 71:7 72:3 70:1 71:0 69:5 F F F F F [66] 156 473 posts on Some LIWC categories U average FriendFeed accuracy 63:1 [67] 10000 10;000 blog LIWC C(2) 80:0 posts ACC [68] 300 60;000 favorite visual patterns, R 0:19 0:17 0:22 0:12 0:17 pictures aesthetic preferences r r r r r The table reports, from left to right, the number of subjects involved in the experiments, number and type of behavioral samples, main cues, type of task and performance over different traits....

    [...]

  • ..., egocentric R 0:12 0:10 0:10 0:11 0:10 Profiles networks, LIWC MAE MAE MAE MAE MAE [63] 279 2000 tweets LIWC, MRC, R 0:16 0:13 0:14 0:18 0:12 per subject profile info....

    [...]

Journal ArticleDOI
TL;DR: In three field experiments that reached over 3.5 million individuals with psychologically tailored advertising, it is found that matching the content of persuasive appeals to individuals’ psychological characteristics significantly altered their behavior as measured by clicks and purchases.
Abstract: People are exposed to persuasive communication across many different contexts: Governments, companies, and political parties use persuasive appeals to encourage people to eat healthier, purchase a particular product, or vote for a specific candidate. Laboratory studies show that such persuasive appeals are more effective in influencing behavior when they are tailored to individuals’ unique psychological characteristics. However, the investigation of large-scale psychological persuasion in the real world has been hindered by the questionnaire-based nature of psychological assessment. Recent research, however, shows that people’s psychological characteristics can be accurately predicted from their digital footprints, such as their Facebook Likes or Tweets. Capitalizing on this form of psychological assessment from digital footprints, we test the effects of psychological persuasion on people’s actual behavior in an ecologically valid setting. In three field experiments that reached over 3.5 million individuals with psychologically tailored advertising, we find that matching the content of persuasive appeals to individuals’ psychological characteristics significantly altered their behavior as measured by clicks and purchases. Persuasive appeals that were matched to people’s extraversion or openness-to-experience level resulted in up to 40% more clicks and up to 50% more purchases than their mismatching or unpersonalized counterparts. Our findings suggest that the application of psychological targeting makes it possible to influence the behavior of large groups of people by tailoring persuasive appeals to the psychological needs of the target audiences. We discuss both the potential benefits of this method for helping individuals make better decisions and the potential pitfalls related to manipulation and privacy.

428 citations


Cites background from "Predicting Personality from Twitter..."

  • ...For example, people’s personality profiles have been predicted from personal websites (11), blogs (12), Twitter messages (13), Facebook profiles (10, 14–16), and Instagram pictures (17)....

    [...]

References
More filters
Journal ArticleDOI
TL;DR: This paper provides an introduction to the WEKA workbench, reviews the history of the project, and, in light of the recent 3.6 stable release, briefly discusses what has been added since the last stable version (Weka 3.4) released in 2003.
Abstract: More than twelve years have elapsed since the first public release of WEKA. In that time, the software has been rewritten entirely from scratch, evolved substantially and now accompanies a text on data mining [35]. These days, WEKA enjoys widespread acceptance in both academia and business, has an active community, and has been downloaded more than 1.4 million times since being placed on Source-Forge in April 2000. This paper provides an introduction to the WEKA workbench, reviews the history of the project, and, in light of the recent 3.6 stable release, briefly discusses what has been added since the last stable version (Weka 3.4) released in 2003.

19,603 citations


"Predicting Personality from Twitter..." refers background in this paper

  • ...To the best of our knowledge, our work is among the first to look at the relationship between profile information provided in social networks and personality traits....

    [...]

Journal ArticleDOI
TL;DR: In this article, the authors investigated the relation of the Big Five personality dimensions (extraversion, emotional stability, Agreeableness, Conscientiousness, and Openness to Experience) to three job performance criteria (job proficiency, training proficiency, and personnel data) for five occupational groups (professionals, police, managers, sales, and skilled/semi-skilled).
Abstract: This study investigated the relation of the “Big Five” personality dimensions (Extraversion, Emotional Stability, Agreeableness, Conscientiousness, and Openness to Experience) to three job performance criteria (job proficiency, training proficiency, and personnel data) for five occupational groups (professionals, police, managers, sales, and skilled/semi-skilled). Results indicated that one dimension of personality, Conscientiousness, showed consistent relations with all job performance criteria for all occupational groups. For the remaining personality dimensions, the estimated true score correlations varied by occupational group and criterion type. Extraversion was a valid predictor for two occupations involving social interaction, managers and sales (across criterion types). Also, both Openness to Experience and Extraversion were valid predictors of the training proficiency criterion (across occupations). Other personality dimensions were also found to be valid predictors for some occupations and some criterion types, but the magnitude of the estimated true score correlations was small (ρ < .10). Overall, the results illustrate the benefits of using the 5-factor model of personality to accumulate and communicate empirical findings. The findings have numerous implications for research and practice in personnel psychology, especially in the subfields of personnel selection, training and development, and performance appraisal.

8,018 citations


"Predicting Personality from Twitter..." refers background in this paper

  • ...Relationships have been discovered between personality and psychological disorders [42], job performance [4] and satisfaction [24], and even romantic success [46]....

    [...]

  • ...We conclude with a discussion of the implications that this work has for social media websites and for organizations that may utilize social media to better understand the people with whom they interact....

    [...]

Journal ArticleDOI
TL;DR: In this paper, the auteur discute un modele a cinq facteurs de la personnalite qu'il confronte a d'autres systemes de the personNalite and don't les correlats des dimensions sont analyses.
Abstract: L'auteur discute un modele a cinq facteurs de la personnalite qu'il confronte a d'autres systemes de la personnalite et dont les correlats des dimensions sont analyses ainsi que les problemes methodologiques

6,111 citations


"Predicting Personality from Twitter..." refers background in this paper

  • ...It should be noted that the models dependence on trait terms indicates that the Big Five traits are based on a lexical approach to personality measurement [43], [9], [10], [16]....

    [...]

  • ...Additional research has proved that different tests, languages, and methods of analysis do not alter the models validity [29], [10], [21], [27]....

    [...]

Journal ArticleDOI
TL;DR: It is argued that the five-factor model of personality should prove useful both for individual assessment and for the elucidation of a number of topics of interest to personality psychologists.
Abstract: The five-factor model of personality is a hierarchical organization of personality traits in terms of five basic dimensions: Extraversion, Agreeableness, Conscientiousness, Neuroticism, and Openness to Experience. Research using both natural language adjectives and theoretically based personality questionnaires supports the comprehensiveness of the model and its applicability across observers and cultures. This article summarizes the history of the model and its supporting evidence; discusses conceptions of the nature of the factors; and outlines an agenda for theorizing about the origins and operation of the factors. We argue that the model should prove useful both for individual assessment and for the elucidation of a number of topics of interest to personality psychologists.

5,838 citations


"Predicting Personality from Twitter..." refers background in this paper

  • ...We conclude with a discussion of the implications that this work has for social media websites and for organizations that may utilize social media to better understand the people with whom they interact....

    [...]

Book
01 Jan 1986
TL;DR: The Self-Concept in Organizational Psychology: Clarifying and Differentiating the Constructs 1 John Schaubroeck, You Jin Kim, and Ann Chunyan Peng 2. The Effect of Subconscious Goals on Organizational Behavior 39 Gary P. Latham and Edwin A. Locke 3. Combating Stress in Organizations 65 Nathan A. Bowling, Terry A. Beehr, and Simone Grebner 4. e-Learning at Work: Contributions of Past Research and Suggestions for the Future 89 Kenneth G. Brown, Steven D. Charlier, and Abigail
Abstract: About the Editors vii List of Contributors ix Editorial Foreword xi 1. The Self-Concept in Organizational Psychology: Clarifying and Differentiating the Constructs 1 John Schaubroeck, You Jin Kim, and Ann Chunyan Peng 2. The Effect of Subconscious Goals on Organizational Behavior 39 Gary P. Latham and Edwin A. Locke 3. Combating Stress in Organizations 65 Nathan A. Bowling, Terry A. Beehr, and Simone Grebner 4. e-Learning at Work: Contributions of Past Research and Suggestions for the Future 89 Kenneth G. Brown, Steven D. Charlier, and Abigail Pierotti 5. Human Dynamics and Enablers of Effective Lean Team Cultures and Climates 115 Desiree H. Van Dun and Celeste P.M. Wilderom 6. Personnel Selection and the Competitive Advantage of Firms 153 Robert E. Ployhart 7. The Processes of Team Staffing: A Review of Relevant Studies 197 Stephen J. Zaccaro and Gia A. DiRosa 8. Strategic HRM Moving Forward: What Can We Learn from Micro Perspectives? 231 David P. Lepak, Kaifeng Jiang, Kyongji Han, William G. Castellano, and Jia Hu Index 261 Contents of Previous Volumes 267

2,659 citations


"Predicting Personality from Twitter..." refers background in this paper

  • ...We conclude with a discussion of the implications that this work has for social media websites and for organizations that may utilize social media to better understand the people with whom they interact....

    [...]