Contextualization As A Means To Improve The Predictive Validity of Personality Models

See discussions, stats, and author profiles for this publication at: https://www.researchgate.
net/publication/331978516
Contextualization as a means to improve the predictive validity of personality

models
Article in Personality and Individual Differences · July 2019

DOI: 10.1016/j.paid.2019.03.007
CITATIONS READS
0 389
2 authors:
Victor Swift Jordan B Peterson

University of Toronto University of Toronto
5 PUBLICATIONS 0 CITATIONS 130 PUBLICATIONS 7,289 CITATIONS
SEE PROFILE SEE PROFILE
Some of the authors of this publication are also working on these related projects:
The psychological significance of the Biblical stories: http://bit.ly/2rMHp08 View project
All content following this page was uploaded by Victor Swift on 12 July 2019.
The user has requested enhancement of the downloaded file.

Personality and Individual Differences 144 (2019) 153–163
Contents lists available at ScienceDirect
Personality and Individual Differences

journal homepage: www.elsevier.com/locate/paid
Contextualization as a means to improve the predictive validity of T

personality models
⁎,1
Victor Swift , Jordan B. Peterson
University of Toronto, 27 King's College Circle, Toronto, Ontario M5S 1A1, Canada
A R T I C LE I N FO A B S T R A C T
Keywords: Research investigating performance-related outcomes has established that the predictive validity of Big Five
Contextualization models can be doubled, simply by contextualizing scale items. We herein test whether this contextualization
Big Five effect can be leveraged to universally improve the predictive validity of Big Five models by considering the
Predictive validity breadth, depth, and nature of contextualization effects. Study 1 (N = 320) compared the predictive validity of an
Conscientiousness
uncontextualized measure of Conscientiousness (C), with three measures of C contextualized to organizational,
Frame of Reference
academic, and romantic settings, respectively. Nine outcome variables served as indices of validity (three per
context). We hypothesized that contextualized scales would be superior in predicting all context-congruent
outcomes. Study 2 (N = 680) and Study 3 (N = 378) extended the results to consider alternative trait and
outcome measures (e.g., Implicit Association Task). Across all three studies, predictive validity was enhanced for
contextualized vs. uncontextualized measures only when outcomes were context-congruent. However, this en-
hancement varied greatly, with contextualized scales improving the prediction of some outcomes by as much as
32% (GPA), and as little as 1% (income). Our findings suggest that scale contextualization can be adopted by
psychometricians to easily improve the predictive validity of personality models. Guidelines for contextualiza-
tion are discussed.
The development of reliable Big Five personality measures has Frame of Reference (FOR) effect (Holtrop, Born, & de Vries, 2014).
ushered in an era of personality screening. Today, Big Five scores are Since its discovery (Schmit, Ryan, Stierwalt, & Powell, 1995) the
used to predict important social, political, organizational, inter- FOR effect has been exclusively investigated as a potential means to
personal, and individual outcomes (Ozer & Benet-Martínez, 2006). The improve the prediction of performance outcomes. Indeed, the FOR ef-
magnitude of these predictions, however, remain moderate to small, as fect has been reliably shown to emerge in the prediction of job per-
suggested by meta-analyses across disciplines (Barrick & Mount, 1991; formance (Bing et al., 2014; Farmer, 1999; Hunthausen, Truxillo,
Berry, Ones, & Sackett, 2007; Bogg & Roberts, 2004; Poropat, 2009). Bauer, & Hammer, 2003; Pace & Brannick, 2010), and academic per-
Thus, there is much to be gained from improving the predictive validity formance (Bing, Whanger, Davison, & VanHook, 2004; Holtrop, Born,
of Big Five measures. de Vries, & de Vries, 2014; Lievens, De Corte, & Schollaert, 2008; Liu,
One of the most promising approaches for improving the predictive 2014; Roenicke, 2013; Seiler, 2011; Schmit et al., 1995). However, it
validity of personality models is contextualization. Simply by asking remains unclear as to whether FOR effects emerge in the prediction of
participants to complete traditional personality scales with a particular non-performance outcomes.
context in mind, the predictive validity of the consequent personality Before contextualization can be considered a viable means to im-
models can be significantly improved (Bing, Davison, & Smothers, prove the general predictive validity of the Big Five, three features of
2014). For instance, scales comprised of personality items such as “do the FOR effect must be elucidated. Firstly, we must determine the
you finish what you start at work” can explain two times more variance breadth of FOR effects by establishing whether these effects emerge in
in job performance than scales that do not specify a context (for meta- the prediction of a variety of outcomes, across a variety of contexts.
analysis see Shaffer & Postlethwaite, 2012). The improvement of pre- Contextualization will prove to be most psychometrically useful if FOR
dictive validity by means of contextualization is referred to as the effects emerge broadly. Secondly, we must determine the depth of these
⁎
Corresponding author at: 100 St. George Street, Toronto, Ontario M5S 3G3, Canada.
E-mail address: victor.swift@mail.utoronto.ca (V. Swift).
1
Victor Swift was responsible for design, data collection, data preparation, data analysis, and report writing. Jordan Peterson was responsible for resources and
supervision. This study was not preregistered.
https://doi.org/10.1016/j.paid.2019.03.007
Received 28 February 2018; Received in revised form 5 March 2019; Accepted 7 March 2019
Available online 14 March 2019
0191-8869/ © 2019 Elsevier Ltd. All rights reserved.
V. Swift and J.B. Peterson Personality and Individual Differences 144 (2019) 153–163
effects by establishing whether contextualization results in meaningful 3. Nature of frame of reference effects
increases of predictive validity – as trivial increases due to technicalities
would disqualify the practical value of contextualization. Lastly, we According to Identity Theory (Stryker, 1986), when a traditional
must determine the nature of FOR effects by establishing the conditions trait measure is contextualized, the latent variable that is being mod-
that give rise to FOR effects. Only by knowing the necessary and suf- elled changes from a broad trait to a role-specific trait (Heller et al.,
ficient conditions for FOR effects can we maximize the psychometric 2009). This transformation has two consequences which contribute to
utility of contextualization. FOR effects. Firstly, contextualization reduces the ambiguity of per-
sonality items, as the imposed role serves as a standard reference frame.
1. Breadth of frame of reference effects By restricting the interpretation of items, this reference frame reduces
within-person inconsistency and between-person variability (Lievens
We hypothesize that FOR effects emerge in the prediction of a et al., 2008). In turn, this reduction in error increases predictive va-
variety of outcomes, not just performance-related outcomes. This ex- lidity. Secondly, contextualization adjusts the specificity of trait scales
pectation is based, in part, on current trends in contextualization re- to better match the specificity of context-specific outcomes (Ajzen,
search. Our review of the contextualization literature revealed that six 1987). By aggregating across contexts, general personality scales have
studies have considered non-performance outcomes (Ching et al., 2013; less predictive accuracy due to the lack of context-specific, but criterion
Heller, Ferris, Brown, & Watson, 2009; Holtrop, Born, de Vries, & de valid, variance (Paunonen & Ashton, 2001).
Vries, 2014; Slatcher & Vazire, 2009; Wood, 2007; Wood & Roberts, Simply specifying a role is not sufficient to produce FOR effects.
2006). In all of these studies, trait-outcome correlations were greater Crucially, the role that is specified must be “conceptually relevant” to
for contextualized, than for uncontextualized, trait measures. For in- the outcome being predicted (Farmer, 1999; Lievens et al., 2008). For
stance, the correlations between relationship-specific personality and instance, FOR effects emerge when predicting job-performance from
relationship satisfaction were consistently larger (ravg = 0.38) than the workplace-specific, but not school-specific, trait measures. Likewise,
correlations between uncontextualized personality and relationship FOR effects emerge when predicting academic performance from
satisfaction (ravg = 0.15; Slatcher & Vazire, 2009; Wood & Roberts, school-specific, but not workplace-specific, trait measures (Farmer,
2006). Likewise, the correlation between fraternity satisfaction and 1999; Lievens et al., 2008). Thus, FOR effects seem to be contingent on
fraternity-specific personality (r = 0.44) was found to be larger than manipulations of criterion relevance (Bing et al., 2004). It remains
the correlation between fraternity satisfaction and uncontextualized unclear, however, as to what it means for a criterion to be conceptually
personality (r = 0.18; Wood, 2007). relevant to a context, as any notion of relevance is inherently unin-
Simply comparing the magnitude of correlation coefficients does formative without allusion to some specified guidelines (Davie, 2005).
not confer whether one predictor is stronger than another, particularly We herein define criterion relevance as a congruency between the
when the predictors are highly correlated (Steiger, 1980), as is the case context of an outcome measure and the context of a trait measure.
in the existing literature. Moreover, the precise magnitude of correla-
tions is unlikely to generalize to other samples (Vispoel, Morris, & 4. Hypotheses
Kilinc, 2018). As such, it remains unclear as to whether con-
textualization does indeed improve predictions of non-performance In three studies, we investigate the utility of contextualization as a
measures. We herein address this issue by utilizing statistical proce- means to improve the predictive validity of personality models.
dures that enable the systematic comparison of correlation coefficients. Specifically, we seek to determine whether FOR effects are broad and
meaningful enough to justify the instrumental use of contextualization.
2. Depth of frame of reference effects Moreover, we seek to determine the necessary and sufficient conditions
for producing FOR effects in order to substantiate guidelines for con-
Predictive validity can be trivially improved by increasing the textualization. Accordingly, we posit three hypotheses:
equivalence of item content between predictor and outcome variables
H1. (Congruency Hypothesis) FOR effects emerge when there is congruency
(Nicholls, Licht, & Pearl, 1982). This poses a problem for con-
between the contexts of outcome and trait measures.
textualization research where items are deliberately transformed to be
more similar to outcome variables. Indeed, it is difficult to determine H2. (Breadth Hypothesis) FOR effects emerge across a variety of contexts
whether FOR effects are genuine or caused by increases in item and for a variety of outcomes.
equivalence in some performance research. For instance, Bing et al.
H3. (Depth Hypothesis) FOR effects are independent of item equivalence.
(2014) found that the substitution of items such as “I am usually able to
convince my friends what we ought to do together next” with “If a potential Our examination focuses on a single Big Five trait:
buyer says ‘no’ to me it means I need to work harder to get my point across” Conscientiousness (C) – defined as a tendency to be hardworking, re-
improved the prediction of supervisor-rated sales performance on scales strained, orderly, and rule abiding (Roberts, Jackson, Fayard, Edmonds,
comprised of such items as “This employee uses suggestive selling in a very & Meints, 2009). Of the Big Five traits, C remains the best predictor of
skillful fashion”. job performance (Schmitt, 2014) and academic performance (Poropat,
Item equivalence does not pose a problem for all FOR findings. 2009), and the most researched trait in the FOR context (Roenicke,
Specifically, when outcome variables are categorically distinct from the 2013). Thus, by focusing on C we are able to simultaneously draw from
trait being measured, contextualization is unlikely to affect item and contribute to FOR research that may have broad practical im-
equivalence. For instance, contextualizing trait conscientiousness plications for academic and industrial domains.
measures, which are comprised of items such as “I make plans and follow In Study 1, all three hypotheses are tested using a Big Five measure
through with them”, and “I tend to be organized”, improves the prediction contextualized to three contexts: workplace, school, and relationship. In
of job satisfaction, which is comprised of items such as “I feel fairly Study 2, we extend our results by testing whether FOR effects emerge
satisfied with my job”, and “I like my job better than the average worker when predicting non-performance outcomes from a novel con-
does”. However, adding the tag “at work” to conscientiousness items textualized measure of C. The results of this second study reflect the
does not seem to increase the semantic similarity with job satisfaction generalizability of non-performance FOR effects. In Study 3, we extend
items. In these circumstances, instead of increasing item equivalence, our results further by considering whether FOR effects emerge for other
contextualization increases the relevance of the predictor to the out- Big Five traits, and when predicting non-performance outcomes that are
come. We suspect that it is precisely this condition of relevance that not based on self-report. Across all studies, outcome variables are
underlies, and legitimizes, FOR effects. considered that have been unexplored in existing FOR research.
154
5. Study 1 (Carlos & Rodrigues, 2016). Agreement with items was indicated using
a 7-point Likert scale (1 = strongly disagree, 7 = strongly agree). While
To test whether FOR effects emerge when there is congruency be- self-rated performance is somewhat controversial as a substitute for
tween the contexts of outcome and trait measures (H1), we consider supervisor-rated performance, for our purposes (i.e., demonstrating
three categories of outcomes across three different contexts: workplace, relative predictive validity), this issue is irrelevant. Moreover, the focus
school, and relationships. Across contexts, C has been found to predict of this study is FOR effects beyond job performance, as FOR effects for
three categorical outcomes: performance, virtues, and satisfaction. performance are well established (see introduction).
Thus, we expect contextualized measures of C to be better predictors of
performance, virtues, and satisfaction only when context congruency is 5.1.2.2. Relationship performance. Relationship performance was
satisfied. For instance, we expect workplace-C to predict workplace assessed with a 9-item version of Sternberg's Triangular Love Scale
performance, workplace virtues, and workplace satisfaction, over and (Lemieux & Hale, 1999). The scale consisted of 3 components (3 items
above uncontextualized and other-contextualized measures of C. each): intimacy, passion, and commitment. Agreement with items was
Moreover, we expect that FOR effects do not emerge when the context indicated on a 5-point scale (1 = not at all, 5 = extremely).
of a trait measure is not congruent with the context of an outcome
measure. For instance, we expect that workplace-C will not predict 5.1.2.3. Academic performance. To assess academic performance,
school or relationship performance, virtues, and satisfaction, over and participants were asked to report their GPA for each level of
above uncontextualized and other-contextualized measures of C. schooling that they completed (High School, Community College,
Nine outcome measures are considered which represent perfor- University, and Graduate School). GPA was measured using a 13-
mance, virtues, and satisfaction, for workplace, school, and relationship point scale which was labeled with letter grades and their
contexts (3 categorical outcomes x 3 contexts). Because we are interested corresponding percentages and GPA values [1 = F (< 59%);
in changes in predictive validity for measures of C, the outcomes that 13 = A+ (99% to 100%, 4.0 GPA)]. A final academic performance
were selected are based on previous research into the categorical out- score was calculated by averaging GPA across levels of schooling.
comes associated with C. In terms of performance, Conscientious people
have been found to be better workers (Barrick & Mount, 1991), students 5.1.3. Satisfaction measures
(Poropat, 2009), and partners (Ahmetoglu, Swami, & Chamorro- 5.1.3.1. Workplace satisfaction. Workplace satisfaction was assessed
Premuzic, 2010). In part due to their dutifulness, Conscientious people using Hoppock's Job Satisfaction Measure (McNichols, Stahl, &
tend to be more virtuous, as demonstrated by reductions in workplace Manley, 1978), a 4-item scale with variable 7-point Likert responses
deviance (Berry et al., 2007), academic procrastination (Steel, 2007), (1 = low satisfaction, 7 = high satisfaction). This scale has been used in
and relationship infidelity (Schmitt, 2004). Finally, Conscientious previous research to demonstrate the relationship between C and Job
people tend to be more satisfied with life in general (Hayes & Joseph, Satisfaction (Judge, Heller, & Mount, 2002).
2003), and across domains, including workplace (Judge, Higgins,
Thoresen, & Barrick, 1999), school (Weber & Huebner, 2015), and re- 5.1.3.2. Relationship satisfaction. Relationship satisfaction was assessed
lationships (Schaffhuser, Allemand, & Martin, 2014). with the 7-item Relationship Assessment Scale (RAS; Hendrick, 1988).
We expect that all nine of these outcome measures are associated Respondents indicated agreement with items (e.g., “How well does your
with FOR effects (H2) – while specific effects depend on context con- partner meet your needs”) on a 5-point scale (1 = low, 5 = high).
gruency (H1).
5.1.3.3. Academic satisfaction. Academic satisfaction was assessed with
5.1. Study 1: Methods a variation of the 7-item RAS. The word “school” was substituted in
place of the word “partner” (e.g., “How well did your school meet your
5.1.1. Personality measures needs”). Agreement with items was indicated on a 5-point scale
Three contextualized measures of C were derived from the 20-item (1 = low, 5 = high). Participants were asked to respond with their
Conscientiousness sub-scale of the International Personality Item Pool most recent schooling experience in mind.
Representation of the Revised NEO Personality Inventory (IPIP-NEO;
Goldberg et al., 2006). The IPIP-NEO was used because it demonstrates 5.1.4. Virtue/vice measures
the highest internal consistency of any measure of C (Goldberg, 2017). 5.1.4.1. Workplace deviance. Organizational Deviance (OD) refers to
Contextualization consisted of specifying a unique setting for each harmful or counterproductive behaviors that undermine an
of the 20 items of C. The three contexts that were specified represented organization, including unlawful or dishonest actions. OD was
academic, romantic, and organizational settings, respectively. Each of assessed by a 7-item Workplace Deviance scale (Bennett & Robinson,
these three contextualized measures used the same 20 IPIP-NEO items 2000) that assessed agreement with items on a 7-point scale (1 = never,
in order to ensure an equal representation of C's facets. For instance, the 7 = daily).
IPIP-NEO item “I need a push to get started” was translated into the
academic item “I need a push to start school projects or homework”, the 5.1.4.2. Relationship infidelity. Relationship fidelity was assessed using
romantic item “I need a push to start things with my partner”, and the the relationship exclusivity sub-scale of the Sexy Seven Measure of
organizational item “I need a push to start projects at work” (see Sexuality (Schmitt & Buss, 2000). Participants indicated agreement
Appendix A). with 8 sexually connotative self-ascriptions (e.g., promiscuous,
To assess the other Big Five traits, along with uncontextualized C, an adulterous) using a 9-point scale (1 = extremely inaccurate,
unaltered version of the IPIP-NEO was administered (100 items total, 9 = extremely accurate).
20 items for C). On all personality items, participants indicated their
agreement with statements using a 5-point Likert scale (1 = strongly 5.1.4.3. Academic procrastination. Participants completed a short form
disagree, 5 = strongly agree). (5-item) version of the Academic Procrastination Form (Yockey, 2016).
Agreement with items (e.g., “I knew I should work on schoolwork, but I
5.1.2. Performance measures just didn't do it”), was indicated on a 4-point scale (1 = strongly agree,
5.1.2.1. Workplace performance. Workplace performance was assessed 4 = strongly disagree).
using self-report, which has been shown to reliably represent actual
workplace performance (Heidemeier, 2005). Participants completed 5.1.5. Procedure
the 12-item Task Performance subscale of the Job Performance scale 345 participants recruited through Amazon's Mechanical Turk
155
(MTurk) completed our survey, which was assembled and hosted on Table 1
Qualtrics. Twenty-five participants (7%) who failed at least one of the Scale parameters of all measures used in Study 1.
nine attention checks were excluded from the dataset before any ana- Alpha Mean SD
lyses were conducted, resulting in a final sample of 320 participants
(51% female; Mage = 40, SDage = 12). Conscientiousness
General 0.95 4.2 0.71
Participation was restricted to users who had completed > 1000
Workplace 0.94 4.5 0.56
surveys and demonstrated acceptable responding at least 99% of the School 0.98 3.9 1.04
time (the same restriction applied to subsequent studies). MTurk users Relationships 0.94 4.3 0.74
are representative of the general U.S population (Buhrmester, Kwang, & Satisfaction
Gosling, 2011) and have been repeatedly shown to produce reliable and Workplace 0.92 4.9 1.22
valid data (Mason & Suri, 2012; Paolacci & Chandler, 2014). School 0.95 3.5 1.08
After providing consent to participate, participants were notified Relationships 0.95 3.9 1.03
that the survey would consist of approximately 230 multiple choice Performance
questions that may appear repetitive, but which ultimately differ in Workplace 0.74 5.5 0.71
School (GPA) N/A 3.0 0.70
terms of context. The survey was divided according to contextual
Relationships 0.89 4.0 0.83
blocks, in order to ensure that participants maintained the appropriate
context in mind. The workplace block consisted of the Job Performance Virtues
Workplace 0.84 1.6 0.57
Scale, Job Satisfaction Scale, Workplace Deviance Scale, and the School 0.96 2.7 1.06
workplace-contextualized measure of C. The academic block consisted Relationships 0.91 2.2 1.02
of the Academic Performance measure, Academic Satisfaction scale,
Academic Procrastination scale, and the measure of C contextualized to
the classroom. The relationship block consisted of the Relationship increase the overlap between C items and outcome items.
Performance scale, Relationship Satisfaction scale, Infidelity scale, and Strong measurement invariance is classically represented by a
the relationship-specific measure of C. A non-contextual block was combination of configural (factor structure), metric (item weight), and
administered, consisting of the entire IPIP-NEO, and demographics scalar (average score) invariance (Beaujean, 2014). Configural in-
survey. These blocks were randomly presented, as were the constituents variance is suggested by good overall model fit and matching sig-
within the blocks. To ensure valid responses, nine unique attention- nificant loadings (Hirschfeld & von Brachel, 2014). Metric and scalar
checks were scattered throughout the survey and embedded within the invariance is suggested by minimal reduction in Comparative Fit Index
multiple-choice surveys (e.g., “To indicate focus choose the third option”). (CFI) between steps (ΔCFI < 0.01). Following the guidelines of Hu and
At the end of the survey, participants were debriefed and compensated Bentler (1998), overall goodness of fit for configural invariance was
with $3.00. assessed using the Standardized Root Mean Square Residual (SRMR)
index with a cutoff at 0.08 (Hu & Bentler, 1999).
Table 2 depicts the results of the MACS analysis. For all models,
5.2. Study 1: Results
SRMR at the configural step suggested good overall fit (SRMR ≤ 0.06).
Likewise, for all models change in CFI was < 0.01 between configural
The sample was predominantly Caucasian (87%) and 95% of the
and metric steps, and between metric and scalar steps. Together, these
respondents were currently employed (5% retired or disabled). One
results demonstrate that there was strong measurement invariance be-
participant who reported never being employed in an organization was
tween contextualized and uncontextualized measures, suggesting that
excluded from analyses involving organizational variables. Thirty-four
contextualization did not increase item overlap.
percent of participants were single or in a causal relationship. Nine
Intercorrelations between measures of C and outcome variables are
participants who reported never having a romantic relationship were
depicted in Table 3. The uncontextualized measure of C was most
excluded from analyses involving relationship variables. All partici-
strongly associated with workplace-C (r = 0.8), and comparably asso-
pants had completed a high-school degree, and 8% were currently
ciated with school-C (r = 0.6), and relationship-C (r = 0.6). All of the
students (3.5% part-time).
outcome measures (regardless of context) were significantly correlated
With the exception of the Job Performance measure, all scales ad-
with all of the measures of C. This pattern confirms prior research de-
ministered herein exhibited excellent internal consistency (see Table 1).
monstrating that C predicts performance, satisfaction, and virtues
Two items from the Job Performance measure were negatively corre-
across contexts. The significance of context-incongruent trait-outcome
lated with the overall scale (r1 = −0.08, r2 = −0.16), thereby under-
correlations indicates that the contextualized measures were re-
mining the reliability of this measure. These two items (#19 and #24 in
presentative of a general factor of C, in addition to context-specific
the original published scale) were removed from the scale before our
factors of C.
hypotheses were tested, increasing the internal consistency from poor
The congruency hypothesis (H1) and the breadth hypothesis (H2)
(α = 0.57) to acceptable (α = 0.74).
were tested using K-fold cross validation regression analysis. Whereas
To determine whether contextualization increased the overlap be-
traditional regression analyses provide an unverified estimate of pre-
tween traits and outcomes (H3: depth), measurement equivalence was
dictive power, K-fold regression analysis verifies the predictive power
tested using eight2 Means and Covariance Structures (MACS) – also
of models by fitting a model on a subset of data and subsequently
known as multi-group CFA (Beaujean, 2014). Each MAC structure re-
quantifying the test error on the remainder of the data (Yarkoni &
presented a two-factor model consisting of a latent C variable and a
Westfall, 2017). Free of overfitting, the resultant K-folds regression
latent C-related outcome variable (e.g., Job Satisfaction) and their
coefficient can be interpreted as a true estimate of prediction, unlike
covariance. Measurement equivalence was determined by comparing
correlation (Yarkoni & Westfall, 2017). As such, we employed K-folds
the goodness of fit for models in which latent C was based on un-
analysis to determine the relative predictive power of contextualized
contextualized items, with models in which latent C was based on
and uncontextualized measures.
context-congruent items. Strong equivalence of the uncontextualized
Three K-folds regressions were conducted for every outcome vari-
and contextualized models would suggest that contextualization did not
able, one for each contextualization condition – i.e., uncontextualized,
context-congruent, and context-incongruent. Uncontextualized regres-
2
One MACS model for each item-based outcome measure (i.e., GPA was sions predicted outcome variables (e.g., scored Job Satisfaction) from
excluded). scored C; context-congruent regressions predicted outcome variables
156
Table 2
Measurement invariance across Study 1 measures.
Steps Satisfaction Vice Performance
Work School Relation Work School Relation Work Relation
Configural 0.829(0.06) 0.883(0.05) 0.910(0.05) 0.815(0.06) 0.886(0.05) 0.871(0.06) 0.829(0.06) 0.837(0.06)

Metric 0.826(0.07) 0.879(0.07) 0.907(0.06) 0.813(0.07) 0.881(0.07) 0.868(0.07) 0.826(0.07) 0.835(0.07)
Scalar 0.823(0.07) 0.871(0.07) 0.894(0.07) 0.810(0.07) 0.873(0.07) 0.854(0.07) 0.823(0.07) 0.821(0.07)
Note. Values represent CFI (SRMR).
Table 3 models. Moreover, none of the context-incongruent measures demon-

Inter-correlations of trait scales and outcome measures for Study 1. strated greater predictive validity than uncontextualized measures.
R-C W-C S-C NEO-C
Although K-fold analysis provides greater confidence in effect size than
regular regression analysis, we opted to additionally test whether effect
S-C .58⁎ sizes were significantly different between groups using Hittner, May,
W-C .45⁎ .80⁎ and Silver's (2003) test for dependent overlapping correlations (Hitt-
R-C .52⁎ .39⁎ .62⁎
School
ner's test). All context-congruent coefficients were found to be sig-
Performance .19⁎ .11t .54⁎ .22⁎ nificantly larger than context-incongruent coefficients (p < .05). Ad-
Virtue .32⁎ .38⁎ .75⁎ .46⁎ ditionally, six of the nine outcome variables were significantly more
Satisfaction .24⁎ .22⁎ .60⁎ .31⁎ strongly predicted by context-congruent measures than un-
Workplace
contextualized measures. The three outstanding outcomes were Job
Performance .44⁎ .68⁎ .35⁎ .62⁎
Virtue .38⁎ .55⁎ .33⁎ .47⁎ Satisfaction (z = 1.30, p = .10), Relationship Performance (z = 1.30,
Satisfaction .27⁎ .35⁎ .25⁎ .31⁎ p = .10) and Relationship Vice (z = 1.32, p = .09).
Relationship
Performance .60⁎ .33⁎ .20⁎ .35⁎
6. Study 2
Virtue .35⁎ .24⁎ .14t .26⁎
Satisfaction .58⁎ .32⁎ .22⁎ .38⁎
Study 1 found support for our hypotheses using measures of C that
Note. R-C: Relationship-C, W-C: Workplace-C, S-C: School-C; NEO-C: Un-con- were contextualized at the item level. These measures simply add
textualized C. contextual information to existing Big Five personality items. In Study
⁎
p < .0001. 2, we test our hypotheses with a contextualized measure of C that was
t
p < .05. derived independently of the Big Five. In addition to substantiating the
generalizability of our FOR hypotheses, positive results in this study
from scored context-congruent C (e.g., workplace-C); and context-in- would suggest trait-related FOR effects are not scale dependent.
congruent regressions predicted the outcome variable from the re- The contextualized measure employed herein represented work-
maining context-incongruent C variables (e.g., School C and place C. Two context-congruent outcome measures were administered
Relationship C). Each K-fold regression split the data into 10 training that have previously been established as workplace-related correlates of
and testing folds. Cross validation was repeated 100 times for each C: Job Satisfaction and Income (Judge et al., 1999). To balance the
model in order to derive more stable parameters (see Yarkoni & design, we administered two context-incongruent outcome measures
Westfall, 2017). that have been reliably associated with C: Proactive Health Behaviors
Results of the K-fold regression analysis are depicted in Table 4. In (Takahashi, Edmonds, Jackson, & Roberts, 2013), and Political Con-
line with H1, we found predictions to be greater for all context-conservatism (Sibley, Osborne, & Duckitt, 2012). All outcome variables
gruent models, compared to uncontextualized and context-incongruent measured herein differ from those employed in Study 1 – this includes
Job Satisfaction, which is measured with an alternative scale in this
Table 4 study.
K-fold based standardized correlation coefficients for Study 1. We expect workplace C to be better than uncontextualized measures
only in the prediction of the Income and Job Satisfaction (H1
Outcome Congruent Incongruent General
Congruency). Although different contextual features influence these
Work variables, they are nevertheless the only two variables that imply a
Performance .69a,b .50 .65 shared context with the workplace trait measure. Incidentally, finding
Vice .58a,b .46 .53 FOR effects for Income and Job Satisfaction will provide support for H2
Satisfaction .39a .33 .35
(Breadth), as these represent additional outcome variables.
School
Performance .75a,b .44 .48
Vice .75a,b .42 .48
6.1. Study 2: Methods
Satisfaction .60a,b .30 .33
6.1.1. Measures
Relationship
Performance .40a
.33 .35 6.1.1.1. Uncontextualized measure of C. In order to ensure that
Vice .40a .32 .33 uncontexualized C was comprehensively measured, a 50-item
Satisfaction .59a,b .37 .42 measure of C was derived by combining three popular personality
measures: the IPIP-NEO (120-item version; Johnson, 2014), the BFAS
Note. Standardized r values for context-congruent (Congruent), context-incon-
(DeYoung, Quilty, & Peterson, 2007), and the HEXACO (Lee & Ashton,
gruent (Incongruent) and uncontextualized (General) measures of
2004). Responses were made using a 5-point Likert scale (1 = disagree
Conscientiousness.
a
Congruent coefficient is significantly larger than incongruent coefficient strongly, 5 = agree strongly). This composite measure exhibited excellent
(p < .05). internal consistency (α = 0.96, M = 3.8, SD = 0.6).
b
Congruent coefficient is significantly larger than uncontextualized coeffi-
cient (p < .05). 6.1.1.2. Big Five Competencies Grid (CG). Contextualized for the
157
Table 5
Pearson's Correlations between trait and outcome measures in Study 2.
Workplace C Aggregated C Job satisfaction Income Conservatism
⁎⁎
Aggregated C .63
Job satisfaction .35⁎⁎ .27⁎⁎
Income .13⁎⁎ .12⁎⁎ .33⁎⁎
Conservatism .07 .15⁎⁎ .08 .08
Proactive health .09⁎ .22⁎⁎ .11⁎⁎ .02 .05
⁎
p < .05.
⁎⁎
p < .01.
workplace at the instructional and item level, CG measures the Big Five analyses was composed of 664 participants (50.1% female; Mage = 36,
for the workplace context (Consiglio, Alessandri, Borgogni, & Piccolo, SD = 11).
2013). Rather than simply adding “at work” tags to traditional After reading a consent form that broadly disclosed the main com-
personality items, CG restructures traditional items to apply solely to ponents of the survey, participants were asked to complete a demo-
the workplace. The following CG item is representative of such graphic questionnaire including SES, political orientation, exercise
restructuring: “I respect deadlines even if it means working overtime”. frequency, and alcohol consumption questions. Participants then com-
Participants indicated how frequently they participate in a given pleted the 50-item measure of C, followed by the workplace measure of
workplace behavior using a 7-point Likert scale ranging from 1 C (CG) and the Job Satisfaction questionnaire. After being debriefed,
(never) to 7 (every time), with specific frequencies in between – for participants were compensated with $2.00.
instance, “occasionally–in about 30% of the chances when I could have”.
Workplace C is comprised of items from two subscales: Accomplishment
and Process Management (11 items total). Workplace C as measured by 6.2. Study 2: Results
CG exhibited good internal consistency in our sample (α = 0.89,
M = 5.29, SD = 0.88). All participants resided in the United States and the majority were
Caucasian (80.4%). With the exception of two participants, all partici-
6.1.1.3. Outcome variables. Job Satisfaction was measured using an 8- pants were high school graduates, and 34.1% of participants were
item scale that surveyed satisfaction in multiple domains including job students at the time of the survey (9% part-time students). The average
security, supervision, income, and co-workers. This scale has been used reported untaxed income for this sample was $40,000 (SD = $26,000).
in previous research to demonstrate the relationship between C and Approximately thirteen percent of participants were unemployed at the
intrinsic career success (Judge et al., 1999). Participants provided time of taking the survey and subsequently reported work-related in-
responses using a 5-point Likert scale (1 = dislike it very much, 5 = like it formation for their most-recent job. Fifty-two percent of participants
very much). This measure exhibited good internal consistency identified with a clear left-wing political ideology with 25% of parti-
(α = 0.84, M = 3.5, SD = 0.75). cipants identifying between left-wing and right-wing. Out of a possible
Annual untaxed income was measured using an 8-point scale ran- 5 points, the Job Satisfaction score followed a normal distribution with
ging from “Less than $20,000” to “More than $106,000”, in incremental a slight negative skew (M = 3.5, SD = 0.75). Proactive Health score (z-
intervals of $15,000. Considering the variable economic situations of score based) was fairly normally distributed, with a slight negative
online workers, unemployed participants were given the option to in- skew (M = −0.01, SD = 0.63).
dicate the prior year's income. Accordingly, participants were asked if Intercorrelations between the measures of C and outcome variables
their work-related details (i.e., job satisfaction, and income) re- are reported in Table 5. The correlation between uncontextualized C
presented their current, or recent, status. and the workplace measure of C was substantially smaller than the
Political orientation was calculated by averaging two 5-point items analogous correlation in Study 1 (r = 0.63 vs. r = 0.8). This finding is
that surveyed identification with right-wing and left-wing (reverse unsurprising, considering that the workplace measure employed herein
coded) ideals, respectively (1 = disagree strongly, 5 = agree strongly). A was derived independently from the Big Five. All outcomes significantly
score below 3 represented a left-wing leaning, a score of 3 represented correlated with uncontextualized C, confirming past research asso-
political neutrality, and a score above 3 represented a right-wing ciating C with Job Satisfaction, Income, Conservatism, and Proactive
leaning. A meta-analysis of 73 studies investigating the role of per- Health.
sonality in political orientation (Sibley et al., 2012) has demonstrated Again, the congruency hypothesis (H1) and the breadth hypothesis
that C is the best Big Five predictor of Conservatism (r = 0.10). The (H2) were tested using K-fold cross validation regression analysis. Two
internal consistency of this measure was excellent (α = 0.90, M = 2.49, K-folds regressions were conducted for every outcome variable, one for
SD = 1.2). each contextualization condition – i.e., uncontextualized C and context-
Proactive health was calculated by averaging two z-scored items congruent C. Uncontextualized regressions predicted outcome variables
that surveyed the frequency of exercise and alcohol consumption (re- (e.g., scored Job Satisfaction) from the uncontextualized measure of C;
verse coded), using a 7-point scale (1 = never, 7 = most days). context-congruent regressions predicted outcome variables from
Conscientious individuals are characterized by a tendency to partake in workplace-C. Each K-fold regression split the data into 10 training and
preventative health behaviors (Takahashi et al., 2013) – notably, less testing folds and cross validation was repeated 100 times. Results of this
frequent alcohol consumption (Cook, Young, Taylor, & Bedford, 1998), analysis are presented in Table 6.
and more frequent exercising (Conner & Abraham, 2001). In line with H1, contextualized predictive coefficients were only
larger for context-congruent outcome measures (i.e., Job Satisfaction
6.1.2. Procedure and Income). However, Hittner's test for dependent overlapping cor-
714 participants recruited through Amazon's Mechanical Turk relations revealed only Job Satisfaction to be significantly greater for
(MTurk) completed our survey. Fifty participants (7%) were excluded the congruent measure (z = 1.94, p = .02) – while Income coefficients
from analyses a-priori on the basis of negligent responding, as indicated were not significantly different (z = 0.15, p = .44).
by a combination of negligent response checks, unvarying responses,
and survey completion in under 10 min. Thus, the sample used for
158
Table 6 Job Satisfaction, were debriefed, and compensated with $4.00.

K-fold based standardized correlation coefficients for Study 2.
Outcome Congruent General 7.2. Measures
Job satisfaction .37a .32 7.2.1. Uncontextualized Big Five measure

Income .17 .14
As in Study 1, the Big Five personality traits were assessed with an
Conservatism .17 .20
Health .17 .26
unaltered version of the IPIP-NEO (100 items total, 20 items for C, 20
items for E).
Note. Standardized r values for context-congruent (Congruent) and un-
contextualized (General) measures of conscientiousness. 7.2.2. Item-contextualized measures
a
Congruent coefficient is significantly larger than uncontextualized coeffi- These measures were identical to the contextualized measures used
cient (p < .05). in Study 1 with the exception that E items were now also contextualized
(following the Study 1 procedure). For instance, the E item “I make
7. Study 3 friends easily”, was translated into the workplace item “I make friends
easily at work”, and the school item “I made friends easily at school”.
In order to extend our findings beyond self-report measures, and There were 20 items per contextualization condition (work or school)
account for their potentially confounding effects, Study 3 considered for both E and C (80 items total).
whether FOR effects emerge in non-self-report measures. Specifically,
this study tested whether FOR effects emerge in the prediction of trait- 7.2.3. Instruction-contextualized measures
relevant implicit attitudes, as measured by the Implicit Associations E and C items from the IPIP-NEO Domain questionnaire (Goldberg
Test (IAT). Additionally, Study 3 was designed to test whether FOR et al., 2006) were preceded with instructions that imposed a workplace
effect emerge for other personality traits – namely, Extraversion (E) – frame of reference (see Appendix A for details). This contextualization
and under other contextualization conditions – i.e., instruction-based approach has been used to successfully produce FOR effects elsewhere
contextualization. (Bing et al., 2004; Hunthausen et al., 2003). Participants indicated their
The outcome variables for this study were derived from self-report agreement using a 5-point Likert scale (1 = strongly disagree,
and IAT of Job Satisfaction. Although FOR effects were modest for Job 5 = strongly agree).
Satisfaction in both preceding studies, this outcome variable was
nevertheless favoured because it is a well-known correlate of both C 7.2.4. Job satisfaction
and E (Judge et al., 2002), that has been reliably measured through Self-reported Job Satisfaction was assessed with 9 job dissatisfaction
implicit association tests (Boyd, 2010; Boyd, 2017). (reverse coded) items from the Brayfield and Rothe (1951) Index.
Measuring dissatisfaction provided a contrast and extension to the Job
7.1. Procedure Satisfaction findings from Study's 1 and 2, which were based on scales
using positive items only. The scale included items such as “my job is
397 participants recruited through Amazon's Mechanical Turk pretty uninteresting”, which were rated using a 5-point Likert scale
(MTurk) completed our survey. Twenty participants (5%) were ex- (1 = strongly disagree, 5 = strongly agree). This measure has been shown
cluded from analyses a priori on the basis of negligent responding, as to significantly correlate with implicit measures of Job Satisfaction
indicated by negligent response checks. Thus, the final sample for (Boyd, 2017).
analysis was comprised of 378 participants (50.3% female; Mage = 38.8,
SD = 11.2). 7.2.5. Implicit job satisfaction (IAT-JS)
Participation was restricted to MTurk users who were using laptop A single-target IAT was programmed and administered through
or desktop computers. Before commencing the study, participants were Inquisit (2016). The test was modelled after two existing validated IAT
asked to minimize distractions and noise and confirm a disturbance-free measures of Job Satisfaction (Boyd, 2010; Boyd, 2017). In order to
environment. Participants who did not provide confirmation were di- maximize the relevance of the test, participants manually entered five
verted from the survey. keywords pertaining to their current workplace role. These keywords
After providing consent to complete a variety of questionnaires and served as the targets for association (positive or negative). A detailed
“word-matching” tasks, participants were informed that despite ap- explanation of this measure can be found in Appendix A.
parent repetition, the survey portions contained important contextual
differences. The survey began with a brief demographics questionnaire 7.3. Study 3: Results
(age, gender, nationality), followed by a block of workplace-con-
textualized personality items and a block of school-contextualized All participants had completed a high-school degree, (7.4%) were
items. Both contextualized blocks contained subsets of C and E items students at the time of the survey (3.4% part time). Thirty-three par-
which were randomized – the order of the subsets was also randomized. ticipants (8.7%) were unemployed at the time of the survey, and five of
These blocks were followed by a brief work-history questionnaire, and these participants had no employment experience. These participants
an instruction-contextualized measure of workplace-C and E. Next, were excluded from any analyses based on workplace experience (i.e.,
participants completed a brief academic history questionnaire, followed Job Satisfaction, IAT, workplace-C, and workplace-E).
by an uncontextualized measure of the Big Five personality traits. The Before analyses were conducted, IAT scores were excluded on the
uncontextualized measure was followed by a self-report measure of Job basis of keyword anomalies. Specifically, five participants provided
Satisfaction. The intentional position of this block minimized transfer nonsensical keywords (e.g., “asdf”), and 53 participants provided in-
effects from the workplace measures of personality. admissible entries – i.e., missing keywords or providing job evaluations
Following the survey portion of the study, participants were in- (e.g., fantastic) rather than job roles. Thus, all analyses involving IAT
formed of the ensuing “word-matching” task and asked to compose a scores utilized a restricted sample of 315 participants. IAT d scores –
brief open-ended essay describing their work. This essay exercise has which range from −1 (negative association) to 1 (positive association)
been used in other research to prime the target object (i.e., workplace) – were normally distributed around a mean of 0.16 (SD = 0.35), sug-
in implicit association paradigms (Boyd, 2017), as it appears to increase gesting a slight positivity bias in the sample. All self-report measures in
the relationship between implicit measures and behaviors (Perugini & this study exhibited excellent internal consistency, as indicated by
Prestwich, 2007). Finally, participants completed the IAT measure of Cronbach's alpha levels above 0.9 (see Table 7).
159
Table 7 Table 9
Scale parameters of all measures used in Study 3. K-fold based standardized correlation coefficients for Study 3.
Alpha Mean SD Model Congruent items Congruent Incongruent items General
instructions
Item-based Work C 0.94 4.39 0.58
Item-based School C 0.97 3.86 0.98 JS ~ E .47a,b .45a,b .29 .40
Item-based Work E 0.95 3.33 0.92 JS ~ C .54a,c .45a .35 .53
Item-based School E 0.97 3.06 1.10 IAT ~ E .24 .24 .21 .23
Instruction-based Work C 0.91 4.08 0.62 IAT ~ C .25 .21 .23 .24
Instruction-based Work E 0.94 3.36 0.83
Conscientiousness 0.95 4.10 0.72 Note. Standardized r values for item-level context-congruent (Congruent Items),
Extraversion 0.96 3.17 0.98 instruction-level context-congruent (Congruent Instructions), item-level con-
Job Satisfaction 0.91 3.76 0.96 text-congruent (Incongruent Items) and uncontextualized (General) trait mea-
sures. Models represent predictions of job satisfaction from extraversion
(JS ~ E) and conscientiousness (JS ~ C), and IAT measured job satisfaction
Correlations between all dependent and independent variables fol- from extraversion (IAT ~ E) and conscientiousness (IAT ~ C).
lowed the expected pattern (see Table 8). Uncontextualized C corre- a
Congruent coefficient is significantly larger than incongruent coefficient
lated strongly with item-based workplace-C (r = 0.78) and instruction- (p < .05).
b
based workplace-C (r = 0.79), which in turn correlated highly with one Congruent coefficient is significantly larger than uncontextualized coeffi-
another (r = 0.81). Likewise, uncontextualized E correlated strongly cient (p < .05).
c
with item-based workplace-E (r = 0.87) and instruction-based work- Item-level coefficient is significantly larger than instruction-level coeffi-
place-E (r = 0.85), which in turn correlated highly with one another cient (p < .05).
(r = 0.89). Self-rated Job Satisfaction correlated significantly with IAT
Job Satisfaction (r = 0.34, p < .001). 8. Discussion
Again, the congruency hypothesis (H1) was tested using K-fold cross
validation regression analysis. Sixteen K-folds regressions were con- Across three studies, we sought to determine whether FOR effects
ducted predicting either Job Satisfaction or IAT-JS from E or C for the 4 are broad and meaningful enough to justify the instrumental use of
contextualization conditions – i.e., uncontextualized, context-congruent contextualization. We hypothesized that FOR effects would emerge any
item based (workplace), context-congruent instruction based (work) time there is a congruency between the contexts of outcome and trait
and context incongruent C (school). Each K-fold regression split the measures (H1). This hypothesis found support across all three studies,
data into 10 training and testing folds, with cross validation repeated where trait-outcome correlations were found to be greater for all con-
100 times. text-congruent measures, compared to uncontextualized measures and
In line with H1, predictive coefficients were larger for all item-based context-incongruent measures. Because we considered a variety of
context-congruent measures. However, Hittner's test for dependent contexts (i.e., school, work, and relationships), and a wide variety of
overlapping correlations revealed only Job Satisfaction correlations to outcome measures, we have reason to believe that FOR effects are a
be significantly different (Table 9). Specifically, the correlation coeffi- broadly applicable phenomenon (H2). Moreover, this phenomenon
cient for item-based context-congruent E was significantly larger than appears to be independent of outcome-trait item equivalence (H3), as
the coefficient for uncontextualized E (z = 2.67, p = .004). The same uncontextualized and contextualized scales exhibit measurement in-
pattern was found for instruction-based context-congruent E (z = 2.07, variance (Study 1).
p = .019). While FOR effects can be broadly coerced by means of context-
Instruction-based context congruent measures did not consistently congruence, the utility of contextualization does not appear to be uni-
outperform uncontextualized measures, however, with such workplace- versal. That is to say, increases in effect size by means of con-
C variables being worse predictors of both IAT-SR and Job Satisfaction. textualization may be negligible, as we found in the predictions of four
Item-based context-congruent measures outperformed instruction- of the thirteen context-congruent outcomes we considered: Income,
based context-congruent measures on all predictions. Moreover, Relationship Performance, Relationship Vice, and IAT-JS. Moreover,
Hittner's test revealed this pattern to be significant in predictions of Job the magnitude of FOR effects appeared to be inconsistent for Job
Satisfaction from C (z = 3.27, p = .001). Satisfaction, which was measured across all three studies. This incon-
sistency suggests that FOR effects are not contingent on degree of
congruency alone. A likely source of this inconsistency appears to be
Table 8
Pearson's correlations between trait and outcome measures in Study 3.
CWork CSchool EWork ESchool Instruction CWork Instruction EWork C E JS
CSchool .43
EWork .36 .32
ESchool .25 .42 .60
Instruction CWork .81 .43 .30 .24
Instruction EWork .39 .35 .89 .61 .37
C .78 .52 .40 .32 .80 .47
E .33 .32 .86 .69 .30 .84 .38
JS .52 .32 .44 .23 .42 .43 .50 .37
IAT .19 .15⁎⁎ .16⁎⁎ .11a .11a .17⁎⁎ .18⁎⁎ .14⁎ .35
Note. Conscientiousness (C), Extraversion (E), work contextualized C (CWork), school contextualized C (CSchool), instruction-level work-contextualized C
(Instruction CWork), instruction-level work-contextualized E (Instruction EWork), self-reported job satisfaction (JS), IAT measured job satisfaction (IAT). Unless
otherwise indicated, all correlations are significant at p < .001.
a
ns.
⁎
p < .05.
⁎⁎
p < .01.
160
range restriction, as FOR effects appeared to be largest when context- C, which was based on a multi-dimensional self- report of work atti-
congruent trait measures had the largest range. This is best exhibited in tudes and tendencies. Thus, despite being structurally most similar to
Study 3 where the correlation between Job Satisfaction and personality the procrastination measure, the school-specific measure of C corre-
was significantly larger for workplace-E (min score = 1.2, max lated equally highly with GPA, presumably due to a similar overlap in
score = 5), but not workplace-C (min score = 2.7, max score = 5). demands and incentives.
Further research is needed to elucidate why FOR effects only emerge A major theoretical implication of our demand/incentive theory is
in context-congruent models. In line with Identity Theory (Stryker, that overlap in demands and incentives contributes to the overall cri-
1986), we posit that context-congruence aligns the demands and in- terion related validity of all uncontextualized measures. That is to say,
centives implied in trait and outcome measures, thereby facilitating uncontextualized measures inherently encode some information about
overlap between trait and outcome scores. For instance, if role-specific contextual and cross-contextual demands and incentives, and the de-
demands are low enough, and incentives high enough, a person who is gree to which an outcome variable jointly encodes these demands/in-
otherwise a slacker may be found to be more Conscientious (role-spe- centives contributes to the magnitude of association. Thus, C may be
cific trait), and subsequently a high-performance worker (outcome). reliably found to predict job satisfaction, job performance, and work-
Likewise, if role-specific demands are too high, or incentives too low, an place deviance, because uncontextualized measures of C inherently
otherwise hardworking person may be found to be less Conscientious in encode some information regarding workplace demands and incentives
the working environment (role-specific trait), and as an outcome, – while contextualized scales encode this information better.
deemed a poor worker (Maddox & Markman, 2010). By relying on Although we found broad support for our hypotheses, our findings
uncontextualized measures of C, we can neglect information that is were nevertheless subject to some limitations. Firstly, generalizations
crucial to role-specific performance – namely demands and incentives – from our data may be impeded by our decision to use the online MTurk
and thereby undermine predictions of role-related outcomes. platform to conduct this study. However, it is important to note that
Although it is reasonable to assume that a Conscientious student data provided by MTurk users does not appear to differ meaningfully
will perform well in the workplace, it is not clear that their from community samples in terms of validity, or in terms of personality
Conscientious behaviors will persist under changing demands and in- (McCredie & Morey, 2018), which were the most important sample
centives, compared to a generally Conscientious person. This may ex- characteristics for our purposes. Similarly, although we have found
plain why workplace-C is superior to traditional measures of C when evidence for broad contextualization effects for a number of diverse
predicting workplace performance, while academic-C exhibits this outcome measures, we did not consider task-based or observational
benefit only in the prediction of academic performance (Farmer, 1999; outcome measures. While previous research has demonstrated con-
Lievens et al., 2008; Seiler, 2011). Applied to our data, the demand/ textualization effects for these kinds of measures – both task-based
incentive interpretation of FOR effects suggests, for instance, that aca- (Holtrop, Born, de Vries, & de Vries, 2014) and observational (Bing
demic-C exclusively improved the prediction of school satisfaction be- et al., 2014) – further research is needed in order to test the applic-
cause effort (i.e., C) at school and satisfaction at school both depend on ability of our demand/incentive theory with these various measures in
the demands and incentives of the classroom. This explanation can be a variety of contexts.
tested in future experiments by manipulating demands and incentives Finally, although our congruence guideline is purported to apply to
and subsequently testing for FOR effects. any contextualized measures, herein we have only tested our guideline
If FOR effects arise when the demands and incentives implied in using measures of C and E. As such, it is possible that future research
trait and outcome measures are aligned, any factors that disturb this using other traits or constructs may find that FOR effects do not emerge
alignment may reduce FOR effects. This idea may explain the negligible under alternative conditions. Logically, however, our FOR guideline is
FOR effects we uncovered in the prediction of relationship perfor- not unique to C or E, as context-congruence is applicable to any con-
mance/vices. For instance, the social desirability of relationship fidelity struct that can be contextualized.
may have caused participants to downplay their relationship vices and
thereby withhold crucial information regarding relationship demands
and incentives that would otherwise be shared with the contextualized 9. Conclusions
trait measure. Likewise, implicit Job Satisfaction may have been asso-
ciated with negligible FOR effects because implicit attitudes are more The confirmation of our hypotheses has implications for any do-
strongly based on affective tendencies and cognitive biases (Rudman, mains that make use of personality measurement. In practical domains,
2004) than they are based on demands and incentives. Moreover, FOR for instance, clinicians who use personality scores to predict therapeutic
effects may be stronger following item-level contextualization than outcomes may generate more accurate predictions by contextualizing
instruction-level contextualization – as found in Study 3, and preceding trait measures to the clinical setting. Simply by specifying a context for
research (Lievens et al., 2008) – because contextualized items restrict a personality measure, the predictive validity of context-congruent
item interpretations (Bing et al., 2014) and thereby encode more role- outcomes can be significantly and meaningfully increased by up to
specific information. 30%. The likelihood of enhancing predictive validity may be increased
Likewise, our demand/incentive theory can be used to explain the by utilizing item-level (over instruction-level) contextualization, redu-
largest FOR effects, which should arise when demands and incentives cing range restriction and response biases, and maximizing overlap of
are most tightly aligned for predictor and outcome variables. In other demands and incentives encoded by predictor and outcome variables.
words, the more role-specific information an outcome variable encodes,
the more beneficial it is to include role-specific information in the
predictor variable. The largest FOR effects across all three studies were Declaration of conflicting interests
for the prediction of school performance (GPA) and school vice (pro-
crastination). For both of these variables, school-specific C explained The authors declared no potential conflicts of interest with respect
34% more variance than general C. Whereas GPA was based on a to the research, authorship, or publication of this article.
unidimensional self-report of performance evaluation history, procras-
tination scores were based on a multi-dimensional self-report of work
attitudes and tendencies. Nevertheless, both measures encode the same Acknowledgments
demands (e.g., parental expectations) and incentives (e.g., academic
success), and appear to do so with minimal error. These same demands Preparation of this manuscript was supported by Grant 533803 from
and incentives are clearly implicated in the school-specific measure of the Social Sciences and Humanities Research Council of Canada.
161
Appendix A 12. Make plans to finish work and stick to them

13. Am always prepared for projects at work
20 IPIP-NEO Conscientiousness Items Contextualized for 14. Need a push to start projects at work
Relationship, School, and Workplace Settings. 15. Don't pay attention during work hours
The following characteristics describe how people act in relationships. 16. Do just enough work to get by at work
Please indicate how much you personally agree with each statement. 17. Make a mess of things at work
18. Complete workplace tasks successfully
1. Finish what I start for my partner 19. Do things according to a plan at work
2. Leave things unfinished with my partner 20. Don't see things through at work
3. Need a push to start things with my partner
4. Make plans to help my partner and stick to them Instruction-Based Contextualization.
5. Do just enough work to get by with my partner “We would like to ask you some more questions about your experiences
6. Am always prepared for anniversaries in the workplace. The following are traits that may be expressed in the
7. Waste my time with my partner workplace. Please take a moment to think about what you are like in the
8. Am exacting in my relationship efforts workplace, and indicate how well each statement describes your workplace
9. Make a mess of things with my partner personality.”
10. Carry out my plans to help my partner
11. Complete partner mandated chores successfully 1. Carry out my plans
12. Do things according to a plan for my relationship 2. Finish what I start
13. Follow through with my plans for my partner 3. …
14. Don't pay attention when I am with my partner
15. Mess things up in my relationships Implicit Job Satisfaction (IAT-JS).
16. Find it difficult to start things for my partner A single-target IAT was programmed and administered through
17. Get favors for my partner done right away Inquisit (2016). The test was modelled after two existing validated IAT
18. Don't see things through with my partner measures of Job Satisfaction (Boyd, 2010; Boyd, 2017). In order to
19. Pay attention to details when my partner is talking maximize the relevance of the test, participants manually entered five
20. Shirk my relationship duties keywords pertaining to their current workplace role. These keywords
served as the targets for association (positive or negative).
The following characteristics describe how people act at school. Please The task consisted of five trials in which participants were required
indicate how much you personally agree with each statement. to quickly pair their target words (e.g., “data entry”), positive words
(e.g., “great”) or negative words (e.g., “miserable”) with the labels
1. Made a mess of things at school “good or work” or “bad or work” by pressing the “e” or “i” key as
2. Didn't see things through at school quickly as possible. Incorrect associations were demarcated with a red
3. Got my school work done right away “x”. On all trails, the stimuli were presented in the middle of the screen
4. Made plans to finish school work and stuck to them and the sorting labels (“good” or “bad”) were located in the top right
5. Wasted my time at school and left of the screen, respectively. On trial 1, participants were in-
6. Paid attention to details when I was doing school work troduced to the matching procedure using only the positive and nega-
7. Found it difficult to start school projects tive keywords. The subsequent trail pairs (2 and 3, and 4 and 5) were
8. Finished what I started at school counter balanced and began with a practice set (i.e., trials 2 and 3). In
9. Needed a push to start school projects or homework one trial pair, participants were required to correctly match target,
10. Did things according to a plan at school positive, and negative words with the labels “good or work” (left side)
11. Followed through with my plans at school and “bad” (right side). In the remaining trial pair, and participants had
12. Was always prepared for projects at school to match words with the labels “good” (left side) and “bad or work”
13. Did just enough work to get by at school (right side).
14. Completed academic tasks successfully
15. Didn't pay attention during school hours References
16. Shirked my academic duties
17. Left things unfinished at school Ahmetoglu, G., Swami, V., & Chamorro-Premuzic, T. (2010). The relationship between
18. Messed things up at school dimensions of love, personality, and relationship length. Archives of Sexual Behavior,
39, 1181–1190.
19. Carried out my plans at school Ajzen, I. (1987). Attitudes, traits and actions: Dispositional prediction of behavior in
20. Was exacting in my efforts at school personality and social psychology. In L. Berkowitz (Ed.). Advances in experimental and
social psychology (pp. 1–63). New York: Academic Press.
Barrick, M. R., & Mount, M. K. (1991). The Big Five personality dimensions and job
The following characteristics describe how people act at work. Please performance: A meta-analysis. Personnel Psychology, 44, 1–26.
indicate how much you personally agree with each statement. Beaujean, A. A. (2014). Latent variable modeling using R: A step-by-step guide. New York,
NY: Routledge/Taylor & Francis Group.
Bennett, R. J., & Robinson, S. L. (2000). Development of a measure of workplace de-
1. Waste my time at work viance. Journal of Applied Psychology, 85, 349–360.
2. Am exacting in my job at work Berry, C. M., Ones, D. S., & Sackett, P. R. (2007). Interpersonal deviance, organizational
3. Get my work done right away deviance, and their common correlates: A review and meta-analysis. Journal of
Applied Psychology, 92, 410–424.
4. Mess things up at work
Bing, M. N., Davison, H. K., & Smothers, J. (2014). Item-level frame-of-reference effects in
5. Leave things unfinished at work personality testing: An investigation of incremental validity in an organizational
6. Carry out my plans at work setting. International Journal of Selection and Assessment, 22, 165–178.
7. Find it difficult to start projects at work Bing, M. N., Whanger, J. C., Davison, H. K., & VanHook, J. B. (2004). Incremental validity
of the frame-of-reference effect in personality scale scores: A replication and exten-
8. Shirk my duties at work sion. Journal of Applied Psychology, 89, 150–157.
9. Pay attention to details when I am working Bogg, T., & Roberts, B. W. (2004). Conscientiousness and health-related behaviours: A
10. Follow through with my plans at work meta-analysis of the leading behavioural contributors to mortality. Psychological
Bulletin, 130, 887–919.
11. Finish what I start at work
162
Boyd, B. (2010). An implicit measure of job satisfaction (Unpublished Master's Thesis)New Liu, M. (2014). Contextualized extraversion and its change in cross-cultural adjustment
York, NY: Baruch College. (1564334). (Available from ProQuest Dissertations & Theses Global. (1614532098)).
Boyd, B. (2017). Personalizing an implicit measure of job satisfaction (Order No. 10265612). Maddox, W. T., & Markman, A. B. (2010). The motivation-cognition interface in learning
(Available from ProQuest Dissertations & Theses Global. (1898765668)). and decision making. Current Directions in Psychological Science, 19, 106–110.
Brayfield, A. H., & Rothe, H. F. (1951). An index of job satisfaction. Journal of Applied Mason, W., & Suri, S. (2012). Conducting behavioral research on Amazon's Mechanical
Psychology, 35, 307–311. Turk. Behavior Research Methods, 44, 1–23.
Buhrmester, M., Kwang, T., & Gosling, S. D. (2011). Amazon's Mechanical Turk: A new McCredie, M. N., & Morey (2018). Who are the Turkers? A characterization of MTurk
source of inexpensive, yet high-quality, data. Perspectives on Psychological Science, workers using the personality assessment inventory. Assessment.
6, 3–5. McNichols, C. W., Stahl, M. J., & Manley, T. R. (1978). A validation of Hoppock's job
Carlos, V. S., & Rodrigues, R. G. (2016). Development and validation of a self-reported satisfaction measure. Academy of Management Journal, 21, 737.
measure of job performance. Social Indicators Research, 126, 279–307. Nicholls, J. G., Licht, B. G., & Pearl, R. A. (1982). Some dangers of using personality
Ching, C. M., Church, A. T., Katigbak, M. S., Locke, K. D., de Vargas-Flores, J., Ibáñez- questionnaires to study personality. Psychological Bulletin, 92(3), 572–580.
Reyes, J., ... Roslan, N. A. (2013). Cross-cultural generalizability of the Personality Ozer, D. J., & Benet-Martínez, V. (2006). Personality and the prediction of consequential
and Role Identity Structural Model (PRISM): Implications for trait and cultural psy- outcomes. Annual Review of Psychology, 57, 401–421.
chology. Journal of Research in Personality, 47, 894–907. Pace, V. L., & Brannick, M. T. (2010). Improving prediction of work performance through
Conner, M., & Abraham, C. (2001). Conscientiousness and the theory of planned beha- frame-of-reference consistency: Empirical evidence using openness to experience.
vior: Toward a more complete model of the antecedents of intentions and behavior. International Journal of Selection and Assessment, 18, 230–235.
Personality and Social Psychology Bulletin, 27, 1547–1561. Paolacci, G., & Chandler, J. (2014). Inside the Turk: Understanding Mechanical Turk as a
Consiglio, C., Alessandri, G., Borgogni, L., & Piccolo, R. F. (2013). Framing work com- participant pool. Current Directions in Psychological Science, 23, 184–188.
petencies through personality traits: The Big Five competencies grid. European Paunonen, S. V., & Ashton, M. C. (2001). Big five factors and facets and the prediction of
Journal of Psychological Assessment, 29, 162–170. behavior. Journal of Personality and Social Psychology, 81(3), 524–539.
Cook, M., Young, A., Taylor, D., & Bedford, A. (1998). Personality correlates of alcohol Perugini, M., & Prestwich, A. (2007). The gatekeeper: Individual differences are keys in
consumption. Personality and Individual Differences, 24, 641–647. the chain from perception to behaviour. European Journal of Personality, 21, 303–317.
Davie, J. W. (2005). The problem of relevance (order no. MR09762). (Available from Poropat, A. E. (2009). A meta-analysis of the five-factor model of personality and aca-
ProQuest Dissertations & Theses Global. (304990472)). demic performance. Psychological Bulletin, 135, 322–338.
DeYoung, C. G., Quilty, L. C., & Peterson, J. B. (2007). Between facets and domains: 10 Roberts, B. W., Jackson, J. J., Fayard, J. V., Edmonds, G., & Meints, J. (2009).
aspects of the Big Five. Journal of Personality and Social Psychology, 93, 880–896. Conscientiousness. In M. Leary, & R. Hoyle (Eds.). Handbook of individual differences in
Farmer, S. J. (1999). Enhancing criterion-related validity by assessing context-specific traits: social behavior (pp. 369–381). New York, NY: Guilford Press.
Implications of conditional dispositions for personnel selection (9947493). (Available Roenicke, C. C. (2013). Extending the frame of reference effect beyond conscientiousness
from ProQuest Dissertations & Theses Global. (304597555)). (3564859). (Available from ProQuest Dissertations & Theses Global. (1410336008)).
Goldberg, L. R. (2017, February 22). Index of 274 labels for 463 IPIP scales. Retrieved Rudman, L. A. (2004). Sources of implicit attitudes. Current Directions in Psychological
from: http://ipip.ori.org/newIndexofScaleLabels.htm. Science, 13(2), 79–82.
Goldberg, L. R., Johnson, J. A., Eber, H. W., Hogan, R., Ashton, M. C., Cloninger, C. R., & Schaffhuser, K., Allemand, M., & Martin, M. (2014). Personality traits and relationship
Gough, H. C. (2006). The International Personality Item Pool and the future of public- satisfaction in intimate couples: Three perspectives on personality. European Journal
domain personality measures. Journal of Research in Personality, 40, 84–96. of Personality, 28, 120–133.
Hayes, N., & Joseph, S. (2003). Big 5 correlates of three measures of subjective well- Schmit, M. J., Ryan, A. M., Stierwalt, S. L., & Powell, A. B. (1995). Frame-of-reference
being. Personality and Individual Differences, 34, 723–727. effects on personality scale scores and criterion-related validity. Journal of Applied
Heidemeier, H. (2005). Self and supervisor ratings of job-performance: Meta-analyses and a Psychology, 80, 607–620.
process model of rater convergence. Universität Erlangen-Nürnberg. Retrieved from Schmitt, D. P. (2004). The Big Five related to risky sexual behaviour across 10 world
https://opus4.kobv.de/opus4fau/frontdoor/deliver/index/docId/143/file/ regions: Differential personality associations of sexual promiscuity and relationship
Heidemeier_Diss. infidelity. European Journal of Personality, 18, 301–319.
Heller, D., Ferris, D., Brown, D., & Watson, D. (2009). The influence of work personality Schmitt, D. P., & Buss, D. M. (2000). Sexual dimensions of person description: Beyond or
on job satisfaction: Incremental validity and mediation effects. Journal of Personality, subsumed by the Big Five? Journal of Research in Personality, 34, 141–177.
77(4), 1051–1084. Schmitt, N. (2014). Personality and cognitive ability as predictors of effective perfor-
Hendrick, S. S. (1988). A generic measure of relationship satisfaction. Journal of Marriage mance at work. Annual Review of Organizational Psychology and Organizational
and the Family, 50, 93–98. Behavior, 1, 45–65.
Hirschfeld, G., & von Brachel, R. (2014). Multiple-group confirmatory factor analysis in Seiler, S. N. (2011). The impact of item characteristics on contextualized personality assess-
R—A tutorial in measurement invariance with continuous and ordinal indicators. ment (3503898). (Available from ProQuest Dissertations & Theses Global.
Practical Assessment, Research & Evaluation, 19(7), 12. (1010625066)).
Hittner, J. B., May, K., & Silver, N. C. (2003). A Monte Carlo evaluation of tests for Shaffer, J. A., & Postlethwaite, B. E. (2012). A matter of context: A meta-analytic in-
comparing dependent correlations. The Journal of General Psychology, 130, 149–168. vestigation of the relative validity of contextualized and uncontextualized personality
Holtrop, D., Born, M. P., de Vries, A., & de Vries, R. E. (2014). A matter of context: A measures. Personnel Psychology, 65, 445–494.
comparison of two types of contextualized personality measures. Personality and Sibley, C. G., Osborne, D., & Duckitt, J. (2012). Personality and political orientation:
Individual Differences, 68, 234–240. Meta-analysis and test of a threat-constraint model. Journal of Research in Personality,
Holtrop, D., Born, M. P., & de Vries, R. E. (2014). Predicting performance with con- 46, 664–677.
textualized inventories, no frame-of-reference effect? International Journal of Selection Slatcher, R. B., & Vazire, S. (2009). Effects of global and contextualized personality on
and Assessment, 22, 219–223. relationship satisfaction. Journal of Research in Personality, 43, 624–633.
Hu, L., & Bentler, P. M. (1998). Fit indices in covariance structure modeling: Sensitivity to Steel, P. (2007). The nature of procrastination: A meta-analytic and theoretical review of
underparameterized model misspecification. Psychological Methods, 3(4), 424–453. quintessential self-regulatory failure. Psychological Bulletin, 133, 65–94.
Hu, L., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure Steiger, J. H. (1980). Tests for comparing elements of a correlation matrix. Psychological
analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: Bulletin, 87, 245–251.
A Multidisciplinary Journal, 6(1), 1–55. Stryker, S. (1986). Identity theory: Developments and extensions. In K. Yardley, & T.
Hunthausen, J. M., Truxillo, D. M., Bauer, T. B., & Hammer, B. L. (2003). A field study of Honess (Eds.). Self and identity (pp. 89–104). New York: Wiley.
frame-of-reference effects on personality test validity. Journal of Applied Psychology, Takahashi, Y., Edmonds, G. W., Jackson, J. J., & Roberts, B. W. (2013). Longitudinal
88, 545–551. correlated changes in conscientiousness, preventative health-related behaviors, and
Inquisit 5 [Computer software] (2016). Retrieved from https://www.millisecond.com. self-perceived physical health. Journal of Personality, 81, 417–427.
Johnson, J. A. (2014). Measuring thirty facets of the Five-factor model with a 120-item Vispoel, W. P., Morris, C. A., & Kilinc, M. (2018). Using generalizability theory to dis-
public domain inventory: Development of the IPIP-NEO-120. Journal of Research in attenuate correlation coefficients for multiple sources of measurement error.
Personality, 51, 78–89. Multivariate Behavioral Research, 1–21.
Judge, T. A., Heller, D., & Mount, M. K. (2002). Five-factor model of personality and job Weber, M., & Huebner, E. S. (2015). Early adolescents' personality and life satisfaction: A
satisfaction: A meta-analysis. Journal of Applied Psychology, 87, 530–541. closer look at global vs. domain-specific satisfaction. Personality and Individual
Judge, T. A., Higgins, C. A., Thoresen, C. J., & Barrick, M. R. (1999). The Big Five per- Differences, 83, 31–36.
sonality traits, general mental ability, and career success across the life span. Wood, D. (2007). Using the PRISM to compare the explanatory value of general and role-
Personnel Psychology, 52, 621–652. contextualized trait ratings. Journal of Personality, 75, 1103–1126.
Lee, K., & Ashton, M. C. (2004). Psychometric properties of the HEXACO personality Wood, D., & Roberts, B. W. (2006). Cross-sectional and longitudinal tests of the
inventory. Multivariate Behavioral Research, 39, 329–358. Personality and Role Identity Structural Model (PRISM). Journal of Personality, 74,
Lemieux, R., & Hale, J. L. (1999). Intimacy, passion, and commitment in young romantic 779–809.
relationships: Successfully measuring the triangular theory of love. Psychological Yarkoni, T., & Westfall, J. (2017). Choosing prediction over explanation in psychology:
Reports, 85, 497–503. Lessons from machine learning. Perspectives on Psychological Science, 12(6),
Lievens, F., De Corte, W., & Schollaert, E. (2008). A closer look at the frame-of-reference 1100–1122.
effect in personality scale scores and validity. Journal of Applied Psychology, 93, Yockey, R. D. (2016). Validation of the short form of the academic procrastination scale.
268–279. Psychological Reports, 118, 171–179.
163
View publication stats

Contextualization As A Means To Improve The Predictive Validity of Personality Models

Загружено:

Сведения о документе

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Contextualization As A Means To Improve The Predictive Validity of Personality Models

Загружено:

Авторское право:

Доступные форматы

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Contextualization as a means to improve the predictive validity of personality

Article in Personality and Individual Differences · July 2019

Victor Swift Jordan B Peterson

SEE PROFILE SEE PROFILE

The psychological significance of the Biblical stories: http://bit.ly/2rMHp08 View project

The user has requested enhancement of the downloaded file.

Contents lists available at ScienceDirect

Personality and Individual Diﬀerences

Contextualization as a means to improve the predictive validity of T

Work School Relation Work School Relation Work Relation

Conﬁgural 0.829(0.06) 0.883(0.05) 0.910(0.05) 0.815(0.06) 0.886(0.05) 0.871(0.06) 0.829(0.06) 0.837(0.06)

Note. Values represent CFI (SRMR).

Table 3 models. Moreover, none of the context-incongruent measures demon-

Table 6 Job Satisfaction, were debriefed, and compensated with $4.00.

Job satisfaction .37a .32 7.2.1. Uncontextualized Big Five measure

Appendix A 12. Make plans to ﬁnish work and stick to them

View publication stats

Вам также может понравиться