Showing papers in "Educational and Psychological Measurement in 2002"

PDF

Open Access

Journal Article•DOI•

Reliability Generalization of Scores on the Spielberger State-Trait Anxiety Inventory:

[...]

Laura L. B. Barnes¹, Diane Harp², Woo Sik Jung³•Institutions (3)

Oklahoma State University–Stillwater¹, Tulsa Community College², Bluffton University³

01 Aug 2002-Educational and Psychological Measurement

TL;DR: A reliability generalization study for Spielberger's State-Trait Anxiety Inventory (STAI) was conducted by as mentioned in this paper, where a total of 816 research articles utilizing the STAI between 1990 and 2000 were reviewed and compared.

...read moreread less

Abstract: A reliability generalization study for Spielberger’s State-Trait Anxiety Inventory (STAI) was conducted. A total of 816 research articles utilizing the STAI between 1990 and 2000 were reviewed and ...

...read moreread less

855 citations

Journal Article•DOI•

A Theoretical and Empirical Analysis of the Measurement of Collective Efficacy: The Development of a Short Form.

[...]

Roger D. Goddard¹•Institutions (1)

University of Michigan¹

01 Feb 2002-Educational and Psychological Measurement

TL;DR: This article developed a 12-item Likert-type measure of collective efficacy in schools, designed to assess the extent to which a faculty believes in its conjoint capability in terms of its ability to achieve collective efficacy.

...read moreread less

Abstract: The present study reports on the development of a 12-item Likert-type measure of collective efficacy in schools. Designed to assess the extent to which a faculty believes in its conjoint capability...

...read moreread less

273 citations

Journal Article•DOI•

A Reliability Generalization Study of the Marlowe-Crowne Social Desirability Scale:

[...]

S. Natasha Beretvas, Jason L. Meyers¹, Walter L. Leite¹•Institutions (1)

University of Texas at Austin¹

01 Aug 2002-Educational and Psychological Measurement

TL;DR: A reliability generalization study was conducted for the Marlowe-Crowne Social Desirability Scale (MCSDS) as mentioned in this paper, which is the most commonly used tool designed to assess social desirability bias.

...read moreread less

Abstract: A reliability generalization (RG) study was conducted for the Marlowe-Crowne Social Desirability Scale (MCSDS). The MCSDS is the most commonly used tool designed to assess social desirability bias ...

...read moreread less

251 citations

Journal Article•DOI•

A History of Effect Size Indices

[...]

Carl J. Huberty

01 Apr 2002-Educational and Psychological Measurement

TL;DR: In this article, the authors trace histories of a variety of effect size indices, including relationship, group differences, and overlap, and review the histories in a multivariable and univariate setting.

...read moreread less

Abstract: Depending on how one interprets what an effect size index is, it may be claimed that its history started around 1940, or about 100 years prior to that. An attempt is made in this article to trace histories of a variety of effect size indices. Effect size bases discussed pertain to (a) relationship, (b) group differences, and (c) group overlap. Multivariable as well as univariate indices are considered in reviewing the histories.

...read moreread less

215 citations

Journal Article•DOI•

Concurrent and Predictive Validity of the Student Adaptation to College Questionnaire in a Sample of European Freshman Students

[...]

Wim Beyers¹, Luc Goossens¹•Institutions (1)

The Catholic University of America¹

01 Jun 2002-Educational and Psychological Measurement

TL;DR: In this article, the validity of scores on the Student Adaptation to College Questionnaire (SACQ) in a sample of European university students was examined. And concurrent validity was established through significant correlations in the expected direction with alternative measures of student adjustment (academic motivation, loneliness, depression, and general adjustment to university).

...read moreread less

Abstract: This study represents the first attempt to examine the validity of scores on the Student Adaptation to College Questionnaire (SACQ) in a sample of European university students. Concurrent validity was established through significant correlations in the expected direction with alternative measures of student adjustment (academic motivation, loneliness, depression, and general adjustment to university). Further concurrent validity evidence for selected subscales was provided through moderate associations with students’engagement in social activities and their self-reported use of psychological services provided on campus. Findings regarding predictive validity, as assessed through correlations with student attrition and academic results, went in the expected direction but were somewhat less convincing. The latter results are explained in terms of differences between European and North American systems of higher education. With some reservations regarding the Academic Adjustment subscale, then, the SACQ seem...

...read moreread less

205 citations

Journal Article•DOI•

Reliability generalization of working alliance inventory scale scores

[...]

William E. Hanson¹, Kyle T. Curry, Deborah L. Bandalos¹•Institutions (1)

University of Nebraska–Lincoln¹

01 Aug 2002-Educational and Psychological Measurement

TL;DR: Reliability generalization (RG) was used to study five versions of the Working Alliance Inventory (WAI), including scores from 12 different scales, including internal consistency estimates, six interrater reliability estimates, and four study characteristics as mentioned in this paper.

...read moreread less

Abstract: Reliability generalization (RG) was used to study five versions of the Working Alliance Inventory (WAI), including scores from 12 different scales. Sixty-seven internal consistency estimates, six interrater reliability estimates, and four study characteristics were analyzed. In general, reliability estimates of WAI scale scores appear to be robust. Mean reliability estimates ranged, in this sample of studies, from .79 to .97, with a modal estimate of .92. Variability in reliability estimates was, based on simple bivariate correlations, associated with client and therapist sample size for WAI total scores (observer version). Implications for measuring alliance using the WAI and conducting future RG studies on psychotherapy process measures are discussed.

...read moreread less

174 citations

Journal Article•DOI•

Myers-Briggs Type Indicator score reliability across studies: A meta-analytic reliability generalization study

[...]

Robert M. Capraro¹, Mary Margaret Capraro¹•Institutions (1)

Texas A&M University¹

01 Aug 2002-Educational and Psychological Measurement

TL;DR: The Myers-Briggs Type Indicator (MBTI) was submitted to a descriptive reliability generalization (RG) analysis to characterize the variability of measurement error in MBTI scores across administrations as discussed by the authors.

...read moreread less

Abstract: The Myers-Briggs Type Indicator (MBTI) was submitted to a descriptive reliability generalization (RG) analysis to characterize the variability of measurement error in MBTI scores across administrations. In general, the MBTI and its scales yielded scores with strong internal consistency and test-retest reliability estimates, although variation was observed.

...read moreread less

152 citations

Journal Article•DOI•

Development of a Compact Measure of Job Satisfaction: The Abridged Job Descriptive Index

[...]

Jeffrey M. Stanton¹, Evan F. Sinar², William K. Balzer³, Amanda L. Julian³, Paul Thoresen³, Shahnaz Aziz³, Gwenith G. Fisher³, Patricia C. Smith³ - Show less +4 more•Institutions (3)

Syracuse University¹, Development Dimensions International², Bowling Green State University³

01 Feb 2002-Educational and Psychological Measurement

TL;DR: In this paper, the authors developed an abridged version of the Job Descriptive Index (AJDI) containing a total of 25 items and tested it on a national sample and a sample of university workers.

...read moreread less

Abstract: The Job Descriptive Index is a popular measure of job satisfaction with five subscales containing 72 items. A national sample (n = 1,534) and a sample of university workers (n = 636) supported development of an abridged version of the Job Descriptive Index (AJDI) containing a total of 25 items. A systematic scale-reduction technique was employed with the first sample to decide which items to retain in each scale. The abridged subscales were then tested in the second sample. Results indicated that the relationships among the five abridged subscales and between the five abridged subscales and other measures were substantially preserved.

...read moreread less

152 citations

Journal Article•DOI•

Reliability Generalization: Moving toward Improved Understanding and Use of Score Reliability:

[...]

Tammi Vacha-Haase¹, Robin K. Henson², John C. Caruso³•Institutions (3)

Colorado State University¹, University of North Texas², University of Montana³

01 Aug 2002-Educational and Psychological Measurement

TL;DR: Reliability generalization (RG) is a measurement meta-analytic method used to explore the variability in score reliability estimates and to characterize the possible sources of this variance as mentioned in this paper.

...read moreread less

Abstract: Reliability generalization (RG) is a measurement meta-analytic method used to explore the variability in score reliability estimates and to characterize the possible sources of this variance. This article briefly summarizes some RG considerations. Included is a description of how reliability confidence intervals might be portrayed graphically. The article includes tabulations across various RG studies, including how frequently authors (a) report score reliabilities for their own data, (b) conduct reliability induction, or (c) do not even mention reliability.

...read moreread less

140 citations

Journal Article•DOI•

A Structural and Discriminant Analysis of the Work Addiction Risk Test

[...]

Claudia Flowers, Bryan E. Robinson¹•Institutions (1)

University of North Carolina at Charlotte¹

01 Jun 2002-Educational and Psychological Measurement

TL;DR: The Work Addiction Risk Test (WART) was designed to measure "workaholism" as mentioned in this paper, and the accuracy of the WART scores was investigated in a recent study.

...read moreread less

Abstract: The Work Addiction Risk Test (WART) was designed to measure “workaholism.” The present study examines the underlying dimensions of the WART and investigated the accuracy of the WART scores to discr...

...read moreread less

138 citations

Journal Article•DOI•

A Monte Carlo Comparison of Item and Person Statistics Based on Item Response Theory versus Classical Test Theory.

[...]

Paul L. MacDonald, Sampo V. Paunonen¹•Institutions (1)

University of Western Ontario¹

01 Dec 2002-Educational and Psychological Measurement

TL;DR: In this article, the behavior of item and person statistics obtained from two measurement frameworks, item response theory (IRT) and classical test theory (CTT), were examined using Monte Carlo techniques with simulated test data.

...read moreread less

Abstract: Despite the well-known theoretical advantages of item response theory (IRT) over classical test theory (CTT), research examining their empirical properties has failed to reveal consistent, demonstrable differences. Using Monte Carlo techniques with simulated test data, this study examined the behavior of item and person statistics obtained from these two measurement frameworks. The findings suggest IRT- and CTT-based item difficulty and person ability estimates were highly comparable, invariant, and accurate in the test conditions simulated. However, whereas item discrimination estimates based on IRT were accurate across most of the experimental conditions, CTT-based item discrimination estimates proved accurate under some conditions only. Implications of the results of this study for psychometric item analysis and item selection are discussed.

...read moreread less

Journal Article•DOI•

Factorial and criterion validity of scores of a measure of belonging in youth development programs

[...]

Dawn Anderson-Butcher¹, David E. Conroy•Institutions (1)

Ohio State University¹

01 Oct 2002-Educational and Psychological Measurement

TL;DR: The authors examined the factorial validity, internal consistency, and predictive validity of scores from one measure of belonging to an after-school youth development program and found that belonging scores were positively related to actual program attendance over a 6-month period, self-reported attendance in the last week, and protective factors found in communities.

...read moreread less

Abstract: Many youth development programs, including the Boys & Girls Clubs ofAmerica, feature belonging as a central piece in their theories ofchange. From a psychometric perspective, little is known about measures ofbelonging. This research examined the factorial validity, internal consistency, and predictive validity of scores from one measure of belonging to an after-school youth development program. Confirmatory factor analysis yielded a five-item measure from a calibration analysis that demonstrated “tight” cross validity in a cross-validation sample as well as factorial invariance between females and males. Internal consistency estimates for this 5-item scale exceeded .90 in both samples. Belonging scores were positively related to actual program attendance over a 6-month period, self-reported attendance in the last week, and protective factors found in communities. Belonging scores were moderately and negatively related to community-based risk factors.

...read moreread less

Journal Article•DOI•

A Factorial Analysis of Scales Measuring Competitiveness

[...]

John M. Houston, Sandra McIntire, Judy Kinnie, Christeine Terry¹•Institutions (1)

Rollins College¹

01 Apr 2002-Educational and Psychological Measurement

TL;DR: This article examined the construct of competitiveness, which has been variously defined and operationalized by psychologists for more than 100 years, and its relation to other constructs, and suggested that researchers should carefully define competitiveness and choose measures that reflect their own definition to improve interpretation of the results.

...read moreread less

Abstract: This study examined the construct of competitiveness, which has been variously defined and operationalized by psychologists for more than 100 years, and its relation to other constructs. Four hypotheses regarding multidimensionality and related constructs were proposed and tested by administering 10 different paper-and-pencil measures to 140 undergraduate students. Two factor analyses (principal axis with varimax rotation) provided evidence of two factors that were labeled Self-Aggrandizement and Interpersonal Success. The results suggest that researchers should carefully define competitiveness and choose measures that reflect their own definition to improve interpretation of the results.

...read moreread less

Journal Article•DOI•

Validity and Reliability of Scores on the Reduced Emotional Intensity Scale

[...]

Maggie Geuens¹, Patrick De Pelsmacker²•Institutions (2)

Université libre de Bruxelles¹, University of Antwerp²

01 Apr 2002-Educational and Psychological Measurement

TL;DR: In this article, the authors investigated the underlying factor structure of the 30-item EIS and constructed a reduced EIS (EIS-R) based on a reduced scale of 17 items.

...read moreread less

Abstract: The purpose of this study was to investigate the underlying factor structure of the 30-item Emotional Intensity Scale (EIS) and to construct a reduced EIS (EIS-R). The psychometric characteristics of the scales were examined from three different samples: 204 employees of the University of Antwerp, a subset of 106 of the first sample who cooperated in a retest, and 510 men and women representative of the Belgian population. The original EIS does not seem to possess an adequate factor structure. Confirmatory factor analysis on a reduced scale of 17 items indicates that two factors underlie the EIS-R: a positive and a negative emotions factor. Furthermore, scores on the EIS-R were shown to have adequate reliability and validity.

...read moreread less

Journal Article•DOI•

The Fifth Edition of the APA "Publication Manual": Why Its Statistics Recommendations Are So Controversial.

[...]

Fiona Fidler¹•Institutions (1)

University of Melbourne¹

01 Oct 2002-Educational and Psychological Measurement

TL;DR: The fifth edition of the Publication Manual of the American Psychological Association (APA) draws on recommendations for improving statistical practices made by the APA Task Force on Statistical Inference (TFSI) as mentioned in this paper.

...read moreread less

Abstract: The fifth edition of the Publication Manual of the American Psychological Association (APA) draws on recommendations for improving statistical practices made by the APA Task Force on Statistical Inference (TFSI). The manual now acknowledges the controversy over null hypothesis significance testing (NHST) and includes both a stronger recommendation to report effect sizes and a new recommendation to report confidence intervals. Drawing on interviews with some critics and other interested parties, the present review identifies a number of deficiencies in the new manual. These include lack of follow-through with appropriate explanations and examples of how to report statistics that are now recommended. At this stage, the discipline would be well served by a response to these criticisms and a debate over needed modifications.

...read moreread less

Journal Article•DOI•

A Reliability Generalization Study of the Geriatric Depression Scale

[...]

Kevin M. Kieffer¹, Robert J. Reese²•Institutions (2)

Saint Leo University¹, Abilene Christian University²

01 Dec 2002-Educational and Psychological Measurement

TL;DR: In this article, a reliability generalization study of the Geriatric Depression Scale (GDS) was conducted to further distill psychometric properties of the scores generated by this measure.

...read moreread less

Abstract: Depression has proven to be a serious illness in older adults that often goes untreated because it is frequently misdiagnosed or is confused with other symptom patterns. One instrument that has been consistently cited in the literature as an effective indicator of depression in older adults is the Geriatric Depression Scale (GDS). The present study provided a reliability generalization (RG) study of the GDS in an effort to further distill psychometric properties of the scores generated by this measure. RG, a relatively new meta-analytic reliability procedure, was used to (a) identify the typical reliability of GDS scores across studies and (b) examine sources of measurement error across studies. Results from this investigation of 338 previously published research studies indicated that the average score reliability across studies was .8482 (SD = .0870) and that the number of items on the scale, scale SD, sample size, and participant population were the most important predictors of score reliability on thi...

...read moreread less

Journal Article•DOI•

A psychometric comparison of four measures of hope and optimism

[...]

Lyndall G. Steed

01 Jun 2002-Educational and Psychological Measurement

TL;DR: In this article, the authors compare and contrast the psychometric properties of four scales developed to measure hope and optimism, namely, the Revised Generalized Expectancy for Success Scale, the Life Orientation Test (LOT), the Hope Scale (HS), and the Hunter Opinions and Personal Expectations Scale.

...read moreread less

Abstract: This study was designed to compare and contrast the psychometric properties of four scales developed to measure hope and optimism, namely, the Revised Generalized Expectancy for Success Scale, the Life Orientation Test (LOT), the Hope Scale (HS), and the Hunter Opinions and Personal Expectations Scale The definitions on which the measures are based are compared along with their reported reliability and construct validity Three hundred and forty-seven undergraduate students completed the scales along with measures of trait negative affect (TNA) and trait positive affect (TPA), task- and emotion-oriented coping and perceived stress All scales had adequate internal consistency, and there was strong evidence of convergent validity Regarding factor structure replicability, the LOT was marginally superior The possibility of the scales’redundancy because of contamination by TNA or TPA depended on the criterion construct It is argued that the LOT and HS are the scales of choice when assessing hope and/or op

...read moreread less

Journal Article•DOI•

Variability and Prediction of Measurement Error in Kolb’s Learning Style Inventory Scores a Reliability Generalization Study

[...]

Robin K. Henson¹, Dae-Yeop Hwang¹•Institutions (1)

University of North Texas¹

01 Aug 2002-Educational and Psychological Measurement

TL;DR: This paper conducted a reliability generalization study across studies and versions of the test and found that internal consistency and test-retest reliabilities for LSI scores fluctuate considerably and contribute to deleterious cumulative measurement error.

...read moreread less

Abstract: The Learning Style Inventory (LSI) is a commonly employed measure of learning styles based on Kolb’s Experiential Learning Model. Nevertheless, the psychometric soundness of LSI scores has historically been critiqued. The present article extends this critique by conducting a reliability generalization study across studies and versions of the test. Results indicated that internal consistency and test-retest reliabilities for LSI scores fluctuate considerably and contribute to deleterious cumulative measurement error. Reliability variation was predictable by test version and several study features.

...read moreread less

Journal Article•DOI•

Confidence intervals for effect sizes in analysis of variance

[...]

Kevin D. Bird¹•Institutions (1)

University of New South Wales¹

01 Apr 2002-Educational and Psychological Measurement

TL;DR: The authors discusses procedures for constructing individual and simultaneous confidence intervals on contrasts on parameters of a number of fixed-effects ANOVA models, including multivariate analysis of variance (MANOVA) models for the analysis of repeated measures data.

...read moreread less

Abstract: Although confidence interval procedures for analysis of variance (ANOVA) have been available for some time, they are not well known and are often difficult to implement with statistical packages. This article discusses procedures for constructing individual and simultaneous confidence intervals on contrasts on parameters of a number of fixed-effects ANOVA models, including multivariate analysis of variance (MANOVA) models for the analysis of repeated measures data. Examples show how these procedures can be implemented with accessible software. Confidence interval inference on parameters of random-effects models is also discussed.

...read moreread less

Journal Article•DOI•

Reliability Generalization of the Life Satisfaction Index

[...]

K. A. Wallace¹, A. J. Wheeler¹•Institutions (1)

University of Montana¹

01 Aug 2002-Educational and Psychological Measurement

TL;DR: In this article, the authors examined score reliability for a measure of life satisfaction (LSI) and found no significant differences in score reliability by language of administration or sample type, including sample size, number of items, mean age, standard deviation of age, proportion female, and mean LSI score.

...read moreread less

Abstract: The purpose of the present study was to examine score reliability for a measure of life satisfaction (Life Satisfaction Index [LSI]). This reliability generalization comprised a search of 157 journal articles, which resulted in the inclusion of a total of 34 samples. Results revealed an average reliability of .79 (SD = .10, median = .79). Bivariate correlational analyses revealed no relationships between score reliability and various sample characteristics, including sample size, number of items, mean age, standard deviation of age, proportion female, mean LSI score, and standard deviation of LSI scores. No significant differences in score reliability were found by language of administration or sample type. These analyses provide evidence for adequate reliability of LSI scores across a variety of sample characteristics; however, they must be interpreted with caution, given the small sample size. In addition, this study documents the poor reporting of psychometric properties in the LSI literature.

...read moreread less

Journal Article•DOI•

Group Data on High School Grade Point Averages and Scores on Academic Aptitude Tests As Predictors of Institutional Graduation Rates

[...]

Heinrich Stumpf, Julian C. Stanley¹•Institutions (1)

Johns Hopkins University¹

01 Dec 2002-Educational and Psychological Measurement

TL;DR: For every 4-year college in the United States listed in the 1998 College Handbook of the College Board, the percentages of students graduating within 6 years of entering and of students having high school grade point averages (GPAs) of at least 3.00 were recorded.

...read moreread less

Abstract: For every 4-year college in the United States listed in the 1998 College Handbook of the College Board, the percentages of students graduating within 6 years of entering and of students having high school grade point averages (GPAs) of at least 3.00 were recorded. The authors also obtained the College Board Scholastic Assessment Test I (SAT I) Verbal and Math and the American College Test (ACT) scores at the 25th and 75th percentiles of the distributions of scores of the enrolled freshmen. The SAT I Verbal and Math and the ACT scores at the 25th and 75th percentiles proved to be good predictors of the percentage of students graduating from the same institution that admitted them as freshmen (rs ranging from .62 to .73), as did the percentage of freshmen having high school GPAs of 3.00 or higher (r = .49). The correlations of the group percentages and means with the criterion were considerably higher than the predictive-validity coefficients of the SAT I and ACT scores for individual graduation as reported...

...read moreread less

Journal Article•DOI•

Correcting effect sizes for score reliability: A reminder that measurement and substantive issues are linked inextricably

[...]

Frank Baugh¹•Institutions (1)

Texas A&M University¹

01 Apr 2002-Educational and Psychological Measurement

TL;DR: In this article, the authors emphasize that measurement issues must be explicitly considered even in studies that focus on substantive questions, and they discuss the dynamics associated with insufficient attention being paid to score reliabilities in substantive studies.

...read moreread less

Abstract: The present article emphasizes that measurement issues must be explicitly considered even in studies that focus on substantive questions. First, dynamics associated with insufficient attention being paid to score reliabilities in substantive studies are discussed. Next, reasons to adjust effect size indices for score unreliability are presented. Finally, some procedures for adjusting effect sizes for score reliability are briefly reviewed.

...read moreread less

Journal Article•DOI•

Reliability Generalization: An Examination of the Career Decision-Making Self-Efficacy Scale

[...]

Johanna E. Nilsson, Christa K. Schmidt, William D. Meek

01 Aug 2002-Educational and Psychological Measurement

TL;DR: In this article, the authors explore the variability in reliability scores on a commonly used career scale, the Career Decision-Making Self-Efficacy Scale (CDMSE), and employ reliability generalization to identify typical score reliability, variability of score reliability and variables explaining this variability.

...read moreread less

Abstract: The purpose of the present study was to explore the variability in reliability scores on a commonly used career scale, the Career Decision-Making Self-Efficacy Scale (CDMSE). Reliability generalization was employed to identify typical score reliability, variability of score reliability, and variables explaining this variability. Forty-nine pieces of work were examined, and the results revealed that 41% of them reported score reliability of their own data. Of the five subscales, Problem Solving showed the lowest score reliability. In addition, higher score reliability was associated with age, sample racial/ethnic demographics, and standard deviation of total mean score.

...read moreread less

Journal Article•DOI•

Psychometric Properties of Shortened Versions of the Automatic Thoughts Questionnaire.

[...]

Richard G. Netemeyer, Donald A. Williamson¹, Scot Burton², Dipayan Biswas, Supriya Jindal, Stacy Landreth, Gregory Mills, Sonya Primeaux³ - Show less +4 more•Institutions (3)

Pennington Biomedical Research Center¹, University of Arkansas², Louisiana State University³

01 Feb 2002-Educational and Psychological Measurement

TL;DR: In this paper, a 15-and 8-item shortened version of the Automatic Thoughts Questionnaire (ATQ) was used to measure cognitions associated with depression, and a single factor was found to underlie both reduced versions, with scores on this factor yielding strong estimates of internal consistency and nomological validity.

...read moreread less

Abstract: Measures of depression are increasingly being used as outcomes or predictors by organizational and consumer psychologists. One such measure is the Automatic Thoughts Questionnaire (ATQ). However, questions about the 30-item ATQ’s factor structure and its length for use in survey research remain. The authors offer 15-and 8-item shortened versions of the ATQ. Two samples (n = 434 and n = 419) were used to derive the reduced versions. A single factor was found to underlie both reduced versions, with scores on this factor yielding strong estimates of internal consistency and nomological validity. Two more cross-validation samples (n = 163 and n = 91) also showed support for the 15-and 8-item versions. Overall, results suggest that these reduced-item versions of the ATQ are useful alternatives to measuring cognitions associated with depression.

...read moreread less

Journal Article•DOI•

The Score Equivalence of Paper-and-Pencil and Computerized Versions of a Speeded Test of Reading Comprehension

[...]

Mark Pomplun, Sharon Frey, Douglas F. Becker¹•Institutions (1)

Riverside Publishing¹

01 Apr 2002-Educational and Psychological Measurement

TL;DR: This paper investigated the equivalence of scores from computerized and paper-and-pencil versions of a reading placement test and found that both forms of the computerized versions produced higher vocabulary scores than the paper andpencil format and one form also had higher comprehension and total scores on the computerised version.

...read moreread less

Abstract: This study investigated the equivalence of scores from computerized and paper-and-pencil versions of a reading placement test. Concerns about score equivalence on the computerized versions were warranted because of the speeded nature of the paper-and-pencil version and differences in text delivery and response modes. The results indicated that both forms of the computerized versions produced higher vocabulary scores than the paper-and-pencil format and one form also had higher comprehension and total scores on the computerized version. These difficulty differences, especially for the vocabulary scores, appeared related to the differences in response speed associated with use of a mouse to record responses in contrast to a pencil and answer sheet. Scale scores for the computerized versions had similar predictive power for course placement as paper-and-pencil scores. However, because these results were based on students from only seven institutions, additional studies are needed to investigate the comparabi...

...read moreread less

Journal Article•DOI•

Trait Ratings for Automated Essay Grading

[...]

Mark D. Shermis, Chantal Mees Koch¹, Ellis B. Page², Timothy Z. Keith³, Susanmarie Harrington¹ - Show less +1 more•Institutions (3)

Indiana University – Purdue University Indianapolis¹, Duke University², University of Texas at Austin³

01 Feb 2002-Educational and Psychological Measurement

TL;DR: This paper employed an automated grader to evaluate essays, both holistically and with the rating of traits (content, organization, style, mechanics, and creativity) for Web-based student essays.

...read moreread less

Abstract: This study employed an automated grader to evaluate essays, both holistically and with the rating of traits (content, organization, style, mechanics, and creativity) for Webbased student essays ser...

...read moreread less

Journal Article•DOI•

An Examination of the Reliability of Scores from Zuckerman’s Sensation Seeking Scales, Form V

[...]

V. Heide K. Deditius-Island, John C. Caruso¹•Institutions (1)

University of Montana¹

01 Aug 2002-Educational and Psychological Measurement

TL;DR: A reliability generalization was conducted on Zuckerman's Sensation Seeking Scale, Form V (SSS-V) as discussed by the authors, and two hundred and forty-four empirical articles on the SSS-V were reviewed spanning a 20-year p...

...read moreread less

Abstract: A reliability generalization (RG) was conducted on Zuckerman’s Sensation Seeking Scale, Form V (SSS-V). Two hundred and forty-four empirical articles on the SSS-V were reviewed spanning a 20-year p...

...read moreread less

Journal Article•DOI•

Performance Differences According to Test Mode and Computer Familiarity on a Practice Graduate Record Exam

[...]

Amie L. Goldberg, Joseph J. Pedulla¹•Institutions (1)

Boston College¹

01 Dec 2002-Educational and Psychological Measurement

TL;DR: In this paper, the authors investigated the relationship between test mode (paper and pencil vs. computerized with editorial control and computerized without editorial control) and computer familiarity with test performance on the Graduate Record Exam (GRE).

...read moreread less

Abstract: Ideally, test performance is unrelated to the mode in which the test is administered. This study investigated the relationships between test mode (paper and pencil vs. computerized with editorial control and computerized without editorial control) and computer familiarity (lower, moderate, and higher) with test performance on the Graduate Record Exam (GRE). The GRE was administered to 222 undergraduates stratified by gender and randomly assigned to the three test mode groups. With self-reported grade point average as a covariate in a MANCOVA, the authors found that examinees in the paper-and-pencil group outperformed the computerized-without-editorial-control group on all subtests. The computerized-with-editorial-control group outperformed the computerized-without-editorial-control group on the Analytical subtest only. The authors also found a significant main effect for computer familiarity on the Analytical and Quantitative subtests. A significant interaction between computer familiarity and test mode o...

...read moreread less

Journal Article•DOI•

Reliability, Structure, and Correlates of Learning and Study Strategies Inventory Scores.

[...]

Janet G. Melancon¹•Institutions (1)

Loyola University New Orleans¹

01 Dec 2002-Educational and Psychological Measurement

TL;DR: The Learning and Study Strategies Inventory (LASSI) is used in hundreds of universities and high schools each year as discussed by the authors, and the reliability, structure, and criterion-related validity of LASSI scores are investigated.

...read moreread less

Abstract: The Learning and Study Strategies Inventory (LASSI) is used in hundreds of universities and high schools each year. This study investigated the reliability, structure, and criterion-related validity of LASSI scores. Data were provided by 502 university students. Results suggest that the LASSI may not measure the postulated 10 scales typically used to report results.

...read moreread less

Journal Article•DOI•

The Patterns of Adaptive Learning Survey: A Comparison across Grade Levels:

[...]

Margaret E. Ross¹, David M. Shannon¹, Jill D. Salisbury-Glennon¹, Anthony J. Guarino¹•Institutions (1)

Auburn University¹

01 Jun 2002-Educational and Psychological Measurement

TL;DR: The Patterns of Adaptive Learning Survey (PALS) as mentioned in this paper was developed to assess a trichotomous achievement goal structure, which included the following subscales: Task Goal Orientation, Performance-Approach goal orientation, and Performance-Avoid Goal orientation.

...read moreread less

Abstract: The Patterns of Adaptive Learning Survey (PALS) was developed to assess a trichotomous achievement goal structure, which included the following subscales: Task Goal Orientation, Performance-Approach Goal Orientation, and Performance-Avoid Goal Orientation. The use of the PALS in making inferences about these goal orientations was originally validated with a middle school sample of students. In this study, the authors computed Cronbach’s alphas and employed confirmatory factor analytic procedures to provide statistical evidence of the reliability and validity of inferences based on scores from the PALS at the fourth-grade and college levels.

...read moreread less