scispace - formally typeset
Search or ask a question
Author

Colleen A. McHorney

Bio: Colleen A. McHorney is an academic researcher from Merck & Co.. The author has contributed to research in topics: Item response theory & Population. The author has an hindex of 46, co-authored 94 publications receiving 25191 citations. Previous affiliations of Colleen A. McHorney include Medical College of Wisconsin & University of Wisconsin-Madison.


Papers
More filters
Journal ArticleDOI
TL;DR: In this article, cross-sectional data from the Medical Outcomes Study (MOS) were analyzed to test the validity of the MOS 36-Item Short-Form Health Survey (SF-36) scales as measures of physical and mental health constructs.
Abstract: Cross-sectional data from the Medical Outcomes Study (MOS) were analyzed to test the validity of the MOS 36-Item Short-Form Health Survey (SF-36) scales as measures of physical and mental health constructs. Results from traditional psychometric and clinical tests of validity were compared. Principal

8,553 citations

Journal ArticleDOI
TL;DR: Findings support the use of the SF-36 survey across the diverse populations studied and identify population groups in which use of standardized health status measures may or may not be problematic.
Abstract: The widespread use of standardized health surveys is predicated on the largely untested assumption that scales constructed from those surveys will satisfy minimum psychometric requirements across diverse population groups. Data from the Medical Outcomes Study (MOS) were used to evaluate data completeness and quality, test scaling assumptions, and estimate internal-consistency reliability for the eight scales constructed from the MOS SF-36 Health Survey. Analyses were conducted among 3,445 patients and were replicated across 24 subgroups differing in sociodemographic characteristics, diagnosis, and disease severity. For each scale, item-completion rates were high across all groups (88% to 95%), but tended to be somewhat lower among the elderly, those with less than a high school education, and those in poverty. On average, surveys were complete enough to compute scales scores for more than 96% of the sample. Across patient groups, all scales passed tests for item-internal consistency (97% passed) and item-discriminant validity (92% passed). Reliability coefficients ranged from a low of 0.65 to a high of 0.94 across scales (median = 0.85) and varied somewhat across patient subgroups. Floor effects were negligible except for the two role disability scales. Noteworthy ceiling effects were observed for both role disability scales and the social functioning scale. These findings support the use of the SF-36 survey across the diverse populations studied and identify population groups in which use of standardized health status measures may or may not be problematic.

4,490 citations

Journal Article
TL;DR: Results suggest that the two summary measures may be useful in most studies and that their empiric validity, relative to the best SF-36 scale, will depend on the application.
Abstract: Physical component summary (PCS) and mental component summary (MCS) measures make it possible to reduce the number of statistical comparisons and thereby the role of chance in testing hypotheses about health outcomes. To test their usefulness relative to a profile of eight scores, results were compared across 16 tests involving patients (N = 1,440) participating in the Medical Outcomes Study. Comparisons were made between groups known to differ at a point in time or to change over time in terms of age, diagnosis, severity of disease, comorbid conditions, acute symptoms, self-reported changes in health, and recovery from clinical depression. The relative validity (RV) of each measure was estimated by a comparison of statistical results with those for the best scales in the same tests. Differences in RV among scales from the Medical Outcomes Study 36-Item Short-Form Health Survey (SF-36) were consistent with those in previous studies. One or both of the summary measures were significant for 14 of 15 differences detected in multivariate analyses of profiles and detected differences missed by the profile in one test. Relative validity coefficients ranged from .20 to .94 (median, .79) for PCS in tests involving physical criteria and from .93 to 1.45 (median, 1.02) for MCS in tests involving mental criteria. The MCS was superior to the best SF-36 scale in three of four tests involving mental health. Results suggest that the two summary measures may be useful in most studies and that their empiric validity, relative to the best SF-36 scale, will depend on the application. Surveys offering the option of analyzing both a profile and psychometrically based summary measures have an advantage over those that do not.

1,818 citations

Journal ArticleDOI
TL;DR: The most problematic feature of the five surveys was their lack of precision for individual-patient applications, and across all scales, reliability standards for individual assessment and monitoring were not satisfied, and the 95% Cls were very wide.
Abstract: Interest has increased in recent years in incorporating health status measures into clinical practice for use at the individual-patient level. We propose six measurement standards for individual-patient applications: (1) practical features, (2) breadth of health measured, (3) depth of health measured, (4) precision for cross-sectional assessment, (5) precision for longitudinal monitoring and (6) validity. We evaluate five health status surveys (Functional Status Questionnaire, Dartmouth COOP Poster Charts, Nottingham Health Profile, Duke Health Profile, and SF-36 Health Survey) that have been proposed for use in clinical practice. We conducted an analytical literature review to evaluate the six measurement standards for individual-patient applications across the five surveys. The most problematic feature of the five surveys was their lack of precision for individual-patient applications. Across all scales, reliability standards for individual assessment and monitoring were not satisfied, and the 95% Cls were very wide. There was little evidence of the validity of the five surveys for screening, diagnosing, or monitoring individual patients. The health status surveys examined in this paper may not be suitable for monitoring the health and treatment status of individual patients. Clinical usefulness of existing measures might be demonstrated as clinical experience is broadened. At this time, however, it seems that new instruments, or adaptation of existing measures and scaling methods, are needed for individual-patient assessment and monitoring.

1,792 citations

Journal ArticleDOI
TL;DR: Variations in RP across methods and concepts were linked to differences in the coarseness of measurement scales, reliability, and content (including the effects of chart illustrations).
Abstract: This study estimated the validity and relative precision (RP) of four methods (MOS long- and short-form scales, global items, and COOP Poster Charts) in measuring six general health concepts. The authors also tested whether and how precisely each method discriminated relatively well adult patients (N = 638) from those with only severe chronic medical (N = 168) and only psychiatric conditions (N = 163), as clinically defined. For comparisons between the well group and both medical and psychiatric groups, RP estimates favored long-form over short-form, multi-item scales, and favored multi-item scales over single-item global measures and poster charts. In relation to long forms, short-form multi-item scales achieved a median RP of .93; RP estimates for global items and poster charts were .81 and .67, respectively. Variations in RP across methods and concepts were linked to differences in the coarseness of measurement scales, reliability, and content (including the effects of chart illustrations). These variations in RP have implications for the interpretation of scores, the statistical power of comparisons between clinical groups, and the size of confidence intervals around individual patient scores.

764 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: A 36-item short-form survey designed for use in clinical practice and research, health policy evaluations, and general population surveys to survey health status in the Medical Outcomes Study is constructed.
Abstract: A 36-item short-form (SF-36) was constructed to survey health status in the Medical Outcomes Study. The SF-36 was designed for use in clinical practice and research, health policy evaluations, and general population surveys. The SF-36 includes one multi-item scale that assesses eight health concepts: 1) limitations in physical activities because of health problems; 2) limitations in social activities because of physical or emotional problems; 3) limitations in usual role activities because of physical health problems; 4) bodily pain; 5) general mental health (psychological distress and well-being); 6) limitations in usual role activities because of emotional problems; 7) vitality (energy and fatigue); and 8) general health perceptions. The survey was constructed for self-administration by persons 14 years of age and older, and for administration by a trained interviewer in person or by telephone. The history of the development of the SF-36, the origin of specific items, and the logic underlying their selection are summarized. The content and features of the SF-36 are compared with the 20-item Medical Outcomes Study short-form.

33,857 citations

Book
23 Sep 2019
TL;DR: The Cochrane Handbook for Systematic Reviews of Interventions is the official document that describes in detail the process of preparing and maintaining Cochrane systematic reviews on the effects of healthcare interventions.
Abstract: The Cochrane Handbook for Systematic Reviews of Interventions is the official document that describes in detail the process of preparing and maintaining Cochrane systematic reviews on the effects of healthcare interventions.

21,235 citations

Journal ArticleDOI
TL;DR: Twenty cross-sectional and longitudinal tests of empirical validity previously published for the 36-item short-form scales and summary measures were replicated for the 12-item Physical Component Summary and the12-item Mental Component Summary, including comparisons between patient groups known to differ or to change in terms of the presence and seriousness of physical and mental conditions.
Abstract: Regression methods were used to select and score 12 items from the Medical Outcomes Study 36-Item Short-Form Health Survey (SF-36) to reproduce the Physical Component Summary and Mental Component Summary scales in the general US population (n=2,333). The resulting 12-item short-form (SF-12) achieved multiple R squares of 0.911 and 0.918 in predictions of the SF-36 Physical Component Summary and SF-36 Mental Component Summary scores, respectively. Scoring algorithms from the general population used to score 12-item versions of the two components (Physical Components Summary and Mental Component Summary) achieved R squares of 0.905 with the SF-36 Physical Component Summary and 0.938 with SF-36 Mental Component Summary when cross-validated in the Medical Outcomes Study. Test-retest (2-week)correlations of 0.89 and 0.76 were observed for the 12-item Physical Component Summary and the 12-item Mental Component Summary, respectively, in the general US population (n=232). Twenty cross-sectional and longitudinal tests of empirical validity previously published for the 36-item short-form scales and summary measures were replicated for the 12-item Physical Component Summary and the 12-item Mental Component Summary, including comparisons between patient groups known to differ or to change in terms of the presence and seriousness of physical and mental conditions, acute symptoms, age and aging, self-reported 1-year changes in health, and recovery for depression. In 14 validity tests involving physical criteria, relative validity estimates for the 12-item Physical Component Summary ranged from 0.43 to 0.93 (median=0.67) in comparison with the best 36-item short-form scale. Relative validity estimates for the 12-item Mental Component Summary in 6 tests involving mental criteria ranged from 0.60 to 107 (median=0.97) in relation to the best 36-item short-form scale. Average scores for the 2 summary measures, and those for most scales in the 8-scale profile based on the 12-item short-form, closely mirrored those for the 36-item short-form, although standard errors were nearly always larger for the 12-item short-form.

14,793 citations

Journal ArticleDOI
TL;DR: In this article, cross-sectional data from the Medical Outcomes Study (MOS) were analyzed to test the validity of the MOS 36-Item Short-Form Health Survey (SF-36) scales as measures of physical and mental health constructs.
Abstract: Cross-sectional data from the Medical Outcomes Study (MOS) were analyzed to test the validity of the MOS 36-Item Short-Form Health Survey (SF-36) scales as measures of physical and mental health constructs. Results from traditional psychometric and clinical tests of validity were compared. Principal

8,553 citations

Journal ArticleDOI
TL;DR: The criteria can be used in systematic reviews of health status questionnaires, to detect shortcomings and gaps in knowledge of measurement properties, and to design validation studies.

7,439 citations