scispace - formally typeset
Journal ArticleDOI

Validity of Observer-Based Aggregate Scoring Systems as Descriptors of Elbow Pain, Function, and Disability*

Reads0
Chats0
TLDR
It is believed that outcomes should be expressed as raw scores rather than as categorical rankings, and scores derived from patient-completed functional questionnaires correlated more closely with perceived functional loss than did those determined with aggregate elbow-scoring systems.
Abstract
Current elbow-scoring systems are based on the observer-derived assessment of a variety of clinical and functional criteria, which are scored separately and then aggregated. The aggregate score then is assigned a categorical ranking that ranges from excellent to poor. The developers of different elbow-scoring systems have chosen different outcome criteria, assigned different weights to each criterion, and accorded different ranges of values to each categorical ranking. Five different elbow-scoring systems (the Mayo elbow-performance index and the systems of Broberg and Morrey, Ewald et al., The Hospital for Special Surgery, and Pritchard) were used to evaluate the same group of patients. The validity of the scoring systems was determined with use of visual-analog scales for the assessment of pain and function, patient and physician-derived ratings of the severity of impairment of the elbow, and two functional questionnaires completed by the patient (the Disabilities of the Arm, Shoulder and Hand questionnaire and the Modified American Shoulder and Elbow Surgeons patient self-evaluation form). The study sample consisted of sixty-nine patients who had sought treatment at one of two tertiary referral clinics because of problems related to the elbow. Pearson product-moment correlation coefficients were used to compare the raw aggregate scores, and kappa statistics were used to determine the level of agreement among the categorical rankings (excellent, good, fair, and poor). Examination of the five scoring systems revealed a remarkable lack of concordance with regard to the aspects of elbow function that were assessed. Good correlation was observed when the systems were compared on the basis of raw scores (Pearson product-moment correlation coefficients, 0.79 to 0.90), but only slight-to-moderate correlation was noted when the systems were compared on the basis of categorical rankings (quadratic weighted kappa coefficients, 0.18 to 0.49). Validity testing showed the system of Ewald et al. and the Mayo elbow-performance index to be the most discriminating, the system of Pritchard to be the least discriminating, and the system of The Hospital for Special Surgery and the system of Broberg and Morrey to be intermediate. The scores determined with the elbow-scoring systems demonstrated only moderate correlation with the score for function on the visual analog scale (Pearson product-moment correlation coefficients, 0.44 to 0.66), whereas those derived from the functional questionnaires completed by the patient demonstrated moderate-to-good correlation with the score for function (Pearson product-moment correlation coefficients, 0.72 and 0.80).

read more

Citations
More filters
Journal ArticleDOI

Reliable change and minimum important difference (MID) proportions facilitated group responsiveness comparisons using individual threshold criteria.

TL;DR: The reliable change and MID proportions have an intuitive interpretation and facilitate quantitative responsiveness comparisons among outcome measures based on individual patient criteria and indicate differences among the outcome measures used in this study.
Journal ArticleDOI

The disabilities of the arm, shoulder and hand (DASH) outcome questionnaire: reliability and validity of the Swedish version evaluated in 176 patients.

TL;DR: The Swedish version of the DASH is a reliable and valid instrument that can provide a standardized measure of patient-centered outcomes in upper-extremity musculoskeletal conditions.
References
More filters
Journal ArticleDOI

The measurement of observer agreement for categorical data

TL;DR: A general statistical methodology for the analysis of multivariate categorical data arising from observer reliability studies is presented and tests for interobserver bias are presented in terms of first-order marginal homogeneity and measures of interob server agreement are developed as generalized kappa-type statistics.
Journal ArticleDOI

Intraclass correlations: uses in assessing rater reliability.

TL;DR: In this article, the authors present guidelines for choosing among six different forms of the intraclass correlation for reliability studies in which n target are rated by k judges, and the confidence intervals for each of the forms are reviewed.
Journal ArticleDOI

The Design and Analysis of Clinical Experiments.

P. Armitage, +1 more
- 01 Dec 1987 - 
Journal ArticleDOI

Development of an upper extremity outcome measure: The DASH (disabilities of the arm, shoulder, and head)

TL;DR: The goal is to produce a brief, self-administered measure of symptoms and functional status, with a focus on physical function, to be used by clinicians in daily practice and as a research tool.
Journal ArticleDOI

Graphic representation of pain.

TL;DR: Most patients could readily use visual analogue and graphic rating scales despite having no previous experience and use of these scales is the best available method for measuring pain or pain relief.
Related Papers (5)