Interpreting regression models in clinical outcome
studies

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Measuring the outcome of an intervention is central to the practice of evidence based medicine, and most research papers evaluating patient outcomes now incorporate some form of patient-based metric, such as questionnaires or performance tests. Once an outcome has been defined, researchers typically want to know if any other factors can influence the result. This is typically assessed with regression analysis. Regression analysis 1 determines the relationship of an independent variable (such as bone mineral density) on a dependent variable (such as ageing) with the statistical assumption that all other variables remain fixed. The calculation of the relationship results in a theoretical straight line, and the correlation co-efficient (r) measures how closely the observed data are to the theoretical straight line that we have calculated. In such a linear model, we can judge how well the line fits the data (‘goodness of fit’) by calculating the coefficient of determination (or square of the regression line, R2). R2 is a measure of the percentage of total variation in the dependant variable that is accounted for by the independent variable. An R2 of 1.0 indicates that the data perfectly fit the linear model. Any R2 value less than 1.0 indicates that at least some variability in the data cannot be accounted for by the model (e.g., an R2 of 0.5 indicates that 50% of the variability in the outcome data cannot be explained by the model). Given these statistical tools, we can use the regression equation to predict the value of the dependent variable based on the known value of independent variable. Since many variables may contribute to the outcome (dependent variable), further statistical analysis can be achieved with multiple regression analysis. These models are essentially the same as simple regression analysis, except that the multiple regression analysis equation describes the interrelationship of many variables and allows us to evaluate the joint effect of these variables on the outcome variable in question. Poitras et al 2 report an interesting study this month that aims to predict length of stay and early clinical function following joint arthroplasty. Multiple linear regression analyses produced an equation based on the timed-up-and-go test, which was associated with length of stay. In addition, models based on the pre-operative WOMAC function sub-score produced the best model for describing early post-operative function (as calculated by the Older American Resources and Services ALD score). As such the authors were able to conclude that the outcomes assessments (timed-up-and-go and WOMAC) were predictive of outcome, and further modelling identified thresholds of the outcome assessment scores that related to better and worse outcomes. How should we interpret these findings? The authors quite correctly suggest that models such as these could be of value in discharge planning and resource utilisation by targeting the patients that most need intervention and rehabilitation. The reported R2 for the models, however, was 0.18. Bearing in mind that R2, the coefficient of determination, measures the percentage of the variation in the dependent variable that is explained by variation in the independent variable, 3 taking the compliment (100 – R2) we see that 82% of the variation in the outcome parameter assessed is unexplained by the model. The principal problem is that the variance in the population studied can strongly influence R2 magnitude. Therefore, there is no guarantee that a high coefficient of determination is indicative of ‘goodness of fit’. Similarly there is no guarantee that a small R2 indicates a weak relationship, given that the statistic is largely influenced by variation in the independent variable. 4 Therefore, there is no rule for interpreting the strength of R2 in its application to clinical relevance. Useful high values of R2 can be obtained with clinical data sets, 5 however, a low R2 can still provide a useful clinical model with respect to data trends, but may be low in precision. In this study there is an association between the performance tests and length of stay; and, using the equations, we can indeed predict one from the other. The accuracy of this prediction though, needs to be borne in mind when using it as a clinical tool. Furthermore, it is not rational to compare R2 across different samples, which given clinical populations, are likely to differ significantly in the variance of the independent and dependent variables. 6 In controlled environments, such as biomechanical tests on cadaveric bones, the variance across predictive measurements is likely to be low, and therefore R2 values can be expected to lie in the 0.8 range. 7 In clinical studies, however, R2 values vary widely depending on the nature of the analysis. For example, when comparing radiographic parameters or associating surgical technical factors, values of R2 are reported in the 0.2 to 0.4 range. 8,9 Whereas, comparing data between separate (but intrinsically similar) outcome assessment questionnaires can yield higher values in excess of 0.7. 10 As such, further validation of the Poitras study 2 using new datasets and, ideally, confirmatory analysis of the findings using a much larger sample size, would be required before their regression model could be recommended for use clinically. This does not devalue the appropriateness – or indeed ‘worthiness’ – of reporting these findings in the literature, as the important clinical tools typically start as ideas in small datasets. As with all research papers, the reader requires a basic understanding of methodology to evaluate how relevant the results are to wider practice.

Related collections

Most cited references 5

Record: found
Abstract: found
Article: not found

Predicting early clinical function after hip or knee arthroplasty

S. Poitras, K. S. Wood, J. Savard … (2015)

Objectives Patient function after arthroplasty should ideally quickly improve. It is not known which peri-operative function assessments predict length of stay (LOS) and short-term functional recovery. The objective of this study was to identify peri-operative functions assessments predictive of hospital LOS and short-term function after hospital discharge in hip or knee arthroplasty patients. Methods In total, 108 patients were assessed peri-operatively with the timed-up-and-go (TUG), Iowa level of assistance scale, post-operative quality of recovery scale, readiness for hospital discharge scale, and the Western Ontario and McMaster Osteoarthritis Index (WOMAC). The older Americans resources and services activities of daily living (ADL) questionnaire (OARS) was used to assess function two weeks after discharge. Results Following multiple regressions, the pre- and post-operative day two TUG was significantly associated with LOS and OARS score, while the pre-operative WOMAC function subscale was associated with the OARS score. Pre-operatively, a cut-off TUG time of 11.7 seconds for LOS and 10.3 seconds for short-term recovery yielded the highest sensitivity and specificity, while a cut-off WOMAC function score of 48.5/100 yielded the highest sensitivity and specificity. Post-operatively, a cut-off day two TUG time of 31.5 seconds for LOS and 30.9 seconds for short-term function yielded the highest sensitivity and specificity. Conclusions The pre- and post-operative day two TUG can indicate hospital LOS and short-term functional capacities, while the pre-operative WOMAC function subscale can indicate short-term functional capacities. Cite this article: Bone Joint Res 2015;4:145–151.

0 comments Cited 17 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Reproducibility and side differences of mechanical tests for determining the structural strength of the proximal femur.

V. Kuhn, H Boehm, Felix Eckstein … (2004)

In this experimental study, we evaluated the reproducibility error of mechanical strength tests of the proximal femur when simulating a fall on the trochanter. Based on side differences in femoral failure loads in 55 pairs of femora, we estimated the upper limit of the precision error to be 15% for the side impact test, whereas the intersubject variability was >40%. Mechanical tests are commonly used as the gold standard for determining one of the main functions of bones, that is, to provide mechanical strength. However, it is unknown what magnitude of error is associated with these tests. Here we investigate the precision error and side difference of a side impact test of the proximal femur. BMC was measured using DXA in 54 pairs of femora from donors 79.0 +/- 10.6 years of age. Bones were tested to failure, simulating a fall on the greater trochanter. Failure loads were 3951 +/- 1659N (CV% = 42%) on the right and 3900 +/- 1652N (CV% = 42%) on the left (no significant side difference). The average random difference of femoral BMC was 7 +/- 7% and that of femoral failure loads was 17 +/- 12%. The correlation between BMC and failure load was 79% (r2), but the association between side differences in failure load with those in BMC was only 4%. When confining the analysis to pairs with less than 5% differences in BMC (n = 31), side differences in failure loads were 15 +/- 13%. When correcting failure loads for side differences of BMC, the difference was 16 +/- 15% These results suggest that the upper limit of the precision error for femoral strength tests is approximately 15% in a side impact configuration. Given the large intersubject variability of failure loads, this test provides an efficient tool for determining the structural strength of the proximal femur in a fall.

0 comments Cited 10 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

The validity of a novel radiological method for measuring femoral stem version on anteroposterior radiographs of the hip after total hip arthroplasty.

M. Weber, P Lechler, F von Kunow … (2015)

Femoral stem version has a major influence on impingement and early post-operative stability after total hip arthroplasty (THA). The main objective of this study was to evaluate the validity of a novel radiological method for measuring stem version. Anteroposterior (AP) radiographs and three-dimensional CT scans were obtained for 115 patients (female/male 63/72, mean age 62.5 years (50 to 75)) who had undergone minimally invasive, cementless THA. Stem version was calculated from the AP hip radiograph by rotation-based change in the projected prosthetic neck-shaft (NSA*) angle using the mathematical formula ST = arcos [tan (NSA*) / tan (135)]. We used two independent observers who repeated the analysis after a six-week interval. Radiological measurements were compared with 3D-CT measurements by an independent, blinded external institute. We found a mean difference of 1.2° (sd 6.2) between radiological and 3D-CT measurements of stem version. The correlation between the mean radiological and 3D-CT stem torsion was r = 0.88 (p < 0.001). The intra- (intraclass correlation coefficient ≥ 0.94) and inter-observer agreement (mean concordance correlation coefficient = 0.87) for the radiological measurements were excellent. We found that femoral tilt was associated with the mean radiological measurement error (r = 0.22, p = 0.02). The projected neck-shaft angle is a reliable method for measuring stem version on AP radiographs of the hip after a THA. However, a highly standardised radiological technique is required for its precise measurement.

0 comments Cited 6 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

D. F. Hamilton: Role: Research Fellow, Department of Trauma and Orthopaedics

M. Ghert: Role: Associate Professor, Department of Surgery Deputy Editor,

A. H. R. W. Simpson: Role: Editor-in-Chief

Journal

Journal ID (nlm-ta): Bone Joint Res

Journal ID (iso-abbrev): Bone Joint Res

Title: Bone & Joint Research

Publisher: British Editorial Society of Bone and Joint Surgery

ISSN (Print): 2046-3758

ISSN (Electronic): 2046-3758

Publication date Collection: September 2015

Publication date (Electronic): 01 September 2015

Volume: 4

Issue: 9

Pages: 152-153

Affiliations

[1 ]University of Edinburgh, FU413 Chancellor’s Building, 49 Little France Crescent, Edinburgh, EH16 4SB, UK.

[2 ]The Bone and Joint Journal, 22 Buckingham Street, London, WC2N 6ET, UK.

[3 ]The Bone and Joint Journal, 22 Buckingham Street, London, WC2N 6ET, UK.

Author notes

Correspondence should be sent to Professor A.H. R. W. Simpson; e-mail: e.vodden@ 123456boneandjoint.org.uk

Article

Publisher ID: 2000571

DOI: 10.1302/2046-3758.49.2000571

PMC ID: 4678365

PubMed ID: 26392591

SO-VID: 0e28945b-59a2-473b-b2db-ad11a1716b64

History

Date received : 03 September 2015

Date accepted : 03 September 2015

Custom metadata

odf2nlm-version 1.0

price $2.00

principal-institution The British Editorial Society of Bone and Joint Surgery, London, United Kingdom

article-type Editorial

Keywords: regression,outcome,studies

Data availability:

Keywords: regression, outcome, studies

Comments

Comment on this article

scite_

Cited by 25

See all cited by

Most referenced authors 40

See all reference authors

Interpreting regression models in clinical outcome studies

Read this article at

Abstract

Related collections

Italian Studies

Most cited references 5

Predicting early clinical function after hip or knee arthroplasty

Reproducibility and side differences of mechanical tests for determining the structural strength of the proximal femur.

The validity of a novel radiological method for measuring femoral stem version on anteroposterior radiographs of the hip after total hip arthroplasty.

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Categories

Custom metadata

Comments

Comment on this article

Similar content 51

Cited by 25

Most referenced authors 40