Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Misinterpretation and abuse of statistical tests, confidence intervals, and statistical power have been decried for decades, yet remain rampant. A key problem is that there are no interpretations of these concepts that are at once simple, intuitive, correct, and foolproof. Instead, correct use and interpretation of these statistics requires an attention to detail which seems to tax the patience of working scientists. This high cognitive demand has led to an epidemic of shortcut definitions and interpretations that are simply wrong, sometimes disastrously so—and yet these misinterpretations dominate much of the scientific literature. In light of this problem, we provide definitions and a discussion of basic statistics that are more general and critical than typically found in traditional introductory expositions. Our goal is to provide a resource for instructors, researchers, and consumers of statistics whose knowledge of statistical theory and technique may be limited but who wish to avoid and spot misinterpretations. We emphasize how violation of often unstated analysis protocols (such as selecting analyses for presentation based on the P values they produce) can lead to small P values even if the declared test hypothesis is correct, and can lead to large P values even if that hypothesis is incorrect. We then provide an explanatory list of 25 misinterpretations of P values, confidence intervals, and power. We conclude with guidelines for improving statistical interpretation and reporting.

Related collections

Most cited references 111

Record: found
Abstract: not found
Article: not found

The Abuse of Power

John Hoenig, Dennis M Heisey (2001)

0 comments Cited 344 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Toward evidence-based medical statistics. 1: The P value fallacy.

Steven N Goodman (1999)

An important problem exists in the interpretation of modern medical research data: Biological understanding and previous research play little formal role in the interpretation of quantitative results. This phenomenon is manifest in the discussion sections of research articles and ultimately can affect the reliability of conclusions. The standard statistical approach has created this situation by promoting the illusion that conclusions can be produced with certain "error rates," without consideration of information from outside the experiment. This statistical approach, the key components of which are P values and hypothesis tests, is widely perceived as a mathematically coherent approach to inference. There is little appreciation in the medical community that the methodology is an amalgam of incompatible elements, whose utility for scientific inference has been the subject of intense debate among statisticians for almost 70 years. This article introduces some of the key elements of that debate and traces the appeal and adverse impact of this methodology to the P value fallacy, the mistaken idea that a single number can capture both the long-run outcomes of an experiment and the evidential meaning of a single result. This argument is made as a prelude to the suggestion that another measure of evidence should be used--the Bayes factor, which properly separates issues of long-run behavior from evidential strength and allows the integration of background knowledge with statistical findings.

0 comments Cited 229 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Sifting the evidence-what's wrong with significance tests?

J. A. Sterne, G Davey Smith (2001)

0 comments Cited 226 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Sander Greenland: lesdomes@ucla.edu

Stephen J. Senn: stephen.senn@lih.lu

John B. Carlin: john.carlin@mcri.edu.au

Charles Poole: cpoole@unc.edu

Steven N. Goodman: steve.goodman@stanford.edu

Douglas G. Altman: doug.altman@csm.ox.ac.uk

Journal

Journal ID (nlm-ta): Eur J Epidemiol

Journal ID (iso-abbrev): Eur. J. Epidemiol

Title: European Journal of Epidemiology

Publisher: Springer Netherlands (Dordrecht )

ISSN (Print): 0393-2990

ISSN (Electronic): 1573-7284

Publication date (Electronic): 21 May 2016

Publication date PMC-release: 21 May 2016

Publication date (Print): 2016

Volume: 31

Pages: 337-350

Affiliations

[ ]Department of Epidemiology and Department of Statistics, University of California, Los Angeles, CA USA

[ ]Competence Center for Methodology and Statistics, Luxembourg Institute of Health, Strassen, Luxembourg

[ ]RTI Health Solutions, Research Triangle Institute, Research Triangle Park, NC USA

[ ]Clinical Epidemiology and Biostatistics Unit, Murdoch Children’s Research Institute, School of Population Health, University of Melbourne, Melbourne, VIC Australia

[ ]Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC USA

[ ]Meta-Research Innovation Center, Departments of Medicine and of Health Research and Policy, Stanford University School of Medicine, Stanford, CA USA

[ ]Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK

Article

Publisher ID: 149

DOI: 10.1007/s10654-016-0149-3

PMC ID: 4877414

PubMed ID: 27209009

SO-VID: 12657c37-7667-4264-ad0a-59f9ecb8cc52

License:

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

History

Date received : 9 April 2016

Date accepted : 9 April 2016

Funding

Funded by: FundRef http://dx.doi.org/10.13039/501100004963, Seventh Framework Programme;

Award ID: 602552

Custom metadata

ScienceOpen disciplines: Public health

Keywords: confidence intervals,hypothesis testing,null testing,p value,power,significance tests,statistical testing

Data availability:

ScienceOpen disciplines: Public health

Keywords: confidence intervals, hypothesis testing, null testing, p value, power, significance tests, statistical testing

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Read this article at

Abstract

Related collections

Open Research, Open Science, Open Scholarship

Most cited references 111

The Abuse of Power

Toward evidence-based medical statistics. 1: The P value fallacy.

Sifting the evidence-what's wrong with significance tests?

Author and article information

Contributors

Journal

Affiliations

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 27

Cited by 628

Most referenced authors 1,028