How the Post-Data Severity Converts Testing Results into Evidence for or against Pertinent Inferential Claims-Reference-Cited by-同舟云学术

How the Post-Data Severity Converts Testing Results into Evidence for or against Pertinent Inferential Claims

Published:2024-01-22 Issue:1 Volume:26 Page:95
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Spanos Aris¹^ORCID

Affiliation:

1. Department of Economics, Virginia Tech, Blacksburg, VA 24061, USA

Abstract

The paper makes a case that the current discussions on replicability and the abuse of significance testing have overlooked a more general contributor to the untrustworthiness of published empirical evidence, which is the uninformed and recipe-like implementation of statistical modeling and inference. It is argued that this contributes to the untrustworthiness problem in several different ways, including [a] statistical misspecification, [b] unwarranted evidential interpretations of frequentist inference results, and [c] questionable modeling strategies that rely on curve-fitting. What is more, the alternative proposals to replace or modify frequentist testing, including [i] replacing p-values with observed confidence intervals and effects sizes, and [ii] redefining statistical significance, will not address the untrustworthiness of evidence problem since they are equally vulnerable to [a]–[c]. The paper calls for distinguishing between unduly data-dependant ‘statistical results’, such as a point estimate, a p-value, and accept/reject H0, from ‘evidence for or against inferential claims’. The post-data severity (SEV) evaluation of the accept/reject H0 results, converts them into evidence for or against germane inferential claims. These claims can be used to address/elucidate several foundational issues, including (i) statistical vs. substantive significance, (ii) the large n problem, and (iii) the replicability of evidence. Also, the SEV perspective sheds light on the impertinence of the proposed alternatives [i]–[iii], and oppugns [iii] the alleged arbitrariness of framing H0 and H1 which is often exploited to undermine the credibility of frequentist testing.

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/26/1/95/pdf

Reference53 articles.

1. National Academy of Sciences (2016). Statistical Challenges in Assessing and Fostering the Reproducibility of Scientific Results: Summary of a Workshop, NA Press.

2. ASA’s statement on p-values: Context, process, and purpose;Wasserstein;Am. Stat.,2016

3. Reproducibility crisis;Baker;Nature,2016

4. Replication and Economics Journal Policies;Hoffler;Am. Econ. Rev.,2017

5. Ioannidis, J.P.A. (2005). Why most published research findings are false. PLoS Med., 2.