
94% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
GENERAL COMMENTARY article
Front. Immunol., 30 June 2023
Sec. Cancer Immunity and Immunotherapy
Volume 14 - 2023 | https://doi.org/10.3389/fimmu.2023.913339
This article is a commentary on:
A Novel Signature of 23 Immunity-Related Gene Pairs Is Prognostic of Cutaneous Melanoma
A commentary on
A novel signature of 23 immunity-related gene pairs is prognostic of cutaneous melanoma
by Xue Y-N, Xue Y-N, Wang Z-C, Mo Y-Z, Wang P-Y and Tan W-Q (2020) Front. Immunol. 11:576914. doi: 10.3389/fimmu.2020.576914
This paper is a commentary on the article “A Novel Signature of 23 Immunity Related Gene Pairs Is Prognostic of Cutaneous Melanoma” (Ya-Nan Xue et al., 2020) (1). The authors aimed to identify an immune-related signature for predicting prognosis in cutaneous melanoma (CM). But there are some statistical errors in the statistical inference process, which mainly reflect in the correlation between interval estimation and hypothesis testing showing in TABLE 2 presenting Univariate Cox and Multivariate Cox analysis of clinicopathological factors and risk signatures. Through the following discussion of this published paper, we can be more familiar with the correlation between interval estimation and hypothesis testing, and avoid errors in statistics.
It’s clear that there are some statistical errors in TABLE 2. For example, in the TCGA dataset, the 95% confidence intervals (CI) of the hazard ratios (HRs) of age and stage in Multivariate Cox analysis of clinicopathological factors and risk signatures are 0.98 - 1.04 and 0.83 - 1.43, respectively, but p-value = 0.010 and 0.008, respectively; Moreover, in the GSE59455 validation dataset the 95% CI of the HR of risk Score in Multivariate Cox analysis of clinicopathological factors and risk signatures is 0.94 - 2.34, but p-value = 0.042. It is known that the range of this 95% CI includes 1, so this test should not be statistically significant (p-value > 0.05). Then, our task group had attempted to reproduce the statistical analyses. However, the results demonstrate the validity of our original vision that there are some numerical results in the original paper and the 95% CI for HR which includes HR = 1, certainly couldn’t has a corresponding p-value< 0.05. (the data have not yet been published). Next, to confirm the validity of our conclusions, we explain the concepts of interval estimation and hypothesis testing, as well as the relationship between them in detail later in this commentary.
Interval estimation involves an interval that measures the accuracy of a point estimate. The point estimate of parameter is calculated to get a specific value, which is easy to calculate and use. However, the accuracy of the calculated point estimation cannot be explained by the point estimation itself, and it needs to be reflected by the corresponding distribution. In reality, we usually use an interval to cover the known point estimation, and we can also say that we can get the range of the value of the point estimation, it should be a range, as defined below:
Let θ be a parameter of the population, x1, x 2,…, xn be the sample, obtain the statistics θL (x1, x2,…, xn), θU (x1, x2,…, xn), make θL < θU; get the sample observations, then θ is estimated in the interval [θL, θU]. Because of the randomness of the sample, the probability of the cover parameter θ is uncertain, and we usually need to give an artificial probability here. Of course, we certainly want the probability P of the cover θ to be as large as possible, but this will inevitably lead to an increase in the length of the interval, where the probability of the interval [θL, θU] covering θ is given to 1-α, then P θ (θL ≤ θ ≤ θU)≥1-α, then the confidence level of the random interval [θL, θU] is called the confidence level of θ of 1-α, θL and θU are called θ’s (bilateral) lower and upper confidence limits, respectively.
The hypothesis testing is a process of testing the validity of hypotheses. It is a statistical inference method used to judge whether the differences between samples and population are caused by sampling errors or by the essential differences between samples. There are many methods of hypothesis testing. As one of the most commonly used hypothesis testing methods, the basic principle of significance test is to make some assumptions about a parameter of the population based on some non-sample information, and then through the statistical reasoning of sampling information, make a judgment on whether this hypothesis should be rejected. For example, gene sets with an FDR-adjusted probability (p) value< 0.05 were considered statistically significant in this published article. That said, when p value was greater than 0.05, the null hypothesis should be rejected.
It is worth mentioning that there is a very close relationship between the CI of normal population parameters and the test of hypothesis testing. The similarity between the pivotal quantity and test statistics is not accidental, but there is a one-to-one correspondence between the two statistical inference methods (2). The two problems can be transformed into each other. The derivation is as follows:
In Two-side test:
Null hypothesis (H0): μ = μ0 Alternative hypothesis(H1): μ ≠ μ0
The test statistic: ; Rejection region W: ; Acceptance region , that is, , It can eventually be rewritten as , which is the 1-α CI for the population parameter μ.
Consequently, Under the same condition, the 1-α CI problem in interval estimation corresponds to the hypothesis test problem with the significance level α, and the result is the same. Therefore, the range of this 95% CI includes 1, which has been equated with P > 0.05, so this test should not be statistically significant.
The p value and CI are the two most widely used statistical inference tools. The American Statistical Association issued a warning statement on the use of the p value due to the misuse and abuse of the p value (3). There are some mistakes in understanding the p value and CI which caused the wrong use of the p value (4). Furthermore, the researchers may have never considered the relationship between the p value and CI. Therefore, this paper expects to introduce the definition of p value and CI through the analysis of that previously published paper mentioned above, and we also hope to analyze the relationship and difference between the p value and CI, to help researchers understand the p value and CI correctly.
Undoubtedly, the authors of that previously published article have failed in understanding the relationship between CI and p value. Only when we understand them correctly can we accurately use them to explain the research results and significance of the article, which is crucial to the quality of the article.
All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.
This work was supported by Beijing Municipal Science & Technology Commission Program (Nos. Z171100001017203 and D181100000218005).
RS would like to acknowledge the companionship of his beloved dogs Huang Tao and Hei Dou, and the generous support and invaluable help of his huckleberry friend Zhenkai Luo during the writing of this manuscript.
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
1. Xue YN, Xue YN, Wang ZC, Mo YZ, Wang PY, Tan WQ. A novel signature of 23 immunity-related gene pairs is prognostic of cutaneous melanoma. Front Immunol (2020) 11:576914. doi: 10.3389/fimmu.2020.576914
2. Altman DG, Bland JM. How to obtain the confidence interval from a p value. BMJ (2011) 343:d2090. doi: 10.1136/bmj.d2090
3. Wasserstein RL, Lazar NA. The ASA statement on p-values: context, process, and purpose. Am Statistician (2016) 70(2):129–33. doi: 10.1080/00031305.2016.1154108
Keywords: signature, immunity, gene pairs, prognostic, cutaneous melanoma
Citation: Shen R and Zou T (2023) Commentary: A novel signature of 23 immunity related gene pairs is prognostic of cutaneous melanoma. Front. Immunol. 14:913339. doi: 10.3389/fimmu.2023.913339
Received: 05 April 2022; Accepted: 23 June 2023;
Published: 30 June 2023.
Edited by:
Enrique Hernandez-Lemus, National Institute of Genomic Medicine (INMEGEN), MexicoCopyright © 2023 Shen and Zou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Tong Zou, em91dG9uZzIwMDFAMTYzLmNvbQ==
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.