The final, formatted version of the article will be published soon.
PERSPECTIVE article
Front. Plant Sci.
Sec. Plant Breeding
Volume 15 - 2024 |
doi: 10.3389/fpls.2024.1480463
Using the Pearson's correlation coefficient as the sole metric to measure the accuracy of quantitative trait prediction: Is it sufficient?
Provisionally accepted- Beijing Academy of Agricultural and Forestry Sciences, Beijing, China
How to evaluate the accuracy of quantitative trait prediction is crucial to choose the best model among several possible choices in plant breeding. Pearson's correlation coefficient (PCC), serving as a metric for quantifying the strength of the linear association between two variables, is widely used to evaluate the accuracy of the quantitative trait prediction models, and generally performs well in most circumstances. However, PCC may not always offer a comprehensive view of predictive accuracy, especially in cases involving nonlinear relationships or complex dependencies in machine learningbased methods. It has been found that many papers on quantitative trait prediction solely use PCC as a single metric to evaluate the accuracy of their models, which is insufficient and limited from a formal perspective. This study addresses this crucial issue by presenting a typical example and conducting a comparative analysis of PCC and nine other evaluation metrics using four traditional methods and four machine learning-based methods, thereby contributing to the improvement of practical applicability and reliability of plant quantitative trait prediction models. It is recommended to employ PCC in conjunction with other evaluation metrics in a targeted manner based on specific application scenarios to reduce the likelihood of drawing misleading conclusions.
Keywords: genomic selection, quantitative trait prediction, Pearson's correlation coefficient, Evaluation metric, Regression prediction
Received: 14 Aug 2024; Accepted: 19 Nov 2024.
Copyright: © 2024 Pan, Liu, Han, Zhang, Zhao, Li and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Kaiyi Wang, Beijing Academy of Agricultural and Forestry Sciences, Beijing, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.