Cancer remains a top threat to human health and society. Novel large-scale approaches based on the electronic health record (EHR), and genomic, transcriptomic, epigenomic, and metagenomics (omics techniques) have allowed the expansion of our current knowledge regarding the etiology of cancer, general or specific to different cancer types. With large omics data that have been generated at individual omics, the plea for multi-omics integrative analysis is also urgent to permit a more complete understanding of the interplay of the genes, microorganisms, and metabolites as well as other risk factors (from EHR and different omics) on cancer. However, these modern high throughput techniques are associated with significant limitations such as the high-dimensional data size relative to the available sample size. Most of these methods are not applicable to non-linear and multi-view datasets.
Markedly, the number of genes and genomic variants can be more than tens of thousands, while the sample size in cancer study is mostly in hundreds or less. Statistical methods appropriate for high-dimensional data have been widely studied, for example, the penalized regression. Recently, the breakthrough of machine-learning methods featured by the deep learning leads to a new direction for high-dimensional data analysis, as well as cancer research. The incorporation of various statistical machine learning approaches into genomic analyses is a rather recent area of study. Since large-scale microarray data presents significant challenges for the statistical data analysis, in addition to the classical approaches, there is a need for an advanced method.
This Research Topic will focus on the development and application of novel statistical and machine-learning methods, such as the kernel-based methods, for high-dimensional clinical and (multi-)omics data in cancer-related research.
We welcome the submission of Original Research, Reviews, Mini-Reviews, Perspectives, and Opinions covering, but not limited to, the following topics:
• Integrative analysis of clinical and (multi-)omics cancer data;
• Prediction in cancer diagnosis, survival/progression, and drug responses;
• Incorporation of statistical, kernel-based, and deep learning methods into high-dimensional data analysis.
Cancer remains a top threat to human health and society. Novel large-scale approaches based on the electronic health record (EHR), and genomic, transcriptomic, epigenomic, and metagenomics (omics techniques) have allowed the expansion of our current knowledge regarding the etiology of cancer, general or specific to different cancer types. With large omics data that have been generated at individual omics, the plea for multi-omics integrative analysis is also urgent to permit a more complete understanding of the interplay of the genes, microorganisms, and metabolites as well as other risk factors (from EHR and different omics) on cancer. However, these modern high throughput techniques are associated with significant limitations such as the high-dimensional data size relative to the available sample size. Most of these methods are not applicable to non-linear and multi-view datasets.
Markedly, the number of genes and genomic variants can be more than tens of thousands, while the sample size in cancer study is mostly in hundreds or less. Statistical methods appropriate for high-dimensional data have been widely studied, for example, the penalized regression. Recently, the breakthrough of machine-learning methods featured by the deep learning leads to a new direction for high-dimensional data analysis, as well as cancer research. The incorporation of various statistical machine learning approaches into genomic analyses is a rather recent area of study. Since large-scale microarray data presents significant challenges for the statistical data analysis, in addition to the classical approaches, there is a need for an advanced method.
This Research Topic will focus on the development and application of novel statistical and machine-learning methods, such as the kernel-based methods, for high-dimensional clinical and (multi-)omics data in cancer-related research.
We welcome the submission of Original Research, Reviews, Mini-Reviews, Perspectives, and Opinions covering, but not limited to, the following topics:
• Integrative analysis of clinical and (multi-)omics cancer data;
• Prediction in cancer diagnosis, survival/progression, and drug responses;
• Incorporation of statistical, kernel-based, and deep learning methods into high-dimensional data analysis.