AUTHOR=Winder Zachary , Sudduth Tiffany L. , Fardo David , Cheng Qiang , Goldstein Larry B. , Nelson Peter T. , Schmitt Frederick A. , Jicha Gregory A. , Wilcock Donna M. TITLE=Hierarchical Clustering Analyses of Plasma Proteins in Subjects With Cardiovascular Risk Factors Identify Informative Subsets Based on Differential Levels of Angiogenic and Inflammatory Biomarkers JOURNAL=Frontiers in Neuroscience VOLUME=14 YEAR=2020 URL=https://www.frontiersin.org/journals/neuroscience/articles/10.3389/fnins.2020.00084 DOI=10.3389/fnins.2020.00084 ISSN=1662-453X ABSTRACT=
Agglomerative hierarchical clustering analysis (HCA) is a commonly used unsupervised machine learning approach for identifying informative natural clusters of observations. HCA is performed by calculating a pairwise dissimilarity matrix and then clustering similar observations until all observations are grouped within a cluster. Verifying the empirical clusters produced by HCA is complex and not well studied in biomedical applications. Here, we demonstrate the comparability of a novel HCA technique with one that was used in previous biomedical applications while applying both techniques to plasma angiogenic (FGF, FLT, PIGF, Tie-2, VEGF, VEGF-D) and inflammatory (MMP1, MMP3, MMP9, IL8, TNFα) protein data to identify informative subsets of individuals. Study subjects were diagnosed with mild cognitive impairment due to cerebrovascular disease (MCI-CVD). Through comparison of the two HCA techniques, we were able to identify subsets of individuals, based on differences in VEGF (