AUTHOR=Magnusson Rasmus , Rundquist Olof , Kim Min Jung , Hellberg Sandra , Na Chan Hyun , Benson Mikael , Gomez-Cabrero David , Kockum Ingrid , Tegnér Jesper N. , Piehl Fredrik , Jagodic Maja , Mellergård Johan , Altafini Claudio , Ernerudh Jan , Jenmalm Maria C. , Nestor Colm E. , Kim Min-Sik , Gustafsson Mika TITLE=RNA-sequencing and mass-spectrometry proteomic time-series analysis of T-cell differentiation identified multiple splice variants models that predicted validated protein biomarkers in inflammatory diseases JOURNAL=Frontiers in Molecular Biosciences VOLUME=9 YEAR=2022 URL=https://www.frontiersin.org/journals/molecular-biosciences/articles/10.3389/fmolb.2022.916128 DOI=10.3389/fmolb.2022.916128 ISSN=2296-889X ABSTRACT=
Profiling of mRNA expression is an important method to identify biomarkers but complicated by limited correlations between mRNA expression and protein abundance. We hypothesised that these correlations could be improved by mathematical models based on measuring splice variants and time delay in protein translation. We characterised time-series of primary human naïve CD4+ T cells during early T helper type 1 differentiation with RNA-sequencing and mass-spectrometry proteomics. We performed computational time-series analysis in this system and in two other key human and murine immune cell types. Linear mathematical mixed time delayed splice variant models were used to predict protein abundances, and the models were validated using out-of-sample predictions. Lastly, we re-analysed RNA-seq datasets to evaluate biomarker discovery in five T-cell associated diseases, further validating the findings for multiple sclerosis (MS) and asthma. The new models significantly out-performing models not including the usage of multiple splice variants and time delays, as shown in cross-validation tests. Our mathematical models provided more differentially expressed proteins between patients and controls in all five diseases. Moreover, analysis of these proteins in asthma and MS supported their relevance. One marker, sCD27, was validated in MS using two independent cohorts for evaluating response to treatment and disease prognosis. In summary, our splice variant and time delay models substantially improved the prediction of protein abundance from mRNA expression in three different immune cell types. The models provided valuable biomarker candidates, which were further validated in MS and asthma.