AUTHOR=Zrimec Jan , Buric Filip , Kokina Mariia , Garcia Victor , Zelezniak Aleksej TITLE=Learning the Regulatory Code of Gene Expression JOURNAL=Frontiers in Molecular Biosciences VOLUME=8 YEAR=2021 URL=https://www.frontiersin.org/journals/molecular-biosciences/articles/10.3389/fmolb.2021.673363 DOI=10.3389/fmolb.2021.673363 ISSN=2296-889X ABSTRACT=
Data-driven machine learning is the method of choice for predicting molecular phenotypes from nucleotide sequence, modeling gene expression events including protein-DNA binding, chromatin states as well as mRNA and protein levels. Deep neural networks automatically learn informative sequence representations and interpreting them enables us to improve our understanding of the regulatory code governing gene expression. Here, we review the latest developments that apply shallow or deep learning to quantify molecular phenotypes and decode the