Human endogenous retroviruses (HERVs) make up 8% of the human genome. HERVs are biologically active elements related to multiple diseases. HERV-K, a subfamily of HERVs, has been associated with certain types of cancer and suggested as an immunologic target in some tumors. The expression levels of HERV-K in breast cancer (BCa) have been studied as biomarkers and immunologic therapeutic targets. However, HERV-K has multiple copies in the human genome, and few studies determined the transcriptional profile of HERV-K copies across the human genome for BCa.
Ninety-one HERV-K indexes with entire proviral sequences were used as the reference database. Nine raw sequencing datasets with 243 BCa and 137 control samples were mapped to this database by Salmon software. The differential proviral expression across several groups was analyzed by DESeq2 software.
First, the clustering of each dataset demonstrated that these 91 HERV-K proviruses could well cluster the BCa and control samples when the normal controls were normal cells or healthy donor tissues. Second, several common HERV-K proviruses that are closely related with BCa risk were significantly differentially expressed (
The expression profiling of these 91 HERV-K proviruses can be used as biomarkers to distinguish individuals with BCa and healthy controls. Some proviruses, especially 17p13.1, were strongly associated with BCa risk. The results suggest that HERV-K expression profiles may be appropriate biomarkers and targets for BCa.