Open Access
Pan-cancer analysis of non-coding recurrent mutations and their possible involvement in cancer pathogenesis
Author(s) -
Chie Kikutake,
Minako Yoshihara,
Mikita Suyama
Publication year - 2021
Publication title -
nar cancer
Language(s) - English
Resource type - Journals
ISSN - 2632-8674
DOI - 10.1093/narcan/zcab008
Subject(s) - biology , genetics , coding region , genome , gene , computational biology , epigenomics , cancer genome sequencing , germline mutation , mutation , whole genome sequencing , dna methylation , gene expression
Abstract Cancer-related mutations have been mainly identified in protein-coding regions. Recent studies have demonstrated that mutations in non-coding regions of the genome could also be a risk factor for cancer. However, the non-coding regions comprise 98% of the total length of the human genome and contain a huge number of mutations, making it difficult to interpret their impacts on pathogenesis of cancer. To comprehensively identify cancer-related non-coding mutations, we focused on recurrent mutations in non-coding regions using somatic mutation data from COSMIC and whole-genome sequencing data from The Cancer Genome Atlas (TCGA). We identified 21 574 recurrent mutations in non-coding regions that were shared by at least two different samples from both COSMIC and TCGA databases. Among them, 580 candidate cancer-related non-coding recurrent mutations were identified based on epigenomic and chromatin structure datasets. One of such mutation was located in RREB1 binding site that is thought to interact with TEAD1 promoter. Our results suggest that mutations may disrupt the binding of RREB1 to the candidate enhancer region and increase TEAD1 expression levels. Our findings demonstrate that non-coding recurrent mutations and coding mutations may contribute to the pathogenesis of cancer.