A 3-mRNA-based prognostic signature of survival in oral squamous cell carcinoma
Author(s) -
Ruoyan Cao,
Qiqi Wu,
Qiulan Li,
Mianfeng Yao,
Hongbo Zhou
Publication year - 2019
Publication title -
peerj
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.927
H-Index - 70
ISSN - 2167-8359
DOI - 10.7717/peerj.7360
Subject(s) - nomogram , proportional hazards model , oncology , receiver operating characteristic , medicine , cohort , kegg , survival analysis , head and neck squamous cell carcinoma , univariate , lasso (programming language) , cancer , multivariate statistics , head and neck cancer , biology , gene expression , gene , transcriptome , genetics , machine learning , computer science , world wide web
Background Oral squamous cell carcinoma (OSCC) is the most common type of head and neck squamous cell carcinoma with an unsatisfactory prognosis. The aim of this study was to identify potential prognostic mRNA biomarkers of OSCC based on analysis of The Cancer Genome Atlas (TCGA). Methods Expression profiles and clinical data of OSCC patients were collected from TCGA database. Univariate Cox analysis and the least absolute shrinkage and selection operator Cox (LASSO Cox) regression were used to primarily screen prognostic biomarkers. Then multivariate Cox analysis was performed to build a prognostic model based on the selected prognostic mRNAs. Nomograms were generated to predict the individual’s overall survival at 3 and 5 years. The model performance was assessed by the time-dependent receiver operating characteristic (ROC) curve and calibration plot in both training cohort and validation cohort ( GSE41613 from NCBI GEO databases). In addition, machine learning was used to assess the importance of risk factors of OSCC. Finally, in order to explore the potential mechanisms of OSCC, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis was completed. Results Three mRNAs ( CLEC3B , C6 and CLCN1 ) were finally identified as a prognostic biomarker pattern. The risk score was imputed as: (−0.38602 × expression level of CLEC3B ) + (−0.20632 × expression level of CLCN1 ) + (0.31541 × expression level of C6 ). In the TCGA training cohort, the area under the curve (AUC) was 0.705 and 0.711 for 3- and 5-year survival, respectively. In the validation cohort, AUC was 0.718 and 0.717 for 3- and 5-year survival. A satisfactory agreement between predictive values and observation values was demonstrated by the calibration curve in the probabilities of 3- and 5- year survival in both cohorts. Furthermore, machine learning identified the 3-mRNA signature as the most important risk factor to survival of OSCC. Neuroactive ligand-receptor interaction was most enriched mostly in KEGG pathway analysis. Conclusion A 3-mRNA signature ( CLEC3B , C6 and CLCN1 ) successfully predicted the survival of OSCC patients in both training and test cohort. In addition, this signature was an independent and the most important risk factor of OSCC.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom