z-logo
open-access-imgOpen Access
Salivary metabolomics with machine learning for colorectal cancer detection
Author(s) -
Kuwabara Hiroshi,
Katsumata Kenji,
Iwabuchi Atsuhiro,
Udo Ryutaro,
Tago Tomoya,
Kasahara Kenta,
Mazaki Junichi,
Enomoto Masanobu,
Ishizaki Tetsuo,
Soya Ryoko,
Kaneko Miku,
Ota Sana,
Enomoto Ayame,
Soga Tomoyoshi,
Tomita Masaru,
Sunamura Makoto,
Tsuchida Akihiko,
Sugimoto Masahiro,
Nagakawa Yuichi
Publication year - 2022
Publication title -
cancer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.035
H-Index - 141
eISSN - 1349-7006
pISSN - 1347-9032
DOI - 10.1111/cas.15472
Subject(s) - saliva , colorectal cancer , metabolomics , receiver operating characteristic , medicine , confidence interval , metabolite , area under the curve , gastroenterology , chromatography , cancer , chemistry
As the worldwide prevalence of colorectal cancer (CRC) increases, it is vital to reduce its morbidity and mortality through early detection. Saliva‐based tests are an ideal noninvasive tool for CRC detection. Here, we explored and validated salivary biomarkers to distinguish patients with CRC from those with adenoma (AD) and healthy controls (HC). Saliva samples were collected from patients with CRC, AD, and HC. Untargeted salivary hydrophilic metabolite profiling was conducted using capillary electrophoresis–mass spectrometry and liquid chromatography–mass spectrometry. An alternative decision tree (ADTree)‐based machine learning (ML) method was used to assess the discrimination abilities of the quantified metabolites. A total of 2602 unstimulated saliva samples were collected from subjects with CRC ( n  = 235), AD ( n  = 50), and HC ( n  = 2317). Data were randomly divided into training ( n  = 1301) and validation datasets ( n  = 1301). The clustering analysis showed a clear consistency of aberrant metabolites between the two groups. The ADTree model was optimized through cross‐validation (CV) using the training dataset, and the developed model was validated using the validation dataset. The model discriminating CRC + AD from HC showed area under the receiver‐operating characteristic curves (AUC) of 0.860 (95% confidence interval [CI]: 0.828‐0.891) for CV and 0.870 (95% CI: 0.837‐0.903) for the validation dataset. The other model discriminating CRC from AD + HC showed an AUC of 0.879 (95% CI: 0.851‐0.907) and 0.870 (95% CI: 0.838‐0.902), respectively. Salivary metabolomics combined with ML demonstrated high accuracy and versatility in detecting CRC.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here