z-logo
open-access-imgOpen Access
HeLa Nucleic Acid Contamination in The Cancer Genome Atlas Leads to the Misidentification of Human Papillomavirus 18
Author(s) -
Paul G. Cantalupo,
Joshua P. Katz,
James M. Pipas
Publication year - 2015
Publication title -
journal of virology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.617
H-Index - 292
eISSN - 1070-6321
pISSN - 0022-538X
DOI - 10.1128/jvi.03365-14
Subject(s) - biology , nucleic acid , human papillomavirus , hela , genome , contamination , atlas (anatomy) , genetics , virology , papillomaviridae , human genome , computational biology , cancer , cervical cancer , gene , cell culture , ecology , medicine , paleontology
We searched The Cancer Genome Atlas (TCGA) database for viruses by comparing non-human reads present in transcriptome sequencing (RNA-Seq) and whole-exome sequencing (WXS) data to viral sequence databases. Human papillomavirus 18 (HPV18) is an etiologic agent of cervical cancer, and as expected, we found robust expression of HPV18 genes in cervical cancer samples. In agreement with previous studies, we also found HPV18 transcripts in non-cervical cancer samples, including those from the colon, rectum, and normal kidney. However, in each of these cases, HPV18 gene expression was low, and single-nucleotide variants and positions of genomic alignments matched the integrated portion of HPV18 present in HeLa cells. Chimeric reads that match a known virus-cell junction of HPV18 integrated in HeLa cells were also present in some samples. We hypothesize that HPV18 sequences in these non-cervical samples are due to nucleic acid contamination from HeLa cells. This finding highlights the problems that contamination presents in computational virus detection pipelines.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom