z-logo
Premium
Identifying disease‐associated copy number variations by a doubly penalized regression model
Author(s) -
Cheng Yichen,
Dai James Y.,
Wang Xiaoyu,
Kooperberg Charles
Publication year - 2018
Publication title -
biometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.298
H-Index - 130
eISSN - 1541-0420
pISSN - 0006-341X
DOI - 10.1111/biom.12920
Subject(s) - regression , statistics , copy number variation , disease , regression analysis , mathematics , computer science , biology , medicine , genetics , genome , pathology , gene
Summary Copy number variation (CNV) of DNA plays an important role in the development of many diseases. However, due to the irregularity and sparsity of the CNVs, studying the association between CNVs and a disease outcome or a trait can be challenging. Up to now, not many methods have been proposed in the literature for this problem. Most of the current researchers reply on an ad hoc two‐stage procedure by first identifying CNVs in each individual genome and then performing an association test using these identified CNVs. This potentially leads to information loss and as a result a lower power to identify disease associated CNVs. In this article, we describe a new method that combines the two steps into a single coherent model to identify the common CNV across patients that are associated with certain diseases. We use a double penalty model to capture CNVs’ association with both the intensities and the disease trait. We validate its performance in simulated datasets and a data example on platinum resistance and CNV in ovarian cancer genome.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here