A mixed-model approach for powerful testing of genetic associations with cancer risk incorporating tumor characteristics | Zendy

Haoyu Zhang | Zendy; Ni Zhao | Zendy; Thomas U. Ahearn | Zendy; William Wheeler | Zendy; Montserrat GarcíaClosas | Zendy; Nilanjan Chatterjee | Zendy

Open Access

A mixed-model approach for powerful testing of genetic associations with cancer risk incorporating tumor characteristics

Author(s) -

Haoyu Zhang,

Ni Zhao,

Thomas U. Ahearn,

William Wheeler,

Montserrat GarcíaClosas,

Nilanjan Chatterjee

Publication year - 2020

Publication title -

biostatistics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.493

H-Index - 82

eISSN - 1468-4357

pISSN - 1465-4644

DOI - 10.1093/biostatistics/kxz065

Subject(s) - polytomous rasch model , computer science , multiple comparisons problem , missing data , odds ratio , degrees of freedom (physics and chemistry) , maximization , computational biology , statistics , biology , machine learning , mathematics , mathematical optimization , physics , quantum mechanics , item response theory , psychometrics

Cancers are routinely classified into subtypes according to various features, including histopathological characteristics and molecular markers. Previous genome-wide association studies have reported heterogeneous associations between loci and cancer subtypes. However, it is not evident what is the optimal modeling strategy for handling correlated tumor features, missing data, and increased degrees-of-freedom in the underlying tests of associations. We propose to test for genetic associations using a mixed-effect two-stage polytomous model score test (MTOP). In the first stage, a standard polytomous model is used to specify all possible subtypes defined by the cross-classification of the tumor characteristics. In the second stage, the subtype-specific case-control odds ratios are specified using a more parsimonious model based on the case-control odds ratio for a baseline subtype, and the case-case parameters associated with tumor markers. Further, to reduce the degrees-of-freedom, we specify case-case parameters for additional exploratory markers using a random-effect model. We use the Expectation-Maximization algorithm to account for missing data on tumor markers. Through simulations across a range of realistic scenarios and data from the Polish Breast Cancer Study (PBCS), we show MTOP outperforms alternative methods for identifying heterogeneous associations between risk loci and tumor subtypes. The proposed methods have been implemented in a user-friendly and high-speed R statistical package called TOP (https://github.com/andrewhaoyu/TOP).

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore