Open Access
Improved therapy‐success prediction with GSS estimated from clinical HIV‐1 sequences
Author(s) -
Pironti Alejandro,
Pfeifer Nico,
Kaiser Rolf,
Walter Hauke,
Lengauer Thomas
Publication year - 2014
Publication title -
journal of the international aids society
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.724
H-Index - 62
ISSN - 1758-2652
DOI - 10.7448/ias.17.4.19743
Subject(s) - human immunodeficiency virus (hiv) , medicine , drug resistance , hiv drug resistance , interpretation (philosophy) , drug , set (abstract data type) , test (biology) , test set , machine learning , computational biology , antiretroviral therapy , family medicine , genetics , pharmacology , viral load , computer science , biology , programming language , paleontology
Introduction Rules‐based HIV‐1 drug‐resistance interpretation (DRI) systems disregard many amino‐acid positions of the drug's target protein. The aims of this study are (1) the development of a drug‐resistance interpretation system that is based on HIV‐1 sequences from clinical practice rather than hard‐to‐get phenotypes, and (2) the assessment of the benefit of taking all available amino‐acid positions into account for DRI. Materials and Methods A dataset containing 34,934 therapy‐naïve and 30,520 drug‐exposed HIV‐1 pol sequences with treatment history was extracted from the EuResist database and the Los Alamos National Laboratory database. 2,550 therapy‐change‐episode baseline sequences (TCEB) were assigned to test set A. Test set B contains 1,084 TCEB from the HIVdb TCE repository. Sequences from patients absent in the test sets were used to train three linear support vector machines to produce scores that predict drug exposure pertaining to each of 20 antiretrovirals: the first one uses the full amino‐acid sequences (DE full ), the second one only considers IAS drug‐resistance positions (DE onlyIAS ), and the third one disregards IAS drug‐resistance positions (DE noIAS ). For performance comparison, test sets A and B were evaluated with DE full , DE noIAS , DE onlyIAS , geno2pheno [resistance] , HIVdb, ANRS, HIV‐GRADE, and REGA. Clinically‐validated cut‐offs were used to convert the continuous output of the first four methods into susceptible‐intermediate‐resistant (SIR) predictions. With each method, a genetic susceptibility score (GSS) was calculated for each therapy episode in each test set by converting the SIR prediction for its compounds to integer: S=2, I=1, and R=0. The GSS were used to predict therapy success as defined by the EuResist standard datum definition. Statistical significance was assessed using a Wilcoxon signed‐rank test. Results A comparison of the therapy‐success prediction performances among the different interpretation systems for test set A can be found in Table 1, while those for test set B are found in Figure 1. Therapy‐success prediction of first‐line therapies with DE noIAS performed better than DE onlyIAS (p<10–16). Conclusions Therapy success prediction benefits from the consideration of all available mutations. The increase in performance was largest in first‐line therapies with transmitted drug‐resistance mutations.