Inviting an author to review:
Find an author and click ‘Invite to review selected article’ near their name.
Search for authorsSearch for similar articles
7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Robust supervised and unsupervised statistical learning for HIV type 1 coreceptor usage analysis.

      AIDS Research and Human Retroviruses
      CD4 Lymphocyte Count, CD4-CD8 Ratio, Cluster Analysis, HIV Infections, virology, HIV-1, classification, genetics, physiology, Humans, Models, Statistical, Receptors, HIV, analysis, Viral Load, Virus Attachment, env Gene Products, Human Immunodeficiency Virus

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Human immunodeficiency virus type 1 (HIV-1) isolates differ in their use of coreceptors to enter target cells. This has important implications for both viral pathogenicity and susceptibility to entry inhibitors, recently approved or under development. Predicting HIV-1 coreceptor usage on the basis of sequence information is a challenging task, due to the high variability of the envelope. The associations of the whole HIV-1 envelope genetic features (subtype, mutations, insertions-deletions, physicochemical properties) and clinical markers (viral RNA load, CD8(+), CD4(+) T cell counts) with viral tropism were investigated, using a set of 2896 (659 after filter, 593 patients) sequence-tropism pairs available at the Los Alamos HIV database. Bootstrapped hierarchical clustering was used to assess mutational covariation. Univariate and multivariate analysis was performed to assess the relative importance of different features. Different machine learning (logistic regression, support vector machines, decision trees, rule bases, instance based reasoning) and feature selection (filter and embedded) methods, along with loss functions (accuracy, AUC of ROC curves, sensitivity, specificity, f-measure), were applied and compared for the classification of X4 variants. Extra-sample error estimation was assessed via multiple cross-validation and adjustments for multiple testing. A high-performing, compact, and interpretable logistic regression model was derived to infer HIV-1 coreceptor tropism for a given patient [accuracy = 92.76 (SD 3.07); AUC = 0.93 (SD 0.04)].

          Related collections

          Author and article information

          Comments

          Comment on this article