PlumX Metrics
Embed PlumX Metrics

Abundance estimation and differential testing on strain level in metagenomics data

Bioinformatics, ISSN: 1460-2059, Vol: 33, Issue: 14, Page: i124-i132
2017
  • 29
    Citations
  • 0
    Usage
  • 97
    Captures
  • 2
    Mentions
  • 0
    Social Media
Metric Options:   Counts1 Year3 Year

Metrics Details

Most Recent Blog

July 19, 2017

Today’s digest is a long one! There is high diversity of studies, with a high abundance of gut microbiome studies, a few publications on fecal

Most Recent News

DCATS: differential composition analysis for flexible single-cell experimental designs

Abstract Differential composition analysis — the identification of cell types that have statistically significant changes in abundance between multiple experimental conditions — is one of the

Conference Paper Description

Motivation: Current metagenomics approaches allow analyzing the composition of microbial communities at high resolution. Important changes to the composition are known to even occur on strain level and to go hand in hand with changes in disease or ecological state. However, specific challenges arise for strain level analysis due to highly similar genome sequences present. Only a limited number of tools approach taxa abundance estimation beyond species level and there is a strong need for dedicated tools for strain resolution and differential abundance testing. Methods: We present DiTASiC (Differential Taxa Abundance including Similarity Correction) as a novel approach for quantification and differential assessment of individual taxa in metagenomics samples. We introduce a generalized linear model for the resolution of shared read counts which cause a significant bias on strain level. Further, we capture abundance estimation uncertainties, which play a crucial role in differential abundance analysis. A novel statistical framework is built, which integrates the abundance variance and infers abundance distributions for differential testing sensitive to strain level. Results: As a result, we obtain highly accurate abundance estimates down to sub-strain level and enable fine-grained resolution of strain clusters. We demonstrate the relevance of read ambiguity resolution and integration of abundance uncertainties for differential analysis. Accurate detections of even small changes are achieved and false-positives are significantly reduced. Superior performance is shown on latest benchmark sets of various complexities and in comparison to existing methods.

Bibliographic Details

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know