NIF LinkOut Portal

Options
Only Pubmed Central
Include Pubmed Central
Sections
Title
Abstract
Introduction
Methods
Results
Supplement
Appendix
Contributions
Background
Commentary
Funding
Limitations
Caption
FILTERS

A novel abundance-based algorithm for binning metagenomic sequences using l-tuples.

Authors:
Wu YW, Ye Y
Affiliation:
Journal:
Journal of computational biology : a journal of computational molecular cell biology

Abstract

Metagenomics is the study of microbial communities sampled directly from their natural environment, without prior culturing. Among the computational tools recently developed for metagenomic sequence analysis, binning tools attempt to classify the sequences in a metagenomic dataset into different bins (i.e., species), based on various DNA composition patterns (e.g., the tetramer frequencies) of various genomes. Composition-based binning methods, however, cannot be used to classify very short fragments, because of the substantial variation of DNA composition patterns within a single genome. We developed a novel approach (AbundanceBin) for metagenomics binning by utilizing the different abundances of species living in the same environment. AbundanceBin is an application of the Lander-Waterman model to metagenomics, which is based on the l-tuple content of the reads. AbundanceBin achieved accurate, unsupervised, clustering of metagenomic sequences into different bins, such that the reads classified in a bin belong to species of identical or very similar abundances in the sample. In addition, AbundanceBin gave accurate estimations of species abundances, as well as their genome sizes-two important parameters for characterizing a microbial community. We also show that AbundanceBin performed well when the sequence lengths are very short (e.g., 75 bp) or have sequencing errors. By combining AbundanceBin and a composition-based method (MetaCluster), we can achieve even higher binning accuracy. Supplementary Material is available at www.liebertonline.com/cmb .

  1. Welcome

    Welcome to NIF. Explore available research resources: data, tools and materials, from across the web

  2. Community Resources

    Search for resources specially selected for NIF community

  3. More Resources

    Search across hundreds of additional biomedical databases

  4. Literature

    Search Pub Med abstracts and full text from PubMed Central

  5. Insert your Query

    Enter your search terms here and hit return. Search results for the selected tab will be returned.

  6. Join the Community

    Click here to login or register and join this community.

  7. Categories

    Narrow your search by selecting a category. For additional help in searching, view our tutorials.

  8. Query Info

    Displays the total number of search results. Provides additional information on search terms, e.g., automated query expansions, and any included categories or facets. Expansions, filters and facets can be removed by clicking on the X. Clicking on the + restores them.

  9. Search Results

    Displays individual records and a brief description. Click on the icons below each record to explore additional display options.

X