Estimation of viral richness from shotgun metagenomes using a frequency count approachReport as inadecuate

Estimation of viral richness from shotgun metagenomes using a frequency count approach - Download this document for free, or read online. Document in PDF available to download.


, 1:5

First Online: 04 February 2013Received: 27 July 2012Accepted: 21 November 2012


BackgroundViruses are important drivers of ecosystem functions, yet little is known about the vast majority of viruses. Viral shotgun metagenomics enables the investigation of broad ecological questions in phage communities. One ecological characteristic is species richness, which is the number of different species in a community. Viruses do not have a phylogenetic marker analogous to the bacterial 16S rRNA gene with which to estimate richness, and so contig spectra are employed to measure the number of virus taxa in a given community. A contig spectrum is generated from a viral shotgun metagenome by assembling the random sequence reads into groups of sequences that overlap contigs and counting the number of sequences that group within each contig. Current tools available to analyze contig spectra to estimate phage richness are limited by relying on rank-abundance data.

ResultsWe present statistical estimates of virus richness from contig spectra. The program CatchAll was used to analyze contig spectra in terms of frequency count data rather than rank-abundance, thus enabling formal statistical analyses. Also, the influence of potentially spurious low-frequency counts on richness estimates was minimized by two methods, empirical and statistical. The results show greater estimates of viral richness than previous calculations in nearly all environments analyzed, including swine feces and reclaimed fresh water.

ConclusionsCatchAll yielded consistent estimates of richness across viral metagenomes from the same or similar environments. Additionally, analysis of pooled viral metagenomes from different environments via mixed contig spectra resulted in greater richness estimates than those of the component metagenomes. Using CatchAll to analyze contig spectra will improve estimations of richness from viral shotgun metagenomes, particularly from large datasets, by providing statistical measures of richness.

KeywordsPhage Metagenomics Virome Ecology Richness CatchAll Singleton AbbreviationsPHACCsPhage Communities from Contig Spectrum.

Electronic supplementary materialThe online version of this article doi:10.1186-2049-2618-1-5 contains supplementary material, which is available to authorized users.

Download fulltext PDF

Author: Heather K Allen - John Bunge - James A Foster - Darrell O Bayles - Thaddeus B Stanton


Related documents