The JCVI Standard Operating Procedure for Annotating Prokaryotic Metagenomic Shotgun Sequencing Data

David Tanenbaum, Johannes Goll, Sean Murphy, Prateek Kumar, Nikhat Zafar, Mathangi Thiagarajan, Ramana Madupu, Tanja Davidsen, Leonid Kagan, Saul Kravitz, Douglas B. Rusch, Shibu Yooseph

Abstract


The JCVI metagenomics analysis pipeline provides for the efficient and consistent annotation of shotgun metagenomics sequencing data for sampling communities of prokaryotic organisms.  The process can be equally applied to individual sequence reads from traditional Sanger capillary electrophoresis sequences, newer technologies such as 454 pyrosequencing, or sequence assemblies derived from one or more of these data types.  It includes the analysis of both coding and non-coding genes, whether full-length or, as is often the case for shotgun metagenomics, fragmentary.  The system is designed to provide the best-supported conservative functional annotation based on a combination of trusted homology-based scientific evidence and computational assertions and an annotation value hierarchy established through extensive manual curation.  The functional annotation attributes assigned by this system include gene name, gene symbol, GO terms [1], EC numbers [2], and JCVI functional role categories [3].

doi:10.4056/sigs.651139


Keywords


J. Craig Venter Institute, prokaryotic shotgun metagenomics, environmental sequencing, functional annotation, Global Ocean Sampling, Sargasso Sea

Full Text: HTML PDF

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.

This article doi:10.4056/sigs.651139 has been cited by 5 other articles:

Community ecology of hot spring cyanobacterial mats: predominant populations and their functional potential
Klatt et al.
ISME J 5(8) 1262.
10.1038/ismej.2011.73

TheViral MetaGenome Annotation Pipeline (VMGAP):an automated tool for the functional annotation of viral Metagenomic shotgun sequencing data
Lorenzi et al.
Stand. Genomic Sci. 4(3) 418.
10.4056/sigs.1694706

Efficient de novo assembly of single-cell bacterial genomes from short-read data sets
Chitsaz et al.
Nat Biotechnol 29(10) 915.
10.1038/nbt.1966

The State of Standards in Genomic Sciences
Garrity
Stand. Genomic Sci. 5(3) 262.
10.4056/sigs.2515706

Influence of nutrients and currents on the genomic composition of microbes across an upwelling mosaic
Allen et al.
ISME J () .
10.1038/ismej.2011.201




Acknowledgements

We would like to gratefully acknowledge the support of many members of the Genomic Standards Consortium, the broader genomic science community, and those who have indicated their willingness to serve as editors, reviewers and contributors.

Funding for SIGS is provided by a grant from the Office of the Vice President for Research and Graduate Studies at Michigan State University, the Michigan State University Foundation, and the US Department of Energy Biological and Environmental Research DE-FG02-08ER64707.

Standards in Genomic Sciences is indexed in:

Sponsors of the Genomic Standards Consortium: