Display options
Share it on

BMC Bioinformatics. 2012 Aug 01;13:187. doi: 10.1186/1471-2105-13-187.

ParsEval: parallel comparison and analysis of gene structure annotations.

BMC bioinformatics

Daniel S Standage, Volker P Brendel

Affiliations

  1. Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, Iowa 50011, USA.

PMID: 22852583 PMCID: PMC3439248 DOI: 10.1186/1471-2105-13-187

Abstract

BACKGROUND: Accurate gene structure annotation is a fundamental but somewhat elusive goal of genome projects, as witnessed by the fact that (model) genomes typically undergo several cycles of re-annotation. In many cases, it is not only different versions of annotations that need to be compared but also different sources of annotation of the same genome, derived from distinct gene prediction workflows. Such comparisons are of interest to annotation providers, prediction software developers, and end-users, who all need to assess what is common and what is different among distinct annotation sources. We developed ParsEval, a software application for pairwise comparison of sets of gene structure annotations. ParsEval calculates several statistics that highlight the similarities and differences between the two sets of annotations provided. These statistics are presented in an aggregate summary report, with additional details provided as individual reports specific to non-overlapping, gene-model-centric genomic loci. Genome browser styled graphics embedded in these reports help visualize the genomic context of the annotations. Output from ParsEval is both easily read and parsed, enabling systematic identification of problematic gene models for subsequent focused analysis.

RESULTS: ParsEval is capable of analyzing annotations for large eukaryotic genomes on typical desktop or laptop hardware. In comparison to existing methods, ParsEval exhibits a considerable performance improvement, both in terms of runtime and memory consumption. Reports from ParsEval can provide relevant biological insights into the gene structure annotations being compared.

CONCLUSIONS: Implemented in C, ParsEval provides the quickest and most feature-rich solution for genome annotation comparison to date. The source code is freely available (under an ISC license) at http://parseval.sourceforge.net/.

References

  1. BMC Bioinformatics. 2011 Dec 22;12:491 - PubMed
  2. Bioinformatics. 2003 Oct;19 Suppl 2:ii215-25 - PubMed
  3. BMC Bioinformatics. 2003 Oct 17;4:50 - PubMed
  4. Genomics. 1996 Jun 15;34(3):353-67 - PubMed
  5. Bioinformatics. 2009 Feb 15;25(4):533-4 - PubMed
  6. Nucleic Acids Res. 2008 Jan;36(Database issue):D959-65 - PubMed
  7. Genome Res. 2000 Oct;10(10):1631-42 - PubMed
  8. BMC Bioinformatics. 2008 Jan 28;9:57 - PubMed
  9. BMC Bioinformatics. 2009 Feb 23;10:67 - PubMed
  10. Bioinformatics. 2003 Sep 1;19(13):1712-3 - PubMed

MeSH terms

Publication Types