Display options
Share it on

BMC Bioinformatics. 2008 Dec 16;9:542. doi: 10.1186/1471-2105-9-542.

Corra: Computational framework and tools for LC-MS discovery and targeted mass spectrometry-based proteomics.

BMC bioinformatics

Mi-Youn Brusniak, Bernd Bodenmiller, David Campbell, Kelly Cooke, James Eddes, Andrew Garbutt, Hollis Lau, Simon Letarte, Lukas N Mueller, Vagisha Sharma, Olga Vitek, Ning Zhang, Ruedi Aebersold, Julian D Watts

Affiliations

  1. Institute for Systems Biology, 1441 North 34th Street, Seattle, WA 98103, USA. [email protected]

PMID: 19087345 PMCID: PMC2651178 DOI: 10.1186/1471-2105-9-542

Abstract

BACKGROUND: Quantitative proteomics holds great promise for identifying proteins that are differentially abundant between populations representing different physiological or disease states. A range of computational tools is now available for both isotopically labeled and label-free liquid chromatography mass spectrometry (LC-MS) based quantitative proteomics. However, they are generally not comparable to each other in terms of functionality, user interfaces, information input/output, and do not readily facilitate appropriate statistical data analysis. These limitations, along with the array of choices, present a daunting prospect for biologists, and other researchers not trained in bioinformatics, who wish to use LC-MS-based quantitative proteomics.

RESULTS: We have developed Corra, a computational framework and tools for discovery-based LC-MS proteomics. Corra extends and adapts existing algorithms used for LC-MS-based proteomics, and statistical algorithms, originally developed for microarray data analyses, appropriate for LC-MS data analysis. Corra also adapts software engineering technologies (e.g. Google Web Toolkit, distributed processing) so that computationally intense data processing and statistical analyses can run on a remote server, while the user controls and manages the process from their own computer via a simple web interface. Corra also allows the user to output significantly differentially abundant LC-MS-detected peptide features in a form compatible with subsequent sequence identification via tandem mass spectrometry (MS/MS). We present two case studies to illustrate the application of Corra to commonly performed LC-MS-based biological workflows: a pilot biomarker discovery study of glycoproteins isolated from human plasma samples relevant to type 2 diabetes, and a study in yeast to identify in vivo targets of the protein kinase Ark1 via phosphopeptide profiling.

CONCLUSION: The Corra computational framework leverages computational innovation to enable biologists or other researchers to process, analyze and visualize LC-MS data with what would otherwise be a complex and not user-friendly suite of tools. Corra enables appropriate statistical analyses, with controlled false-discovery rates, ultimately to inform subsequent targeted identification of differentially abundant peptides by MS/MS. For the user not trained in bioinformatics, Corra represents a complete, customizable, free and open source computational platform enabling LC-MS-based proteomic workflows, and as such, addresses an unmet need in the LC-MS proteomics field.

References

  1. Mol Cell Proteomics. 2008 Nov;7(11):2138-50 - PubMed
  2. Mol Syst Biol. 2005;1:2005.0017 - PubMed
  3. Bioinformatics. 2006 Aug 1;22(15):1902-9 - PubMed
  4. Bioinformatics. 2006 May 1;22(9):1096-102 - PubMed
  5. Genome Biol. 2006;7(8):R73 - PubMed
  6. Anal Chem. 2005 Aug 1;77(15):294A-302A - PubMed
  7. Nat Methods. 2007 Mar;4(3):231-7 - PubMed
  8. Mol Cell Proteomics. 2005 Sep;4(9):1328-40 - PubMed
  9. Mol Cell Proteomics. 2004 Dec;3(12):1154-69 - PubMed
  10. Nat Biotechnol. 1999 Oct;17(10):994-9 - PubMed
  11. Mol Cell Proteomics. 2002 May;1(5):376-86 - PubMed
  12. Ann Clin Biochem. 1998 Jan;35 ( Pt 1):62-7 - PubMed
  13. Genome Biol. 2005;6(1):R9 - PubMed
  14. Stat Appl Genet Mol Biol. 2004;3:Article3 - PubMed
  15. Anal Chem. 2002 Oct 15;74(20):5383-92 - PubMed
  16. Anal Chem. 2006 Feb 1;78(3):779-87 - PubMed
  17. Proteomics. 2007 Oct;7(19):3470-80 - PubMed
  18. Nature. 2003 Mar 13;422(6928):198-207 - PubMed
  19. J Proteome Res. 2007 Jul;6(7):2685-94 - PubMed
  20. Mol Biosyst. 2007 Apr;3(4):275-86 - PubMed
  21. Nat Biotechnol. 2004 Nov;22(11):1459-66 - PubMed
  22. Bioinformatics. 2007 Jun 1;23(11):1394-400 - PubMed
  23. Anal Chem. 2007 Aug 1;79(15):5826-37 - PubMed
  24. J Proteome Res. 2006 Feb;5(2):414-21 - PubMed
  25. Mol Cell Proteomics. 2006 Oct;5(10):1927-41 - PubMed
  26. J Proteome Res. 2005 Jul-Aug;4(4):1143-54 - PubMed
  27. Bioinformatics. 2006 Mar 1;22(5):634-6 - PubMed
  28. J Proteome Res. 2008 Jan;7(1):51-61 - PubMed

Substances

MeSH terms

Publication Types

Grant support