Display options
Share it on

Algorithms Mol Biol. 2010 Nov 08;5:35. doi: 10.1186/1748-7188-5-35.

Estimating the evidence of selection and the reliability of inference in unigenic evolution.

Algorithms for molecular biology : AMB

Andrew D Fernandes, Benjamin P Kleinstiver, David R Edgell, Lindi M Wahl, Gregory B Gloor

Affiliations

  1. Department of Biochemistry, The University of Western Ontario, N6A 5C1 Canada. [email protected].

PMID: 21059250 PMCID: PMC2994857 DOI: 10.1186/1748-7188-5-35

Abstract

BACKGROUND: Unigenic evolution is a large-scale mutagenesis experiment used to identify residues that are potentially important for protein function. Both currently-used methods for the analysis of unigenic evolution data analyze 'windows' of contiguous sites, a strategy that increases statistical power but incorrectly assumes that functionally-critical sites are contiguous. In addition, both methods require the questionable assumption of asymptotically-large sample size due to the presumption of approximate normality.

RESULTS: We develop a novel approach, termed the Evidence of Selection (EoS), removing the assumption that functionally important sites are adjacent in sequence and and explicitly modelling the effects of limited sample-size. Precise statistical derivations show that the EoS score can be easily interpreted as an expected log-odds-ratio between two competing hypotheses, namely, the hypothetical presence or absence of functional selection for a given site. Using the EoS score, we then develop selection criteria by which functionally-important yet non-adjacent sites can be identified. An approximate power analysis is also developed to estimate the reliability of inference given the data. We validate and demonstrate the the practical utility of our method by analysis of the homing endonuclease I-Bmol, comparing our predictions with the results of existing methods.

CONCLUSIONS: Our method is able to assess both the evidence of selection at individual amino acid sites and estimate the reliability of those inferences. Experimental validation with I-Bmol proves its utility to identify functionally-important residues of poorly characterized proteins, demonstrating increased sensitivity over previous methods without loss of specificity. With the ability to guide the selection of precise experimental mutagenesis conditions, our method helps make unigenic analysis a more broadly applicable technique with which to probe protein function.

AVAILABILITY: Software to compute, plot, and summarize EoS data is available as an open-source package called 'unigenic' for the 'R' programming language at http://www.fernandes.org/txp/article/13/an-analytical-framework-for-unigenic-evolution.

References

  1. Curr Biol. 2003 May 27;13(11):973-8 - PubMed
  2. Gene. 2003 Apr 24;309(1):49-56 - PubMed
  3. Genes Dev. 1999 Nov 1;13(21):2863-74 - PubMed
  4. J Mol Evol. 1993 Feb;36(2):182-98 - PubMed
  5. Genetics. 1995 Dec;141(4):1263-74 - PubMed
  6. J Mol Biol. 2004 Nov 5;343(5):1231-41 - PubMed
  7. Proc R Soc Lond A Math Phys Sci. 1946;186(1007):453-61 - PubMed
  8. Proc Natl Acad Sci U S A. 2001 Jul 3;98(14):7898-903 - PubMed
  9. PCR Methods Appl. 1994 Jun;3(6):S136-40 - PubMed
  10. Philos Trans R Soc Lond B Biol Sci. 1996 Nov 29;351(1347):1529-38 - PubMed
  11. Mol Biol Evol. 1994 Sep;11(5):725-36 - PubMed
  12. Cell Mol Life Sci. 2005 Oct;62(19-20):2161-72 - PubMed
  13. Bioinformatics. 2007 Jul 1;23(13):i319-27 - PubMed
  14. Nucleic Acids Res. 1993 Feb 11;21(3):777-8 - PubMed
  15. Nucleic Acids Res. 2001 Dec 1;29(23):4793-9 - PubMed
  16. J Mol Biol. 2007 Jan 26;365(4):1143-62 - PubMed
  17. J Mol Biol. 2002 Dec 13;324(5):933-51 - PubMed
  18. Nucleic Acids Res. 2010 Apr;38(7):2411-27 - PubMed
  19. Methods Enzymol. 1994;244:19-61 - PubMed
  20. BMC Bioinformatics. 2006 Mar 17;7:150 - PubMed
  21. Biochemistry. 1993 Jul 6;32(26):6523-30 - PubMed

Publication Types