Display options
Share it on

ACM Int Conf Bioinform Comput Biol (2010). 2010 Aug;2010:43-52. doi: 10.1145/1854776.1854788.

Genome-wide compatible SNP intervals and their properties.

The 2010 ACM International Conference on Bioinformatics and Computational Biology : ACM-BCB 2010 : Niagara Falls, New York, U.S.A., August 2-4, 2010. ACM International Conference on Bioinformatics and Computational Biology (1st : 2010 :...

Jeremy Wang, Fernando Pardo-Manual de Villena, Kyle J Moore, Wei Wang, Qi Zhang, Leonard McMillan

Affiliations

  1. Dept. of Computer Science, University of North Carolina, Chapel Hill, NC 27599, USA.
  2. Dept. of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA.

PMID: 29152612 PMCID: PMC5690570 DOI: 10.1145/1854776.1854788

Abstract

Intraspecific genomes can be subdivided into blocks with limited diversity. Understanding the distribution and structure of these blocks will help to unravel many biological problems including the identification of genes associated with complex diseases, finding the ancestral origins of a given population, and localizing regions of historical recombination, gene conversion, and homoplasy. We present methods for partitioning a genome into blocks for which there are no apparent recombinations, thus providing parsimonious sets of compatible genome intervals based on the four-gamete test. Our contribution is a thorough analysis of the problem of dividing a genome into compatible intervals, in terms of its computational complexity, and by providing an achievable lower-bound on the minimal number of intervals required to cover an entire data set. In general, such minimal interval partitions are not unique. However, we identify properties that are common to every possible solution. We also define the notion of an interval set that achieves the interval lower-bound, yet maximizes interval overlap. We demonstrate algorithms for partitioning both haplotype data from inbred mice as well as outbred heterozygous genotype data using extensions of the standard four-gamete test. These methods allow our algorithms to be applied to a wide range of genomic data sets.

References

  1. Trends Genet. 2005 Jun;21(6):318-22 - PubMed
  2. Proc Natl Acad Sci U S A. 2005 Jan 4;102(1):158-62 - PubMed
  3. Proc Natl Acad Sci U S A. 2002 May 28;99(11):7335-9 - PubMed
  4. Bioinformatics. 2003 Apr 12;19(6):780-1 - PubMed
  5. J Comput Biol. 2003;10(1):13-9 - PubMed
  6. PLoS Biol. 2006 Nov;4(12):e395 - PubMed
  7. J Comput Biol. 2003;10(3-4):323-40 - PubMed
  8. Nature. 2005 Oct 27;437(7063):1299-320 - PubMed
  9. Genetics. 2005 Jan;169(1):441-53 - PubMed
  10. J Comput Biol. 2010 Mar;17(3):383-99 - PubMed
  11. Nat Genet. 2007 Sep;39(9):1151-5 - PubMed
  12. Bioinformatics. 2005 Jun;21 Suppl 1:i413-22 - PubMed
  13. Nat Genet. 2007 Sep;39(9):1056-7 - PubMed
  14. Pac Symp Biocomput. 2009;:415-26 - PubMed
  15. Genetics. 1985 Sep;111(1):147-64 - PubMed
  16. J Comput Biol. 2007 Dec;14(10):1273-86 - PubMed
  17. BMC Bioinformatics. 2006 Feb 09;7:61 - PubMed
  18. Science. 2007 Jul 20;317(5836):338-42 - PubMed
  19. Science. 2002 Jun 21;296(5576):2225-9 - PubMed
  20. J Comput Biol. 2006 Mar;13(2):522-53 - PubMed
  21. Am J Hum Genet. 2002 Dec;71(6):1386-94 - PubMed
  22. Nat Genet. 2004 Nov;36(11):1133-7 - PubMed
  23. Nat Genet. 2007 Sep;39(9):1100-7 - PubMed
  24. Genome Res. 2004 May;14(5):908-16 - PubMed
  25. Pac Symp Biocomput. 2009;:150-61 - PubMed
  26. Genetics. 2004 Jan;166(1):537-45 - PubMed
  27. Bioinformatics. 2008 Oct 1;24(19):2215-21 - PubMed
  28. Nature. 2007 Aug 30;448(7157):1050-3 - PubMed
  29. Genetics. 2009 Feb;181(2):747-53 - PubMed
  30. BMC Bioinformatics. 2006 Oct 16;7:454 - PubMed
  31. Am J Hum Genet. 2002 Nov;71(5):1227-34 - PubMed
  32. Nature. 2001 May 10;411(6834):199-204 - PubMed
  33. J Bioinform Comput Biol. 2003 Apr;1(1):1-20 - PubMed
  34. Genome Res. 2007 Jan;17(1):96-107 - PubMed
  35. Bioinformatics. 2007 Jul 15;23(14):1851-3 - PubMed
  36. Nat Genet. 2001 Oct;29(2):229-32 - PubMed
  37. Science. 2009 Aug 7;325(5941):737-40 - PubMed
  38. Genetics. 2002 Apr;160(4):1609-18 - PubMed
  39. Science. 2001 Nov 23;294(5547):1719-23 - PubMed
  40. PLoS Genet. 2006 Jul;2(7):e121 - PubMed

Publication Types

Grant support