Display options
Share it on

Genome Biol. 2008;9(12):R170. doi: 10.1186/gb-2008-9-12-r170. Epub 2008 Dec 05.

FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease.

Genome biology

Rong Chen, Alex A Morgan, Joel Dudley, Tarangini Deshpande, Li Li, Keiichi Kodama, Annie P Chiang, Atul J Butte

Affiliations

  1. Stanford Center for Biomedical Informatics Research, 251 Cmpus Drive, Stanford, CA 94305, USA. [email protected]

PMID: 19061490 PMCID: PMC2646274 DOI: 10.1186/gb-2008-9-12-r170

Abstract

BACKGROUND: Candidate single nucleotide polymorphisms (SNPs) from genome-wide association studies (GWASs) were often selected for validation based on their functional annotation, which was inadequate and biased. We propose to use the more than 200,000 microarray studies in the Gene Expression Omnibus to systematically prioritize candidate SNPs from GWASs.

RESULTS: We analyzed all human microarray studies from the Gene Expression Omnibus, and calculated the observed frequency of differential expression, which we called differential expression ratio, for every human gene. Analysis conducted in a comprehensive list of curated disease genes revealed a positive association between differential expression ratio values and the likelihood of harboring disease-associated variants. By considering highly differentially expressed genes, we were able to rediscover disease genes with 79% specificity and 37% sensitivity. We successfully distinguished true disease genes from false positives in multiple GWASs for multiple diseases. We then derived a list of functionally interpolating SNPs (fitSNPs) to analyze the top seven loci of Wellcome Trust Case Control Consortium type 1 diabetes mellitus GWASs, rediscovered all type 1 diabetes mellitus genes, and predicted a novel gene (KIAA1109) for an unexplained locus 4q27. We suggest that fitSNPs would work equally well for both Mendelian and complex diseases (being more effective for cancer) and proposed candidate genes to sequence for their association with 597 syndromes with unknown molecular basis.

CONCLUSIONS: Our study demonstrates that highly differentially expressed genes are more likely to harbor disease-associated DNA variants. FitSNPs can serve as an effective tool to systematically prioritize candidate SNPs from GWASs.

References

  1. Nat Methods. 2007 Nov;4(11):879 - PubMed
  2. Nat Genet. 2008 Mar;40(3):310-5 - PubMed
  3. Diabetes Res Clin Pract. 2008 Feb;79(2):284-90 - PubMed
  4. Nat Genet. 2004 May;36(5):431-2 - PubMed
  5. Horm Metab Res. 2008 Oct;40(10):722-6 - PubMed
  6. Clin Genet. 2007 Jan;71(1):1-11 - PubMed
  7. Nature. 2008 Mar 27;452(7186):423-8 - PubMed
  8. Diabetologia. 2008 May;51(5):821-6 - PubMed
  9. Diabetologia. 2008 Jun;51(6):971-7 - PubMed
  10. Diabetes. 2008 Aug;57(8):2220-5 - PubMed
  11. Bioinformatics. 2007 Jan 15;23(2):215-21 - PubMed
  12. Nature. 2007 Dec 6;450(7171):887-92 - PubMed
  13. Arthritis Rheum. 2007 Nov;56(11):3793-804 - PubMed
  14. Nat Genet. 2008 Sep;40(9):1092-7 - PubMed
  15. J Clin Endocrinol Metab. 2008 Jan;93(1):310-4 - PubMed
  16. Diabetes. 2007 Feb;56(2):513-7 - PubMed
  17. Folia Biol (Praha). 2007;53(5):173-5 - PubMed
  18. Nat Genet. 2008 Sep;40(9):1098-102 - PubMed
  19. Nucleic Acids Res. 2001 Jan 1;29(1):308-11 - PubMed
  20. Science. 2007 Jun 1;316(5829):1341-5 - PubMed
  21. Mol Syst Biol. 2008;4:189 - PubMed
  22. Curr Biol. 2008 Jun 24;18(12):883-9 - PubMed
  23. Diabetes. 2008 Jul;57(7):1992-6 - PubMed
  24. Nucleic Acids Res. 2004 Jun 04;32(10):3108-14 - PubMed
  25. Nature. 2008 Mar 27;452(7186):429-35 - PubMed
  26. Science. 2008 Apr 18;320(5874):325-7 - PubMed
  27. Int J Med Inform. 2005 Mar;74(2-4):289-98 - PubMed
  28. Diabetes. 2006 Jan;55(1):128-35 - PubMed
  29. Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W758-61 - PubMed
  30. Proc Natl Acad Sci U S A. 2004 Apr 20;101(16):6062-7 - PubMed
  31. Clin Immunol. 2008 Nov;129(2):195-201 - PubMed
  32. Am J Hum Genet. 2007 Dec;81(6):1284-8 - PubMed
  33. Diabetes. 2008 Oct;57(10):2834-42 - PubMed
  34. Am J Hum Genet. 2007 Jan;80(1):91-104 - PubMed
  35. Nature. 2005 Oct 27;437(7063):1299-320 - PubMed
  36. Diabetologia. 2007 Dec;50(12):2461-6 - PubMed
  37. Eur J Pharmacol. 1991 May 30;198(1):7-14 - PubMed
  38. Diabetes. 2008 Mar;57(3):791-5 - PubMed
  39. Diabetes. 2008 Apr;57(4):1093-100 - PubMed
  40. Bioinformatics. 2007 May 1;23(9):1132-40 - PubMed
  41. Diabetes. 2002 Aug;51(8):2581-6 - PubMed
  42. Diabetes. 2008 Nov;57(11):3161-5 - PubMed
  43. Science. 2007 Jun 1;316(5829):1331-6 - PubMed
  44. Diabetes Res Clin Pract. 2003 May;60(2):139-41 - PubMed
  45. Diabetes. 1998 Nov;47(11):1806-8 - PubMed
  46. Genome Res. 2002 Jun;12(6):996-1006 - PubMed
  47. J Clin Endocrinol Metab. 2005 May;90(5):3054-9 - PubMed
  48. Int J Biol Sci. 2007 Oct 25;3(7):420-7 - PubMed
  49. Diabetes. 2007 Dec;56(12):3105-11 - PubMed
  50. J Mol Med (Berl). 2006 Dec;84(12):1005-14 - PubMed
  51. Circ Res. 2007 Aug 3;101(3):e11-30 - PubMed
  52. BMC Bioinformatics. 2005 Mar 14;6:55 - PubMed
  53. Nature. 2007 Jun 7;447(7145):661-78 - PubMed
  54. J Hum Genet. 2006;51(7):629-33 - PubMed
  55. Nucleic Acids Res. 2007 Jan;35(Database issue):D760-5 - PubMed
  56. Diabetes Metab. 2008 Jun;34(3):273-8 - PubMed
  57. Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W285-92 - PubMed
  58. PLoS Comput Biol. 2008 Mar 28;4(3):e1000043 - PubMed
  59. Proc Natl Acad Sci U S A. 2007 Mar 13;104(11):4530-5 - PubMed
  60. Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W377-84 - PubMed
  61. BMC Med Genet. 2008 Jul 03;9:59 - PubMed
  62. Nucleic Acids Res. 2005 Mar 14;33(5):1544-52 - PubMed
  63. Proc Natl Acad Sci U S A. 2001 Apr 24;98(9):5116-21 - PubMed
  64. Diabetes. 2008 Aug;57(8):2226-33 - PubMed
  65. J Clin Endocrinol Metab. 2007 Sep;92(9):3733-7 - PubMed
  66. Diabetes. 2004 Dec;53(12):3337-41 - PubMed
  67. Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W717-23 - PubMed
  68. Nature. 2007 Feb 22;445(7130):881-5 - PubMed
  69. Hum Mutat. 2003 Jun;21(6):577-81 - PubMed
  70. J Lipid Res. 2006 Dec;47(12):2601-13 - PubMed
  71. Diabetes. 2008 Apr;57(4):1048-56 - PubMed
  72. J Hum Genet. 2008;53(2):174-180 - PubMed
  73. J Clin Endocrinol Metab. 2008 Jan;93(1):304-9 - PubMed
  74. Nat Genet. 2000 Oct;26(2):163-75 - PubMed
  75. BMC Genet. 2005 Aug 22;6:45 - PubMed
  76. Diabetes Res Clin Pract. 2004 Dec;66 Suppl 1:S63-7 - PubMed
  77. Rheumatology (Oxford). 2004 Aug;43(8):973-9 - PubMed
  78. Diabetes. 2007 Dec;56(12):3112-7 - PubMed
  79. Diabetes Metab Res Rev. 2008 Feb;24(2):137-40 - PubMed
  80. Nat Genet. 2007 Jul;39(7):827-9 - PubMed
  81. Nat Genet. 2007 Jul;39(7):857-64 - PubMed
  82. Diabetes. 2004 Nov;53(11):3002-6 - PubMed
  83. Hum Genet. 2008 Aug;124(1):101-4 - PubMed
  84. J Med Genet. 2006 Aug;43(8):691-8 - PubMed
  85. Genome Res. 2008 May;18(5):706-16 - PubMed

MeSH terms

Publication Types

Grant support