Display options
Share it on

BMC Med Genomics. 2017 May 24;10:26. doi: 10.1186/s12920-017-0265-2.

A disease similarity matrix based on the uniqueness of shared genes.

BMC medical genomics

Matthew B Carson, Cong Liu, Yao Lu, Caiyan Jia, Hui Lu

Affiliations

  1. Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 680 N Lake Shore Dr, Suite 1400, Chicago, IL, 60611, USA.
  2. Department of Bioengineering, University of Illinois at Chicago, 851 S Morgan St, Chicago, IL, 60607, USA.
  3. Center for Biomedical Informatics, Shanghai Children's Hospital, 24 W Beijing Rd, Suite 1400, Shanghai, 200000, China.
  4. Department of Computer Science, Beijing Jiaotong University, No.3 Shangyuancun, Haidian District, Beijing, 100044, China.
  5. Department of Bioengineering, University of Illinois at Chicago, 851 S Morgan St, Chicago, IL, 60607, USA. [email protected].
  6. Center for Biomedical Informatics, Shanghai Children's Hospital, 24 W Beijing Rd, Suite 1400, Shanghai, 200000, China. [email protected].
  7. SJTU-Yale Joint Center for Biostatistics, Department of Bioinformatics and Biostatistics, Shanghai Jiaotong University, 800 Dongchuan Road, Shanghai, 200000, China. [email protected].

PMID: 28589854 PMCID: PMC5461528 DOI: 10.1186/s12920-017-0265-2

Abstract

BACKGROUND: Complex diseases involve many genes, and these genes are often associated with several different illnesses. Disease similarity measurement can be based on shared genotype or phenotype. Quantifying relationships between genes can reveal previously unknown connections and form a reference base for therapy development and drug repurposing.

METHODS: Here we introduce a method to measure disease similarity that incorporates the uniqueness of shared genes. For each disease pair, we calculated the uniqueness score and constructed disease similarity matrices using OMIM and Disease Ontology annotation.

RESULTS: Using the Disease Ontology-based matrix, we identified several interesting connections between cancer and other disease and conditions such as malaria, along with studies to support our findings. We also found several high scoring pairwise relationships for which there was little or no literature support, highlighting potentially interesting connections warranting additional study.

CONCLUSIONS: We developed a co-occurrence matrix based on gene uniqueness to examine the relationships between diseases from OMIM and DORIF data. Our similarity matrix can be used to identify potential disease relationships and to motivate further studies investigating the causal mechanisms in diseases.

Keywords: Clustering; Disease-disease similarity; Disease-related genes

References

  1. Genome Res. 2008 Apr;18(4):644-52 - PubMed
  2. PLoS Comput Biol. 2010 Feb 05;6(2):e1000662 - PubMed
  3. Anticancer Res. 2010 Apr;30(4):1371-3 - PubMed
  4. Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90 - PubMed
  5. Nucleic Acids Res. 2012 Jan;40(Database issue):D940-6 - PubMed
  6. BMJ Open. 2012 Jun 11;2(3):null - PubMed
  7. J Theor Biol. 2014 Dec 7;362:3-8 - PubMed
  8. Clin Psychol Rev. 2004 Jul;24(3):315-38 - PubMed
  9. Int J Cancer. 2015 Apr 1;136(7):1646-54 - PubMed
  10. Am J Hum Genet. 2011 Jun 10;88(6):755-766 - PubMed
  11. J Biomed Inform. 2012 Apr;45(2):363-71 - PubMed
  12. Genes Dev. 2011 Apr 1;25(7):717-29 - PubMed
  13. Nucleic Acids Res. 2005 Jan 1;33(Database issue):D514-7 - PubMed
  14. BMC Syst Biol. 2016 Jan 11;10 Suppl 1:9 - PubMed
  15. Science. 2012 Jan 6;335(6064):28, 30-2 - PubMed
  16. AMIA Jt Summits Transl Sci Proc. 2012;2012:42-51 - PubMed
  17. CA Cancer J Clin. 2006 Sep-Oct;56(5):254-81; quiz 313-4 - PubMed
  18. BMC Genomics. 2009 Jul 07;10 Suppl 1:S6 - PubMed
  19. J Am Med Inform Assoc. 2012 Mar-Apr;19(2):295-305 - PubMed
  20. Pharm Res. 2008 Sep;25(9):2097-116 - PubMed
  21. Mol Cancer. 2003 Jan 06;2:4 - PubMed
  22. Rom J Intern Med. 2012 Jan-Mar;50(1):71-81 - PubMed
  23. Gene Regul Syst Bio. 2010 Mar 24;4:19-34 - PubMed
  24. Environ Health. 2015 Aug 15;14:65 - PubMed

MeSH terms

Publication Types