BMC Med Genomics. 2017 May 24;10:26. doi: 10.1186/s12920-017-0265-2.
A disease similarity matrix based on the uniqueness of shared genes.
BMC medical genomics
Matthew B Carson, Cong Liu, Yao Lu, Caiyan Jia, Hui Lu
Affiliations
Affiliations
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 680 N Lake Shore Dr, Suite 1400, Chicago, IL, 60611, USA.
- Department of Bioengineering, University of Illinois at Chicago, 851 S Morgan St, Chicago, IL, 60607, USA.
- Center for Biomedical Informatics, Shanghai Children's Hospital, 24 W Beijing Rd, Suite 1400, Shanghai, 200000, China.
- Department of Computer Science, Beijing Jiaotong University, No.3 Shangyuancun, Haidian District, Beijing, 100044, China.
- Department of Bioengineering, University of Illinois at Chicago, 851 S Morgan St, Chicago, IL, 60607, USA. [email protected].
- Center for Biomedical Informatics, Shanghai Children's Hospital, 24 W Beijing Rd, Suite 1400, Shanghai, 200000, China. [email protected].
- SJTU-Yale Joint Center for Biostatistics, Department of Bioinformatics and Biostatistics, Shanghai Jiaotong University, 800 Dongchuan Road, Shanghai, 200000, China. [email protected].
PMID: 28589854
PMCID: PMC5461528 DOI: 10.1186/s12920-017-0265-2
Abstract
BACKGROUND: Complex diseases involve many genes, and these genes are often associated with several different illnesses. Disease similarity measurement can be based on shared genotype or phenotype. Quantifying relationships between genes can reveal previously unknown connections and form a reference base for therapy development and drug repurposing.
METHODS: Here we introduce a method to measure disease similarity that incorporates the uniqueness of shared genes. For each disease pair, we calculated the uniqueness score and constructed disease similarity matrices using OMIM and Disease Ontology annotation.
RESULTS: Using the Disease Ontology-based matrix, we identified several interesting connections between cancer and other disease and conditions such as malaria, along with studies to support our findings. We also found several high scoring pairwise relationships for which there was little or no literature support, highlighting potentially interesting connections warranting additional study.
CONCLUSIONS: We developed a co-occurrence matrix based on gene uniqueness to examine the relationships between diseases from OMIM and DORIF data. Our similarity matrix can be used to identify potential disease relationships and to motivate further studies investigating the causal mechanisms in diseases.
Keywords: Clustering; Disease-disease similarity; Disease-related genes
References
- Genome Res. 2008 Apr;18(4):644-52 - PubMed
- PLoS Comput Biol. 2010 Feb 05;6(2):e1000662 - PubMed
- Anticancer Res. 2010 Apr;30(4):1371-3 - PubMed
- Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90 - PubMed
- Nucleic Acids Res. 2012 Jan;40(Database issue):D940-6 - PubMed
- BMJ Open. 2012 Jun 11;2(3):null - PubMed
- J Theor Biol. 2014 Dec 7;362:3-8 - PubMed
- Clin Psychol Rev. 2004 Jul;24(3):315-38 - PubMed
- Int J Cancer. 2015 Apr 1;136(7):1646-54 - PubMed
- Am J Hum Genet. 2011 Jun 10;88(6):755-766 - PubMed
- J Biomed Inform. 2012 Apr;45(2):363-71 - PubMed
- Genes Dev. 2011 Apr 1;25(7):717-29 - PubMed
- Nucleic Acids Res. 2005 Jan 1;33(Database issue):D514-7 - PubMed
- BMC Syst Biol. 2016 Jan 11;10 Suppl 1:9 - PubMed
- Science. 2012 Jan 6;335(6064):28, 30-2 - PubMed
- AMIA Jt Summits Transl Sci Proc. 2012;2012:42-51 - PubMed
- CA Cancer J Clin. 2006 Sep-Oct;56(5):254-81; quiz 313-4 - PubMed
- BMC Genomics. 2009 Jul 07;10 Suppl 1:S6 - PubMed
- J Am Med Inform Assoc. 2012 Mar-Apr;19(2):295-305 - PubMed
- Pharm Res. 2008 Sep;25(9):2097-116 - PubMed
- Mol Cancer. 2003 Jan 06;2:4 - PubMed
- Rom J Intern Med. 2012 Jan-Mar;50(1):71-81 - PubMed
- Gene Regul Syst Bio. 2010 Mar 24;4:19-34 - PubMed
- Environ Health. 2015 Aug 15;14:65 - PubMed
MeSH terms
Publication Types