Display options
Share it on

BMC Genomics. 2015;16:S5. doi: 10.1186/1471-2164-16-S6-S5. Epub 2015 Jun 01.

Detection of gene annotations and protein-protein interaction associated disorders through transitive relationships between integrated annotations.

BMC genomics

Marco Masseroli, Arif Canakoglu, Massimiliano Quigliatti

PMID: 26046679 PMCID: PMC4460591 DOI: 10.1186/1471-2164-16-S6-S5

Abstract

BACKGROUND: Increasingly high amounts of heterogeneous and valuable controlled biomolecular annotations are available, but far from exhaustive and scattered in many databases. Several annotation integration and prediction approaches have been proposed, but these issues are still unsolved. We previously created a Genomic and Proteomic Knowledge Base (GPKB) that efficiently integrates many distributed biomolecular annotation and interaction data of several organisms, including 32,956,102 gene annotations, 273,522,470 protein annotations and 277,095 protein-protein interactions (PPIs).

RESULTS: By comprehensively leveraging transitive relationships defined by the numerous association data integrated in GPKB, we developed a software procedure that effectively detects and supplement consistent biomolecular annotations not present in the integrated sources. According to some defined logic rules, it does so only when the semantic type of data and of their relationships, as well as the cardinality of the relationships, allow identifying molecular biology compliant annotations. Thanks to controlled consistency and quality enforced on data integrated in GPKB, and to the procedures used to avoid error propagation during their automatic processing, we could reliably identify many annotations, which we integrated in GPKB. They comprise 3,144 gene to pathway and 21,942 gene to biological function annotations of many organisms, and 1,027 candidate associations between 317 genetic disorders and 782 human PPIs. Overall estimated recall and precision of our approach were 90.56 % and 96.61 %, respectively. Co-functional evaluation of genes with known function showed high functional similarity between genes with new detected and known annotation to the same pathway; considering also the new detected gene functional annotations enhanced such functional similarity, which resembled the one existing between genes known to be annotated to the same pathway. Strong evidence was also found in the literature for the candidate associations detected between Cystic fibrosis disorder and the PPIs between the CFTR_HUMAN, DERL1_HUMAN, RNF5_HUMAN, AHSA1_HUMAN and GOPC_HUMAN proteins, and between the CHIP_HUMAN and HSP7C_HUMAN proteins.

CONCLUSIONS: Although identified gene annotations and PPI-genetic disorder candidate associations require biological validation, our approach intrinsically provides their in silico evidence based on available data. Public availability within the GPKB (http://www.bioinformatics.deib.polimi.it/GPKB/) of all identified and integrated annotations offers a valuable resource fostering new biomedical-molecular knowledge discoveries.

References

  1. BMC Bioinformatics. 2007;8:284 - PubMed
  2. BMC Bioinformatics. 2014;15 Suppl 1:S3 - PubMed
  3. BMC Bioinformatics. 2007;8:170 - PubMed
  4. Bioinformatics. 2013 Feb 1;29(3):355-64 - PubMed
  5. Proc Natl Acad Sci U S A. 1998 Mar 31;95(7):3879-84 - PubMed
  6. PLoS Comput Biol. 2009 Dec;5(12):e1000605 - PubMed
  7. J Cheminform. 2011 May 16;3(1):19 - PubMed
  8. Bioinformatics. 2005 Aug 15;21(16):3416-21 - PubMed
  9. Bioinformatics. 2002 Dec;18(12):1641-9 - PubMed
  10. FEBS Lett. 2009 Jun 5;583(11):1759-65 - PubMed
  11. PLoS Comput Biol. 2012 May;8(5):e1002533 - PubMed
  12. Mol Syst Biol. 2007;3:88 - PubMed
  13. Bioinformatics. 2005 Sep 15;21(18):3587-95 - PubMed
  14. J Integr Bioinform. 2010;7(3). doi: 10.2390/biecoll-jib-2010-119 - PubMed
  15. Mol Biol Cell. 2010 Mar 15;21(6):871-84 - PubMed
  16. Genome Res. 2003 May;13(5):896-904 - PubMed
  17. Cell. 2006 Nov 17;127(4):803-15 - PubMed
  18. Cell. 2006 Aug 11;126(3):571-82 - PubMed
  19. Perspect Biol Med. 1986 Autumn;30(1):7-18 - PubMed
  20. Biochim Biophys Acta. 1988 Sep 7;950(3):282-95 - PubMed
  21. J Cell Sci. 2011 Sep 15;124(Pt 18):3074-83 - PubMed

MeSH terms

Publication Types