Display options
Share it on

Sci Data. 2017 May 23;4:170073. doi: 10.1038/sdata.2017.73.

Unique identifiers for small molecules enable rigorous labeling of their atoms.

Scientific data

Hesam Dashti, William M Westler, John L Markley, Hamid R Eghbalnia

Affiliations

  1. National Magnetic Resonance Facility at Madison, Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA.

PMID: 28534867 PMCID: PMC5441290 DOI: 10.1038/sdata.2017.73

Abstract

Rigorous characterization of small organic molecules in terms of their structural and biological properties is vital to biomedical research. The three-dimensional structure of a molecule, its 'photo ID', is inefficient for searching and matching tasks. Instead, identifiers play a key role in accessing compound data. Unique and reproducible molecule and atom identifiers are required to ensure the correct cross-referencing of properties associated with compounds archived in databases. The best approach to this requirement is the International Chemical Identifier (InChI). However, the current implementation of InChI fails to provide a complete standard for atom nomenclature, and incorrect use of the InChI standard has resulted in the proliferation of non-unique identifiers. We propose a methodology and associated software tools, named ALATIS, that overcomes these shortcomings. ALATIS is an adaptation of InChI, which operates fully within the InChI convention to provide unique and reproducible molecule and all atom identifiers. ALATIS includes an InChI extension for unique atom labeling of symmetric molecules. ALATIS forms the basis for improving reproducibility and unifying cross-referencing across databases.

References

  1. J Chem Inf Model. 2014 Mar 24;54(3):693-704 - PubMed
  2. J Med Chem. 2013 Jan 24;56(2):547-55 - PubMed
  3. J Cheminform. 2015 May 30;7:23 - PubMed
  4. Nucleic Acids Res. 2000 Jan 1;28(1):27-30 - PubMed
  5. Br J Pharmacol. 2011 Mar;162(6):1239-49 - PubMed
  6. PLoS One. 2015 May 29;10(5):e0128478 - PubMed
  7. Mol Oncol. 2012 Apr;6(2):155-76 - PubMed
  8. Nucleic Acids Res. 2007 Jan;35(Database issue):D521-6 - PubMed
  9. Analyst. 2013 Jan 7;138(1):171-8 - PubMed
  10. Methods Mol Biol. 2012;819:105-26 - PubMed
  11. Nat Protoc. 2007;2(11):2692-703 - PubMed
  12. J Cheminform. 2012 Sep 18;4(1):22 - PubMed
  13. J Chromatogr A. 2003 Jun 20;1002(1-2):111-36 - PubMed
  14. Biophys J. 2017 Apr 25;112(8):1529-1534 - PubMed
  15. J Chem Inf Model. 2015 Oct 26;55(10):2111-20 - PubMed
  16. Curr Top Med Chem. 2014;14(16):1923-38 - PubMed
  17. J Proteome Res. 2016 Apr 1;15(4):1360-8 - PubMed
  18. Curr Med Chem. 2012;19(30):5128-47 - PubMed
  19. Nucleic Acids Res. 2007 Jan;35(Database issue):D301-3 - PubMed
  20. Nucleic Acids Res. 2013 Jan;41(Database issue):D801-7 - PubMed
  21. J Appl Crystallogr. 2009 Aug 1;42(Pt 4):726-729 - PubMed
  22. Nucleic Acids Res. 2008 Jan;36(Database issue):D402-8 - PubMed
  23. J Chem Inf Model. 2012 May 25;52(5):1124-31 - PubMed
  24. Nucleic Acids Res. 2016 Jan 4;44(D1):D457-62 - PubMed
  25. Nucleic Acids Res. 2013 Jan;41(Database issue):D781-6 - PubMed
  26. Anal Chim Acta. 2009 Oct 19;653(1):23-35 - PubMed
  27. Anal Chem. 2015 Jan 6;87(1):133-46 - PubMed
  28. Nutr Diabetes. 2015 Oct 19;5:e182 - PubMed
  29. Nucleic Acids Res. 2014 Jan;42(Database issue):D459-71 - PubMed
  30. Toxicol Pathol. 2008 Jan;36(1):140-7 - PubMed
  31. Nucleic Acids Res. 2013 Jan;41(Database issue):D456-63 - PubMed
  32. Nat Rev Drug Discov. 2004 Aug;3(8):660-72 - PubMed
  33. Nucleic Acids Res. 2009 Jan;37(Database issue):D603-10 - PubMed
  34. Nucleic Acids Res. 2016 Jan 4;44(D1):D1202-13 - PubMed
  35. PLoS One. 2015 Apr 17;10 (4):e0121424 - PubMed
  36. J Cheminform. 2012 Dec 13;4(1):35 - PubMed
  37. J Mass Spectrom. 2010 Jul;45(7):703-14 - PubMed
  38. J Med Chem. 2014 Feb 27;57(4):1137 - PubMed
  39. J Cheminform. 2011 Oct 07;3:33 - PubMed
  40. Nat Protoc. 2016 May;11(5):905-19 - PubMed
  41. J Cheminform. 2011 Jan 07;3(1):1 - PubMed
  42. BMC Bioinformatics. 2005 Jul 18;6:180 - PubMed
  43. J Microbiol Biotechnol. 2009 Jan;19(1):51-4 - PubMed
  44. Mol Biosyst. 2006 Sep;2(9):430-46 - PubMed
  45. Curr Top Med Chem. 2007;7(16):1600-29 - PubMed

Publication Types

Grant support