Methods Mol Biol. 2017;1611:135-145. doi: 10.1007/978-1-4939-7015-5_11.
Methods in molecular biology (Clifton, N.J.)
Minoru Kanehisa
PMID: 28451977 DOI: 10.1007/978-1-4939-7015-5_11
KEGG is an integrated database resource for linking sequences to biological functions from molecular to higher levels. Knowledge on molecular functions is stored in the KO (KEGG Orthology) database, while cellular- and organism-level functions are represented in the PATHWAY and MODULE databases. Genes in the complete genomes, which are stored in the GENES database, are given KO identifiers by the internal annotation procedure, enabling reconstruction of KEGG pathways and modules for interpretation of higher-level functions. This is possible because all the KEGG pathways and modules are represented as networks of KO nodes. Here we present knowledge-based prediction methods for functional characterization of amino acid sequences using the KEGG resource. Specifically we show how the tools available at the KEGG website including BlastKOALA and KEGG Mapper can be utilized for enzyme annotation and metabolic reconstruction.
Keywords: EC number; Genome annotation; KEGG Mapper; KEGG module; KEGG pathway map; Pathway analysis