Genomics Proteomics Bioinformatics. 2010 Dec;8(4):268-73. doi: 10.1016/S1672-0229(10)60029-0.
SNPTransformer: a lightweight toolkit for genome-wide association studies.
Genomics, proteomics & bioinformatics
Changzheng Dong
PMID: 21382596
PMCID: PMC5054149 DOI: 10.1016/S1672-0229(10)60029-0
Abstract
High-throughput genotyping chips have produced huge datasets for genome-wide association studies (GWAS) that have contributed greatly to discovering susceptibility genes for complex diseases. There are two strategies for performing data analysis for GWAS. One strategy is to use open-source or commercial packages that are designed for GWAS. The other is to take advantage of classic genetic programs with specific functions, such as linkage disequilibrium mapping, haplotype inference and transmission disequilibrium tests. However, most classic programs that are available are not suitable for analyzing chip data directly and require custom-made input, which results in the inconvenience of converting raw genotyping files into various data formats. We developed a powerful, user-friendly, lightweight program named SNPTransformer for GWAS that includes five major modules (Transformer, Operator, Previewer, Coder and Simulator). The toolkit not only works for transforming the genotyping files into ten input formats for use with classic genetics packages, but also carries out useful functions such as relational operations on IDs, previewing data files, recoding data formats and simulating marker files, among other functions. It bridges upstream raw genotyping data with downstream genetic programs, and can act as an in-hand toolkit for human geneticists, especially for non-programmers. SNPTransformer is freely available at http://snptransformer.sourceforge.net.
Copyright © 2010 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.
References
- BMC Bioinformatics. 2006 Feb 09;7:60 - PubMed
- Nucleic Acids Res. 2007 Jan;35(Database issue):D663-7 - PubMed
- Nat Genet. 2005 May;37(5):549-54 - PubMed
- Bioinformatics. 2010 Feb 15;26(4):580-1 - PubMed
- Nature. 2005 Oct 27;437(7063):1299-320 - PubMed
- Bioinformatics. 2005 Jan 15;21(2):263-5 - PubMed
- Bioinformatics. 2004 Aug 4;20 Suppl 1:i137-44 - PubMed
- Am J Hum Genet. 2007 Sep;81(3):559-75 - PubMed
- Bioinformatics. 2007 May 15;23(10):1294-6 - PubMed
- J Clin Invest. 2008 May;118(5):1590-605 - PubMed
- Science. 2004 Oct 22;306(5696):636-40 - PubMed
- Nat Genet. 2007 Jul;39(7):906-13 - PubMed
- Hum Hered. 2007;64(1):45-51 - PubMed
- Am J Hum Genet. 2003 Nov;73(5):1162-9 - PubMed
- Bioinformatics. 2000 Feb;16(2):182-3 - PubMed
- Bioinformatics. 2010 Jan 15;26(2):270-2 - PubMed
- Science. 2008 Jan 25;319(5862):395 - PubMed
- Hum Hered. 2008;66(2):87-98 - PubMed
- Bioinformatics. 2009 Jan 15;25(2):284-5 - PubMed
- Nat Methods. 2004 Nov;1(2):109-11 - PubMed
- Nature. 2005 Sep 15;437(7057):376-80 - PubMed
- Bioinformatics. 2010 Feb 15;26(4):560-4 - PubMed
- BMC Res Notes. 2009 Oct 23;2:214 - PubMed
- Bioinformatics. 2007 Mar 1;23(5):644-5 - PubMed
- Nat Genet. 2005 Nov;37(11):1217-23 - PubMed
- Bioinformatics. 2003 Feb 12;19(3):376-82 - PubMed
- Nature. 2007 Oct 18;449(7164):851-61 - PubMed
MeSH terms
Publication Types