Display options
Share it on

PLoS One. 2014 Jul 22;9(7):e102642. doi: 10.1371/journal.pone.0102642. eCollection 2014.

Bioinformatics on the cloud computing platform Azure.

PloS one

Hugh P Shanahan, Anne M Owen, Andrew P Harrison

Affiliations

  1. Department of Computer Science, Royal Holloway, University of London, Egham, Surrey, United Kingdom.
  2. Department of Mathematical Sciences, University of Essex, Wivenhoe Park, Colchester, United Kingdom.
  3. Department of Mathematical Sciences, University of Essex, Wivenhoe Park, Colchester, United Kingdom; Department of Biological Sciences, University of Essex, Wivenhoe Park, Colchester, United Kingdom.

PMID: 25050811 PMCID: PMC4106841 DOI: 10.1371/journal.pone.0102642

Abstract

We discuss the applicability of the Microsoft cloud computing platform, Azure, for bioinformatics. We focus on the usability of the resource rather than its performance. We provide an example of how R can be used on Azure to analyse a large amount of microarray expression data deposited at the public database ArrayExpress. We provide a walk through to demonstrate explicitly how Azure can be used to perform these analyses in Appendix S1 and we offer a comparison with a local computation. We note that the use of the Platform as a Service (PaaS) offering of Azure can represent a steep learning curve for bioinformatics developers who will usually have a Linux and scripting language background. On the other hand, the presence of an additional set of libraries makes it easier to deploy software in a parallel (scalable) fashion and explicitly manage such a production run with only a few hundred lines of code, most of which can be incorporated from a template. We propose that this environment is best suited for running stable bioinformatics software by users not involved with its development.

References

  1. Bioinformatics. 2012 Sep 1;28(17):2267-9 - PubMed
  2. Bioinformatics. 2010 Aug 1;26(15):1903-4 - PubMed
  3. Proc IEEE/ACM Int Conf Grid Computing. 2009 Oct 13;2009:34-41 - PubMed
  4. BMC Syst Biol. 2010 Dec 03;4:168 - PubMed
  5. BMC Bioinformatics. 2010 May 18;11:259 - PubMed
  6. Bioinformatics. 2011 Apr 15;27(8):1045-51 - PubMed
  7. Genome Biol. 2010;11(5):207 - PubMed
  8. AMIA Annu Symp Proc. 2011;2011:364-73 - PubMed
  9. BMC Bioinformatics. 2010 Dec 21;11 Suppl 12:S1 - PubMed
  10. Nucleic Acids Res. 2012 Jan;40(Database issue):D1082-8 - PubMed
  11. Nucleic Acids Res. 2007 Jan;35(Database issue):D760-5 - PubMed
  12. J Proteome Res. 2009 Jun;8(6):3148-53 - PubMed
  13. Bioinformatics. 2012 Jan 15;28(2):294-5 - PubMed
  14. PLoS One. 2011;6(10):e26624 - PubMed
  15. BMC Bioinformatics. 2010 Dec 21;11 Suppl 12:S3 - PubMed
  16. Nucleic Acids Res. 2009 Jan;37(Database issue):D868-72 - PubMed
  17. Brief Bioinform. 2009 May;10(3):259-77 - PubMed
  18. Nucleic Acids Res. 2012 Apr;40(8):3307-15 - PubMed
  19. Nucleic Acids Res. 2011 Jan;39(Database issue):D427-34 - PubMed
  20. BMC Bioinformatics. 2012 May 04;13:77 - PubMed
  21. Curr Protoc Bioinformatics. 2012 Jun;Chapter 11:Unit11.9 - PubMed
  22. PLoS Comput Biol. 2011 Aug;7(8):e1002147 - PubMed

MeSH terms

Publication Types