Display options
Share it on

F1000Res. 2017 Mar 07;6:227. doi: 10.12688/f1000research.11022.3. eCollection 2017.

Picopore: A tool for reducing the storage size of Oxford Nanopore Technologies datasets without loss of functionality.

F1000Research

Scott Gigante

Affiliations

  1. Walter & Eliza Hall Institute of Medical Research, Parkville, Victoria, 3121, Australia.

PMID: 28413619 PMCID: PMC5365225 DOI: 10.12688/f1000research.11022.3

Abstract

Oxford Nanopore Technologies' (ONT's) MinION and PromethION long-read sequencing technologies are emerging as genuine alternatives to established Next-Generation Sequencing technologies. A combination of the highly redundant file format and a rapid increase in data generation have created a significant problem both for immediate data storage on MinION-capable laptops, and for long-term storage on lab data servers. We developed Picopore, a software suite offering three methods of compression. Picopore's lossless and deep lossless methods provide a 25% and 44% average reduction in size, respectively, without removing any data from the files. Picopore's raw method provides an 88% average reduction in size, while retaining biologically relevant data for the end-user. All methods have the capacity to run in real-time in parallel to a sequencing run, reducing demand for both immediate and long-term storage space.

Keywords: Compression; DNA Sequencing; Data Storage; Genome Informatics; Nanopore Sequencing

Conflict of interest statement

Competing interests: No competing interests were disclosed.

References

  1. Bioinformatics. 2014 Dec 1;30(23):3399-401 - PubMed
  2. Genome Biol. 2016 Nov 25;17 (1):239 - PubMed
  3. F1000Res. 2015 Oct 15;4:1075 - PubMed
  4. Nat Biotechnol. 2012 Apr 10;30(4):295-6 - PubMed
  5. Nature. 2016 Feb 11;530(7589):228-232 - PubMed
  6. Nat Methods. 2015 Aug;12(8):733-5 - PubMed

Publication Types