Display options
Share it on

Nat Microbiol. 2022 Jan;7(1):108-119. doi: 10.1038/s41564-021-01019-2. Epub 2021 Dec 14.

Synthetic DNA spike-ins (SDSIs) enable sample tracking and detection of inter-sample contamination in SARS-CoV-2 sequencing workflows.

Nature microbiology

Kim A Lagerborg, Erica Normandin, Matthew R Bauer, Gordon Adams, Katherine Figueroa, Christine Loreth, Adrianne Gladden-Young, Bennett M Shaw, Leah R Pearlman, Daniel Berenzy, Hannah B Dewey, Susan Kales, Sabrina T Dobbins, Erica S Shenoy, David Hooper, Virginia M Pierce, Kimon C Zachary, Daniel J Park, Bronwyn L MacInnis, Ryan Tewhey, Jacob E Lemieux, Pardis C Sabeti, Steven K Reilly, Katherine J Siddle

Affiliations

  1. Broad Institute of Harvard and MIT, Cambridge, MA, USA.
  2. Harvard Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA.
  3. Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
  4. Division of Infectious Diseases, Massachusetts General Hospital, Boston, MA, USA.
  5. The Jackson Laboratory, Bar Harbor, ME, USA.
  6. Department of Pathology, Massachusetts General Hospital, Boston, MA, USA.
  7. Pediatric Infectious Disease Unit, MassGeneral Hospital for Children, Boston, MA, USA.
  8. Department of Pathology, Harvard Medical School, Boston, MA, USA.
  9. Department of Medicine, Harvard Medical School, Boston, MA, USA.
  10. Infection Control Unit, Massachusetts General Hospital, Boston, MA, USA.
  11. Department of Immunology and Infectious Diseases, Harvard T. H. Chan School of Public Health, Harvard University, Boston, MA, USA.
  12. Massachusetts Consortium on Pathogen Readiness, Boston, MA, USA.
  13. Graduate School of Biomedical Sciences and Engineering, University of Maine, Orono, ME, USA.
  14. Graduate School of Biomedical Sciences, Tufts University School of Medicine, Boston, MA, USA.
  15. Howard Hughes Medical Institute, Chevy Chase, MD, USA.
  16. Broad Institute of Harvard and MIT, Cambridge, MA, USA. [email protected].
  17. Department of Genetics, Yale School of Medicine, New Haven, CT, USA. [email protected].

PMID: 34907347 DOI: 10.1038/s41564-021-01019-2

Abstract

The global spread and continued evolution of SARS-CoV-2 has driven an unprecedented surge in viral genomic surveillance. Amplicon-based sequencing methods provide a sensitive, low-cost and rapid approach but suffer a high potential for contamination, which can undermine laboratory processes and results. This challenge will increase with the expanding global production of sequences across a variety of laboratories for epidemiological and clinical interpretation, as well as for genomic surveillance of emerging diseases in future outbreaks. We present SDSI + AmpSeq, an approach that uses 96 synthetic DNA spike-ins (SDSIs) to track samples and detect inter-sample contamination throughout the sequencing workflow. We apply SDSIs to the ARTIC Consortium's amplicon design, demonstrate their utility and efficiency in a real-time investigation of a suspected hospital cluster of SARS-CoV-2 cases and validate them across 6,676 diagnostic samples at multiple laboratories. We establish that SDSI + AmpSeq provides increased confidence in genomic data by detecting and correcting for relatively common, yet previously unobserved modes of error, including spillover and sample swaps, without impacting genome recovery.

© 2021. The Author(s), under exclusive licence to Springer Nature Limited.

References

  1. Washington, N. L. et al. Emergence and rapid transmission of SARS-CoV-2 B.1.1.7 in the United States. Cell 184, 2587–2594.e7 (2021). - PubMed
  2. Walensky, R. P., Walke, H. T. & Fauci, A. S. SARS-CoV-2 variants of concern in the United States—challenges and opportunities. JAMA 325, 1037–1038 (2021). - PubMed
  3. Wang, P. et al. Antibody resistance of SARS-CoV-2 variants B.1.351 and B.1.1.7. Nature 593, 130–135 (2021). - PubMed
  4. Focosi, D., Tuccori, M., Baj, A. & Maggi, F. SARS-CoV-2 variants: a synopsis of in vitro efficacy data of convalescent plasma, currently marketed vaccines, and monoclonal antibodies. Viruses 13, 7, https://doi.org/10.3390/v13071211 (2021). - PubMed
  5. Wang, P. et al. Increased resistance of SARS-CoV-2 variant P.1 to antibody neutralization. Cell Host Microbe 29, 747–751.e4 (2021). - PubMed
  6. Naveca, F. et al. SARS-CoV-2 Reinfection by the New Variant of Concern (VOC) P.1 in Amazonas, Brazil https://virological.org/t/sars-cov-2-reinfection-by-the-new-variant-of-concern-voc-p-1-in-amazonas-brazil/596 (2021). - PubMed
  7. Genomic Sequencing of SARS-CoV-2: A Guide to Implementation for Maximum Impact on Public Health https://www.who.int/publications/i/item/9789240018440 (WHO, 2021). - PubMed
  8. COVID-19 Genomics U. K. (COG-UK) [email protected]. An integrated national scale SARS-CoV-2 genomic surveillance network. Lancet Microbe 1, e99–e100 (2020). - PubMed
  9. Chiara, M. et al. Next generation sequencing of SARS-CoV-2 genomes: challenges, applications and opportunities. Brief. Bioinform. https://doi.org/10.1093/bib/bbaa297 (2020). - PubMed
  10. Charre, C. et al. Evaluation of NGS-based approaches for SARS-CoV-2 whole genome characterisation. Virus Evol. 6, veaa075 (2020). - PubMed
  11. Rausch, J. W., Capoferri, A. A., Katusiime, M. G., Patro, S. C. & Kearney, M. F. Low genetic diversity may be an Achilles heel of SARS-CoV-2. Proc. Natl Acad. Sci. USA 117, 24614–24616 (2020). - PubMed
  12. Endo, A. Centre for the Mathematical Modelling of Infectious Diseases COVID-19 Working Group, Abbott, S., Kucharski, A. J. & Funk, S. Estimating the overdispersion in COVID-19 transmission using outbreak sizes outside China.Wellcome Open Res. 5, 67, https://doi.org/10.12688/wellcomeopenres.15842.3 (2020). - PubMed
  13. Lagerborg, K. A., Watrous, J. D., Cheng, S. & Jain, M. High-throughput measure of bioactive lipids using non-targeted mass spectrometry. Methods Mol. Biol. 1862, 17–35 (2019). - PubMed
  14. Boja, E. S. & Rodriguez, H. Mass spectrometry-based targeted quantitative proteomics: achieving sensitive and reproducible detection of proteins. Proteomics 12, 1093–1110 (2012). - PubMed
  15. Chen, K. et al. The overlooked fact: fundamental need for spike-in control for virtually all genome-wide analyses. Mol. Cell. Biol. 36, 662–667 (2016). - PubMed
  16. Illumina: COVIDSeq Test https://www.illumina.com/products/by-type/ivd-products/covidseq.html (2020). - PubMed
  17. Jiang, L. et al. Synthetic spike-in standards for RNA-seq experiments. Genome Res. 21, 1543–1551 (2011). - PubMed
  18. Quail, M. A. et al. SASI-Seq: sample assurance spike-ins, and highly differentiating 384 barcoding for Illumina sequencing. BMC Genomics 15, 110, https://doi.org/10.1186/1471-2164-15-110 (2014). - PubMed
  19. Dilucca, M., Forcelloni, S., Georgakilas, A. G., Giansanti, A., & Pavlopoulou, A. Codon Usage and Phenotypic Divergences of SARS-CoV-2 Genes. Viruses 12, 5 (2020). - PubMed
  20. Potapov, V. & Ong, J. L. Examining sources of error in PCR by single-molecule sequencing. PLoS ONE 12, e0169774 (2017). - PubMed
  21. Lemieux, J. E. et al. Phylogenetic analysis of SARS-CoV-2 in Boston highlights the impact of superspreading events. Science 371, 6529, https://doi.org/10.1126/science.abe3261 (2021). - PubMed
  22. So, A. P. et al. A robust targeted sequencing approach for low input and variable quality DNA from clinical samples. NPJ Genom. Med. 3, 2, https://doi.org/10.1038/s41525-017-0041-4 (2018). - PubMed
  23. Grubaugh, N. D. et al. An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar. Genome Biol. 20, 8, https://doi.org/10.1186/s13059-018-1618-7 (2019). - PubMed
  24. Pipelines R&D, et al. COVID-19 ARTIC v3 Illumina Library Construction and Sequencing Protocol v5 https://doi.org/10.17504/protocols.io.bibtkann (2020). - PubMed
  25. Lam, C. et al. SARS-CoV-2 Genome Sequencing Methods Differ in Their Abilities To Detect Variants from Low-Viral-Load Samples. J. Clin. Microbiol. 59, 11, https://doi.org/10.1128/JCM.01046-21 (2021). - PubMed
  26. Quick, J. et al. Real-time, portable genome sequencing for Ebola surveillance. Nature 530, 228–232 (2016). - PubMed
  27. Metsky, H. C. et al. Zika virus evolution and spread in the Americas. Nature 546, 411–415 (2017). - PubMed
  28. Gohl, D. M. et al. A rapid, cost-effective tailed amplicon method for sequencing SARS-CoV-2. BMC Genomics 21, 863, https://doi.org/10.1186/s12864-020-07283-6 (2020). - PubMed
  29. Itokawa, K., Sekizuka, T., Hashino, M., Tanaka, R. & Kuroda, M. Disentangling primer interactions improves SARS-CoV-2 genome sequencing by multiplex tiling PCR. PLoS ONE 15, e0239403 (2020). - PubMed
  30. Tyson, J. R. et al. Improvements to the ARTIC multiplex PCR method for SARS-CoV-2 genome sequencing using nanopore. Preprint at bioRxiv https://doi.org/10.1101/2020.09.04.283077 (2020). - PubMed
  31. VarSkip: VarSkip Multiplex PCR Designs for SARS-CoV-2 Sequencing https://github.com/nebiolabs/VarSkip (2021). - PubMed
  32. ARTIC: nanopore protocol for nCoV2019 novel coronavirus. https://github.com/artic-network/artic-ncov2019 (2020). - PubMed
  33. Matranga, C. B. et al. Enhanced methods for unbiased deep sequencing of Lassa and Ebola RNA viruses from clinical and biological samples. Genome Biol. 15, 519 (2014). - PubMed
  34. Elbe, S. & Buckland-Merrett, G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob. Chall. 1, 33–46 (2017). - PubMed

Publication Types

Grant support