Display options
Share it on

Front Oncol. 2020 Mar 13;10:186. doi: 10.3389/fonc.2020.00186. eCollection 2020.

Humanizing Big Data: Recognizing the Human Aspect of Big Data.

Frontiers in oncology

Kathy Helzlsouer, Daoud Meerzaman, Stephen Taplin, Barbara K Dunn

Affiliations

  1. Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, MD, United States.
  2. Center for Biomedical Informatics and Information Technology, National Cancer Institute, Bethesda, MD, United States.
  3. Center for Global Health, National Cancer Institute, Bethesda, MD, United States.
  4. Division of Cancer Prevention, National Cancer Institute, Bethesda, MD, United States.

PMID: 32231993 PMCID: PMC7082327 DOI: 10.3389/fonc.2020.00186

Abstract

The term "big data" refers broadly to large volumes of data, often gathered from several sources, that are then analyzed, for example, for predictive analytics. Combining and mining genetic data from varied sources including clinical genetic testing, for example, electronic health records, what might be termed as "recreational" genetic testing such as ancestry testing, as well as research studies, provide one type of "big data." Challenges and cautions in analyzing big data include recognizing the lack of systematic collection of the source data, the variety of assay technologies used, the potential variation in classification and interpretation of genetic variants. While advanced technologies such as microarrays and, more recently, next-generation sequencing, that enable testing an individual's DNA for thousands of genes and variants simultaneously are briefly discussed, attention is focused more closely on challenges to analysis of the massive data generated by these genomic technologies. The main theme of this review is to evaluate challenges associated with big data in general and specifically to bring the sophisticated technology of genetic/genomic testing down to the individual level, keeping in mind the human aspect of the data source and considering where the impact of the data will be translated and applied. Considerations in this "humanizing" process include providing adequate counseling and consent for genetic testing in all settings, as well as understanding the strengths and limitations of assays and their interpretation.

Copyright © 2020 Helzlsouer, Meerzaman, Taplin and Dunn.

Keywords: big data; cancer risk prediction; clinical genetics/genomics; data sharing; direct-to-consumer testing; precision medicine; predictive analytics

References

  1. Pharmacogenomics. 2019 Jun;20(8):609-620 - PubMed
  2. BMC Genomics. 2019 Nov 15;20(1):868 - PubMed
  3. N Engl J Med. 2015 Jan 29;372(5):397-9 - PubMed
  4. Ann Epidemiol. 2017 May;27(5):297-301 - PubMed
  5. Adv Exp Med Biol. 2019;1192:3-15 - PubMed
  6. Lancet. 2010 May 1;375(9725):1525-35 - PubMed
  7. J Med Internet Res. 2013 Aug 21;15(8):e186 - PubMed
  8. N Engl J Med. 2018 Oct 04;379(14):1353-1362 - PubMed
  9. IEEE J Biomed Health Inform. 2015 Jul;19(4):1193-208 - PubMed
  10. Semin Oncol. 2016 Feb;43(1):36-48 - PubMed
  11. Int J Mol Sci. 2017 Feb 15;18(2): - PubMed
  12. J Epidemiol Community Health. 2017 Nov;71(11):1113-1117 - PubMed
  13. BMC Genomics. 2019 Nov 4;20(1):805 - PubMed
  14. Curr Protoc Hum Genet. 2014 Apr 24;81:9.15.1-8 - PubMed
  15. Nat Rev Genet. 2010 Jan;11(1):31-46 - PubMed
  16. Biopreserv Biobank. 2017 Oct;15(5):469-474 - PubMed
  17. Am J Hum Genet. 2019 Feb 7;104(2):275-286 - PubMed
  18. Br Med Bull. 2017 Sep 1;123(1):35-45 - PubMed
  19. Sci Data. 2016 Mar 15;3:160018 - PubMed
  20. J Genet Couns. 2004 Apr;13(2):83-114 - PubMed
  21. JAMA. 2013 Apr 3;309(13):1351-2 - PubMed
  22. Cancer Res. 2019 Oct 15;79(20):5140-5145 - PubMed
  23. Front Med (Lausanne). 2019 Apr 17;6:66 - PubMed
  24. Mol Genet Genomic Med. 2019 May;7(5):e630 - PubMed
  25. Yale J Biol Med. 2013 Sep 20;86(3):359-65 - PubMed

Publication Types