Display options
Share it on

Comput Geosci. 2017 Sep;164-170. doi: 10.1016/j.cageo.2017.06.007. Epub 2017 Jun 10.

A relevancy algorithm for curating earth science data around phenomenon.

Computers & geosciences

Manil Maskey, Rahul Ramachandran, Xiang Li, Amanda Weigel, Kaylin Bugbee, Patrick Gatlin, J J Miller

Affiliations

  1. NASA, Marshall Space Flight Center, Huntsville, AL, USA.
  2. University of Alabama in Huntsville, Huntsville, AL, USA.

PMID: 32440033 PMCID: PMC7241668 DOI: 10.1016/j.cageo.2017.06.007

Abstract

Earth science data are being collected for various science needs and applications, processed using different algorithms at multiple resolutions and coverages, and then archived at different archiving centers for distribution and stewardship causing difficulty in data discovery. Curation, which typically occurs in museums, art galleries, and libraries, is traditionally defined as the process of collecting and organizing information around a common subject matter or a topic of interest. Curating data sets around topics or areas of interest addresses some of the data discovery needs in the field of Earth science, especially for unanticipated users of data. This paper describes a methodology to automate search and selection of data around specific phenomena. Different components of the methodology including the assumptions, the process, and the relevancy ranking algorithm are described. The paper makes two unique contributions to improving data search and discovery capabilities. First, the paper describes a novel methodology developed for automatically curating data around a topic using Earth science metadata records. Second, the methodology has been implemented as a stand-alone web service that is utilized to augment search and usability of data in a variety of tools.

Keywords: Data curation; Earth science phenomena; Information retrieval; Relevancy algorithm; Search paradigms

Publication Types