| Literature DB >> 34158935 |
Anirudh Prabhu1, Shaunna M Morrison2, Ahmed Eleish1, Hao Zhong1, Fang Huang3, Joshua J Golden4, Samuel N Perry5, Daniel R Hummer6, Jolyon Ralph7, Simone E Runyon8, Kathleen Fontaine1, Sergey Krivovichev9, Robert T Downs4, Robert M Hazen2, Peter Fox1.
Abstract
Minerals contain important clues to understanding the complex geologic history of Earth and other planetary bodies. Therefore, geologists have been collecting mineral samples and compiling data about these samples for centuries. These data have been used to better understand the movement of continental plates, the oxidation of Earth's atmosphere and the water regime of ancient martian landscapes. Datasets found at 'RRUFF.info/Evolution' and 'mindat.org' have documented a wealth of mineral occurrences around the world. One of the main goals in geoinformatics has been to facilitate discovery by creating and merging datasets from various scientific fields and using statistical methods and visualization tools to inspire and test hypotheses applicable to modelling Earth's past environments. To help achieve this goal, we have compiled physical, chemical and geological properties of minerals and linked them to the above-mentioned mineral occurrence datasets. As a part of the Deep Time Data Infrastructure, funded by the W.M. Keck Foundation, with significant support from the Deep Carbon Observatory (DCO) and the A.P. Sloan Foundation, GEMI ('Global Earth Mineral Inventory') was developed from the need of researchers to have all of the required mineral data visible in a single portal, connected by a robust, yet easy to understand schema. Our data legacy integrates these resources into a digestible format for exploration and analysis and has allowed researchers to gain valuable insights from mineralogical data. GEMI can be considered a network, with every node representing some feature of the datasets, for example, a node can represent geological parameters like colour, hardness or lustre. Exploring subnetworks gives the researcher a specific view of the data required for the task at hand. GEMI is accessible through the DCO Data Portal (https://dx.deepcarbon.net/11121/6200-6954-6634-8243-CC). We describe our efforts in compiling GEMI, the Data Policies for usage and sharing, and the evaluation metrics for this data legacy.Entities:
Keywords: data legacy; information model; information system; mineral inventory; minerals; online interface
Year: 2020 PMID: 34158935 PMCID: PMC8216291 DOI: 10.1002/gdj3.106
Source DB: PubMed Journal: Geosci Data J ISSN: 2049-6060 Impact factor: 1.778
FIGURE 1(a) Mineral Evolution Database (MED) locality page and mineral list for Sarfartoq Carbonatite Complex, Sarfartoq Region, Isortoq Fjord (Søndre Isortoq), Qeqqata, Greenland, Denmark (https://rruff.info/mineral_list/locality.php?mindat_id=123391). This webpage contains information on the oldest known and youngest known mineral age at this locality (in millions of years), the locality's Mindat ID and URL, latitude and longitude and the minerals attributed to this locality along with their age classification and associated legend. Dolomite is the only directly dated mineral at this locality and is therefore coloured in green, per the legend scheme. The age of dolomite is assigned to all other minerals at this locality with the exception of calcite. Calcite is assigned the age range of all calcite ages of sublocalities (‘child localities’). (b) Expanded mineral data table of Figure 1a. This expansion gives access to information on ages, structure types, chemical composition and the number of localities each mineral has in MED
FIGURE 2(a) An example of the Handbook of Mineralogy PDFs from which mineralogical information was extracted with automated information extraction (credit: The Handbook of Mineralogy via the RRUFF Project, http://rruff.info/doclib/hom/calcite.pdf). (b), Calcite mineral sample (Credit: The RRUFF Project database, http://rruff.info/calcite/display=default/R040070)
FIGURE 3GEMI information model. The information model consists of 2 main parts, the geological properties centred around minerals and the location properties centred around the mineral locality. This is the latest version of the information model as of the publication of this paper
FIGURE 4(a) and (b) A screenshot of the user interface is shown in the figure above. A list of results returned on that particular browser appears at the centre and extends down and to the right. On the left-hand side, there is a list of facets, corresponding to properties of the data type to which the results correspond. A user can constrain the criteria that control the query by selecting values within the facets. A feature of the browser is that once a facet value is selected, the result set and all values available in other facets are refreshed after the query is rerun. The user can make selections in several facets at once and can also use the search box above the result set to perform a free-text search. Other actions such as exporting the result set are implemented based on specific requirements for each browser and are highly customizable