| Literature DB >> 31557052 |
Dinesh Kumar Barupal1, Oliver Fiehn1.
Abstract
BACKGROUND: Blood chemicals are routinely measured in clinical or preclinical research studies to diagnose diseases, assess risks in epidemiological research, or use metabolomic phenotyping in response to treatments. A vast volume of blood-related literature is available via the PubMed database for data mining.Entities:
Mesh:
Year: 2019 PMID: 31557052 PMCID: PMC6794490 DOI: 10.1289/EHP4713
Source DB: PubMed Journal: Environ Health Perspect ISSN: 0091-6765 Impact factor: 9.031
Coverage of blood-related structures in different databases and sources relevant for exposome research.
| Source category | Source name and description | Website | Blood chemicals | Application areas |
|---|---|---|---|---|
| Literature data | PubChem, PubMed | 49,542 | Blood exposome | |
| PubMed Abstract | 37,070 | Blood exposome | ||
| PMC: Blood Metabolomics | 4,036 | Blood exposome | ||
| Metabolite databases | Metabolomics Workbench | 11,194 | Metabolism, general | |
| MassBank of North America (MoNA) | 7,238 | Metabolomics | ||
| Human Metabolome Database | 7,039 | Human metabolism | ||
| LipidMaps database | 3,243 | Lipid metabolism | ||
| Ontology | National Institutes of Health (NIH), NCBI: Medical Subject Headings | 39,285 | Biological relevance | |
| Chemical Entities of Biological Interest (ChEBI) | 15,646 | Biological relevance | ||
| Pathway databases | Kyoto Encyclopedia of Genes and Genomes (KEGG) | 11,440 | Biochemical pathways | |
| NIH, NCBI: Gene (human) | 9,190 | Precision medicine | ||
| BioCyc | 5,257 | Biochemical pathways | ||
| NIH, NCBI: Structure (Protein) | 3,878 | Precision medicine | ||
| NIH, NCBI: BioSystems Database | 3,813 | Biochemical pathways | ||
| NIH: Online Mendelian Inheritance in Man | 2,731 | Genetic disorders | ||
| Governmentd atabases | Japan Chemical Substance Dictionary (NIKKAJI) | 30,871 | Biomonitoring | |
| U.S. Food and Drug Administration (FDA): Structured Product Labeling | 17,362 | Biomonitoring | ||
| European Chemical Agency (ECHA) | 12,368 | Biomonitoring | ||
| U.S. National Institute of Standards and Technology: Mass Spectrometry Data Center | 10,480 | Biomonitoring | ||
| U.S. Environmental Agency (EPA): Substance Registry Services | 678 | Biomonitoring | ||
| U.S. FDA: Food Additive database | 1,207 | Biomonitoring | ||
| U.S. FDA: Center for Food Safety and Applied Nutrition | 83 | Biomonitoring | ||
| Pharmacology | NIH, National Library of Medicine (NLM): DailyMed | 4,483 | Drugs | |
| U.S. Department of Agriculture (USDA): Dr. Duke's Phytochemical and Ethnobotanical Database | 4,135 | Food biomarkers | ||
| World Health Organization (WHO): Anatomical Therapeutic Chemical Classification System | 3,754 | Drugs | ||
| Logical Observation Identifiers Names and Codes | 1,812 | Clinical assays | ||
| U.S. FDA: Endocrine Disruptor Knowledge Base | 821 | Endocrine disrupters | ||
| Toxicological databases | U.S. EPA: Distributed Structure-Searchable Toxicity (DSSTOX) | 21,427 | Exposome: toxicants | |
| Comparative Toxicogenomics Database | 9,878 | Exposome: toxicants | ||
| NIH: Toxicology in the 21st Century | 6,899 | Exposome: toxicants | ||
| U.S. EPA: Toxic Substances Control Act | 6,515 | Exposome: toxicants | ||
| NIH, NLM: Chemical Carcinogenesis Research Information System | 4,607 | Exposome: toxicants | ||
| NIH, NLM: Hazardous Substances Data Bank | 4,512 | Exposome: toxicants | ||
| NIH, NLM: Information on Hazardous Chemicals and Occupational Diseases | 2,669 | Exposome: occupational | ||
| U.S. EPA: Pesticides | 1,851 | Exposome: toxicants | ||
| The Organization for Economic Co-operation and Development: Existing Chemicals Database | 1,690 | Exposome: daily | ||
| NIH, NLM: Household Products Database | 1,601 | Exposome: daily | ||
| International Labor Organization (ILO): International Chemical Safety Cards (ICSC) | 1,311 | Exposome: occupational | ||
| New Jersey Right to Know: Hazardous Substance List | 1,271 | Exposome: toxicants | ||
| California Office of Environmental Health Hazard Assessment | 1,013 | Exposome: toxicants | ||
| U.S. Centers for Disease Control and Prevention (CDC), National Institute for Occupational Safety and Health (NIOSH) | 828 | Exposome: occupational | ||
| California Preposition 65: Safe Drinking Water and Toxic Enforcement Act of 1986 | 787 | Exposome: toxicants | ||
| U.S. CDC, Agency for Toxic Substances and Disease Registry | 746 | Exposome: toxicants | ||
| WHO: International Agency for Research on Cancer (IARC) Monographs | 580 | Carcinogens | ||
| U.S. EPA: Integrated Risk Information System | 447 | Carcinogens | ||
| USDA: Pesticide Data Program | 340 | Exposome: toxicants | ||
| WHO: Joint Food and Agriculture Organization (FAO)/WHO Expert Committee on Food Additives | 259 | Food additives | ||
| BioAssay databases | NIH: Molecular Libraries and Imaging | 18,748 | Pharmaceuticals | |
| NIH, National Cancer Institute (NCI): Developmental Therapeutics Program | 9,896 | Pharmaceuticals | ||
| NIH, National Institute of Allergy and Infectious Diseases (NIAID): screening program | 7,508 | Pharmaceuticals | ||
| NIH, National Center for Advancing Translational Sciences (NCATS): Chemical Genomics Center | 8,788 | Pharmaceuticals | ||
| NIH, Common Fund: Molecular Libraries and Imaging program | 5,152 | Pharmaceuticals | ||
| The Broad Institute of Massachusetts Institute of Technology (MIT) and Harvard | 5,154 | Pharmaceuticals |
Note: Descriptions and web addresses for these sources and databases are provided in Table S6. PubChem CIDs from each database and sources were cross-referenced against the master list of PubChem CIDs in the Blood Exposome Database.
Figure 1.Overview schema for constructing the Blood Exposome Database. Three NCBI hosted databases were used as inputs for the workflow that yielded 42,000 two dimensional structures for blood specimens.
Figure 2.Overlap analysis of the origin of 41,474 achiral blood chemicals. PubChem to PubMed mapping provided the most comprehensive overview of the blood related compounds.
Figure 3.Distribution of lipophilicity (A), molecular weight (B), and publication count (C) in the Blood Exposome Database. The y-axis shows the frequency of chemicals. Xlogp is a unitless measurement for lipophilicity, in which negative values indicate more polar compounds.