| Literature DB >> 34178042 |
Serena Dato1, Paolina Crocco1, Nicola Rambaldi Migliore2, Francesco Lescai2.
Abstract
BACKGROUND: Aging is a complex phenotype influenced by a combination of genetic and environmental factors. Although many studies addressed its cellular and physiological age-related changes, the molecular causes of aging remain undetermined. Considering the biological complexity and heterogeneity of the aging process, it is now clear that full understanding of mechanisms underlying aging can only be achieved through the integration of different data types and sources, and with new computational methods capable to achieve such integration. RECENT ADVANCES: In this review, we show that an omics vision of the age-dependent changes occurring as the individual ages can provide researchers with new opportunities to understand the mechanisms of aging. Combining results from single-cell analysis with systems biology tools would allow building interaction networks and investigate how these networks are perturbed during aging and disease. The development of high-throughput technologies such as next-generation sequencing, proteomics, metabolomics, able to investigate different biological markers and to monitor them simultaneously during the aging process with high accuracy and specificity, represents a unique opportunity offered to biogerontologists today. CRITICAL ISSUES: Although the capacity to produce big data drastically increased over the years, integration, interpretation and sharing of high-throughput data remain major challenges. In this paper we present a survey of the emerging omics approaches in aging research and provide a large collection of datasets and databases as a useful resource for the scientific community to identify causes of aging. We discuss their peculiarities, emphasizing the need for the development of methods focused on the integration of different data types. FUTURE DIRECTIONS: We critically review the contribution of bioinformatics into the omics of aging research, and we propose a few recommendations to boost collaborations and produce new insights. We believe that significant advancements can be achieved by following major developments in bioinformatics, investing in diversity, data sharing and community-driven portable bioinformatics methods. We also argue in favor of more engagement and participation, and we highlight the benefits of new collaborations along these lines. This review aims at being a useful resource for many researchers in the field, and a call for new partnerships in aging research.Entities:
Keywords: aging; bioinformatics; databases; metabolomics; proteomics; regulation; systems biology; translational genomics
Year: 2021 PMID: 34178042 PMCID: PMC8225294 DOI: 10.3389/fgene.2021.689824
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Relevant cohort studies in aging research.
| Study/Cohort | Aim of the research | Main publication | Population (N samples*) | Start date |
| The Long Life Family Study | Family based, longitudinal study of healthy aging and longevity launched in 2005 aimed at the identification of markers in blood able to predict survival, better physical function, disease-free aging, dementia, and cardiovascular disease. | Caucasian and American of European ancestry (535 families, 1.499 individuals in the Proband Generation, 2.594 individuals in the Offspring Generation and 830 Spouse Controls) | Started in 2005 | |
| The Healthy Ageing in Neighborhoods of Diversity across the Life Span (HANDLS) study | Prospective study, aimed at investigating the influence of age, race, socioeconomic status on health and on major age-related diseases or conditions (Diabetes, Cerebrovascular Disease, Age-Associated Decline, Cardiovascular Disease) | African American and whites in Baltimore 3720 participants | Started in 2009 | |
| The NIA’s Baltimore Longitudinal Study of Aging | Comprehensive and longest running longitudinal examination of human aging in the world. The aim was the understanding about normal versus pathological aging as well as age-related diseases and conditions. | Different ethnicities** 3200 participants (1300 actively followed) | Start date: 1958 | |
| Framingham Heart Study | Long-term cardiovascular cohort study of residents of the city of Framingham, Massachusetts, aimed at understanding epidemiology of coronary heart disease (CHD) | Different ethnicities** Six groups of individuals: Original Cohort (5209), Offspring Cohort (5124), Third Generation Cohort (4095), New Offspring Spouse Cohort (103), Omni Generation 1 Cohort (506), and Omni Generation 2 Cohort (410). | Started in 1948 | |
| Lifestyle Interventions and Independence for Elders (LIFE) study | Study of the effect of physical activity in reducing the risk of major mobility disability. | Different ethnicities** 1635 participants | Started in 2010 | |
| Genetics of Healthy Aging (GEHA) | EU-funded program on the identification of genes involved in healthy aging and longevity. | Individuals of European ancestry from 10 recruitment areas all over the Europe 2535 90+ sibpairs (5319 non-agenarians) 2548 controls (50–75 years) | Started in 2004 | |
| IDEAL (Integrated research on Developmental determinants of Aging and Longevity) | EU-funded project on development, epigenetics and longevity. The project was focused on gathering insights into the role of early life conditions affecting late-life health, disease and aging. | Individuals of European ancestry from all over the Europe 8,000 long-lived individuals (≥85 years of age) | Started in 2011 | |
| Canadian Longitudinal Study on Aging (CLSA) | Large, national, long-term study aimed at understanding the impact of biological, medical, psycho-social, lifestyle and economic factors on the development of disease and disability as people age. | Canadians 50,000 individuals aged between 45 and 85 years followed up for 20 year at least | Started in 2003 | |
| Nagahama Prospective Cohort for Comprehensive Human Bioscience (the Nagahama Study) | Longitudinal cohort study of the residents in Nagahama City (south-central Japan) | Japanese 9,764 participants at baseline aged between 34 and 80 years | Started in 2013 |
FIGURE 1Omics aging databases. General overview of the main aging research databases described in the review: we have annotated each database with the omics data type it provides. G stands for genomics; T for transcriptomics; P for proteomics; M for metabolomics; E for epigenomics; Ph for pharmacogenomics.
Aging research databases.
| Database | Brief description | Data type and size | Omics data | Project status | References | Links |
| AgeFactDB | Database for the collection and integration of age-related data | 16,599 aging factors (16,450 genes, 91 compounds, 58 others) and 9,611 observations (8,159 aging phenotypes, 1,452 homology analyses) | Genomics | Stopped | ||
| MINDMAP | Integrated database for research in aging, mental wellness, and urban environment. It integrates 10 longitudinal cohort studies across cities in Europe, the US, and Canada to investigate mental wellbeing in older age as well as age-related factors and phenotypes | Aging factors, aging-related phenotypes, and aging-related risk factors in 2,664,115 participants | Genomics, Epigenomics, Transcriptomics, Proteomics, Metabolomics | Ongoing | ||
| GiSAO.db | Database of genes involved in age-related biological processes. It also contains orthologs between | Data of genes involved in senescence, apoptosis, and oxidative stress (gene expression data, annotation data, experimental data, ortholog data) for a total of 338 between experiments and arrays performed on all species involved | Genomics, Transcriptomics | Stopped | ||
| NeuroMuscleDB | Database of muscle-related genes at different stages of development and aging | Information of about 1,102 genes, 6,030 mRNAs, and 5,687 proteins that participate in muscle development in | Genomics, Transcriptomics, Proteomics | Ongoing | ||
| MetaboAgeDB | Database of human aging-related metabolites | 408 annotated aging-related metabolites and more than 1,515 aging-related variations occurring in healthy individuals | Metabolomics | Ongoing | ||
| Human Aging Genomic Resources (HAGR) | Collection of databases and tools for studying the genetics of aging | It integrates 10 between databases and tools related to genomics, 1 related to drugs, 1 to animal longevity, and 1 to aging changes | Genomics, Epigenomics, Transcriptomics, Proteomics, Pharmacogenomics | Ongoing | ||
| HAGR - GenAge | Database of genes related to aging in model organisms and in humans | 2,202 genes related to longevity and/or aging in model organisms, and 307 aging-related genes in humans (both directly related to aging in humans and the best candidates from model organisms) | Genomics | Ongoing | {Tacutu:2018fua} | |
| HAGR – GenDR | Database of genes associated with dietary restriction (DR) in model organisms and in mammals | 214 DR-associated genes in model organisms and 173 differentially expressed genes due to DR in mammals | Genomics, Transcriptomics | Ongoing | ||
| HAGR – LongevityMap | Database of genes, genetic variants, and loci associated with longevity in humans | 550 entries (275 reported as significant findings), 884 genes, and 3,144 variants from a total of 270 large and small-scale association studies on longevity in humans | Genomics | Ongoing | ||
| HAGR – CellAge | Database of genes associated with cell senescence | 1,259 cell senescence-associated gene expression changes from 279 gene manipulation experiments for 164 distinct cell lines and 3 distinct senescence types | Transcriptomics | Ongoing | ||
| HAGR – Aging-related Disease Genes | Dataset of genes involved in age-related diseases | 769 aging-related disease genes | Genomics | Stopped | ||
| HAGR – DrugAge | Database of drugs and compounds associated to extended longevity in model organisms | 567 distinct compounds across 1,823 lifespan (increasing or decreasing) assays on 30 unique species | Pharmacogenomics | Ongoing | ||
| HAGR – AnAge | Database of longevity records in animals | 4,244 entries (4,219 species and 25 taxa) with 3,275 longevity records, and 1,981 aging process observations. Life history traits for 3,275 species and metabolism data for 707 species | Genomics | Ongoing | ( | |
| HAGR - Digital Ageing Atlas | Database consisting in a collection of human age-related data covering different biological levels. It also contains data on | 3,784 molecular changes (3,071 in humans, 713 in mice), 343 physiological changes, 17 psychological changes, and 95 pathological changes in humans. A total of 2,599 genes involved for humans and 675 for mice | Genomics, Trancriptomics, Proteomics, Metabolomics | Ongoing | ||
| Aging Atlas | Database of age-related changes and pathologies in humans and model organisms | 3,274 aging-related human and mouse genes. RNA-seq data of genome-wide transcriptomic changes related to aging (more than 18,000 differentially expressed genes potentially related to aging). Single-cell RNA-seq data from 14 types of aged tissues from rats, monkeys, and humans. ChiP-seq data of specific aging-related loci regulated by histone modifications and transcription factors. Protein–protein interaction data related to aging. Compounds related to aging | Genomics, Epigenomics, Transcriptomics, Proteomics, Pharmacogenomics | Ongoing | ||
| Japanese Multi Omics Reference Panel (jMorp) | Database of metabolome and proteome data in plasma obtained from volunteers in Tohoku Medical Megabank Organization. It also integrates other multi-omics data collected from volunteers mainly from Japan | A Japanese reference genome and genomic data from 8,380 Japanese individuals. Cell-type specific transcriptomes based on 100 Japanese individuals. Peptides of 256 abundant proteins in 501 volunteers. 45 metabolites detected in 25,783 individuals | Genomics, Transcriptomics, Proteomics, Metabolomics | Ongoing |
FIGURE 2Data integration in aging research. A schematic representation of the process of data integration from public databases and other sources in aging and age-related diseases. The main data sources are represented in the “input” panel, and we represent the key methods described in the manuscript under “data extraction and integration.” We have represented the expected answers in the output, in terms of risk profiles and predictive tools for population stratification and prevention.
Suggested tutorials about bioinformatics methods and tools for omics analyses.
| Topic | Description | Links |
| Tucker Decomposition | This is an interesting blog curated by a company named “Integrated Knowledge Solutions.” We found this 3-parts tutorial well explained and accessible, and therefore a good starting point for those wishing to approach tensors and Tucker decomposition using R | Part 1: |
| Tucker Decomposition | R-bloggers is a famous blog for those who use R, where people contribute with their expertise and release tutorials on different topics. Here, Alexej Gossmann nicely explains the basic concepts of tensors, and how to perform Tucker decomposition | Understanding tensors: |
| Support Vector Machines | scikit-learn is a reknown python framework to carry out machine learning, with accessible and reusable tools built of famous libraries. It is entirely open source, and also has a series of user guides and tutorials on different topics, SVMs among others. | |
| Nextflow | The best way to approach Nextflow is to look at the extensive material produced or cataloged by the nf-core community. On their website they list a large number of resources, and also have a series of short “bytesize” webinars covering all the basics. | Tutorials page: |
| Deep Learning | Tensorflow is an open source library for computation, used for DL applications because well equipped to handle multidimensional data arrays (i.e., tensors), and exploit different computing architectures (particularly, GPUs). Its website has a series of very useful tutorials, which we suggest as a starting point to approach this environment. | |
| Deep Learning | If you use R, you will find particularly useful the resource that RStudio has put together on the topic: a large number of tutorials is available for both beginners and advanced R users | |
| Deep Learning | If you prefer a more in depth overview, with conceptual information, Manning Publications offer a video course freely available, which covers a wide range of topics for DL with R |