| Literature DB >> 29081689 |
Alexandra J Mayhew1, David Meyre1,2.
Abstract
The goal of this review article is to provide a conceptual based summary of how heritability estimates for complex traits such as obesity are determined and to explore the future directions of research in the heritability field. The target audience are researchers who use heritability data rather than those conducting heritability studies. The article provides an introduction to key concepts critical to understanding heritability studies including: i) definitions of heritability: broad sense versus narrow sense heritability; ii) how data for heritability studies are collected: twin, adoption, family and population-based studies; and iii) analytical techniques: path analysis, structural equations and mixed or regressive models of complex segregation analysis. For each section, a discussion of how the different definitions and methodologies influence heritability estimates is provided. The general limitations of heritability studies are discussed including the issue of "missing heritability" in which heritability estimates are significantly higher than the variance explained by known genetic variants. Potential causes of missing heritability include restriction of many genetic association studies to single nucleotide polymorphisms, gene by gene interactions, epigenetics, and gene by environment interactions. Innovative strategies of accounting for missing heritability including modeling techniques and improved software are discussed.Entities:
Keywords: Complex traits; Family studies; Heritability; Missing heritability; Path analysis; Twin studies
Year: 2017 PMID: 29081689 PMCID: PMC5635617 DOI: 10.2174/1389202918666170307161450
Source DB: PubMed Journal: Curr Genomics ISSN: 1389-2029 Impact factor: 2.236
Comparison of statistical methods used to assess heritability estimates for human complex traits.
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|
| Monozygotic and dizygotic twins | Falconers formula is used to provide a broad sense heritability estimate. This formula assumes that the higher degree of correlation in MZ twins compared to DZ twins is attributable entirely to genetics. Falconers formula: | Conceptually simple | Assumes gene-environment correlations and interaction are minimal | |||
| Monozygotic twins raised separately | Adoption studies are used to estimate broad sense heritability. The formula assumes that all phenotypic correlation is due only to genetic factors as there is no shared environment and the unique environmental influences should be equal. | Considered the best estimate of heritability due to most number of underlying assumptions being true | Difficult to find a sufficient sample size | |||
| Family dyads with a known degree of relatedness | An estimate of narrow sense heritability is calculated by comparing the agreement of the phenotype in family pairs by the expected correlation based on genetic relatedness using the formula: | Compared to twin studies, easier to find a sufficient sample size | All relatives must have the same relationship | |||
| Twin or family data | Estimates narrow sense heritability by creating linear regression models and maximizing the goodness-of-fit function between observed and predicted covariance matrices. | Allows the use of multi-variable data so family members with different relationships can be included | Like classical family study designs, the assumptions of the underlying genetic relationships between individuals may not be true. | |||
| Data from unrelated individuals | Genome wide association studies look at the association using linear or logistic regression between individual single nucleotide polymorphisms and the genotype of interest in a large number of people. Narrow sense heritability can be calculated by summing the variance in the phenotype explained by each individual SNP. However, this method is not commonly used to estimate heritability due to the number of SNPs not yet identified for any given phenotype. | Can use unrelated individuals | To date, the sample size requirements to detect gene by gene interactions and gene by environment interactions have been a limiting factor. | |||
| Data from unrelated individuals | Genome-wide complex trait analyses includes all SNPs simultaneously in a mixed linear regression model which provides a higher estimate of heritability compared to genome-wide association studies. | Includes more SNPs than genome-wide association studies and therefore provides estimates of heritability that are closer to those derived from classic twin and family designs. | Same limitations as for genome-wide association studies. | |||