| Literature DB >> 25593593 |
Timothy D Read1, Ruth C Massey2.
Abstract
Genome-wide association studies (GWASs) have become an increasingly important approach for eukaryotic geneticists, facilitating the identification of hundreds of genetic polymorphisms that are responsible for inherited diseases. Despite the relative simplicity of bacterial genomes, the application of GWASs to identify polymorphisms responsible for important bacterial phenotypes has only recently been made possible through advances in genome sequencing technologies. Bacterial GWASs are now about to come of age thanks to the availability of massive datasets, and because of the potential to bridge genomics and traditional genetic approaches that is provided by improving validation strategies. A small number of pioneering GWASs in bacteria have been published in the past 2 years, examining from 75 to more than 3,000 strains. The experimental designs have been diverse, taking advantage of different processes in bacteria for generating variation. Analysis of data from bacterial GWASs can, to some extent, be performed using software developed for eukaryotic systems, but there are important differences in genome evolution that must be considered. The greatest experimental advantage of bacterial GWASs is the potential to perform downstream validation of causality and dissection of mechanism. We review the recent advances and remaining challenges in this field and propose strategies to improve the validation of bacterial GWASs.Entities:
Year: 2014 PMID: 25593593 PMCID: PMC4295408 DOI: 10.1186/s13073-014-0109-z
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Figure 1Growth in the cumulative number of next-generation sequencing runs in public databases for three important bacterial pathogens, , and . The data were acquired by querying the National Center for Biotechnology Information Short Read Archive database and excluding datasets linked to RNA-seq experiments.
Early bacterial genome-wide association studies based on whole-genome shotgun data
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
| |||||
|
| 192 | Host adaptation | Vitamin B5 biosynthesis is important | 30 bp ‘word’ searching [ | [ |
|
| 123 | Antibiotic resistance | 39 novel resistance-associated loci | PhyC [ | [ |
|
| 75 | Antibiotic resistance | Novel associated single nucleotide polymorphism in | ROADTRIPS [ | [ |
|
| 90 | Virulence | 121 novel associated loci | PLINK [ | [ |
|
| 3,701 | Antibiotic resistance | Multiple novel associated loci | PLINK [ | [ |
Similarities and differences between bacterial and eukaryotic genome-wide association study approaches
|
|
|
|
|---|---|---|
| Ploidy | Haploid | Diploid |
| Genetic re-assortment | Infrequent short gene conversion and horizontal gene transfer events | Homologous recombination and chromosome segregation linked to reproduction |
| Accessory (non-core) genes | Variable numbers in different species | Rare |
| Linkage disequilibrium | Variable across the genome and between species | Variable across the genome |
| Population structure | Asexual, generally highly structured, except for relatively rare homologous recombination events | Sexual, variable allele frequencies in subpopulations owing to non-random mating, ancestral divergence, drift |
| Confounders in genome-wide association studies | Population structure | Population structure |
| How to move from association to causality | Genetic reconstruction of mutations in laboratory strains, transposon mutant screens | Forward genetics in animal models or cultured tissue systems; linkage to known genetic diseases; large monogenic association studies |
| Current burden of proof for causality | Molecular Koch’s Postulates | Combined genetic and experimental evidence |
Figure 2Natural mechanisms for introducing homoplasious mutations into the genomes of bacterial populations. Homoplasious mutations are necessary for association studies. The figure represents three mechanisms for forming an identical genetic variant (red star) on different lineages of a hypothetical phylogenetic tree of bacterial strains: (a) horizontal gene transfer, (b) recombination and (c) recurrent mutation.
Figure 3Schematic representation of genomics/genome-wide association studies and traditional microbial genetics strategies. The top-down approach [6] (genomics/genome-wide association studies (GWASs) typically begins with a pool of sequenced genomes and attempts to discover genes that are associated with a particular phenotype. Bottom-up approaches (molecular genetics) perform experiments to prove or disprove hypotheses about the function of particular genes or regions of the genome. These approaches can be integrated when knowledge gained from molecular genetics is used to validate unbiased GWASs- and genomics-based experiments.