| Literature DB >> 28228978 |
Lavanya Rishishwar1, Lu Wang2, Evan A Clayton3, Leonardo Mariño-Ramírez4, John F McDonald3, I King Jordan1.
Abstract
Recent technological developments-in genomics, bioinformatics and high-throughput experimental techniques-are providing opportunities to study ongoing human transposable element (TE) activity at an unprecedented level of detail. It is now possible to characterize genome-wide collections of TE insertion sites for multiple human individuals, within and between populations, and for a variety of tissue types. Comparison of TE insertion site profiles between individuals captures the germline activity of TEs and reveals insertion site variants that segregate as polymorphisms among human populations, whereas comparison among tissue types ascertains somatic TE activity that generates cellular heterogeneity. In this review, we provide an overview of these new technologies and explore their implications for population and clinical genetic studies of human TEs. We cover both recent published results on human TE insertion activity as well as the prospects for future TE studies related to human evolution and health.Entities:
Keywords: bioinformatics; disease; genetics; genomics; health; human; natural selection; polymorphisms; transposable elements; transposition
Year: 2017 PMID: 28228978 PMCID: PMC5305044 DOI: 10.1080/2159256X.2017.1280116
Source DB: PubMed Journal: Mob Genet Elements ISSN: 2159-2543
Large scale genome sequencing initiatives. Projects are sorted in descending order by the number of participants.
| Project Name | PMID | # Participants | Description |
|---|---|---|---|
| Million Veteran Program (MVP) | 26441289 | 1,000,000 | Planned sequencing of 1 million US. Veterans (genotyping, whole genome and exome); current enrollment at 500k |
| SHGP | 26583887 | 100,000 | Catalog of whole genome sequences of 100k Saudis |
| TOPMed | N/A | 62,000 | Sequencing of 62k individual genomes along with a variety of data for precision medicine initiative |
| UK10K | 26367797 | 10,000 | Sequencing of ∼10k individuals from UK to inspect the effect of rare and low-frequency variants to human traits |
| Human Longevity | Awaiting | 10,000 | Deep sequencing of 10k human genomes; Data donated to Precision FDA |
| Iceland Genome Project | 25807286 | 2,636 | Catalog of whole genome sequences of 2,636 Icelanders |
| 1000 Genomes Project | 26432245 | 2,504 | International whole genome project that sampled 2,504 healthy individuals from 26 populations |
| EGDP | 27654910 | 483 | Catalog of whole genome sequences of 483 genomes from 148 diverse population |
| SGDP | 27654912 | 300 | Catalog of whole genome sequences of 300 genomes from 142 diverse population |
| GoNL | 24974849 | 250 | Catalog of whole genome sequences of 250 Dutch parent-offspring families |
| Australian Aboriginals | 27654914 | 108 | Catalog of whole genome sequences of 108 Aboriginal Australians |
Computational approaches for genome-wide detection of TE insertions. Methods are sorted in order by their year of publication.
| Tool Name | PMID | Year | Comments |
|---|---|---|---|
| VariationHunter | 19447966 | 2009 | Originally developed for SV detection, later refined for TE calling |
| HYDRA-SV | 20308636 | 2010 | General purpose SV tool; reported on mouse genome |
| TE-Locate | 24832231 | 2012 | Reported on 1001 Arabidopsis genomes project |
| Tea | 22745252 | 2012 | Specialized TE caller for cancer WGS data |
| ngs_te_mapper | 22347367 | 2012 | Requires TSDs; reported for Drosophila melanogaster |
| RetroSeq | 23233656 | 2013 | Tested on 1KGP and mouse strains |
| ReloaTE | 23576519 | 2013 | Requires TSDs; designed for rice genomes |
| Mobster | 25348035 | 2014 | Tested on 1KGP; reliable predictor for Human genome |
| Tangram | 25228379 | 2014 | Used in Phase II of 1KGP; no longer maintained |
| TEMP | 24753423 | 2014 | Reported on 1KGP and Drosophila genomes |
| T-lex2 | 25510498 | 2014 | Reported on 1KGP and Drosophila genomes |
| TE-Tracker | 25408240 | 2014 | Reported on Arabidopsis genome and simulated human genome |
| TIGRA | 24307552 | 2014 | A breakpoint assembler and not a structural variant caller |
| TranspoSeq | 24823667 | 2014 | Specialized TE caller for cancer WGS data |
| TraFiC | 25082706 | 2014 | Specialized TE caller for cancer WGS data |
| MELT | 26432246 | 2015 | Used in Phase III of 1KGP; reported to work on Human, Chimp and dog. |
| ITIS | 25887332 | 2015 | Reported on |
| Jitterbug | 26459856 | 2015 | Reported on 1KGP and Arabidopsis genome |
| MetaSV | 25861968 | 2015 | General purpose SV tool; reported on simulated genome |
| DD_DETECTION | 26508759 | 2016 | Database free dispersed duplication detection approach |
| GRIPper | — | — | Detects non-reference gene copy insertion |
Figure 1.Schematic of the high-throughput bioinformatics (A) and experimental (B) approaches to human TE insertion discovery.
High-throughput experimental approaches for TE insertion detection. Next-generation sequence based methods are presented separately from methods that used tiling arrays or Sanger sequencing. Methods are sorted in descending order by their year of publication.
| Next-generation sequence based | Tiling arrays/Sanger based | ||||
|---|---|---|---|---|---|
| Method | PMID | Year | Method | PMID | Year |
| L1-Seq | 20488934 | 2010 | TIP-Chip | 20602999 | 2010 |
| Transposon-Seq | 20603005 | 2010 | Fosmid-based | 20602998 | 2010 |
| ME-Scan | 20591181 | 2010 | AIP | 22495107 | 2012 |
| RC-Seq | 22037309 | 2011 | |||