| Literature DB >> 25628861 |
Birgit C Schlick-Steiner1, Wolfgang Arthofer1, Karl Moder2, Florian M Steiner1.
Abstract
Today, the comparative analysis of DNA molecules mainly uses information inferred from nucleotide substitutions. Insertion/deletion (INDEL) mutations, in contrast, are largely considered uninformative and discarded, due to our lacking knowledge on their evolution. However, including rather than discarding INDELs would be relevant to any research area in ecology and evolution that uses molecular data. As a practical approach to better understanding INDEL evolution in general, we propose the study of recent INDEL (reINDEL) mutations - mutations where both ancestral and derived state are seen in the sample. The precondition for reINDEL identification is knowledge about the pedigree of the individuals sampled. Sound reINDEL knowledge will allow the improved modeling needed for including INDELs in the downstream analysis of molecular data. Both microsatellites, currently still the predominant marker system in the analysis of populations, and sequences generated by next-generation sequencing, a promising and rapidly developing range of technologies, offer the opportunity for reINDEL identification. However, a 2013 sample of animal microsatellite studies contained unexpectedly few reINDELs identified. As most likely explanation, we hypothesize that reINDELs are underreported rather than absent and that this underreporting stems from common reINDEL unawareness. If our hypothesis applies, increased reINDEL awareness should allow gathering data rapidly. We recommend the routine reporting of either the absence or presence of reINDELs together with standardized key information on the nature of mutations when they are detected and the use of the keyword "reINDEL" to increase visibility in both instances of successful and unsuccessful search.Entities:
Keywords: INDEL; microsatellite; models of evolution; molecular marker; next-generation sequencing; pedigree
Year: 2014 PMID: 25628861 PMCID: PMC4298431 DOI: 10.1002/ece3.1330
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Eusocial insects are among study systems that facilitate generating large population-genetic or population-genomic data sets for individuals with known pedigree and that are thus ideal for studying reINDELs. Here, one of the 1,000–10,000 workers (Steiner et al. 2004) of a colony of the ant Lasius austriacus carries a worker sister at the pupal stage, all these workers stemming from the same mother and father (Steiner et al. 2007). Photograph copyright B.C. Schlick-Steiner & F.M. Steiner.
Characteristics (as of October 2014) of microsatellites versus two selected next-generation sequencing (NGS) approaches in studying populations, RADseq (Baird et al. 2008) and whole-genome resequencing (Huang et al. 2009). All sequencing information is based on the assumption that Illumina (http://www.illumina.com/) technology is used, except the sequencing for developing microsatellite loci, for which the use of Roche 454 (http://www.454.com/) technology is assumed. Where feasible, we classified using a five-step scale of very low, low, intermediate, high, and very high, for each characteristic calibrated relatively across the three techniques. Secondary bioinformatics: sequence alignment and variant calling; tertiary bioinformatics: further downstream steps of sequence annotation and interpretation (Wright et al. 2011; primary bioinformatics, that is, base calling, usually is performed by the software of the sequencing machine)
| NGS-based population genomics | |||
|---|---|---|---|
| Characteristics | Microsatellite-based population genetics | RADseq | Whole-genome resequencing |
| Establishment of resources | |||
| Type of establishment required | Development of loci | n/a | De-novo whole-genome sequencing |
| DNA quality / amount needed | High / ca. hundred ng | Very high / several | |
| Problems due to DNA contaminants | Intermediate | Very high | |
| Sequencing effort | 0.01–0.05× coverage | 30–100× coverage | |
| Need of secondary / tertiary bioinformatics expertise | Intermediate / low | Very high / very high | |
| Total time / expenses | Few weeks / intermediate | Few months to year / high to very high | |
| Application | |||
| DNA quality / amount needed | Very low / several ng per locus | High / 250 ng | High / several hundred ng |
| Problems due to DNA contaminants | Very low | Very high | Very high |
| Need of secondary / tertiary bioinformatics expertise | Low / depends on research question | High / depends on research question | High / depends on research question |
| Total time / expenses per individual | Few days per locus / very low | Weeks / intermediate | Weeks / very high |
| Numbers of individuals that can be analyzed | Very high | Intermediate to high | Low |
| Information accessed | |||
| Genome coverage | Very low | Low | Very high |
| Evenness of distribution of loci over genome | Unpredictable | High | Very high |
| Number of research questions that can be tackled | Intermediate | High | Very high |
| Appropriateness for basic / complex research questions | Very high / low | Intermediate / intermediate | Very low / very high |
Figure B1The annual numbers of publications on microsatellites and population genomics published 1995–2013 included in Web of Science. The results of regression analyses comparing the R² of linear (lin) and exponential (exp) functions are added. See Appendix 1 for the database query protocols used and for the regression analysis results. Years are given as relative years as used in the regression analyses: 2 = 1995, 20 = 2013.
| Linear regressions | Exponential regressions | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Research approach | Intercept | Slope | d | a | b | ||||
| Microsatellites | 736.2 | 216.9 | 0.97 | <.0001 | −30593.9 | 31294.8 | 0.006 | 0.92 | <.0001 |
| Population genomics | −27.9 | 4.6 | 0.67 | <.0001 | −1.0 | 0.4 | 0.279 | 0.97 | <.0001 |
| Ref | Incl | Scor | Class | Loc? | Pos? | Typ? | Dir? | Siz? | Sex? | No? | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Aquaculture 388:14–23 | 0 | ||||||||||||
| Aquaculture 400:77–84 | 0 | ||||||||||||
| Aquaculture 404:139–149 | 0 | ||||||||||||
| Aquaculture 404:95–104 | 0 | ||||||||||||
| Behav Ecol. 24:1022–1029 | 0 | ||||||||||||
| Behav Ecol. 24:1128–1137 | 0 | ||||||||||||
| Behav Ecol. 24:1306–1311 | 0 | ||||||||||||
| Behav Ecol. 24:1356–1362 | 1 | no | Aves | 2320 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Behav Ecol. 24:29–38 | 0 | ||||||||||||
| Behav Ecol. 24:540–546 | 1 | aut | Actinopterygii | 572 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Behav Ecol. 24:949–954 | 0 | ||||||||||||
| Behav Ecol Sociobiol. 67:113–122 | 0 | ||||||||||||
| Behav Ecol Sociobiol. 67:243–255 | 0 | ||||||||||||
| Behav Ecol Sociobiol. 67:399–408 | 0 | ||||||||||||
| Behav Ecol Sociobiol. 67:621–627 | 1 | aut + man | Insecta | 9116 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Behav Ecol Sociobiol. 67:727–735 | 0 | ||||||||||||
| Biol Invasions 15:1331–1342 | 0 | ||||||||||||
| Biol Invasions 15:199–212 | 0 | ||||||||||||
| Biol Invasions 15:2281–2297 | 0 | ||||||||||||
| Conserv Genet. 14:1019–1028 | 0 | ||||||||||||
| Conserv Genet. 14:1029–1042 | 1 | man | Reptilia | 6110 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Conserv Genet. 14:1099–1110 | 0 | ||||||||||||
| Conserv Genet. 14:171–183 | 0 | ||||||||||||
| Conserv Genet. 14:21–30 | 0 | ||||||||||||
| Conserv Genet. 14:559–571 | 0 | ||||||||||||
| Conserv Genet. 14:601–613 | 0 | ||||||||||||
| Conserv Genet. 14:625–636 | 0 | ||||||||||||
| Conserv Genet. 14:65–77 | 0 | ||||||||||||
| Conserv Genet. 14:875–883 | 0 | ||||||||||||
| Conserv Genet. 14:953–962 | 0 | ||||||||||||
| Conserv Genet Res. 5:181–183 | 0 | ||||||||||||
| Conserv Genet Res. 5:199–201 | 0 | ||||||||||||
| Conserv Genet Res. 5:507–510 | 0 | ||||||||||||
| Conserv Genet Res. 5:555–560 | 0 | ||||||||||||
| Conserv Genet Res. 5:667–669 | 0 | ||||||||||||
| Conserv Genet Res. 5:749–753 | 0 | ||||||||||||
| Conserv Genet Res. 5:863–866 | 0 | ||||||||||||
| Ecol Evol. 3:1569–1579 | 0 | ||||||||||||
| Ecol Evol. 3:1765–1779 | 0 | ||||||||||||
| Ecol Evol. 3:2933–2946 | 1 | aut | Amphibia | 93,140 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Ecol Evol. 3:3152–3165 | 0 | ||||||||||||
| Ecol Evol. 3:3379–3387 | 1 | no | Anthozoa | 1140 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Ecol Evol. 3:474–481 | 0 | ||||||||||||
| Ecol Evol. 3:694–705 | 1 | aut | Aves | 32,788 | 17 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | n/a |
| Ecol Evol. 3:80–88 | 0 | ||||||||||||
| Evolution 67:1169–1180 | 0 | ||||||||||||
| Evolution 67:2299–2308 | 0 | ||||||||||||
| Evolution 67:2561–2576 | 0 | ||||||||||||
| Evolution 67:2701–2713 | 0 | ||||||||||||
| Evolution 67:646–660 | 0 | ||||||||||||
| Evol Appl. 6:165–179 | 0 | ||||||||||||
| Evol Appl. 6:34–45 | 0 | ||||||||||||
| Evol Appl. 6:524–534 | 0 | ||||||||||||
| Heredity 110:111–122 | 0 | ||||||||||||
| Heredity 110:355–362 | 0 | ||||||||||||
| Heredity 110:439–448 | 0 | ||||||||||||
| Heredity 110:560–569 | 0 | ||||||||||||
| Heredity 111:321–329 | 0 | ||||||||||||
| Heredity 111:338–344 | 0 | ||||||||||||
| Hydrobiologia 700:33–45 | 0 | ||||||||||||
| Hydrobiologia 714:61–70 | 0 | ||||||||||||
| Hydrobiologia 715:113–123 | 0 | ||||||||||||
| Hydrobiologia 715:37–50 | 0 | ||||||||||||
| Insect Soc. 60:135–145 | 0 | ||||||||||||
| Insect Soc. 60:203–211 | 1 | no | Insecta | 1260 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Insect Soc. 60:231–241 | 1 | aut | Insecta | 720 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Insect Soc. 60:337–344 | 0 | ||||||||||||
| J Evol Biol. 26:108–117 | 0 | ||||||||||||
| J Evol Biol. 26:1330–1340 | 0 | ||||||||||||
| J Evol Biol. 26:1727–1737 | 0 | ||||||||||||
| J Evol Biol. 26:889–899 | 0 | ||||||||||||
| J Hered. 104:127–133 | 0 | ||||||||||||
| J Hered. 104:182–191 | 1 | no | Mammalia | 7776 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| J Hered. 104:217–222 | 1 | aut | Actinopterygii | 264 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 1 |
| J Hered. 104:301–311 | 0 | ||||||||||||
| J Hered. 104:371–379 | 1 | no | Chondrichthyes | 2400 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| J Hered. 104:386–398 | 0 | ||||||||||||
| J Hered. 104:465–475 | 0 | ||||||||||||
| J Hered. 104:532–546 | 0 | ||||||||||||
| J Hered. 104:692–703 | 0 | ||||||||||||
| J Hered. 104:92–104 | 0 | ||||||||||||
| Mol Ecol. 22:1158–1170 | 0 | ||||||||||||
| Mol Ecol. 22:1282–1294 | 0 | ||||||||||||
| Mol Ecol. 22:1447–1462 | 1 | no | Insecta | 46,560 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Mol Ecol. 22:1546–1557 | 0 | ||||||||||||
| Mol Ecol. 22:1640–1649 | 0 | ||||||||||||
| Mol Ecol. 22:1998–2010 | 1 | aut + man | Insecta | 1040 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Mol Ecol. 22:2787–2796 | 1 | aut + man | Mammalia | 816 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Mol Ecol. 22:3391–3402 | 0 | ||||||||||||
| Mol Ecol. 22:3721–3736 | 0 | ||||||||||||
| Mol Ecol. 22:3916–3932 | 0 | ||||||||||||
| Mol Ecol. 22:4499–4515 | 0 | ||||||||||||
| Mol Ecol. 22:4549–4561 | 0 | ||||||||||||
| Mol Ecol. 22:5001–5015 | 0 | ||||||||||||
| Mol Ecol. 22:5027–5039 | 1 | aut | Aves | 11,426 | 0 | 0 | n/a | n/a | n/a | n/a | n/a | n/a | 0 |
| Mol Ecol. 22:5430–5440 | 0 | ||||||||||||
| Mol Ecol. 22:74–86 | 0 | ||||||||||||
| Total | 16 | 217,448 | 17 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 |