| Literature DB >> 33265816 |
Abstract
This article discusses how entropy/information methods are well-suited to analyzing and forecasting the four processes of innovation, transmission, movement, and adaptation, which are the common basis to ecology and evolution. Macroecologists study assemblages of differing species, whereas micro-evolutionary biologists study variants of heritable information within species, such as DNA and epigenetic modifications. These two different modes of variation are both driven by the same four basic processes, but approaches to these processes sometimes differ considerably. For example, macroecology often documents patterns without modeling underlying processes, with some notable exceptions. On the other hand, evolutionary biologists have a long history of deriving and testing mathematical genetic forecasts, previously focusing on entropies such as heterozygosity. Macroecology calls this Gini-Simpson, and has borrowed the genetic predictions, but sometimes this measure has shortcomings. Therefore it is important to note that predictive equations have now been derived for molecular diversity based on Shannon entropy and mutual information. As a result, we can now forecast all major types of entropy/information, creating a general predictive approach for the four basic processes in ecology and evolution. Additionally, the use of these methods will allow seamless integration with other studies such as the physical environment, and may even extend to assisting with evolutionary algorithms.Entities:
Keywords: Shannon; adaptation; artificial intelligence; diversity-profile; entropy; evolutionary algorithms; gene-expression; linkage-disequilibrium; mutual information; selection
Year: 2018 PMID: 33265816 PMCID: PMC7512290 DOI: 10.3390/e20100727
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Ecological and evolutionary information or entropy, for values q = 0, 1, 2. (a) Measurement and (b) forecasting from underlying processes. Full equations are found in the supplement of a previous review [5].
| Entropy | ECOLOGY: Variant Species in an Assemblage | EVOLUTION: Variant Molecules (Genes) within Species |
|---|---|---|
|
| ||
| Used, but has very wide confidence limits, even with modern corrections [ | ||
| The most common frequency-sensitive measure [ | Rarely used until recently [ | |
|
| Some use [ | The most common measure (Heterozygosity, Nucleotide diversity, STRUCTURE, AMOVA, |
|
| ||
| No forecasts from underlying processes; some from curve-fitting [ | Some forecasts, with underlying transmission and innovation only [ | |
|
| Forecasts are available to be transferred from Molecular Ecology [ | Forecasting ability now close to matching that for |
|
| Some forecasts transferred from Molecular Ecology, but only with underlying transmission and innovation, no adaptation [ | Extensive ability to forecast under a wide range of conditions for all underlying processes: Innovation, Transmission, Movement, and Adaptation. Forecasts are often based on gas diffusion theory, e.g., Fokker–Planck Equation (see summaries in textbooks [ |
Figure 1Confidence limits for values for two hypothetical localities, one locality shown as a pair of solid lines, the other locality shown as a pair of dashed lines (the mean curves would be between the two confidence limits, but are omitted for clarity). The circled areas in each of the three panels show cases where discrimination between the assemblages of species or genes at the two localities is more clearly identified by (a) q = 0, (b) q = 1, or (c) q = 2, respectively.
Types of forecasts available for q = 1 (Shannon) entropy/information, showing how they can be used for the common processes: Innovation, Transmission, Movement, and Adaptation. Although much of this modeling has been done for molecular variants, it has often been, or could be, applied to variant species in ecological assemblages, as described in Vellend (2016) [1] and text of Section 3. For forecasts with other values of q, see Table 1b.
| UNDERLYING PROCESSES | Space and Time Scales | |||
|---|---|---|---|---|
| α Within-Locality | β between-Locality | |||
| Finite Size, at Equilibrium | Dynamic: Non-Equilibrium | Finite Size, at Equilibrium | Dynamic: Non-Equilibrium | |
|
| Innovation mechanisms—SNP, IAM and SMM—are defined and described further in the text, including the relationships between forecasts for molecules within one species (described in this table) and forecasts for species in assemblages | |||
| SNP [ | SNP, IAM, SMM [ | SNP [ | SNP [ | |
| - | - | SNP [ | SNP [ | |
| [ | [ | Not Yet | Not Yet | |
| ‘Balancing’ selection that maintains more than one variant [ | ‘Directional’ selection that favors a single variant ([ | Not Yet | Not Yet | |
Processes common to all systems of evolution, and their likely timescales.
| System | Common Processes for Information | |||
|---|---|---|---|---|
| Innovation | Transmission | Adaptation | Movement | |
| Prebiotic (may be continuing slowly in current physical environment) | Many years? [ | Seconds, or longer, rate depends upon type of interactions [ | Speed would depend upon relative rates of innovation and competitive interactions [ | Probably occurs, at least involuntarily in currents, etc. |
| Biomolecules—acting individually | Seconds, or longer | Seconds, or longer, rate depends upon type of interactions [ | Seconds, or longer | Seconds, or longer |
| Biomolecules—as basis of biological evolution | Generations [ | Generations [ | Generations [ | Generations [ |
| Neural networks and Behavioral responses driven by neurons | Seconds | Seconds | Seconds (or longer with a group of individuals [ | Seconds, or longer |
| Species | Usually 1000’s of generations [ | Usually 1000’s of generations [ | Usually 1000’s of generations [ | Usually 1000’s of generations [ |
| Algorithms and machines | Seconds to Hours [ | Seconds to Hours [ | Seconds to Hours [ | Seconds to Hours e.g., Self-driving cars, Mars rovers, Computer viruses |
Figure 2Similarities of DNA and Quantum Computing. In the DNA in the upper panel, if association between individual SNPs is random (‘linkage equilibrium’), then the proportion of a particular DNA sequence (‘haplotype’) is the product of the proportions at each SNP in the population, over m nucleotide positions. Similarly, for the parallel quantum ‘qbits’ in the lower panel, each will have a probability of being zero or 1, depending upon the input of energy to that part of the quantum computer (which affects the complex amplitude, whose square is the probability). Like the DNA sequence, the expected outcome in a quantum computer would be characterized by the product of the m probabilities, P.