| Literature DB >> 33119583 |
András Szilágyi1,2,3, Viktor Péter Kovács1, Eörs Szathmáry1,2,3, Mauro Santos1,4.
Abstract
Chromosomes are likely to have assembled from unlinked genes in early evolution. Genetic linkage reduces the assortment load and intragenomic conflict in reproducing protocell models to the extent that chromosomes can go to fixation even if chromosomes suffer from a replicative disadvantage, relative to unlinked genes, proportional to their length. Here we numerically show that chromosomes spread within protocells even if recurrent deleterious mutations affecting replicating genes (as ribozymes) are considered. Dosage effect selects for optimal genomic composition within protocells that carries over to the genic composition of emerging chromosomes. Lacking an accurate segregation mechanism, protocells continue to benefit from the stochastic corrector principle (group selection of early replicators), but now at the chromosome level. A remarkable feature of this process is the appearance of multigene families (in optimal genic proportions) on chromosomes. An added benefit of chromosome formation is an increase in the selectively maintainable genome size (number of different genes), primarily due to the marked reduction of the assortment load. The establishment of chromosomes is under strong positive selection in protocells harboring unlinked genes. The error threshold of replication is raised to higher genome size by linkage due to the fact that deleterious mutations affecting protocells metabolism (hence fitness) show antagonistic (diminishing return) epistasis. This result strengthens the established benefit conferred by chromosomes on protocells allowing for the fixation of highly specific and efficient enzymes.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33119583 PMCID: PMC7665907 DOI: 10.1371/journal.pgen.1009155
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Number of different genes in chromosomes (and their ratio relative to the total) of different lengths, sum over the whole population.
Values are an average over 200,000 time steps starting at t = 2∙106. Parameters are: D = 3, S = 21, μ = 10−3, νlinkage = νbreak = νrecomb = 0.01.
| Number of genes in chromosomes | Frequency | Number of gene type A (ratio) | Number of gene type B (ratio) | Number of gene type C (ratio) |
|---|---|---|---|---|
| 1 | 0.0617 | 977 (29.96%) | 1010 (30.97%) | 1274 (39.07%) |
| 2 | 0.0796 | 1456 (34.63%) | 1510 (35.91%) | 1239 (29.46%) |
| 3 | ||||
| 4 | 0.0612 | 1113 (34.39%) | 991 (30.62%) | 1132 (34.98%) |
| 5 | 0.0415 | 798 (36.42%) | 764 (34.87%) | 629 (28.71%) |
| 6 | ||||
| 7 | 0.0057 | 92 (30.77%) | 109 (36.45%) | 98 (32.78%) |
| 8 | 0.0166 | 260 (29.65%) | 316 (36.03%) | 301 (34.32%) |
| 9 | ||||
| 10 | 0.0108 | 181 (31.75%) | 203 (35.61%) | 186 (32.63%) |
Fig 1Semi-log plot of the frequency of different templates with no chromosome breakage or recombination.
Frequencies are normalized on gene count (i.e., a chromosome with 3 genes counts as three when measuring the frequency), parameter values indicated at the top of the figure. Chromosomes consisting of 3∙n (n is positive integer) genes are plotted as thick lines.
Fig 2Frequency of different templates with chromosome break and recombination.
Frequencies are normalized on gene count, parameter values indicated at the top of the figure (standard parameter set). Chromosomes consisting of 3·n (n positive integer) genes are plotted as thick lines. (Chromosomes with a frequency less than 1% are not shown).
Fig 3The equilibrium fitness of the population as a function of the split size (S) at two different numbers of essential genes D = 3 and D = 5.
Average of 10 independent runs. Relevant parameters as in Fig 2 (μ = 10−3, νlinkage = νbreak = νrecomb = 0.01).
Fig 4Average number of genes in chromosomes as a function of gene number (D) and split size (S) with break and recombination.
Parameters are μ = 10−3νlinkage = νbreak = νrecomb = 0.01. The area enclosed in black lines shows the viable region without chromosomatization.