| Literature DB >> 29764366 |
Yoann Anselmetti1,2, Wandrille Duchemin2,3, Eric Tannier2,3, Cedric Chauve4, Sèverine Bérard5.
Abstract
BACKGROUND: Genomes rearrangements carry valuable information for phylogenetic inference or the elucidation of molecular mechanisms of adaptation. However, the detection of genome rearrangements is often hampered by current deficiencies in data and methods: Genomes obtained from short sequence reads have generally very fragmented assemblies, and comparing multiple gene orders generally leads to computationally intractable algorithmic questions.Entities:
Keywords: Comparative genomics; Mosquito genomics; Scaffolding
Mesh:
Year: 2018 PMID: 29764366 PMCID: PMC5954271 DOI: 10.1186/s12864-018-4466-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Input and output of ADSEQ. (Left) Input data: (1) a species tree with extant genomes (A, B and C) containing observed adjacencies (black link) and scaffolding gene adjacencies with a prior score (blue link); each grey box represents a gene. (2) reconciled gene trees representing evolutionary histories of gene families annotated by evolutionary events. (Right) Typical output: gene order across ancestral and extant genomes including new extant gene adjacencies with a posterior score (green link) between genes located at fragments extremities in the initial genome assemblies
Fig. 2Precision and recall statistics for scaffolding adjacencies on three artificially fragmented genomes (A.alb: A. albimanus, A.ara: A. arabiensis and A.dir: A. dirus). Left graph: results with 50% of reads. Right graph: results with all reads. The different methods results are plotted with the precision on the Y axis and the recall on the X axis. For ADSEQ and AD, results for three different adjacency support thresholds (0.1, 0.5 and 0.8) before genome linearization are plotted and represented with a color gradient. Note: A True Positive (TP) adjacency requires the proper orientation of both genes
Fig. 3A. species trees (left: X phylogeny, right: WG phylogeny) with rearrangements per adjacency as branch lengths (× 10−3). The pie chart for a given species represents the adjacency degree of the genes of this species: orange represents genes with no adjacency, light blue genes with adjacency degree of 1 and green genes with adjacency degree of 2. Moreover, the diameter of each chart is proportional to the number of genes in the corresponding species
Fig. 4Left: Number of segments in extant and ancestral genomes, according to three runs of ADSEQ in three different conditions. In the first run, we turn off the scaffolding mode on the X phylogeny, that is, it only constructs ancestral segments. The first column “XNS ext” thus describes the initial assembly, and “XNS anc” the assembly of ancestral genomes when reconstructed without extant scaffolding. In the second and third runs, the scaffolding mode was turned on, and run with the X phylogeny (“X ext” and “X anc”) and the WG phylogeny (“WG ext” and “WG anc”). Right: Number of rearrangements over all branches of the X phylogeny, with and without the scaffolding mode
Numbers of inferred rearrangements and duplications in the X chromosome and in the autosomes, according to the phylogeny (X or WG) used as a parameter of ADSEQ
| Event | X phylogeny | WG phylogeny | ||
|---|---|---|---|---|
| X chr. | Autosomes | X chr. | Autosomes | |
| Duplications | 604 | 5857 | 606 | 5553 |
| Rearrangements | 415 | 2949 | 416 | 2760 |