| Literature DB >> 28402423 |
Wandrille Duchemin1,2, Yoann Anselmetti2,3, Murray Patterson2,4, Yann Ponty5,6, Sèverine Bérard3,7, Cedric Chauve8, Celine Scornavacca3, Vincent Daubin2, Eric Tannier1,2.
Abstract
Availability: http://pbil.univ-lyon1.fr/software/DeCoSTAR (Last accessed April 24, 2017).Entities:
Keywords: evolution; gene fusion/fission; gene order; protein domain; rearrangements; reconciliation; software
Mesh:
Year: 2017 PMID: 28402423 PMCID: PMC5441342 DOI: 10.1093/gbe/evx069
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FA species tree (light blue), three reconciled gene trees (black, red, and green) (losses are orange squares; duplications are green squares) and a set of extant and ancestral adjacencies linking genes (white). (A) An adjacency is inherited by both sister species after a speciation occurs. (B) An adjacency between the red and green gene is transferred, and so are both extremities of this adjacency. (C) The red gene undergoes gene loss and thus both adjacencies it was a part of disappear. (D) The adjacency between the red and black genes disappears due to an adjacency break on the branch leading to the leaf. (E) An adjacency is gained between the black gene and the newly acquired red gene.
Description of the Propagation Rules under Different Situations
Description of the Score between Leaves
note.—The two parameters Fadj and SPI used in the scaffolding modes, respectively, account for the position of the genes in their contig and the repartition of poorly assembled genomes in the species tree; both parameters are described with more details in the supplementary material, Supplementary Material online. The scaffolding mode and score given option can be used simultaneously as they affect a different set of adjacencies: respectively, the adjacencies absent from the input and the adjacencies given in the input.
FDensity of the distribution of the degree of all genes inferred by DeCoSTAR on the 18 Anopheles data set. The degree of an extant or ancestral gene is the sum of the weights of all adjacencies containing this gene. For extant genomes in the input (black line), this value can only be 0, 1, or 2. For genomes in the output, extant (red line) or ancestral (green line), all values are possible because adjacencies have scores between 0 and 1, and a gene can belong to an arbitrary number of adjacencies. The difference between the black and red lines are due to the scaffolding: genes with 0 or 1 neighbor are linked to other genes as an output of DeCoSTAR. In ancestral genomes, some genes have degree three or slightly more.
FA schematic representation of the results obtained for the fusion–fission data set, following the schema described in figure 1, and with adjacency gain marked by an upper G. Family A and B are represented as reconciled gene trees, respectively, in red and green. The presence of an adjacency denotes the fusion of A and B to form the family C.