| Literature DB >> 29899658 |
Daniel S Carvalho1,2, James C Schnable1,2, Ana Maria R Almeida3.
Abstract
The study of gene family evolution has benefited from the use of phylogenetic tools, which can greatly inform studies of both relationships within gene families and functional divergence. Here, we propose the use of a network-based approach that in combination with phylogenetic methods can provide additional support for models of gene family evolution. We dissect the contributions of each method to the improved understanding of relationships and functions within the well-characterized family of AGAMOUS floral development genes. The results obtained with the two methods largely agreed with one another. In particular, we show how network approaches can provide improved interpretations of branches with low support in a conventional gene tree. The network approach used here may also better reflect known and suspected patterns of functional divergence relative to phylogenetic methods. Overall, we believe that the combined use of phylogenetic and network tools provide a more robust assessment of gene family evolution.Entities:
Keywords: functional divergence; gene families; network analysis; phylogenetics
Year: 2018 PMID: 29899658 PMCID: PMC5993073 DOI: 10.1177/1176934318764683
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
List of species and sequence identifiers used in this study.
| Gene name | Species name | Clade name | Accession or gene ID |
|---|---|---|---|
|
|
| Ginkgoaceae | GU563899 |
|
|
| Pinaceae | X79280.1 |
|
|
| Pinaceae | AF023615 |
|
|
| Taxaceae | JF519754 |
|
|
| Taxodiaceae | HM177453 |
|
|
| Aristolochiaceae | AY464101 |
|
|
| Chloranthaceae | AY464099 |
|
|
| Lauraceae | DQ398021 |
|
|
| Lauraceae | DQ398022 |
|
|
| Magnoliaceae | JQ326236 |
|
|
| Magnoliaceae | AB050653 |
|
|
| Magnoliaceae | JQ326219 |
|
|
| Arecaceae | AY739698 |
|
|
| Arecaceae | AY739699 |
|
|
| Arecaceae | XP_010912706.1 |
|
|
| Poaceae | Bradi2g06330.1 |
|
|
| Poaceae | Bradi4g40350.1 |
|
|
| Poaceae | Bradi2g25090.1 |
|
|
| Poaceae | L37528 |
|
|
| Poaceae | AF151693 |
|
|
| Poaceae | FJ750944 |
|
|
| Poaceae | AB232157 |
|
|
| Poaceae | Sb03g002525 |
|
|
| Poaceae | Sb08g006460 or Sobic.008G072900.1 |
|
|
| Poaceae | Sb09g006360 or Sobic.009G075500.3 |
|
|
| Poaceae | GRMZM2G160687-T03 |
|
|
| Poaceae | GRMZM2G359952-T01 |
|
|
| Poaceae | GRMZM2G052890-T01 |
|
|
| Triuridaceae | GQ214163 |
|
|
| Triuridaceae | GQ214164 |
|
|
| Orchidaceae | AIU94767.1 or KF914206.1 |
|
|
| Orchidaceae | AIU94768.1 |
|
|
| Hypoxidaceae | AIU94766.1 or KF914205.1 |
|
|
| Hypoxidaceae | AIU94771.1 |
|
|
| Asparagaceae | AB175825.1 |
|
|
| Asparagaceae | BAD18011.1 |
|
|
| Berberidaceae | AY464106 |
|
|
| Eupteleaceae | GU357452 |
|
|
| Eupteleaceae | GU357453 |
|
|
| Lardizabalaceae | AY464107 |
|
|
| Lardizabalaceae | AY627635 |
|
|
| Lardizabalaceae | AY627629 |
|
|
| Lardizabalaceae | JQ806406 |
|
|
| Lardizabalaceae | JQ806407 |
|
|
| Papaveraceae | DQ088996 |
|
|
| Papaveraceae | DQ088997 |
|
|
| Papaveraceae | DQ088998 |
|
|
| Ranunculaceae | Aquca-136-00009.1 |
|
|
| Ranunculaceae | Aquca-022-00039.1 |
|
|
| Ranunculaceae | Aquca-136-00010.1 |
|
|
| Ranunculaceae | JN887118 |
|
|
| Ranunculaceae | AY867879 |
|
|
| Sabiaceae | AY464105 |
|
|
| Brassicaceae | 946287 |
|
|
| Brassicaceae | 486333 |
|
|
| Brassicaceae | 321962 |
|
|
| Brassicaceae | 489841 |
|
|
| Brassicaceae | AT3G58780 |
|
|
| Brassicaceae | AT2G42830 |
|
|
| Brassicaceae | AT4G09960 or NM_001203767.1 |
|
|
| Brassicaceae | Brara.K01743.1 |
|
|
| Brassicaceae | Brara.G01817.1 |
|
|
| Brassicaceae | Brara.E00310.1 |
|
|
| Brassicaceae | Brara.C02624.1 |
|
|
| Brassicaceae | Carubv10005558m |
|
|
| Brassicaceae | Carubv10019520m |
|
|
| Brassicaceae | Carubv10025002m |
|
|
| Brassicaceae | Carubv10003771 |
|
|
| Brassicaceae | Thhalv10006196 |
|
|
| Brassicaceae | Thhalv10017047 |
|
|
| Brassicaceae | Thhalv10028938 |
|
|
| Caricaceae | Evm. TU supercontig_50.73 |
|
|
| Fabaceae | Medtr8g087860.1 |
|
|
| Fabaceae | JX308825 |
|
|
| Fabaceae | Medtr 3g 005530.1 |
|
|
| Malvaceae | Gorai.N017200.1 |
|
|
| Malvaceae | Gorai.011G035500.1 |
|
|
| Malvaceae | Gorai012G042600.1 |
|
|
| Malvaceae | Gorai.009G265100.1 |
|
|
| Malvaceae | Gorai.009G288000.1 |
|
|
| Malvaceae | Thecc1E6029596t1 |
|
|
| Malvaceae | Thecc1EG001841t1 |
|
|
| Malvaceae | Thecc1EG036541t1 |
|
|
| Phrymaceae | Mgv1a012605 or Migut.M00986.1 |
|
|
| Phrymaceae | Mgv1a013047m or Migut.C01334.1 |
|
|
| Salicaceae | Potri.011G075800.1 |
|
|
| Salicaceae | Potri.013G104900.1 |
|
|
| Salicaceae | Potri.019G077200.1 |
|
|
| Solanaceae | L26295.1 |
|
|
| Solanaceae | AY098735 |
|
|
| Solanaceae | NM_001247265.2 |
|
|
| Vitaceae | GSVIVG01000802001 |
|
|
| Vitaceae | GSVIVT01021303001 |
Genes retrieved from NCBI (National Center for Biotechnology Information) (genes with * were retrieved from Phytozome) (long).
Summary of symbols.
| Symbol | Denomination | Description |
|---|---|---|
|
| Identity threshold | Threshold value used to build a network, based on similarity values ranging from 0% to 100%. Pairs of sequences that have an identity value greater than or equal to |
| Adjacency matrix at | Adjacency matrix obtained at a certain value of | |
|
| Element of the adjacency matrix | Element of the adjacency matrix and represents the presence (1) or absence (0) of an edge between sequences |
|
| Neighborhood matrix | Matrix composed of elements representing the least number of edges necessary to connect a pair of sequences |
|
| Element of the neighborhood matrix | Element of the neighborhood matrix and represents the least number of edges connecting sequences |
| Δ | Increments of | Value incremented to |
| Network distance between 2 networks | Represents the network distance |
Figure 1.Phylogenetic tree of the AGAMOUS family genes. Main functional groups are highlighted in black boxes along the tree.
Figure 2.(A) Network distance graph based on the δ(σ, σ + Δσ) distance. The values for the analyzed networks obtained at 51%, 67%, and 75% are marked. (B) Network obtained at 67% identity. Nodes are colored based on the community they belong to (C1-C5), as result of the modularity algorithm (see methods). The sequences that do not belong to any community are represented as gray nodes. Network obtained at 67% identity, colored based on (C) gene function and (D) species phylogenetic placement.
Figure 3.Schematic diagram of results based on phylogenetic (left) and network (right) analyses. Potential contributions of each approach, as well as benefits steaming from the combination of both methods are described below the diagrams.