| Literature DB >> 18973668 |
Anja Rautenberg1, Dmitry Filatov, Bodil Svennblad, Nahid Heidari, Bengt Oxelman.
Abstract
BACKGROUND: Increasing evidence from DNA sequence data has revealed that phylogenies based on different genes may drastically differ from each other. This may be due to either inter- or intralineage processes, or to methodological or stochastic errors. Here we investigate a spectacular case where two parts of the same gene (SlX1/Y1) show conflicting phylogenies within Silene (Caryophyllaceae). SlX1 and SlY1 are sex-linked genes on the sex chromosomes of dioecious members of Silene sect. Elisanthe.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18973668 PMCID: PMC2636791 DOI: 10.1186/1471-2148-8-299
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Possible outcomes of incomplete paralogue sampling. Possible outcomes of incomplete paralogue sampling (bold lines) in relation to different relative ages of gene duplications. a) Recent gene duplications. The copies (inparalogues) are monophyletic within terminal taxa (e.g. organisms). The relationships among these remain unchanged, regardless of which copies are included in the study. b) An early gene duplication has created two ancient outparalogues. If different paralogues are sequenced for different taxa, the ingroup will not become monophyletic. c) A gene duplication has created two recent outparalogues. Organismal phylogenetic inference will give erroneous results, if some of the paralogues remain undetected. A through D represent ingroup taxa, numbers indicate sequence copies.
Vouchers used for sequencing of SlX1/SlY1/SlXY1 genes
| Taxon | Group | DNA type | Voucher (or reference) | Accession |
|---|---|---|---|---|
| [EMBL: | ||||
| [EMBL: | ||||
| B, C | XY, mRNA | Filatov & Charlesworth 2002[ | [EMBL: | |
| [EMBL: | ||||
| B, dE | X | Filatov & Charlesworth 2002[ | [EMBL: | |
| B, dE | Y | Filatov & Charlesworth 2002[ | [EMBL: | |
| L | XY | Filatov & Charlesworth 2002[ | [EMBL: | |
| B, dE | X, ♂plant | Delichère & & al. 1999[ | [EMBL: | |
| B, dE | X | Filatov & Charlesworth 2002[ | [EMBL: | |
| B, dE | Y | Filatov & Charlesworth 2002[ | [EMBL: | |
| B, dE | Y, ♂ plant | Delichère & al. 1999[ | [EMBL: | |
| B, E | XY, mRNA | Nicolas & al. 2004[ | [EMBL: | |
| [EMBL: | ||||
| [EMBL: | ||||
| B, Be | XY | Filatov & Charlesworth 2002[ | [EMBL: | |
| [EMBL: |
Groups are according to Oxelman & Lidén [31], and Oxelman & al. [13,33], where B = Silene subgenus Behenantha (Otth) Endl., Be = Silene sect. Behenantha Otth, C = Silene sect. Conoimorpha Otth, E = Silene sect. Elisanthe (Fenzl ex Endl.) Ledeb., dE = dioecious Silene sect. Elisanthe, L = Lychnis L., S = Silene subgenus Silene. Note that Be, C, and E are nested within B. "DNA type" indicates type of sequence (XY = SlXY1, from autosomes, X = SlX1 from X chromosome, Y = SlY1, from Y chromosome, ♀ = female plant, ♂ = male plant, mRNA = only exons). Specimens in bold were sequenced in the present study. For these accessions voucher information (collector, number and herbarium abbreviations, according to Holmgren & al. [61]) is given. More voucher details can be found in Sileneae database [62]. For other accessions references to original publications are given. Accession numbers in GenBank/EMBL. Superscripts A, B, C correspond to sequence names in the phylogenies.
Number and length of XY1 sequences
| Taxon | ||||||
|---|---|---|---|---|---|---|
| Number of copies ( | 2 | 1 | 1 | 3 | 1 | 3 |
| Length (bp) | 4517/4534 | 6280 | 6036 | 3004/3009/2979 + 1463/1459/1459 | 6738 | 2150 + 1890/3894/4618 |
| Covered area (exon numbers) | 2–15 | 1–15 | 1 (intron)-15 | 3 (intron)-6 (intron) + 10–14 (intron) | 1–14 | 1 (intron)-14 |
| Included PCR fragments | 14 | 8 | 9 | 3 | 9 | 17 |
| P | 0.9965 | 0.9959 | 0.9980 | < 0.95 | 0.9980 | 0.9924 |
| Distance between copies (± S E) | 0.001 ± 0.001 | 0.004 ± 0.001 | 0.024 ± 0.002 | |||
| Mean distances to other taxa ± SE | 0.045 ± 0.004–0.125 ± 0.008 | 0.045 ± 0.004–0.128 ± 0.007 | 0.064 ± 0.004–0.123± 0.009 (distance to the other vulgaris sequence = 0.032 ± 0.002) |
Number and length of XY1 sequence copies found. Indicated are also which parts of the region that are covered in which taxa. The "n" row indicates number of PCR fragments included in the probabilistic calculations of sequence numbers. The "p" row indicates the posterior probabilities that we did find all sequence copies (see Methods for formula). The divergence estimates are means from pairwise Tamura-Nei distances ± standard error (SE) from sequences within an individual, and mean distances to sequences from other individuals.
Figure 2Included parts of XY1. Thick grey lines indicate parts of the XY1 region included in the long (upper line) and reduced (lower line) alignments. Dotted lines indicate parts that were excluded. Recombination breakpoints suggested from the GARD analysis are indicated with arrows. The black boxes indicate exons, thin lines introns. The PCR primer positions are indicated below by arrows (sequence-specific primers not included). The intron/exon figure is redrawn from Atanassov & al. [29], and the lengths are based on S. latifolia Y1 sequences.
Figure 3Results from the GARD analysis. a) Support for the suggested positions of breakpoints in GARD analysis using the HKY85 nucleotide substitution model and beta-gamma rate distribution with 5 rate classes on the reduced XY1 matrix (4045 bp), Neighbor-Joining trees for the 5' (b) and 3' non-recombinant partitions (c).
Breakpoint locations
| 5' part | 3' part | Entire alignment | |
|---|---|---|---|
| Positions in reduced alignment | 1–1821 | 2385–4045 | 1–4045 |
| Corresponding positions in full alignment | 1–2718 | 3927–6416 | 1–6416 |
| Number of base/indel characters | 2718/148 | 2490/150 | 6416/361 |
| Substitution model | GTR + Γ | GTR + Γ | GTR + Γ |
Locations of breakpoints between non-recombinant parts of the reduced 4045-bp alignment suggested by GARD, and corresponding positions in the longer alignment. Substitution models were suggested by MrModeltest AIC.
Figure 4Phylogenies based on different parts of the XY1 region. Bayesian consensus chronograms for the 5' (a) and 3' parts (b) of the 6416 bp XY1 alignment. The partitions in a) and b) represent non-recombinant parts that were suggested by GARD from a reduced 9-sequence 4045 bp alignment of the XY1 gene. Values below branches are median ages in million years. Numbers above branches are posterior probabilities (PP)/maximum parsimony bootstrap (MPB) support values for the same alignment but with additional indel characters (values of PP < 0.70 and MPB < 60% are not shown). Hyphens (-) indicate nodes with high posterior probabilities that were not present in the MPB trees. Note that the MBP trees are unrooted, and there is thus only a single internal branch connecting the clades where the BEAST root (*) appears. Horizontal bars represent 95% HPD age intervals. Roman numbers are used to label nodes. Note how the positions of the groups sect. Conoimorpha (C) and S. vulgaris (V) change in the different partitions.
XY1 primers
| primer name | primer sequence |
|---|---|
| A-7F | GGAGGCAAGAAAGCATTGAG |
| Ai-1353F | GATCACATTTAGGCCAGT |
| B-13F | CGCCAACGTCTTTATCTCTCA |
| C-21R | TGGGTTTCACGACTTCAACA |
| 1-4F | AACGATAATACATCCCGGTGAG |
| Di-2283R | CACAATAGAGAAGCCCAAAGTT |
| 2-13F | GTTGCAACTCATACTGACAGTCC |
| 2-7R | GGAGCTCCCTAATCCTGTTT |
| 3-20F | TCTCGTCCAGATTTGGTGTG |
| 4-21R | AGCGGTTCAGAAGAGCACAT |
| 5-11F | ATAAGTCAGTTGTTTTGTGGAGCATC |
| 5-52R | ATGCCTCGAGGTCCAATAGA |
| 6-26F | AAGAGCTGGATTGACGCCAGTGAC |
| 7-4R | TGAAGATCAGCATTGTGAGCTTTCTC |
| 9-12F | TGCTGAAGATGGCTTGCTAA |
| 9-19R | AAGCCATCTTCAGCAGCACT |
| M-5F | GGAAACAGAGAGCGGAGGTA |
| 11-28R | CAGCAGAGCTTGAACAGTCATCT |
| O-22R | CAGCTCAGCCAAAACTTCCT |
Sequences for XY1 primers used for PCR (many also for sequencing, sequence specific primers used for sequencing are available on request from the first author). Specific primers are not included.
Sample sizes needed for 95% probability to find all sequence copies
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | |
|---|---|---|---|---|---|---|---|---|---|
| 5 | 8 | 11 | 14 | 17 | 20 | 23 | 26 | 28 |
Sample sizes (n) needed for P (x = x) > 0.95 calculated from a discrete uniform prior of x on {1, 2, . . ., 10} where xcorrespond to the observed number of sequence variants and n to the number of sampled PCR fragments.