| Literature DB >> 21619600 |
Andrea Zuccolo1, John E Bowers, James C Estill, Zhiyong Xiong, Meizhong Luo, Aswathy Sebastian, José Luis Goicoechea, Kristi Collura, Yeisoo Yu, Yuannian Jiao, Jill Duarte, Haibao Tang, Saravanaraj Ayyampalayam, Steve Rounsley, Dave Kudrna, Andrew H Paterson, J Chris Pires, Andre Chanderbali, Douglas E Soltis, Srikar Chamala, Brad Barbazuk, Pamela S Soltis, Victor A Albert, Hong Ma, Dina Mandoli, Jody Banks, John E Carlson, Jeffrey Tomkins, Claude W dePamphilis, Rod A Wing, Jim Leebens-Mack.
Abstract
BACKGROUND: Recent phylogenetic analyses have identified Amborella trichopoda, an understory tree species endemic to the forests of New Caledonia, as sister to a clade including all other known flowering plant species. The Amborella genome is a unique reference for understanding the evolution of angiosperm genomes because it can serve as an outgroup to root comparative analyses. A physical map, BAC end sequences and sample shotgun sequences provide a first view of the 870 Mbp Amborella genome.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21619600 PMCID: PMC3219971 DOI: 10.1186/gb-2011-12-5-r48
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Frequencies of BAC end sequences and Sanger shot gun sequences matching sequences in Repbase
| Type | Absolute number in BESs | Percentage BESs | Percentage repeats in BESs | Absolute number in SGSs | Percentage SGSs | Percentage repeats in SGSs |
|---|---|---|---|---|---|---|
| hAT | 642 (1,671) | 0.92 (2.41) | 6.84 (4.61) | 20 (41) | 0.74 (1.52) | 5.73 (2.94) |
| MuDR | 343 (724) | 0.49 (1.04) | 3.65 (2.00) | 7 (30) | 0.26 (1.11) | 2.00 (2.15) |
| CACTA | 27 (75) | 0.04 (0.11) | 0.29 (0.21) | 0 (4) | 0 (0.15) | 0 (0.29) |
| Helitrons | 12 (69) | 0.02 (0.10) | 0.13 (0.19) | 0 (3) | 0 (0.11) | 0 (0.22) |
| Other | 108 (595) | 0.15 (0.86) | 1.15 (1.64) | 1 (24) | 0.04 (0.89) | 0.29 (1.72) |
| Total | 1,132 (3,134) | 1.63 (4.51) | 12.06 (8.64) | 28 (102) | 1.04 (3.78) | 8.02 (7.31) |
| LTR Ty1- | 2,162 (9,578) | 3.11 (13.79) | 23.02 (26.42) | 64 (314) | 2.37 (11.65) | 18.34 (22.51) |
| LTR Ty3- | 2,431 (8,395) | 3.50 (12.09) | 25.89 (23.15) | 129 (377) | 4.78 (13.98) | 36.96 (27.03) |
| LTR not classified | 720 (2,868) | 1.04 (4.13) | 7.67 (7.91) | 51 (139) | 1.89 (5.16) | 14.61 (0.96) |
| LINEs | 1,876 (8,055) | 2.70 (11.60) | 19.98 (22.22) | 55 (294) | 2.04 (10.91) | 15.76 (21.08) |
| SINEs | 11 (183) | 0.02 (0.26) | 0.12 (0.50) | 0 (4) | 0 (0.15) | 0 (0.29) |
| Retro not classified | 1,058 (4,046) | 1.52 (5.82) | 11.27 (11.16) | 23 (165) | 0.85 (6.12) | 6.59 (11.83) |
| Total | 8,258 (33,125) | 11.89 (47.69) | 87.94 (91.36) | 321 (1,293) | 11.91 (47.96) | 91.98 (92.69) |
| 9,390 (36,259) | 13.52 (52.20) | 100 (100) | 349 (1,395) | 12.95 (51.74) | 100 (100) |
Results in parentheses include Internal BlastN searches. Repbase v.15.08 was used [38]. SINE, short interspersed element; SGS, Sanger shot gun sequence.
Putatively high-copy MITEs identified in the BESs and Sanger shot gun sequences using MUST pipeline
| Length | Inverted repeat length | BES hits | Copy number estimate | SGS hits | Copy number estimate | AT% | |
|---|---|---|---|---|---|---|---|
| MITE_1 | 358 | 26 | 542 | ~17,000 | 18 | ~17,200 | 68.80 |
| MITE_2 | 190 | 19 | 140 | ~3,300 | 8 | ~3,100 | 68.70 |
| MITE_3 | 516 | 47 | 394 | ~17,900 | 8 | ~11,300 | 75.20 |
Copy number estimates based on procedure of Hawkins et al. [50]. SGS, Sanger shot gun sequence.
Figure 1Maximum likelihood trees for reverse transcriptase genes classified as Copia-type and Gypsy-type LTR and LINE elements. (a) Copia-type; (b) Gypsy-type LTRs; (c) Gypsy-type LINEs. The maximum likelihood trees show rate heterogeneity and no recent expansive radiations (that is, short terminal branches). Reverse transcriptase sequences were mined from BAC end sequence set.
Simple sequence repeats identified in BESs and Sanger shot gun sequences
| Repeat | ||||
|---|---|---|---|---|
| Mono | 149.66 | 152.89 | 72.74 | 50.79 |
| Di | 225.03 | 211.00 | 77.89 | 63.94 |
| Tri | 72.49 | 78.96 | 110.01 | 144.06 |
| Tetra | 89.88 | 90.70 | 100.67 | 102.25 |
| Penta | 74.85 | 89.73 | 64.54 | 56.00 |
| Total | 611.92 | 623.28 | 425.85 | 417.04 |
aValues are presence per million base pairs. RS, Random sheared
Figure 2K-mer analyses of Sanger shotgun sequences reveal low frequencies of short repeats in the .
Statistics for cDNA sequences included in multi-library transcriptome assembly of 246,196 unigenes with lengths greater than 100 bp
| Tissue - library name | Sequencing method | Number of reads | Unscreened reads | Total passing bases (MB) |
|---|---|---|---|---|
| Apical meristem - Atr12 | 454 FLX Titanium | 794,746 | 688,305 | 201.90 |
| Male flowers - Atr15 | 454 FLX Titanium | 277,023 | 255,213 | 73.49 |
| Old leaves - Atr14 | 454 FLX Titanium | 280,097 | 260,563 | 73.49 |
| Old stem - Atr13 | 454 FLX Titanium | 259,431 | 238,156 | 68.70 |
| Pre-meiotic female flower buds - Atr10 | 454 FLX GS | 895,000 | 812,325 | 176.97 |
| Pre-meiotic female flower bud - Atr02 | Sanger | 13,263 | 13,141 | 7.17 |
| Pre-meiotic male flower bud - Atr01 | Sanger | 25,343 | 25,006 | 14.17 |
| Root - Atr11 | 454 FLX GS | 324,070 | 300,275 | 64.88 |
| Stem - Atr16 | 454 FLX Titanium | 410,098 | 388,436 | 120.03 |
Assemblies and raw data can be downloaded from the Ancestral Angiosperm Genome Project website [25]. A BLAST portal for the assembly is also available at the project website.
Figure 3Variation in rates of structural evolution evident in parsimony mapping of losses of synteny with 29 gene blocks inferred for the last common ancestor of all extant flowering plant lineages.
Figure 4Hybridization of three BAC clones in the minimum tiling paths for contigs 1003 and 431 to mitotic squashes (2Results for contig 1003; (f-j) results for contig 431. Panels (a) and (f) show all three BAC-FISH probes merged; (e,j) DAPI staining; (b,c,d) show each of three BACs (red, green, white) for contig 1003; (g,h,i) show each of three BACs (red, green, white) for contig 431.
Figure 5LASTZ dot plots comparing BAC contig 1003 syntenic regions in the grape and rice genomes. (a) Grape genome; (b) rice genome.
Figure 6LASTZ dot plots comparing BAC contig 431 syntenic regions in the grape and rice genomes. (a) Grape genome; (b) rice genome.
Figure 7Gene trees for auxin-independent growth promoter (Auxin-independent growth promoter (AXI1); (b) ceramidase; (c) plant uncoupling mitochondrial protein 1 [PUMP1] gene families. The gene trees show divergence of genes on Amborella contig 431 diverging from lineages leading to Vitis γ homeologs mapping to syntenic blocks on chromosomes 6, 8 and 13 (shown in red). Genes sampled from major angiosperm lineages are highlighted.