| Literature DB >> 33693604 |
John T Burley1,2, James R Kellner1,2, Stephen P Hubbell3, Brant C Faircloth4.
Abstract
The lack of genomic resources for tropical canopy trees is impeding several research avenues in tropical forest biology. We present genome assemblies for two Neotropical hardwood species, Jacaranda copaia and Handroanthus (formerly Tabebuia) guayacan, that are model systems for research on tropical tree demography and flowering phenology. For each species, we combined Illumina short-read data with in vitro proximity-ligation (Chicago) libraries to generate an assembly. For Jacaranda copaia, we obtained 104X physical coverage and produced an assembly with N50/N90 scaffold lengths of 1.020/0.277 Mbp. For H. guayacan, we obtained 129X coverage and produced an assembly with N50/N90 scaffold lengths of 0.795/0.165 Mbp. J. copaia and H. guayacan assemblies contained 95.8% and 87.9% of benchmarking orthologs, although they constituted only 77.1% and 66.7% of the estimated genome sizes of 799 and 512 Mbp, respectively. These differences were potentially due to high repetitive sequence content (>59.31% and 45.59%) and high heterozygosity (0.5% and 0.8%) in each species. Finally, we compared each new assembly to a previously sequenced genome for Handroanthus impetiginosus using whole-genome alignment. This analysis indicated extensive gene duplication in H. impetiginosus since its divergence from H. guayacan.Entities:
Keywords: zzm321990 Handroanthus (Tabebuia) guayacan; zzm321990 Jacaranda copaiazzm321990 ; Chicago library; dovetail; genome assembly; tropical tree
Mesh:
Year: 2021 PMID: 33693604 PMCID: PMC8034707 DOI: 10.1093/g3journal/jkab010
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Geographic distributions of herbarium records for species sequenced in this study (J. copaia and H. guayacan) as well as the previously sequenced H. impetiginosus (Silva-Junior ). Data obtained from gbif.org.
Figure 2Flowers of (A) H. guayacan (HAGU) and (B) J. copaia (JACO). While in bloom, individual tree crowns are highly conspicuous in (C) aerial and (D) satellite imagery of a lowland tropical forest in Central Panama. Image credits: (A and B) Steven Paton, Smithsonian Tropical Research Institute (STRI); (C) Jonathan Dandois, STRI.
Basic genome assembly metrics and quality statistics
| Species |
|
| ||
|---|---|---|---|---|
| Assembly method | Meraculous | Dovetail HiRise assembly | Meraculous | Dovetail HiRise assembly |
|
| 613.86 Mbp | 616.19 Mbp | 336.39 Mbp | 339.77 Mbp |
|
| 0.057 Mbp | 1.020 Mbp | 0.017 Mbp | 0.795 Mbp |
|
| 0.010 Mbp | 0.277 Mbp | 0.004 Mbp | 0.165 Mbp |
|
| 2,974 scaffolds | 184 scaffolds | 5,519 scaffolds | 113 scaffolds |
|
| 12,202 scaffolds | 624 scaffolds | 21,878 scaffolds | 452 scaffolds |
|
| 506,509 bp | 5,337,482 bp | 148,601 bp | 4,933,827 bp |
|
| 29,490 | 5,869 | 38,393 | 3,857 |
|
| 29,490 | 5,869 | 38,393 | 3,856 |
|
| 12,474 | 1,086 | 10,322 | 825 |
|
| 0.90 | 0.98 | 0.69 | 0.98 |
|
| 53,440 | 53,041 | 70,867 | 70,146 |
|
| 36.27 Kbp | 36.74 Kbp | 12.25 Kbp | 12.47 Kbp |
|
| — | 32.99% | — | 33.24% |
|
| 23,950 | 47,236 | 32,474 | 66,288 |
|
| 0.44% | 0.81% | 1.32% | 2.30% |
Genome completeness assessed using 2,121 eudicot Benchmarking Universal Single-Copy Orthologs (BUSCOs)
| Species: |
|
|
| ||
|---|---|---|---|---|---|
| Assembly: | Meraculous | HiRise | Meraculous | HiRise | Himp0.1 |
|
| 1,957 | 2,032 | 1,754 | 1,864 | 1,847 |
|
| 1,844 | 1,911 | 1,676 | 1,775 | 1,574 |
|
| 113 | 121 | 78 | 89 | 273 |
|
| 82 | 34 | 179 | 99 | 103 |
|
| 82 | 55 | 188 | 158 | 171 |
|
|
|
|
|
|
|
Predicted repetitive sequence content in assemblies
|
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|---|
| Number of elements | Length (Mb) | Fraction of assembly (%) | Number of elements | Length (Mb) | Fraction of assembly (%) | Number of elements | Length (Mb) | Fraction of assembly (%) | |
|
| 619,981 | 356.7 | 57.89 | 451,277 | 149.8 | 44.08 | 634,428 | 231.8 | 45.06 |
|
| 3,759 | 3.3 | 0.54 | 3,081 | 2.1 | 0.63 | 3,974 | 3.3 | 0.66 |
|
| 105,619 | 153.6 | 24.93 | 43,361 | 36.9 | 10.85 | 52,006 | 56.7 | 11.26 |
|
| 27,586 | 17.2 | 2.97 | 9,425 | 5.2 | 1.53 | 18,240 | 10.1 | 2.01 |
|
| 483,017 | 182.6 | 29.63 | 395,410 | 105.6 | 31.07 | 560,208 | 161.7 | 32.13 |
|
| 151,890 | 9.0 | 1.47 | 125,684 | 5.1 | 1.51 | 182,721 | 7.8 | 1.54 |
|
| 26,279 | 1.3 | 0.21 | 23,734 | 1.1 | 0.33 | 33,613 | 1.6 | 0.33 |
|
|
|
|
|
|
|
|
|
|
|
Genome alignment summary statistics
| Species comparison |
|
|
| |||
|---|---|---|---|---|---|---|
| Reference | Query | Reference | Query | Reference | Query | |
| HAGU | HIMP | JACO | HIMP | JACO | HAGU | |
|
| — | — | — | — | — | — |
|
| 3,857 | 13,204 | 5,869 | 13,204 | 5,869 | 3,857 |
|
| 3,655 (94.76%) | 12,398 (93.90%) | 2,172 (37.01%) | 8,150 (61.72%) | 2,257 (38.46%) | 1,937 (50.22%) |
|
| 202 (5.24%) | 806 (6.10%) | 3,697 (62.99%) | 5,054 (38.28%) | 3,612 (61.54%) | 1,920 (49.78%) |
|
| — | — | — | — | — | — |
|
| 339.77 | 503.29 | 616.19 | 503.29 | 616.19 | 339.77 |
|
| 266.8 (78.52%) | 374.8 (74.46%) | 21.8 (3.54%) | 32.8 (6.52%) | 23.8 (3.86%) | 26.8 (7.87%) |
|
| 72.9 (21.48%) | 128.5 (25.54%) | 594.4 (96.46%) | 470.5 (93.48%) | 592.4 (96.14%) | 313.0 (92.13%) |
|
| — | — | — | — | — | — |
|
| 149,208 | 149,208 | 44,485 | 44,485 | 68,689 | 68,689 |
|
| 232.8 | 232.7 | 20.2 | 20.1 | 22.2 | 22.2 |
|
| 1561 | 1560 | 454 | 453 | 324 | 323 |
|
| 90.34 | 90.34 | 87.27 | 87.27 | 88.73 | 88.73 |
|
| 447,736 | 447,736 | 91,126 | 91,126 | 174,083 | 174,083 |
|
| 479.4 | 479.1 | 40.8 | 40.7 | 38.3 | 38.2 |
|
| 1071 | 1070 | 448 | 447 | 220 | 220 |
|
| 89.94 | 89.94 | 91.87 | 91.87 | 92.64 | 92.64 |
Figure 3Depiction of relationships among the three currently available Bignoniaceae genomes and a high-quality assembly of the European olive tree (Olea europeae) derived from relationships of Jacaranda and Handroanthus resolved in Grose and Olmstead (2007a) and Olmstead . Node values depict whole genome alignment summary metrics: total length of 1-to-1 alignments (black) and average percentage identify of 1:1 alignments (gray).
Figure 4Shared and unique duplicated BUSCOs in genome assemblies for JACO, HAGU, and HAIM.