| Literature DB >> 32922818 |
Shi-Pin Chen1,2, Wei-Hong Sun1,2, Yuan-Fang Xiong1,2, Yu-Ting Jiang1,2, Xue-Die Liu1,2, Xing-Yu Liao1,2, Di-Yang Zhang1,2, Shu-Zhen Jiang1,2, Yu Li1,2, Bin Liu1,2, Liang Ma2, Xia Yu2, Li He2, Bao Liu2, Jin-Lin Feng2, Li-Zhen Feng2, Zhi-Wen Wang3, Shuang-Quan Zou1,2, Si-Ren Lan1,2, Zhong-Jian Liu2,4,5.
Abstract
Lauraceae includes the genus Phoebe, and the family is linked to the evolution of magnoliids. We sequenced the genome of Phoebe bournei Nanmu. The assembled genome size was 989.19 Mb, with a contig N50 value of 2.05 Mb. A total of 28,198 protein-coding genes were annotated in P. bournei. Whole-genome duplication (WGD) analysis showed that Lauraceae has experienced two WGD events; the older WGD event occurred just before the divergence of Lauraceae and Magnoliales, and the more recent WGD was shared by all lineages of Lauraceae. The phylogenetic tree showed that magnoliids form a sister clade to monocots and eudicots. We also identified 63 MADS-box genes, including AGL12-like genes that may be related to the regulation of P. bournei roots and FIN219-like genes encoding GH3 proteins, which are involved in photomorphogenesis. SAUR50-like genes involved in light signal-mediated pedicel or stem development were also identified. Four ATMYB46- and three PtrEPSP-homologous genes related to lignin biosynthesis were identified. These genes may be associated with the formation of straight trunks in P. bournei. Overall, the P. bournei reference genome provides insight into the origin, evolution, and diversification of Phoebe and other magnoliids.Entities:
Keywords: Genome; Genome evolution
Year: 2020 PMID: 32922818 PMCID: PMC7459323 DOI: 10.1038/s41438-020-00368-z
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Fig. 1Comparison of phylogenetic trees constructed by different methods.
a Phylogenetic tree based on the Bayesian method. b Phylogenetic tree based on the ASTRAL (left) and concatenated (right) amino acid sequences. c Phylogenetic tree based on the ASTRAL (left) and concatenated (right) nucleotide sequences
Fig. 2The expansion and contraction of gene families.
The green numbers are the numbers of expanded gene families, and the red numbers are the numbers of contracted gene families. The blue portions of the pie charts represent the gene families whose copy numbers are constant. The orange portions of the pie charts represent the proportion of 11,968 gene families found in the most recent common ancestor (MRCA) that have expanded or contracted during recent differentiation
Fig. 3WGD analysis.
a Ks distribution in P. bournei and other species. Peaks in the intraspecies Ks distribution indicate whole-genome polyploidization events, and peaks in the interspecies Ks distribution indicate speciation events. b Collinear point diagram of the P. bournei genome. There are multiple 1:3 collinear regions in the P. bournei genome, among which one extra green copy region provides evidence of recent WGD, and two extra red copy regions provide evidence of ancient WGD events. c The phylogeny of magnoliids with WGD events
MADS-box genes in P. bournei, C. kanehirae, and Arabidopsis thaliana
| Category | |||
|---|---|---|---|
| Type II (total) | 33 | 33 | 37 |
| MIKCc | 37 | 27 | 31 |
| MIKC* | 6 | 6 | 6 |
| Type I (total) | 53 | 30 | 27 |
| Mα | 23 | 23 | 22 |
| Mβ | 18 | 4 | 3 |
| Mγ | 12 | 3 | 2 |
| Total | 86 | 63 | 64 |
Fig. 4Phylogenetic analysis of MADS-box genes from P. bournei, C. kanehirae, and A. thaliana.
The gene ID numbers begin with “Pb” to represent the gene IDs of P. bournei, “Ck” to represent the gene IDs of C. kanehirae, and “AT” to represent the gene IDs of A. thaliana
Fig. 5Analysis of P. bournei GH3 proteins.
a Phylogenetic relationships of P. bournei, C. kanehirae, and A. thaliana GH3 proteins. Bootstrap values are indicated at each node. The gene ID numbers begin with “Pb“ to represent the gene IDs of P. bournei, “Ck” to represent the gene IDs of C. kanehirae, and “AT” to represent the gene IDs of A. thaliana. In the parentheses, the number on the left is the number of GH3 homologs in the genome of P. bournei, and that on the right is the number of GH3 homologs in the genome of C. kanehirae. b Expression patterns of GH3 in the cortex, heartwood, and leaf. “Co” represents the cortex, “He” represents heartwood. “Le” represents the leaf. The accession numbers of the GH3 genes of A. thaliana are shown in Supplementary Table 13
Fig. 6Phylogenetic relationships of P. bournei, C. kanehirae, and A. thaliana SAUR proteins.
The gene ID numbers begin with “Pb” to represent the gene IDs of P. bournei, “Ck” to represent the gene IDs of C. kanehirae, and the “AT” to represent the gene IDs of A. thaliana. SAUR50-like genes of P. bournei are indicated in red
Fig. 7Identification of the homologous genes of MYB46 and EPSP.
a Phylogenetic relationship of PbMYB46-homologous genes with anther secondary wall- or lignin-associated MYBs. b Phylogenetic relationships of PbEPSP-homologous genes with EPSP-homologous genes in different plants. c Expression patterns of MYB46 and EPSP in the cortex, heartwood, and leaves. “He” represents heartwood, “Co” represents the cortex, and “Le” represents leaves. The accession numbers of MYB46 and EPSP in different species are shown in Supplementary Table 13