Literature DB >> 32004401

A high-quality walnut genome assembly reveals extensive gene expression divergences after whole-genome duplication.

Junpei Zhang1, Wenting Zhang2, Feiyang Ji1, Jie Qiu2, Xiaobo Song1, Dechao Bu3, Gang Pan4, Qingguo Ma1, Jiaxin Chen2, Ruimin Huang1, Yingying Chang1, Dong Pei1.   

Abstract

Entities:  

Keywords:  Juglans; WGD; genome

Mesh:

Year:  2020        PMID: 32004401      PMCID: PMC7415773          DOI: 10.1111/pbi.13350

Source DB:  PubMed          Journal:  Plant Biotechnol J        ISSN: 1467-7644            Impact factor:   9.803


× No keyword cloud information.
The common walnut Juglans regia and its related species in the genus Juglans are important economic trees, which have been widely grown for nut and wood productions in many counties. In the Juglans genus, there are ~ 21 diploid species all with 2n = 32 chromosomes, which can be divided into four major sections based on phylogenetic analyses and fruit morphology – Cardiocaryon (e.g. J. cathayensis and J. mandshurica), Juglans (e.g. J. sigillata and the most popular one J. regia), Trachycaryon (e.g. J. cinerea) and Rhysocaryon (e.g. J. hindsii, J. nigra and J. microcarpa) (Aradhya et al., 2007). For walnuts, the draft genome sequences of several Juglans species had been generated from second‐generation sequencing platform (J. regia Chandler N50: 465Kb, Martınez‐Garcıa et al., 2016; J. regia Chandler N50: 640Kb, Stevens et al., 2018). Very recently, the J. microcarpa × J. regia hybrid was sequenced using single molecule sequencing technology, and the new methods for haplotype phasing facilitated the genome assembling of the two species (J. regia Serr N50: 2.90 Mb, Zhu et al., 2019). In this study, we aimed to generate a high‐quality reference‐level de novo assembly and gene annotations of J. regia for genetic and genomic studies in walnut. To search for J. regia with low heterozygous rate, we selected 7 diverse J. regia accessions for Illumina sequencing to estimate the whole‐genome‐level heterozygosity. The accession Zhongmucha‐1 (an ancient tree from Tibet, China; N29°04.889’ E92°44.432’; 3277 ASL) with the lowest (0.385%) heterozygosity was used to generate high‐quality genome sequences. The resulting genome assemblies of Zhongmucha‐1 contain a total of 353 contigs, with the contig N50 size of 3.34 Mb. Combined with Hi‐C data generated in this work and the linkage and physical maps reported previously (Luo et al., 2015; Zhu et al., 2019), the contigs were anchored and ordered, generating chromosome‐level sequences of 540 Mb. We found ~ 95% of the RNA‐seq reads and 97.25% of the Illumina sequencing reads could be aligned onto the final assembly. The completeness of the genome assembly was then evaluated using BUSCO data sets (Simão et al., 2016), and ~ 94% of the core eukaryotic genes were able to be retrieved. To annotate the walnut genome for protein‐coding genes, we used a hybrid gene prediction protocol with combinations of ab initio gene predictions and homologs sequence searching, which was further integrated with RNA‐seq data from 9 representative tissues in J. regia (Figure 1a).
Figure 1

(a) Sequence assembly flow chart for Juglans regia ‘Zhongmucha‐1’. (b) Phylogenetic tree of angiosperm species on the basis of orthologs of single‐copy gene families. Yellow rectangle region represents fagales species. (c) Self‐alignment of walnut genomes. The circus map shows, from outside to inside, ideograms of the sixteen chromosomes, density of TE, density of genes and syntenic blocks. (d) Collinear relationship at the chromosome level between Morella rubra and Juglans regia sub‐genomes. (e) The summary of gene losses in two sub‐genomes of J. regia through comparisons with M. rubra. ‘+’ and ‘‐’ indicate the gene retention and gene loss, respectively. (f) A local comparison for genes in Morella rubra and their orthologs in J. regia sub‐genomes. The black boxes indicate genes, the grey arc ligature indicates gene lost in J. regia subB, the yellow arc ligatures indicate genes lost in J. regia subA, and the green arc ligatures indicate genes conserved in J. regia subA and subB. (g) Expression correlation of sub‐genome paired genes. Frequency distributions are shown for Spearman correlation coefficient calculated from expressions of paired genes in nine tissues. Among them, (h) the detailed expression profiles in nine tissues are shown for JreCHS and JreSRG1. (i) Heatmap and cluster of gene pairs between sub‐genomes based on their expressions in fruit immature. (j) Frequency distributions of Ka/Ks values for four clusters. (k) A summarized model proposed for the phylogeny of Juglans. The divergence time of M. rubra and J. regia was estimated ~ 31 Mya. The walnut genome had undergone a whole‐genome duplication event from 2n = 16 to 2n = 32 chromosomes around 24 million years ago. The rediploidization event was likely to occur soon after WGD for the Juglans genus. J. regia and its landrace J. sigillata arose as a hybrid between the American and the Asian lineages around 3.45 million years ago.

(a) Sequence assembly flow chart for Juglans regia ‘Zhongmucha‐1’. (b) Phylogenetic tree of angiosperm species on the basis of orthologs of single‐copy gene families. Yellow rectangle region represents fagales species. (c) Self‐alignment of walnut genomes. The circus map shows, from outside to inside, ideograms of the sixteen chromosomes, density of TE, density of genes and syntenic blocks. (d) Collinear relationship at the chromosome level between Morella rubra and Juglans regia sub‐genomes. (e) The summary of gene losses in two sub‐genomes of J. regia through comparisons with M. rubra. ‘+’ and ‘‐’ indicate the gene retention and gene loss, respectively. (f) A local comparison for genes in Morella rubra and their orthologs in J. regia sub‐genomes. The black boxes indicate genes, the grey arc ligature indicates gene lost in J. regia subB, the yellow arc ligatures indicate genes lost in J. regia subA, and the green arc ligatures indicate genes conserved in J. regia subA and subB. (g) Expression correlation of sub‐genome paired genes. Frequency distributions are shown for Spearman correlation coefficient calculated from expressions of paired genes in nine tissues. Among them, (h) the detailed expression profiles in nine tissues are shown for JreCHS and JreSRG1. (i) Heatmap and cluster of gene pairs between sub‐genomes based on their expressions in fruit immature. (j) Frequency distributions of Ka/Ks values for four clusters. (k) A summarized model proposed for the phylogeny of Juglans. The divergence time of M. rubra and J. regia was estimated ~ 31 Mya. The walnut genome had undergone a whole‐genome duplication event from 2n = 16 to 2n = 32 chromosomes around 24 million years ago. The rediploidization event was likely to occur soon after WGD for the Juglans genus. J. regia and its landrace J. sigillata arose as a hybrid between the American and the Asian lineages around 3.45 million years ago. Based on the collinear relationship with our J. regia genome, the scaffolds from other five Juglans species (J. cathayensis, J. mandshurica, J. sigillata, J. hindsii and J. nigra) were anchored onto chromosomes followed with ordering and orientation, generating 16 pseudochromosomes for each Juglans species. Taken together, coupled with the high‐quality assembly of J. microcarpa (Zhu et al., 2019), the chromosome‐level sequences are now available for totally seven species from the three major sections in Juglans. Through aligning the assemblies of the six related species with that of J. regia, we identified highly confident variants within coding regions, including 2 270 791 single nucleotide polymorphisms (SNPs), 235 764 small indels of ≤ 20 bp and 52 363 structural variants (SVs). The genome assemblies of all Juglans and the variation data have been deposited at http://xhhuanglab.cn/data/juglans.html, and the assembly of Juglans regia can also be downloaded from BIGD under Bioproject number PRJCA002070. Using single‐copy genes, we constructed a phylogenetic tree for the seven Juglans species, their close relatives with reported genome sequences and the core eudicots genomes, along with Oryza sativa from monocots as outgroup (Figure 1b). Self‐alignment of the walnut genome sequences based on the 39 432 gene models identified 11 446 paralogous gene groups with 555 synteny blocks, which indicated that the gene pairs were actually due to one‐to‐one chromosome pairs on the walnut genome (dividing into two sub‐genomes – subA and subB in this work; Figure 1c). A total of 11 938 (between red bayberry and walnut subA) and 7978 (between red bayberry and walnut subB) one‐to‐one orthologous gene pair blocks were found and used to visualize the detailed orthologous chromosome‐to‐chromosome relationships (Figure 1d). There were 9457 cases with both copies retained in J. regia, 7530 cases with singletons retained in subA and 3204 retained in subB, meaning more than half of the duplicated gene pairs had lost one copy in J. regia after the WGD (Figure 1e). According to the estimation of the WGD time (~24.48 million years ago), coding genes were lost at a rate range from 0.56% to 1.62% per million years for each chromosome. We further focused on 6981 one‐to‐one collinear gene pairs between sub‐genomes to compare the expression patterns in each pair. The correlation coefficient of the expression levels of the two copies in nine tissues was calculated for each gene pair (Figure 1g), and ~22% pairs showed significant correlations (850 pairs with P < 0.05). The other ~78% pairs showed weak or no expression pattern correlations between copies in subA and subB, suggesting the duplicates of each other begin to diverge in expression levels and patterns. As a typical example for co‐expressed gene pairs (Figure 1h), the two copies JreCHS and JreCHS (a key enzyme involved in the biosynthesis of flavonoids) showed closely correlated gene expressions, which may indicate that both copies were responsible for the biosynthesis of the flavonoids with less functional divergence. Another contrasting example is JreSRG1 (a gene involved in plant senescence; Figure 1h), of which the subA copy were normally expressed in immature fruit and somatic embryo while the subB copy had no transcripts in these tissues, with numerous changes in promoter regions between the two copies (sequence identity = 45%). The abundant gene copies exhibiting diverse expression patterns were probably due to the sequence divergence in the promoter and other regulation regions (e.g. UTRs and enhancers) between sub‐genomes. We then performed hierarchical clustering for these gene pairs based on their expression patterns in each tissue. For the tissue ‘immature fruit’ (Figure 1i), gene pairs could be classified into four distinctive clusters. Frequency distributions of the Ka/Ks ratio in the four clusters showed the clusterD (high expressions in both copies) had the lowest Ka/Ks ratio peak value (ClusterA: 0.2582; ClusterB: 0.2257; ClusterC: 0.2243; ClusterD: 0.1736), which suggested genes in this cluster tended to be conserved (Figure 1j). Along with J. microcarpa (Zhu et al., 2019), now we totally have 7 well‐assembled Juglans genomes. Through the estimation of the peak Ks value of whole‐genome orthologous genes between J. regia and each of the related species, J. regia speciated with J. sigillata roughly 0.84 million years ago and with other related species ~2.64 million years ago, much later than the WGD event (24.48 million years). Moreover, based on the Ks values, we can estimate the divergence time between M. rubra and J. regia was about 31 million years ago close to previous estimation (Jia et al., 2019), and the ancestry of the J. regia and J. sigillata genome and the origin of J. regia and J. sigillata was dated to ~0.82 Mya. Recently, Zhang et al, 2019 discovered that J. regia (and its landrace J. sigillata) arose as a hybrid between the American and the Asian lineages around 3.45 million year ago. Based on these clues, now we proposed a summarized model for the phylogeny of Juglans, as displayed in Figure 1k.

Author contributions

D.P. conceived and designed the experiments. J.Z, F.J., X.S., G.P., Q.M. and Y.C performed the experiments. W.Z., F.J., J.Q., D.B., R.H. and J.C. analysed the genome data. W.Z. and D.P. wrote the manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  6 in total

1.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.

Authors:  Felipe A Simão; Robert M Waterhouse; Panagiotis Ioannidis; Evgenia V Kriventseva; Evgeny M Zdobnov
Journal:  Bioinformatics       Date:  2015-06-09       Impact factor: 6.937

2.  The walnut (Juglans regia) genome sequence reveals diversity in genes coding for the biosynthesis of non-structural polyphenols.

Authors:  Pedro J Martínez-García; Marc W Crepeau; Daniela Puiu; Daniel Gonzalez-Ibeas; Jeanne Whalen; Kristian A Stevens; Robin Paul; Timothy S Butterfield; Monica T Britton; Russell L Reagan; Sandeep Chakraborty; Sriema L Walawage; Hans A Vasquez-Gross; Charis Cardeno; Randi A Famula; Kevin Pratt; Sowmya Kuruganti; Mallikarjuna K Aradhya; Charles A Leslie; Abhaya M Dandekar; Steven L Salzberg; Jill L Wegrzyn; Charles H Langley; David B Neale
Journal:  Plant J       Date:  2016-07-18       Impact factor: 6.417

3.  Genomic Variation Among and Within Six Juglans Species.

Authors:  Kristian A Stevens; Keith Woeste; Sandeep Chakraborty; Marc W Crepeau; Charles A Leslie; Pedro J Martínez-García; Daniela Puiu; Jeanne Romero-Severson; Mark Coggeshall; Abhaya M Dandekar; Daniel Kluepfel; David B Neale; Steven L Salzberg; Charles H Langley
Journal:  G3 (Bethesda)       Date:  2018-07-02       Impact factor: 3.154

4.  Sequencing a Juglans regia × J. microcarpa hybrid yields high-quality genome assemblies of parental species.

Authors:  Tingting Zhu; Le Wang; Frank M You; Juan C Rodriguez; Karin R Deal; Limin Chen; Jie Li; Sandeep Chakraborty; Bipin Balan; Cai-Zhong Jiang; Patrick J Brown; Charles A Leslie; Mallikarjuna K Aradhya; Abhaya M Dandekar; Patrick E McGuire; Daniel Kluepfel; Jan Dvorak; Ming-Cheng Luo
Journal:  Hortic Res       Date:  2019-03-25       Impact factor: 6.793

5.  Synteny analysis in Rosids with a walnut physical map reveals slow genome evolution in long-lived woody perennials.

Authors:  Ming-Cheng Luo; Frank M You; Pingchuan Li; Ji-Rui Wang; Tingting Zhu; Abhaya M Dandekar; Charles A Leslie; Mallikarjuna Aradhya; Patrick E McGuire; Jan Dvorak
Journal:  BMC Genomics       Date:  2015-09-17       Impact factor: 3.969

6.  The red bayberry genome and genetic basis of sex determination.

Authors:  Hui-Min Jia; Hui-Juan Jia; Qing-Le Cai; Yan Wang; Hai-Bo Zhao; Wei-Fei Yang; Guo-Yun Wang; Ying-Hui Li; Dong-Liang Zhan; Yu-Tong Shen; Qing-Feng Niu; Le Chang; Jie Qiu; Lan Zhao; Han-Bing Xie; Wan-Yi Fu; Jing Jin; Xiong-Wei Li; Yun Jiao; Chao-Chao Zhou; Ting Tu; Chun-Yan Chai; Jin-Long Gao; Long-Jiang Fan; Eric van de Weg; Jun-Yi Wang; Zhong-Shan Gao
Journal:  Plant Biotechnol J       Date:  2018-08-10       Impact factor: 9.803

  6 in total
  8 in total

1.  Metabolome and Transcriptome Profiling Unveil the Mechanisms of Polyphenol Synthesis in the Developing Endopleura of Walnut (Juglans regia L.).

Authors:  Ruimin Huang; Ye Zhou; Feng Jin; Junpei Zhang; Feiyang Ji; Yongchao Bai; Dong Pei
Journal:  Int J Mol Sci       Date:  2022-06-14       Impact factor: 6.208

2.  The Manchurian Walnut Genome: Insights into Juglone and Lipid Biosynthesis.

Authors:  Xiang Li; Kewei Cai; Qinhui Zhang; Xiaona Pei; Song Chen; Luping Jiang; Zhiming Han; Minghui Zhao; Yan Li; Xinxin Zhang; Yuxi Li; Shikai Zhang; Su Chen; Guanzheng Qu; Mulualem Tigabu; Vincent L Chiang; Ronald Sederoff; Xiyang Zhao
Journal:  Gigascience       Date:  2022-06-28       Impact factor: 7.658

3.  Genome-Wide Identification and Characterization of bHLH Transcription Factors Related to Anthocyanin Biosynthesis in Red Walnut (Juglans regia L.).

Authors:  Wei Zhao; Yonghui Liu; Lin Li; Haijun Meng; Ying Yang; Zhaobin Dong; Lei Wang; Guoliang Wu
Journal:  Front Genet       Date:  2021-02-24       Impact factor: 4.599

4.  Dead-End Hybridization in Walnut Trees Revealed by Large-Scale Genomic Sequence Data.

Authors:  Wei-Ping Zhang; Lei Cao; Xin-Rui Lin; Ya-Mei Ding; Yu Liang; Da-Yong Zhang; Er-Li Pang; Susanne S Renner; Wei-Ning Bai
Journal:  Mol Biol Evol       Date:  2022-01-07       Impact factor: 16.240

5.  A genome variation map provides insights into the genetics of walnut adaptation and agronomic traits.

Authors:  Feiyang Ji; Qingguo Ma; Wenting Zhang; Jie Liu; Yu Feng; Peng Zhao; Xiaobo Song; Jiaxin Chen; Junpei Zhang; Xin Wei; Ye Zhou; Yingying Chang; Pu Zhang; Xuehui Huang; Jie Qiu; Dong Pei
Journal:  Genome Biol       Date:  2021-10-27       Impact factor: 13.583

6.  Chromosome-scale assembly reveals asymmetric paleo-subgenome evolution and targets for the acceleration of fungal resistance breeding in the nut crop, pecan.

Authors:  Lihong Xiao; Mengjun Yu; Ying Zhang; Jie Hu; Rui Zhang; Jianhua Wang; Haobing Guo; He Zhang; Xinyu Guo; Tianquan Deng; Saibin Lv; Xuan Li; Jianqin Huang; Guangyi Fan
Journal:  Plant Commun       Date:  2021-09-24

7.  Characterization of walnut JrWOX11 and its overexpression provide insights into adventitious root formation and development and abiotic stress tolerance.

Authors:  Yingying Chang; Xiaobo Song; Mingjun Li; Qixiang Zhang; Pu Zhang; Xiashuo Lei; Dong Pei
Journal:  Front Plant Sci       Date:  2022-09-06       Impact factor: 6.627

8.  Soil Chemical and Microbiological Properties Are Changed by Long-Term Chemical Fertilizers That Limit Ecosystem Functioning.

Authors:  Yong-Chao Bai; Ying-Ying Chang; Muzammil Hussain; Bin Lu; Jun-Pei Zhang; Xiao-Bo Song; Xia-Shuo Lei; Dong Pei
Journal:  Microorganisms       Date:  2020-05-08
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.