| Literature DB >> 32658273 |
Nadia Kamal1, Terry Mun2, Dugald Reid2, Jie-Shun Lin2, Turgut Yigit Akyol3, Niels Sandal2, Torben Asp2, Hideki Hirakawa4, Jens Stougaard2, Klaus F X Mayer1,5, Shusei Sato3, Stig Uggerhøj Andersen2.
Abstract
Lotus japonicus is a herbaceous perennial legume that has been used extensively as a genetically tractable model system for deciphering the molecular genetics of symbiotic nitrogen fixation. Our aim is to improve the L. japonicus reference genome sequence, which has so far been based on Sanger and Illumina sequencing reads from the L. japonicus accession MG-20 and contained a large fraction of unanchored contigs. Here, we use long PacBio reads from L. japonicus Gifu combined with Hi-C data and new high-density genetic maps to generate a high-quality chromosome-scale reference genome assembly for L. japonicus. The assembly comprises 554 megabases of which 549 were assigned to six pseudomolecules that appear complete with telomeric repeats at their extremes and large centromeric regions with low gene density. The new L. japonicus Gifu reference genome and associated expression data represent valuable resources for legume functional and comparative genomics. Here, we provide a first example by showing that the symbiotic islands recently described in Medicago truncatula do not appear to be conserved in L. japonicus.Entities:
Keywords: comparative genomics; expression atlas; legume; rhizobium; symbiosis
Mesh:
Year: 2020 PMID: 32658273 PMCID: PMC7508351 DOI: 10.1093/dnares/dsaa015
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 3Symbiotic island gene expression. (A) Log(nodule/root) expression ratios for genes in Medicago A17 symbiotic islands and their best Lotus Gifu BLAST matches. n: gene count. r: Pearson correlation coefficient for the Lotus Gifu and Medicago A17 log(nodule/root) ratios. (B–C) Histograms of Pearson correlation coefficients for symbiotic islands. The Pearson correlation coefficient for each island is an average of the coefficients resulting from pairwise comparisons of the gene expression profiles of all genes residing within that island. n: symbiotic island count. (B) All genes in Medicago A17 symbiotic islands with a putative Lotus Gifu homolog with expression data. Multiple copies of the same Lotus Gifu gene are allowed. (C) Only one Lotus Gifu copy and one corresponding Medicago A17 gene is included in the analysis and it is further required that each Lotus Gifu island contains at least three members. Lotus: expression data from Lotus Gifu. Medicago: expression data from Medicago A17.
Assembly and genetic map statistics
| Dataset | Contig count | Total length (bp) | N50 (bp) | L50 |
|---|---|---|---|---|
| Assembly | 1,686 | 554,078,227 | 807,552 | 187 |
| Containing ≥1 SNP | 1,538 (91.2%) | 551,215,263 (99.5%) | 823,414 | 185 |
| Exclusively Gifu × | 105 (6.3%) | 3,270,218 (0.6%) | 35,963 | 27 |
| Exclusively Gifu × MG-20 | 124 (7.4%) | 5,010,333 (0.9%) | 51,370 | 29 |
| Contains SNPs from both | 1,309 (77.6%) | 542,934,712 (97.9%) | 835,713 | 180 |
| Does not contain any SNPs | 148 (8.8%) | 2,862,964 (0.5%) | 23,531 | 46 |
N50: at least 50% of the total length is contained within contigs of size N50 or longer. L50: at least 50% of the total length is contained within L50 number of contigs.
Figure 1(A) Circos diagram displaying heatmaps of the numbers of genes and ncRNAs (100 Mb bins) and bases covered by repetitive elements (10 Mb bins) in the Lotus Gifu genome. (B) BUSCO version 4 scores of the Lotus Gifu assembly (98.2%), the HC gene set (94%), the high- and LC gene set (96.1%) and of only the longest transcript of each gene (91.8%) from the joint HC and LC gene set. Lineage used: embryophyta_odb10. (C) Distribution of repetitive elements in the Lotus Gifu genome.
Genome annotation statistics
| Lotus Gifu version 1.2 HC + LC | Lotus Gifu version 1.2 HC | Medicago A17 version 4 | Medicago A17 version 4 HC | Medicago A17 version 5 |
| |
|---|---|---|---|---|---|---|
| Number of genes | 30,243 | 21,778 | 50,444 | 31,451 | 51,316 | 52,872 |
| Number of coding genes | 29,554 | 21,778 | 50,444 | 31,451 | 44,623 | 52,872 |
| Number of mRNAs | 49,868 | 37,994 | 57,585 | 38,175 | 44,623 | 86,256 |
| Number of exons | 306,545 | 264,198 | 267,394 | 397,385 | 189,379 | 560,910 |
| Number of CDSs | 262,442 | 236,845 | 257,792 | 376,276 | 174,461 | 516,059 |
| Average CDS lengths (bp) | 1,216.2 | 1,385.4 | 1,038.4 | 1,272.4 | 1,017.7 | 1,350.7 |
| Average exon lengths (bp) | 417.54 | 373.77 | 282.58 | 261.92 | 360.25 | 312.48 |
| Average intron lengths (bp) | 527.12 | 513.71 | 444.41 | 438.41 | 476.57 | 519.19 |
| Average transcripts per gene | 1.65 | 1.74 | 1.14 | 1.54 | 1 | 1.63 |
| Average exons per transcript | 6.15 | 6.95 | 4.64 | 6.75 | 4.19 | 6.5 |
| Average CDS exons per transcript | 5.23 | 6.23 | 4.48 | 6.39 | 3.91 | 5.98 |
HC, high-confidence gene models; LC, low-confidence gene models.
Figure 2Expression profiles of known symbiosis genes. Expression values from the Lotus Gifu RNA-seq expression atlas are shown for the indicated genes. A full description of the conditions included is shown in Supplementary Table S1. The heatmap was generated from https://lotus.au.dk/expat/ using the normalize by row function.
Conservation of symbiotic islands between Lotus and Medicago
| Island type | NRU | NRD | NRN | NDA | NDD | NDN |
|---|---|---|---|---|---|---|
| Islands | 270 | 89 | 84 | 49 | 211 | 57 |
| Mt genes | 2559 | 712 | 628 | 377 | 1680 | 429 |
| Mt genes with Lj hits | 1040 | 550 | 516 | 298 | 506 | 322 |
| Corresponding Lj genes | 770 | 358 | 456 | 261 | 396 | 275 |
| Lj genes with micro-synteny | 446 | 228 | 320 | 190 | 215 | 166 |
| Mt lncRNAs | 302 | 40 | 17 | 25 | 228 | 18 |
| Mt lncRNAs with Lj hit | 47 | 17 | 9 | 13 | 31 | 6 |
| Mt lncRNAs with Lj hit in micro-syntenic region | 17 | 3 | 5 | 9 | 8 | 5 |
| Conservation rate (%) | 40.6% | 77.2% | 82.2% | 79.0% | 30.1% | 75.1% |
| Duplication rate | 1.35 | 1.54 | 1.13 | 1.14 | 1.28 | 1.17 |
| Ratio of genes with micro-synteny | 57.9% | 63.7% | 70.2% | 72.8% | 54.3% | 60.4% |
| Islands with Lj hits in more than half of the genes | 101 (37%) | 80 (90%) | 82 (98%) | 44 (90%) | 36 (17%) | 49 (86%) |
Mt, Medicago A17; Lj, Lotus Gifu; NRU, nodule versus root upregulated; NRD, nodule versus root downregulated; NRN, nodule versus root not regulated; NDA, nodule development apical zone; NDD, nodule development differentiation zone; NDN, nodule development not regulated.