| Literature DB >> 28616075 |
Etienne Jourdier1, Lyam Baudry2,3, Martial Marbouty2,3, Frédérique Bidard1, Dante Poggi-Parodi1, Yoan Vicq1, Romain Koszul2,3, Antoine Margeot1.
Abstract
BACKGROUND: The presence of low complexity and repeated regions in genomes often results in difficulties to assemble sequencing data into full chromosomes. However, the availability of full genome scaffolds is essential to several investigations, regarding for instance the evolution of entire clades, the analysis of chromosome rearrangements, and is pivotal to sexual crossing studies. In non-conventional but industrially relevant model organisms, such as the ascomycete Trichoderma reesei, a complete genome assembly is seldom available.Entities:
Keywords: Centromere; Chromosomal contact; Chromosome conformation capture; GRAAL; Genome assembly; Hi-C; Karyotype; Translocation; Trichoderma reesei
Year: 2017 PMID: 28616075 PMCID: PMC5469131 DOI: 10.1186/s13068-017-0837-6
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Fig. 1T. reesei scaffold reassembly in seven chromosomes. T. reesei scaffolds from the JGI reference genome have been reassembled using chromosomal conformation capture (3C) sequencing data. a Contact matrix resulting from GRAAL reassembly. Red bars indicate the boundaries of the seven chromosome; centromere positions are represented by blue dots. b Order and orientation of the reassembled scaffolds in the seven chromosomes. Orientation uncertainties are noted with a question mark. Scaffolds 1, 2, 5, and 28 that were misassembled in the reference genome are shown in green, blue, orange, and red, respectively. Centromere positions are represented by blue dots
Size (bp) , number of genes, and gene density (nb of genes per kb) of T. reesei QM6a chromosomes
| Genetic element | Size | Number of genes | Gene density |
|---|---|---|---|
| Chromosome I | 6,647,935 | 1817 | 0.27 |
| Chromosome II | 5,980,447 | 1701 | 0.28 |
| Chromosome III | 5,112,650 | 1336 | 0.26 |
| Chromosome IV | 4,337,413 | 1162 | 0.27 |
| Chromosome V | 3,979,336 | 1092 | 0.27 |
| Chromosome VI | 3,567,305 | 983 | 0.28 |
| Chromosome VII | 3,660,386 | 1022 | 0.28 |
| Unassembled scaffolds | 163,868 | 16 | |
| Total | 33,449,340 | 9129 |
Gene annotation was based on the JGI Filtered Models set of genes
T. reesei QM6a centromeres
| Chr | Location on chr (Mb) | Between scaffolds | Scaffolds involved | Size (kb) | %AT | Nb of genes (gene IDs) |
|---|---|---|---|---|---|---|
| Scaffolds with centromere signature reassembled in chromosomes | ||||||
| chr I | 3.12 | 21(−) and 4(+) | 55 | 34 | 77.9 | 4 (112,674, 112,675, 112,676, 112,677) |
| chr II | 1.93 | 10(−) and 8(+) | 66 + 59 | 30 | 70.0 | 3 (71,146, 43,199, 42,942) |
| chr III | 1.71 | 40(+) and 2a(−) | Unknown | |||
| chr IV | 1.48 | 17(+) and 20(−) | 56 | 32 | 74.2 | 2 (112,678, 112,679) |
| chr V | 1.12 | 18(+) and 28a(−) | 60 + 61 | 32 | 77.0 | 1 (112,683) |
| chr VI | 1.67 | 37(+) and 39(+) | 51 | 43 | 76.3 | 2 (112,649, 73,103) |
| chr VII | 1.73 | 16(−) and 3(−) | 52 | 41 | 76.7 | 1 (112,651) |
| Other scaffolds with centromere signature but not reassembled | ||||||
| 57 | 26 | 76.6 | 0 | |||
| 58 | 21 | 78.7 | 3 (112,680, 112,681, 112,682) | |||
| 65 | 13 | 81.7 | 1 (112,689) | |||
Chromosomal contact data were used to identify the location of the centromeres on the chromosomes. Centromeres were all identified in small scaffolds, not in the middle of well-assembled scaffolds
Translocation and large deletion events found in GRAAL reassembly of T. reesei Rut-C30 with respect to QM6a
| Translocation | Location on scaffolds (this study) | Location on scaffolds [ | Mapping on QM6a chromosomes |
|---|---|---|---|
| n° 1 | scaffold_2: 556 ± 22 kb | scaffold_2: 546,703 bp | chr III: 3,166,447 |
| scaffold_4: 1,197 ± 25 kb | scaffold_4: 1,204,862 bp | chr I: 4,342,096 | |
| n° 2 | scaffold_4: 750 ± 27 kb | scaffold_4: 748,277 bp | chr I: 3,885,511 |
| scaffold_22: 138 ± 31 kb | scaffold_22: 139,515 bp | chr VI: 3,165,364 | |
| n° 3 | scaffold_22: 138 ± 31 kb | scaffold_22: 139,476 bp | chr VI: 3,165,325 |
| scaffold_48: 0 ± 35 kb | scaffold_48: 1667 bp | chr I: 5,018,020 |
Newly acquired 3C-seq data of T. reesei Rut-C30 strain were used to reassemble the reference genome. Comparison with QM6a reassembly allowed the identification of three chromosomal translocations and one large deletion. The position of these rearrangements is consistent with former work [14, 15]
Fig. 2Chromosome maps of T. reesei QM6a and Rut-C30 strains. Chromosome maps of T. reesei QM6a (a) and Rut-C30 and NG14 (b) strains were identified by reassembly of the JGI reference genome using 3C sequencing data for each strain. For Rut-C30 map, the colors of chromosome fragments are consistent with their colors in QM6a map to clearly show chromosomal rearrangements. Some emblematic genes were chosen along the sequence to be used as location markers (list available in Additional file 6). The Rut-C30 85 kb deletion event on chr. VI is shown by the lack of pks1 gene. Centromere locations are shown by restricted width. c Possible scenario (among others) from QM6a to Rut-C30. Translocations are numbered according to Table 3
Translocation breakpoints of various T. reesei strains genomes
| Translocation breakpoints | Location on QM6a scaffolds [ | Mapping on QM6a chromosomes | |
|---|---|---|---|
| QM9414 & QM9123 | n°1 | scaffold_4: 1,190,139 | chr I: 4,327,373 |
| scaffold_14: 118,472 | chr II: 4,693,330 | ||
| n°2 | scaffold_9: 787,779 | chr VI: 2,237,971 | |
| scaffold_27: 140,159 | chr II: 5,788,998 | ||
| CBS 999.97 (1-1, | Resulting in D-segment | scaffold_36: 54,323 | chr II: 5,441,472 |
| Resulting in L-segment | scaffold_33: 33,249 | chr IV: 4,304,165 | |
| QM9978 | n°1 | scaffold_1: 96,633 | chr V: 1,604,851 |
| scaffold_16: 631,551 | chr VII: 1,076,804 |
Translocation breakpoints were mapped on the superscaffolds generated by GRAAL
Fig. 3Genealogy and likely chromosome structure of various T. reesei strains. Translocation breakpoints (Table 4) and QM6a chromosome assembly (Fig. 2) were used to infer the likely chromosome structure of various T. reesei strains. Chromosome fragment colors and marker genes are consistent to QM6a map (Fig. 2: chromosome maps of T. reesei QM6a and Rut-C30 strains). a Genealogy of T. reesei strains. b Likely chromosome map of QM9414 and QM9123. c Likely chromosome map of QM9978. d Likely chromosome map of CBS999.97 (1-1, re)
Statistics on chromosome fragments
| Strain | Chromosome | Fragment size (kb) | Nb of genes |
|---|---|---|---|
| CBS 999.97 (1-1, | chr II => chr IV (D-segment) | 539 | 167 genes |
| chr IV => chr II (L-segment) | 33 | 11 genes | |
| QM9414 & QM9123 | chr I => chr II | 2321 | 634 genes |
| chr II => chr I | 1096 | 322 genes | |
| chr II => chr VI | 192 | 63 genes | |
| chr VI => chr I | 1329 | 369 genes | |
| QM9978 | chr V => chr VII | 2374 | 644 genes |
| chr VII => chr V | 1077 | 290 genes | |
| Rut-C30 | chr I => chr III | 1133 | 309 genes |
| chr I => chr VI | 1630 | 442 genes | |
| chr III => chr III | 1976 | 485 genes | |
| chr VI => chr I | 402 | 114 genes |
For each of the breakpoint described in Tables 3 and 4, the size and number of genes of the resulting chromosome fragment (from the breakpoint to the telomere or to the next breakpoint) were calculated. The only dispensable fragment is the L-segment described in CBS999.97 (1-1, re) [10]
Fig. 4Possible chromosome structures in F1 progeny resulting from a crossing between a MAT1-1 female fertile QM6a strain and Rut-C30 strain. Using the chromosome structure of QM6a and Rut-C30 strains, we enumerated the possible chromosome structures in F1 progeny (only chromosomes I, III, and VI are shown here with colors consistent to Fig. 3c). For each possible structure, the fragmental diploidy or aneuploidy is shown. Since the chromosome fragments contain essential genes, segmental aneuploidy results in inviable progeny
Analyses of possible chromosome structures for different crossing experiments
| Crossing experiment | Nb ≠ chr | Total possible structures | Non-viable | Viable | Possible viable structures different from parental ones |
|---|---|---|---|---|---|
| CBS999.97 (1-1, | 2 | 22 = 4 | 1 | 3 (75%) | 1 structure with chr II fragment (D-segment) diploidy |
| CBS999.97 (1-1, | 4 | 24 = 16 | 12 | 4 (25%) | 1 structure with chr II fragment diploidy |
| 1 structure with chr II fragment diploidy and chr VI fragment diploidy | |||||
| CBS999.97 (1-1, | 5 | 25 = 32 | 23 | 9 (28%) | 1 haploid with QM6a structure, |
| 1 crossed-haploid, | |||||
| 4 structures with 1 chr fragment diploidy, | |||||
| 1 structure with 2 chr fragment diploidy | |||||
| QM6a ( | 0 | 1 | 0 | 1 (100%) | None |
| QM6a ( | 3 | 23 = 8 | 6 | 2 (25%) | None |
| QM6a ( | 3 | 23 = 8 | 5 | 3 (38%) | 1 structure with chr I fragment diploidy |
The first three cases have already been experimentally described [10]. The next 3, involving a MAT1-1 female fertile (ff) QM6a strain, have not yet been described. We assumed that crossing-over were possible but not in translocated parts