| Literature DB >> 28367986 |
Beatriz Navarro-Domínguez1, Francisco J Ruiz-Ruano1, Josefa Cabrero1, José María Corral2,3, María Dolores López-León1, Timothy F Sharbel2,4, Juan Pedro M Camacho1.
Abstract
For many years, parasitic B chromosomes have been considered genetically inert elements. Here we show the presence of ten protein-coding genes in the B chromosome of the grasshopper Eyprepocnemis plorans. Four of these genes (CIP2A, GTPB6, KIF20A, and MTG1) were complete in the B chromosome whereas the six remaining (CKAP2, CAP-G, HYI, MYCB2, SLIT and TOP2A) were truncated. Five of these genes (CIP2A, CKAP2, CAP-G, KIF20A, and MYCB2) were significantly up-regulated in B-carrying individuals, as expected if they were actively transcribed from the B chromosome. This conclusion is supported by three truncated genes (CKAP2, CAP-G and MYCB2) which showed up-regulation only in the regions being present in the B chromosome. Our results indicate that B chromosomes are not so silenced as was hitherto believed. Interestingly, the five active genes in the B chromosome code for functions related with cell division, which is the main arena where B chromosome destiny is played. This suggests that B chromosome evolutionary success can lie on its gene content.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28367986 PMCID: PMC5377258 DOI: 10.1038/srep45200
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Identification of protein-coding genes located in B chromosomes of the grasshopper E. plorans, using the number of mapped reads that map to the CDSs found in the de novo assembled transcriptome, in the 0B (X axis) and 4B (Y axis) libraries of genomic DNA.
Red triangles represent those sequences that code for known proteins, blue diamonds represent those sequences that code for transposable elements and grey circles represent non-annotated CDSs. The CDSs corresponding to the 10 genes found in the B chromosome are labelled.
Results of mapping gDNA reads from 0B and 4B males on transcriptome CDSs.
| Transcriptome CDS | Length | gDNA reads mapped | Log2(4B/0B) | Annotation | |
|---|---|---|---|---|---|
| 0B | 4B | ||||
| comp59256_c3_seq2|m.36041 | 468 | 3 | 211 | 6.14 | TE |
| comp61215_c1_seq1|m.53202 | 519 | 3 | 145 | 5.59 | |
| comp61215_c0_seq1|m.53195 | 906 | 13 | 372 | 4.84 | |
| comp60327_c0_seq1|m.44236 | 966 | 10 | 222 | 4.47 | |
| comp61215_c0_seq1|m.53194 | 1338 | 11 | 231 | 4.39 | |
| comp62628_c0_seq2|m.69037 | 3012 | 40 | 758 | 4.24 | |
| comp61215_c5_seq1|m.53215 | 369 | 7 | 113 | 4.01 | |
| comp59256_c1_seq2|m.36039 | 441 | 19 | 250 | 3.72 | NA |
| comp61215_c2_seq1|m.53203 | 684 | 4 | 46 | 3.52 | |
| comp62255_c0_seq1|m.65104 | 1767 | 17 | 192 | 3.50 | |
| comp59256_c1_seq1|m.36038 | 324 | 10 | 111 | 3.47 | NA |
| comp59256_c0_seq1|m.36037 | 351 | 6 | 58 | 3.27 | NA |
| comp59183_c0_seq1|m.35620 | 2367 | 28 | 218 | 2.96 | |
| comp43869_c0_seq1|m.5004 | 1098 | 215 | 1492 | 2.79 | TE |
| comp40101_c0_seq1|m.3160 | 411 | 87 | 506 | 2.54 | NA |
| comp61379_c0_seq1|m.55345 | 1653 | 15 | 76 | 2.34 | |
| comp57756_c0_seq1|m.28314 | 357 | 73 | 367 | 2.33 | TE |
| comp57756_c0_seq1|m.28313 | 972 | 286 | 1415 | 2.31 | TE |
| comp40101_c0_seq1|m.3162 | 315 | 85 | 380 | 2.16 | NA |
| comp62313_c1_seq11|m.65571 | 351 | 44 | 180 | 2.03 | NA |
| comp61143_c1_seq6|m.51252 | 837 | 25 | 102 | 2.03 | TE |
| comp62628_c0_seq8|m.69051 | 1728 | 181 | 692 | 1.93 | |
| comp62313_c1_seq1|m.65560 | 498 | 324 | 1174 | 1.86 | NA |
| comp62453_c1_seq3|m.66898 | 4269 | 89 | 310 | 1.80 | |
| comp52884_c0_seq1|m.14126 | 651 | 235 | 780 | 1.73 | NA |
| comp57756_c1_seq1|m.28315 | 1374 | 605 | 1969 | 1.70 | TE |
| comp62575_c1_seq4|m.68249 | 2322 | 104 | 336 | 1.69 | |
| comp46268_c0_seq2|m.7460 | 339 | 320 | 1017 | 1.67 | NA |
| comp62313_c1_seq11|m.65572 | 312 | 75 | 235 | 1.65 | NA |
Only the 29 contigs with 40 or more reads mapped and log2(4B/0B) ≥ 1.58 are shown, 13 of which corresponded to protein-coding genes, 6 to transposable elements (TE) and 10 failed to be annotated (NA).
Estimation of the number of gene copies in the B24 chromosome of E. plorans.
| Gene | Acc.No. | B-activity | ||||
|---|---|---|---|---|---|---|
| KX034164 | 0.0127 | 0.1161 | 21 | 5 | Yes | |
| KX034167 | 0.0099 | 0.054 | 12 | 4 | No | |
| KX034169 | 0.0484 | 0.2057 | 10 | 2 | Yes | |
| KX034170 | 0.0123 | 0.3313 | 61 | 12 | No | |
| KX034165 | 0.0139 | 0.1582 | 26 | 8 | Yes | |
| KX034166 | 0.0137 | 0.2856 | 47 | 12 | Yes | |
| KX034168 | 0.0073 | 0.0814 | 25 | 4 | No | |
| KX034171 | 0.0103 | 0.3235 | 71 | 23 | Yes | |
| KX034172 | 0.0117 | 0.0608 | 12 | 3 | No | |
| KY211688 | 0.0218 | 0.1088 | 11 | 2 | No |
The first four genes were found complete in the B-carrying genome, and the latter six were truncated. Acc.No. = Accession number (GenBank). Abundances in the 0B and 4B genomes (A0 and A4) are multiplied by 106. N4 = Number of copies in the 4B genome. N = Number of copies per B chromosome.
Figure 2Coverage for the CIP2A transcript in the gDNA (0B and 4B) and RNA (0B and 1B) Illumina reads (A), and qPCR on gDNA (B) and cDNA (C). Note that coverage was higher in the 4B library across the complete sequence length, including the full CDS (delimited by the dotted vertical lines), the 5′ UTR (from the 5′ end to the first dotted line) and 3′ UTR (from the second dotted line to the 3′ end). Likewise, note the higher coverage for this transcript in the B-carrying RNA library. The shaded zone in A marks the region amplified by qPCR. qPCR on gDNA (B) revealed that genomic copy number for the CIP2A gene increases with B chromosome number, following a dose-dependent pattern, thus supporting its presence in the B chromosome. qPCR on cDNA (C) revealed that CIP2A is expressed in all tissues and sexes analyzed, also following a dose-depending pattern and suggesting the active transcription of B chromosome gene copies. RQ = Relative quantity. NREQ = Normalized relative expression quantity. P = P-value and Pb = Sequential Bonferroni P-value for Spearman rank correlation (B) and Kruskal-Wallis (C) analyses.
Figure 3Coverage for the MYCB2 transcript in the gDNA (0B and 4B) and RNA (0B and 1B) Illumina reads (A), and qPCR on gDNA (B) and cDNA (C). Note that less than half of MYCB2 CDS (between the dotted vertical lines) showed high coverage in the 4B library, specifically from position 8961 to the 3′ end (A), suggesting that this B chromosome gene is truncated. Two regions were selected for qPCR amplification of this gene, one within the region being apparently absent in the B chromosome (shaded zone 1) and the other within the region being present in it (shaded zone 2). qPCR on gDNA with zone 1 primers showed that copy number for this gene region was independent on the number of B chromosomes (B1). However, qPCR on gDNA with zone 2 primers showed that abundance of this MYCB2 gene region increased with B chromosome number in a dose-dependent pattern (B2). Likewise, qPCR on cDNA showed that MYCB2 expression was independent of B chromosome number when probed with zone 1 primers (C1) but it increased in a dosage-dependent pattern with zone 2 primers (C2), suggesting the active transcription of B chromosome truncated gene copies. RQ = Relative quantity. NREQ = Normalized relative expression quantity. P = P-value and Pb = Sequential Bonferroni P-value for Spearman rank correlation (B) and Kruskal-Wallis (C) analyses.
KOG classification of genes located on the B chromosome.
| Gene | Hit | E-Value | Description | Mult/Single | Class | Class Description |
|---|---|---|---|---|---|---|
| KOG2025 | 7.00E-31 | Chromosome condensation complex Condensin I, subunit G | Multiple | B/D | Chromatin structure and dynamics/Cell cycle control and mitosis | |
| KOG0161 | 4.00E-10 | Myosin class II heavy chain | Single | Z | Cytoskeleton | |
| KOG0410 | 2.00E-85 | Predicted GTP binding protein | Single | R | General function prediction only | |
| KOG4518 | 2.00E-75 | Hydroxypyruvate isomerase | Single | G | Carbohydrate transport and metabolism | |
| KOG0247 | 3.00E-84 | Kinesin-like protein | Single | Z | Cytoskeleton | |
| KOG2485 | 4.00E-86 | Conserved ATP/GTP binding protein | Single | R | General function prediction only | |
| KOG1428 | 7.00E-84 | Neuronal presynaptic protein Highwire/PAM/RPM-1 | Single | T | Signal transduction mechanisms | |
| KOG4237 | 0.00 | Extracellular matrix protein slit, contains leucine-rich and EGF-like repeats | Multiple | W | Extracellular structures/Signal transduction mechanism | |
| KOG0355 | 0.00 | DNA topoisomerase type II | Single | B | Chromatin structure and dynamics |
CKAP2 did not achieve any significant hits in KOG database.
Biological samples used for each experiment in the current work.
| Experiment | Technique | Sex | Body part | Bs | N |
|---|---|---|---|---|---|
| B chromosome gene content | Illumina WGS | Male | Hind leg | 0 | 1 |
| 4 | 1 | ||||
| qPCR (DNA) | Male | Body | 0 | 2 | |
| 1 | 6 | ||||
| 2 | 5 | ||||
| 3 | 1 | ||||
| B chromosome expression | Illumina RNA-seq | Female | Full body | 0 | 1 |
| 1 | 1 | ||||
| qPCR (cDNA) | Male | Body/Testes | 0 | 2 | |
| 1 | 7 | ||||
| 2 | 8 | ||||
| 3 | 6 | ||||
| Female | Body/Ovary | 0 | 4 | ||
| 1 | 8 | ||||
| 2 | 8 | ||||
| 3 | 1 |
Bs = number of B chromosomes; N = number of individuals.
Sequence of all primers used for qPCR experiments in this work.
| Gene | Forward primer (5′-3′) | Reverse primer (5′-3′) |
|---|---|---|
| TGGCGCTGGTACTGAGTATG | GATCCACCTGAAGAGCTTGG | |
| CAAAATGGCGTGCTGAAAG | CGTCTTTTGATTTAATAGTGGAATTTG | |
| TCTTCGATGTTTTGGCCTTC | TGGTCATCATTTGCCAGAGA | |
| GAGGTATGGAACACGCACAA | AGTGGCACGTTTCGTCTTCT | |
| CAACAGCGCCTGTCACTAAA | GCTGAGGTGTCTGCTCACAA | |
| CACTGGAGGACGCGATGT | CCGGAGTGAGATCAAAAGACC | |
| TGTCCGGACGAGTTGACA | CGGAACAGAATGTGGATTGA | |
| CCACATCCAGATTGCACAAG | ACTCCAAACCAATCCAACCA | |
| CAGGGCACAAATGAAAATCC | TTGCTGCTTCTCTTCATCCA | |
| AGCTCCAGTAGGTGCAAAGG | GGCCTGCTTCAACATCTCT | |
| ACCCGTCACATACACAACGA | CCATCACCATTGCTTGTACG | |
| GCAAGGAAGAAGAGGAAGCA | CCAGTGCCATAACCCAGAAC | |
| AACATGCTGCATTGCGACT | CACTTGAAGTCGTGGTCGTG | |
| AACCTGTCGGAAAGAGCAAA | TTCGCACAATGTCAACATCC | |
| GGAACACTCGGCGTACCA | GCAGAGTCCTTCCCACCA | |
| GCCTGTCAAGGGCAAGAA | TCCAACTTCGGAGCAACC |