| Literature DB >> 29096008 |
Octavio Manuel Palacios-Gimenez1,2, Vanessa Bellini Bardella1, Bernardo Lemos2, Diogo Cavalcanti Cabral-de-Mello1.
Abstract
Satellite DNA (satDNA) is an abundant class of non-coding repetitive DNA that is preferentially found as tandemly repeated arrays in gene-poor heterochromatin but is also present in gene-rich euchromatin. Here, we used DNA- and RNA-seq from Gryllus assimilis to address the content and transcriptional patterns of satDNAs. We also mapped RNA-seq libraries for other Gryllus species against the satDNAs found in G. assimilis and G. bimaculatus genomes to investigate their evolutionary conservation and transcriptional profiles in Gryllus. Through DNA-seq read clustering analysis using RepeatExplorer, dotplots analysis and fluorescence in situ hybridization mapping, we found that ∼4% of the G. assimilis genome is represented by 11 well-defined A + T-rich satDNA families. These are mainly located in heterochromatic areas, with some repeats able to form high-order repeat structures. By in silico transcriptional analysis we identified satDNAs that are conserved in Gryllus but differentially transcribed. The data regarding satDNA presence in G. assimilis genome were discussed in an evolutionary context, with transcriptional data enabling comparisons between sexes and across tissues when possible. We discuss hypotheses for the conservation and transcription of satDNAs in Gryllus, which might result from their role in sexual differentiation at the chromatin level, heterochromatin formation and centromeric function.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29096008 PMCID: PMC5909420 DOI: 10.1093/dnares/dsx044
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Main features of the satDNAs isolated from G. assimilis genome
| Repeat family | Monomer length (bp) | AT % | Genome proportion% | Nucleotide divergence (±SE) % | Reads/ Contigs | Total of monomers into clusters | Max. number of tandem arrays per contigs | Total repeat family length (kb) |
|---|---|---|---|---|---|---|---|---|
| Gas1 | 11 | 54.5 | 1.35 | 8.5 (±2.9) | 222.81 | 3, 138 | 216 | 34.518 |
| Gas2 | 205 | 64.1 | 1.03 | 15.5 (±0.9) | 95.35 | 112 | 4 | 22.96 |
| Gas3 | 187 | 59.4 | 0.246 | 9.3 (±1) | 90.24 | 21 | 2 | 3.927 |
| Gas4 | 73 | 43.8 | 0.242 | 30.5 (±2.9) | 696.71 | 51 | 25 | 3.723 |
| Gas5 | 165 | 63.6 | 0.212 | 16.6 (±1.2) | 22.08 | 71 | 5 | 11.715 |
| Gas6 | ||||||||
| Gas6-1 | 199 | 38.7 | 0.151 | 7 (±1) | 1218.2 | 8 | 3 | 1.592 |
| Gas6-2 | 200 | 35.2 | 0.182 | 21.7 (±1.7) | 915.625 | 6 | 3 | 1.224 |
| Gas7 | 19 | 73.7 | 0.150 | 6.4 (±1.5) | 377.62 | 157 | 31 | 2.983 |
| Gas8 | ||||||||
| Gas8-1 | 179 | 65.9 | 0.144 | 2.8 (±1.3) | 724.75 | 2 | 2 | 0.358 |
| Gas8-2 | 181 | 62.4 | 0.127 | 5.4 (±0.8) | 73 | 12 | 2 | 2.172 |
| Gas9 | 82 | 46.3 | 0.049 | 15.7 (±2.1) | 14.25 | 15 | 3 | 1.29 |
| Gas10 | 161 | 68.9 | 0.019 | 13.2 (±1.6) | 21.4 | 8 | 2 | 1.288 |
| Gas11 | 10 | 50 | 0.014 | 3.9 (±1.1) | 279 | 45 | 35 | 0.45 |
SE, standard deviation.
Figure 1Chromosomal location of nine satDNAs in mitotic chromosomes of embryos of G. assimilis by FISH. The satDNA names are shown in the images. Asterisks indicate chromosomes with signals in both termini.
Figure 2Chromosomal location of the Gas6 (a and b) and Gas8 (c and d) satDNAs subfamilies in mitotic chromosomes of embryos of G. assimilis by FISH. The satDNA names are shown in the images. Note the specific chromosomal localization on the pair 1 of the Gas6 subfamilies (a and b) contrasting with the scattered clusters for Gas8 subfamilies (c and d). The insets in (a) and (b) shows the pair 1 with overlapped hybridization signals for the satDNAs and 18 S rDNA. Asterisks indicate chromosomes with signals in both termini.
Figure 3Differential expression of satDNAs between male and female in distinct body parts in the G. assimilis genome (a), between different tissues in G. firmus female (b) and whole animal samples in G. rubens (c). The quantification method applied is FPKM (fragments per kilo-base of transcript per million mapped reads, the expression value obtained after normalization of read counts by both transcript length and number of mapped reads in each RNA-seq library). FBLWFFM, fat body from long winged female with functional flight muscle; FBSWFIF, fat body from short winged female incapable flight; LWFFM, long winged female with functional flight muscle; LWFHFM, long winged female with histolyzed flight muscle.
Number of raw reads from each tissue sequencing library that align to each G. assimilis satDNAs studied and their proportion with respect to the total number of reads (i.e. number of raw reads that align to a satDNA divided by the total number of raw reads in the sequencing library) in the gDNA and in each of the four different tissue transcriptomes obtained by Illumina sequencing
| SatDNA | Genome proportion | Transcriptome | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Male head | Female head | Testis | Ovary | ||||||||||
| Reads | Proportion | FPKM | Reads | Proportion | FPKM | Reads | Proportion | FPKM | Reads | Proportion | FPKM | ||
| Gas1 | 0.0135 | 0 | 0 | 0 | 0 | 0 | 0 | 714 | 5.3E–06 | 2.7E + 07 | 28 | 2.1E–07 | 1.9E + 07 |
| Gas3 | 0.00246 | 62 | 4.3E–07 | 3.2E + 06 | 134 | 8.6E–07 | 3.1E + 06 | 106 | 7.9E–07 | 392,449 | 0 | 0 | 0 |
| Gas5 | 0.00212 | 26 | 1.8E–07 | 3.1E + 06 | 70 | 4.5E–07 | 2.3E + 06 | 318 | 2.4E–06 | 11.8E + 06 | 32 | 2.4E–07 | 4.7E + 06 |
| Gas8-2 | 0.00127 | 0 | 0 | 0 | 0 | 0 | 0 | 66 | 4.9E–07 | 385,392 | 0 | 60 | 0 |
| Total reads | 143,521,774 | 156,505,578 | 133,834,504 | 135,002,056 | |||||||||
The quantification scaling method applied is FPKM (fragments per kilo-base of transcript per million mapped reads, the expression value obtained after normalization of read counts by both transcript length and number of mapped reads in each RNA-seq library).
Table showing the number of raw reads from each tissue sequencing library from G. firmus female that align to each of the G. assimilis (Gas) and G. bimaculatus (GB) satDNAs studied and their proportion with respect to the total number of reads (i.e. number of raw reads that align to a satDNA divided by the total number of raw reads in the sequencing library) in each of the four different tissue transcriptomes obtained by Illumina sequencing
| SatDNA | Transcriptome | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FBLWFFM | FBSWFIF | LWFFM | LWFHFM | |||||||||
| Reads | Proportion | FPKM | Reads | Proportion | FPKM | Reads | Proportion | FPKM | Reads | Proportion | FPKM | |
| Gas3 | 6,200 | 1.09E–04 | 5.7E + 06 | 4,690 | 1.2E–04 | 5.82E + 06 | 453 | 1.8E–05 | 7.7E + 06 | 451 | 2.4E–05 | 6.2E + 06 |
| Gas5 | 86 | 1.5E–06 | 117, 117 | 68 | 1.7E–06 | 134,350 | 0 | 0 | 0 | 22 | 1.2E–06 | 443,217 |
| Gas8-2 | 33 | 5.8E–07 | 546, 893 | 12 | 3.0E–07 | 475,428 | 0 | 0 | 0 | 0 | 0 | 0 |
| Gas11 | 197 | 3.5E–06 | 1.23E + 06 | 113 | 2.8E–06 | 949,536 | 0 | 0 | 0 | 0 | 0 | 0 |
| GBH535 | 0 | 0 | 0 | 17 | 4.2E–07 | 91,237.1 | 0 | 0 | 0 | 0 | 0 | 0 |
| Total reads | 56,930,863 | 40,216,469 | 25,270,610 | 18,443,958 | ||||||||
The quantification scaling method applied is FPKM (fragments per kilo-base of transcript per million mapped reads, the expression value obtained after normalization of read counts by both transcript length and number of mapped reads in each RNA-seq library). FBLWFFM, fat body from long winged female with functional flight muscle; FBSWFIF, fat body from short winged female incapable flight; LWFFM, long winged female with functional flight muscle; LWFHFM, long winged female with histolyzed flight muscle.
Table showing the number of raw reads from whole animal samples sequencing library from G. rubens that align to each of the G. assimilis (Gas) and G. bimaculatus (GB) satDNAs studied and their proportion with respect to the total number of reads (i.e. number of raw reads that align to a satDNA divided by the total number of raw reads in the sequencing library) in the transcriptome obtained by Illumina sequencing
| satDNA | Reads | Proportion | FPKM |
|---|---|---|---|
| Gas3 | 14,144 | 1.8E–5 | 4.1E + 06 |
| Gas5 | 1,206 | 1.5E–6 | 476,399.7 |
| Gas6–1 | 46 | 5.8E–08 | 11,844.3 |
| Gas8–2 | 142 | 1.8E–07 | 44,825.5 |
| GBH535 | 4,348 | 5.5E–06 | 656,004.45 |
| Total reads | 792,280,896 |
The quantification scaling method applied is FPKM (fragments per kilo-base of transcript per million mapped reads, the expression value obtained after normalization of read counts by both transcript length and number of mapped reads in the RNA-seq library).