Literature DB >> 29018237

High-throughput analysis unveils a highly shared satellite DNA library among three species of fish genus Astyanax.

Duílio M Z de A Silva1, Ricardo Utsunomia2, Francisco J Ruiz-Ruano3, Sandro Natal Daniel4, Fábio Porto-Foresti4, Diogo Teruo Hashimoto5, Claudio Oliveira2, Juan Pedro M Camacho3, Fausto Foresti2.   

Abstract

The high-throughput analysis of satellite DNA (satDNA) content, by means of Illumina sequencing, unveiled 45 satDNA families in the genome of Astyanax paranae, with repeat unit length (RUL) ranging from 6 to 365 bp and marked predominance of short satellites (median length = 59 bp). The analysis of chromosomal location of 35 satDNAs in A. paranae, A. fasciatus and A. bockmanni revealed that most satellites are shared between the three species and show highly similar patterns of chromosome distribution. The high similarity in satellite DNA content between these species is most likely due to their recent common descent. Among the few differences found, the ApaSat44-21 satellite was present only on the B chromosome of A. paranae, but not on the A or B chromosomes of the two other species. Likewise, the ApaSat20-18 satellite was B-specific in A. paranae but was however present on A and B chromosomes of A. fasciatus and A. bockmanni. The isochromosome nature of B chromosomes in these species was evidenced by the symmetric location of many satDNAs on both B chromosome arms, and the lower symmetry observed in the A. fasciatus BfMa chromosome suggests that it is older than those analyzed in A. paranae and A. bockmanni.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 29018237      PMCID: PMC5635008          DOI: 10.1038/s41598-017-12939-7

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Satellite DNA consists of arrays of tandemly repeated units of two or more nucleotides usually clustered in the heterochromatic regions of chromosomes, but also found on euchromatic regions[1,2]. The birth of a satDNA implies the de novo duplication of a DNA sequence in a specific genomic site, subsequent amplification and dissemination throughout the genome, and local massive amplification yielding clusters being cytologically visible[2]. According to the library hypothesis[3], related organisms share a common library of satellite DNA sequences and, in each species, certain variants may be amplified, generating differential collections of visible satellites in closely related species. It implies that the appearance of a “new” satDNA usually represents the amplification of one of the satellites already present at low level in the library. Moreover, Fry and Salser[3] suggested that the acquisition of a biological function might be responsible for the maintenance of a satellite sequence in the library over long evolutionary periods. In fact, some satDNAs play important genomic roles, such as telomere and centromere formation and function[4,5], and new functions have recently been attributed to some satellite DNAs[1]. On the other hand, other satellites remain as genomic “junk” if they do not have an immediate use but may occasionally acquire one (for review see Garrido-Ramos[6]). Most current satDNAs were uncovered through the ladder pattern yielded by restriction enzyme digestion followed by gel electrophoresis, although this technique shows limitations[7,8]. The advent of Next Generation Sequencing (NGS) and the emergence of bioinformatics tools, such as RepeatExplorer[9], have greatly facilitated satDNA discovery, and a new toolkit based on the former software has uncovered unsuspected levels of intragenomic diversity in satDNA content[2]. For instance, these latter authors unveiled 62 satDNA families in the genome of the migratory locust, a species where the conventional restriction-electrophoresis protocol had failed to find any satDNA. This impelled these authors to suggest the term satellitome for the whole collection of satDNA families found in a single genome, and they suggested that the high-throughput analysis of the satellitome might illuminate many aspects of satDNA evolution[2]. Satellite DNA has been profusely studied in plants and animals[7,10-12]. In contrast, satDNA has scarcely been reported in fish, with only a few exceptions[13-19]. In all cases, no more than seven satellite DNAs were described for a given species[19]. Among the Neotropical ichthyofauna, the genus Astyanax (Baird & Girard) is one of the most species-rich group. It is currently composed of 244 species[20] characterized by wide genome plasticity, with diploid numbers ranging from 2n = 36 in A. shubarti to 2n = 50 in several species[21,22] and the occurrence of variation in karyotype formulas, B chromosome presence, differential distribution of repetitive DNA and heterochromatin and hybrid cytotypes[23]. The high number of species makes this genus an excellent material for testing the satDNA library hypothesis[3]. Several repetitive DNAs have been physically mapped in Astyanax, including ribosomal DNA, U snRNA genes, histone genes, transposable elements and microsatellites[24-26]. However, only one satDNA has been described in this genus, namely the As51 satDNA found in A. scabripinnis by digestion with the KpnI restriction enzyme[15]. Fluorescent in situ hybridization (FISH) mapping showed its location in non-centromeric heterochromatin, i.e., close to telomeric regions of the long arm of some acrocentric chromosomes, in the nucleolus organizer region and in the interstitial heterochromatin of chromosome 24[15]. In the B chromosome of this species, the As51 satDNA was found to be largely symmetrically on both arms, which suggested its isochromosome nature[15]. Later, several studies have reported the presence of the As51 satDNA on A chromosomes of other Astyanax species, but it was only detected in the B chromosome of A. fasciatus, in addition to those of A. scabripinnis (for a review, see Silva et al.[23]). Here, we perform a high-throughput analysis of satDNA content in Astyanax paranae, by means of Illumina sequencing of 0B and 1B genomes, using RepeatExplorer[9] and the satMiner toolkit recently developed by Ruiz-Ruano et al.[2]. This uncovered the presence of 45 satDNA families, 35 of which were PCR amplified on genomic DNA (gDNA) from this species to generate DNA probes for each satDNA. We then performed FISH analysis in A. paranae and two other species also carrying B chromosomes (A. fasciatus and A. bockmanni). These results have greatly increased the current knowledge on satellite DNA in Astyanax and provided new insights on B chromosome evolution.

Results

The satellitome in A. paranae

After nine iterations of the satMiner toolkit protocol (until no additional satDNA was uncovered), we found 45 different satDNA families (67 variants), with repeat unit lengths (RUL) ranging between 6 and 365 bp, and 59 bp median value (Table 1). Length distribution was thus clearly biased due to a predominance of short satellites, as more than half (33) showed RUL shorter than 100 bp (Supplementary Fig. S1). The A + T content of the consensus satDNA sequences varied between 30.3% and 75.1% among families, with 55.8% median value, indicating a slight bias towards A + T rich satellites. The Shapiro-Wilks test showed that A + T content was the only satellitome feature fitting a normal distribution (W = 0.978, P = 0.55), the remaining variables (RUL, abundance and divergence) being far from normality (P < 0.05 in all cases). For this reason, we used non-parametric tests for subsequent analysis.
Table 1

Main characteristics of the 45 satDNA families found in the genome of A. paranae by RepeatExplorer and satMiner analyses.

SFSatDNA familyRULA+T (%)VDivergence (%)Abundance (%)Log2 (1B/0B)
0B1B0B1B
ApaSat01-515154.985.695.536.3064.697−0.42
1ApaSat02-23623664.8114.9915.160.4480.371−0.27
ApaSat03-919151.654.964.90.230.185−0.32
1ApaSat04-23323363.1117.43170.1660.138−0.27
ApaSat05-232352.226.616.470.1650.076−1.11
ApaSat06-868646.414.954.620.0840.1570.91
ApaSat07-6-tel650.0197.860.0670.0950.51
ApaSat08-353562.936.156.330.0660.043−0.60
ApaSat09-212175.0214.8814.660.060.053−0.18
ApaSat10-17917963.114.023.950.0530.05−0.10
ApaSat11-222250.0111.36100.0530.048−0.13
ApaSat12-696960.913.53.610.0520.05−0.06
ApaSat13-222256.5310.69.590.0490.007−2.86
ApaSat14-18418463.0211.7811.690.0450.0540.25
ApaSat15-515154.926.416.340.0430.037−0.21
ApaSat16-545448.129.49.260.0420.0460.14
ApaSat17-36536551.812.242.180.040.0460.20
ApaSat18-585846.626.957.650.0340.033−0.04
ApaSat19-777766.215.154.880.0340.0430.32
ApaSat20-181850.0112.048.080.0330.1812.43
ApaSat21-686848.511.71.650.0230.009−1.28
ApaSat22-636356.5116.4816.190.0210.0260.32
ApaSat23-373743.213.753.950.020.006−1.68
ApaSat24-787856.415.735.770.0190.013−0.49
ApaSat25-272751.9110.9310.330.0180.0210.27
ApaSat26-19519565.118.249.670.0170.012−0.51
ApaSat27-17817839.917.217.660.0160.014−0.17
ApaSat28-525255.8116.1916.110.0150.0170.16
ApaSat29-525267.3113.1513.110.0150.014−0.10
ApaSat30-505064.0115.4514.470.0150.014−0.04
ApaSat31-16516563.019.349.620.0140.011−0.31
ApaSat32-858558.816.8616.120.0130.001−4.63
ApaSat33-11211264.314.833.970.0130.0230.85
ApaSat34-595937.311.481.440.0120.1043.07
ApaSat35-373748.614.634.820.0120.009−0.42
ApaSat36-212157.116.326.870.0110.01−0.24
ApaSat37-383857.925.014.760.0110.01−0.06
ApaSat38-10710741.1117.9415.10.0110.0180.79
ApaSat39-323250.017.3212.10.010.001−4.00
ApaSat40-18918965.619.637.70.0090.0271.53
ApaSat41-333330.3110.079.50.0080.0110.41
ApaSat42-909066.714.764.670.0060.006−0.06
ApaSat43-616168.9110.157.670.0060.0151.37
ApaSat44-212138.1125.925.430.0020.0414.34
ApaSat45-11311375.112.781.490.0010.0072.41
Total678.396.853

SF = superfamily, RUL = repeat unit length, V = number of variants. In each family, length and A + T content are given for the most abundant variant. Divergence per family is expressed as percentage of Kimura divergence.

Main characteristics of the 45 satDNA families found in the genome of A. paranae by RepeatExplorer and satMiner analyses. SF = superfamily, RUL = repeat unit length, V = number of variants. In each family, length and A + T content are given for the most abundant variant. Divergence per family is expressed as percentage of Kimura divergence. Spearman rank correlation analysis showed that RUL in the 45 satellite DNA families showed a positive correlation with the A + T content (rS = 0.34, t = 2.34, P = 0.024), indicating that longer satellites tend to be richer in A + T. However, these two parameters failed to show a significant correlation with abundance or divergence in the 0B and 1B genomes (P > 0.05 in all cases). Sequence comparison between repeat unit sequences of the 45 satDNA families detected homology only between ApaSat02-236 and ApaSat04-233 (78.8%), with a single variant each, and these two families were grouped into superfamily 1 (SF1). Remarkably, these two satDNAs show closely similar RUL, suggesting that the high divergence showed by both families (15% and 17%, respectively) was mostly due to nucleotide substitutions, whereas indels were small and rare (Supplementary Fig. S2). We tried to infer bioinformatically the presence of satDNA families in A. paranae B chromosomes, through their possible changes in abundance between 0B and 1B gDNA libraries. This revealed that 18 satDNA families showed positive values for log2 A1/A0, suggesting that they might be abundant in the B chromosome (Table 1). The remaining 27 satDNAs, however, showed negative values for this parameter, thus suggesting that their abundance might be lower in B than A chromosomes. Bearing in mind that the length of the metacentric B chromosome in A. paranae (BpM) represents approximately 8% of the total length of the haploid A chromosome set (Silva et al. unpublished), we estimate that, in case of complete absence of a given satDNA in the B chromosome, the maximum decrease in satDNA abundance, expected in a 1B genome, would be about 4%. However, 22 of these 27 satDNAs showed abundance decrease surpassing this threshold, suggesting that satDNA absence in the B chromosome does not explain the observed differences in satDNA abundance between the 0B and 1B individuals. We thus believe that these differences are most likely due to between-individual differences in satDNA abundance in A and/or B chromosomes. In fact, only 8 out of the 18 satDNAs showing higher abundance in the 1B genome were actually visualized by FISH on the B chromosome (see below). All these results indicate the convenience of separately sequencing several B-carrying and B-lacking individuals to lessen the effect of A chromosome variation.

Chromosome distribution of 35 satellite DNA families in A. paranae

We designed primers for PCR amplification of all 45 satDNAs found, but only 35 of them worked successfully on A. paranae gDNA. Given that this represents a huge collection of satDNAs for FISH analysis, we did not try to redesign additional primers for the 10 satDNAs whose PCR amplification had failed in the first try. Out of the 35 satellite families analyzed by FISH, 30 showed conspicuous clusters in at least one chromosome (c pattern) and 5 failed to yield a FISH signal, so we consider that they are non-clustered at cytological level (nc pattern) (Fig. 1). Clustered satDNAs were found in both heterochromatic and euchromatic regions (Supplementary Fig. S3). RUL values of clustered (median = 77.5) and non-clustered (median = 21) satellites showed similarly high variances (6448.3 and 1423.5, respectively) (Levene’s test: F = 2.5, df = 1, 33, P = 0.124) and the Mann-Whitney test showed that clustered satellites show significantly longer repeat units (U = 26, P = 0.021) (Fig. 2).
Figure 1

Examples of the chromosome distribution patterns of the SatDNAs found in A. paranae: non-clustered (a) and clustered (b). Each cell is shown with satDNA FISH (red) merged with DAPI (upper panel) and with satDNA FISH (lower panel). Note that the ApaSat36-21 does not show any FISH signal (non-clustered pattern). Bar = 10 μm.

Figure 2

Comparison of repeat unit length (RUL) between the satDNAs found in A. paranae according to their chromosome distribution pattern.

Examples of the chromosome distribution patterns of the SatDNAs found in A. paranae: non-clustered (a) and clustered (b). Each cell is shown with satDNA FISH (red) merged with DAPI (upper panel) and with satDNA FISH (lower panel). Note that the ApaSat36-21 does not show any FISH signal (non-clustered pattern). Bar = 10 μm. Comparison of repeat unit length (RUL) between the satDNAs found in A. paranae according to their chromosome distribution pattern.

Comparison of satellite distribution in two other Astyanax species

FISH analysis of the same 35 satellites in A. fasciatus and A. bockmanni showed the presence of clusters for 21 and 27 families, respectively (Table 2). We found generally good consistency between species for distribution patterns on A chromosomes, with 17 satellite families showing the same pattern in the three species and 10 showing the same pattern in two species. In this latter case, the absence of clusters in the third species could be due to satellite absence or else to its presence in a non-clustered pattern. These two possibilities could be tested by a detailed satellitome analysis in A. fasciatus and A. bockmanni. For this reason, we limited our observations to scoring how many satellites, out of the 35 analyzed in A. paranae, were clustered (c pattern) in A. fasciatus and A. bockmanni, as inferred from FISH analysis. In fact, 26 out of the 30 satDNA families which showed FISH signals on A. paranae chromosomes (c pattern) also showed signals on one or the two other Astyanax species. The four remaining satellites showed FISH signals in A. paranae but not in the two other species. This does not necessarily mean that these satellites are absent from the satDNA library in these two species, as they may be present but in a non-clustered pattern. The reverse situation was found for ApaSat20-18 and ApaSat36-21, as they were not visualized by FISH on A. paranae A chromosomes even though they were bioinformatically detected in the 0B genome (nc pattern), but the former satellite showed conspicuous FISH signals on A chromosomes of A. fasciatus and A. bockmanni (Fig. 3) whereas the latter showed them on those of A. bockmanni only (Table 2 and Supplementary Fig. S4).
Table 2

Chromosomal distribution patterns for 35 satDNA families found in A. paranae, defined by the presence of FISH signals on A chromosomes, as well as their distribution in A. fasciatus and A. bockmanni.

SFSatDNA familyChromosomal DistributionCluster presence
A. paranaeA. fasciatusA. bockmanniA. paranaeA. fasciatusA. bockmanni
ApaSat01-51ccc111
1ApaSat02-236ccc111
ApaSat03-91ccc111
1ApaSat04-233ccc111
ApaSat05-23cc101
ApaSat06-86ccc111
ApaSat07-6-telttt
ApaSat08-35cc110
ApaSat13-22ccc111
ApaSat14-184ccc111
ApaSat15-51ccc111
ApaSat17-365ccc111
ApaSat18-58ccc111
ApaSat19-77ccc111
ApaSat20-18nccc
ApaSat22-63c100
ApaSat23-37ccc111
ApaSat24-78ccc111
ApaSat26-195cc101
ApaSat27-178cc101
ApaSat28-52ccc111
ApaSat30-50ccc111
ApaSat31-165cc101
ApaSat32-85cc101
ApaSat33-112ccc111
ApaSat34-59cc110
ApaSat35-37c100
ApaSat36-21ncc
ApaSat37-62nc
ApaSat38-107nc
ApaSat39-32cc101
ApaSat40-189c100
ApaSat42-90c100
ApaSat43-61ccc111
ApaSat44-21nc
ApaSat45-113cc101
Total301924

c = clustered, t = telomeric, nc = non-clustered, 0 = cluster absence and 1 = cluster presence.

Figure 3

Mitotic metaphase cells of A. paranae (a), A. fasciatus (b) and A. bockmanni (c) showing the chromosome distribution of the ApaSat20-18 satDNA. The FISH signals are shown in red and are merged with DAPI in the upper panel. Bar = 10 μm.

Chromosomal distribution patterns for 35 satDNA families found in A. paranae, defined by the presence of FISH signals on A chromosomes, as well as their distribution in A. fasciatus and A. bockmanni. c = clustered, t = telomeric, nc = non-clustered, 0 = cluster absence and 1 = cluster presence. Mitotic metaphase cells of A. paranae (a), A. fasciatus (b) and A. bockmanni (c) showing the chromosome distribution of the ApaSat20-18 satDNA. The FISH signals are shown in red and are merged with DAPI in the upper panel. Bar = 10 μm. Both members of the SF1 superfamily (i.e., ApaSat02-236 and ApaSat04-233) showed the same chromosomal location on the pericentromeric region of many A chromosomes in the three species analyzed (Fig. 4 and Supplementary Fig. S5).
Figure 4

Mitotic metaphase cells of A. paranae (a), A. fasciatus (b) and A. bockmanni (c) showing the chromosome distribution of the ApaSat02-236 satDNA. The FISH signals are shown in red and are merged with DAPI in the upper panel. Bar = 10 μm.

Mitotic metaphase cells of A. paranae (a), A. fasciatus (b) and A. bockmanni (c) showing the chromosome distribution of the ApaSat02-236 satDNA. The FISH signals are shown in red and are merged with DAPI in the upper panel. Bar = 10 μm. Two satDNAs (ApaSat20-18 and ApaSat44-21) showed conspicuous clusters on B chromosomes but did not show a FISH signal on A chromosomes (Table 1), thus appearing to be B-specific in the A. paranae genome. However, they were detected bioinformatically in the 0B genome, although at very low abundance (0.033 and 0.002%, respectively) (Table 1). Both satDNAs show short RUL, but only ApaSat44-21 appears to be exclusive to the A. paranae 1B genome, as it was not detected in A. fasciatus or A. bockmanni chromosomes (Supplementary Fig. S6), whereas ApaSat20-18 is clustered in the A and B chromosomes of A. fasciatus and A. bockmanni (Supplementary Fig. 3a), indicating that this satellite might have originated in one of these two species and was transferred between species, perhaps with the B chromosome. It would thus be interesting to perform a detailed analysis of sequence variation for this satellite in A and B chromosomes of the three species to investigate if it arose in A or B chromosomes. Satellite distribution on B chromosomes showed a very high tendency toward symmetric location with respect to the centromere for those satellites showing non-centromeric location (Fig. 5). Assigning the value 1 to a symmetric pattern and 0 to a non-symmetric one, we calculated a symmetry index (SI) for each B chromosome as the average symmetry for all non-centromeric satellites found on it (Table 3). This showed that BpM and BbM were highly symmetric (SI = 1) and BfMa was poorly symmetric (SI = 0.45).
Figure 5

Distribution of satDNAs on the B chromosomes of A. paranae (BpM), A. fasciatus (BfMa) and A. bockmanni (BbM). The red numbers indicate catalog number for each satDNA. DAPI-stained chromosomes and satDNA hybridization patterns are displayed side-by-side for each B-variant. Note that all satDNAs showing signals on BpM and BbM chromosomes show symmetric distribution, whereas only five of them are symmetric in the BfMa chromosome.

Table 3

Chromosomal location, presence of clusters and symmetry for satDNA families in the B chromosomes of A. paranae (BpM), A. fasciatus (BfMa) and A. bockmanni (BbM).

SFSatDNA familyLocationCluster presenceSI
BpMBfMaBbMBpMBfMaBbMBpMBfMaBbM
ApaSat01-51q, ip, t; q, t01101
1ApaSat02-236pepe110
1ApaSat04-233pepe011
ApaSat05-23p, t; q, tp, t11010
ApaSat06-86pe; p, i, t; q, i, tpe; p, i; q, ip, t; q, t111111
ApaSat07-6-telttt
ApaSat08-35p, t; q, t1001
ApaSat13-22p, tq, t01101
ApaSat14-184p, t; q, tp, t; q, dp, t; q, t111111
ApaSat15-51p, t; q, t1001
ApaSat17-365p, i, t; q, i, t0101
ApaSat18-58p.i; q, ipe; p, i; q, i11011
ApaSat19-77p, i; q, iq, ip, t; q, t111101
ApaSat20-18p, t; q, tq, ip, i, t; q, i, t111101
ApaSat33-112p, t; q, tp, i; q, ip, i, t; q, i, t111111
ApaSat34-59p, t; q, t1001
ApaSat40-189pe; p, i, t; q, i, t1001
ApaSat42-90p, i, t; q, i, tp, i11010
ApaSat44-21pe; p, i; q, i1001
Total1414910.451

p = short arm, q = long arm, d = distal, i = interstitial, pe = pericentromeric, t = terminal. In the Cluster presence columns: 0 = cluster absence and 1 = cluster presence and in the SI columns: 0 = non-symmetric and 1 = symmetric.

Distribution of satDNAs on the B chromosomes of A. paranae (BpM), A. fasciatus (BfMa) and A. bockmanni (BbM). The red numbers indicate catalog number for each satDNA. DAPI-stained chromosomes and satDNA hybridization patterns are displayed side-by-side for each B-variant. Note that all satDNAs showing signals on BpM and BbM chromosomes show symmetric distribution, whereas only five of them are symmetric in the BfMa chromosome. Chromosomal location, presence of clusters and symmetry for satDNA families in the B chromosomes of A. paranae (BpM), A. fasciatus (BfMa) and A. bockmanni (BbM). p = short arm, q = long arm, d = distal, i = interstitial, pe = pericentromeric, t = terminal. In the Cluster presence columns: 0 = cluster absence and 1 = cluster presence and in the SI columns: 0 = non-symmetric and 1 = symmetric. Finally, out of the 30 satellite DNA families which showed a clustered pattern at cytological level, i.e. visible by FISH, in A. paranae, 26 were shared by the A chromosomes of one or the two other Astyanax species analysed here. In the case of B chromosomes, a total of 18 satellites were visualized on the B chromosomes of these three species, and only 12 were shared by Bs in two or three species, in high resemblance with the case of A chromosomes (Fisher exact test: P = 0.1007).

Discussion

We found 45 satDNA families in the genome of A. paranae by means of Next Generation Sequencing and bioinformatic analysis using a low-cost approach. This represents a huge leap in the knowledge of satDNA library[3] in Astyanax, a genus where full genome sequencing in A. mexicanus actually gave no information on satellite DNA[27]. The present results are especially valuable bearing in mind that 26 out of the 30 satDNA families showing FISH signals on A. paranae chromosomes (apart from ApaSat07-6-tel) also showed signals on A. fasciatus and/or A. bockmanni. The absence of FISH signals for a satDNA does not necessarily mean its genomic absence, as it can be in a non-clustered pattern, at cytological level, thus being invisible by FISH. Remarkable examples of this are ApaSat20-18 and ApaSat36-21, which gave no FISH signals in A. paranae A chromosomes but their presence in this species genome is granted by NGS and bioinformatic analyses. In fact, one or both satellites were conspicuously clustered in A. fasciatus and A. bockmanni (see Table 2), demonstrating that a same satDNA family can change its chromosomal distribution pattern during evolution in different species[2]. Taken together, these results suggest that the three species share most satDNAs. This might be due to a slow rate of satDNA turnover in this genus, but a recent molecular phylogeny performed on more than 70 Astyanax species has shown that the three species analysed here share a same clade, with genetic distances lower than 2%, thus suggesting their recent diversification[28]. Therefore, the high proportion of shared satellite DNA families between these three species might also be due to their short period of independent evolution. Satellite DNA analysis in species belonging to other clades will provide valuable information on turnover rate of satellite DNA in this genus. Up to now, only one satDNA was described in Astyanax, using restriction enzymes, i.e., the As51 satDNA[15]. This is actually the most abundant satDNA in the A. paranae genome, for which reason it is named here ApaSat01-51, being 14 times more abundant in the 0B genome than the second satDNA in abundance (ApaSat02-236). Our present results have thus revealed the power of NGS sequencing combined with the bioinformatic methodology provided by the RepeatExplorer and SatMiner tools, as they allow access to a new level of satellites showing very low abundance. In fact, we have been able to visualize FISH clusters for ApaSat45-113, which represents only 0.001% of the 0B genome. Our results also show that the use of the SatMiner toolkit adds extra power for the high-throughput analysis of satellite DNA since a single RepeatExplorer run in A. paranae uncovered only six satDNA families with 0.0836% minimum abundance. Therefore, the use of SatMiner implied uncovering 39 additional satDNAs and detecting satellites with abundance almost 60 times lower, as shown by Ruiz-Ruano et al.[2]. The high-throughput analysis of the satellitome provides ample information on many aspects of a multitude of different satDNA families within the same genome. It is presumable that this kind of analysis will help to reveal general tendencies of this kind of repetitive DNA, and whether they differ among groups of organisms. For instance, the RUL for the 45 satDNA families bioinformatically found in A. paranae (median = 59 bp) was rather shorter than the figures reported in grasshopper species (158 bp in Locusta migratoria[2] and 97 bp in Eumigus monticola[29]). Satellitome analysis in other species will clarify whether there is actually a tendency for length differences between different kinds of organisms. Likewise, in E. monticola, Ruiz-Ruano et al.[29] found that satDNAs clustered on the B chromosome were significantly shorter than those located only on the A chromosomes. In A. paranae, however, no significant length difference was found between satellites visualized by FISH on the B chromosome and those non visualized on it (U = 148.5, P = 0.669). In A. paranae, longer satellites tend to be richer in A + T, in consistency with previous observations by Ruiz-Ruano et al.[2]. However, whereas long satDNAs are less divergent than short ones in the migratory locust genome[2], we have not found such an association in A. paranae, perhaps because the latter species actually carries few long satellites. The second and fourth most abundant satDNAs in the A. paranae genome, ApaSat02-236 and ApaSat04-233 (both showing homology thus constituting the SF1 superfamily), might play a role in centromeric function in these three species as they are localized on the pericentromeric region of many chromosomes. It is usually assumed that the most abundant satDNA plays a centromeric role[30]. However, in E. monticola, the only satDNA located in the pericentromeric region of all chromosomes is EmoSat08-41, which is the eighth satDNA in order of decreasing abundance[29]. Likewise, in Astyanax, the only satDNAs showing pericentromeric location in most chromosomes were those belonging to SF1, and ApaSat04-233 was present on all chromosomes in A. fasciatus (see Fig. 4b), but they were only the second and fourth most abundant satDNAs, respectively. We found two patterns of chromosomal distribution for the A. paranae satDNAs. These were (i) the non-clustered (nc) pattern, referred to satDNAs being invisible by FISH but still abundant in the genome, as indicated by the bioinformatics analysis of Illumina reads and (ii) the clustered (c) pattern, for those satDNAs forming a countable number of large clusters being visible by FISH. Remarkably, although many satellites exhibited the same pattern of distribution in the three species analysed, some satellites nonetheless exhibited distinct patterns between species, especially ApaSat20-18 and ApaSat36-21 which show a non-clustered pattern on the A chromosomes of A. paranae but they are conspicuously clustered on those of A. fasciatus and A. bockmanni (Fig. 3) or else only on those of A. bockmanni (Supplementary Fig. S4). These different patterns of distribution indicate that each satDNA follows its own evolutionary pathway in different species; that is, a satDNA can remain non-clustered in one species while it can become clustered in another species, thus following the pattern of emergence and establishment of satDNA proposed by Ruiz-Ruano et al.[2]. It is interesting to note that non-clustered satDNAs tend to show shorter RUL than clustered ones, as this is consistent with the general idea that microsatellites and minisatellites are scattered throughout the genome, whereas long satellites are clustered. However, this observation is only partially correct in A. paranae since at least one non-clustered satellite (ApaSat38-107) was longer than 100 bp which is an usual minimum threshold for long satellites, and several clustered satellites showed RULs about only 20 bp, thus overlapping with the common definition of minisatellites (see Table 2). Our present results thus support the conclusion by Ruiz-Ruano et al.[2] that it is not justified to classify satellites according to RUL since satellites of any length may be clustered or not in the genome, a fact also observed in A. paranae. Due to the large number of chromosomes in the karyotype of the Astyanax species analyzed here, with many A chromosomes showing similar morphology, it was difficult to perform a detailed analysis of satDNA distribution on them. Therefore, the between-species comparison was more superficial for A chromosomes than for B chromosomes. In the case of A chromosomes, we limited the comparison to whether each satellite is clustered on one or more chromosomes of the A complement in each species, whereas for the B chromosomes, which can be easily identified in each species, it was possible to accurately locate each satellite (see Fig. 5). The set of satDNAs located on the BpM chromosome was distributed throughout its whole length, including regions where it had not yet been possible to identify any type of repetitive sequence[23,31]. The presence of repetitive sequences along the whole BpM chromosome length is consistent with its C-positive banding pattern (Supplementary Fig. S3). Two satellite DNAs showed conspicuous FISH signals on the B chromosome of A. paranae (BpM), i.e., ApaSat20-18 and ApaSat44-21, are apparently B-specific in this species, as they failed to show FISH signals on A chromosomes of this species (Fig. 3 and Supplementary Fig. S6). Remarkably, a search for dimers or multimers of each of these satDNAs in the 0B reads showed that they are not tandemly repeated in the A chromosomes, suggesting that they are actually B-specific in this species. One of these satDNAs (ApaSat44-21) appears to be exclusive to the B chromosome of A. paranae (Supplementary Fig. S6), and thus it might have arisen by duplication and amplification in the BpM chromosome, consistent with the enrichment of repetitive DNA occurring in B chromosomes after their origin[32]. The other satDNA (ApaSat20-18), however, is shared with the A and B chromosomes of A. fasciatus and A. bockmanni (Fig. 3), thus it could be a good marker for investigating B chromosome origin in these species through detailed sequence analysis. In several species of Astyanax, it has been shown that the most frequent morphology for the B chromosomes is metacentric and, in some cases, it has been proven that they are isochromosomes[15,31]. The symmetrical distribution of satellites on the B chromosomes analyzed here is a clear indication that they are, in fact, isochromosomes, i.e., that they derived from an acrocentric chromosome which, through incorrect division of the centromere, gave rise to a metacentric B chromosome with two identical arms, each corresponding to a chromatid, thus showing perfect symmetry. Therefore, a newly emerged iso-B-chromosome should show a very high index of symmetry (1), whereas old ones could show lower symmetry indexes as a result of sequence changes being proportional to their age. The low symmetry index shown by BfMa in A. fasciatus might suggest that it did not arise as an isochromosome. However, the symmetric pattern for five satellites, shown by this B chromosome raises the possibility that it also begun as an isochromosome and that some satellites were gained or lost from a single arm. Since B chromosomes are dispensable, these indels most likely were neutral and their number was proportional to age. On this basis, the BfMa chromosome in A. fasciatus would be older than BfMa in A. paranae and BbM in A. bockmanni. The presence of B chromosomes showing similar morphology and size in several Astyanax species led Moreira-Filho et al. to suggest their possible common origin[33]. Recent results by chromosome painting would be consistent with this hypothesis as seven types of B chromosomes shared anonymous repetitive DNA sequences in A. paranae, A. fasciatus and A. bockmanni[23], although the physical mapping of four non-anonymous repetitive DNA families, by the same authors, failed to support the common origin hypothesis, except invoking a complex series of gains and/or losses of several kinds of repetitive DNA families[23]. However, they also compared the DNA sequence of the ITS regions of 45 S ribosomal DNA obtained by PCR amplification on microdissected BpM and BfMa chromosomes and also on genomic DNA from B-lacking individuals of A. paranae and A. fasciatus, and concluded that these two B chromosomes might have had a common origin through hybridization[23]. Our present comparative analysis of the distribution of satellite DNA on A chromosomes and three types of B chromosomes (BpM in A. paranae, BfMa in A. fasciatus and BbM in A. bockmanni) has revealed that A and B chromosomes share about similar proportions of satellite families between species. This result is highly consistent with the common descent of A and B chromosomes in these species, including the possibility that these B chromosomes descended from a B chromosome already present in a common ancestor of these three species. The finding of more B variants in A. fasciatus than in the two other species[23] might be an indication that B chromosomes are older in this species. Likewise, the lower symmetry in satDNA location shown by BfMa, suggests that it might be an isochromosome-derived B chromosome with a longer evolutionary pathway of satDNA gains, losses or rearrangements than BpM and BbM ones, which showed perfect symmetry. However, the former observations would also be consistent with the independent and recurrent origin of B chromosomes in different species, provided that centromere misdivision is frequent in this group. Summing up, our present results add new insights on the different hypotheses previously suggested about B chromosome origin in Astyanax: (1) B chromosomes found in different species of this genus might have derived from a same B chromosome arisen in an ancestor species, but they might have evolved at different rates between species, i.e. faster in A. fasciatus. (2) B chromosomes could have moved between species through interspecific hybridization, as suggested in bees[34] and Astyanax[23]. However, the recent common ancestry of the three species analyzed here[28] makes it difficult to distinguish B origin through hybridization from common descent with diversification. (3) Our present results raise the possibility that B chromosome origin is recurrent in these species, so that they might have arisen independently in the three species, thus explaining why B chromosomes in A. fasciatus appear to be older than those in A. paranae and A. bockmanni. The fact that B chromosomes in these three species share a remarkable collection of satDNAs indicates that the origin of these B chromosomes is complex and cannot be elucidated by a marker as dynamic as satellite DNA. Future research should be focused on unveiling the content of these B chromosomes in protein-coding genes, as this kind of genes has recently been uncovered in B chromosomes of several organisms[35-39]. It is expected that gene content will provide a more clarifying test to the hypothesis of the common origin of B chromosomes in these species, as it predicts similar content for protein-coding genes in B chromosomes from different species.

Materials and Methods

We analyzed individuals of A. paranae, A. fasciatus and A. bockmanni. The sample information is summarized in the Table 4. Previous research showed the presence of B chromosomes in all these locations[23,31,40]. The samples were collected in accordance with the Brazilian environmental protection legislation (collection permission MMA/IBAMA/SISBIO-number 3245), and the procedures for sampling, maintenance and analysis of the samples were performed in compliance with international guidelines for the care and use of animals followed by the Brazilian College of Animal Experimentation (COBEA) and was approved (protocol 405) by the Bioscience Institute/UNESP Ethics Committee on the Use of Animals (CEUA).
Table 4

Collection sites, number of specimens, diploid number (2n) and B chromosome features of the Astyanax individuals analysed.

SpeciesWaterwayCoordinatesSpecimens2nB typeB name
A. paranaeCascatinha river22°53′30″S 48°28′36″W2050Large MBpM
A. fasciatusÁgua da Madalena stream22°59′23″S 48°25′31″W446Large MBfMa
A. bockmanniAlambari river22°27′6″S 49°14′25″W750Large MBbM

M = metacentric, BpM = metacentric B in A. paranae, BfMa = metacentric B in A. fasciatus, BbM = metacentric B in A. bockmanni.

Collection sites, number of specimens, diploid number (2n) and B chromosome features of the Astyanax individuals analysed. M = metacentric, BpM = metacentric B in A. paranae, BfMa = metacentric B in A. fasciatus, BbM = metacentric B in A. bockmanni. After analysis, specimens were deposited at the fish collection of the Laboratório de Biologia e Genética de Peixes (LBP) at UNESP, Botucatu, São Paulo, Brazil, under the vouchers LBP19572 (A. paranae – Cascatinha River) and LBP19574 (A. fasciatus – Água da Madalena Stream). The specimens from the Alambari River (A. bockmanni) were deposited at the fish collection of the Laboratório de Genética de Peixes at UNESP, Bauru, São Paulo, Brazil. Mitotic chromosomes were obtained from tissue cell suspensions of the anterior kidney according to Foresti et al.[41]. B chromosomes were identified by C-banding performed following the protocol described by Sumner et al.[42]. Chromosome morphology was classified, according to Levan et al.[43], as metacentric (m), submetacentric (sm), subtelocentric (st), and acrocentric (a). Karyotypes were arranged according to chromosome morphology and size. We extracted total genomic DNA from liver using a tissue kit (Macherey-Nagel), including a step for RNA removal with RNAse A (Invitrogen). Genomic DNA sequencing was performed on total DNA extracted from one individual carrying 1B and another lacking it, by the Life Sciences Core Facility (LaCTAD) of Universidade Estadual de Campinas (UNICAMP), using the Illumina HiSeq. 2000 platform (2 × 101 bp paired-end) (Illumina, San Diego, CA), which yielded 14 and 34 Gbp of reads for the 1B and 0B libraries, respectively. We deposited the 0B and 1B genomic libraries in the Sequence Read Archive (SRA) database with accession numbers SRR5461470 and SRR5461471, respectively. To perform a high-throughput analysis of satellite DNA in the A. paranae genome, we first performed a standard RepeatExplorer[9] clustering on 2 × 200,000 reads combined from the 0B and 1B genomic libraries. In this first analysis, we only detected 6 satDNAs. We then followed the protocol suggested by Ruiz-Ruano et al.[2] using the satMiner toolkit, in order to detect as many satDNAs as possible in the A. paranae genome. Briefly, the protocol consists of quality trimming with Trimmomatic[44] and then clustering a selection of 2 × 200,000 reads with RepeatExplorer, duplicating the number of reads for each new run. Then, we searched for tandem repeated structures in those assembled contigs of clusters with a typical satDNA structure (i.e., spherical or ring-shaped) with Geneious R8.1 software. We then used DeconSeq software[45] to filter out reads showing homology with the previously identified clusters. Then, using a sample of the remaining reads, we performed a new RepeatExplorer run. We performed a homology search between all repeat unit sequences found and grouped them as same sequence variant, same family and same superfamily if identity was higher than 95%, 80% and 50%, respectively. We determined the abundance and divergence for each variant by means of RepeatMasker software[46], with the Cross_match search engine, using 10 million reads for each genome. Abundance in the 0B and 1B genomes (A0 and A1, respectively) was calculated as the proportion of reads mapped for a given satDNA with respect to total mapped reads. We assigned catalog numbers to satDNA families in order of decreasing abundance in the 0B genome, following Ruiz-Ruano et al.[2]. We named each satDNA family following the criterion suggested by Ruiz-Ruano et al.[2]. The assembled sequences were deposited in GenBank with accession numbers MF044753-MF044818. We searched for homology with other repetitive sequences in RepBase[47]. We calculated the log2 of the quotient between 1B and 0B abundance values (log2 A1/A0) to detect changes in satDNA abundance that might be due to abundance differences in the A and B chromosomes. We thoroughly investigated the possible presence in the B-lacking genome of satDNA families that appeared to be B-specific. For this purpose, we first selected pairs of reads in each library separately showing homology with a specific satDNA by using BLAT[48] implemented in a custom script (https://github.com/fjruizruano/ngs-protocols/blob/master/mapping_blat_gs.py). Then, we performed a RepeatExplorer clustering with 2 × 2500 of the selected reads. In this case, we used a custom database for annotating the sequences of all assembled satDNAs. We designed primers in opposite orientation to amplify the 45 satDNAs identified bioinformatically (Supplementary Table S1) as in Ruiz-Ruano et al.[2]. To generate FISH probes, we performed PCR including digoxigenin-11-dUTP (Roche Applied Science) in the reaction to label the PCR product. For FISH experiments, we used the procedure described in Pinkel et al.[49], using high stringency conditions, and the signals were detected with anti-digoxigenin-rhodamine (Roche Applied Science). We analyzed a minimum of five metaphases for each hybridization experiment. The two cytogenetic preparations of A. paranae used for FISH, were not from the same individuals used for DNA sequencing. The FISH experiments on A. fasciatus were carried out using material from two samples, whereas we used material from only one sample for FISH on A. bockmanni. These individuals had only one B chromosome by cell and we confirmed that we used individuals with the same type of B chromosome for A. paranae and A. fasciatus by C-banding. Statistical analysis was performed by the Shapiro-Wilks test to ascertain whether variables fitted a normal distribution, Levene’s test to test homoscedasticity, the Fisher exact test, and the non-parametric Mann-Whitney and Spearman rank correlation tests, by means of the Statistica 6.0 software.
  42 in total

1.  Origin of B chromosomes in the genus Astyanax (Characiformes, Characidae) and the limits of chromosome painting.

Authors:  Duílio M Z de A Silva; Sandro Natal Daniel; Juan Pedro M Camacho; Ricardo Utsunomia; Francisco J Ruiz-Ruano; Manolo Penitente; José Carlos Pansonato-Alves; Diogo Teruo Hashimoto; Claudio Oliveira; Fábio Porto-Foresti; Fausto Foresti
Journal:  Mol Genet Genomics       Date:  2016-03-16       Impact factor: 3.291

2.  Satellite DNA content illuminates the ancestry of a supernumerary (B) chromosome.

Authors:  Francisco J Ruiz-Ruano; Josefa Cabrero; María Dolores López-León; Juan Pedro M Camacho
Journal:  Chromosoma       Date:  2016-08-13       Impact factor: 4.316

3.  Nucleotide sequences of HS-alpha satellite DNA from kangaroo rat Dipodomys ordii and characterization of similar sequences in other rodents.

Authors:  K Fry; W Salser
Journal:  Cell       Date:  1977-12       Impact factor: 41.582

4.  Chromosome mapping of H1 histone and 5S rRNA gene clusters in three species of Astyanax (Teleostei, Characiformes).

Authors:  D T Hashimoto; M A Ferguson-Smith; W Rens; F Foresti; F Porto-Foresti
Journal:  Cytogenet Genome Res       Date:  2011-01-19       Impact factor: 1.636

5.  Organization and molecular cytogenetics of a satellite DNA family from Hoplias malabaricus (Pisces, Erythrinidae).

Authors:  T Haaf; M Schmid; C Steinlein; P M Galetti; H F Willard
Journal:  Chromosome Res       Date:  1993-05       Impact factor: 5.239

6.  Polymorphic nature of nucleolus organizer regions in fishes.

Authors:  F Foresti; L F Almeida Toledo; S A Toledo
Journal:  Cytogenet Cell Genet       Date:  1981

7.  High-throughput analysis of the satellitome illuminates satellite DNA evolution.

Authors:  Francisco J Ruiz-Ruano; María Dolores López-León; Josefa Cabrero; Juan Pedro M Camacho
Journal:  Sci Rep       Date:  2016-07-07       Impact factor: 4.379

8.  Protein-coding genes in B chromosomes of the grasshopper Eyprepocnemis plorans.

Authors:  Beatriz Navarro-Domínguez; Francisco J Ruiz-Ruano; Josefa Cabrero; José María Corral; María Dolores López-León; Timothy F Sharbel; Juan Pedro M Camacho
Journal:  Sci Rep       Date:  2017-04-03       Impact factor: 4.379

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  Uncovering the Ancestry of B Chromosomes in Moenkhausia sanctaefilomenae (Teleostei, Characidae).

Authors:  Ricardo Utsunomia; Duílio Mazzoni Zerbinato de Andrade Silva; Francisco J Ruiz-Ruano; Cristian Araya-Jaime; José Carlos Pansonato-Alves; Priscilla Cardim Scacchetti; Diogo Teruo Hashimoto; Claudio Oliveira; Vladmir A Trifonov; Fábio Porto-Foresti; Juan Pedro M Camacho; Fausto Foresti
Journal:  PLoS One       Date:  2016-03-02       Impact factor: 3.240

View more
  16 in total

1.  Satellite DNA content of B chromosomes in the characid fish Characidium gomesi supports their origin from sex chromosomes.

Authors:  Érica A Serrano-Freitas; Duílio M Z A Silva; Francisco J Ruiz-Ruano; Ricardo Utsunomia; Cristian Araya-Jaime; Claudio Oliveira; Juan Pedro M Camacho; Fausto Foresti
Journal:  Mol Genet Genomics       Date:  2019-10-17       Impact factor: 3.291

2.  Meiotic behavior, transmission and active genes of B chromosomes in the cichlid Astatotilapia latifasciata: new clues about nature, evolution and maintenance of accessory elements.

Authors:  Adauto Lima Cardoso; Natália Bortholazzi Venturelli; Irene da Cruz; Fábio Malta de Sá Patroni; Diogo de Moraes; Rogério Antonio de Oliveira; Ricardo Benavente; Cesar Martins
Journal:  Mol Genet Genomics       Date:  2022-06-15       Impact factor: 3.291

3.  Satellitome analysis illuminates the evolution of ZW sex chromosomes of Triportheidae fishes (Teleostei: Characiformes).

Authors:  Rafael Kretschmer; Caio Augusto Gomes Goes; Ricardo Utsunomia; Marcelo de Bello Cioffi; Luiz Antônio Carlos Bertollo; Tariq Ezaz; Fábio Porto-Foresti; Gustavo Akira Toma
Journal:  Chromosoma       Date:  2022-01-31       Impact factor: 4.316

4.  B chromosomes of multiple species have intense evolutionary dynamics and accumulated genes related to important biological processes.

Authors:  Syed F Ahmad; Maryam Jehangir; Adauto L Cardoso; Ivan R Wolf; Vladimir P Margarido; Diogo C Cabral-de-Mello; Rachel O'Neill; Guilherme T Valente; Cesar Martins
Journal:  BMC Genomics       Date:  2020-09-23       Impact factor: 3.969

5.  Differential expression of miRNAs in the presence of B chromosome in the cichlid fish Astatotilapia latifasciata.

Authors:  Jordana Inácio Nascimento-Oliveira; Bruno Evaristo Almeida Fantinatti; Ivan Rodrigo Wolf; Adauto Lima Cardoso; Erica Ramos; Nathalie Rieder; Rogerio de Oliveira; Cesar Martins
Journal:  BMC Genomics       Date:  2021-05-12       Impact factor: 3.969

Review 6.  Vertebrate Genome Evolution in the Light of Fish Cytogenomics and rDNAomics.

Authors:  Radka Symonová; W Mike Howell
Journal:  Genes (Basel)       Date:  2018-02-14       Impact factor: 4.096

7.  Satellitome landscape analysis of Megaleporinus macrocephalus (Teleostei, Anostomidae) reveals intense accumulation of satellite sequences on the heteromorphic sex chromosome.

Authors:  Ricardo Utsunomia; Duílio Mazzoni Zerbinato de Andrade Silva; Francisco J Ruiz-Ruano; Caio Augusto Gomes Goes; Silvana Melo; Lucas Peres Ramos; Claudio Oliveira; Fábio Porto-Foresti; Fausto Foresti; Diogo Teruo Hashimoto
Journal:  Sci Rep       Date:  2019-04-10       Impact factor: 4.379

8.  A Long-Term Conserved Satellite DNA That Remains Unexpanded in Several Genomes of Characiformes Fish Is Actively Transcribed.

Authors:  Rodrigo Zeni Dos Santos; Rodrigo Milan Calegari; Duílio Mazzoni Zerbinato de Andrade Silva; Francisco J Ruiz-Ruano; Silvana Melo; Claudio Oliveira; Fausto Foresti; Marcela Uliano-Silva; Fábio Porto-Foresti; Ricardo Utsunomia
Journal:  Genome Biol Evol       Date:  2021-02-03       Impact factor: 3.416

9.  Satellite DNAs Unveil Clues about the Ancestry and Composition of B Chromosomes in Three Grasshopper Species.

Authors:  Diogo Milani; Vanessa B Bardella; Ana B S M Ferretti; Octavio M Palacios-Gimenez; Adriana de S Melo; Rita C Moura; Vilma Loreto; Hojun Song; Diogo C Cabral-de-Mello
Journal:  Genes (Basel)       Date:  2018-10-26       Impact factor: 4.096

10.  Eight Million Years of Satellite DNA Evolution in Grasshoppers of the Genus Schistocerca Illuminate the Ins and Outs of the Library Hypothesis.

Authors:  Octavio M Palacios-Gimenez; Diogo Milani; Hojun Song; Dardo A Marti; Maria D López-León; Francisco J Ruiz-Ruano; Juan Pedro M Camacho; Diogo C Cabral-de-Mello
Journal:  Genome Biol Evol       Date:  2020-03-01       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.