Literature DB >> 30971780

Satellitome landscape analysis of Megaleporinus macrocephalus (Teleostei, Anostomidae) reveals intense accumulation of satellite sequences on the heteromorphic sex chromosome.

Ricardo Utsunomia1,2, Duílio Mazzoni Zerbinato de Andrade Silva3, Francisco J Ruiz-Ruano4, Caio Augusto Gomes Goes5, Silvana Melo3, Lucas Peres Ramos3, Claudio Oliveira3, Fábio Porto-Foresti5, Fausto Foresti3, Diogo Teruo Hashimoto6.   

Abstract

The accumulation of repetitive DNA sequences on the sex-limited W or Y chromosomes is a well-known process that is likely triggered by the suppression of recombination between the sex chromosomes, which leads to major differences in their sizes and genetic content. Here, we report an analysis conducted on the satellitome of Megaleporinus macrocephalus that focuses specifically on the satDNAs that have been shown to have higher abundances in females and are putatively located on the W chromosome in this species. We characterized 164 satellite families in M. macrocephalus, which is, by far, the most satellite-rich species discovered to date. Subsequently, we mapped 30 satellites, 22 of which were located on the W chromosome, and 14 were shown to exist only on the W chromosome. Finally, we report two simple, quick and reliable methods that can be used for sex identification in M. macrocephalus individuals using fin clips or scales, which could be applicable to future studies conducted in the field of aquaculture.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30971780      PMCID: PMC6458115          DOI: 10.1038/s41598-019-42383-8

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Eukaryotic genomes have a large number of repetitive DNA sequences that include transposable elements (TEs), multigene families and satellite DNAs (satDNAs)[1]. These sequences are highly dynamic at the chromosomal and nucleotide levels and several mechanisms that contribute to their expansion/contraction are known to shape their evolution[2]. Satellite DNAs are noncoding head-to-tail tandemly repeated sequences that constitute long arrays that are preferentially distributed within pericentromeric or subtelomeric heterochromatic areas; however, several examples of short arrays that are dispersed within euchromatin have already been reported[3-7]. In general, groups of related species share several satDNA families that usually evolve independently within each lineage, according to the so-called library hypothesis[8]. The use of next generation sequencing (NGS) data and bioinformatics to identify repetitive sequences provides a unique opportunity to characterize large collections of satellite DNAs in nonmodel species and, most importantly, compare satellitomes or specific satDNAs from distinct species[9-11]. In this context, the analysis of repeat diversity and genomic abundance in different species becomes possible and also allows for the isolation and characterization of some satDNAs that have accumulated within specific genomic regions, such as the B or sex chromosomes[9,12]. Heteromorphic sex chromosomes usually arise from a loss of recombination on sex-limited W or Y chromosomes, which leads to degenerative processes such as the pseudogenization and invasion of repetitive DNA sequences[13]. Unlike mammals and birds, fishes exhibit a rapid turnover in their sex chromosomes that is directly related to the numerous sex chromosome systems described for their group[14]. Although knowledge regarding the molecular mechanisms underlying sex determination in fishes is still limited, the potential role of the accumulation of DNA repeats during sex chromosome differentiation has been well documented[13,15]. Thus, an initial mass characterization of accumulated repeats in the sex chromosomes of a particular species is necessary for better understanding of the evolution of these chromosomes, as well as to provide support for the complete assembly of repeat-rich genomic regions[2,16]. Sex-specific markers are important noninvasive tools that can be used to assess the sex of individuals at any stage of life. Over the past years, distinct sex markers have been successfully identified for several aquatic species using a number of approaches, including amplified fragment length polymorphism (AFLP)[17,18], random amplified polymorphic DNA (RAPD)[19,20], and, more recently, restriction-associated DNA sequencing (RADseq) and its variations (such as ddRADseq, 2b-RAD, and GBS)[21-23]. Such markers are important for revealing sex chromosome systems and their evolution in primitive vertebrate species, as well as for providing insights into topics relevant to practical aquaculture, such as precocious sex identification, especially in species lacking morphological sexual dimorphism. They are also necessary for the identification of the genetic sex of sex-reversed individuals after hormonal and/or temperature-based treatments[14,24]. Megaleporinus is a fish genus within Anostomidae that is composed by 16 species, based on morphological, molecular and cytogenetic data[25]. Interestingly, representatives of this genus share an ancestral, highly differentiated ZZ/ZW sex chromosome system[25] and comprise the only fish genus relevant to Brazilian aquaculture that has heteromorphic sex chromosomes. Although this sex chromosome system was described more than 30 years ago[26], our knowledge of the molecular structures and sequence components of the Z and W chromosomes is essentially restricted to the identification of the conserved presence of a dispersed repetitive element known as LespeI in the W chromosome in several Megaleporinus species[27-29]. The species M. macrocephalus, which is popularly known as ‘piauçu’, is a valuable characiform fish that has been intensively cultivated in Brazil, with production rates reaching 3,800 tons per year[30]. Similarly, to other Brazilian native fish species, females exhibit higher growth rates than males[31], which has led to an interest in producing superfemale (WW) specimens and, consequently, all-female broods. Thus, expanding the knowledge of the composition of the sex chromosomes of M. macrocephalus is the first step that is necessary to maximize the production of this species in the future. Additionally, the development of noninvasive tools that can be used for sexing M. macrocephalus individuals is also important to avoid sex bias in the selected population; for example, obtaining chromosomes from each specimen would be laborious and demanding, at least in juvenile member of the species. In this context, we characterized the satellitome of M. macrocephalus by integrating genomic and chromosomal data, with a special focus on those satellites that exhibit higher abundances within the female genomic library and, therefore, are putatively located on the W chromosome; the aim of this is to provide a starting point for a detailed and comprehensive analysis of the sex chromosome system. We then mapped the same satDNAs in Megaleporinus obtusidens, a species that diverged from M. macrocephalus approximately 9.27 mya[25], in order to investigate the conservation of satDNA repeats in the W chromosomes of both species. Finally, since sexing individuals of this species is not possible unless mitotic chromosomes were obtained, we developed noninvasive methods for sexing multiple individuals of M. macrocephalus that utilizes a novel quick-FISH protocol and qPCR, which will be useful for future studies and in aquaculture.

Material and Methods

Material, DNA extraction and chromosome preparation

Live specimens of M. macrocephalus and M. obtusidens were collected from fish tanks at the Aquaculture Center of São Paulo State University, CAUNESP, Jaboticabal, SP. The specimens were anesthetized, and total blood was collected from the caudal vein, and then the individuals were returned to the fish tanks at CAUNESP. The procedures used for the sampling, maintenance and analysis of the fishes were performed in compliance with the standards of the Brazilian College of Animal Experimentation (COBEA) and approved (protocol 1094/2018) by the Bioscience Institute/UNESP Ethics Committee on the Use of Animals (CEUA). One female and one male specimen of M. macrocephalus were selected for next generation sequencing (NGS), and the blood collected from both individuals was stored in 95% ethanol. Genomic DNA extraction from the total blood was performed using the DNeasy kit (Qiagen) according to the manufacturer’s instructions and included a RNA removal step that utilized RNAse A (Invitrogen). To obtain the cell suspensions that were used for the chromosome preparations for the M. macrocephalus and M. obtusidens specimens, we performed lymphocyte culture experiments in 12 individuals of M. macrocephalus (9 males and 3 females) and 3 individuals of M. obtusidens (3 females) according to the protocol described by Fenocchio and Bertollo[32]. Subsequently, we performed a random sampling of 20 individuals from among the offspring of M. macrocephalus for testing of our noninvasive tools. In total, we sampled two, twelve and twenty specimens for the purposes of NGS, lymphocyte culture and the diagnosis of sex, respectively.

Whole-genome sequencing and satellitome characterization

Genomic DNA sequencing was performed on the Illumina MiSeq platform (2 × 250 bp paired-end), which yielded 0.77 Gb for the female (ZW) and 0.375 Gb for the male (ZZ) (approximately 0.5x and 0.25x genome coverage, respectively)[33]. We deposited the two libraries into the Sequence Read Archive (SRA) database under accession numbers SRR7263033 and SRR7263034 for the male and the female, respectively. To perform a high-throughput analysis of the satellite DNA within the M. macrocephalus genome, we used the satMiner bioinformatic protocol as described by Ruiz-Ruano et al.[10]. Briefly, we performed quality trimming of the female and male gDNA libraries using Trimmomatic software (options used: ILLUMINACLIP:TruSeq. 3-PE.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:20 MINLEN:250) to select pairs of reads for which Q > 20 for all nucleotides[34]. We then randomly selected 2 × 200,000 reads from the female and the male and joined both selections. We performed a clustering of these 2 × 400,000 reads using RepeatExplorer[9] with the default options to select clusters with a structure typical of satDNA (e.g., spherical or ring-shaped), and searched for contigs with tandem repetitions using the dotplot tool that is integrated into Geneious 8.1 software (Biomatters). The assembled contigs of all the clusters identified by RepeatExplorer were filtered using DeconSeq software[35] and the remaining sequences were clustered using RepeatExplorer, which duplicated the number of reads for the second round (2 × 400,000 reads for the female and the male) to generate a total of 2 × 800,000 reads. We repeated these clustering and filtering steps until no new satDNA sequences appeared, while maintaining the number of reads during each iteration. After satDNA mining, we performed a homology search of all the repeated unit sequences that were found and grouped them into the same sequence variant, family and superfamily if their identity was greater than 95%, 80% and 40%, respectively. We determined the abundance and divergence of each variant in the male and female libraries using the RepeatMasker software[36] with the cross_match search engine and a selection of 1,494,006 reads per genome, which corresponded to the number of reads in the smallest library (male) after trimming. We estimated the abundance of each satDNA in the male (ZZ) and female (ZW) genomes based on the proportion of the aligned nucleotides within the number of total reads. Subsequently, we sorted the satDNA families found in the female genome in decreasing order of abundance and assigned a catalog number to each satDNA family according to the criteria described by Ruiz-Ruano et al.[10]. The consensus sequences for each satDNA family were deposited in GenBank under the accession numbers MG818994-MG819157. We also estimated the average divergence generated within a repeat landscape by considering the distances between sequences based on the Kimura 2-parameter model using the script calcDivergenceFromAlign.pl within the RepeatMasker suite[36]. Thus, by comparing the abundance and divergence of different subclasses of satellite DNAs in males and females, we constructed a subtractive repeat landscape that reveals the elements with the greatest differences in abundance in the two types of libraries, which provided the first indication of which satDNA sequences are overrepresented in the female library with respect to the male library. Finally, we calculated the quotient between the female and male abundance values of each satDNA (F/M ratio) in order to reveal the differences in abundance that may be due to differences in the autosomes and sex chromosomes. Thus, to identify potential satDNAs that are clustered on the W chromosome, we selected the 31 satDNA families that had the highest F/M ratios. Primers were then manually designed for each orientation with similar melting temperatures and less stable extensive dimers based on predictions made by the software PerlPrimer[37] to amplify the satDNAs (Supplementary Table S2). We designed primers for all selected satDNAs, with the exception of MmaSat148, because it contained a deletion with respect to the remaining members of SF13 and there were no available regions that could anchor a pair of primers. As a result, we designed primer pairs for 30 satDNA families in total. Statistical analysis was performed using the Shapiro-Wilks test to ascertain whether the variables fit a normal distribution, as well as the nonparametric Mann-Whitney U test, using Statistica 6.0 software.

Retrieving of monomers from raw reads and intragenomic analysis

We collected monomers from four satDNAs families clustered on the W chromosome (MmaSat9, MmaSat97, MmaSat122 and MmaSat12) to obtain reliable scores for the haplotype abundances of these four satDNAs. For this purpose, we used a selection of 1,494,006 reads each from the female and male genomes and aligned these reads against each of the abovementioned satDNAs using with RepeatMasker to trim the ends to obtain the full monomers as described in Utsunomia et al.[11]. Subsequently, the collected monomers were aligned separately using the MUSCLE algorithm[38] with the default parameters, and singletons (e.g., sequence variants found only once) were discarded at this stage. Finally, we constructed minimum spanning trees (MSTs) for each satDNA on the basis of the pairwise differences and consideration of the relative abundances of the haplotypes using PHYLOViZ 2.0 software[39].

FISH analysis for chromosomic satDNA mapping

Prior to the FISH experiments, all probes were labeled with digoxigenin-11-dUTP via PCR. FISH was performed in highly stringent conditions using the method described by Pinkel et al.[40] with small modifications that were described in Utsunomia et al.[11]. Chromosomal preparations from lymphocyte cultures were counterstained with DAPI (4′,6-diamino-2-phenylindole, Vector Laboratories) and analyzed using an optical microscope (Olympus BX61). The images were captured using Image Pro Plus 6.0 software (Media Cybernetics). A minimum of 10 cells from each individual were analyzed to confirm the FISH results.

Use of quick-FISH and qPCR for non-invasive sexing

A novel quick-FISH method was employed to develop a rapid and noninvasive tool for sexing live specimens of M. macrocephalus. We used a mixture of probes that were labeled with digoxigenin-11-dUTP that corresponded to two satDNAs: i) MmaSat97-39, a satDNA that maps to a single band that is exclusive to the W chromosome that can be used to distinguish males and females; ii) MmaSat98-37, a satDNA that maps to a single band in an autosomal pair that shows no differences between males and females and that was used as positive control for FISH. The results of FISH in interphasic cells using this combination of probes showed two spots in males, which corresponded to one per autosome of the pair containing MmaSat98-37, and three spots in females, due to the presence of the additional band corresponding to MmaSat97-39 in the W chromosome. To verify its reliability, we performed the experiment using 20 live specimens that were not subject to previous sexing. For each specimen, we extracted a single scale and a small piece of caudal fin and fixed this material directly in 200 μL of Carnoy’s solution. Subsequently, 20 μL from each cell suspension was placed onto a slide and dried for 15 min at 60 °C. We then proceeded with the FISH protocol as described above using 15 min of probe hybridization. The post-hybridization washes were similar to those performed during “regular” FISH. For each specimen, we counted between 40 to 90 cells. After the scale and fin collection, metaphasic chromosomes were obtained from the specimens using the protocol described in Foresti et al.[41] in order to confirm the presence/absence of the W chromosome. The determinations of the comparative relative abundances of the satDNAs in the male and female genomes was performed using qPCR to measure the copy number differences between males and females in both species. We selected MmaSat97 and MmaSat98 for analysis, since both were used with the quick-FISH protocol to confirm the FISH and bioinformatics results and because these may provide an additional noninvasive tool for sexing individuals. For this purpose, we selected three individuals of each sex (previously genotyped via karyotyping) from M. macrocephalus and M. obtusidens. Since the selected satDNAs are clustered in both species (Supplementary Fig. S1), this tool would be useful for both of them. qPCR was used to determine the satDNA dose using the ΔCt method of relative quantification (RQ), with the single-copy gene hypoxanthine phosphoribosyltransferase (Hprt) being utilized as a reference. Amplification was performed with the following primers: HprtF: 5′-GGCCAGGGAGATCATGAAGG-3′ and HprtR: 5′-TGGAGCGGTCACTATTTCGG-3′. This primer pair was designed based on the assembly of Hprt gene of Astyanax paranae (Silva et al., unpublished). The qPCR was performed using a StepOne Real-Time PCR System (Life Technologies, Carlsbad, CA). The target and reference sequences were simultaneously analyzed using two independent replicates. After amplification, we analyzed the melting curves to verify the presence of a unique product of amplification. The results from samples with inconsistent Ct values were discarded. We determined the ΔCt values, which are represented as the mean ± standard error of the mean (SEM). Statistical analysis was performed using the Shapiro-Wilks test to ascertain whether the variables fit a normal distribution, followed by the nonparametric Kruskal-Wallis test and the Student-Newman-Keuls test.

Results

The W chromosome of M. macrocephalus is enriched in satDNA

Based on 10 iterations of the satMiner protocol, we assembled 514 satDNA variants that belonged to 164 satDNA families that had repeat unit lengths (RUL) that ranged from 5 to 1969 bp (median value 54 bp; Supplementary Table S1 and Fig. S2). In total, satellite DNAs represented 13.47% (female) and 11.99% (male) of the genome, with 2.78% being represented by the most abundant repeat and 1.8E-08% being represented by the less abundant repeat (Supplementary Table S1). The distribution of the lengths was biased due to a predominance of short satellites, with more than half (107) being shorter than 100 bp. The A + T content of the consensus satDNA sequences varied between 29.7% and 84.2% among the families, with a median value of 58.3%, which indicated a slight bias towards A + T rich satellites. The Shapiro-Wilks test showed that the A + T content was the only satellitome feature that fit a normal distribution (W = 0.98, P = 0.08), while the remaining variables (RUL, abundance and divergence) were not normal (P < 0.05 in all cases). For this reason, we used nonparametric tests for the subsequent analysis. Short (<100 bp) and long (>100 bp) satellites had a similar amount of A + T content (U = 2710, P = 0.242). In addition, long satDNAs were more abundant in males (U = 2205, P = 0.007) but were also less diverse (U = 1474, P = 0.001). Sequence comparison between all satDNA families revealed some homology between several of them (Supplementary Table S1, Supplementary Fig. S3). Notably, we found that some long satDNAs shared a conserved motif (68–73 bp), but no evidence of a common origin among them was found (Supplementary Fig. S1). Evidence of higher order repeats (HORs) were found in some members of SF2, including MmaSat19, MmaSat20, MmaSat39, MmaSat66 and MmaSat90 (Supplementary Fig. S4). Several satDNA families were more abundant within the female (ZW) library (Supplementary Fig. S5). Among the 164 satDNA families, we found 95 satDNAs with an F/M ratio higher than 1 and 65 satDNAs with an F/M ratio lower than 1. In addition, three satellites were not found within the male library (Table 1 and Supplementary Table S1). The subtractive repeat landscape revealed a high proportion of MmaSat1 in the female library (Supplementary Fig. S5c); however, we did not design primers for this sequence, since there were several other satDNAs with a higher (F/M) ratio (Supplementary Table S1). This could also be related to the fact that MmaSat1 is likely spread over several autosomes.
Table 1

Repeat unit lengths (RULs, in nt), A + T content (%), number of variants (V), and abundances (% of the libraries) in females (F) and males (M); divergence (%) in females (F) and males (M); clustering patterns and chromosomal locations of the 31 satDNAs families and superfamilies (SF) showing the highest F/M ratio (Females/Males) values.

SFsatDNA familyRULA + TVAbundance (%)Divergence (%)(F/M)PatternLocation
FMFM
15MmaSat155-717150.710.00015215.94BW(i)
13MmaSat158-39394110.00009326.05DW(p,i)
MmaSat162-484847.910.00000335.77DW(i)
MmaSat036-747462.210.0683420.0002026.4226.04338.5455746BW(i)
MmaSat097-393953.830.0243610.00027410.3914.2688.86575875BW(i)
MmaSat122-545457.460.0130870.0002669.9616.6849.1743487BW(p,i)
13MmaSat063-474742.670.0413420.00094312.2620.1143.81854155BW(i)
4MmaSat139-474770.210.0050290.00012620.2720.9339.95338983BW(i)
MmaSat128-383857.940.0110090.0003739.4715.1429.52932761BW(p)
MmaSat127-424264.360.0114880.00068210.825.7616.8478686BW(p)
4MmaSat113-525261.550.0166110.00126012.0518.2513.18306878BW(i)
MmaSat092-464654.3110.0289440.00368514.7433.827.853690304BW(i)
15MmaSat058-717149.330.0444100.0060819.579.957.303293426DBW(p) + A
7MmaSat153-40406510.0003770.00007326.615.545.141818182NS
8MmaSat061-333345.510.0428970.00877910.8612.194.886482382DBW(d) + A
8MmaSat151-333348.510.0006460.00015826.0732.664.086003373NS
MmaSat111-333357.630.0170220.00486216.7230.693.501042124BW(i)
12MmaSat145-676755.210.0011070.00037010.5710.842.994227994DBW(d) + A
MmaSat017-727257.960.1280010.0563039.019.72.273440312DBW(i,d) + A
8MmaSat150-313158.110.0006850.00031915.8621.312.147157191BW(p)
MmaSat152-313161.310.0004130.00021129.9131.531.957016435DBW(i) + A
1MmaSat009-535358.550.2515900.1372159.1510.231.833544197DBW(d) + A
MmaSat099-313161.370.0234340.01316510.8911.751.779963135DBW(i) + A
MmaSat154-303043.310.0003050.00018125.4432.571.685840708NS
MmaSat029-323246.980.0846980.05037414.1215.341.68137639DW(i) + A
MmaSat107-444454.610.0177500.0108995.847.031.628611974DBA
MmaSat098-373745.970.0240990.01494613.6915.411.61236797BA
MmaSat048-1298129849.210.0583760.0365786.37.511.595953778BA
5MmaSat118-666668.220.0150430.0095199.1610.051.580305365NS
2MmaSat108-29529560.310.0176560.0117126.929.631.507559543BA

For each family, the length and A + T content are shown for the most abundant variant. Divergence per family is expressed as a percentage of the Kimura divergence. We designed primers, produced probes and performed FISH for all variants. Pattern: B = banded, NS = no signal, D = dotted, DB = dotted-banded (based on the criteria described by Ruiz-Ruano et al. (2018). Chromosomal location: W = W chromosome, A = autosome, i = interstitial, p = proximal to the centromere, d = distal. When a satDNA was present at two loci in a same chromosome, both locations are indicated and separated by a comma. satDNA families are listed in decreasing order of F/M ratios.

Repeat unit lengths (RULs, in nt), A + T content (%), number of variants (V), and abundances (% of the libraries) in females (F) and males (M); divergence (%) in females (F) and males (M); clustering patterns and chromosomal locations of the 31 satDNAs families and superfamilies (SF) showing the highest F/M ratio (Females/Males) values. For each family, the length and A + T content are shown for the most abundant variant. Divergence per family is expressed as a percentage of the Kimura divergence. We designed primers, produced probes and performed FISH for all variants. Pattern: B = banded, NS = no signal, D = dotted, DB = dotted-banded (based on the criteria described by Ruiz-Ruano et al. (2018). Chromosomal location: W = W chromosome, A = autosome, i = interstitial, p = proximal to the centromere, d = distal. When a satDNA was present at two loci in a same chromosome, both locations are indicated and separated by a comma. satDNA families are listed in decreasing order of F/M ratios.

Chromosomal mapping reveals 14 satDNAs that are exclusive to the W chromosome

Within the 30 satDNA families that were analyzed using FISH, 26 contained conspicuous clusters in at least one chromosome (clustered pattern) in the female specimens of M. macrocephalus, while four satDNAs (MmaSat114, MmaSat151, MmaSat153 and MmaSat154) were not clustered (Table 1 and Supplementary Fig. S1). Based on the resolution of the FISH analysis, 14 were mapped exclusively to the W chromosome, 8 were located on the W chromosome and some autosomes, and 4 were clustered on autosomes (Figs 1–2 and Supplementary Fig. S1). These same probes were hybridized against M. obtusidens chromosomes, 19 of which were shown to be clustered while 11 showed a nonclustered organization (Supplementary Fig. S1). In total, for M. obtusidens, 4 satDNAs were mapped exclusively to the W chromosome, 7 were located on the W chromosome and some autosomes, and 8 were located on autosomes (Fig. 2 and Supplementary Fig. S1). Finally, the W chromosomes of both species were shown to share 10 satDNAs (Fig. 2).
Figure 1

Examples of the chromosomal distribution patterns of the mapped satDNAs in M. macrocephalus; those clustered solely on the W chromosome (MmaSat36); those clustered on the W chromosome and some autosomes (MmaSat17); those clustered on the autosomes (MmaSat48); and those that are nonclustered (MmaSat151). Each cell is shown with the satDNA FISH signal (red) merged with that of DAPI (left) and satDNA FISH (right). Arrowheads indicate the W chromosome. Bar = 10 μm.

Figure 2

FISH analysis showing several satDNA families on the W chromosomes of M. macrocephalus (Mma) and M. obtusidens (Mob). Bar = 10 μm.

Examples of the chromosomal distribution patterns of the mapped satDNAs in M. macrocephalus; those clustered solely on the W chromosome (MmaSat36); those clustered on the W chromosome and some autosomes (MmaSat17); those clustered on the autosomes (MmaSat48); and those that are nonclustered (MmaSat151). Each cell is shown with the satDNA FISH signal (red) merged with that of DAPI (left) and satDNA FISH (right). Arrowheads indicate the W chromosome. Bar = 10 μm. FISH analysis showing several satDNA families on the W chromosomes of M. macrocephalus (Mma) and M. obtusidens (Mob). Bar = 10 μm. Different homogenization patterns were observed in males and females. When the satellite DNA was clustered in both males and females, the satDNA was homogenous in both sexes; however, those satDNAs which clustered solely on the W chromosome had higher homogenization rates than those that were nonclustered in males, which allowed us to infer that homogenization of repeats is higher in clustered repeats (Table 1; U = 20, P = 0.003).

Amplification and diversification of satDNA copies on the W chromosome

We successfully extracted monomers directly from Illumina raw reads representing the MmaSat9, MmaSat97, MmaSat122 and MmaSat128 satDNA families in M. macrocephalus and, after discarding singletons, we obtained 2162 monomers (110 haplotypes), 182 monomers (28 haplotypes), 152 monomers (91 haplotypes) and 141 monomers (9 haplotypes), respectively, for each family (Fig. 3 and Supplementary Fig. S6). The obtained MSTs were different for each satDNA. The MST for MmaSat9 exhibited an interesting pattern of satDNA organization, with the presence of several shared haplotypes in the males and females, which were probably located on the autosomes, as well as some haplotypes that were restricted to females, which were probably located on the W chromosome (Fig. 3). For the other satellites, a few monomers were retrieved from the male library (Supplementary Fig. S6). For MmaSat122, significant diversification in the monomers from the female was observed, while the monomers representing MmaSat128 exhibited less variation and an intermediate level of variation was observed for MmaSat97 (Supplementary Fig. S6).
Figure 3

Minimum spanning tree (MST) showing the relationships between the different haplotypes of MmaSat9 that obtained from Illumina reads from males (blue) and females (pink). The diameter of the circles is proportional to their abundance and the numbers represent the number of mutational steps. Metaphasic plates after FISH of MmaSat9 in female and male specimens are shown; the colors of the borders correspond to the colors of the circles.

Minimum spanning tree (MST) showing the relationships between the different haplotypes of MmaSat9 that obtained from Illumina reads from males (blue) and females (pink). The diameter of the circles is proportional to their abundance and the numbers represent the number of mutational steps. Metaphasic plates after FISH of MmaSat9 in female and male specimens are shown; the colors of the borders correspond to the colors of the circles.

Development of two noninvasive tools for sexing individuals based on quick-FISH and qPCR

For the quick-FISH approach, we selected MmaSat97, a W-specific satellite DNA, and MmaSat98, a satDNA located on a single pair of autosomes. Hybridization using digoxigenin-11-dUTP-labeled versions of these probes generates two signals in male cells, which correspond to the MmaSat98 loci within an autosomal pair, and three signals in female cells, which correspond to the two MmaSat98 loci and the additional MmaSat97 locus on the W chromosome. This approach was successfully applied to the identification of the sex of 20 live specimens. Cells derived from a single scale or a fin clip were hybridized with the two probes, and the results were satisfactory for both samples (Fig. 4). Different hybridization times were also tested (15 min, 1 h, 2 h and overnight), but no visible differences in terms of signal intensity were noticed (data not shown). As predicted, the female cells showed three conspicuous signals, while the male cells showed only two. Overall, the expected FISH signals were observed in almost 90% of all samples, which is likely due to the nuclear conformation within the slides and suggests that this method is 100% reliable for sex identification.
Figure 4

(A) quick-FISH results obtained in male and female samples. (B) Relative quantification (RQ) of MmaSat97 and MmaSat98 in males and females of M. macrocephalus and M. obtusidens. The letters indicate significant differences (P < 0.05) between samples.

(A) quick-FISH results obtained in male and female samples. (B) Relative quantification (RQ) of MmaSat97 and MmaSat98 in males and females of M. macrocephalus and M. obtusidens. The letters indicate significant differences (P < 0.05) between samples. qPCR showed that the genomic abundance of MmaSat97 is different in males and females of M. macrocephalus and M. obtusidens (Fig. 4). This technique also revealed that the genomic abundance of MmaSat98 is similar in males and females of M. macrocephalus and of M. obtusidens, but is different in individuals of M. macrocephalus and M. obtusidens regardless of sex (Fig. 4). Overall, the sexing of individuals using the qPCR method we developed is also 100% reliable.

Discussion

In the present study, we characterized 164 satDNA families in M. macrocephalus that were composed of 514 variants, which is the highest number of satellites characterized for a given species so far; of these, 64 families (155 variants) were grouped into 17 superfamilies. Considering that Astyanax paranae and Characidium gomesi, two other characiform species, contain 45 and 64 satDNA families, respectively[12], it is likely that a great expansion of satellites occurred in the genome of M. macrocephalus. It is well-known that satellites are highly dynamic sequences, which is directly related to the existence of several species- or group-specific satDNAs[2]. In this context, novel satellite DNA families may arise from the independent duplication of different genomic sequences, such as intergenic spacers, portions of transposable elements or even those derived from other satellite DNAs, which leads to a complex scenario[2,10]. Here, we found that almost half of the satDNAs in M. macrocephalus can be grouped into superfamilies, which demonstrates that satDNA expansion in this species is due to the duplication of existing repeats followed by substitutions/deletions/insertion events. Additionally, the existence of higher-order repeats (HORs), which contain alternate subrepeats with greater identity than contiguous repeats[4,42], in some members of SF2 is evidence of the existence of another diversification mechanism within this species. Remarkably, some of the selected satellites used for FISH belong to same superfamily, revealing that some satellites were subject to local duplication and amplification, such as those from SF6 (MmaSat113 and MmaSat139), SF15 (MmaSat158 and MmaSat62) and SF17 (MmaSat58 and MmaSat155) (Fig. 2 and Supplementary Fig. S1), while others were duplicated and disseminated to other chromosomal regions, such as SF10 (MmaSat61, 150 and 151) (Supplementary Fig. S1). Interestingly, one satDNA sequence with a length of 52 bp (MmaSat85-52) was found within the A. paranae (ApaSat29–52) and C. gomesi (CgomSat02–52) genomes[12] (Serrano, unpublished), each of which exhibited different relative abundancies (with a few divergent positions; Supplementary Fig. S7). Considering the proposed phylogeny of the Characiformes order[43,44], one must note that the referenced species (M. macrocephalus, A. paranae and C. gomesi) are distantly related characiform species, each belonging to a different family (Anostomidae, Characidae and Crenuchidae, respectively), which reveals the existence of an interesting and unusual case of satDNA conservation across an entire order, similar to that of the human alpha satellite[42]. Although we were not able to map this satDNA to particular chromosomes, the nucleotide conservation among the distantly related species (92.3% of mean sequence identity) is noteworthy, and further studies will be required to understand the dynamics of this repeat. Our selection of satDNAs via the calculation of relative abundance values (female/male) was effective and revealed the presence of at least 22 satDNAs that had differentially accumulated within the heteromorphic sex chromosomes of M. macrocephalus, which suggested the presence of a high degree of differentiation between the Z and W chromosomes that was likely the result of a loss of recombination between these chromosomes, as has been theoretically predicted[13]. It is noteworthy that 18 out of the 22 satDNAs (78%) were retrieved exclusively during subsequent iterations of RepeatExplorer and DeconSeq that utilized the satMiner protocol[10], which allows for the identification of low abundance satellites. These results will certainly produce an improved assembly for the W chromosome of this species by utilizing long reads. Most importantly, the existence of satDNAs that are exclusively clustered within the W chromosome (nonclustered in males) has provided an opportunity to corroborate the idea that low copy number satDNAs can escape from homogenization mechanisms[7,11,45], as the divergence values of these specific satDNAs are higher in males. As expected in a monophyletic ZZ/ZW system, our results demonstrated that some satellites are conserved in the W chromosomes of M. macrocephalus and M. obtusidens, which suggests that they were present in these chromosomes prior to the split of these species. On the other hand, there are satellites with differential accumulation in or that are exclusive to the W chromosome of M. macrocephalus, which corroborates the occurrence of an independent and continuous differentiation of the W chromosomes in this genus and is reinforced by the presence of some common satDNAs at different abundances[46]. From a practical perspective, the identified satDNAs that are differentially clustered on the W chromosomes of both species were valuable during the development of quick and noninvasive tools for sex identification in M. macrocephalus and M. obtusidens. Even though a conventional PCR-based approach for sex identification would be more practical for fish farmers[47], one must note that both of the methods presented here are suitable for the identification of Megaleporinus superfemales (WW), which would be useful for aquaculture and future genomic studies (Fig. 5).
Figure 5

A schematic summary of the practical steps of utilizing satDNAs for sex identification in M. macrocephalus. From a single scale or a fin clip, it was possible to obtain cell suspensions or extract DNA and perform quick-FISH or qPCR, respectively.

A schematic summary of the practical steps of utilizing satDNAs for sex identification in M. macrocephalus. From a single scale or a fin clip, it was possible to obtain cell suspensions or extract DNA and perform quick-FISH or qPCR, respectively. In summary, our study revealed the existence of a second fish species that has a characterized satellitome and allowed for the confirmation of some insights of satellite biology, the evolution of sex chromosomes and applications of satellite discoveries. The presented data, when combined with that from future analyses, will be useful for the precise characterization of the composition of the sex chromosomes in Megaleporinus. In addition, the development of quick and noninvasive tools for sexing individuals will be useful for future genomic and aquaculture studies. Supplementary data 1
  38 in total

1.  MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-03-19       Impact factor: 16.971

Review 2.  Satellite DNA evolution.

Authors:  M Plohl; N Meštrović; B Mravinac
Journal:  Genome Dyn       Date:  2012-06-25

3.  Comparative chromosomal mapping of microsatellites in Leporinus species (Characiformes, Anostomidae): unequal accumulation on the W chromosomes.

Authors:  J Poltronieri; V Marquioni; L A C Bertollo; E Kejnovsky; W F Molina; T Liehr; M B Cioffi
Journal:  Cytogenet Genome Res       Date:  2013-11-06       Impact factor: 1.636

4.  A new genus of Anostomidae (Ostariophysi: Characiformes): Diversity, phylogeny and biogeography based on cytogenetic, molecular and morphological data.

Authors:  Jorge L Ramirez; José L O Birindelli; Pedro M Galetti
Journal:  Mol Phylogenet Evol       Date:  2016-11-25       Impact factor: 4.286

5.  RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads.

Authors:  Petr Novák; Pavel Neumann; Jiří Pech; Jaroslav Steinhaisl; Jiří Macas
Journal:  Bioinformatics       Date:  2013-02-01       Impact factor: 6.937

Review 6.  Satellite DNA evolution: old ideas, new approaches.

Authors:  Sarah Sander Lower; Michael P McGurk; Andrew G Clark; Daniel A Barbash
Journal:  Curr Opin Genet Dev       Date:  2018-03-23       Impact factor: 5.578

7.  Polymorphic nature of nucleolus organizer regions in fishes.

Authors:  F Foresti; L F Almeida Toledo; S A Toledo
Journal:  Cytogenet Cell Genet       Date:  1981

8.  Fast identification and removal of sequence contamination from genomic and metagenomic datasets.

Authors:  Robert Schmieder; Robert Edwards
Journal:  PLoS One       Date:  2011-03-09       Impact factor: 3.240

9.  Satellite DNA-like elements associated with genes within euchromatin of the beetle Tribolium castaneum.

Authors:  Josip Brajković; Isidoro Feliciello; Branka Bruvo-Mađarić; Durđica Ugarković
Journal:  G3 (Bethesda)       Date:  2012-08-01       Impact factor: 3.154

10.  High-throughput analysis unveils a highly shared satellite DNA library among three species of fish genus Astyanax.

Authors:  Duílio M Z de A Silva; Ricardo Utsunomia; Francisco J Ruiz-Ruano; Sandro Natal Daniel; Fábio Porto-Foresti; Diogo Teruo Hashimoto; Claudio Oliveira; Juan Pedro M Camacho; Fausto Foresti
Journal:  Sci Rep       Date:  2017-10-10       Impact factor: 4.379

View more
  13 in total

1.  Satellite DNA content of B chromosomes in the characid fish Characidium gomesi supports their origin from sex chromosomes.

Authors:  Érica A Serrano-Freitas; Duílio M Z A Silva; Francisco J Ruiz-Ruano; Ricardo Utsunomia; Cristian Araya-Jaime; Claudio Oliveira; Juan Pedro M Camacho; Fausto Foresti
Journal:  Mol Genet Genomics       Date:  2019-10-17       Impact factor: 3.291

2.  Satellitome analysis illuminates the evolution of ZW sex chromosomes of Triportheidae fishes (Teleostei: Characiformes).

Authors:  Rafael Kretschmer; Caio Augusto Gomes Goes; Ricardo Utsunomia; Marcelo de Bello Cioffi; Luiz Antônio Carlos Bertollo; Tariq Ezaz; Fábio Porto-Foresti; Gustavo Akira Toma
Journal:  Chromosoma       Date:  2022-01-31       Impact factor: 4.316

3.  High dynamism for neo-sex chromosomes: satellite DNAs reveal complex evolution in a grasshopper.

Authors:  Ana B S M Ferretti; Diogo Milani; Octavio M Palacios-Gimenez; Francisco J Ruiz-Ruano; Diogo C Cabral-de-Mello
Journal:  Heredity (Edinb)       Date:  2020-06-04       Impact factor: 3.821

4.  Satellitome Analysis in the Ladybird Beetle Hippodamia variegata (Coleoptera, Coccinellidae).

Authors:  Pablo Mora; Jesús Vela; Francisco J Ruiz-Ruano; Areli Ruiz-Mena; Eugenia E Montiel; Teresa Palomeque; Pedro Lorite
Journal:  Genes (Basel)       Date:  2020-07-13       Impact factor: 4.096

Review 5.  Decoding the Role of Satellite DNA in Genome Architecture and Plasticity-An Evolutionary and Clinical Affair.

Authors:  Sandra Louzada; Mariana Lopes; Daniela Ferreira; Filomena Adega; Ana Escudeiro; Margarida Gama-Carvalho; Raquel Chaves
Journal:  Genes (Basel)       Date:  2020-01-09       Impact factor: 4.096

6.  De novo identification of satellite DNAs in the sequenced genomes of Drosophila virilis and D. americana using the RepeatExplorer and TAREAN pipelines.

Authors:  Bráulio S M L Silva; Pedro Heringer; Guilherme B Dias; Marta Svartman; Gustavo C S Kuhn
Journal:  PLoS One       Date:  2019-12-19       Impact factor: 3.240

7.  Cytogenetics of the small-sized fish, Copeina guttata (Characiformes, Lebiasinidae): Novel insights into the karyotype differentiation of the family.

Authors:  Gustavo Akira Toma; Renata Luiza Rosa de Moraes; Francisco de Menezes Cavalcante Sassi; Luiz Antonio Carlos Bertollo; Ezequiel Aguiar de Oliveira; Petr Rab; Alexandr Sember; Thomas Liehr; Terumi Hatanaka; Patrik Ferreira Viana; Manoela Maria Ferreira Marinho; Eliana Feldberg; Marcelo de Bello Cioffi
Journal:  PLoS One       Date:  2019-12-19       Impact factor: 3.240

8.  Satellitome Analysis of Rhodnius prolixus, One of the Main Chagas Disease Vector Species.

Authors:  Eugenia E Montiel; Francisco Panzera; Teresa Palomeque; Pedro Lorite; Sebastián Pita
Journal:  Int J Mol Sci       Date:  2021-06-03       Impact factor: 5.923

9.  Satellitome Analysis of the Pacific Oyster Crassostrea gigas Reveals New Pattern of Satellite DNA Organization, Highly Scattered across the Genome.

Authors:  Monika Tunjić-Cvitanić; Juan J Pasantes; Daniel García-Souto; Tonči Cvitanić; Miroslav Plohl; Eva Šatović-Vukšić
Journal:  Int J Mol Sci       Date:  2021-06-24       Impact factor: 5.923

10.  A Long-Term Conserved Satellite DNA That Remains Unexpanded in Several Genomes of Characiformes Fish Is Actively Transcribed.

Authors:  Rodrigo Zeni Dos Santos; Rodrigo Milan Calegari; Duílio Mazzoni Zerbinato de Andrade Silva; Francisco J Ruiz-Ruano; Silvana Melo; Claudio Oliveira; Fausto Foresti; Marcela Uliano-Silva; Fábio Porto-Foresti; Ricardo Utsunomia
Journal:  Genome Biol Evol       Date:  2021-02-03       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.