Literature DB >> 28854689

Identification and characterization of a subtelomeric satellite DNA in Callitrichini monkeys.

Naiara Pereira Araújo1, Leonardo Gomes de Lima1, Guilherme Borges Dias1, Gustavo Campos Silva Kuhn1, Alan Lane de Melo2, Yatiyo Yonenaga-Yassuda3, Roscoe Stanyon4, Marta Svartman1.   

Abstract

Repetitive DNAs are abundant fast-evolving components of eukaryotic genomes, which often possess important structural and functional roles. Despite their ubiquity, repetitive DNAs are poorly studied when compared with the genic fraction of genomes. Here, we took advantage of the availability of the sequenced genome of the common marmoset Callithrix jacchus to assess its satellite DNAs (satDNAs) and their distribution in Callitrichini. After clustering analysis of all reads and comparisons by similarity, we identified a satDNA composed by 171 bp motifs, named MarmoSAT, which composes 1.09% of the C. jacchus genome. Fluorescent in situ hybridization on chromosomes of species from the genera Callithrix, Mico and Callimico showed that MarmoSAT had a subtelomeric location. In addition to the common monomeric, we found that MarmoSAT was also organized in higher-order repeats of 338 bp in Callimico goeldii. Our phylogenetic analyses showed that MarmoSAT repeats from C. jacchus lack chromosome-specific features, suggesting exchange events among subterminal regions of non-homologous chromosomes. MarmoSAT is transcribed in several tissues of C. jacchus, with the highest transcription levels in spleen, thymus and heart. The transcription profile and subtelomeric location suggest that MarmoSAT may be involved in the regulation of telomerase and modulation of telomeric chromatin.
© The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

Entities:  

Keywords:  Platyrrhini; heterochromatin; repetitive DNA

Mesh:

Substances:

Year:  2017        PMID: 28854689      PMCID: PMC5737874          DOI: 10.1093/dnares/dsx010

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


1. Introduction

New World monkeys (NWM), infraorder Platyrrhini, represent a diverse group of neotropical primates, which are very important in medical, genomics, and evolutionary studies. Among them, the marmosets (tribe Callitrichini, family Cebidae) comprise a group of 21 species of Callithrix, endemic in the Atlantic forest, while Cebuella and Mico are found in the Amazon rainforest. Recent molecular data support a strong relationship between Callithrix and Cebuella + Mico, which are, in turn, sister groups to Callimico., In addition to the geographical separation, marmosets have different diploid numbers (2n). Callithrix species have 2n = 46, whereas both Cebuella and Mico species have 2n = 44. Several studies have shown that NWM genomes are rich in repetitive DNAs, most still uncharacterized. Among them, dispersed repetitive sequences, such as transposable elements (TEs), are major components of primate genomes. For instance, the long interspersed element 1 (LINE-1) and the primate-specific Alu element, a short interspersed element (SINE), were considered as the largest contributors to the genome expansion in primates. Satellite DNA (satDNA) sequences, which are organized as long arrays of head-to-tail tandem repetitions, are also abundant components of primate genomes. SatDNA monomers (repetitive units) form homogeneous arrays, usually enriched in regions of constitutive heterochromatin, and were hypothesized to be related to the maintenance of centromeric function (reviewed by Plohl et al.). Multimers of the same satDNA motif may exhibit high similarity to each other, even when the individual monomers show considerable divergence. This organization is referred to as higher order repeats (HORs). Simian centromeres are mainly composed of α-satellite (AS) consisting of units of ∼170 bp in the infraorder Catarrhini and ∼340 or ∼540 bp in NWM. It was assumed that the HOR blocks of AS was a unique attribute of hominoids. However, Sujiwattanarat et al. have recently reported HORs in the NWM Aotus azarae and C. jacchus. They suggested that this type of organization probably occurs in the AS of a wide range of simians. SatDNAs do not code proteins but their transcription has been reported in many organisms, including vertebrates, invertebrates and plants (reviewed by Pezer et al.) where they were shown to participate in the formation of heterochromatin, centromeres and in gene regulation. Chan et al. showed that AS transcripts are essential for the localization of mitotic centromere proteins including CENP-C, determining the kinetochore structure on centromeric chromatin during mitosis in humans. In spite of the great biomedical and evolutionary interest of primates, their satDNAs have only been studied in a few groups. In the common marmoset Callithrix jacchus, the only satDNA reported to date is the AS DNA, located in the centromeres of all chromosomes., The recent availability of genomic data for C. jacchus provides an excellent new opportunity to study how satDNAs are organized and influence NWM genome evolution. In this study, we employed an integrated approach, using whole-genome sequence analysis and molecular cytogenetics, to get an in-depth insight into a new satDNA of Callithrix, termed MarmoSAT. Our intention was to better understand its evolution by analyzing an array of NWM genomes.

2. Materials and methods

2.1. Identification of satDNA in Callithrix jacchus and sequence analysis

Similarity-based clustering, repeat identification, and classification were performed using RepeatExplorer with whole-genome shotgun (WGS) Illumina reads from a male C. jacchus (accession number: SRR957684). This pipeline involves an all-to-all comparison of Illumina reads by MEGABLAST and the grouping of similar reads in clusters that represent unique repetitive DNA families. A minimum of 55 nt overlap is required for clustering different reads. A total of 1540214 100 bp reads, representing ∼5% coverage of the C. jacchus genome were utilized in the analysis (Supplementary Fig. S1). All clusters with an abundance of at least 0.01% that of the top cluster were analysed in detail (Supplementary Table S1). As the reads utilized represent a random sample of the genome, the abundance of a given repetitive DNA family can be determined by the number of reads present in that specific cluster divided by the total number of reads utilized. The reads from each cluster are further aligned and partially assembled to produce contigs to be used in repeat consensus reconstruction and annotation. All contigs were compared with the mammalian repeat library in Repbase., Whenever a significant number of reads from two distinct clusters match the similarity parameters, RepeatExplorer indicates these clusters as ‘connected component', pointing to a potential relationship between the repeats. C. jacchus MarmoSAT repeats were retrieved from this species sequenced genome by BLAST searches on the assembled genome (accession number: ACFV00000000.1) using as query a consensus sequence obtained from the RepeatExplorer analysis. Hits with e-values lower than 1 × 10−5 were considered significant. Furthermore, BLAST searches on C. jacchus WGS database present on NCBI were used to retrieve long MarmoSAT arrays on unmapped contigs. In some cases, the Tandem Repeats Finder program was used to help in the delimitation of MarmoSAT monomers. The MarmoSAT arrays analysed in unmapped contigs and in assembled chromosomes files were carefully analysed through dot plots to determine the start and end of each repeat. Dot plots were also used to check for similarity between MarmoSAT and AS. These plots were generated with the Dotlet application with a 15 bp word size and 60% similarity cutoff. Multiple sequence alignments were performed using Muscle 4.0. The MEGA software version 5.05 was used for the calculation of genetic distances and construction of Neighbor-Joining (NJ) trees.

2.2. Samples, DNA extractions, PCR amplifications, cloning and sequencing

Chromosome preparations and genomic DNAs were obtained from fibroblast cultures of one male of each Callithrix penicillata, C. geoffroyi, Callimico goeldii and Mico argentatus. Both Callithrix specimens are kept by Dr Alan Lane de Melo in animal facilities at Universidade Federal de Minas Gerais (permits 1/31/94/0000-8 and 3106.6995/2012-MG from IBAMA and 167/2006 from CETEA/UFMG, revalidated on 16 March 2012). The M. argentatus cells were provided by Dr Yatiyo Yonenaga-Yassuda from the Universidade de São Paulo (Brazil). AS and MarmoSAT were amplified by polymerase chain reaction (PCR) from genomic DNAs of the three species with the following primer sets: Alpha-F (ACAGGGAAATATCTGCTTCTAAATC) and Alpha-R (GCTTACTGCTGTTTCTTCCATATG); MarmoSAT-F (ACAGAGTAGAATAGGGCATTG) and MarmoSAT-R (CCAACTCAGTATGCTCTCTCATG). The MarmoSAT set of primers were designed from consensus sequences from an unidentified C. jacchus satDNA. PCR reactions consisted of an initial denaturation step of 94°C for 3 min, followed by 30 cycles at 94°C for 60 s, 55°C for 60 s and 72°C for 60 s and a final extension at 72°C for 10 min. PCR products were excised from a 1% agarose gel and purified with the Wizard SV Gel and PCR Clean-up System kit (Promega). Selected MarmoSAT repeats were cloned using the pGEM-T-Easy cloning kit (Promega). Recombinant plasmids were sequenced on the ABI3130 platform (Myleus Biotechnology). The sequences generated in this study have GenBank accession numbers KX686899 (MarmoSAT – Cebuella pygmaea), KX686900 and KX686901 (MarmoSAT – Mico argentatus).

2.3. CBG-banding and fluorescence in situ hybridization

CBG-banding was obtained according to Sumner. The Callithrix, Callimico and M. argentatus karyotypes were mounted following Sherlock et al., Neusser et al. and Dumas et al., respectively. Fluorescence in situ hybridization (FISH) was performed using AS, MarmoSAT and telomeric sequences as probes. The satDNA probes were prepared from PCR purified products labelled by nick translation with digoxigenin-11-dUTP (DIG-Nick Translation mix, Roche Applied Science). A biotinylated telomeric sequence (TTAGGG)4 (Invitrogen) was synthesized and used as probe for FISH. Chromosomes were denatured in 70% formamide/2xSSC at 65% for 1–2 min. The hybridization mix, consisting of 100 ng of labelled probe in 50% formamide/2xSSC, was denatured for 10 min at 98°C and applied to the chromosome preparations. Hybridization was carried out at 37°C for 16–20 hours. Slides were washed in 2xSSC at 37°C for 5 min. Immunodetection was performed with antidigoxigenin conjugated with FITC and neutravidin coupled with rhodamine (Roche Applied Science). The analyses and image acquisition were performed under a Zeiss Axioimager 2 epifluorescence microscope using the AxioVision software (Zeiss).

2.4. Transcription analysis

We investigated the transcription of MarmoSAT in several tissues of C. jacchus using the RNA-seq data generated by the Non-Human Primate Reference Transcriptome Resource (NHPRTR; Peng et al.). These data are publicly available at NCBI under the BioProject PRJNA271912 and include total Ribo-Zero transcriptomes for bone marrow, the left and right brain hemispheres, the pituitary, colon, heart plus thymus, heart only, kidney, liver, lung, lymph node, muscle and spleen from a female marmoset. The reads were mapped to a consensus sequence of MarmoSAT using the ‘–sensitive-local' preset of Bowtie2 implemented on the Galaxy platform (http://usegalaxy.org; Langmead and Salzberg; Giardine et al.; Goecks et al.). The Neural Network Promoter Prediction tool was used to investigate potential transcription start sites within MarmoSAT (http://www.fruitfly.org/seq_tools/promoter.html; Reese). Transcription of the abundant LINE-1 and Alu elements was also examined as described above and compared with that of MarmoSAT using the Spearman's rank correlation coefficient.

3. Results

3.1. Identification and characterization of MarmoSAT

We identified the most abundant families of repetitive DNAs present in the C. jacchus genome using the similarity-based clustering method implemented on RepeatExplorer. We found that highly repetitive elements comprise ∼15% of the common marmoset genome, mostly represented by TEs and satDNA families. LINE-like elements represented the most abundant repetitive family, comprising 6.7% of the genome and including 95,112 reads in 21 different clusters. The second most abundant repetitive element, Alu-like/SINE-like, spans 3.6% (54,301 reads) of the common marmoset genome. The AS DNA, the third most abundant repetitive family, represents 1.5% of C. jacchus genome (20,308 reads) and is organized on three different clusters inside this species genome. The fourth most abundant cluster of repetitive DNA was a still uncharacterized tandem repeat named herein MarmoSAT, which represents 1.09% of the genome and is composed of 171 bp AT-rich (61%) motifs. Considering an estimated genome size of 3.4 Gb for C. jacchus this new satDNA family would account for more than 37 Mb or ∼216,000 copies. MarmoSAT arrays were identified in all assembled chromosome files, including the sex chromosomes, and presented an average size of 2,223 bp, ranging from 296 bp on chromosome 13 to 8,188 bp on chromosome 3. The average nucleotide divergence among repeats is 20.18%, ranging from 11.5% (on chromosome 14) to 39.1% (on chromosome 22) (Supplementary Table S2). The amount of MarmoSAT differed among the assembled chromosome files, but this variation may be related to technical limitations. We also identified arrays present in different loci inside chromosomes 1, 3, 8, 9, 10, 15, 16, 18, 19, and 21, whereas on chromosomes 8, 15, 16 and 21, MarmoSAT is organized as one single array interspersed with several TEs insertions (Supplementary Table S3). In order to investigate whether MarmoSAT repeats form long-arrays, we performed BLAST searches on unmapped C. jacchus contigs and retrieved the five contigs with the highest score values (accession numbers: ACFV01174585.1, ACFV01176989.1, ACFV01177303.1, ACFV01181345.1and ACFV01184555.1). As a result, we obtained 492 copies of MarmoSAT with an average array size of 99.8 copies per contig, suggesting the presence of long arrays of MarmoSAT in the C. jacchus unmapped data. Because MarmoSAT and AS repeat units have similar sizes (∼171 bp), we performed extensive sequence comparisons with described AS sequences from several primates, including C. jacchus and humans, but dot plot analyses could not find any significant similarity hits between the reads in the MarmoSAT cluster and the ones in the ASor CarB clusters, indicating lack of homology (Supplementary Fig. S2). Moreover, BLAST searches revealed no significant similarity between this sequence and any other deposited in the GenBank or RepBase databases. In order to investigate the homogenization dynamics of MarmoSAT related to their genomic location (from different arrays or chromosomes), we constructed NJ phylogenetic trees using 458 copies extracted from all chromosome files, three BACs, one from the X (accession number: AC146662.3) and two from the Y chromosome (accession numbers: AC243896.4 and AC243459.3), and 492 copies retrieved from unassembled contigs (Supplementary Fig. S3). The resulting trees showed that MarmoSAT sequences were not clustered in chromosome-specific branches. Similarly, the repeats found in the three BACs did not cluster together with the assembled sequences of the corresponding X and Y chromosomes. We also produced chromosome-specific trees, but still could not find any array specificity for MarmoSAT repeats (data not shown).

3.2. MarmoSAT flanking regions are enriched with Alu-like and L1-Cja-like retrotransposons

Aiming to better understand the genomic distribution and possible association with different genetic elements, we analysed 42 flanking regions (from 500 bp to 1 kb, when available) of the MarmoSAT arrays found on the assembled chromosome files (Supplementary Table S3). Our analysis revealed that TEs Alu-like and L1-Cja-like were often associated with MarmoSAT arrays. We found 20 insertions of Alu-like elements adjacent to MarmoSAT arrays, whereas 17 L1-Cja-like elements were found neighbouring MarmoSAT sequences. In contrast, we did not find AS sequences associated with MarmoSAT. We found no preferential positions for TE insertion inside MarmoSAT monomers, and no micro-sequence similarities that could indicate any insertion bias (data not shown). We also found the subtelomeric satDNA family CarB, previously described in M. argentatus, flanking the 5' regions of MarmoSAT arrays on chromosomes 14 and 16 (Supplementary Table S3), and one copy flanking MarmoSAT in one BAC mapped on chromosome Y (accession number AC243896.4). Interestingly, we identified CarB as representing only 0.108% of the C. jacchus genome. We found low copy numbers of CarB on chromosomes 1, 3–8, 13–20, and it was absent on the other chromosomes.

3.3. Phylogenetic distribution of MarmoSAT

In order to ascertain the distribution of MarmoSAT in Cebuella pygmaea and Mico argentatus we amplified this sequence by PCR from genomic DNAs of both species. The sequencing of cloned PCR products from both species confirmed the presence of MarmoSAT in their genomes. The distribution of MarmoSAT in other NWM species was verified by an in silico search using a MarmoSAT query against the GenBank non-redundant nucleotide collection (NCBI). BLAST searches retrieved one significant hit with e-value 2e−13 corresponding to satDNA CgoA (accession number: X52012.1), previously described in Callimico goeldii. This 338 bp satDNA family was described as restricted to the C. goeldii genome after Southern blot hybridizations with 70% stringency. Dot-plot analysis of the CgoA repeat unit sequences showed that this satDNA is composed of two monomers (Fig. 1), one with 170 bp (CgoA1), and another with 168 bp (CgoA2). Pairwise sequence comparisons between CgoA monomers revealed 32.9% nucleotide divergence, indicating a HOR organization in C. goeldii. The comparison of CgoA1 and CgoA2 against MarmoSAT revealed 30.3% and 33.3% of divergence, respectively.
Figure 1

Schematic illustration of MarmoSAT repeat units of 171 bp in Callithrix jacchus and 338 bp HOR of CgoA1 and CgoA2 units in Callimico goeldii.

Schematic illustration of MarmoSAT repeat units of 171 bp in Callithrix jacchus and 338 bp HOR of CgoA1 and CgoA2 units in Callimico goeldii. We searched for MamoSAT in the outgroup species Aotus nancymaae and Saimiri boliviensis (accession numbers SRR1692997 and SRR315548), but did not find homologous sequences. The absence of MarmoSAT in the Aotus and Saimiri lineages suggests that this satDNA family probably amplified after the split of Callitrichinae (Fig. 2).
Figure 2

Possible evolutionary pathway of MarmoSAT and CarB amplification in Callitrichini and Callimico goeldii. Phylogenetic relationships are based on Perelman et al.. Colored branches indicate the presence of MarmoSAT (green, lighter color) and CarB (red, darker color) satellite DNA families. The traced lines in the Callimico and Cebuella lineages indicate insufficient data to verify the hypothesis.

Possible evolutionary pathway of MarmoSAT and CarB amplification in Callitrichini and Callimico goeldii. Phylogenetic relationships are based on Perelman et al.. Colored branches indicate the presence of MarmoSAT (green, lighter color) and CarB (red, darker color) satellite DNA families. The traced lines in the Callimico and Cebuella lineages indicate insufficient data to verify the hypothesis.

3.4. Chromosomal location of MarmoSAT on Callitrichini and Callimico

The chromosome location of MarmoSAT was investigated after FISH on chromosomes from Callithrix penicillata, C. geoffroyi, Mico argentatus and Callimico goeldii. CBG-banding was also performed in order to compare the distribution of MarmoSAT in relation to the constitutive heterochromatin in the four species (Figs 3A and C and 4).
Figure 3

Callithrix penicillata metaphases after (A) CBG-banding and (B) FISH with the MarmoSAT (green) and telomeric (red) probes. C-D show the results of the same experiments in C. geoffroyi. Bar = 10 µm. Colour visible in online version.

Callithrix penicillata metaphases after (A) CBG-banding and (B) FISH with the MarmoSAT (green) and telomeric (red) probes. C-D show the results of the same experiments in C. geoffroyi. Bar = 10 µm. Colour visible in online version. The MarmoSAT probe hybridized to both subtelomeric regions of all biarmed chromosomes in C. penicillata, with the exception of pair 3 and the sex chromosomes, which had only their short arms labelled (Fig. 3B). Pairs 1 and 18 showed interstitial labelling on their short and long arms, respectively, whereas pairs 16 and 17 had no hybridization signals. C. geoffroyi chromosomes presented the same hybridization pattern, with the exception of pair 15, which did not display any signal (Fig. 3D). In M. argentatus, MarmoSAT sequences were visualized on both ends of the biarmed pairs 3, 20 and in the sex chromosomes; in the subtelomeric regions of the short arm of pairs 10 and 21 and in the long arms of pairs 2, 5–7, 13 and 14. In addition, interstitial signals were detected in the short arms of pairs 2 and 3 and in the long arms of pairs 5 and 14. Among the acrocentric chromosomes, MarmoSAT sequences hybridized to the long arms of pairs 15, 17 and 19 and to interstitial regions of pairs 15, 16 and 19 (Fig. 4A). In C. goeldii, MarmoSAT sequences were located at subtelomeric regions of the short and long arms of biarmed chromosomes, with the exception of pair 11, which showed signals only on its long arms. The acrocentrics had only their long arms labelled (Fig. 4B).
Figure 4

Karyotypes of Mico argentatus after (A) CBG-banding and FISH with MarmoSAT (green) and telomeric sequences (red) probes. In (B), CBG-banding and FISH with MarmoSAT sequences probe in Callimico goeldii. Bar = 10 µm. Colour visible in online version.

Karyotypes of Mico argentatus after (A) CBG-banding and FISH with MarmoSAT (green) and telomeric sequences (red) probes. In (B), CBG-banding and FISH with MarmoSAT sequences probe in Callimico goeldii. Bar = 10 µm. Colour visible in online version. Since MarmoSAT had a telomeric localization, we also performed double FISH with a telomeric probe (TTAGGG)4. Telomeric sequences were detected in all telomeres of the three species (Figs 3B, 3D and 4A) and co-localized with MarmoSAT in most chromosome pairs. Additionally, the Y chromosomes of both Callithrix species had their long arms completely labelled with the telomeric probe. A search for sequencing reads that spanned the interface between MarmoSAT and telomeric sequences on the Trace Archive File of C. jacchus (http://www.ncbi.nlm.nih.gov/Traces/home/) revealed two reads composed by MarmoSAT arrays adjacent to telomeric repeats. In both cases, the telomeric repeats were in the terminal 3' position, indicating a subtelomeric location of MarmoSAT (Fig. 5).
Figure 5

Schematic representation of MarmoSAT repeats adjacent to telomeric repeats on reads gnl:1006135623 and gnl:1137852669 found in the NCBI Trace Archive Files database.

Schematic representation of MarmoSAT repeats adjacent to telomeric repeats on reads gnl:1006135623 and gnl:1137852669 found in the NCBI Trace Archive Files database. Besides the MarmoSAT hybridizations, we also report for the first time the chromosome location of AS in C. penicillata, C. geoffroyi, M. argentatus and C. goeldii (Fig. 6). This satDNA is present in large amounts in the (peri)centromeric regions of these marmosets' chromosomes, corroborating the bioinformatics analysis and previous cytogenetic studies of C. jacchus.
Figure 6

FISH with a digoxigenin-labelled α-satellite DNA in Callitrichini and Callimico goeldii cells. CPE = Callithrix penicillata, CGE = C. geoffroyi, MAR = Mico argentatus, CGO = C. goeldii. Bar = 10 µm.

FISH with a digoxigenin-labelled α-satellite DNA in Callitrichini and Callimico goeldii cells. CPE = Callithrix penicillata, CGE = C. geoffroyi, MAR = Mico argentatus, CGO = C. goeldii. Bar = 10 µm.

3.5. MarmoSAT is transcribed on several tissues from C. jacchus

We used the NHPRTR33 data to investigate the transcription of MarmoSAT in the tissues of C. jacchus. This analysis revealed MarmoSAT transcripts in all tissues surveyed albeit at very different abundances (Fig. 7). The 13 tissues analysed displayed over 11-fold variation in the transcription level of MarmoSAT, with a higher-than-average transcript level in brain (left and right hemispheres), heart plus thymus, heart only and spleen (Fig. 7).
Figure 7

Transcription level of MarmoSAT in several tissues of a female Callithrix jacchus. RPKM: Reads Per Kilobase Million.

Transcription level of MarmoSAT in several tissues of a female Callithrix jacchus. RPKM: Reads Per Kilobase Million. Because we found no putative promoter regions in MarmoSAT sequences, since L1 and Alu were the sequences most commonly associated with MarmoSAT arrays, we also analysed their transcripts (Supplementary Table S3). L1 and Alu also displayed ubiquitous transcription, but showed no significant correlation with MarmoSAT transcription levels (Spearman's R = 0.24692 and -0.1318, P = 0.3733 and 0.6693, respectively), and only moderate correlation with one another (R = 0.4835 and P = 0.097; Supplementary Fig. S4).

4. Discussion

In this study, we identified a new satDNA with 171 bp monomers in the common marmoset genome, which was named MarmoSAT. Regardless of the same motif size, MarmoSAT and AS do not share any sequence similarity or conserved structure that could suggest a common origin (Supplementary Fig. S2). The presence of two non-homologous complex satDNA families that share the same motif size was not previously reported in primates. It is possible that the convergence to the same motif size reflects the optimal distance for nucleosome positioning, as suggested by Henikoff et al., indicating that despite sequence heterogeneity, different satDNA families may retain structural features important to heterochromatic domains in C. jacchus.

4.1. Evolutionary turnover of subtelomeric satDNAs MarmoSAT and CarB in Callitrichini

SatDNAs have been identified in the subtelomeric heterochromatin of most marmoset chromosomes,,. In the Callithrix species analysed herein we found MarmoSAT to be a major component of these regions (Fig. 3). The hybridization of MarmoSAT and telomeric sequences showed very similar patterns in both C. penicillata and C. geoffroyi, which may be explained by the recent diversification of the clade (2.6 Mya). Together with the cytogenetic analysis, our searches on assembled genomes and WGS data of NWMs showed that MarmoSAT is apparently restricted to Callithrix, Mico, Callimico and Cebuella, while CarB is present in Callithrix and Mico (Fig. 2). Therefore, MarmoSAT and CarB repeats, a satDNA previously described in Mico species,, were already present in the common ancestor of Callitrichini and C. goeldii. The FISH experiments revealed that MarmoSAT is more abundant in Callithrix and C. goeldii than in Mico (Figs 3 and 4). Accordingly, Fanning et al. CgoA description supports MarmoSAT high abundancy in C. goeldii. Altogether, these results support the hypothesis that MarmoSAT sequences were probably abundant in the ancestor of Callitrichini and C. goeldii and remained copious in Callithrix and Callimico. On the other hand, CarB underwent amplification in Mico, as shown by Alves et al. and Canavez et al., replacing MarmoSAT. This assumption is supported for instance by comparing the homologous M. argentatus chromosome 4 and Callithrix chromosome 13, M. argentatus chromosome 5 short arm and Callithrix pair 20, and M. argentatus chromosome 18 and Callithrix 19, in which the Callithrix counterparts showed MarmoSAT hybridization, whereas in M. argentatus chromosomes there were large heterochromatic blocks mainly composed of CarB. The amplification of specific satDNAs was already reported in Callimico and Cebuella., In marmosets, the differential amplification of unrelated satDNAs does not seem to affect karyotype stability, as this group of NWMs has conserved karyotypes.

4.2. MarmoSAT subtelomeric arrays lack chromosome-specificity

SatDNA repeats on the same array or chromosome tend to present a higher level of sequence similarity to each other when compared with those on non-homologous chromosomes., An NJ phylogenetic tree with 980 copies of MarmoSAT revealed that these sequences lack chromosome specificity (Supplementary Fig. S3). The absence of differential homogenization for specific variants between MarmoSAT subtelomeric arrays in C. jacchus could be the result of exchange events between arrays and chromosomes. This hypothesis is supported by data in chimpanze, in which it has been shown that the subterminal regions of different chromosomes interact with each other to form stable physical contacts in meiosis, which may result in frequent DNA exchanges between chromatids. Moreover, Rudd et al. showed that in humans, 17% of all sister chromatid exchanges occur in the terminal ∼100 kb of chromosomes, translating into a recombination rate on subtelomeric regions 160-fold larger than in euchromatic regions.

4.3. MarmoSAT is present as HORs in Callimico goeldii

Another significant finding was the identification of MarmoSAT organized in HOR structures in C. goeldii. Although HORs are apparently common and widespread in primates,, they are predominantly located at (peri)centromeric, rather than in subtelomeric regions. Herein we showed that the previously described copies of CgoA satDNA18 are in fact two highly different monomers of MarmoSAT organized in HORs (Fig. 1). Probably, the establishment of a HOR organization occurred after the divergence of C. jacchus and C. goeldii ∼8 million years ago (Mya) (Fig. 2). Moreover, we observed that the subtelomeric location of MarmoSAT is shared among Callitrichini species and C. goeldii (Figs 3 and 4) This is the first report of subtelomeric HORs in primates, indicating that HORs sequences may be found in heterochromatic regions outside of centromeres. The example of HORs in closely related species such as C. jacchus and C. goeldii is illuminating since it shows that HORs may develop over a relatively short period of evolutionary time. In contrast with what was found in C. jacchus AS, the sequence similarity between C. goeldii CgoA1 and CgoA2 repeat units is considerably lower than 70%. This pattern is similar to the observed in previously described HORs from primates and bovids.,

4.4. Possible role for L1 in MarmoSAT dispersion

The presence of several MarmoSAT arrays adjacent to L1 opens the possibility that their dispersion could have been mediated by these retroelements (Supplementary Table S3). Interestingly, we detected many copies of MarmoSAT located interstitially on C. penicillata and C. geoffroyi pairs 1 and 18 and M. argentatus pairs 2, 3, 5, 14–16 and 19 (Figs 3 and 4). In humans, it has been shown in vivo and in vitro that the L1 can co-mobilize 3’ downstream sequences to other genomic locations as the result of imperfect transcription events. During the transposition event the transcription of an L1-element may bypass its own polyadenylation signal utilizing a second downstream polyadenylation site for 3’ end processing, leading to the transcription and later transposition of adjacent flanking sequences. Even though most of L1-elements have truncated sequences, these elements are still capable of retrotransposition, suggesting that even incomplete L1s may play a role in such events. The finding of Alu-like and L1-Cja-like flanking MarmoSAT may have resulted from a bias in assembling repetitive DNAs with WGS data, due to the high sequence similarity among repeats. In fact, unmapped sequences have a low frequency of TEs insertions when compared with the sequences present in assembled chromosomes. We also observed that these TEs lack preferential insertion sites in MarmoSAT arrays, either target sites or array positioning, indicating a probable random nature for these events. For example, it has been suggested that L1 elements have a cleavage site preference for sequences rich in AA|TTT, but we did not observe any insertion at these specific sites. Although MarmoSAT sequences have three different AA|TTT regions, the absence of TEs insertions may be affected by the local chromatin structure. Alternatively, TEs loci inside MarmoSAT sequences may have suffered several mutations resulting in different sequences compared with the initial insertion regions.

4.5. Potential functional roles of MarmoSAT

Albeit being devoid of protein-coding capacity, satDNAs may possess structural and/or functional roles, usually via expression of non-coding RNAs (ncRNAs; reviewed by Biscotti et al.). Interestingly, we found MarmoSAT-derived transcripts in all tissues analysed (Fig. 7) and because this satDNA does not possess promoter regions we conclude that its transcription is probably initiated in flanking sequences. The most frequent sequences flanking MarmoSAT arrays are L1 and Alu retroelements, but MarmoSAT transcription levels do not correlate with those from these elements (Supplementary Fig. S4). Because of its abundance and presence in almost all chromosomes, MarmoSAT could be transcribed from a number of different loci and have different promoters, inside or outside L1/Alu. Besides the subtelomeric location determined by FISH, we found Sanger sequencing reads that span both MarmoSAT and the telomeric repeats in C. jacchus (Fig. 5). ncRNAs containing telomeric repeats in mammals are known as telomere repeat containing RNAs (TERRAs) and are thought to regulate telomerase and to modulate telomeric chromatin throughout the cell cycle (reviewed in Luke and Lingner). TERRAs transcription has been shown to start at subtelomeric repeats towards the chromosome ends in human, mouse and yeast. MarmoSAT abundance and location make it possible to suggest that it could be part of TERRAs. Moreover, MarmoSAT expression profile is similar to the TERRA expression found in mouse, with higher levels of transcripts in spleen, kidney and thymus. Although the results presented herein are suggestive of interesting functional roles for MarmoSAT transcripts, the validity of these hypotheses must be assessed experimentally.

5. Conclusions

In this study, we identified with a combination of bioinformatics and cytogenetics, a subtelomeric satDNA, termed MarmoSAT, in the common marmoset and provided insights into its organization and evolution. Our data suggest that MarmoSAT originated in the common ancestor of Callitrichini and Callimico goeldii. Interestingly, the MarmoSAT arrays are organized in HORs in C. goeldii. We also propose that MarmoSAT transcripts may play a role in telomeric chromatin. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  54 in total

1.  Concerted evolution and higher-order repeat structure of the 1.709 (satellite IV) family in bovids.

Authors:  William S Modi; Sergey Ivanov; Daniel S Gallagher
Journal:  J Mol Evol       Date:  2004-04       Impact factor: 2.395

Review 2.  Satellite DNA-mediated effects on genome regulation.

Authors:  Z Pezer; J Brajković; I Feliciello; D Ugarkovć
Journal:  Genome Dyn       Date:  2012-06-25

3.  New insights into centromere organization and evolution from the white-cheeked gibbon and marmoset.

Authors:  A Cellamare; C R Catacchio; C Alkan; G Giannuzzi; F Antonacci; M F Cardone; G Della Valle; M Malig; M Rocchi; E E Eichler; M Ventura
Journal:  Mol Biol Evol       Date:  2009-05-08       Impact factor: 16.240

4.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

5.  Somatic expression of LINE-1 elements in human tissues.

Authors:  Victoria P Belancio; Astrid M Roy-Engel; Radhika R Pochampally; Prescott Deininger
Journal:  Nucleic Acids Res       Date:  2010-03-09       Impact factor: 16.971

6.  Regulation of heterochromatic silencing and histone H3 lysine-9 methylation by RNAi.

Authors:  Thomas A Volpe; Catherine Kidner; Ira M Hall; Grace Teng; Shiv I S Grewal; Robert A Martienssen
Journal:  Science       Date:  2002-08-22       Impact factor: 47.728

7.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences.

Authors:  Jeremy Goecks; Anton Nekrutenko; James Taylor
Journal:  Genome Biol       Date:  2010-08-25       Impact factor: 13.583

8.  Higher-order repeat structure in alpha satellite DNA occurs in New World monkeys and is not confined to hominoids.

Authors:  Penporn Sujiwattanarat; Watcharaporn Thapana; Kornsorn Srikulnath; Yuriko Hirai; Hirohisa Hirai; Akihiko Koga
Journal:  Sci Rep       Date:  2015-05-14       Impact factor: 4.379

9.  Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data.

Authors:  Can Alkan; Mario Ventura; Nicoletta Archidiacono; Mariano Rocchi; S Cenk Sahinalp; Evan E Eichler
Journal:  PLoS Comput Biol       Date:  2007-09       Impact factor: 4.475

10.  Phylogenomics of species from four genera of New World monkeys by flow sorting and reciprocal chromosome painting.

Authors:  Francesca Dumas; Roscoe Stanyon; Luca Sineo; Gary Stone; Francesca Bigoni
Journal:  BMC Evol Biol       Date:  2007-08-16       Impact factor: 3.260

View more
  4 in total

1.  An eutherian intronic sequence gave rise to a major satellite DNA in Platyrrhini.

Authors:  Mirela Pelizaro Valeri; Guilherme Borges Dias; Valéria do Socorro Pereira; Gustavo Campos Silva Kuhn; Marta Svartman
Journal:  Biol Lett       Date:  2018-01       Impact factor: 3.703

Review 2.  Decoding the Role of Satellite DNA in Genome Architecture and Plasticity-An Evolutionary and Clinical Affair.

Authors:  Sandra Louzada; Mariana Lopes; Daniela Ferreira; Filomena Adega; Ana Escudeiro; Margarida Gama-Carvalho; Raquel Chaves
Journal:  Genes (Basel)       Date:  2020-01-09       Impact factor: 4.096

Review 3.  Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics.

Authors:  Syed Farhan Ahmad; Worapong Singchat; Maryam Jehangir; Aorarat Suntronpong; Thitipong Panthum; Suchinda Malaivijitnond; Kornsorn Srikulnath
Journal:  Cells       Date:  2020-12-18       Impact factor: 6.600

4.  Cytogenetic Characterization of Seven Novel satDNA Markers in Two Species of Spined Loaches (Cobitis) and Their Clonal Hybrids.

Authors:  Anatolie Marta; Dmitry Dedukh; Oldrich Bartos; Zuzana Majtanova; Karel Janko
Journal:  Genes (Basel)       Date:  2020-06-04       Impact factor: 4.096

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.