Literature DB >> 22908042

Satellite DNA-like elements associated with genes within euchromatin of the beetle Tribolium castaneum.

Josip Brajković1, Isidoro Feliciello, Branka Bruvo-Mađarić, Durđica Ugarković.   

Abstract

In the red flour beetle Tribolium castaneum the major TCAST satellite DNA accounts for 35% of the genome and encompasses the pericentromeric regions of all chromosomes. Because of the presence of transcriptional regulatory elements and transcriptional activity in these sequences, TCAST satellite DNAs also have been proposed to be modulators of gene expression within euchromatin. Here, we analyze the distribution of TCAST homologous repeats in T. castaneum euchromatin and study their association with genes as well as their potential gene regulatory role. We identified 68 arrays composed of TCAST-like elements distributed on all chromosomes. Based on sequence characteristics the arrays were composed of two types of TCAST-like elements. The first type consists of TCAST satellite-like elements in the form of partial monomers or tandemly arranged monomers, up to tetramers, whereas the second type consists of TCAST-like elements embedded with a complex unit that resembles a DNA transposon. TCAST-like elements were also found in the 5' untranslated region (UTR) of the CR1-3_TCa retrotransposon, and therefore retrotransposition may have contributed to their dispersion throughout the genome. No significant difference in the homogenization of dispersed TCAST-like elements was found either at the level of local arrays or chromosomes nor among different chromosomes. Of 68 TCAST-like elements, 29 were located within introns, with the remaining elements flanked by genes within a 262 to 404,270 nt range. TCAST-like elements are statistically overrepresented near genes with immunoglobulin-like domains attesting to their nonrandom distribution and a possible gene regulatory role.

Entities:  

Keywords:  gene regulation; immunoglobulin-like genes; repetitive DNA; satellite DNA; transposon

Mesh:

Substances:

Year:  2012        PMID: 22908042      PMCID: PMC3411249          DOI: 10.1534/g3.112.003467

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Based on the hypothesis of Britten and Davidson (1971), repetitive elements can be a source of regulatory sequences and act to distribute regulatory elements throughout the genome. In particular, mobile transposable elements (TEs) are predicted to be a source of noncoding material that allows for the emergence of genetic novelty and influences evolution of gene regulatory networks (Feschotte 2008). Recently it has been shown that at least 5.5% of conserved noncoding elements unique to mammals originate from mobile elements and are preferentially located close to genes involved in development and transcription regulation (Lowe ). The complete sequence conservation, wide evolutionary distribution, and presence of functional elements such as promoters and transcription factor binding sites within some satellite DNA sequences has led to the assumption that in addition to participating in centromere formation, they might also act as cis-regulatory elements of gene expression (Ugarković 2005). To perform potential regulatory functions, satellite DNA elements are predicted to be preferentially distributed in euchromatic portion of the genomes in the vicinity of genes. Whole-genome sequencing projects enable the presence and distribution of satellite DNA repeats in the euchromatic portion of the genome to be determined. The analysis of satellite DNA-like elements dispersed within euchromatin, and their comparison with homologous elements present within heterochromatin, also may reveal insights into the origin of satellite DNAs and their subsequent evolution (Kuhn ). Satellite DNAs are major building elements of pericentromeric and centromeric heterochromatin in many eukaryotic species, and in certain species they account for the majority of genomic DNA, as in beetles from the coleopteran family Tenebrionidae (Ugarković and Plohl 2002). In the red flour beetle Tribolium castaneum, pericentromeric heterochromatin comprises approximately 40% of the genome, and TCAST satellite DNA has previously been characterized as the major satellite that encompasses centromeric as well as pericentromeric regions of all 20 chromosomes (Ugarković ). TCAST satellite is composed of two subfamilies, Tcast1a and Tcast1b, which together comprise 35% of the whole genome. Tcast1a and Tcast1b have an average homology of 79% and are a similar size at 362 bp and 377 bp, respectively, but they are characterized by a divergent, subfamily specific region of approximately 100 bp (Feliciello ). The genome sequencing project of T. castaneum has recently been completed (Richards ). Sequencing involved the euchromatic portion of the genome, with >20% of the genome, corresponding to heterochromatic regions, excluded due to technical difficulties. In this article, we searched for the presence of TCAST satellite-homologous elements within the assembled T. castaneum genome by using a comprehensive computational analysis. By searching the sequenced T. castaneum genome, we found 68 TCAST satellite DNA arrays within the euchromatin of all chromosomes. They were mapped to 5′ or 3′ ends, as well as within introns, of more than 100 protein-coding genes. Based on sequence characteristics, dispersed TCAST-like elements were classified into two groups. The first group includes partial TCAST satellite monomers or short arrays of tandemly arranged monomers up to tetramers. The second group contains TCAST-like element embedded within complex repeat units that contain two hallmarks of DNA transposons, terminal inverted repeats and target-size duplications. The evolutionary relationship and possible modes of dispersion of the two types of dispersed TCAST-like sequences are discussed. In addition, we examined the sequence divergence, phylogenetic relationship, and chromosomal distribution of the elements. Annotation, characterization, and classification of genes within the region of TCAST-like elements are reported, with the preferential localization of TCAST-like elements near specific groups of genes identified. Our results demonstrate for the first time, the enrichment of satellite DNA-like elements in the vicinity of genes with immunoglobulin-like domains and suggest their possible gene-regulatory role.

Materials and Methods

BLASTN version 2.2.22+ was used to screen the NCBI refseq_genomic database of T. castaneum. All scaffolds that have not been mapped to linkage groups were also screened. The program was optimized to search for highly similar sequences (megablast) to the query sequence [TCAST consensus sequence (Ugarković )]. Genes flanking TCAST–homologous elements were found automatically by NCBI blast. Sequences corresponding to hits, as well as their flanking regions, were analyzed by dot plot (http://www.vivo.colostate.edu/molkit/dnadot/), using standard parameters (window size 9, mismatch limit 0), or more relaxed conditions (window size 11, mismatch limit 1), to determine the exact start and end site of specific TCAST-like elements. The TCAST transposon-like elements were analyzed in detail for the presence of hallmarks such as terminal inverted repeats (TIRs) and target-site duplications with the aid of the Gene Jockey sequence analysis program (for Apple Macintosh). Secondary structures were determined using the default parameters of the MFOLD program available online [http://mfold.rna.albany.edu/?q=mfold (Zuker 2003)]. AT content was analyzed using BioEdit Sequence Alignment Editor (Hall 1999). Repbase, a reference database of eukaryotic repetitive DNA, was screened using WU-BLAST (Kohany ). Sequence alignment was performed using MUSCLE algorithm (Edgar 2004) combined with manual adjustment. All sequences were included in the alignment, with the exception of the ones that did not at least partially overlap with other sequences. Gblocks was used to eliminate poorly aligned positions and divergent regions of the alignments (Talavera and Castresana 2007). Alignments (original fasta files) are available upon request. jModelTest 0.1.1 software (Posada 2008) was used to infer best-fit models of DNA evolution—TPM3uf+G for transposon-like and A type elements and TPM1uf for B type elements. Maximum likelihood (ML) trees were estimated with the PhyML 3.0 software (Guindon and Gascuel 2003) using best-fit models. Markov chain Monte Carlo Bayesian searches were performed in MrBayes v. 3.1.2. (Huelsenbeck and Ronquist 2001) under the best-fit models (two simultaneous runs, each with four chains; 3 × 106 generations; sampling frequency one in every 100 generations; majority rule consensus trees constructed based on trees sampled after burn-in). Branch support was evaluated by bootstrap analysis (1000 replicates) in ML and by posterior probabilities in Bayesian analyses. Pairwise sequence diversity (uncorrected P) was calculated using the MEGA 5.05 software (Tamura ). T. castaneum gene homologs in Drosophila melanogaster were searched using the OrthoDB Phylogenomic database. Each gene has OrthoDB identificator, with Uniprot data linked to OrthoDB (Waterhouse ). To find sets of biological annotations that frequently appear together and are significantly enriched in a set of genes located near TCAST-like elements, program GeneCodis 2.0 available online (http://genecodis.dacya.ucm.es/) was used. GeneCodis generates statistical rank scores for single annotations and their combinations. To find all the possible combinations of annotations, GeneCodis uses the apriori algorithm introduced by Agrawal . Once the annotations were extracted, a statistical analysis based on the hypergeometric distribution or the χ2 test of independence was executed to calculate the statistical significance (P values) for each individual annotation or co-annotations. Two-tailed hypergeometric test with Bonferroni correction (alpha = 0.025) was used to analyze the distribution of TCAST-like elements among T. castaneum chromosomes. In each chromosome the frequency of TCAST-like elements was compared with the frequency in the complete sample and the significance of deviations was calculated.

Results

Identification of dispersed TCAST-like elements

Using the consensus sequence of TCAST satellite DNA (Ugarković ) as a query sequence, we screened the NCBI refseq_genomic database of T. castaneum with the alignment program BLASTN version 2.2.22+. The program was optimized to search for highly similar sequences (megablast) and blast hits on the query sequence were analyzed individually. Alignments were mapped regarding start and end site, chromosome number, and total length. When the distance between two alignments on the same chromosome was short, the genomic sequence was further analyzed by dot plot to identify any potential continuity between the two alignments. Only genomic sequences with at least 140 nt (40% of TCAST monomer length) of continuous sequence and >80% identity to the TCAST consensus sequence were considered for further analysis. The total number of dispersed TCAST-like elements was 68, with 36 elements flanked by genes at both 5′ and 3′ ends, 3 elements flanked by a single gene either at 5′ or 3′ end (sequences no. 36, 39, 50), and the 29 elements positioned within introns (Table 1). Except 68 TCAST-like elements associated with genes, no other dispersed TCAST-like elements were found within the assembled T. castaneum genome. Analysis of scaffolds that have not been mapped to linkage groups revealed the presence of an additional 41 TCAST-like elements, but because they were not mapped to T. castaneum genome and could possibly derive from heterochromatin, we did not consider them for further analysis.
Table 1

TCAST-like elements associated with genes within T. castaneum euchromatin

UniprotEntrezGene NameChrSat_seq.PositionDistance, bpDM HomologFBgnTypeLengthCopies
D6WZP1662564Altered disjunction915′18,773Q9VEH1FBgn0000063Satellite7342.0
D6WZP3662624Ras-related protein Rab-26913′7795Q9VP48FBgn0086913Satellite7342.0
D6WZL9661947Probable serine/threonine-protein kinase92InsideQ0KHT7FBgn0052666Satellite9932.8
D6X226660275Arrest935′99,669Q8IP89FBgn0000114Satellite7162.0
D6X238661741Numb933′115,984P16554FBgn0002973Satellite7162.0
100141832no match on uniprot945′1520Satellite5171.4
D6X2D0660440Short-chain dehydrogenase943′6704Q9VE80FBgn0038610Satellite5171.4
D6X1E7656884Cytochrome P450 306A1955′404,270Q9VWR5FBgn0004959Satellite10582.9
D6X2U7656977Elongase953′9947Q9VCY6FBgn0038986Satellite10582.9
D6X2C4660195Dopamine receptor 196InsideP41596FBgn0011582Satellite3040.8
D6X2U7656977Elongase975′7128Q9VCY6FBgn0038986Satellite3941.1
D6X366657055elongation of very long chain fatty acids protein973′50,111Q9VCY5FBgn0053110Satellite3941.1
D6X0D7657748Ret oncogene985′56,625Q8INU0FBgn0011829Satellite2130.6
D6X0E1657829Dpr9983′62,781Q9VFD9FBgn0038282Satellite2130.6
D6X2H8654954ADAM metalloprotease99InsideQ6QU65FBgn0051314Transposon1107
D6X2U7655561Elongase9105′47,902Q9VCZ0FBgn0038983Transposon1085
D6X2V3655640Putative uncharacterized protein9103′67,953Q9VDB7FBgn0038881Transposon1085
D6X244655011Serine/threonine-protein kinase 32B911InsideQ0KID3FBgn0052944Transposon1062
D6X374100141521Putative uncharacterized protein912InsideQ9VGZ4FBgn0037814Satellite2920.8
D6X2C4660195Dopamine receptor 1913InsideP41596FBgn0011582Transposon900
D6X259656290Transport and Golgi organization 139145′9456Q9VGT8FBgn0040256Satellite2220.6
D6X260656373Protein-tyrosine sulfotransferase9143′33,523Q9VYB7FBgn0086674Satellite2220.6
D6X075658603MICAL-like protein9155′39,684Q9VU34FBgn0036333Satellite2030.6
D6X1P2658891tiptop9153′142,821Q9U3V5FBgn0028979Satellite2030.6
D6X095659195Troponin C9165′3922P47947FBgn0013348Transposon589
D6X0I1659336Troponin C9163′15,143P47947FBgn0013348Transposon589
D6X1J0655713Transporter917InsideQ9NB97FBgn0034136Satellite9152.5
D6WF56100141877zinc finger protein 2503185′125,685Q7KAH0FBgn0027339Satellite12083.4
D6WF61656924Transcription initiation factor TFIID subunit 73183′64,294Q9VHY5FBgn0024909Satellite12083.4
D6WGB1659040Mahya3195′115,537P20241FBgn0002968Satellite6871.9
D6WGB5659201V-type proton ATPase subunit E3193′82,217P54611FBgn0015324Satellite6871.9
D6WII0100141571NADH dehydrogenase, putative3205′4278Q9W3N7FBgn0029971Transposon635
D6WII2100142263Putative uncharacterized protein3203′18,585Q9VIY1FBgn0032769Transposon635
D6WFT8657535WD repeat-containing protein 47321InsideQ960Y9FBgn0026427Satellite6041.7
D6WDY2656125Kynurenine aminotransferase3225′10,696Q8SXC2FBgn0037955Transposon1000
D6WDY4656298Annexin IX3223′11,599P22464FBgn0000083Transposon1000
D6WFK8656174ankyrin 2,3/unc44323InsideQ7KU95FBgn0085445Transposon1081
D6WFX1657874ral guanine nucleotide exchange factor3245′64,025Q8MT78FBgn0034158Transposon888
D6WFX3658031galactose-1-phosphate uridylyltransferase3243′3958Q9VMA2FBgn0031845Transposon888
D6WDQ4659233Putative uncharacterized protein3255′15,051Q8T0R9FBgn0038809Transposon1016
D6WDQ6659376coiled-coil domain containing 963253′9162A1ZA72FBgn0013988Transposon1016
D6WF68655042glucose dehydrogenase3265′25,896Q9VY00FBgn0030598Transposon1067
C3XZ92655348Mitogen-activated protein kinase kinase kinase kinase 23263′92,876Q8SYA1FBgn0034421Transposon1067
D6WE82658463Putative uncharacterized protein327InsideQ9VDK2FBgn0038815Transposon314
D6WHX6658191Putative uncharacterized protein3285′173,881Q1RKQ9FBgn0085382Transposon826
D6WI58658343Cathepsin L3283′82,559Q95029FBgn0013770Transposon826
D6WDJ9656922Putative uncharacterized protein3295′173,548Q8IPJ1FBgn0031859Transposon1084
D6WDN0657559PRMT53293′383,809Q9U6Y9FBgn0015925Transposon1084
D6WGS3656976Putative uncharacterized protein3305′37572A0AMQ8FBgn0034655Satellite2160.6
D6WGT0100142515calpain 33303′226,707Q11002FBgn0008649Satellite2160.6
D6WDS8660532Muscle-specific protein 3003315′378,626Q4ABG9FBgn0260952Transposon1060
D6WDT0654860Phosphatidylinositol-binding clathrin assembly protein3313′7855C1C3H4FBgn0086372Transposon1060
D6WHF2664188Nephrin332InsideQ9W4T9FBgn0028369Transposon666
D6WI96100142620Heat shock protein 70333InsideP11147FBgn0001219Transposon1058
D6WG02654917N-acetylglucosaminyltransferase vi334InsideQ9VUH4FBgn0036446Transposon319
D6WYD1656891Putative uncharacterized protein8355′385,712Q8SY79FBgn0032249Satellite6251.7
D6WYN3654942serine-type protease inhibitor8353′58,583Q9VSC9FBgn0035833Satellite6251.7
D6WYA1657913Copia protein (Gag-int-pol protein)8363′262B6V6Z8??Satellite1960.5
D6WYC9656718Cmp-n-acetylneuraminic acid synthase837InsideB5RJF3FBgn0052220Transposon831
D6WV42656028CG5080838InsideQ7K3E2FBgn0031313Transposon582
D6WYA0100142507Beaten path8395′7165Q94534FBgn0013433Transposon1181
D6WUX6662235Putative uncharacterized protein840InsideQ7KUK9FBgn0036454Transposon440
D6X0E1654938defective proboscis extension response741InsideQ9VFD9FBgn0038282Satellite7222.0
D6WPX8662021Ribosome-releasing factor 2, mitochondrial7425′17,480Q9VCX4FBgn0051159Transposon905
A2AX72662058Gustatory receptor7423′1581Q9VPT1FBgn0041250Transposon905
D6WTD1661895similar to chitinase 6743InsideQ9W2M7FBgn0034580Satellite14404.0
D6WPE6100142073voltage-gated potassium channel744InsideP17970FBgn0003383Transposon814
D2A2C6663849Putative uncharacterized protein4455′9489Q9V3S3FBgn0013300Satellite5491.5
D2A2D1663875Putative uncharacterized protein4453′10,920Q9W191FBgn0034994Satellite5491.5
D2A2I0657017Putative uncharacterized protein4465′5820Q8SZ28FBgn0033786Satellite5581.6
D2A2I1657098Ribonucleoside-diphosphate reductase4463′7000P48591FBgn0012051Satellite5581.6
D1ZZG6660983Kinesin-like protein447InsideQ9VLW2FBgn0031955Transposon508
D2A2P8100142595PiggyBac transposable element448InsideQ9VHL1FBgn0037633Transposon377
D6WB65655028E742495′60,525P20105FBgn0000567Satellite7702.1
D6WB73654962organic cation transporter2493′2638Q7K3M6FBgn0034479Satellite7702.1
D6WBG8659129pre-mRNA-splicing helicase BRR22503′4811Q9VUV9FBgn0036548Satellite728
D6WB14658844monophenolic amine tyramine2515′7955P22270FBgn0004514Transposon567
D6WB15658769Cuticular protein 47Ef2513′16,173A1Z8H7FBgn0033603Transposon567
D6WB29657778Endoprotease FURIN252InsideP30432FBgn0004598Transposon1045
A8DIV5657942Nicotinic acetylcholine receptor subunit alpha112535′13,645P25162FBgn0004118Transposon1021
D6WB29657778Endoprotease FURIN2533′4875P30432FBgn0004598Transposon1021
D6X3I9661787Transcription initiation factor IIF10545′10,025Q05913FBgn0010282Satellite8702.4
D6X3J1661827Putative uncharacterized protein10543′6607Satellite8702.4
D6X4P3655389Neutral alpha-glucosidase ab1055InsideQ7KMM4FBgn0027588Satellite6941.9
D6X3H5661246Neurexin-410565′2234Q94887FBgn0013997Satellite2240.6
D6X3H7661308Succinate semialdehyde dehydrogenase10563′14,901Q9VBP6FBgn0039349Satellite2240.6
D6X4V6655916Tubby, putative1057InsideQ9VB18FBgn0039530Transposon763
D6X3J6662034Putative uncharacterized protein10585′1015Q9VEJ9FBgn0038511Satellite5641.6
D6X3J7657069cdc73 domain protein10583′27,239Q9VHI1FBgn0037657Satellite5641.6
D2A693663231lysine-specific demethylase 4B659InsideQ9V6L0FBgn0053182Satellite4981.4
D2A490659655Facilitated trehalose transporter Tret1-2 homolog660InsideQ8MKK4FBgn0033644Transposon689
D2A6I4659728Putative uncharacterized protein6615′116,030Q9W4G2FBgn0260971Transposon764
D2A6I6659791Putative uncharacterized protein6613′4860Q9VNB4FBgn0037323Transposon764
D2A3V0657272Fasciclin-36625′37,286P15278FBgn0000636Satellite2810.8
D2A3V3657421LIM domain kinase 16623′21,789Q8IR79FBgn0041203Satellite2810.8
D6W8F4660322Disco-relatedx63InsideQ9VXJ5FBgn0042650Satellite5301.5
D6W8D3659123PlexAx645′1973O96681FBgn0025741Transposon848
D6WGD2659272Aldose-1-epimerasex643′6472Q9VRU1FBgn0035679Transposon848
B3MMG1657652Neural-cadherin565InsideO15943FBgn0015609Satellite2730.8
D6WNN6658579Transient receptor potential-gamma protein5665′2547Q9VJJ7FBgn0032593Transposon894
A3RE80658661Cardioacceleratory peptide receptor5663′27,105Q868T3FBgn0039396Transposon894
A1JUG2661207Ultraspiracle567InsideP20153FBgn0003964Satellite3791.1
D6WNB3656063Y box protein5685′14,993O46173FBgn0022959Satellite4551.3
D6WNB6656095Peptide chain release factor 15683′350,365Q9VK20FBgn0032486Satellite4551.3

A list of genes with gene identity numbers, gene name, chromosomal location, position, and distance relative to the associated TCAST-like element, as well as a list of TCAST-like elements, their types (satellite or transposon-like), total length in bp, and copy number of satellite repeats within an array are shown.

A list of genes with gene identity numbers, gene name, chromosomal location, position, and distance relative to the associated TCAST-like element, as well as a list of TCAST-like elements, their types (satellite or transposon-like), total length in bp, and copy number of satellite repeats within an array are shown. There were only three cases in which two different TCAST-like elements were associated with the same gene: gene D6X2C4 contains TCAST-like sequences no. 6 and 13 within introns, gene D6X2U7 is flanked at 5′ and 3′ end by sequences no. 5 and 7, respectively, whereas gene D6WB29 is located at 3′ end of the sequence no. 53 and has sequence no. 52 within an intron. All other TCAST-like elements were positioned near or within different genes. Thus in total, there were 101 genes found in the vicinity of TCAST-like elements. Characteristics of the genes associated with TCAST-like elements, including gene identity number, gene name and chromosomal location, position relative to the associated TCAST-like element, and distances between TCAST-like elements and genes, are shown in Table 1 Distances between TCAST-like elements and genes range from 262 nt (gene positioned at 3′ site of the sequence no. 36), to a maximal distance of 404,270 nt (gene positioned at 5′ site of the sequence no. 5).

Characteristics of TCAST-like elements

TCAST satellite-like elements:

Sequence analysis of the 68 TCAST-like elements identified within the vicinity of genes enabled their classification into two groups. The first group contains partial TCAST satellite monomers or tandemly arranged elements, either complete or partial dimers, trimers, or tetramers (Table 1). The minimal size of satellite repeat was 203 nt (0.6 of complete TCAST monomer; sequence no. 15), whereas the maximal size was 1440 nt (four complete TCAST monomers; sequence no. 43; Table 1). In many sequences, two subtypes of TCAST satellite monomers were mutually interspersed: Tcast1a and Tcast1b. Tcast1b corresponds to the TCAST satellite consensus that was used as a query sequence (Ugarković ), and Tcast1a corresponds to the TCAST subfamily described in Feliciello . Tcast1a and Tcast1b have an average homology of 79% and are of similar sizes at 362 bp and 377 bp respectively, but are characterized by a divergent, subfamily specific region of approximately 100 bp (Feliciello ). There were 34 TCAST satellite-like elements found within or in the region of 53 genes. Lengths of TCAST satellite-like elements (Table 1), their exact start and end sites within genomic sequence and composition (supporting information, Table S1) are provided. To see whether there is any clustering of sequences of TCAST satellite-like elements due to the difference in the homogenization at the level of local array, chromosome, or among different chromosomes, sequence alignment and phylogenetic analysis were performed. Tcast1a and Tcast1b subunits were extracted from TCAST satellite-like sequences and analyzed separately. Alignment was performed on 24 Tcast1a subunits, ranging in size from 136 and 377 bp (File S1). The average pairwise distances between Tcast1a subunits of TCAST satellite-like sequences was 5.8%. Alignment adjustment using Gblocks, which eliminates poorly aligned positions and divergent regions, resulted in few changes; therefore, the original, unadjusted alignment was used for the construction of phylogenetic trees. Because the sequences differ in lengths and comprise regions of divergent variability, methods that take into account specific models of DNA evolution were considered as the most suitable for the construction of phylogenetic trees, ML and Bayesian (Markov chain Monte Carlo). The ML tree showed weak resolution with no significant support for clustering of sequences derived from the same satellite-like array or from the same chromosome. Similarly, the Bayesian tree demonstrated no significant sequence clustering (Figure 1A).
Figure 1 

Bayesian/ML phylogenetic trees of: (A) TCAST satellite-like elements (subunits Tcast1a), (B) TCAST satellite-like elements (subunits Tcast1b), and (C) TCAST transposon-like elements. Sequence numbers correspond to those in Table 1. When a particular sequence is composed of few subrepeats (e.g., Tcast1a or Tcast1b), numbers indicating subrepeats are added (e.g., 43_1, 43_2, 43_3). Numbers in brackets indicate chromosomes on which the corresponding sequences are located. Numbers on branches indicate Bayesian posterior probabilities/ML bootstrap support (above 0.5/50%, respectively).

Bayesian/ML phylogenetic trees of: (A) TCAST satellite-like elements (subunits Tcast1a), (B) TCAST satellite-like elements (subunits Tcast1b), and (C) TCAST transposon-like elements. Sequence numbers correspond to those in Table 1. When a particular sequence is composed of few subrepeats (e.g., Tcast1a or Tcast1b), numbers indicating subrepeats are added (e.g., 43_1, 43_2, 43_3). Numbers in brackets indicate chromosomes on which the corresponding sequences are located. Numbers on branches indicate Bayesian posterior probabilities/ML bootstrap support (above 0.5/50%, respectively). Alignment of 28 Tcast1b subunits, ranging from 159 bp to 363 bp (File S2), was also not significantly affected by adjustment with Gblocks; therefore, the unadjusted alignment was used for the construction of phylogenetic trees (Figure 1B). The average pairwise divergence between Tcast1b subunits, of TCAST satellite-like sequences, was 6.7%. With the ML phylogenetic tree, four groups composed of two or three sequences, were resolved by relatively low bootstrap values. However, the majority of Tcast1b subunit sequences remained unresolved. There was no clustering of subunits derived from the same array or the same chromosome (Figure 1B). Bayesian tree analysis produced one significantly supported cluster composed of 10 sequences derived from 7 chromosomes (Figure 1B).

TCAST transposon-like elements:

The second group of TCAST-like repeats is represented by a complex element that contains an almost complete TCAST (or Tcast1b) monomer, and a TCAST monomer segment of approximately 121 bp in an inverted orientation. These two TCAST segments are separated by a nonsatellite sequence of approximately 306 bp. Both TCAST segments are part of TIRs that are approximately 269 bp long (Figure 2). As a result of the long TIRs, these elements are likely to form stable hairpin secondary structures and therefore resemble transposons. The nonsatellite part of sequence, common for all TCAST transposon-like elements, is unique in that it does not exhibit significant homology to any other sequence within the T. castaneum genome. There were 34 TCAST transposon-like elements found within or in the vicinity of 50 genes. Their lengths (Table 1) and exact start and end sites within genomic sequence (Table S1) are provided. Sequence analysis of TCAST transposon-like elements determined that 13 of them were > 1000 bp, with a maximal size of 1181 bp (Table 1). The remaining TCAST transposon-like elements were shorter, with a minimal size of 314 bp (sequence no. 27), and usually lacking part of, or one or both, TIRs. Conserved TIRs are necessary for transposition, and if they are absent, truncated, or mutated so that the transposase cannot interact with the transposon sequence, the transposon cannot be mobilized and therefore represents a molecular fossil of a once active transposon (Capy ). Despite mutations and partial truncations of TIRs within the TCAST transposon-like elements, and likely because of the length of the TIRs, most of the elements still preserve a stable secondary structure and could potentially remain mobile.
Figure 2 

Organization of TCAST elements within T. castaneum genome in the form of TCAST transposon-like element, tandem arrays, and CR1-3_TCa retrotransposon. Regions corresponding to TCAST element are shown in red. TCAST transposon-like element contains an almost complete TCAST monomer and a monomer segment of approximately 121 bp in an inverted orientation, whereas CR1-3 retrotransposon contains segment corresponding to 1.2 monomer. Within TCAST transposon-like element terminal inverted repeats (arrows) unique nonsatellite sequence (green), target-site duplication in the form of “ACT,” and the insertion point of 925-bp sequence found within TR 1.9, element and coding for the putative transposase are shown. Three short ORFs within TCAST transposon-like element are also indicated. Within nonlong terminal repeat retrotransposon CR1-3_TCa regions corresponding to 5′UTR and to two ORFs are indicated.

Organization of TCAST elements within T. castaneum genome in the form of TCAST transposon-like element, tandem arrays, and CR1-3_TCa retrotransposon. Regions corresponding to TCAST element are shown in red. TCAST transposon-like element contains an almost complete TCAST monomer and a monomer segment of approximately 121 bp in an inverted orientation, whereas CR1-3 retrotransposon contains segment corresponding to 1.2 monomer. Within TCAST transposon-like element terminal inverted repeats (arrows) unique nonsatellite sequence (green), target-site duplication in the form of “ACT,” and the insertion point of 925-bp sequence found within TR 1.9, element and coding for the putative transposase are shown. Three short ORFs within TCAST transposon-like element are also indicated. Within nonlong terminal repeat retrotransposon CR1-3_TCa regions corresponding to 5′UTR and to two ORFs are indicated. Some TCAST transposon-like elements >1000 bp have a 3-bp duplication at the site of insertion in the form of ACT. One TCAST transposon-like element (sequence no. 39) is inserted into another repetitive DNA, indicated as Tcast2, which had been previously identified bioinformatically (Wang ). Sequence analysis of this transposon-like element confirms the continuity of Tcast2 from the duplication site “ACT.” Typically, the size of target-site duplication is a hallmark of different superfamilies of eukaryotic DNA transposons, with mariner/Tc1, the only superfamily whose members are characterized by either 2- or 3-bp target-site duplication (Capy ; Kapitonov and Jurka 2003; Feschotte and Pritham 2007). There are three open reading frames (ORFs) within TCAST transposon-like sequences, but the resulting putative proteins are very short and do not share similarity with any other proteins (Figure 2). The elements therefore do not code for transposases and are considered nonautonomous. Using the whole TCAST transposon-like elements as a query sequence, we searched the T. castaneum Gen Bank database for “full-sized” homologous elements that could potentially code for transposases and be considered autonomous. The search identified an element, named TR 1.9, with a 925-bp sequence inserted within a unique sequence of the TCAST transposon-like elements (Figure 2). This 925-bp sequence contains an ORF of 206 amino acids and a conserved domain belonging to the Transposase 1 superfamily, which also includes the mariner transposase. DNA transposons of the mariner/Tc1 superfamily Mariner-1_TCa and Mariner-2_TCa, were identified within the T. castaneum genome (Jurka 2009a, 2009b). Using BLASTP and the translated sequence from the 925 bp ORF as a query sequence, we identified hits with a partial homology to a Mariner-2_TCa transposase and to a mariner-like element transposase present in two other insects, the beetle Agrilus planipennis (emerald ash borer) and Chrysoperla plorabunda (green lacewing; Neuroptera), but not to Mariner-1_ TCa transposase. To test whether there is any chromosome-specific sequence clustering of TCAST transposon-like sequences that could suggest difference in homogenization within chromosome and among different chromosomes, the alignment and subsequent phylogenetic analysis of TCAST transposon-like sequences was performed. Because TCAST transposon-like elements differ significantly in size (314−1181 nt), the alignment and phylogenetic analyses was performed on 25 elements that mutually overlap in their sequences, whereas the other nine TCAST transposon-like elements were excluded from the analysis due to the very low overlapping with other elements. Alignment was additionally adjusted using Gblocks (File S3). The average pairwise divergence among TCAST transposon-like sequences was 12.7%. ML and Bayesian methods gave similar tree topologies (Figure 1C). The ML tree showed very weak resolution of TCAST transposon-like sequences and a general absence of subgroups with specific sequence characteristics (Figure 1C). Only two clusters were formed whereas, using the Bayesian tree, we identified three well-supported groups; two of them were as for ML tree (Figure 1C).

Distribution of TCAST-like elements on T. castaneum chromosomes

TCAST-like elements found in the vicinity of genes were distributed on all 10 T. castaneum chromosomes (Table 1). Positions of constitutive heterochromatin and euchromatin were assigned on the haploid set of T. castaneum chromosomes, based on C-banding data (Stuart and Mocelin 1995) and Tribolium castaneum 3.0 Assembly data (Figure 3). Within euchromatic segments, the position of each TCAST-like element is specifically indicated (Figure 3) based on the position within the genomic sequence (Table S1). TCAST-like elements were dispersed on both arms of chromosomes 3, 5, 9, and 7, whereas on other chromosomes they were located on a single arm (Figure 3). The number of TCAST-like elements ranged from 2 on chromosome 1(X) to 17 on chromosomes 3 and 9. To detect whether TCAST-like elements were distributed randomly among the T. castaneum chromosomes or whether there was a significant over or underrepresentation of the elements on some chromosomes we performed hypergeometric distribution analysis test. The analysis revealed no statistically significant deviation in the number of TCAST-like elements among the chromosomes (Figure S1), pointing to their random distribution.
Figure 3 

Distribution of TCAST-like elements on T. castaneum chromosomes. The karyotype representing the haploid set of T. castaneum chromosomes, and positions of constitutive heterochromatin (dark) and euchromatin (white) are depicted based on C-banding data (Stuart and Mocelin 1995) and T. castaneum 3.0 assembly (http://www.beetlebase.org). TCAST transposon-like elements (blue) and TCAST satellite-like elements (red) are shown. Two TCAST-like elements are represented as separate lines if they are at least 100 kb distant from each other.

Distribution of TCAST-like elements on T. castaneum chromosomes. The karyotype representing the haploid set of T. castaneum chromosomes, and positions of constitutive heterochromatin (dark) and euchromatin (white) are depicted based on C-banding data (Stuart and Mocelin 1995) and T. castaneum 3.0 assembly (http://www.beetlebase.org). TCAST transposon-like elements (blue) and TCAST satellite-like elements (red) are shown. Two TCAST-like elements are represented as separate lines if they are at least 100 kb distant from each other. To determine whether there was a target preference for the insertion of TCAST-like elements, for example high AT content or another sequence characteristic, we analyzed the AT content within 100 bp of the flanking regions for each TCAST -like element, from both 5′ and 3′ sites (Figure S2 and Figure S3). The average AT content of the flanking regions for both TCAST satellite-like elements and TCAST transposon-like elements did not differ significantly from the average AT content of the whole T. castaneum genome or from the AT content of randomly selected intergenic regions and introns. Thus, this finding suggests that with regard to AT content, there is no target preference for the insertion of TCAST-like elements. Furthermore, alignment and comparison of all flanking sequences of TCAST-like elements did not identify any common sequence motifs.

Genes in the vicinity of TCAST-like elements

Uniprot gene numbers were used as identifiers of genes located in the vicinity of TCAST-like elements (gene names shown in Table 1). Uniprot gene numbers for homologous genes found in Drosophila melanogaster are also indicated (Table 1). Detailed description of the genes, including molecular function of their protein products, biological processes in which these proteins are involved, and their cellular localization (cellular component), are shown (Table S1). Each identified gene is assigned to a particular TCAST-like element within its vicinity, and the precise position of TCAST-like elements in genomic sequence (start and end site) is indicated (Table S1). Functional analysis revealed that 17 of 101 genes correspond to putative uncharacterized proteins, whereas the remaining genes are involved in different molecular functions and diverse biological processes. Among the proteins, a proportion is characterized by ATP binding activity (13 proteins) and involvement in protein phosphorylation and /or signal transduction (9 proteins; Table S1). To determine whether TCAST-like elements are distributed randomly relative to genes or whether they are overrepresented near specific groups of genes, we used GeneCodis 2.0 to provide a statistical representation of the genes associated with TCAST-like elements. Because many genes are still not annotated in T. castaneum and furthermore T. castaneum genomic data are not included in GeneCodis, we used gene numbers for orthologous genes from D. melanogaster for the analysis and compared them with the whole set of 14,869 genes annotated in D. melanogaster. Genecodis analysis revealed that TCAST-like elements are located near nine genes characterized as members of the immunoglobulin protein superfamily. Because there are only 134 immunoglobulin-like genes present within the total set of D. melanogaster genes, random distribution of TCAST-like elements would result in their occurrence near approximately a single immunoglobulin-like gene. The presence of TCAST-like elements in the vicinity of nine immunoglobulin-like genes therefore represents a statistically significant overrepresentation (0.00000427). All nine genes exhibit structural features of immunoglobulin-like, immunoglobulin subtype 1 and immunoglobulin subtype 2 proteins and are associated with the following TCAST transposon-like elements: 25 at the 3′end, 28 and 39 at the 5′ end, 32 and 40 within introns, and TCAST satellite-like elements: 8 at the 3′ end, 19 and 62 at the 5′ end, and 41 within intron (Table 1). A minimal distance between TCAST-like element and immunoglobulin-like gene was 7165 bp and a maximal 173,881 bp (Table 1). Molecular function of most of immunoglobulin-like genes is unknown, and they are involved in different biological processes such as cell adhesion, protein phosphorylation, and axon guidance (Table S1). Although all nine genes belong to immunoglobulin superfamily, they did not exhibit sequence similarity, which could suggest role of duplication in their evolution and spreading. The position of TCAST-like elements relative to the genes also was not consistent with the possibility that TCAST-like elements duplicated along with the immunoglobulin- like genes. Overrepresentation of TCAST-like elements was also found near genes that exhibit ATP-binding activity and axon guidance properties but with a marginal significance (0.0183374 and 0.00865139). For the rest of genes, no significant overrepresentation of TCAST-like elements was detected. Thus, enrichment of TCAST-like elements in the vicinity of immunoglobulin-like genes potentially implicates a role of TCAST-like elements in the regulation of these genes.

Discussion

TEs are classified in several dozen families based on transposition mechanisms and different dynamics properties (Hua-Van ). Active TEs encode the enzymes necessary for their transposition, either to move between nonhomologous regions in the genome or to copy themselves to other positions. In many cases, TEs do not produce their own enzymes but are able to use those from functional copies or even from other TEs families. Defective and inactive TEs often are amplified in regions of low recombination such as heterochromatin and may form tandemly repeated satellite DNAs. The origin of satellite DNA array from transposon-like elements is reported for many insects such as Drosophila melanogaster (Agudo ), Drosophila guanche (Miller ), and the beetle Misolampus goudoti (Pons 2004) whereas the retroviral-like features were first observed in the satellite DNA from rodents of the genus Ctenomys (Rossi ). Transposons can be inserted into other repetitive sequences such as satellite DNAs, as has been observed for the mariner-like element and MITE element, both inserted into satellite DNA of the ant Messor bouvieri (Palomeque ). Searching for repetitive elements homologous to the TCAST repeat within Repbase (http://www.girinst.org/repbase/) revealed that 5′ UTR of nonlong terminal repeat retrotransposon CR1-3_TCa (Jurka 2009c) shares a high similarity of 83% with a 444-bp long TCAST sequence composed of 1.2 tandem monomers (Figure 1). Other CR1 subfamilies identified within T. castaneum such as CR1-1_ TCa, CR1-2_TCa, and CR1-4_TCa, published in Repbase, do not share similarity to CR1-3 and do not contain TCAST similar sequence. We propose that CR1-3 was inserted within TCAST satellite array and through recombination has acquired a part of TCAST sequence. Newly acquired TCAST element could act as a promoter because TCAST satellite DNA has an internal promoter for RNA Pol II (Pezer and Ugarković 2012) and becomes a new functional 5′ UTR. Subsequent retrotransposition of CR1-3_TCa could explain the dispersion of TCAST within the euchromatin (Figure 4). Three CR1-3_TCa elements with TCAST in the 5′UTR were identified within scaffolds that have not been mapped to linkage groups. However, truncated fragments with partial homology to CR1-3_TCa retrotransposon can be mapped within T. castaneum genome, some of them in the vicinity of TCAST elements. Such arrangement also indicates the role of CR1-3_TCa in the spreading of TCAST elements. There is also a possibility that TCAST satellite DNA originates from CR1-3 retrotransposon which was, after inactivation, amplified within the heterochromatin region. In the case of TCAST transposon-like elements, part of the satellite sequence is incorporated within TIRs which are characteristic for DNA transposons. The presence of target-site duplications at the sites of insertions of some TCAST transposon-like elements also indicates transposition as a mode of spreading of TCAST elements. Parts of satellite DNA elements can be found within some transposons, such as pDv transposon (Evgen'ev ; Zelentsova ) whose long direct terminal repeats show significant sequence similarity to the pvB370 satellite DNA, located in the centromeric heterochromatin of a number of species of the Drosophila virilis group (Heikkinen ). The presence of short stretches of PisTR-A satellite DNA sequences within 3′ UTR of Ogre retrotransposons dispersed in the pea (Pisum sativum) genome was reported (Macas ). Furthermore, the mobilization of subtelomeric repeats upon excision of the transposable P element from tandemly repeated subtelomeric sequences has been observed (Thompson-Stewart ).
Figure 4 

Models of spreading of TCAT-like elements based on (A) retrotransposition of CR-3_TCa element. CR1-3_TCa was inserted within TCAST satellite array and through recombination has acquired a part of TCAST sequence, which could act as a promoter and become a new functional 5′UTR. Subsequent retrotransposition of CR1-3_TCa could explain the dispersion of TCAST within the euchromatin. (B) Rolling circle replication of TCAST satellite DNA sequences excised from their heterochromatin loci via intrastrand recombination, followed by reintegration into different genome locations by homologous recombination.

Models of spreading of TCAT-like elements based on (A) retrotransposition of CR-3_TCa element. CR1-3_TCa was inserted within TCAST satellite array and through recombination has acquired a part of TCAST sequence, which could act as a promoter and become a new functional 5′UTR. Subsequent retrotransposition of CR1-3_TCa could explain the dispersion of TCAST within the euchromatin. (B) Rolling circle replication of TCAST satellite DNA sequences excised from their heterochromatin loci via intrastrand recombination, followed by reintegration into different genome locations by homologous recombination. Incorporation of part of a TCAST satellite DNA sequence into a (retro)transposable element, and its subsequent mobilization and spreading by (retro)transposition, may explain the distribution of TCAST element in the vicinity of genes within euchromatin. Satellite DNA sequences are prone to undergo recurrent repeat copy number expansion and contraction in divergent lineages as well as among populations of the same species (Bosco ). This amplification appears to be random and does not correlate with phylogeny of the species (Pons ; Lee ; Bulazel ). Amplification of a satellite sequence is reported to occur as a result of unequal crossing over or duplicative transposition (Smith 1976; Ma and Jackson 2006). The discovery of human extrachromosomal elements originating from satellite DNA arrays in cultured human cells and different plant species indicates the possible existence of additional amplification mechanisms based on rolling-circle replication (Assum ; Navrátilová ). It has been proposed that satellite sequences excised from their chromosomal loci via intrastrand recombination could be amplified in this way, followed by reintegration of tandem arrays into the genome (Feliciello ). Moreover, it is possible that such a mechanism affected TCAST satellite DNA, and that extrachromosamal circles of TCAST were reintegrated into different genome locations by homologous recombination based on short stretches of sequence similarity between TCAST satellite and target genomic sequence (Figure 4). Integrated TCAST sequences are mainly composed of interspersed elements belonging to two major subfamilies, Tcast1a and Tcast1b, which is a prevalent type of organization in pericentromeric heterochromatin (Feliciello ). This finding indicates that the origin of dispersed euchromatic TCAST elements may be duplication of heterochromatin copies. The distribution of TCAST-like elements relative to protein coding genes revealed no specific preference for insertions within introns or at 5′ or 3′ ends of genes. TCAST-like elements are distributed on all chromosomes with no significant deviation in the number among the chromosomes, and phylogenetic analysis did not detect any significant sequence clustering of TCAST-like elements derived from the same chromosome. Dispersed TCAST satellite-like elements produce tandem arrays up to tetramers, but repeats from the same array do not reveal any significant clustering on phylogenetic trees. This finding indicates there is no significant difference in the homogenization of TCAST satellite-like repeats at the level of local arrays or chromosome or among different chromosomes. The average pair-wise sequence divergence (6% for dispersed TCAST satellite-like repeats) is greater than the usual divergence of satellite elements located in heterochromatin of tenebrionid beetles [approximately 2% (Ugarković )]. This difference in homogeneity between repeats located in heterochromatin and euchromatin may be explained by a lower rate of gene conversion affecting dispersed satellite-like elements or by a specific mechanism of DNA repair acting on satellite DNA (Feliciello ). TCAST transposon-like elements dispersed among the genes within euchromatin have an even greater average sequence divergence (approximately 12%) and also exhibit no significant chromosome-specific sequence clustering, indicating a similar rate of homogenization within and among the chromosomes. Relatively high sequence divergence of TCAST transposon-like elements and the significant truncation of the majority of them, indicates that the transposition of these elements did not occur very recently and that these elements could be considered as molecular fossils of the functional TCAST transposon-like elements. Cis-regulatory elements, such as promoters or transcription factor binding sites, are predicted in some satellite DNAs (Pezer ). Transcription from promoters for RNA Pol II is also characteristic for pericentromeric satellite DNAs from the beetles Palorus ratzeburgii and Palorus subdepressus (Pezer and Ugarković 2008, 2009). Temperature-sensitive transcription of TCAST satellite DNA from an internal RNA Pol II promoter has been demonstrated (Pezer and Ugarković 2012). Based on these findings, it can be proposed that TCAST elements located in the vicinity of genes may function as alternative promoters, and transcripts derived from them may interfere with the expression of neighboring gene. This type of regulation is often observed for retrotransposons positioned immediately 5′ of protein genes (Faulkner ). In addition, some tissue-specific gene promoters are derived from retrotransposons (Ting ; Samuelson ). Because of rapid evolutionary turnover, satellite DNA sequences often are restricted to a group of closely related species, or in some instances are species specific. This is the case with TCAST satellite DNA, which is not even detected in the congeneric Tribolium species. If restricted satellite DNAs have regulatory potential, then insertion of these elements in vicinity of genes could contribute to the establishment of lineage-specific or species-specific patterns of gene expression. Annotation of genes in proximity to TCAST-like elements demonstrated a statistical overrepresentation of certain groups of genes, for example, those with immunoglobulin-like domains. Recently, in the fish Salvelinus fontinalis, a regulatory role of a 32-bp satellite repeat, located in an intron of the major histocompatibility complex gene (MHIIβ), on MHIIβ gene expression was demonstrated (Croisetiere ). The level of gene expression depends on temperature, as well as the number of satellite repeats, and indicates a role for temperature-sensitive satellite DNA in gene regulation of the adaptive immune response. Further studies are necessary to determine whether TCAST-like elements exhibit a potential regulatory role on nearby genes. The transcriptional potential of satellite DNAs as well as their distribution close to protein-coding genes, as shown in this study, provides strong support, that in addition to transposons, satellite DNAs represent a rich source for the assembly of gene regulatory systems.
  49 in total

1.  Centromeres from telomeres? The centromeric region of the Y chromosome of Drosophila melanogaster contains a tandem array of telomeric HeT-A- and TART-related sequences.

Authors:  M Agudo; A Losada; J P Abad; S Pimpinelli; P Ripoll; A Villasante
Journal:  Nucleic Acids Res       Date:  1999-08-15       Impact factor: 16.971

2.  Endogenous retroviral sequences are required for tissue-specific expression of a human salivary amylase gene.

Authors:  C N Ting; M P Rosenberg; C M Snow; L C Samuelson; M H Meisler
Journal:  Genes Dev       Date:  1992-08       Impact factor: 11.361

Review 3.  Transposable elements and the evolution of regulatory networks.

Authors:  Cédric Feschotte
Journal:  Nat Rev Genet       Date:  2008-05       Impact factor: 53.242

4.  Transcription of Satellite DNAs in Insects.

Authors:  Zeljka Pezer; Josip Brajković; Isidoro Feliciello; Durđica Ugarković
Journal:  Prog Mol Subcell Biol       Date:  2011

5.  Chromatin immunoprecipitation cloning reveals rapid evolutionary patterns of centromeric DNA in Oryza species.

Authors:  Hye-Ran Lee; Wenli Zhang; Tim Langdon; Weiwei Jin; Huihuang Yan; Zhukuan Cheng; Jiming Jiang
Journal:  Proc Natl Acad Sci U S A       Date:  2005-07-22       Impact factor: 11.205

6.  Detection of a mariner-like element and a miniature inverted-repeat transposable element (MITE) associated with the heterochromatin from ants of the genus Messor and their possible involvement for satellite DNA evolution.

Authors:  Teresa Palomeque; José Antonio Carrillo; Martín Muñoz-López; Pedro Lorite
Journal:  Gene       Date:  2006-02-28       Impact factor: 3.688

7.  Thousands of human mobile element fragments undergo strong purifying selection near developmental genes.

Authors:  Craig B Lowe; Gill Bejerano; David Haussler
Journal:  Proc Natl Acad Sci U S A       Date:  2007-04-26       Impact factor: 11.205

8.  Transcription of pericentromeric heterochromatin in beetles--satellite DNAs as active regulatory elements.

Authors:  Z Pezer; D Ugarković
Journal:  Cytogenet Genome Res       Date:  2009-06-25       Impact factor: 1.636

9.  RNA Pol II promotes transcription of centromeric satellite DNA in beetles.

Authors:  Zeljka Pezer; Durdica Ugarković
Journal:  PLoS One       Date:  2008-02-13       Impact factor: 3.240

10.  Species-specific shifts in centromere sequence composition are coincident with breakpoint reuse in karyotypically divergent lineages.

Authors:  Kira V Bulazel; Gianni C Ferreri; Mark D B Eldridge; Rachel J O'Neill
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

View more
  25 in total

Review 1.  Structural and functional liaisons between transposable elements and satellite DNAs.

Authors:  Nevenka Meštrović; Brankica Mravinac; Martina Pavlek; Tanja Vojvoda-Zeljko; Eva Šatović; Miroslav Plohl
Journal:  Chromosome Res       Date:  2015-09       Impact factor: 5.239

Review 2.  Helitrons shaping the genomic architecture of Drosophila: enrichment of DINE-TR1 in α- and β-heterochromatin, satellite DNA emergence, and piRNA expression.

Authors:  Guilherme B Dias; Pedro Heringer; Marta Svartman; Gustavo C S Kuhn
Journal:  Chromosome Res       Date:  2015-09       Impact factor: 5.239

3.  First evidence of DNA methylation in insect Tribolium castaneum: environmental regulation of DNA methylation within heterochromatin.

Authors:  Isidoro Feliciello; Josip Parazajder; Ivana Akrap; Durđica Ugarković
Journal:  Epigenetics       Date:  2013-04-17       Impact factor: 4.528

4.  Satellite DNA Modulates Gene Expression in the Beetle Tribolium castaneum after Heat Stress.

Authors:  Isidoro Feliciello; Ivana Akrap; Đurđica Ugarković
Journal:  PLoS Genet       Date:  2015-08-14       Impact factor: 5.917

5.  Tandem repeat-containing MITEs in the clam Donax trunculus.

Authors:  Eva Satovic; Miroslav Plohl
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

6.  Satellite DNA as a driver of population divergence in the red flour beetle Tribolium castaneum.

Authors:  Isidoro Feliciello; Ivana Akrap; Josip Brajković; Ivo Zlatar; Đurđica Ugarković
Journal:  Genome Biol Evol       Date:  2014-12-19       Impact factor: 3.416

7.  Genome-wide analysis of tandem repeats in Tribolium castaneum genome reveals abundant and highly dynamic tandem repeat families with satellite DNA features in euchromatic chromosomal arms.

Authors:  Martina Pavlek; Yevgeniy Gelfand; Miroslav Plohl; Nevenka Meštrović
Journal:  DNA Res       Date:  2015-10-01       Impact factor: 4.458

8.  Adjacent sequences disclose potential for intra-genomic dispersal of satellite DNA repeats and suggest a complex network with transposable elements.

Authors:  Eva Satović; Tanja Vojvoda Zeljko; Andrea Luchetti; Barbara Mantovani; Miroslav Plohl
Journal:  BMC Genomics       Date:  2016-12-06       Impact factor: 3.969

9.  Satellitome Analysis of the Pacific Oyster Crassostrea gigas Reveals New Pattern of Satellite DNA Organization, Highly Scattered across the Genome.

Authors:  Monika Tunjić-Cvitanić; Juan J Pasantes; Daniel García-Souto; Tonči Cvitanić; Miroslav Plohl; Eva Šatović-Vukšić
Journal:  Int J Mol Sci       Date:  2021-06-24       Impact factor: 5.923

10.  Tandem repeats derived from centromeric retrotransposons.

Authors:  Anupma Sharma; Thomas K Wolfgruber; Gernot G Presting
Journal:  BMC Genomics       Date:  2013-03-04       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.