| Literature DB >> 30049970 |
Maria Waldl1, Bernhard C Thiel2, Roman Ochsenreiter3, Alexander Holzenleiter4,5, João Victor de Araujo Oliveira6, Maria Emília M T Walter7, Michael T Wolfinger8,9, Peter F Stadler10,11,12,13.
Abstract
The telomerase RNA in yeasts is large, usually >1000 nt, and contains functional elements that have been extensively studied experimentally in several disparate species. Nevertheless, they are very difficult to detect by homology-based methods and so far have escaped annotation in the majority of the genomes of Saccharomycotina. This is a consequence of sequences that evolve rapidly at nucleotide level, are subject to large variations in size, and are highly plastic with respect to their secondary structures. Here, we report on a survey that was aimed at closing this gap in RNA annotation. Despite considerable efforts and the combination of a variety of different methods, it was only partially successful. While 27 new telomerase RNAs were identified, we had to restrict our efforts to the subgroup Saccharomycetacea because even this narrow subgroup was diverse enough to require different search models for different phylogenetic subgroups. More distant branches of the Saccharomycotina remain without annotated telomerase RNA.Entities:
Keywords: homology search; non-coding RNA; secondary structure; synteny; telomerase RNA; yeast
Year: 2018 PMID: 30049970 PMCID: PMC6115765 DOI: 10.3390/genes9080372
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Schematic organization of telomerase RNA (TER). Contact regions for important binding sites are indicated by green circles (EST1, SM, and KU). The green ellipse denotes the contact region with the reverse transcriptase (TERT). Other major features are the template, the pseudoknot region, the template boundary element (TBE), the three-way junction (TWJ), and the Area of Required Connectivity (ARC). Adapted from [13].
Figure 2Features identified in TER sequences. (KU) ku binding hairpin, (T) template region, (EST1) Est1 binding site, (TWJ) three-way junction, and (SM1) SM1 binding site. Elements not shown are either not present in the corresponding species (e.g., the TWJ in Candida glabrata) or could not be located with reasonable certainty. Species marked by * are not part of the phylogenetic tree and were placed next to their closest related neighbor based on the similarity of their TER sequences.
Figure A1Phylogeny of the Saccharomycetales. Bootstrap support is 100% unless otherwise indicated. The Saccharomycetacea are indicted in dark blue. A red dot at tip of the tree indicates a TER sequences listed in Table 1.
Overview of conserved TER substructures in Saccharomycetacea, as identified by the combined synteny/covariance model pipeline. The 3 end is defined as 10 nt downstream of the SM binding site. The 5 end is approximate. Citations refer to publication in which the sequence and/or the coordinates of respective features are reported explicitly. These annotations form the basis of Figure 2.
| Species | Accession | Strand | TER Coordinates | Ku Binding Site | Template Region | Est1 Binding Site | TWJ | SM1 |
|---|---|---|---|---|---|---|---|---|
|
|
| neg | 16,338–17,322 [ | 16,940–16,966 | 16,794–16,862 [ | 16,378–16,485 [ | 16,350–16,359 | |
|
|
| pos | 250–1327 [ | 662–693 | 765–858 [ | 1183–1290 [ | 1307–1316 | |
|
|
| pos | 506,443–507,711 [ | 506,855–506,888 | 506,967–507,049 [ | 507,518–50,7671 [ | 507,691–507,700 | |
|
|
| pos | 461,805–463,090 [ | 462,224–462,257 | 462,337–462,499 [ | 462,905–463,051 [ | 463,070–463,079 | |
|
|
| pos | 611,456–612,727 [ | absent [ | 611,890–611,919 [ | 612,006–612,090 [ | 612,532–612,687 [ | 612,708–612,716 [ |
|
|
| neg | 269,038–270,368 | 269,938–269,968 | ||||
|
|
| pos | 54,147–54,960 | 54,451–54,480 | ||||
|
|
| neg | 677,871–679,048 [ | 678,276–678,305 [ | ||||
|
|
| neg | 693,543–694,708 | 693,942–693,973 | ||||
|
|
| pos | 348,600–349,844 | 348,876–348,930 | 348,957–348,982 [ | 349,129–349,208 | 349,825–349,833 | |
|
|
| pos | 854,162–855,236 | 854,389–854,444 | 854,754–854,820 | 855,217–855,225 | ||
|
|
| neg | 134,961–136,000 | 135,698–135,756 [ | 135,613–135,636 | 135,409–135,470 | 134,973–134,981 | |
|
|
| pos | 702,500–703,549 | 702,730–702,791 [ | 702,853–702,876 | 703,022–703,083 | 703,530–703,538 | |
|
|
| pos | 682,034–682,916 | 682,124–682,181 | 682,261–682,283 | 682,900–682,905 | ||
|
| neg | 441,802–442,700 | 442,582–442,638 | 442,229–442,292 | 441,811–441,820 | |||
|
|
| neg | 306,329–307,150 | 307,076–307,129 | 306,786–306,850 | 306,339–306,348 | ||
|
|
| pos | 575,851–576,676 | 575,886–575,941 | 576,233–576,294 | 576,657–576,666 | ||
|
|
| pos | 690,800–691,797 | 691,218–691,282 | 691,777–691,786 | |||
|
|
| pos | 388,401–389,382 | 388,567–388,624 | 388,937–389,004 | 389,362–389,371 | ||
|
|
| pos | 709,007–709,780 | 709,057–709,086 | 709,267–709,336 | 709,761–709,770 | ||
|
|
| neg | 426,211–427,050 | 427,000–427,028 | 426,726–426,817 | 426,221–426,229 | ||
|
|
| neg | 712,655–713,400 | 712,902–712,974 | 712,665–712,673 | |||
|
|
| pos | 297,087–297,883 | 297,527–297,616 | 297,865–297,873 | |||
|
|
| pos | 455,564–455,975 [ | 455,656–455,728 | 455,957–455,965 | |||
|
|
| neg | 404,150–405,050 | 405,003–405,033 | 404,650–404,733 | 404,165–404,173 | ||
|
|
| pos | 381,827–383,194 [ | 382,404–382,432 [ | 382,506–382,519 [ | 382,647–382,710 [ | 382,994–383,155 [ | 383,176–383,184 [ |
|
|
| neg | 1,519,837–1,521,377 | 1,520,648–1,520,678 | 1,520,550–1,520,562 | 1,520,303–1,520,369 | 1,519,864–1,520,027 | 1,519,849–1,519,857 |
|
|
| neg | 272,769–274,000 | 273,158–273,179 | 272,992–273,085 | 272,781–272,789 | ||
|
|
| pos | 1230–2215 | 2197–2204 | ||||
|
|
| neg | 419,194–421,150 [ | 421,007–421,081 [ | 420,914–420,932 [ | 420,657–420,852 [ | 419,206–419,214 [ | |
|
|
| pos | 2586–4361 | 2836–2854 | 4342–4350 | |||
|
|
| neg | 254,761–256,469 | 256,151–256,169 | 254,773–254,781 | |||
|
|
| pos | 87,530–89,215 | 87,780–87,798 | 89,196–89,204 | |||
|
|
| pos | 45,720–46,940 | 45,996–46,050 | 46,193–46,203 | 46,301–46,377 | 46,703–46,848 | 46,921–46,929 |
|
|
| pos | 476,134–47,7336 | 476,392–476,446 | 476,588–476,598 | 476,694–476,770 | 477,100–477,240 | 477,317–477,325 |
|
|
| pos | 287,410–288,645 | 287,705–287,739 | 287,888–287,898 | 288,019–288,096 | 288,417–288,558 | 288,626–288,634 |
|
|
| pos | 1–1215 [ | 284–320 [ | 424–434 | 585–662 | 981–1128 [ | 1201–1209 |
|
|
| neg | 18,591–19,809 [ | 19,497–19,532 [ | 19,349–19,356 | 19,156–19,232 | 18,687–18,833 [ | 18,603–18,611 |
|
|
| pos | 307,733–308,897 [ | 308,010–308,045 [ | 308,154–308,161 | 308,281–308,353 | 308,660–308,803 [ | 308,878–308,886 |
|
|
| pos | 307,597–308,757 [ | 307,880–307,914 [ | 308,057–308,064 [ | 308,185–308,256 [ | 308,563–308,682 [ | 308,737–308,746 [ |
|
|
| neg | 478,773–479,970 [ | 479,664–479,718 [ | 479,512–479,520 | 479,340–479,417 | 478,866–479,012 [ | 478,785–478,793 |
|
|
| pos | 284,183–285,344 | 284,465–284,501 | 284,645–284,655 | 284,772–284,843 | 285,150–285,269 | 285,325–285,333 |
|
|
| pos | 58,142–59,362 [ | 58,418–58,472 [ | 58,613–58,620 | 58,723–58,799 | 59,125–59,270 [ | 59,343–59,351 |
|
| pos | 287,536–288,696 | 287,818–287,854 | 287,998–288,008 | 288,124–288,195 | 288,502–288,621 | 288,677–288,685 | |
|
| neg | 473,800–474,997 | 474,691–474,745 | 474,537–474,547 | 474,368–474,444 | 473,894–474,038 | 473,812–473,820 | |
|
|
| pos | 1–1163 [ | 278–313 [ | 424–434 | 549–621 | 928–1072 [ | 1147–1155 |
Figure 3Alignment of the core SM-binding site motif: The common pattern of most Saccharomycetaceae (top); and species-specific variants (bottom).
Figure 4Summary of the BLAST-based survey of TER genes. Blue nodes show TERs described in the literature, orange nodes represent TERs that we identified, and grey nodes are additional candidates for which we could not validate characteristic features. TERs outside the Saccharomycetaceae group are presented in light colors. The length of the edges are weighted by the inverse of the length of the BLAST hit. Note that distances in drawing between nodes not connected by an edge are not indicative of their evolutionary distance.