| Literature DB >> 29202864 |
Boas Pucker1, Daniela Holtgräwe1, Bernd Weisshaar2.
Abstract
OBJECTIVE: The Arabidopsis thaliana Niederzenz-1 genome sequence was recently published with an ab initio gene prediction. In depth analysis of the predicted gene set revealed some errors involving genes with non-canonical splice sites in their introns. Since non-canonical splice sites are difficult to predict ab initio, we checked for options to improve the annotation by transferring annotation information from the recently released Columbia-0 reference genome sequence annotation Araport11.Entities:
Keywords: Araport11; Gene prediction hints; Genome annotation; Reciprocal best hit; Splicing
Mesh:
Substances:
Year: 2017 PMID: 29202864 PMCID: PMC5716242 DOI: 10.1186/s13104-017-2985-y
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
The oligonucleotides listed were applied in RT-PCRs to validate non-canonical splice sites selected candidate genes in Nd-1
| Name | Gene | Sequence | Length | Orientation | Recommended annealing temperature [°C] |
|---|---|---|---|---|---|
| S015 | At1g79350 ( | GCTTCCCTGGAGTGCTGATCG | 21 | Forward | 61 |
| S016 | At1g79350 ( | TCGGGTTCATCAATCGAGCATCC | 23 | Reverse | 61 |
| S017 | At1g79350 ( | AAGAACAGGTAGTTTCTCCTGCTCC | 25 | Reverse | 60 |
| S003 | At4g01800 ( | ACTGGTGAAGGGAAAACGCTTG | 22 | Forward | 59 |
| S004 | At4g01800 ( | AATGTATATCCCGCTCAAAGGCTG | 24 | Reverse | 59 |
| S005 | At4g01800 ( | TCTTCTGCTTTTCATCAACAGTGTAATG | 28 | Reverse | 58 |
| S018 | At4g27500 ( | AGCCGCAGAAGGAAGAAAAGC | 21 | Forward | 59 |
| S019 | At4g27500 ( | ACGCGATGAGACGAATTCCGAG | 22 | Forward | 61 |
| S020 | At4g27500 ( | CTCTTGGGATCGTTTCTGGTCC | 22 | Reverse | 59 |
Fig. 1Representative gene structure of missed non-canonical splice sites in ab initio gene prediction on the Nd-1 genome sequence. Gene structures of At1g79350.1 and the corresponding reciprocal best BLAST hit (RBH) of the ab initio gene prediction in Nd-1 (GeneSet_Nd-1_v1.0) are displayed. The non-canonical splice sites were missed leading to a difference at exon 20 (blue arrows). Despite this deviation, the structure of At1g79350 Nd−1 was predicted very well by AUGUSTUS [44, 45]
Fig. 2Representative gene structure of missed non-canonical splice sites in ab initio gene prediction in Nd-1. Gene structure of the At1g79350 RBH in the hint-based gene prediction (GeneSet_Nd-1_v1.1) on the Nd-1 genome sequence is displayed (a). The non-canonical splice sites were missed in the ab initio gene prediction leading to a skipping of exon 20 (highlighted in yellow) (b)