| Literature DB >> 25202477 |
Anna M Roschanski1, Bruno Fady2, Birgit Ziegenhagen1, Sascha Liepelt1.
Abstract
PREMISE OF THE STUDY: We present a protocol for the annotation of transcriptome sequence data and the identification of candidate genes therein using the example of the nonmodel conifer Abies alba. • METHODS ANDEntities:
Keywords: Abies alba; Pinaceae; adaptation; annotation; candidate genes; de novo sequencing
Year: 2013 PMID: 25202477 PMCID: PMC4105350 DOI: 10.3732/apps.1200179
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Statistics of the 454 transcriptome sequencing run and metrics of the Newbler assembly software.
| Size (nt) in quantiles | |||||||||
| Sequence type | Number | % | Nucleotides | Average (nt) | 0% | 25% | 50% | 75% | 100% |
| Reads trimmed | 1521698 | 100 | 546346058 | 359.0 | <21 | <303 | <395 | <444 | <1088 |
| Reads assembled | 381808 | 25.1 | |||||||
| Reads partial | 619615 | 40.7 | |||||||
| Reads singleton | 484576 | 31.8 | 175198711 | 361.6 | <50 | <307 | < 397 | < 443 | <876 |
| Reads repeat | 1617 | 0.1 | |||||||
| Reads outlier | 20389 | 1.3 | |||||||
| Reads too short | 13693 | 0.9 | |||||||
| Contigs | 25149 | 12511848 | 498 | <100 | <365 | <468 | <601 | <2394 | |
| N50 Contig | 704 | ||||||||
Half of all bases are assembled in contigs of this size or longer.
Fig. 1.Workflow of the annotation protocol. Numbers of the output after each step are given. The standard approach starts with 25149 contigs. The specific approach uses them as the reference database for the tBLASTn step.
Primers for resequencing of annotated gene fragments in Abies alba.
| Gene Locus ID | Primer sequences (5′–3′) | No. of introns | Intron length (nt) | Total length (nt) | Annotation approach | BLASTn of gene sequences against nr nucleotide database ( | GO-ID biological process | |
| 95 | F: ACAGAAACTAAAGCTAGTGTCG | 57 | 0 | — | 696 | 1 | reductive pentose-phosphate cycle, photorespiration, oxidation reduction | |
| R: CCTTAATTTCACCCGTCTCAG | ||||||||
| 215 | F: CCAAGGACTCTGATCGAATCC | 56 | 2 | 411 | 1486 | 2 | response to UV, response to wounding, phenylpropanoid metabolic process, response to fungus | |
| R: GAAGCCAGCATTCAAAGACTC | ||||||||
| 241 | F: AACGTCCGTTAATACTTCGG | 56 | 3 | 256 | 1370 | 2 | glycolysis | |
| R: AGTAAGTGTAGCCCTTCACG | ||||||||
| 323 | F: AAGCAAGCTTCTGAAATTCC | 53 | 2 | 278 | 804 | 1 | auxin biosynthetic process, ATP biosynthetic process, proton transport | |
| R: TGGTAGAGTCTACCAAATGAG | ||||||||
| 1362 | F: GAAGAGGTAGCTGCATTGGT | 59 | 0 | — | 871 | 1 | response to hypoxia, sucrose biosynthetic process, nuclear mRNA splicing, via spliceosome | |
| R: GGGCTTATACCGTAAATATACCCA | ||||||||
| 1704 | F: CAACTACTTCAGAGACAGAC | 52 | 2 | 327 | 858 | 2 | ||
| R: AAAGATTCCCTCCAAATCAG | ||||||||
| 2387 | F: TAAATGGCTCAATTCCTCCTACTG | 61 | 1 | 128 | 624 | 1 | ||
| R: GTTCCAAGCTTCCACAATACTC | ||||||||
| 2565 | F: GTGTCTGGAAGGGAATACAAGG | 58 | 0 | — | 432 | 1 | PREDICTED: | embryonic development ending in seed dormancy, one-carbon metabolic process, posttranscriptional gene silencing, methylation-dependent chromatin silencing |
| R: CCTTGACTCCTTCATGGATCAG | ||||||||
| 2774 | F: GTTACAGGAAGCCTTTCTGG | 55 | 0 | — | 502 | 2 | cell wall modification | |
| R: GCGGGATGAATTATCTTGTC | ||||||||
| 2937 | F: TGAGCTGATTGCTAATGCGG | 58 | 0 | — | 622 | 2 | glycolysis | |
| R: GGACATGGTGGTCATTGAGG | ||||||||
| 2986 | F: CTGTCTGTGACGGATCTAGC | 57 | 0 | — | 355 | 1 | ||
| R: TGAGGATGGCTTACAACACG | ||||||||
| 3421 | F: CTCATCTCTGCCAGAAAGAC | 55 | 0 | — | 324 | 2 | phenylpropanoid metabolic process, biosynthetic process, | |
| R: GTAGAGCTTCATCTACGAGG | ||||||||
| 3593 | F: AGGACCTGAAATACCTTGCT | 56 | 0 | — | 337 | 2 | transport, respiratory electron transport chain, photosynthesis | |
| R: TCCGTGTTTATCTCACAGGT | ||||||||
| 3689 | F: CGATTGCATCTCTGTACGCC | 58 | 0 | — | 619 | 2 | thiamin biosynthetic process | |
| R: GCTCTTGAGCCTCTTGACAC | ||||||||
| 3918 | F: TTCCAAGGTCTTCTCAAGGT | 55 | 0 | — | 400 | 2 | cellulose biosynthetic process, cellular cell wall organization, secondary cell wall biogenesis, rhamnogalacturonan I side chain metabolic process | |
| R: TGAAGAGTAGGAGTTTCGGT | ||||||||
| 3942 | F: GTATGATACCGATGTGACGA | 55 | 0 | — | 273 | 2 | ubiquitin-dependent protein catabolic process | |
| R: TTTGTAATGGATGCACTCGG | ||||||||
| 3981 | F: GGAGAAGTCTACAGTTCCAG | 54 | 0 | — | 918 | 1 | oxidation reduction | |
| R: ATAGTCCAGTGTCTTGAACTC | ||||||||
| 4103 | F: ATGGCCACCTTACTAAGAAGC | 57 | 0 | — | 841 | 1 | auxin biosynthetic process, one-carbon metabolic process | |
| R: CCACTTAAGGACCTTTACAGTCTC | ||||||||
| 4492 | F: TGGGTGCAACTGAAGATAGAG | 57 | 0 | — | 698 | 1 | auxin biosynthetic process, chlorophyll biosynthetic process, photosynthesis | |
| R: TTTCTACAACTAGCAAGCCTGAG | ||||||||
| 4921 | F: GAAGGTCGGCTATATCAGGT | 56 | 0 | — | 664 | 2 | PREDICTED: | response to cadmium ion, ubiquitin-dependent protein catabolic process |
| R: AGCTTAGACAGAGACTCAGG | ||||||||
| 5004 | F: CAGATGTGAGCCATTACTTTGAC | 57 | 0 | — | 461 | 1 | chlorophyll biosynthetic process | |
| R: CAACCTCTGAATATAGCTGCCT | ||||||||
| 5823 | F: TGCTTGATATACGTCCTGGG | 57 | 0 | — | 293 | 2 | regulation of transcription, photomorphogenesis, tryptophan biosynthetic process | |
| R: CTAGACAGTGTTGCTCCACG | ||||||||
| 5945 | F: CTGTCACTCAGATCTTCAGC | 55 | 0 | — | 339 | 2 | photosynthesis, light harvesting, protein-chromophore linkage | |
| R: AGATGATCAGCGAGATTCTC | ||||||||
| 6119 | F: AGAGGATGTTGGGCATTATGG | 57 | 0 | — | 567 | 1 | pollen tube development, oxidation reduction | |
| R: CATCACATGGTATCTCATCCGA | ||||||||
| 6594 | F: TGGCTTTATCTTGGAGACTTCAC | 58 | 1 | 348 | 712 | 1 | phosphoinositide biosynthetic process, phosphoinositide phosphorylation, signal transduction, phosphoinositide-mediated signaling | |
| R: GAATAAGGTCATAGCCTGCCG | ||||||||
| 6757 | F: TATCATGCCCTGAAAGCGTC | 58 | 5 | 177 | 939 | 1 | ||
| R: ACTTCCACAAGCAAGACACTC | ||||||||
| 7098 | F: CTTTACTGTTGGAGGTAGATCAG | 55 | 0 | — | 782 | 1 | dTDP-rhamnose biosynthetic process, | |
| R: GTTTGTTTGTCTTTGTACTCCC | ||||||||
| 7208 | F: GTTACATTCGTAAGTAGCTTGG | 54 | 0 | — | 326 | 1 | transport, ATP synthesis coupled electron transport | |
| R: AAATGGTCGAGAAGTCTACTG | ||||||||
| 7324 | F: ATTGGAGATGGAGCCATGAC | 57 | 0 | — | 471 | 1 | terpenoid biosynthetic process, thiamin biosynthetic process | |
| R: TCTCTGCATATGGGTAACCC | ||||||||
| 8248 | F: CAAGTATTCCGAAAGGCAGC | 57 | 1 | 601 | 1128 | 2 | response to water deprivation, water transport, transmembrane transport, response to salt stress | |
| R: ACAAAGGTGCCCACAATCTC | ||||||||
| 8583 | F: TCTCCTACATTGACGATCCC | 56 | 0 | — | 393 | 2 | phenylpropanoid metabolic process, biosynthetic process, | |
| R: CCATCCAAGCACTTGAAGAG | ||||||||
| 8855 | F: TATTTGCTGGTCGGGATTCG | 58 | 2 | 275 | 926 | 2 | photosynthesis, light harvesting | |
| R: CTGCACTAGGTTCTCGAACG | ||||||||
| 9366 | F: AGTGAAAGCAACAACTTAGG | 53 | 0 | — | 598 | 1 | cell redox homeostasis, oxidation reduction | |
| R: TCTGGCTTCATTGATTTGTC | ||||||||
| 9512 | F: GTACTGGAGTAGCTGCACGA | 59 | 1 | 99 | 415 | 2 | regulation of transcription, DNA-dependent | |
| R: TACAAAGTGCTGCACAGCAG | ||||||||
| 9652 | F: TGCAAAGAAAGTCAAGGCGA | 58 | 2 | 418 | 913 | 2 | ||
| R: CCCATACGGTGTTAATGGCT | ||||||||
| 11301 | F: GATGTTGTTCGTGCAAAGAC | 54 | 0 | — | 490 | 2 | malate metabolic process, oxidation reduction, tricarboxylic acid cycle, glycolysis | |
| R: GCGAACTTAATTCCCTTCTC | ||||||||
| 13329 | F: GATATGTGCCCAAGAACATTCTG | 57 | 0 | — | 350 | 1 | PREDICTED: | |
| R: CCTTGCATGCTTCAAGAAGG | ||||||||
| 13536 | F: CTGCTGATTCTGATCAGTCC | 56 | 0 | — | 368 | 2 | regulation of transcription, DNA-dependent | |
| R: TCCACAATGCAAACATAGGC | ||||||||
| 14455 | F: GAACAAGATCGACTACTGCC | 56 | 0 | — | 834 | 2 | root hair elongation, xyloglucan biosynthetic process | |
| R: TTTGATGGCCTTGAAAGCAG | ||||||||
| 14479 | F: CCACTCCCAAGTACTCAAAGG | 57 | 0 | — | 588 | 1 | translational elongation | |
| R: CAAGTGTGGCAATCCAACAC | ||||||||
| 14514 | F: GGGTTCTGATTCTCCAAAGG | 56 | 0 | — | 322 | 2 | pentose-phosphate shunt, response to salt stress, glycolysis, response to cadmium ion | |
| R: CTGCATACTTGGCCAAAGTG | ||||||||
| 14585 | F: TCTTGAATTCTTCCTATGTCCCAG | 57 | 1 | 193 | 915 | 1 | PREDICTED: | homogalacturonan biosynthetic process |
| R: AATTGCACATCTGCACAAACTC | ||||||||
| 14887 | F: GGTTAGACCAGTTCATAACC | 53 | 0 | — | 1156 | 2 | PREDICTED: | |
| R: GTCTTCAAACTCTGACAAGG | ||||||||
| 15135 | F: TTGCAGGACTTCTTTAATGG | 53 | 0 | — | 657 | 2 | oxidation reduction, response to stress, auxin biosynthetic process | |
| R: TCTTCTTGTCAGATGGATCC | ||||||||
| 15337 | F: TTTATTGTATTCCTCCTAGGCCAG | 57 | 1 | 232 | 1086 | 1 | cellulose biosynthetic process, cellular cell wall organization | |
| R: CACAATCTAAGCCACATTCTTCC | ||||||||
| 15484 | F: TTCGACGCCAACGTTATCTG | 58 | 0 | — | 663 | 2 | phenylpropanoid metabolic process, biosynthetic process, | |
| R: GGCCCAGAGAATTGACATCC | ||||||||
| 15727 | F: CACTGAAGGTTGTGGACGAG | 58 | 0 | — | 325 | 2 | ||
| R: GTTCAGAAGGGCTGTGTAGG | ||||||||
| 15811 | F: TTCGAGATCATCTGGACTGC | 57 | 0 | — | 438 | 2 | biosynthetic process | |
| R: CGACTGTTTCGACAGTGAGG | ||||||||
| 15969 | F: GGAAACCTTCTTGTTCACATCTG | 57 | 0 | — | 990 | 1 | auxin biosynthetic process, one-carbon metabolic process | |
| R: CTTGTCTGGAATCCTCCCTG | ||||||||
| 16727 | F: GGTGACTGTGAAGGCAATGG | 58 | 0 | — | 331 | 2 | lipid transport | |
| R: TCCCACATTTCTTTCCAGCT | ||||||||
| 16816 | F: CATCTTGGCTTCGTGATTGTC | 57 | 3 | 132 | 562 | 2 | ||
| R: TGCAATTTGGCGTAATCGAC | ||||||||
| 16883 | F: CTCACAGAGGTCAGAAAGAATGG | 58 | 0 | — | 710 | 1 | auxin biosynthetic process, one-carbon metabolic process | |
| R: CTGCTTCAAAGGTTTGACAATCTC | ||||||||
| 16979 | F: CCTGGATAGTGAAATTGGAGG | 55 | 0 | — | 535 | 1 | auxin biosynthetic process | |
| R: ATCCTTCTCTGAATGAGTTTCG | ||||||||
| 17340 | F: CTTGGTTAATTTCCGTCCTG | 54 | 0 | — | 281 | 2 | transport, photosynthesis, electron transport chain | |
| R: CAGCTCCTACATTTAAACCC | ||||||||
| 17637 | F: TGCTGAGAAAGTTGATTCTTCC | 56 | 0 | — | 424 | 1 | ||
| R: GTATTCGAGGTGTAGATTGCTG | ||||||||
| 17975 | F: CAAACATTGCTGCAAAGCTC | 56 | 2 | 94 | 547 | 1 | cysteine biosynthetic process from serine | |
| R: CCTATTCCAGCAACCAATATGTC | ||||||||
| 18135 | F: GAGACTTTGGATTCGATCCC | 55 | 1 | 132 | 683 | 2 | photosynthesis, light harvesting in photosystem I | |
| R: AGAAGGCCGCAAATATAGTG | ||||||||
| 18444 | F: ATTAATCTTTGCAGGGAAGC | 54 | 0 | — | 313 | 2 | ||
| R: AGACGAGATGAAGTGTAGAC | ||||||||
| 18599 | F: GGAATGCATGATCCATTTCTG | 55 | 0 | — | 678 | 1 | oxidation reduction | |
| R: TACCTGAATTGTTCTTGCGA | ||||||||
| 18680 | F: CTGCGATGGATAAACTACCT | 55 | 1 | 214 | 465 | 2 | regulation of transcription | |
| R: GCTAGTGTTGCTATTGTGGG | ||||||||
| 19005 | F: GGAGATTGAGCAACGAAGAG | 56 | 0 | — | 368 | 1 | auxin biosynthetic process | |
| R: TTTGAATCCCTGAAATCCTGG | ||||||||
| 19173 | F: AGAACCAATCCCTGTTACAC | 55 | 0 | — | 343 | 2 | defense response to bacterium, ubiquitin-dependent protein catabolic process, response to zinc ion | |
| R: GATCAGTTCCAATCACACCT | ||||||||
| 19540 | F: ACCAATTCTCTTGTTCTCGG | 55 | 0 | — | 634 | 1 | plasma membrane ATP synthesis coupled proton transport, auxin biosynthetic process | |
| R: CGAACCATGTAAAGATCATTCC | ||||||||
| 20156 | F: ATGGATCCCTGGAATTTATGC | 55 | 0 | — | 386 | 1 | RNA processing, chlorophyll biosynthetic process | |
| R: ATACTCTACCTACTACAGAATCCC | ||||||||
| 20318 | F: ACAGCTCCCATTAATCTGAC | 55 | 0 | — | 356 | 1 | PREDICTED: | root hair elongation, cellulose biosynthetic process, response to cold, cellular cell wall organization, plant-type cell wall biogenesis |
| R: CCAGAATTGTTCATTTCTCCAC | ||||||||
| 20694 | F: GTCGAACAATGAAGACGAGG | 56 | 0 | — | 346 | 1 | PREDICTED: | cell wall modification, regulation of transcription |
| R: TGTGAGCGAAGAAACAAACC | ||||||||
| 21136 | F: AGACTGGTGTTACATTTGCGT | 57 | 1 | 229 | 535 | 1 | oxidation reduction, chlorophyll biosynthetic process, photosynthesis | |
| R: CCAACAAGCTTCTCACTAATTTCC | ||||||||
| 21165 | F: ATGCACGATGTTCTTGATGC | 57 | 2 | 204 | 644 | 1 | PREDICTED: | response to hypoxia, sucrose biosynthetic process |
| R: GGTGTCATGTTTATATGACAGTGG | ||||||||
| 21173 | F: ACATTGTTGCTAACGATCCG | 56 | 0 | — | 333 | 2 | cell wall macromolecule catabolic process, chitin catabolic process | |
| R: AGACGAGGTAGAGATTGAGC | ||||||||
| 21890 | F: GAAAGCTTACAGGGAAGCAG | 55 | 1 | 358 | 607 | 2 | RNA processing | |
| R: ACGATATCCAAGCATCATCC | ||||||||
| 21957 | F: AACAACTTCACAGTTTCTCC | 54 | 0 | — | 292 | 2 | auxin biosynthetic process, chlorophyll biosynthetic process, oxidation reduction, photosynthesis, dark reaction | |
| R: GGAATCGGTAAATCAACGAC | ||||||||
| 22174 | F: GATGATCCGGTTCGAATACC | 55 | 0 | — | 334 | 1 | regulation of apoptosis, transcription, DNA-dependent | |
| R: AAACGTAAGATACAAGTGGGTG | ||||||||
| 23660 | F: AGGAAGATGTTAGGCTCGGG | 58 | 1 | 781 | 1232 | 2 | response to water deprivation, water transport, transmembrane transport | |
| R: GAAGCCCTTCACAACTCCAG | ||||||||
| 23809 | F: ATGCGCTCTATGTTAGAACG | 55 | 0 | — | 1058 | 2 | oxidation reduction, chlorophyll biosynthetic process, photosynthesis, dark reaction | |
| R: AATCTCAAGACGTTTACCGA | ||||||||
| 23850 | F: GAAGATTTATTCGGCAACTG | 52 | 1 | 449 | 695 | 2 | auxin biosynthetic process, protein amino acid phosphorylation, conjugation, mitosis, cell division | |
| R: ATCTGATCCTCTGTTAAGGT | ||||||||
| 23982 | F: TGAGACTTGCTTGGGAAGAG | 57 | 2 | 586 | 921 | 1 | metabolic process | |
| R: AGCCCATTGTAAACGAAGGA | ||||||||
| 24523 | F: TTCAGACTCGAACGTTTGCA | 58 | 0 | — | 449 | 2 | hydrogen peroxide catabolic process, oxidation reduction | |
| R: AAGCTTTCATTCCCAGACGG | ||||||||
| 24699 | F: AAGATAAGCAGTTTGCTGCA | 56 | 0 | — | 262 | 2 | auxin biosynthetic process, response to stress | |
| R: AACATTCTTCTCGCCAACAG | ||||||||
| 24902 | F: CCCTCTCAATCTTGAGGATGC | 58 | 1 | 240 | 662 | 1 | electron transport chain | |
| R: CAGATGGACCTGTAATTTGAACCT | ||||||||
| 25060 | F: CTGCAAGATACTTCAAAGATGCAC | 58 | 2 | 163 | 624 | 1 | PREDICTED: | acetyl-CoA biosynthetic process, cellular carbohydrate metabolic process |
| R: ATTTGGTGTAGAGAACATCTTCCC | ||||||||
| 26089 | F: GATTATTGATTCTACCACCGGA | 55 | 1 | 281 | 1233 | 1 | translational elongation | |
| R: TTTCTCAACAGCCTTGATGAC | ||||||||
| 26764 | F: GGGAATTGGCTCGTATCTGG | 58 | 0 | — | 359 | 1 | response to glucose stimulus, response to sucrose stimulus, response to fructose stimulus, response to salt stress, pentose-phosphate shunt, oxidation reduction, response to cadmium ion | |
| R: GTTCTGCTTAGCAATCTTTGTCC | ||||||||
| 27033 | F: TTTACTCCACCATTACGAGG | 55 | 0 | — | 948 | 2 | response to virus, auxin biosynthetic process, protein folding, response to heat, response to bacterium, response to cadmium ion, response to high light intensity, response to hydrogen peroxide, protein amino acid phosphorylation | |
| R: TTCGCAATGATAGGATTGCA | ||||||||
| 27963 | F: TAGGCCCATAGCTAACAAACC | 57 | 0 | — | 318 | 1 | transcription, DNA-dependent | |
| R: TCGAATTGTTTCATCCTCCCA | ||||||||
| 28203 | F: TGTGGACGAGGAGATATTCG | 56 | 0 | — | 315 | 2 | ||
| R: TTCAGAAAGGGCTGTGTAGG | ||||||||
| 28456 | F: GATTTCGAGAGCTGGTATCCC | 58 | 0 | — | 853 | 1 | protein amino acid glycosylation | |
| R: AGCTGTCGGTTGATGTTCTG | ||||||||
| 28639 | F: GTAGAATAAGTGGGAGCCGT | 57 | 0 | — | 438 | 2 | ||
| R: ATAGGAAGAGCCGACATCGA | ||||||||
| 29437 | F: CTTCAGGTGCTCGATATCGT | 56 | 0 | — | 403 | 2 | ||
| R: TCAACTGGAAACGTTAGCTC |
Note: — = not available; T = annealing temperature.
Values are based on the sequence of one sample randomly chosen from a sample set of 80 trees from a population at Mont Ventoux (France).