| Literature DB >> 19580677 |
Julio C Vega-Arreguín1, Enrique Ibarra-Laclette, Beatriz Jiménez-Moraila, Octavio Martínez, Jean Philippe Vielle-Calzada, Luis Herrera-Estrella, Alfredo Herrera-Estrella.
Abstract
BACKGROUND: In-depth sequencing analysis has not been able to determine the overall complexity of transcriptional activity of a plant organ or tissue sample. In some cases, deep parallel sequencing of Expressed Sequence Tags (ESTs), although not yet optimized for the sequencing of cDNAs, has represented an efficient procedure for validating gene prediction and estimating overall gene coverage. This approach could be very valuable for complex plant genomes. In addition, little emphasis has been given to efforts aiming at an estimation of the overall transcriptional universe found in a multicellular organism at a specific developmental stage.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19580677 PMCID: PMC2714558 DOI: 10.1186/1471-2164-10-299
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Statistics of the high quality reads from four GS20–454 sequencing runs of the Palomero cDNA library and coverage of the maize unigenes from the NCBI (UniGene) and TIGR (ZMGI) databases after each sequencing run.
| 1 | 367391 | 100.94 | 1 | 36.42 | 0 | 37.89 | 0 |
| 2 | 367699 | 100.73 | + 2 | 43.58 | 7.16 | 44.12 | 6.23 |
| 3 | 394851 | 100.62 | + 3 | 47.83 | 4.25 | 47.74 | 3.62 |
| 4 | 387937 | 100.50 | + 4 | 50.42 | 2.59 | 50.00 | 2.26 |
| Total | 1517878 | 100.38 | |||||
Figure 1Percentage of matching Unigenes from NCBI (purple) and TIGR (red) after four GS20–454 sequencing runs. A comparison by BLAST (e-value < 9e-07) was performed with the Palomero 454 sequences against the NCBI (UniGene) and TIGR (ZMGI) databases.
Comparison of the number of the NCBI maize ESTs and Palomero GS20–454 ESTs aligning by BLAST with the MAGIs.
| Num. seq. | 903624 | 1517878 |
| % Hits | 98.32 | 89.52 |
| % No hits | 1.68 | 10.48 |
| Num. MAGIs matched | 74403 | 77045 |
| Distinct MAGIs | 32110 | 34752 |
Number of unique MAGIs with a match to the Palomero GS20–454 ESTs that did not have prior expression evidence in the NCBI maize ESTs.
| 1 | 47278 | 31380 | 15898 |
| + 2 | 60807 | 37034 | 23773 |
| + 3 | 70348 | 40296 | 30052 |
| + 4 | 77045 | 42293 | 34752 |
Percentage of the Palomero GS20–454 sequences from four sequencing runs that matched by BLAST to maize databases and those that did not align (e-value < 9e-07) to any maize database.
| NCBI_ESTs | 87.50 | |
| MAGI.4 | 89.52 | 5.67 |
| Zm_organelle | 1.75 | |
| MAGI_repeats | 2.70 |
Figure 2Histogram of the length (in nucleotides) distribution of the 86,069 .
Representation of emblematic maize genes in the Palomero GS20–454 cDNA library.
| floricaula/leafy-like2 | fl2 | 2921 | partial CDS | 1 | --- | --- | --- |
| barren stalk1 | ba1 | 4570 | complete CDS | 31 | 15 | 3 | --- |
| rough sheath2 | rs2 | 1420 | complete CDS | 7 | 6 | 2 | 1 |
| alcohol dehydrogenase1 | adh1 | 1389 | partial CDS | 59 | 59 | 56 | 34 |
| alcohol dehydrogenase2 | adh2-n | 3535 | complete CDS | 55 | 48 | 46 | 28 |
| knotted1-like homeodomain protein liguleless4b | lg4b | 1123 | partial CDS | 6 | 4 | 4 | 3 |
| knotted1-like homeodomain protein liguleless3 | lg3 | 1537 | complete CDS | --- | --- | --- | --- |
| indeterminate gametophyte1 | Ig1 | 5431 | complete CDS | 31 | 13 | 9 | 5 |
| ramosa2 | ra2 | 6032 | complete CDS | 111 | 45 | 22 | 4 |
| ramosa1 | ra1 | 1519 | complete CDS | --- | --- | --- | --- |
| teosinte glume architecture1 | tga1 | 3219 | complete CDS | 13 | 9 | --- | --- |
| P gene; transposon | 7753 | complete CDS | 349 | 94 | 23 | 2 | |
| knotted-1 | kn-1 | 1627 | complete CDS | 91 | 37 | 14 | 10 |
| teosinte branched1 | tb1 | 3188 | partial CDS | --- | --- | --- | --- |
| indeterminate growth1 | id1 | 1625 | complete CDS | 4 | 2 | 2 | 2 |
| c1 locus myb homologue | c1 | 4059 | complete CDS | 85 | 41 | 5 | --- |
| Bronze2 | bz2 | 2948 | complete CDS | 19 | 15 | 7 | --- |
| fertilization independent endosperm2 | fie2 | 1171 | complete CDS | 17 | 15 | 15 | 10 |
aA criterion of 90 to 100% identity was used to consider a hit valid
Comparison of transcript abundance of representative maize genes estimated by GS20–454 sequencing and qPCR.
| Similar to UP|LIRP1_ORYSA (Q03200) Light-regulated protein precursor | TC327885 | 2.11 | 11.7808 ± 0.09 |
| Similar to UP|PSAE_HORVU (P13194) Photosystem I reaction center subunit IV, chloroplast precursor (PSI-E) | TC361477 | 0.74 | 13.1666 ± 0.05 |
| UP|Q41754_MAIZE (Q41754) Ubiquitin | TC369342 | 0.49 | 14.6629 ± 0.10 |
| UP|TBA6_MAIZE (P33627) Tubulin alpha-6 chain (Alpha-6 tubulin) | TC326717 | 0.32 | 14.0284 ± 0.03 |
| Homologue to GB|BAD33626.1|50726105|AP005579 Polyubiquitin 2 | TC342043 | 0.32 | 16.3357 ± 0.04 |
| Homologue to UP|Q41772_MAIZE (Q41772) Cytosolic ascorbate peroxidase, complete | TC364641 | 0.22 | 16.5425 ± 0.08 |
| Similar to UP|SODC2_MESCR (O49044) Superoxide dismutase [Cu-Zn]2 | TC327155 | 0.08 | 17.9920 ± 0.08 |
| Similar to UP|Q6YIH2_ORYSA (Q6YIH2) OsCDPK protein | TC327230 | 0.06 | 18.5091 ± 0.13 |