| Literature DB >> 20157478 |
Junjian Ni1, Anuradha Pujar, Ken Youens-Clark, Immanuel Yap, Pankaj Jaiswal, Isaak Tecle, Chih-Wei Tung, Liya Ren, William Spooner, Xuehong Wei, Shuly Avraham, Doreen Ware, Lincoln Stein, Susan McCouch.
Abstract
Gramene is a comparative information resource for plants that integrates data across diverse data domains. In this article, we describe the development of a quantitative trait loci (QTL) database and illustrate how it can be used to facilitate both the forward and reverse genetics research. The QTL database contains the largest online collection of rice QTL data in the world. Using flanking markers as anchors, QTLs originally reported on individual genetic maps have been systematically aligned to the rice sequence where they can be searched as standard genomic features. Researchers can determine whether a QTL co-localizes with other QTLs detected in independent experiments and can combine data from multiple studies to improve the resolution of a QTL position. Candidate genes falling within a QTL interval can be identified and their relationship to particular phenotypes can be inferred based on functional annotations provided by ontology terms. Mutations identified in functional genomics populations and association mapping panels can be aligned with QTL regions to facilitate fine mapping and validation of gene-phenotype associations. By assembling and integrating diverse types of data and information across species and levels of biological complexity, the QTL database enhances the potential to understand and utilize QTL information in biological research.Entities:
Year: 2009 PMID: 20157478 PMCID: PMC2790302 DOI: 10.1093/database/bap005
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Number of published QTL papers for rice, wheat, maize, Arabidopsis and tomato between 1987 and 2007. Graph showing the steady increase in publications reporting QTLs in five major plant species between 1987 and 2007 based on nonredundant data from four publicly available literature databases, PubMed, Agricola, CAB Abstracts and BIOSIS Previews.
Summary of QTL and associated features in the Gramene database for 10 cereal species
| Updated based on build 28. | Rice | Maize | Wheat | Tetraploid wheat | Oat | Barley | Sorghum | Pearl millet | Foxtail millet | Wild rice | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| QTLs | 8646 | 1747 | 23 | 8 | 375 | 299 | 136 | 284 | 65 | 41 | 11624 |
| TO: traits | 237 | 77 | 10 | 3 | 7 | 30 | 19 | 27 | 2 | 10 | 332 |
| TO: trait categories | 9 | 8 | 3 | 2 | 5 | 7 | 5 | 6 | 1 | 5 | 9 |
| PO: structure | 38 | 19 | 5 | 2 | 6 | 8 | 7 | 15 | 2 | 5 | 48 |
| PO: growth-stage | 19 | 9 | 6 | 2 | 5 | 5 | 7 | 7 | 2 | 5 | 20 |
| Map sets | 89 | 8 | 9 | 2 | 1 | 8 | 2 | 5 | 1 | 1 | 126 |
| Parental germplasm | 91 | 4 | 4 | 2 | 2 | 7 | 3 | 12 | 2 | 2 | 108 |
| Co-localized markers | 30950 | 3615 | 73 | 14 | 888 | 335 | 334 | 1031 | 87 | 42 | 37369 |
| Neighboring markers | 16422 | 3120 | 37 | 14 | 561 | 258 | 558 | 535 | 122 | 74 | 21671 |
| Curated papers | 246 | 56 | 11 | 2 | 1 | 9 | 2 | 6 | 1 | 1 | 335 |
aTO: traits—the number of unique phenotypic traits defined by TO terms that have been used to annotate QTLs.
bTO: trait categories—the nine categories of traits named in the TO; this higher order node in the TO serves to cluster related traits.
cPO: structure—the number of anatomy terms used to describe QTLs (total number of unique terms = 48).
dPO: growth-stage—the number of growth-stage terms used to describe QTLs (total number of unique terms = 20).
eMap Sets—the number of unique mapping population marker datasets used in QTL studies.
fParental germplasm—the number of different strains or accessions used as parents in QTL mapping studies.
gCo-localized markers—markers that map within QTL intervals; >37 000 markers have been curated and used to anchor QTLs to the sequence map of rice.
hNeighboring markers—markers flanking QTL intervals; >21 000 neighboring markers have been curated and are used to construct comparative maps.
Distribution of rice QTLs
| Chr. | Chr. length (Mb) | No. of QTLs | QTLs with genome position | QTLs associated with Trait category term, Vigor | QTLs associated with plant height |
|---|---|---|---|---|---|
| 1 | 43.6 | 1274 | 1026 | 255 | 157 |
| 2 | 35.9 | 843 | 590 | 96 | 44 |
| 3 | 36.3 | 1069 | 755 | 181 | 102 |
| 4 | 35.2 | 769 | 500 | 113 | 60 |
| 5 | 29.9 | 667 | 512 | 99 | 53 |
| 6 | 31.2 | 826 | 632 | 84 | 30 |
| 7 | 29.7 | 653 | 448 | 82 | 28 |
| 8 | 28.3 | 646 | 453 | 78 | 31 |
| 9 | 23.0 | 519 | 392 | 78 | 46 |
| 10 | 22.9 | 397 | 312 | 48 | 31 |
| 11 | 28.5 | 546 | 422 | 92 | 40 |
| 12 | 27.5 | 437 | 251 | 49 | 16 |
| Total | 372.0 | 8646 | 6293 | 1255 | 638 |
aThe number of QTLs that have positions on the sequenced rice genome is less than the total number of QTLs curated due to lack of requisite information in some publications (i.e. for AFLP, RAPD, etc.).
bTO trait category term ‘Vigor’ carries annotations for traits related to rate of seedling emergence, biomass accumulation, etc.
Sample of information associated with seven QTL map sets from five species in Gramene
| Map set name | Species | QTL | No. of markers on map | Co-localized markers | Neighboring markers | Parental germplasm | No. of associated QTL studies |
|---|---|---|---|---|---|---|---|
| 1. Cornell IR64/Azu DH QTL 2001 | Rice | 275 | 588 | 287 | 203 | IR64, Azucena | 9 |
| 2. IRRI IR64/Azu DH QTL 2003 | Rice | 1372 | 281 | 2529 | 2636 | IR64, Azucena | 23 |
| 3. JRGP Nip/Kas F2 QTL 2000 | Rice | 351 | 3263 | 12732 | 1185 | Nipponbare, Kasalath | 33 |
| 4. UWM B73/Mo17 RFLP SSR QTL 1996 | Maize | 66 | 186 | 125 | 120 | B73, Mo17 | 3 |
| 5. Synthetic/Opata RI RFLP/SSR QTL 1995 | Wheat | 1 | 943 | 1 | 1 | Synthetic | 4 |
| 6. Steptoe/Morex DH RFLP QTL 2006 | Barley | 33 | 312 | 1 | 66 | Steptoe, Morex | 1 |
| 7. Cornell Kan/Ogl QTL 1995 | Oat | 375 | 249 | 888 | 528 | Kanota, Ogle | 1 |
a‘Synthetic’ means that the female parent is Altar84/A3. squarrose (219) IGM86.940.
Figure 2.Generalized model for candidate gene discovery. The diagram shows a simplified view of the information modules in the Gramene database. The figure has been designed to show the path of forward genetic dissection of complex traits above the dotted line and reverse genetic investigation below the heavy dotted line. In reality, as indicated by the arrows, the database can be accessed from any point of entry and users can navigate to the required information. Gramene provides automated analytical tools shown to the left of the pale grey box, and manual curation using ontologies and controlled vocabularies shown to the right of the dotted box, to enhance the data mining potential of the database. The association between the QTL and EST modules contained within the pale grey box highlights the fact that a user can systematically zero in on information about the relationship between a complex phenotype and a candidate gene/s of interest. As depicted here, a trait or complex phenotype is associated through maps and markers to a genomic region identified as a QTL and to ESTs and gene models previously annotated to the genome region; the QTL database thus performs the crucial function of connecting molecular or genotypic information with complex phenotypes, and more generally connecting the field of molecular biology with that of quantitative genetics and plant breeding.
Figure 3.Iron-uptake pathway in rice: cloned gene to complex traits, via pathway, genes and QTLs. Leaf senescence QTLs aligned to CMap view of rice chromosome 3 shown in the top left corner; Genome Browser view of QTL region of rice chromosome 3 with associated gene models shown in the center; DMA metabolic pathway shown in the top right; rice plants expressing iron chlorosis tolerant phenotype and iron toxicity phenotype shown at the bottom. The Mugenic acids (MAs) or phytosiderophores are end products of the DMA metabolic pathway. Biosynthesis of MAs is closely linked to iron deficiency and is highly regulated to avoid toxicity owing to excess iron uptake. The figure represents parts of the curated DMA pathway; the genomic positions of the cloned genes from this pathway are used to trace suitable QTL candidates in that region. The populations from which these QTLs were analyzed can then be used in breeding programs aimed at developing rice varieties that are tolerant to soil environments characterized by deficiencies or excesses of iron.