| Literature DB >> 27940610 |
Chen Sun1,2, Zhiqiang Hu1,2, Tianqing Zheng3, Kuangchen Lu1, Yue Zhao1, Wensheng Wang3, Jianxin Shi4, Chunchao Wang3, Jinyuan Lu1, Dabing Zhang4,5, Zhikang Li6, Chaochun Wei7,2.
Abstract
A pan-genome is the union of the gene sets of all the individuals of a clade or a species and it provides a new dimension of genome complexity with the presence/absence variations (PAVs) of genes among these genomes. With the progress of sequencing technologies, pan-genome study is becoming affordable for eukaryotes with large-sized genomes. The Asian cultivated rice, Oryza sativa L., is one of the major food sources for the world and a model organism in plant biology. Recently, the 3000 Rice Genome Project (3K RGP) sequenced more than 3000 rice genomes with a mean sequencing depth of 14.3×, which provided a tremendous resource for rice research. In this paper, we present a genome browser, Rice Pan-genome Browser (RPAN), as a tool to search and visualize the rice pan-genome derived from 3K RGP. RPAN contains a database of the basic information of 3010 rice accessions, including genomic sequences, gene annotations, PAV information and gene expression data of the rice pan-genome. At least 12 000 novel genes absent in the reference genome were included. RPAN also provides multiple search and visualization functions. RPAN can be a rich resource for rice biology and rice breeding. It is available at http://cgm.sjtu.edu.cn/3kricedb/ or http://www.rmbreeding.cn/pan3k.Entities:
Mesh:
Year: 2016 PMID: 27940610 PMCID: PMC5314802 DOI: 10.1093/nar/gkw958
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Statistics about genomes in RPAN
| Accession group | Count |
|---|---|
| 1764 | |
| 801 | |
| AUS | 221 |
| ARO | 101 |
| ADM | 123 |
| Total | 3010 |
Figure 1.The architecture of RPAN. This rice pan-genome browser contains table browser, genome browser and multiple search functions. Table browser allows users to summarize, display and download the contents of tracks. Tree browser can be used to select target genomes which can be displayed as tracks in genome browser. Genome browser contains a reference individual genome as well as those novel sequences not included in the reference genome. Users can also search the pan-genome with a list of genes or accessions, for the presence/absence of the genes in the list of selected rice accessions. Searched results can be displayed in the genome browser as well as in tables and figures.
Statistics about rice gene categorization
| Gene category | Count |
|---|---|
| Total genes | 50 995 |
| Core genes | 23 914 |
| Candidate core genes | 4986 |
| Distributed genes | 22 095 |
| Subspecies-unbalanced genes | 13 617 |
| 5579 | |
| 6038 | |
| Subspecies-specific genes | 853 |
| 587 | |
| 147 | |
| AUS-specific genes | 67 |
| ARO-specific genes | 52 |
| Subgroup-unbalanced genes | 11 581 |
| 9816 | |
| 3418 | |
| Random genes | 5316 |
Figure 2.The tree browser and genome browser of RPAN. The left panel is the tree browser representing the clustering of ∼3000 individual rice genomes. The tree browser can be used to select genomic tracks for visualization in the genome browser. The right panel is the genome browser. The tracks in it from top to bottom are reference, gene annotation, presence frequency, accessions (red) and RNA-seq (blue). Its genomic sequences contain a reference individual rice genome as well as those novel sequences not included in the reference genome.
Figure 3.Examples of search and visualization functions of RPAN. Os12g0569700, a gene related to rice acclimation to salt and drought stresses, was searched. Search results include (A) distribution of the gene in high-quality accessions and (B) heat maps of the gene presence frequency in different subspecies and subgroups. (C) The visualization of this gene with three RNA-seq tracks.
Figure 4.An example of searching the shortlist of candidate donor for early-japonica breeding by RPAN. With the visualization function for the distributed key gene Os08g0174500 (A–C) controlling the day-length sensitivity of rice, the donors were shortlisted by visualization of the PAVs of Os08g0174500 (D).
Figure 5.Examples of search and visualization functions of RPAN. A list of 132 genes associated with domestication were searched. Search results include (A) statistics about categorization of the genes; (B) the distribution of rice accessions containing all the genes and (C), (D) and (E), visualization of gene categorization in pie charts.