| Literature DB >> 17555605 |
Jing Ding1, Hitoshi Araki, Qiang Wang, Pengfei Zhang, Sihai Yang, Jian-Qun Chen, Dacheng Tian.
Abstract
BACKGROUND: Individuals in the same species are assumed to share the same genomic set. However, it is not unusual to find an orthologous gene only in small subset of the species, and recent genomic studies suggest that structural rearrangements are very frequent between genomes in the same species. Two recently sequenced rice genomes Oryza sativa L. var. Nipponbare and O. sativa L. var. 93-11 provide an opportunity to systematically investigate the extent of the gene repertoire polymorphism, even though the genomic data of 93-11 derived from whole-short-gun sequencing is not yet as complete as that of Nipponbare.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17555605 PMCID: PMC1914357 DOI: 10.1186/1471-2164-8-154
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1An example of sequence comparison between two rice lines, Nipponbare and 93-11. The region of 6,070,001–6,629,564 bp on chromosome 11 (based on IRGSP database) is shown. These 683 kb consecutive sequences, sampled around R gene clusters, were aligned between Nipponbare (559,563 bp) and 93-11 (428,114 bp). Dotted line represents where a sequence in one line cannot be aligned with a sequence at the corresponding position in the other line. Genes are shown as square boxes pointing in the direction of transcription. This region contains 70 non-TE genes in Nipponbare and 62 in 93-11. 53 allelic non-TE genes are colored as green, 8 P/A as red, 4 P/Ad as orange, 14 AL as magenta, and 30 TE genes as blue boxes. All R genes are labeled above or below the boxes. The sizes of genes, intergenic sequences, and indels are proportionally scaled down. In this region, the PAG is 30.7%, and the percentage of asymmetric genes is 39.4% ((8 P/A +4 P/Ad +14 AL)/66).
Statistics of indels between Nipponbare and 93-11 genomes in 35 Mb sequences.
| Cultivar | Inserts | Asymmetric genes in inserts | |||||
| Length | No. | Total length (bp) | Known | Unknown | TEa | Total | |
| Nipponbare | 1–3 kb | 213 | 306,912 | 31 | 4 | 10 | 45 |
| (34.7 Mb | 3–5 kb | 79 | 316,675 | 17 | 3 | 26 | 46 |
| aligned) | 5–10 kb | 97 | 629,958 | 15 | 17 | 65 | 97 |
| 10–20 kb | 58 | 752,324 | 23 | 12 | 80 | 115 | |
| > 20 kb | 9 | 287,930 | 7 | 3 | 25 | 35 | |
| Sub-total | 456 | 2,293,799 | 93 | 39 | 206 | 338 | |
| 93-11 | 1–3 kb | 232 | 410,336 | 23 | 13 | 11 | 47 |
| (34.4 Mb | 3–5 kb | 73 | 283,721 | 26 | 12 | 9 | 47 |
| aligned) | 5–10 kb | 84 | 581,674 | 35 | 17 | 15 | 67 |
| 10–20 kb | 36 | 464,090 | 56 | 11 | 1 | 68 | |
| > 20 kb | 2 | 42,205 | 0 | 4 | 2 | 6 | |
| Sub-total | 427 | 1,782,026 | 140 | 57 | 38 | 235 | |
| Total | 883 | 4,075,825 | 96 | 244 | 573 | ||
Note. The bolded numbers are the same as the numbers in the bottom last column.
a TE is transposable element related gene.
Statistics of genes in genome, chromosome 10 and four gene groups.
| Gene groupa | Symmetric or asymmetric | Known functionb | Multigene familyc | Clustered genesc | Total |
| Inserts in Table 1 | |||||
| In Nipponbare | P/A | 13 | 3 | 1 | 20 |
| P/Ad | 43 | 56 | 8 | 56 | |
| AL | 37 | 46 | 11 | 56 | |
| In 93-11 | P/A | 45 | 1 | 0 | 62 |
| P/Ad | 23 | 35 | 6 | 35 | |
| AL | 72 | 36 | 9 | 100 | |
| Total asy.-genes | 233 | 177 | 35 | 329 | |
| Chromosome 10 | |||||
| Genes in Nip. | Symmetricd | 284 | 2,572 | ||
| P/A | 40(18) | 2(1) | 0(0) | 56(26) | |
| P/Ad | 49(31) | 64(45) | 2(1) | 64(45) | |
| AL | 32(23) | 17(12) | 2(2) | 49(40) | |
| NDe | 80 | ||||
| Genes in 93-11 | Symmetricd | 2,556 | |||
| P/A | 46(30) | 15(11) | 5(4) | 68(48) | |
| P/Ad | 49(40) | 72(60) | 2(2) | 72(60) | |
| AL | 37(24) | 11(4) | 1(0) | 46(31) | |
| NDe | 6 | ||||
| Total asy.-genes | 253(166) | 181(133) | 12(9) | 355(250) | |
| Gene family | Symmetric | 339 | 35 | 132 | 339 |
| P/A | 105 | 0 | 57 | 105 | |
| P/Ad | 42 | 42 | 37 | 42 | |
| AL | 116 | 38 | 88 | 116 | |
| Total asy.-genes | 263 | 80 | 182 | 263 | |
| Total | |||||
| Symmetric | 264 | 99 | 111 | 264 | |
| P/A | 17 | 4 | 10 | 17 | |
| AL | 102 | 60 | 72 | 102 | |
| Total asy.-genes | 119 | 64 | 82 | 119 | |
| Symmetric | 200 | 26 | 22 | 200 | |
| P/A | 6 | 0 | ND | 6 | |
| AL | 4 | 2 | 0 | 4 | |
| Total asy.-genes | 10 | 2 | 0 | 10 |
a Asymmetric genes include P/A, P/Ad and AL genes. The genes in unknown function, in single gene families and in non-clustered genes are equal to the total number of genes subtracts the genes in known function, in multigene families and in clustered genes, respectively.
b The genes in the group of function known are determined by GO (gene ontology) classification.
c The definition of multigene or clustered gene is described in Methods.
d The symmetric genes between Nipponbare and 93-11 should be equal. However when an allele is a pseudogene, we still classify this gene into symmetric group because it has an allelic sequence. Therefore the numbers are not equal; the numbers in parentheses represent the P/A, P/Ad and AL genes identified by the first method (see Methods for details), respectively. Otherwise the numbers stand for the asymmetric genes identified by the second method.
e ND, not determined by the incomplete genome sequences (mainly in 93-11).
Distribution of genes among rice populations or between species.
| Rice lines | In Nipponbare only | In both lines-single gene | In both-multigene | Non- | ||||||||||||||||||||||||||||||
| Country | lines | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 |
| Japan | Nipponbare | - | - | - | - | - | - | - | - | - | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| China | 9311 | + | + | + | + | + | + | + | + | + | + | + | - | - | - | - | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| USA | Calrose | - | - | - | - | - | + | - | - | - | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| USA | S4542 | - | - | + | + | - | + | - | - | - | - | - | - | - | - | + | + | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Haiti | Grassy | + | - | - | - | - | + | - | - | - | - | - | - | - | - | + | + | - | - | + | - | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Guyana | Vintula | + | - | - | + | - | + | - | - | - | - | - | + | + | - | + | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Zaire | CI | + | + | - | + | - | + | + | + | - | - | + | - | + | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Tanzania | WC | - | - | - | + | - | - | - | - | - | - | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Japan | Beiguang | + | - | - | + | - | - | - | - | - | - | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| South Korea | Milyang46 | + | + | + | + | + | + | + | + | + | - | + | - | - | - | - | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| India | Dular | + | - | - | - | - | - | + | + | + | - | - | - | + | - | + | - | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Philippines | IR24 | + | + | + | + | + | + | - | - | + | + | + | + | - | - | + | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| China | NJ16 | + | - | + | + | - | + | + | + | + | + | + | + | + | - | - | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| China | Dadaotou | - | - | - | + | - | - | - | - | - | - | - | + | + | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| China | Zhenshan97 | + | + | + | - | + | + | + | - | + | - | - | - | + | - | - | - | - | + | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + |
| China | Yunjing | - | - | + | - | - | + | - | - | - | - | - | - | + | - | + | + | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| China | C416 | - | + | - | - | + | + | - | - | + | - | - | - | - | - | + | + | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| China | PA64 | - | - | + | + | - | + | + | + | + | - | - | - | - | - | + | - | - | + | - | - | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Wild Rice | S03005 | + | - | + | + | + | + | + | + | + | + | + | + | + | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Wild Rice | S01169 | - | + | - | + | + | + | - | - | - | - | + | - | + | - | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + | + |
| Present frequency% | 55 | 35 | 45 | 65 | 30 | 75 | 40 | 35 | 45 | 20 | 40 | 35 | 65 | 20 | 80 | 60 | 40 | 85 | 90 | 85 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| π value (‰) | 2.4 | 2.0 | 12 | 1.3 | 0.7 | 48 | 2.8 | 0.6 | 0.0 | 0.0 | 12 | 0.0 | 0.5 | 0.0 | 2.9 | 0.0 | 0.0 | 38 | 13 | 0.8 | 2.0 | 1.8 | 2.1 | 4.0 | 0.0 | 135 | 2.9 | 79 | 74 | 41 | 0.0 | 5.0 | 2.8a | |
| Related sp. | - | - | - | - | + | - | - | - | - | - | - | - | + | - | - | + | + | - | + | - | - | + | + | + | + | + | - | - | - | + | + | + | + | |
| Related sp. | - | - | - | - | - | - | + | - | - | - | - | - | + | - | - | + | + | - | + | + | - | - | + | - | - | - | - | + | - | - | + | + | + | |
Note. + stands for the presence of gene, and – for absence of gene suggested by PCR and sequencing; Gene 1-30 are R genes, and 31-33 are house keeping genes. The accession number or name of gene 1-33 is OsIFCC009208, OsIFCC035754, OsIFSC047991, OsIFCC019895, OsIFCC035751, OsIFCC035229, OsIFCC043050, OsIFCC045542, OsIFCC023444, OsIFCC026523, OsIFSC046156, Os12g31620, Os11g45330, Os06g16790, Os12g09730, Os10g04670, Os11g11550, Os12g28250, Os11g15670, Os01g20720, Os12g18360 (Pi-ta), Os04g41370, Os07g29820, Os02g25900, OsIFCC040819, Os12g37270 and Os12g37280 (Pib homologue), Os12g03750 (OsIFCC032042 and OsIFCC032029 in 93-11), Os01g57270 (and Os01g57280, Os01g57310, Os01g57340), Os05g16180 (and Os05g16200, Os11g47780), Os08g10430 (and Os08g10440), Os01g05490 (triosephosphate isomerase), Os03g61120 (anthranilate synthase), and Os05g49760 (isocitrate dehydrogenase), respectively.
aπ, valued as 1/1000, denotes the average nucleotide diversity and the values for the five multigene families and three house keeping genes were calculated from allelic pairs between Nipponbare and 93-11.
Figure 2Grouping of . This tree was created based on the discrete morphology (parsimony) method using the programs SEQBOOT, PARS and CONSENSE of the PHYLIP package v3.6. Percent bootstrap values from 1000 replicate samples are indicated at all the major nodes. a indicates japonica rice.