| Literature DB >> 22639581 |
Hiroshi Mizuno1, Yoshihiro Kawahara, Jianzhong Wu, Yuichi Katayose, Hiroyuki Kanamori, Hiroshi Ikawa, Takeshi Itoh, Takuji Sasaki, Takashi Matsumoto.
Abstract
There is controversy as to whether gene expression is silenced in the functional centromere. The complete genomic sequences of the centromeric regions in higher eukaryotes have not been fully elucidated, because the presence of highly repetitive sequences complicates many aspects of genomic sequencing. We performed resequencing, assembly, and sequence finishing of two P1-derived artificial chromosome clones in the centromeric region of rice (Oryza sativa L.) chromosome 5 (Cen5). The pericentromeric region, where meiotic recombination is silenced, is located at the center of chromosome 5 and is 2.14 Mb long; a total of six restriction-fragment-length polymorphism markers (R448, C1388, S20487S, E3103S, C53260S, and R2059) genetically mapped at 54.6 cM were located in this region. In the pericentromeric region, 28 genes were annotated on the short arm and 45 genes on the long arm. To quantify all transcripts in this region, we performed massive parallel sequencing of mRNA. Transcriptional density (total length of transcribed region/length of the genomic region) and expression level (number of uniquely mapped reads/length of transcribed region) were calculated on the basis of the mapped reads on the rice genome. Transcriptional density and expression level were significantly lower in Cen5 than in the average of the other chromosomal regions. Moreover, transcriptional density in Cen5 was significantly lower on the short arm than on the long arm; the distribution of transcriptional density was asymmetric. The genomic sequence of Cen5 has been integrated into the most updated reference rice genome sequence constructed by the International Rice Genome Sequencing Project.Entities:
Keywords: International Rice Genome Sequencing Project; P1-derived artificial chromosome; centromere; genome sequencing; mRNA-Seq
Year: 2011 PMID: 22639581 PMCID: PMC3355683 DOI: 10.3389/fpls.2011.00016
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Genetic map and PAC/BAC physical map of . Two PAC clones (P0697B04 and P0587F01; black bars) were sequenced. The PAC/BAC contig was mapped by using restriction-fragment-length polymorphism markers S20487S and E3103S, which were located on the short and long arms, respectively, of chromosome 5 at 54.6 cM. Red boxes represent RCS2/CentO clusters.
Improvement of the sequences of PAC clones.
| P0587F01 | P0697B04 | |||
|---|---|---|---|---|
| Accession number | AC146339 | AP011109 | AC137984 | AP011110 |
| Contigs | 12 | 1 | 7 | 1 |
| Status | HTGS_PHASE1 | PLN_PHASE3 | HTGS_PHASE2 | HTGS_PHASE2 |
| Length (bp) | 149,330 | 52,858 | 114,329 | 147,577 |
*The number and orientation of RCS2/CentO repeats were not determined. HTGS, high-throughput genomic sequence; Phase 1: unfinished; may be unordered, unoriented contigs, with gaps. Phase 2: unfinished, ordered, oriented contigs, with or without gaps. Phase 3: finished, no gaps. PLN: plant, fungal, and algal sequences of Phase 3.
Figure 2Distribution of expressed regions in . The positions of restriction-fragment-length polymorphism (RFLP) markers mapped at 54.3–55.4 cM are indicated. The region in which RFLP markers are mapped at 54.6 cM is shown (gray box). The distribution of 36-bp mapped reads on the rice genome was graphed in GBrowse (Stein et al., 2002). The graph indicates the average depths of reads from mRNA-Seq for samples obtained from shoots (green) or roots (red). Only depths <50 are shown (Depths ≥50 are shown as 50). The level of expression is normalized to that of the shoot (standard). Gene models based on Rice Annotation Project (RAP) representative loci (RAP_rep) and RAP predicted genes (RAP_pred) are shown. Expression of an RAP predicted gene is shown (white triangle). The position of Os05g0303000, a homolog of the wheat PSR161 gene mapped on wheat Cen1B (see text), is also indicated (black triangle). Red boxes represent RCS2/CentO clusters.
Figure A1Distribution of expressed regions proximal to . The distributions of reads mapped on Cen4 (A) and Cen8 (B) were graphed in GBrowse, as in Figure 2. Os04g0234600 in Cen4 is extremely highly expressed (black triangle). Os08g0319450 in Cen8 is located in the small RCS2/CentO sequence.
Figure 3Relationships of gene density, transcriptional density, and expression levels. Average scores of gene density, transcriptional density, and expression level in each 1 Mb of sliding windows on chromosomes 4, 5, and 8 are shown. Horizontal axis indicates position on the reference rice genome (Build 5.0 pseudomolecules) constructed by the International Rice Genome Sequencing Project. Lines indicate the boundaries of each pericentromeric region.
Comparison of transcription in centromeric regions and in the whole genomic region.
| Genomic region (bp) | Tissue | Transcribed region (bp) | No. of uniquely mapped reads | Transcriptional density | Expression level | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Centromere | Other | Centromere | Other | Centromere | Other | Centromere | Other | Centromere | Other | ||||
| 2,088,655 | 33,973,212 | Shoot | 115,495 | 5,005,925 | 48,875 | 1,368,096 | 0.055 | 0.147 | <0.0001 | 214.7 | 273.3 | <0.0001 | |
| Root | 101,453 | 5,096,940 | 15,409 | 1,084,734 | 0.049 | 0.150 | <0.0001 | 151.9 | 212.8 | <0.0001 | |||
| 2,139,098 | 27,934,342 | Shoot | 149,160 | 4,687,562 | 34,964 | 1,241,332 | 0.070 | 0.168 | <0.0001 | 234.4 | 264.8 | <0.0001 | |
| Root | 138,618 | 4,758,232 | 24,607 | 1,139,487 | 0.065 | 0.170 | <0.0001 | 177.5 | 239.5 | <0.0001 | |||
| 2,431,594 | 26,098,435 | Shoot | 231,866 | 3,862,076 | 36,813 | 1,197,700 | 0.095 | 0.148 | <0.0001 | 158.8 | 310.1 | <0.0001 | |
| Root | 239,917 | 3,843,450 | 35,396 | 787,908 | 0.099 | 0.147 | <0.0001 | 147.5 | 205.0 | <0.0001 | |||
Statistical significance (P) was based on Fisher's exact test. Expression levels in the centromeric region of chromosome 4 were calculated without the gene Os04g0234600 (see text). The centromeric region was defined as from the start position of the short arm of the pericentromeric region to the end position of the long arm of the pericentromeric regions. Transcribed region, transcriptional density, and expression level are defined in Section “Materials and Methods.”
Comparison of transcription in .
| Genomic region (bp) | Tissue | Transcribed region (bp) | No. of uniquely mapped reads | Transcriptional density | Expression level | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pericent. short arm | Pericent. long arm | Pericent. short arm | Pericent. long arm | Pericent. short arm | Pericent. long arm | Pericent. short arm | Pericent. long arm | Pericent. short arm | Pericent. long arm | |||||||||
| 1,779,938 | 124,271 | 184,446 | Shoot | 88,498 | 140 | 26,857 | 40,984 | 5 | 7,886 | 0.050 | 0.001 | 0.146 | <0.0001 | 463.1 | 35.7 | 293.6 | <0.0001 | |
| Root | 75,230 | 288 | 25,935 | 10,869 | 10 | 4,530 | 0.042 | 0.002 | 0.141 | <0.0001 | 144.5 | 34.7 | 174.7 | <0.0001 | ||||
| 1,063,874 | 97,181 | 978,043 | Shoot | 41,629 | 0 | 107,531 | 4,332 | 0 | 30,632 | 0.039 | 0.000 | 0.110 | <0.0001 | 104.1 | 0.0 | 284.9 | <0.0001 | |
| Root | 37,507 | 0 | 101,111 | 9,725 | 0 | 14,882 | 0.035 | 0.000 | 0.103 | <0.0001 | 259.3 | 0.0 | 147.2 | <0.0001 | ||||
| 935,763 | 76,165 | 1,419,666 | Shoot | 94,769 | 178 | 136,919 | 14,495 | 5 | 22,313 | 0.101 | 0.002 | 0.096 | <0.0001 | 153.0 | 28.1 | 163.0 | <0.0001 | |
| Root | 95,763 | 176 | 143,978 | 14,069 | 5 | 21,322 | 0.102 | 0.002 | 0.101 | 0.496 | 146.9 | 28.4 | 148.1 | 0.0224 | ||||
Statistical significance of the difference in gene expression between the short arm and long arm (P) was based on Fisher's exact test. Transcribed region, transcriptional density, and expression level are defined in Section “Materials and Methods.”
Annotated genes in .
| Gene_ID | S/L | Start | End | Length | Strand | Description | RPKM_shoot | RPKM_root |
|---|---|---|---|---|---|---|---|---|
| Os05g0276500 | S | 11422127 | 11423907 | 1101 | − | Expansin Os-EXPA3 | 0.2 | 40.83 |
| Os05g0277000 | S | 11447481 | 11448493 | 754 | − | Similar to Expansin Os-EXPA3 | 0.22 | 27.83 |
| Os05g0277200 | S | 11463176 | 11464421 | 1246 | + | Conserved hypothetical protein | 2.73 | 3.63 |
| Os05g0277300 | S | 11465333 | 11469546 | 3295 | − | Similar to cDNA clone: 001-013-F11 | 7.59 | 6.48 |
| Os05g0277350 | S | 11474153 | 11475272 | 691 | + | Similar to leucine rich repeat family protein | 0 | 0 |
| Os05g0277500 | S | 11496173 | 11497090 | 840 | + | Similar to germin-like protein subfamily 2 member 4 precursor | 1.6 | 9.45 |
| Os05g0278500 | S | 11638503 | 11644034 | 1551 | − | Transferase family protein | 6.91 | 162.59 |
| Os05g0278550 | S | 11643456 | 11644083 | 628 | + | Hypothetical gene | 5.6 | 144.79 |
| Os05g0278950 | S | 11670044 | 11672821 | 715 | − | Similar to ATP-dependent Clp protease proteolytic subunit | 0 | 0 |
| Os05g0279300 | S | 11676865 | 11685603 | 1209 | − | Similar to tRNA pseudouridine synthase A | 3.23 | 1.39 |
| Os05g0279400 | S | 11689568 | 11695386 | 3310 | + | Zinc-finger, RING-type domain containing protein | 23.83 | 21.99 |
| Os05g0279600 | S | 11700491 | 11709989 | 1352 | + | Endonuclease/exonuclease/phosphatase domain containing protein | 5 | 7.01 |
| Os05g0279750 | S | 11721971 | 11726153 | 4183 | + | Hypothetical gene | 0 | 0.02 |
| Os05g0279900 | S | 11728764 | 11731789 | 1475 | + | Similar to Polygalacturonase A | 7.49 | 2.52 |
| Os05g0280200 | S | 11752678 | 11754728 | 672 | − | Similar to Ras-related protein RGP2 | 52.18 | 55.95 |
| Os05g0280350 | S | 11752728 | 11754718 | + | Hypothetical gene | 57.65 | 63.09 | |
| Os05g0280500 | S | 11782293 | 11785389 | 1881 | − | Phospholipid/glycerol acyltransferase domain containing protein | 0.77 | 73.36 |
| Os05g0280700 | S | 11817709 | 11820877 | 3169 | − | Similar to resistance protein candidate | 0.23 | 0 |
| Os05g0281400 | S | 11920597 | 11921702 | 1013 | + | Protein of unknown function DUF810 domain containing protein | 5.62 | 5.75 |
| Os05g0282500 | S | 12041129 | 12043113 | 600 | − | Hypothetical conserved gene | 0.09 | 0 |
| Os05g0282900 | S | 12079928 | 12081735 | 1808 | + | Conserved hypothetical protein | 0.19 | 0.97 |
| Os05g0283000 | S | 12088257 | 12091692 | 1607 | + | Conserved hypothetical protein | 0.07 | 0 |
| Os05g0283200 | S | 12098520 | 12099575 | 1056 | + | Pectinesterase inhibitor domain containing protein | 0 | 0 |
| Os05g0283600 | S | 12122939 | 12131356 | 3569 | + | Zinc-finger, CCHC-type domain containing protein | 0 | 0 |
| Os05g0285900 | S | 12322935 | 12327534 | 1162 | + | Conserved hypothetical protein | 2.02 | 2.88 |
| Os05g0286100 | S | 12337263 | 12338299 | 1037 | + | Similar to zinc-finger protein KNUCKLES | 0 | 14.26 |
| Os05g0286200 | S | 12353858 | 12356702 | 772 | + | Conserved hypothetical protein | 0 | 0 |
| Os05g0287800 | S | 12482678 | 12486801 | 1445 | + | Conserved hypothetical protein | 6.8 | 17.5 |
| Os05g0289100 | L | 12601354 | 12602492 | 1058 | + | Hypothetical conserved gene | 0 | 0 |
| Os05g0289400 | L | 12630181 | 12635126 | 2682 | − | Similar to CRN (Crooked neck) protein | 19.63 | 29.5 |
| Os05g0289700 | L | 12650476 | 12651976 | 1395 | + | Arbuscular mycorrhizal specific marker 10. Benzyl alcohol benzoyl transferase | 0 | 0 |
| Os05g0290300 | L | 12704171 | 12705720 | 1219 | − | Hypothetical conserved gene | 5.13 | 11.83 |
| Os05g0290400 | L | 12704190 | 12715035 | 2613 | + | Hypothetical gene | 6.92 | 12.32 |
| Os05g0291600 | L | 12860254 | 12860794 | 541 | + | Hypothetical conserved gene | 0 | 0.13 |
| Os05g0291700 | L | 12862505 | 12868432 | 1316 | − | Similar to PTAC16 | 263 | 1.22 |
| Os05g0291800 | L | 12872863 | 12873488 | 526 | + | Similar to predicted protein | 0 | 0 |
| Os05g0292200 | L | 12895006 | 12901403 | 1630 | + | Similar to Transcription factor IIA large subunit (TFIIA-L1) | 30.18 | 29.59 |
| Os05g0292800 | L | 12925027 | 12925834 | 551 | + | Similar to one helix protein (OHP) | 183.51 | 8.6 |
| Os05g0293500 | L | 12962105 | 12967380 | 1237 | − | Similar to Pectate lyase B | 0 | 0 |
| Os05g0293600 | L | 12978536 | 12984017 | 5482 | + | Similar to RNA polymerase beta’ chain | 0 | 0 |
| Os05g0294600 | L | 13018766 | 13021491 | 2425 | − | Pentatricopeptide repeat domain containing protein | 14.97 | 2.73 |
| Os05g0294800 | L | 13035304 | 13039195 | 2262 | + | Hypothetical gene | 10.52 | 10.5 |
| Os05g0295100 | L | 13056572 | 13075697 | 2031 | + | Hypothetical conserved gene | 0.99 | 2.73 |
| Os05g0295200 | L | 13086136 | 13089296 | 2181 | − | Conserved hypothetical protein | 10.32 | 1.34 |
| Os05g0295300 | L | 13093233 | 13094329 | 952 | − | Similar to acetyl-coenzyme A carboxylase | 40.12 | 45.31 |
| Os05g0295700 | L | 13117580 | 13121926 | 2251 | − | Similar to homoserine dehydrogenase-like protein | 10.22 | 11.75 |
| Os05g0295800 | L | 13123210 | 13127786 | 1052 | − | Similar to glyoxalase I | 39.12 | 36.36 |
| Os05g0295900 | L | 13135652 | 13144818 | 3064 | − | Conserved hypothetical protein | 0.71 | 2.97 |
| Os05g0296200 | L | 13169380 | 13171753 | 2374 | + | Conserved hypothetical protein | 0 | 0 |
| Os05g0296600 | L | 13216923 | 13217232 | 310 | + | Non-protein coding transcript | 23.77 | 62.28 |
| Os05g0296700 | L | 13221667 | 13222206 | 540 | − | Similar to small heat shock protein | 3.62 | 3.24 |
| Os05g0296750 | L | 13221730 | 13222352 | 623 | + | Hypothetical gene | 3.23 | 2.34 |
| Os05g0296800 | L | 13226211 | 13228572 | 897 | − | Hypothetical protein | 0.31 | 0.32 |
| Os05g0296900 | L | 13259004 | 13259727 | 508 | − | Conserved hypothetical protein | 0 | 0 |
| Os05g0297001 | L | 13261758 | 13263921 | 2164 | + | Similar to predicted protein | 0 | 0 |
| Os05g0297300 | L | 13287199 | 13288934 | 1736 | + | Protein of unknown function DUF1618 domain containing protein | 0 | 0 |
| Os05g0297400 | L | 13289996 | 13290998 | 992 | − | Similar to CXIP4 | 0 | 0 |
| Os05g0297800 | L | 13304340 | 13307779 | 2408 | − | Conserved hypothetical protein | 0.77 | 0.21 |
| Os05g0297850 | L | 13309305 | 13309728 | 424 | − | Hypothetical conserved gene | 0 | 0 |
| Os05g0297900 | L | 13311413 | 13315153 | 1034 | + | Similar to signal peptidase 18 subunit | 9.67 | 17.76 |
| Os05g0298200 | L | 13337235 | 13341454 | 2401 | + | Ankyrin repeat containing protein | 14.93 | 9.83 |
| Os05g0298600 | L | 13349202 | 13351414 | 2213 | − | Hypothetical conserved gene | 3.43 | 5.1 |
| Os05g0298700 | L | 13357011 | 13359346 | 1220 | − | Similar to xylan endohydrolase isoenzyme X-I | 0 | 0 |
| Os05g0298900 | L | 13395955 | 13396672 | 718 | + | Conserved hypothetical protein | 6.84 | 14.11 |
| Os05g0299000 | L | 13400919 | 13401654 | 736 | + | Hypothetical protein | 0.08 | 0.1 |
| Os05g0299101 | L | 13402647 | 13403283 | 550 | − | Hypothetical gene | 0.41 | 0 |
| Os05g0299200 | L | 13407527 | 13412497 | 1491 | − | Hypothetical conserved gene | 10.04 | 2.98 |
| Os05g0299300 | L | 13414154 | 13420043 | 3226 | − | WD40 repeat-like domain containing protein | 4.48 | 5.13 |
| Os05g0299500 | L | 13434338 | 13439817 | 1563 | + | Protein of unknown function DUF914 | 6.64 | 15.66 |
| Os05g0299600 | L | 13440049 | 13442337 | 2171 | − | Protein of unknown function DUF1677 | 1.18 | 0.67 |
| Os05g0299700 | L | 13450657 | 13453015 | 2359 | − | Similar to expressed protein (zinc-finger-like protein) | 38.25 | 39.38 |
| Os05g0300700 | L | 13504825 | 13512070 | 2425 | + | Cell division cycle-associated protein domain containing protein | 9.19 | 16.98 |
| Os05g0301500 | L | 13558563 | 13563304 | 2162 | + | Similar to ribophorin I | 18.88 | 33.4 |
Genes located between restriction-fragment-length polymorphism (RFLP) markers R448 and R2059 on chromosome 5 are listed. Gene ID (gene_ID); mapped on short arm or long arm (S/L); start position (start); end position (end); total nucleotide length of each transcript (length); coding strand (strand); description in Rice Annotation Project Database (description); RPKM in shoot (RPKM_shoot); and RPKM in root (RPKM_root) are listed. The position of RFLP markers and RCS2/CentO repeats are also shown in bold letter.
Figure 4Relationships between location and RPKM (reads per kilobase of exon models per million mapped reads) of genes. RPKM of each gene on chromosomes 4, 5, and 8 are shown. Only genes with RPKM < 100 are shown. Horizontal axis indicates the positions of genes in the reference rice genome (Build 5.0 pseudomolecules) constructed by the International Rice Genome Sequencing Project. Red triangles indicate the positions of RCS2/CentO clusters.
Figure A2Distributions of . RCS2/CentO clusters in Cen4, Cen5, and Cen8 on rice genome sequence Build 5.0 (blue boxes) are shown. The small RCS2/CentO sequence in Cen8 (Figure A1B in Appendix) is not shown. Gene models based on Rice Annotation Project (RAP) representative genes are shown.