| Literature DB >> 32349372 |
Xiaobao Shi1, Junwei Wu1, Raphael Anue Mensah1, Na Tian1, Jiapeng Liu1, Fan Liu1, Jialan Chen1, Jingru Che1, Ye Guo1, Binghua Wu1, Guangyan Zhong2, Chunzhen Cheng1.
Abstract
Introns exist not only in coding sequences (CDSs) but also in untranslated regions (UTRs) of a gene. Recent studies in animals and model plants such as Arabidopsis have revealed that the UTR-introns (UIs) are widely presented in most genomes and involved in regulation of gene expression or RNA stability. In the present study, we identified introns at both 5'UTRs (5UIs) and 3'UTRs (3UIs) of sweet orange genes, investigated their size and nucleotide distribution characteristics, and explored the distribution of cis-elements in the UI sequences. Functional category of genes with predicted UIs were further analyzed using GO, KEGG, and PageMan enrichment. In addition, the organ-dependent splicing and abundance of selected UI-containing genes in root, leaf, and stem were experimentally determined. Totally, we identified 825 UI- and 570 3UI-containing transcripts, corresponding to 617 and 469 genes, respectively. Among them, 74 genes contain both 5UI and 3UI. Nucleotide distribution analysis showed that 5UI distribution is biased at both ends of 5'UTR whiles 3UI distribution is biased close to the start site of 3'UTR. Cis- elements analysis revealed that 5UI and 3UI sequences were rich of promoter-enhancing related elements, indicating that they might function in regulating the expression through them. Function enrichment analysis revealed that genes containing 5UI are significantly enriched in the RNA transport pathway. While, genes containing 3UI are significantly enriched in splicesome. Notably, many pentatricopeptide repeat-containing protein genes and the disease resistance genes were identified to be 3UI-containing. RT-PCR result confirmed the existence of UIs in the eight selected gene transcripts whereas alternative splicing events were found in some of them. Meanwhile, qRT-PCR result showed that UIs were differentially expressed among organs, and significant correlation was found between some genes and their UIs, for example: The expression of VPS28 and its 3UI was significantly negative correlated. This is the first report about the UIs in sweet orange from genome-wide level, which could provide evidence for further understanding of the role of UIs in gene expression regulation.Entities:
Keywords: Cis- elements; UTR intron; gene expression; nucleotide distribution
Mesh:
Substances:
Year: 2020 PMID: 32349372 PMCID: PMC7247714 DOI: 10.3390/ijms21093088
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Statistics information of 5′ untranslated regions (UTR), coding sequences (CDSs) and 3′ UTR in Citrus sinensis.
| Number of Sequences | Sequences with Introns | Total Bases (Genomic) | Intron/Sequence | Number of Introns/Nucleotide (mRNA) | |
|---|---|---|---|---|---|
| 5′UTR | 16,916 | 617 | 6.8 × 106 | 0.06 | 1.42 × 10−4 |
| CDS | 23,394 | 17,897 | 3.8 × 107 | 4.81 | 2.98 × 10−3 |
| 3′UTR | 17,408 | 469 | 1.2 ×107 | 0.04 | 6.36 × 10−5 |
Information of the UTR-intron (UI) numbers in the 5UI and 3UI containing gene transcripts (respectively abbreviated as 5UI-Ts and 3UI-Ts). 1 UI~6 UIs respectively means that there is 1~6 UIs present in the UTR. There are a total of 825 gene transcripts containing 5UI, and a total of 570 gene transcripts containing 3UI. NS: not shown.
| UI Number | 5UI-Ts Number/Percentage (Gene ID) | 3UI-Ts Number/Percentage (Gene ID) |
|---|---|---|
| 1 UI | 713/86.43% (NS) | 448/78.60% (NS) |
| 2 UIs | 90/10.91% (NS) | 90/15.79% (NS) |
| 3 UIs | 18/2.18% (Cs1g06160.1, Cs2g16080.1, Cs2g03700.1, Cs2g01050.1, Cs3g12240.1, Cs3g19040.1, Cs4g10860.1, Cs5g07860.1, Cs6g06255.1, Cs7g12410.1, Cs9g09475.1, orange1.1t05830.1, orange1.1t01413.1, orange1.1t02679.1, orange1.1t02923.1, orange1.1t04234.1, orange1.1t05909.1, orange1.1t06043.1) | 18/3.16% (Cs1g09404.1, Cs1g10030.1, Cs3g02530.1, Cs3g21100.1, Cs3g25390.1, Cs4g03945.1, Cs6g08820.1, Cs7g04620.1, Cs7g09600.1, Cs8g11445.1, Cs8g11845.1, Cs8g15200.1, orange1.1t02481.1, orange1.1t05875.1, orange1.1t03487.1, orange1.1t06018.1, orange1.1t05916.1, orange1.1t05924.1) |
| 4 UIs | 2/0.24% (Cs2g17870.1, Cs5g25765.1) | 8/1.40% (Cs2g08495.1, Cs3g10260.1, Cs3g12715.1, Cs5g26550.1; Cs5g28645.2, Cs7g15390.1, orange1.1t05956.1, orange1.1t03536.1) |
| 5 UIs | 2/0.24% (orange1.1t06039.1, Cs7g06380.1) | 4/0.70% (Cs2g11780.1, orange1.1t01460.1, orange1.1t03486.1, Cs1g16990.1) |
| 6 UIs | 0 | 2/0.35% (Cs1g15550.1, Cs1g17000.1) |
Figure 1Chromosome localization results of gene transcripts containing 5′UTR intron (5UI) (A) and 3′UTR intron (3UI) (B) in Citrus sinensis. Chr: chromosome, cM: centiMorgan.
Statistics information of the distribution of UTR introns (UIs) and transcripts containing UI (UI-Ts) in the Citrus sinensis chromosomes. 5UI-T represents gene transcript containing 5UI; 3UI-T represents gene transcript containing 3UI. Chr: chromosome.
| Chr No. | 5UI Numbers/Percentage | 5UI Density | 3UI Numbers/Percentage | 3UI Density | 5UI-T Number/Percentage | 5UI-T Density | 3UI-T Number/Percentage | 3UI-T Density |
|---|---|---|---|---|---|---|---|---|
| chr1 | 81 (8.39%) | 2.81 × 10−6 | 103 (13.83%) | 3.58 × 10−6 | 70 (8.49%) | 2.43 × 10−6 | 72 (12.6%) | 2.5 × 10−6 |
| chr2 | 162 (16.79%) | 5.26 × 10−6 | 65 (8.73%) | 2.11 × 10−6 | 137 (16.61%) | 4.45 × 10−6 | 50 (8.77%) | 1.62 × 10−6 |
| chr3 | 87 (9.02%) | 3.03 × 10−6 | 83 (11.14%) | 2.89 × 10−6 | 73 (8.85%) | 2.54 × 10−6 | 61 (10.7%) | 2.13 × 10−6 |
| chr4 | 75 (7.77%) | 3.75 × 10−6 | 60 (8.05%) | 3.00 × 10−6 | 66 (8.00%) | 3.3 × 10−6 | 48 (8.42%) | 2.40 × 10−6 |
| chr5 | 100 (10.36%) | 2.76 × 10−6 | 85 (11.41%) | 2.35 × 10−6 | 84 (10.18%) | 2.32 × 10−6 | 71 (12.46%) | 1.96 × 10−6 |
| chr6 | 96 (9.95%) | 4.53 × 10−6 | 39 (5.24%) | 1.84 × 10−6 | 85 (10.30%) | 4.01 × 10−6 | 33 (5.79%) | 1.56 × 10−6 |
| chr7 | 97 (10.05%) | 3.01 × 10−6 | 78 (10.47%) | 2.42 × 10−6 | 85 (10.30%) | 2.64 × 10−6 | 62 (10.88%) | 1.93 × 10−6 |
| chr8 | 54 (5.59%) | 2.38 × 10−6 | 57 (7.65%) | 2.51 × 10−6 | 48 (5.82%) | 2.11 × 10−6 | 43 (7.54%) | 1.89 × 10−6 |
| chr9 | 48 (4.97%) | 2.59 × 10−6 | 27 (3.62%) | 1.46 × 10−6 | 43 (5.21%) | 2.32 × 10−6 | 23 (4.04%) | 1.24 × 10−6 |
| chrUn | 165 (17.01%) | - | 148 (19.86%) | - | 134 (16.24%) | - | 107 (18.77%) | - |
Figure 2Length distributions of 5′UTR, 3′UTR and CDS introns (A) and position distribution of 5′UTR introns (B) and 3′UTR introns (C) relative to the beginning and end of the associated UTRs. For Figure 2A, the horizontal axis represents the size of the intron, and the vertical axis represents the proportion of the introns of different sizes. For Figure 2B,C, (a) Blue bars represent the observed positions of 5′UTR introns relative to the beginning of the 5′UTR and 3′UTR introns relative to the beginning of the 3′UTR (terminate the codon proximal end). Light blue bars represent the observed positions of 5UIs relative to the end of the 5′UTR (i.e., the start codon ATG) and 3UIs relative to the end of the 3′UTR. (b) Sierra of UIs relative to the end of the UTR and the end of the UTR with color diversity.
Figure 3Sequence logos showing the nucleotide bias around the donor site (A) and acceptor site (B) of 5′UTR, CDS, and 3′UTR introns. The sequence marker shows the nucleotide deviation around the donor site and acceptor site of the 5′UTR, CDS, and 3′UTR introns. The x-axis refers to the base at the beginning of the intron, and the letter height reflects the nucleotide deviation at each position. Only 5 nucleotide exons and 25 nucleotide intron sequences of the donor site and only 2 nucleotide exons and 25 nucleotide intron sequences of the acceptor site are included in the sequence identifier, because the nucleotide usage outside of these regions is not significantly different from the background level.
5′UTR intron (5UI) and 3′UTR intron (3UI) numbers and length in UTRs of UI-containing pentatricopeptide repeat containing proteins (PPRPs) and disease resistance (R) genes.
| Gene Family | Gene ID | 3UI Number and Length (bp) | 5UI Number and Length (bp) |
|---|---|---|---|
|
| Cs4g02090.2 | 1 (150) | - |
| Cs4g03660.1 | 1 (119) | - | |
| Cs4g07420.1 | 1 (1165) | - | |
| Cs4g13530.1 | 2 (366, 98) | - | |
| Cs4g13560.1 | 2 (710, 98) | - | |
| Cs4g20340.4 | 1 (334) | - | |
| Cs4g20340.1 | - | 1 (576) | |
| Cs4g20340.2 | - | 1 (143) | |
| Cs2g05520.1 | 1 (754) | - | |
| Cs2g07840.2 | 1 (486) | - | |
| Cs2g09470.2 | 1 (1044) | - | |
| Cs2g11780.1 | 5 (614, 1389, 178, 567, 78) | - | |
| Cs2g13460.1 | 1 (93) | - | |
| Cs2g19190.1 | 1 (93) | - | |
| Cs2g19710.1 | 2 (113, 442) | - | |
| Cs2g27580.1 | 1 (1233) | - | |
| Cs5g03910.1 | 1 (108) | 1 (147) | |
| Cs5g04860.1 | 1 (422) | - | |
| Cs5g08440.2 | 1 (670) | - | |
| Cs5g17240.1 | 1 (98) | - | |
| Cs5g26200.2 | 1 (771) | - | |
| Cs5g26550.1 | 4 (314, 152, 720, 78) | - | |
| Cs5g34090.1 | 1 (513) | 1 (114) | |
| Cs7g04230.1 | 1 (212) | - | |
| Cs7g04980.1 | 1 (234) | - | |
| Cs7g09600.1 | 3 (781, 330, 112) | - | |
| Cs7g10230.1 | 2 (278, 91) | 1 (490) | |
| Cs7g13700.2 | 1 (949) | - | |
| Cs7g15390.1 | 4 (661, 120, 811, 136) | - | |
| Cs3g02530.2 | 1 (707) | - | |
| Cs3g09780.2 | 1 (103) | - | |
| Cs3g10260.1 | 4 (161, 862, 194, 186) | - | |
| Cs3g11640.1 | 2 (1094, 97) | - | |
| Cs3g19210.1 | 2 (334, 140) | - | |
| Cs3g20090.1 | 2 (506, 107) | 1 (666) | |
| Cs3g20090.2 | - | 1 (462) | |
| Cs3g20480.1 | 2 (220, 166) | - | |
| Cs3g24370.1 | 2 (104, 668) | - | |
| Cs3g25390.1 | 3 (109, 158, 105) | - | |
| Cs6g01290.1 | 2 (280, 132) | 1 (93) | |
| Cs6g07760.1 | 2 (102, 211) | - | |
| Cs6g08820.2 | 1 (107) | - | |
| Cs6g11340.2 | 1 (270) | - | |
| Cs6g11530.1 | 1 (909) | - | |
| Cs6g11910.2 | 1 (388) | - | |
| Cs1g10030.4 | 1 (194) | - | |
| Cs1g10310.2 | 1 (1699) | - | |
| Cs1g12770.2 | 1 (997) | - | |
| Cs1g12780.1 | 2 (89, 306) | - | |
| Cs1g24360.1 | 1 (1265) | - | |
| Cs1g26320.1 | 1 (531) | - | |
| Cs8g15200.1 | 3 (158, 1006, 208) | - | |
| Cs8g18540.1 | 1 (134) | - | |
| Cs9g01900.1 | 1 (99) | - | |
| Cs9g03060.1 | 1 (468) | - | |
| Cs9g17260.1 | 1 (1131) | - | |
| orange1.1t00940.1 | 2 (725, 181) | - | |
| orange1.1t01460.1 | 5 (431, 636, 473, 134, 97) | 1 (268) | |
| orange1.1t01541.1 | 2 (554, 509) | - | |
| orange1.1t04277.2 | 1 (366) | - | |
| orange1.1t04409.1 | 2 (343, 301) | - | |
| Cs4g03945.1 | 3 (633, 89, 99) | - | |
| Cs4g11335.1 | 1 (295) | - | |
| Cs9g14456.1 | 1 (91) | - | |
|
| Cs4g07730.1 | 2 (92, 142) | - |
| Cs4g07730.2 | 2 (87, 138) | - | |
| Cs4g10830.1 | 1 (363) | 1 (93) | |
| Cs2g19600.2 | 1 (238) | - | |
| Cs2g30590.1 | 2 (138, 605) | - | |
| Cs5g20470.1 | 1 (89) | - | |
| Cs5g21990.1 | 1 (131) | - | |
| Cs5g22710.1 | 1 (140) | - | |
| Cs5g28770.1 | 2 (239, 685) | - | |
| Cs5g29510.1 | 1 (145) | - | |
| Cs7g02220.1 | 1 (82) | - | |
| Cs3g13340.1 | 2 (187, 175) | - | |
| Cs3g13390.1 | 2 (180, 130) | - | |
| Cs1g06720.1 | 1 (163) | - | |
| Cs1g08080.2 | 1 (99) | - | |
| Cs1g11430.1 | 2 (291, 1,488) | - | |
| Cs1g12140.1 | 1 (482) | - | |
| Cs1g14030.1 | 1 (171) | - | |
| Cs1g14090.1 | 1 (327) | - | |
| Cs1g14120.1 | 1 (400) | - | |
| Cs1g15550.1 | 6 (408, 159, 163, 293, 120, 101) | - | |
| Cs1g16990.1 | 5 (174, 104, 82, 71, 95 ) | - | |
| Cs1g17000.1 | 6 (332, 96, 357, 268, 161, 144) | - | |
| Cs1g18380.2 | 1 (663) | - | |
| Cs9g18740.1 | 1 (178) | - | |
| orange1.1t01926.1 | 1 (163) | - | |
| orange1.1t02481.1 | 3 (137, 210, 362) | - | |
| orange1.1t02498.1 | 1 (290) | - | |
| orange1.1t02751.1 | 1 (401) | 1 (347) | |
| orange1.1t02917.1 | 1 (138) | - | |
| orange1.1t02924.1 | 1 (140) | - | |
| orange1.1t03486.1 | 5 (169, 195, 93, 377, 127) | - | |
| orange1.1t03487.3 | 1 (571) | - | |
| orange1.1t03742.2 | 1 (140) | - | |
| orange1.1t04592.1 | 1 (238) | - | |
| Cs2g30865.1 | 1 (303) | - | |
| Cs1g09404.1 | 3 (133, 117, 473) | - | |
| orange1.1t05891.1 | 1 (4655) | - | |
| Cs4g17710.1 | 1 (712) | - | |
| Cs4g08050.1 | 1 (282) | - | |
| Cs4g08050.2 | 1 (274) | - | |
| Cs4g08110.2 | 1 (277) | - | |
| Cs6g19070.1 | 1 (751) | - | |
| Cs1g14090.2 | 1 (321) | - | |
| Cs1g14090.3 | 1 (321) | - | |
| orange1.1t03332.1 | 1 (133) | - |
Figure 4Electrophoresis detection results of the UTR introns (UIs) in untranslated regions (UTRs) of the eight selected genes. 1~4 respectively represents the PCR products using leaf gemomic DNA (gDNA), root complementary DNA (cDNA), leaf cDNA, and stem cDNA as template, respectively. M: DL5000 Marker; The UTR structure is available on the GSDS online website (http://gsds.cbi.pku.edu.cn/), with blue for exons and black for introns in UTR. PPR: Pentatricopeptide repeat superfamily protein (Cs6g01290.1), VPS28: Vacuolar protein sorting-associated 28 (Cs2g06750.1), EIN3: Ethylene-insensitive 3 (Cs2g29100.1), DUF247: domain of unknown function 247 gene (Cs2g24990.1), GRAS: GRAS transcription factors (Cs8g18700.1), TPR: Tetratricopeptide repeat-like superfamily protein (Cs8g15200.1), R: Disease resistance protein (Cs5g21990.1) and LTP: Lipid transfer protein (Cs5g09070.2).
Figure 5Relative expression results of genes and their UTR introns (UIs) in Citrus sinensis root, leaf and stem. * and ** respectively represent significant difference (p < 0.05) and very significant difference (p < 0.01) compared with root. 5UI: intron in the 5′UTR; 3UI: intron in the 3′UTR; CDS: coding sequence; represent the relative expression levels of genes containing these structures. PPR: Pentatricopeptide repeat superfamily protein (Cs6g01290.1), VPS28: Vacuolar protein sorting-associated 28 (Cs2g06750.1), EIN3: Ethylene-insensitive 3 (Cs2g29100.1), DUF247: domain of unknown function 247 gene (Cs2g24990.1), GRAS: GRAS transcription factors (Cs8g18700.1), TPR: Tetratricopeptide repeat-like superfamily protein (Cs8g15200.1), R: Disease resistance protein (Cs5g21990.1) and LTP: Lipid transfer protein (Cs5g09070.2).