| Literature DB >> 29258426 |
Rong-Cai Yang1,2, Fred Y Peng3, Zhiqiu Hu4.
Abstract
BACKGROUND: A large number of disease resistance genes or QTLs in crop plants are identified through conventional genetics and genomic tools, but their functional or molecular characterization remains costly, labor-intensive and inaccurate largely due to the lack of deep sequencing of large and complex genomes of many important crops such as allohexaploid wheat (Triticum aestivum L.). On the other hand, gene annotation and relevant genomic resources for disease resistance and other defense-related traits are more abundant in model plant Arabidopsis (Arabidopsis thaliana). The objectives of this study are (i) to infer homology of defense-related genes in Arabidopsis and wheat and (ii) to classify these homologous genes into different gene families.Entities:
Keywords: Arabidopsis; Arabidopsis thaliana; Bread wheat; Comparative genomics; DNA markers; Gene families; Genome analysis; Rust resistance genes; Single nucleotide polymorphism (SNP); Triticum aestivum
Mesh:
Substances:
Year: 2017 PMID: 29258426 PMCID: PMC5738178 DOI: 10.1186/s12864-017-4381-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Flow charts for three approaches to prediction of candidate genes for defense-related traits in Arabidopsis. BLASTN searches a nucleotide BLAST database using a nucleotide query. BLASTX searches a protein database using a translated nucleotide query. GWAS performs genome-wide scan of associations between SNPs and defense-related traits
Fig. 2Venn diagram for numbers of defense protein-coding genes and overlaps between the three approaches. Totals of 2097 (Approach 1), 5253 (Approach 2) and 4460 (Approach 3) predicted candidate genes are overlapped with each other
Fig. 3Distribution of candidate genes in 11 gene families for plant defense on five chromosomes of the Arabidopsis genome as revealed by the three approaches. The circles were arranged according to the three approaches: inner, middle and outer circles for Approaches 1, 2 and 3, respectively. The red, blue and grey lines on each of the three circles represent individual candidate genes in ABC, NLR and other nine (START, Kinase, Transcription factor, Transporter, Cupin, Peroxidase, Protease P450 and Tetratricopeptide repeat) gene families, respectively. The centromeric region is marked by black bumps on each chromosome
The number of predicted candidate genes within 11 gene families for defense-related traits in Arabidopsis
| Protein family | Approach 1 | Approach 2 | Approach 3 | Genome-wide total |
|---|---|---|---|---|
| ABC | 132 | 112 | 91 | 233 |
| NLR | 225 | 177 | 171 | 333 |
| START | 9 | 13 | 9 | 36 |
| Kinase | 406 | 372 | 867 | 1307 |
| TFa | 205 | 123 | 72 | 683 |
| Transporter | 53 | 38 | 65 | 138 |
| Cupin fold | 123 | 68 | 88 | 251 |
| Peroxidase | 73 | 21 | 76 | 84 |
| Protease | 117 | 72 | 72 | 272 |
| P450 | 109 | 58 | 210 | 250 |
| TPRb | 338 | 254 | 507 | 931 |
| Others | 456 | 4066 | 2242 | 18,969 |
|
|
|
|
|
|
aTF, Transcription factor superfamily including bHLH, bZIP, Homeobox, MYB, WRKY
b TPR, Tetratricopeptide repeat superfamily
The number of predicted genes in 11 gene families for 21 defense-related traits in Arabidopsis
| Phenotypea | ABC | Cupin | Kinase | NLR | P450 | Peroxidase | Protease | START | TPRb | TFc | Transporter | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AvrPphB | 4 | 9 | 2 | 1 | 9 | 1 | 1 | 2 | 5 | 6 | 0 | 40 |
| AvrRpm1 | 1 | 8 | 1 | 1 | 12 | 0 | 0 | 4 | 14 | 5 | 2 | 48 |
| AvrB | 1 | 8 | 1 | 1 | 12 | 0 | 0 | 4 | 14 | 6 | 2 | 49 |
| AvrRpt2 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 3 |
| DC3000 | 14 | 16 | 3 | 5 | 32 | 1 | 0 | 8 | 28 | 24 | 7 | 138 |
| LP23.1a | 13 | 22 | 0 | 5 | 18 | 1 | 0 | 2 | 19 | 11 | 4 | 95 |
| RMX23.1a | 40 | 44 | 4 | 28 | 92 | 1 | 1 | 19 | 89 | 34 | 20 | 372 |
| RMX3.1b | 8 | 4 | 0 | 0 | 6 | 3 | 1 | 2 | 5 | 5 | 1 | 35 |
| PNA3.3a | 4 | 5 | 0 | 3 | 9 | 1 | 0 | 2 | 8 | 3 | 2 | 37 |
| ME3.1b | 65 | 79 | 8 | 39 | 126 | 14 | 8 | 36 | 162 | 54 | 36 | 627 |
| LP23.1a CFU2 | 26 | 37 | 0 | 21 | 76 | 12 | 7 | 20 | 99 | 38 | 20 | 356 |
| RMX23.1a CFU2 | 65 | 81 | 8 | 50 | 148 | 13 | 9 | 37 | 161 | 68 | 38 | 678 |
| RMX3.1b CFU2 | 20 | 37 | 3 | 14 | 39 | 5 | 2 | 11 | 54 | 30 | 15 | 230 |
| PNA3.3a CFU2 | 44 | 48 | 6 | 16 | 72 | 5 | 3 | 22 | 83 | 33 | 23 | 355 |
| ME3.1b CFU2 | 11 | 15 | 1 | 8 | 21 | 1 | 1 | 3 | 16 | 5 | 1 | 83 |
| Emco5 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 2 | 1 | 7 |
| Emwa1 | 3 | 4 | 1 | 1 | 3 | 0 | 1 | 3 | 6 | 5 | 3 | 30 |
| Emoy2 | 2 | 1 | 0 | 0 | 3 | 0 | 0 | 2 | 2 | 2 | 2 | 14 |
| Hiks1 | 3 | 4 | 1 | 2 | 6 | 0 | 0 | 3 | 8 | 4 | 4 | 35 |
| Noco2 | 1 | 2 | 0 | 1 | 5 | 0 | 0 | 2 | 5 | 2 | 0 | 18 |
| Aphid number | 10 | 6 | 0 | 5 | 4 | 0 | 0 | 1 | 8 | 4 | 1 | 39 |
a See Additional file 2 for detailed description of 21 defense-related phenotypes
bTF, Transcription factor superfamily including bHLH, bZIP, Homeobox, MYB, WRKY
c TPR, Tetratricopeptide repeat superfamily
The total number of SNPs in coding and non-coding regions of predicted genes in 11 gene families for 21 defense-related traits in Arabidopsis
| Phenotypea | ABC | Cupin | Kinase | NLR | P450 | Peroxidase | Protease | START | TPRb | TFc | Transporter | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AvrPphB | 1948 | 868 | 4004 | 3253 | 1001 | 70 | 614 | 285 | 3000 | 1423 | 1169 | 17,635 |
| AvrRpm1 | 3635 | 1223 | 6539 | 5513 | 1493 | 331 | 313 | 207 | 4638 | 1687 | 1549 | 27,128 |
| AvrB | 1839 | 357 | 4331 | 3170 | 1048 | 70 | 551 | 285 | 2722 | 1351 | 886 | 16,610 |
| AvrRpt2 | 2153 | 304 | 3915 | 2021 | 911 | 191 | 211 | 267 | 2190 | 559 | 698 | 13,420 |
| DC3000 | 5325 | 2430 | 9775 | 7387 | 1532 | 601 | 643 | 400 | 7559 | 3113 | 2751 | 41,516 |
| LP23.1a | 4860 | 1908 | 8221 | 5641 | 1598 | 260 | 444 | 196 | 8295 | 3200 | 2155 | 36,778 |
| RMX23.1a | 5538 | 2918 | 11,130 | 7288 | 2586 | 882 | 711 | 309 | 9425 | 3857 | 1908 | 46,552 |
| RMX3.1b | 7567 | 3850 | 17,546 | 11,375 | 3990 | 979 | 1159 | 696 | 13,631 | 5451 | 4076 | 70,320 |
| PNA3.3a | 4989 | 1546 | 6907 | 5570 | 920 | 308 | 278 | 373 | 5271 | 2309 | 1276 | 29,747 |
| ME3.1b | 4653 | 1634 | 8436 | 5870 | 1926 | 321 | 673 | 76 | 4884 | 2038 | 1899 | 32,410 |
| LP23.1a CFU2 | 6753 | 3214 | 13,341 | 8767 | 3201 | 880 | 786 | 487 | 10,726 | 4414 | 3203 | 55,772 |
| RMX23.1a CFU2 | 6844 | 2944 | 13,110 | 8242 | 1925 | 440 | 864 | 570 | 9789 | 3679 | 2514 | 50,921 |
| RMX3.1b CFU2 | 7605 | 4194 | 18,400 | 11,286 | 3755 | 1019 | 1159 | 534 | 14,364 | 5946 | 4139 | 72,401 |
| PNA3.3a CFU2 | 5133 | 2083 | 8710 | 6282 | 2163 | 533 | 405 | 195 | 5865 | 2762 | 1673 | 35,804 |
| ME3.1b CFU2 | 5460 | 2381 | 10,892 | 7700 | 2228 | 489 | 706 | 423 | 8447 | 3650 | 2623 | 44,999 |
| Emco5 | 1872 | 859 | 3848 | 2084 | 571 | 133 | 202 | 345 | 2721 | 945 | 494 | 14,074 |
| Emwa1 | 1117 | 586 | 2910 | 2355 | 452 | 0 | 313 | 138 | 1515 | 715 | 978 | 11,079 |
| Emoy2 | 2267 | 586 | 3615 | 4376 | 1085 | 114 | 464 | 234 | 2435 | 1360 | 1049 | 17,585 |
| Hiks1 | 2394 | 717 | 5451 | 4517 | 744 | 339 | 425 | 344 | 3614 | 1053 | 1193 | 20,791 |
| Noco2 | 1610 | 386 | 2869 | 2948 | 798 | 82 | 202 | 57 | 1634 | 915 | 444 | 11,945 |
| Aphid number | 3471 | 1809 | 5839 | 4195 | 551 | 120 | 667 | 301 | 5456 | 1933 | 1688 | 26,030 |
aSee Additional file 2 for detailed description of 21 defense-related phenotypes
bTF, Transcription factor superfamily including bHLH, bZIP, Homeobox, MYB, WRKY
c TPR, Tetratricopeptide repeat superfamily
The number of homologous SNPs in coding and non-coding regions of rust resistance genes for 11 gene families
| Rust resistancea | ABC | Cupin | Kinase | NLR | P450 | Peroxidase | Protease | START | TPRb | TFc | Transporter | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 0 | 0 | 0 | 4237 | 0 | 0 | 0 | 0 | 0 | 33,270 | 0 | 37,507 |
|
| 0 | 0 | 0 | 2275 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2275 |
|
| 0 | 0 | 490,734 | 193,899 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 684,633 |
|
| 32,659 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 32,659 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5530 | 0 | 0 | 0 | 5530 |
|
| 0 | 2482 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2417 | 0 | 4899 |
|
| 606 | 0 | 0 | 90,923 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 91,529 |
|
| 18,963 | 0 | 0 | 27,110 | 0 | 0 | 0 | 8943 | 0 | 15,209 | 0 | 70,225 |
|
| 70,003 | 0 | 0 | 29,571 | 108,568 | 0 | 0 | 0 | 0 | 0 | 0 | 208,142 |
|
| 2169 | 0 | 0 | 3760 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5929 |
|
| 0 | 74,738 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 74,738 |
|
| 0 | 0 | 1077 | 0 | 114,579 | 0 | 0 | 0 | 459,695 | 0 | 0 | 575,351 |
|
| 4036 | 155,936 | 0 | 0 | 0 | 0 | 118,953 | 0 | 0 | 0 | 0 | 278,925 |
|
| 0 | 0 | 3836 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3836 |
|
| 0 | 0 | 0 | 0 | 0 | 109,350 | 0 | 0 | 0 | 192,982 | 0 | 302,332 |
|
| 0 | 0 | 63,540 | 0 | 0 | 0 | 0 | 0 | 1342 | 0 | 0 | 64,882 |
|
| 93,131 | 0 | 0 | 12,242 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 105,373 |
|
| 1647 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5225 | 0 | 6872 |
|
| 0 | 0 | 1923 | 0 | 0 | 0 | 0 | 0 | 0 | 4278 | 0 | 6201 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21,030 | 0 | 21,030 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 110,296 | 110,296 |
|
| 0 | 0 | 4957 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4957 |
|
| 0 | 0 | 4250 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4250 |
| GSId | 0.698 | 0.450 | 0.247 | 0.640 | 0.500 | 0.000 | 0.000 | 0.472 | 0.006 | 0.481 | 0.000 |
aSome genes are known to confer resistance to multiple rusts and other pathogens and they include Lr34 (=Lr34/Yr18/Bdv1/Pm38/Ltn1), Lr37(=Lr37/Sr38/Yr17), Lr46(=Lr46/Yr29/Pm39/Ltn), Sr2(=Sr2/Lr27/PBC/Pm) and Sr39(=Sr39/Lr35)
bTF, Transcription factor superfamily including bHLH, bZIP, Homeobox, MYB, WRKY
c TPR, Tetratricopeptide repeat superfamily
dGSI = Gini-Simpson diversity index