| Literature DB >> 18366691 |
Hongyu Gao1, Madan K Bhattacharyya.
Abstract
BACKGROUND: A series of Rps (resistance to Pytophthora sojae) genes have been protecting soybean from the root and stem rot disease caused by the Oomycete pathogen, Phytophthora sojae. Five Rps genes were mapped to the Rps1 locus located near the 28 cM map position on molecular linkage group N of the composite genetic soybean map. Among these five genes, Rps1-k was introgressed from the cultivar, Kingwa. Rps1-k has been providing stable and broad-spectrum Phytophthora resistance in the major soybean-producing regions of the United States. Rps1-k has been mapped and isolated. More than one functional Rps1-k gene was identified from the Rps1-k locus. The clustering feature at the Rps1-k locus might have facilitated the expansion of Rps1-k gene numbers and the generation of new recognition specificities. The Rps1-k region was sequenced to understand the possible evolutionary steps that shaped the generation of Phytophthora resistance genes in soybean.Entities:
Mesh:
Year: 2008 PMID: 18366691 PMCID: PMC2330051 DOI: 10.1186/1471-2229-8-29
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Figure 1Physical mapping of the locations of individual EZ::TN <. Individual GS_43D16 clones containing the EZ::TN
Figure 2Verification of the restriction maps of GS_43D16. A, KpnI and NotI map of the assembled GS_43D16 sequence. B, KpnI and NotI double digestion of selected GS_43D16 clones carrying the EZ::TN
Restriction fragments produced from the KpnI-NotI double digestion of GS_43D16 clones carrying the EZ::TN
| 120 | 15,171 | ~10 kb, -4 | 0.1 kb, 10.1 kb |
| 29 | 20,255 | ~5.5 kb, ~4.5 kb | 5.2 kb, 4.9 kb |
| 205 | 38,493 | ~15 kb, - | 15.6 kb, 1.5 kb |
| 66 | 39,577 | ~14.6 kb, 2.6 kb with Fragment 3. | 14.6 kb, 2.6 kb |
| 65 | 42,203 | ~11.9 kb, 6 kb | 11.9 kb, 5.2 kb |
| 147 | 45,914 | ~10 kb with Fragment 3, ~8 kb | 8.9 kb, 8.2 kb |
| 99 | 49,522 | ~12.5 kb, 4.6 kb with Fragment 7. | 12.5 kb, 4.6 kb |
| 36 | 68,935 | ~14.5 kb, ~3 kb | 14.8 kb, 2.5 kb |
1The sequence flanking the transposon of the transposon inserted GS_43D16 clones were compared with the assembled GS_43D16 sequence.
2 Fragments originating from transposon insertion.
3The expected fragment sizes based on the restriction map of the assembled GS_43D16 sequence and location of transposon insertion.
4The small fragment was not detected.
Figure 3Molecular characterization of the . A) The overlapping three BAC clones containing two CC-NB-LRR genes (green box) of the Rps1-k region are depicted. Sequences of individual BACs were utilized to show their overlapping regions. The composite Rps1-k region, shown at the top of the figure, is based on the sequences of these three BACs. GS_99I16 does not carry the 5'-end of Rps1-k-1, which is shown with a truncated green box. B) Arrangement of predicted genes and retrotransposons in the Rps1-k region. The green colored boxes represent full-length genes (Rps1-k-1 and Rps1-k-2); the red colored boxes represent partial genes; the blue colored boxes represent retroelements; white boxes represent introns in the predicted genes. Boxes above the ruler represent genes that have coding sequence on the forward strand, whereas the boxes under the ruler indicate the genes that are on the reverse strand. Detailed annotation data are presented in Table 2.
Gene annotations of the Rps1-k region1
| 1 | 18019–21708 (+) | Glycine max | 0 | 14 | |
| 2 | 42452–46201 (+) | Glycine max | 0 | 14 | |
| 3 | 63302–62705 (-) | CBL-Interacting protein kinase 15 | Arabidopsis thaliana | 6e-69 | 4 |
| Serine/threonine Kinase (partial seudogene) | Persea Americana AAL23677 | 3e-68 | |||
| 4 | 69126–68921 (-) | Ribosomal protein S6 | Glycine max | 4e-7 | 17 |
| 5 | 76950–78815 (+) | Conserved hypothetical protein | Medicago truncatula | 3e-40 | 4 |
| 6 | 79282–86280 (+) | Gag/pol polyprotein | Pisum sativum | 0 | 19 |
| 7 | 90317–92266 (+) | Hypothetical 65 kDa avirulence protein in avrBs3 region | Xanthomonas campestris | 5e-5 | 7 |
| 8 | 92658–96559 (+) | Gag-pol polyprotein | Zea Mays | 2e-147 | 21 |
| 9 | 113419–113916 (+) | NADH dehydrogenase subunit 1 (only the N-terminal 70 aa) | Trichosurus vulpecula | 3.9 | 77 |
| 10 | 114088–115118 (+) | MAD2 (only the N-terminal 65 aa) | Triticum aestivum | 3e-17 | 5 |
| 11 | 117048–116804 (-) | Cytochrome c oxidase subunit II (the N-terminal 40 aa) | Cynomys ludovicianus | 5.1 | 2 |
| 12 | 117709–119789 (+) | Cysteine proteinase Vacuolar processing enzyme precursor (the N-terminal 118 aa) | Glycine max BAA06030 P49045 | 3e-49 | 13 |
| 13 | 123937–123409 (-) | Unknown protein (partial pseudogene) | Arabidopsis thaliana | 2e-26 | 3 |
| 14 | 127141–126821 (-) | L-lactate dehydrogenase (partial pseudogene) | 9e-27 | 7 | |
| 15 | 131753–138850 (-) | Glycoside hydrolase Integrase, catalytic region (partial pseudogene) | Medicago truncatula | 0 | 6 |
| 16 | 139054–139575 (+) | Unnamed protein product (C-terminal 173 aa) | Oryza sativa | 5e-60 | 3 |
| 17 | 140361–140014 (-) | Gag-pol polyprotein | Glycine max | 1e-34 | 3 |
| 18 | 145152–148184 (+) | Dynein | Oncorhynchus mykiss | 1e-10 | 4 |
| 19 | 145722–145277 (-) | Prion-like Q/N-rich domain protein PQN-33 | Gallus gallus | 6e-48 | 3 |
| 20 | 154355–155745 (+) | Oxidoreductase (pseudogene) | Arabidopsis thaliana | 2e-21 | 7 |
| 21 | 159487–160392 (+) | Gag/pol polyprotein | Pisum sativum | 5e-28 | 9 |
| 22 | 165713–166447 (+) | Glycoside hydrolase, family 1, Zinc finger, CCHC-type; Ribonuclease H fold | Medicago truncatula | 6e-29 | 1 |
| 23 | 167488–176781 (-) | SIRE1–8 retroelement | Glycine max | 0 | 5 |
1Genes not having matched soybean ESTs were not included.
2 indicates the coding sequence is on the forward sequence, while – indicates the coding sequence is on the reverse sequence.
3Soybean expressed sequence tags showing similarities to the target sequence at a level of significance, E value ≤ e-50
Simple repeat sequences in the Rps1-k region
| 7619–7663 | AT | 22 |
| 9814–9851 | AT | 19 |
| 24196–24231 | AT | 18 |
| 34682–34732 | AT | 25 |
| 38898–38960 | AAT | 21 |
| 41328–41354 | AAT | 9 |
| 51716–51901 | AT | 93 |
| 53915–53944 | AT | 15 |
| 59145–59168 | TC | 12 |
| 64934–64989 | AT | 28 |
| 110292–110313 | AT | 22 |
| 112406–112477 | AT | 36 |
| 116097–116116 | AT | 10 |
| 116665–116714 | AT | 25 |
| 127258–127281 | AG | 12 |
| 181688–181759 | AT | 36 |
Tandem repeat sequences in the Rps1-k region
| 4872–4907 | TTAATAAATTTATT | 2.6 |
| 5279–5311 | TTTATT | 2.5 |
| 7219–7253 | TTTTATTATTTAAATAT | 2 |
| 7328–7366 | TTTTAAGTTAACATAAATT | 2 |
| 13986–14041 | CTTATATTTTTTTTAT | 3.5 |
| 14069–14121 | TTTAAATCTTTTATTTTTACC | 2.5 |
| 28228–28272 | TTTATTTATAAGATTATTTAAT | 2 |
| 34767–34826 | ATGCAAACATATATACATGC | 2.9 |
| 65181–65235 | TCATTACTAAAAAAAAATAG | 2.8 |
| 65966–66017 | GCCAGCATGCATGTATATC | 2.7 |
| 70677–70718 | TAAAAAGTTGAATAGATAC | 2.2 |
| 72634–72694 | CATTAAGTTCTTTTAATTCCTAGGTTAGTGG | 2 |
| 75090–75128 | CGTTCTTCAT | 3.8 |
| 87791–87926 | TGAATATATATAGCATGAAAATGCCTTGCAAAATA | 3.9 |
| 89787–89849 | AAATAGAAAAGGAAAGAAAATG | 2.9 |
| 90350–90511 | AAAAAGAAAAGAAAGGAAATTCCCAATCAAAGAGAAAGC | 3.8 |
| 90381–90538 | GAGAAAGCAAAAAGAAAAGAAAGGAAATTCCCAATCAAAGAGTGG | 3.5 |
| 91333–92076 | TACGCGGAGATACCTTACGGTTATCCGCACCCCCTTTGCCATTCAGACACAGTCGTGTCCGTTGGCAAGCAGAGACCAAGTTTGGTCATTCTGCACACATGA | 7.3 |
| 92743–92779 | GCTCGCCTGGGCGAGCTGA | 1.9 |
| 98273–98333 | CATTAAGTTCTTTCAATTCATAGGTTAGTGG | 2 |
| 113516–113540 | AAAAACCGTCTTA | 1.9 |
| 120828–120857 | TTTTTTTTTCC | 2.7 |
| 122442–122512 | ATCAAATAAAATGCTTGCAGATCA | 3 |
| 124367–124513 | AAAAAAAATTGAAGATTCTAAGACAGTTTTTAGGGAAAACCGTCTTAGAATGTCTTATTTTAAATAAAAAAAAATT | 2 |
| 133966–134004 | AATCAAAGAACAACTCAAGTG | 1.9 |
| 134057–134089 | TCAAGAA | 4.9 |
| 135918–136076 | GATCCACAAGGGATGTACCCTCCCTTATTCTCATTACAACAACCCAAGTAGATGTACCCTCCACT | 2.3 |
| 136235–136365 | AAGGGAGAAGAGAGACACAAAAAGAATTCAGGCGGTTAGTCCTTGTCGATTCTTTTTGGAA | 2.2 |
| 137034–137101 | TCTTCTCTTGAATCTTGAATTCAA | 2.9 |
| 144892–144919 | AGAAAAGGAAAAA | 2 |
| 145379–147112 | GGACTACACGTCCTCGCCTTCAGA | 72 |
| 147479–147972 | GGGATCGCGCCCACAAGACACCCAGTGGACCCGAAGGAGTCCAACAGGGCCCTGGGGTTTCCAGCTCTGGTTACGGGCCTCTGTCAGTCCTACAGGGTGCCCGTCCCCCCCAGCAAGGTCACCCCATCGTAACATAGGTAACTATGCACATCTCTCAACTGATTTCTGATGCCATCCAATATTTGCA | 2.6 |
| 148467–148657 | AAAAATACCTCACAAAATATATATATATTATGTTTAGGTAGCAAGATACCTTGGATACACATGTATATAGC | 2.7 |
| 149273–149361 | AAAGAAAGTTCCCGATCAAAGATCGAAAGAAAACAAAGAGAAAA | 2 |
| 150401–150651 | GTATGGTTATCAGCACCTGTCGTCAACCAGGGGCAAACGAGCCCGTTGACGCGCAGAGACTAACGTCATCTTCTGCACCTTTTGTCAACCAGAGACAGCGAGTCCAATGACATGTGGAGATACCCAAGCGATTATCC | 1.8 |
| 150612–151127 | GCACCTTTTGTCATCCAGAGACAGCGAGTCCGATGACATGCGAGGGTACCGTATGGTTATCC | 8.3 |
| 150799–150931 | CACCTTTCGTCAACCAGGGGCAAACGAGCCCATTGACGCGCAGAGACTAACGTCGTCTTCTG | 2.1 |
| 150550–151365 | GCACCTTTCGTCAACCAGGGGCAAGCGAGCCCGTTGACGCGCAGAGACTAACGTCGTCTTCTGCACCTTTTGTCAACCAGAGATAGCGAGTCCGATGACATGCGAGGGTAACGTATGGTTATCCGCACCTTTTTTCATCCAGAGACAGCGAGTCCGATGACATGCGGGGGTACCGTATGGTTATCCGCACCTTTTGTCATCCACAGACGGCAAGTCCGATGACACGCGGAGGTACCGTATGGTTATCCACACCTTTCGTCAACCAGGGGCAAACGAGCCCATTGACGCACAGAGACTAACGTCGTCTTCC | 2.6 |
| 150536–151060 | CCGTATGGTTATCACACCTTTCGTCAACCAGGGGCAAACGAGCCCATTGACGCGCAGAGACTAACGTCGTCTTCTGCACCTTTCGTCAACCAGAGAGAGCGAGCCCAATGAATGCGAGGCTAACGATCGTTATCCGCACCTTTTATCATCCAGAGACGGCTAGTCCGATGACATGCGGGGGTACCGTATGGTTATCCGCACCTTTTGTCATCCACAGACAGCAAGTCCGATAACACGCAGGGGTA | 2.1 |
| 150901–151386 | CGCAGAGACTAACGTCGTCTTCCGCACCTTTTGTCATCCAGAGATAGCGAGTCCGATGACATGCGGAGGTACCGTATGGTTATCCGCACCTTTTGTCAACCAGAGGCAAGCGAGTCCGTTGACA | 3.9 |
| 151985–152116 | AATCCGTAAAGTTTCGCAACATTCTGGAAGTCAAAACAAGTATTGCTGCAC | 2.6 |
| 152550–152596 | TTCTTCATCG | 4.6 |
| 152558–152597 | CGTTCTTCATCGTTCTTCGTT | 1.9 |
| 153216–153398 | CCAAGAGATCGTTAATGGTCCAACGCCTTAACGTTTCTCTCCTTTCAAAA | 3.6 |
| 153555–153596 | AAAAAAGACAAAAAACAT | 2.3 |
| 156483–156603 | ATCAAACATCACTTGAGATCGTTTCAAGGTCCAACGCCTTAACCATTCTCTCCGCTTTTC | 2 |
| 161707–161856 | ACATCTGAGAAGAAAACTCATTCGACCAGGAGCTCATGGAAAATTCCCAAAGACAATTGTGATAGTAGGGT | 2.1 |
| 162626–162804 | TTTTAGAGGACTCAAAGTCCTCACCTTTATC | 5.8 |
| 163722–163780 | ATCAAAGAACAACTCAAGTGA | 2.9 |
| 163762–163815 | GAATCAAGAACAAGTCAAGACTCAA | 2.1 |
| 163765–163818 | TCAAGAATCAAGAAGAAT | 2.9 |
| 163826–163911 | AATCAAG | 12.3 |
| 164281–164330 | TTCAAAAAGGTTTTAACTTT | 2.5 |
| 164461–164485 | TTGAATCTCT | 2.5 |
| 166799–166839 | AGTATTTTCAAAAAT | 2.9 |
| 168846–168891 | TCATAAATCATGCATAATATCCT | 2 |
| 172682–172713 | TTTTCTGCA | 3.4 |
| 178161–178219 | ATCAAAGAACAACTCAAGTGA | 2.9 |
| 178201–178254 | GAATCAAGAACAAGTCAAGACTCAA | 2.1 |
| 178204–178257 | TCAAGAATCAAGAAGAAT | 2.9 |
| 178265–178343 | AATCAAG | 11.3 |
| 178713–178762 | TTCAAAAAGGTTTTAACTTT | 2.5 |
| 180693–180809 | AAAGGCACGCTAAGCCCAATTCCAACCGAGAGGAAGTGCACTGAGCGGCCC | 2.3 |
| 183350–183378 | AATTTATGGAGCCA | 2.1 |
Figure 4Generation of the . Locations of Rps1-k-1 and Rps1-k-2 on the GS_43D16 sequence are shown. Partial BamHI digested GS_43D16 DNA was cloned into the binary vector pTF101.1 and a library of binary clones was obtained. The library was screened for LRR sequences. Binary clone p43-10 contained the Rps1-k-3 gene, which is not present in GS_43D16 (Figure 3). This gene was presumably originated from intramolecular recombination in E. coli. Three BamHI (B) sites involved in generation of the Rps1-k-3 [1] are shown on the map. Solid line shows the region cloned in p43-10 and broken line indicates the region not found in p43-10. Presumably this region was lost during the recombination process in E.coli. The possible recombination process involved in the evolution of Rps1-k-3 is shown at lower part of the figure. The two identical 174 bp sequences of Rps1-k-1 (red line) and Rps1-k-2 (black line) involved in the recombination process are shown within the blue open circle (21,980 through 22,154 of Rps1-k-1 and 46,473 through 46,647 of Rps1-k-2 in the Rps1-k region shown in Figure 3B). The proposed model for the recombination event in E. coli is based on the article by Weisberg and Adhya [65].