| Literature DB >> 16615871 |
Sook Jung1, Dorrie Main, Margaret Staton, Ilhyung Cho, Tatyana Zhebentyayeva, Pere Arús, Albert Abbott.
Abstract
BACKGROUND: Due to the lack of availability of large genomic sequences for peach or other Prunus species, the degree of synteny conservation between the Prunus species and Arabidopsis has not been systematically assessed. Using the recently available peach EST sequences that are anchored to Prunus genetic maps and to peach physical map, we analyzed the extent of conserved synteny between the Prunus and the Arabidopsis genomes. The reconstructed pseudo-ancestral Arabidopsis genome, existed prior to the proposed recent polyploidy event, was also utilized in our analysis to further elucidate the evolutionary relationship.Entities:
Mesh:
Year: 2006 PMID: 16615871 PMCID: PMC1479338 DOI: 10.1186/1471-2164-7-81
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1A dendrogram depicting the phylogenetic relationship of peach, Arabidopsis and many other crop species. The probable position of the recent polyploidization event identified from Blanc and corworkers (22) is marked by an arrow. Figure is based on Figure 1 in reference 19 and Figure 5 in reference 22.
Number of conserved syntenic regions between Arabidopsis and Prunus genetic maps.
| 1TxE (almond × peach) | 306 | 68 (12) |
| 2PxF (peach × peach × P. ferganensis) | 188 | 9 (1) |
| 3JxF (peach) | 78 | 7 (1) |
| 4GxN (almond × peach) | 82 | 1 (0) |
| 5FxT (almond) | 171 | 45 (6) |
| 6FxB (almond) | 119 | 9 (0) |
| All Maps | 475 | 139 (20) |
1Dirlewanger et al. 2004 (9); 2Dettori et al. 2001 (33); 3Dirlewanger et al. 1999 (34); 4Jáuregui et al. 2001 (35); 5Joobeur et al. 2004 (36); 6Ballester et al. 2001 (37)
Figure 2Number of syntenic groups in each TxE linkage group that match to each Arabidopsis chromosome.
Figure 3Conserved syntenic regions with three or more gene pairs between Arabidopsis genome and Prunus genome. Bolded blocks are the ones with conserved gene order.
Conserved syntenic regions with three or more gene pairs between the Arabidopsis genome and Prunus genetic maps.
| EST Name | Linkage Group | ||||
| gp15 | 3 | AT1G02460 | glycoside hydrolase family 28 protein | PP_LEa0030E14f | FxT-G3F |
| AT1G02130 | Ras-related protein (ARA-5) | PP_LEa0010O05f | |||
| AT1G03000 | AAA-type ATPase family protein | PP_LEa0001O24f | |||
| gp21 | 3 | AT1G53750 | 26S proteasome AAA-ATPase subunit (RPT1a) | PP_LEa0010K05f | PxF-G6 |
| AT1G54080 | oligouridylate-binding protein | PP_LEa0012K19f | |||
| AT1G54110 | cation exchanger, putative (CAX10) Ca2+ | PP_LEa0007O07f | |||
| gp33 | 3 | AT1G66540 | cytochrome P450 | PP_LEa0013L12f | TxE-G5 |
| AT1G66250 | glycosyl hydrolase family 17 protein | PP_LEa0012I12f | |||
| AT1G66680 | S locus-linked protein | PP_LEa0003H24f | |||
| gp42 | 3 | AT2G35330 | zinc finger (C3HC4-type RING finger) protein | PP_LEa0017P13f | JxF-G7 |
| AT2G35930 | U-box domain-containing protein | PP_LEa0004C12f | |||
| AT2G36530 | enolase | PP_LEa0003M24f | |||
| gp54 | 3 | AT2G36530 | enolase | PP_LEa0003M24f | TxE-G7 |
| AT2G35930 | U-box domain-containing protein | PP_LEa0004C12f | |||
| AT2G35330 | zinc finger (C3HC4-type RING finger) protein-related | PP_LEa0017P13f | |||
| gp74 | 3 | AT3G60340 | palmitoyl protein thioesterase family protein | PP_LEa0012C18f | TxE-G5 |
| AT3G60510 | enoyl-CoA hydratase/isomerase family protein | PP_LEa0009I06f | |||
| AT3G60030 | squamosa promoter-binding protein-like 12 (SPL12) | PP_LEa0002J03f | |||
| gp75 | 3 | AT3G07160 | glycosyl transferase family 48 protein | PP_LEa0004K19f | TxE-G5 |
| AT3G06650 | ATP-citrate synthase, ATP-citrate (pro-S-)-lyase | PP_LEa0005D13f | |||
| AT3G06880 | transducin family protein | PP_LEa0009A14f | |||
| gp76 | 3 | AT3G02770 | dimethylmenaquinone methyltransferase | PP_LEa0030G03f | TxE-G5 |
| AT3G01930 | nodulin family protein similar to nodulin-like protein | PP_LEa0012O21f | |||
| AT3G02420 | expressed protein | PP_LEa0037N22f | |||
| gp80 | 3 | AT3G08560 | vacuolar ATP synthase subunit E | PP_LEa0009M17f | TxE-G6 |
| AT3G08710 | thioredoxin family protein | PP_LEa0016G12f | |||
| AT3G08770 | lipid transfer protein 6 (LTP6) | PP_LEa0029C22f | |||
| gp85 | 3 | AT4G17720 | RNA recognition motif (RRM)-containing protein | PP_LEa0027L14f | FxT-G2F |
| AT4G16900 | disease resistance protein (TIR-NBS-LRR class) | PP_LEa0003A21f | |||
| AT4G17483 | palmitoyl protein thioesterase family protein | PP_LEa0012C18f | |||
| gp98 | 3 | AT4G17483 | palmitoyl protein thioesterase family protein | PP_LEa0012C18f | TxE-G5 |
| AT4G17486 | expressed protein | PP_LEa0005J05f | |||
| AT4G17615 | calcineurin B-like protein 1 (CBL1) | PP_LEa0009N08f | |||
| gp101 | 3 | AT4G32450 | pentatricopeptide (PPR) repeat-containing protein | PP_LEa0009C16f | TxE-G5 |
| AT4G31970 | cytochrome P450 family protein | PP_LEa0013L12f | |||
| AT4G31810 | enoyl-CoA hydratase/isomerase family protein | PP_LEa0009I06f | |||
| gp106 | 3 | AT5G61790 | calnexin 1 (CNX1) | PP_LEa0006I23f | FxT-G1F |
| AT5G62310 | incomplete root hair elongation (IRE)/protein kinase | PP_LEa0009I05f | |||
| AT5G62090 | expressed protein | PP_LEa0030I08f | |||
| gp109 | 3 | AT5G47350 | palmitoyl protein thioesterase family protein | PP_LEa0012C18f | FxT-G2F |
| AT5G47710 | C2 domain-containing protein contains | PP_LEa0011F23f | |||
| AT5G46870 | RNA recognition motif (RRM)-containing protein | PP_LEa0027L14f | |||
| gp114 | 3 | AT5G03520 | Ras-related GTP-binding protein | PP_LEa0010O05f | FxT-G3F |
| AT5G03340 | cell division cycle protein 48, putative/CDC48 | PP_LEa0001O24f | |||
| AT5G03650 | 1,4-alpha-glucan branching enzyme | PP_LEa0009P15f | |||
| gp115 | 3 | AT5G07990 | flavonoid 3'-monooxygenase | PP_LEa0007M11f | FxT-G3F |
| AT5G07340 | calnexin | PP_LEa0006I23f | |||
| AT5G08470 | peroxisome biogenesis protein (PEX1) | PP_LEa0001O24f | |||
| gp126 | 3 | AT5G08390 | transducin family protein | PP_LEa0010I06f | TxE-G1 |
| AT5G07990 | flavonoid 3'-monooxygenase | PP_LEa0007M11f | |||
| AT5G07340 | calnexin | PP_LEa0006I23f | |||
| gp128 | 4 | AT5G47350 | palmitoyl protein thioesterase family protein | PP_LEa0012C18f | TxE-G2 |
| AT5G46870 | RNA recognition motif (RRM)-containing protein | PP_LEa0027L14f | |||
| AT5G47810 | phosphofructokinase family protein | PP_LEa0001K06f | |||
| AT5G47710 | C2 domain-containing protein | PP_LEa0011F23f | |||
| gp132 | 3 | AT5G47100 | calcineurin B-like protein 9 (CBL9) | PP_LEa0009N08f | TxE-G5 |
| AT5G47350 | palmitoyl protein thioesterase family protein | PP_LEa0012C18f | |||
| AT5G47310 | expressed protein | PP_LEa0005J05f | |||
| gp133 | 3 | AT5G10840 | endomembrane protein 70, putative TM4 family | PP_LEa0015M20f | TxE-G5 |
| AT5G11110 | sucrose-phosphate synthase | PP_LEa0003F22f | |||
| AT5G10430 | arabinogalactan-protein (AGP4) | PP_LEa0008B15f | |||
Figure 4Prunus genomic blocks that map to two distinct Arabidopsis regions. Shown are the Prunus blocks that identified Arabidopsis sister regions generated by the proposed polyploidy event. The Prunus blocks with the same color (red or green) are homologous regions that share more than two anchored ESTs.
Conserved syntenic regions with three or more gene pairs between the pseudo-ancestral Arabidopsis genome and Prunus genetic maps.
| EST Name | BAC Contig | ||||
| ga18 | 3 | AT5G47350 | palmitoyl protein thioesterase family protein | PP_LEa0012C18f | FxT-G2F |
| AT4G17720 | RNA recognition motif (RRM)-containing protein | PP_LEa0027L14f | |||
| AT5G47710 | C2 domain-containing protein contains | PP_LEa0011F23f | |||
| ga28 | 3 | AT5G07340 | calnexin, putative | PP_LEa0006I23f | FxT-G3F |
| AT5G07990 | flavonoid 3'-monooxygenase | PP_LEa0007M11f | |||
| AT5G61580 | phosphofructokinase family protein | PP_LEa0001K06f | |||
| ga29 | 3 | AT5G14650 | polygalacturonase, putative/pectinase, putative | PP_LEa0030E14f | FxT-G3F |
| AT3G01610 | AAA-type ATPase family protein | PP_LEa0001O24f | |||
| AT5G14370 | expressed protein | PP_LEa0011N22f | |||
| ga54 | 3 | AT5G59180 | DNA-directed RNA polymerase II | PP_LEa0026O17f | TxE-G1 |
| AT5G59840 | Ras-related GTP-binding family protein epsin N-terminal homology (ENTH) domain-containing | PP_LEa0036D15f | |||
| AT3G46540 | PP_LEa0003I01f | ||||
| ga60 | 4 | AT2G24640 | ubiquitin carboxyl-terminal hydrolase family protein | PP_LEa0006J17f | TxE-G1 |
| AT4G32400 | mitochondrial substrate carrier family protein | PP_LEa0009H16f | |||
| AT2G25420 | transducin family protein | PP_LEa0009H21f | |||
| AT2G25160 | cytochrome P450 | PP_LEa0013L12f | |||
| ga66 | 3 | AT4G17720 | RNA recognition motif (RRM)-containing protein | PP_LEa0027L14f | TxE-G2 |
| AT5G47350 | palmitoyl protein thioesterase family protein | PP_LEa0012C18f | |||
| AT5G47710 | C2 domain-containing protein | PP_LEa0011F23f | |||
| ga77 | 3 | AT4G17486 | expressed protein | PP_LEa0005J05f | TxE-G5 |
| AT5G47350 | palmitoyl protein thioesterase family protein | PP_LEa0012C18f | |||
| AT4G17615 | calcineurin B-like protein 1 (CBL1) | PP_LEa0009N08f | |||
| ga79 | 3 | AT5G25170 | expressed protein | PP_LEa0005J05f | TxE-G5 |
| AT5G11110 | sucrose-phosphate synthase | PP_LEa0003F22f | |||
| AT5G10840 | endomembrane protein 70, putative TM4 family; | PP_LEa0015M20f | |||
| ga81 | 4 | AT4G31940 | cytochrome P450 | PP_LEa0013L12f | TxE-G5 |
| AT2G25190 | expressed protein | PP_LEa0005J05f | |||
| AT2G25160 | cytochrome P450 | PP_LEa0013L12f | |||
| AT4G31810 | enoyl-CoA hydratase/isomerase family protein | PP_LEa0009I06f | |||
| ga83 | 3 | AT1G66540 | cytochrome P450 | PP_LEa0013L12f | TxE-G5 |
| AT1G66250 | glycosyl hydrolase family 17 protein | PP_LEa0012I12f | |||
| AT1G66680 | S locus-linked protein | PP_LEa0003H24f | |||
| ga94 | 3 | AT5G58160 | formin homology 2 domain-containing protein | PP_LEa0035A24f | TxE-G6 |
| AT5G57990 | ubiquitin-specific protease 23 | PP_LEa0006J17f | |||
| AT5G58590 | Ran-binding protein 1, putative/RanBP1, putative | PP_LEa0003G19f | |||
| ga95 | 3 | AT5G01870 | lipid transfer protein, putative | PP_LEa0029C22f | TxE-G6 |
| AT3G08560 | vacuolar ATP synthase subunit E | PP_LEa0009M17f | |||
| AT3G08710 | thioredoxin family protein | PP_LEa0016G12f | |||
Figure 5Proposed evolutionary steps involving some syntenic blocks between Arabidopsis and the Prunus genomes. Blocks in the putative ancestral Arabidopsis genome and Arabidopsis chromosome 2 and 4 that match to the same block in Prunus TxE map are illustrated. Red and green colors were used to help track the genes. Dashed lines were used to indicate the relationship with less stronger homology when the same EST was homologous to more than one Arabidopsis genes.
Conserved syntenic regions with three or more gene pairs between the Arabidopsis genome and EST-anchored peach BAC contigs.
| EST Name | BAC Contig | ||||
| pp23 | 3 | AT1G19570 | dehydroascorbate reductase | PP_LEa0036C16f | ctg2264 |
| AT1G20010 | tubulin beta-5 chain (TUB5) | PP_LEa0035B10f | |||
| AT1G20450 | dehydrin (ERD10) | PP_LEa0035C17f | |||
| pp48 | 3 | AT2G18470 | protein kinase family protein | PP_LEa0036C20f | ctg2264 |
| AT2G18840 | integral membrane Yip1 family protein | PP_LEa0034N14f | |||
| AT2G18280 | tubby-like protein 2 (TULP2) | PP_LEa0034J18f | |||
| pp52 | 4 | AT2G40280 | Putative methyltransferase | PP_LEa0017H06f | ctg58 |
| AT2G39750 | Putative methyltransferase | PP_LEa0017H06f | |||
| AT2G39770 | GDP-mannose pyrophosphorylase (GMP1) | PP_LEa0005L09f | |||
| AT2G40060 | expressed protein | PP_LEa0017F24f | |||
| pp54 | 3 | AT2G19740 | 60S ribosomal protein L31 (RPL31A) | PP_LEa0008A18f | ctg9 |
| AT2G19680 | mitochondrial ATP synthase g subunit | PP_LEa0025C15f | |||
| AT2G19730 | 60S ribosomal protein L28 (RPL28A) | PP_LEa0001M19f | |||
| pp69 | 3 | AT3G02200 | proteasome family protein | PP_LEa0025D12f | ctg2264 |
| AT3G02310 | developmental protein SEPALLATA2 | PP_LEa0035H10f | |||
| AT3G01520 | universal stress protein (USP) family | PP_LEa0025L13f | |||
| pp94 | 3 | AT4G27880 | seven in absentia (SINA) family protein | PP_LEa0035M04f | ctg2264 |
| AT4G27560 | glycosyltransferase family protein | PP_LEa0036D18f | |||
| AT4G27740 | Yippee putative zinc-binding protein | PP_LEa0035H22f | |||
| pp96 | 3 | AT4G10710 | transcriptional regulator-related | PP_LEa0034P24f | ctg2264 |
| AT4G11450 | expressed protein | PP_LEa0035H16f | |||
| AT4G11030 | long-chain-fatty-acid – CoA ligase | PP_LEa0034M07f | |||
| pp113 | 3 | AT5G66460 | PP_LEa0003M21f | ctg1505 | |
| AT5G66140 | 20S proteasome alpha subunit D2 | PP_LEa0027M15f | |||
| AT5G66510 | bacterial transferase | PP_LEa0009C17f | |||
| pp114 | 4 | AT5G08400 | expressed protein | PP_LEa0011C13f | ctg1565 |
| AT5G08380 | alpha-galactosidase | PP_LEa0009B18f | |||
| AT5G08540 | expressed protein | PP_LEa0027N06f | |||
| AT5G08410 | ferredoxin-thioredoxin reductase | PP_LEa0009N05f | |||
| pp119 | 3 | AT5G47040 | Lon protease homolog 1 | PP_LEa0001P13f | ctg190 |
| AT5G47020 | glycine-rich protein | PP_LEa0012O09f | |||
| AT5G47010 | RNA helicase | PP_LEa0010E19f | |||
| pp126 | 3 | AT5G54010 | glycosyltransferase family protein | PP_LEa0036D18f | ctg2264 |
| AT5G53940 | Yippee putative zinc-binding protein | PP_LEa0035H22f | |||
| AT5G53770 | nucleotidyltransferase family protein | PP_LEa0025D10f | |||
| pp127 | 3 | AT5G51050 | mitochondrial substrate carrier family protein | PP_LEa0034P07f | ctg2264 |
| AT5G50550 | WD-40 repeat family protein/St12p protein | PP_LEa0036H23f | |||
| AT5G51180 | expressed protein similar to auxin down-regulated protein | PP_LEa0035K24f | |||
| pp128 | 3 | AT5G43830 | ARG10 | PP_LEa0034K23f | ctg2264 |
| AT5G44340 | tubulin beta-4 chain (TUB4) | PP_LEa0035B10f | |||
| AT5G44090 | calcium-binding EF hand family protein | PP_LEa0035H07f | |||
| pp130 | 3 | AT5G15160 | bHLH family protein | PP_LEa0035P14f | ctg2264 |
| AT5G14680 | universal stress protein (USP) family protein | PP_LEa0025L13f | |||
| AT5G14590 | isocitrate dehydrogenase | PP_LEa0034O16f | |||
| pp132 | 3 | AT5G66460 | PP_LEa0003M21f | ctg2269 | |
| AT5G66510 | bacterial transferase | PP_LEa0009C17f | |||
| AT5G66140 | 20S proteasome alpha subunit | PP_LEa0027M15f | |||
| pp137 | 3 | AT5G53280 | expressed protein | PP_LEa0027O13f | ctg378 |
| AT5G53310 | myosin heavy chain-related | PP_LEa0013H04f | |||
| AT5G53340 | galactosyltransferase family protein | PP_LEa0003L02f | |||
Figure 6Conserved syntenic regions with three or more gene pairs between Arabidopsis genome and EST-anchored peach BAC contigs.
Conserved syntenic regions with three or more gene pairs between the pseudo-ancestral Arabidopsis genome and EST-anchored peach BAC contigs.
| EST Name | BAC Contig | ||||
| pa3 | 3 | AT5G07990 | flavonoid 3'-monooxygenase | PP_LEa0010I09f | ctg1172 |
| AT5G08100 | L-asparaginase/L-asparagine amidohydrolase | PP_LEa0007L05f | |||
| AT5G60910 | agamous-like MADS box protein AGL8 | PP_LEa0002N13f | |||
| pa4 | 3 | AT2G45560 | cytochrome P450 family protein | PP_LEa0010I09f | ctg1172 |
| AT3G61040 | cytochrome P450 family protein | PP_LEa0010I09f | |||
| AT2G45650 | MADS-box protein (AGL6) | PP_LEa0002N13f | |||
| pa5 | 3 | AT1G68020 | glycosyl transferase family 20 protein | PP_LEa0001F16f | ctg1172 |
| AT1G23870 | glycosyl transferase family 20 protein | PP_LEa0001F16f | |||
| AT1G24260 | MADS-box protein (AGL9) | PP_LEa0002N13f | |||
| pa23 | 3 | AT5G66510 | contains bacterial transferase hexapeptide repea | PP_LEa0009C17f | ctg1505 |
| AT5G66140 | 20S proteasome alpha subunit D2 | PP_LEa0027M15f | |||
| AT5G66460 | PP_LEa0003M21f | ||||
| pa26 | 4 | AT5G08380 | alpha-galactosidase/melibiase | PP_LEa0009B18f | ctg1565 |
| AT5G08540 | expressed protein | PP_LEa0027N06f | |||
| AT5G08400 | expressed protein predicted proteins | PP_LEa0011C13f | |||
| AT5G23440 | ferredoxin-thioredoxin reductase | PP_LEa0009N05f | |||
| pa35 | 3 | AT5G26030 | ferrochelatase I | PP_LEa0004A06f | ctg1823 |
| AT5G11710 | epsin N-terminal homology domain-containing protein | PP_LEa0003I01f | |||
| AT5G11770 | NADH-ubiquinone oxidoreductase 20 kDa subunit | PP_LEa0001H16f | |||
| pa37 | 3 | AT5G47010 | RNA helicase | PP_LEa0010E19f | ctg190 |
| AT5G47040 | Lon protease homolog 1, mitochondrial (LON) | PP_LEa0001P13f | |||
| AT5G47020 | glycine-rich protein | PP_LEa0012O09f | |||
| pa59 | 3 | AT4G27740 | yippee family protein | PP_LEa0035H22f | ctg2264 |
| AT4G27880 | seven in absentia (SINA) family protein | PP_LEa0035M04f | |||
| AT4G27560 | glycosyltransferase family protein | PP_LEa0036D18f | |||
| pa61 | 3 | AT5G51050 | mitochondrial substrate carrier family protein | PP_LEa0034P07f | ctg2264 |
| AT5G51180 | expressed protein | PP_LEa0035K24f | |||
| AT5G50550 | WD-40 repeat family protein/St12p protein | PP_LEa0036H23f | |||
| pa64 | 3 | AT4G14960 | tubulin alpha-6 chain (TUA6) | PP_LEa0035B10f | ctg2264 |
| AT3G22170 | far-red impaired responsive protein | PP_LEa0036G03f | |||
| AT3G22850 | similar to auxin down-regulated protein ARG10 | PP_LEa0034K23f | |||
| pa71 | 3 | AT2G18280 | tubby-like protein 2 (TULP2) | PP_LEa0034J18f | ctg2264 |
| AT4G30260 | integral membrane Yip1 family protein | PP_LEa0034N14f | |||
| AT2G18470 | protein kinase family protein | PP_LEa0036C20f | |||
| pa82 | 3 | AT5G66510 | contains bacterial transferase hexapeptide repea | PP_LEa0009C17f | ctg2269 |
| AT5G66460 | PP_LEa0003M21f | ||||
| AT5G66140 | 20S proteasome alpha subunit D2 | PP_LEa0027M15f | |||
| pa103 | 4 | AT3G56080 | dehydration-responsive protein-related | PP_LEa0017H06f | ctg58 |
| AT2G40060 | expressed protein | PP_LEa0017F24f | |||
| AT2G39750 | dehydration-responsive family protein | PP_LEa0017H06f | |||
| AT3G55590 | GDP-mannose pyrophosphorylase | PP_LEa0005L09f | |||
| pa108 | 3 | AT4G29410 | 60S ribosomal protein L28 (RPL28C) | PP_LEa0001M19f | ctg9 |
| AT4G29480 | mitochondrial ATP synthase g subunit family protein | PP_LEa0025C15f | |||
| AT2G19740 | 60S ribosomal protein L31 (RPL31A) | PP_LEa0008A18f | |||
Figure 7Proposed evolutionary steps involving some syntenic blocks between Arabidopsis and Peach genomes. Blocks in the putative ancestral Arabidopsis genome and Arabidopsis chromosome 3 and 5 that match to the same peach BAC contig are illustrated. Red colors were used to help track the genes. The order of the ESTs in the BAC contig was not shown because the ESTs were anchored to overlapping BACs.
Number of syntenic groups between Prunus/Peach and Arabidopsis that are detectecd at various significance thresholds.
| Significance threshold | ||||||
| Syntenic Group | 99.90% | 99% | 95% | 90% | 80% | Total |
| gp | 21 (17) | 27 (20) | 56 | 81 | 108 | 139 (20) |
| ga | 11 (8) | 22 (12) | 39 | 64 | 86 | 101 (12) |
| pp | 18 (11) | 36 (16) | 65 | 85 | 102 | 140 (16) |
| pa | 13 (10) | 25 (14) | 50 | 70 | 93 | 111 (14) |
Numbers in parenthesis stands for the syntenic groups with more than three gene pairs.