| Literature DB >> 23706002 |
Xun Chen, Xuemin Li, Bing Zhang, Jinsong Xu, Zhikun Wu, Bo Wang, Haitao Li, Muhammad Younas, Lei Huang, Yingfeng Luo, Jiangsheng Wu, Songnian Hu, Kede Liu.
Abstract
BACKGROUND: The presence of homoeologous sequences and absence of a reference genome sequence make discovery and genotyping of single nucleotide polymorphisms (SNPs) more challenging in polyploid crops.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23706002 PMCID: PMC3665465 DOI: 10.1186/1471-2164-14-346
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Distribution of tags along the A03 reference genome sequence. A. Distribution of physical distances between every two adjacent in silico restriction fragments in the size range of 141–420 bp. B. The distribution of in silico restriction fragments within the size range of 141–420 bp along the B. rapa A03 reference genome sequence within a sliding window of 250 kb.
Figure 2The bioinformatics pipeline RFAPtools. A. Assembly of pseudo-reference sequence. Pair-end sequence reads from ZY821, No2127 and pooled DHs were linked to form artificial sequence tags and collapsed into clusters. Then clusters present in both ZY821 and DHs were concatenated to assemble the ZY821 pseudo-reference sequence (ZY821-PRF). Thirdly, all clusters from No2127 were aligned to ZY821-PRF using SOAP with two nucleotide mismatches and those clusters that could not mapped were joined to the end of ZY821-PRF as clusters specific to No2127 to form the final pseudo-reference sequence. B. Discovery and genotyping of SNPs and PAVs. Sequencing reads of each individual were aligned to the PRF and all alleles at each locus were called as “+” indicating the presence of an allele and “-” indicating the absence of that allele in the two parents and 91 DH lines. C. Discrimination of allelic SNPs from homoeologous loci. A co-occurrence matrix of every two alleles from a complex locus with homoeologous sequences in the BnaNZDH population was made and allelic SNP and PAV loci were discriminated from homoeologous loci. Then genotypes of all individuals were scored for each locus.
Figure 3The SNP bin map constructed using 91 DH individuals. The X axis indicates the genetic distance (cM), and the Y axis indicates the number of SNP loci in each bin. The height of bar is maximized with 30 loci for each bin. The bins with more than 30 loci are marked individually with numbers on the bars.
Summary of the SNP linkage map
| A01 | 95.2 | 38 | 327(2) | C01 | 61.6 | 37 | 229(2) |
| A02 | 72.7 | 35 | 203(3) | C02 | 95.8 | 57 | 1023(2) |
| A03 | 124.4 | 65 | 397(3) | C03 | 175.0 | 101 | 1138(3) |
| A04 | 60.6 | 29 | 195(3) | C04 | 75.1 | 27 | 218(2) |
| A05 | 77.1 | 41 | 278(3) | C05 | 144.1 | 46 | 237(1) |
| A06 | 128.9 | 58 | 493(3) | C06 | 125.5 | 55 | 883(3) |
| A07 | 66.0 | 43 | 296(2) | C07 | 86.0 | 45 | 662(1) |
| A08 | 86.2 | 34 | 165(3) | C08 | 90.6 | 38 | 591(3) |
| A09 | 109.3 | 46 | 476(2) | C09 | 105.6 | 57 | 730(3) |
| A10 | 80.3 | 43 | 286(3) | | | | |
| A genome | 900.7 | 432 | 3116(27) | C genome | 959.3 | 463 | 5711(20) |
The number in the parenthesis represents the number of anchor SSR loci in this linkage group.
Figure 4Alignments between the SNP linkage map and the reference genome sequence. The X axis represents the genetic distance of each A genome linkage group in B. napus, and the Y axis represents the physical distance of reference sequence of each corresponding B. rapa chromosome. The green points represent the best hit (E value < =1e-10) of multiple paralogous loci in the B. rapa genome. The red points represent loci with only one end of the PE read having a unique position in the B. rapa genome. The blue points represent loci with both ends of the PE read having a unique position in the B. rapa genome. The oval circles indicate SNP loci that detected blocks of misassembly in the B. rapa reference genome sequence.