| Literature DB >> 31649061 |
Dongqing Shi1, Jun Wu1, Haibao Tang2, Hao Yin1, Hongtao Wang3, Ran Wang4, Runze Wang1, Ming Qian1, Juyou Wu1, Kaijie Qi1, Zhihua Xie1, Zhiwen Wang5, Xiang Zhao5, Shaoling Zhang1.
Abstract
Genome assemblies from diploid organisms create mosaic sequences alternating between parental alleles, which can create erroneous gene models and other problems. In animals, a popular strategy to generate haploid genome-resolved assemblies has been the sampling of (haploid) gametes, and the advent of single-cell sequencing has further advanced such methods. However, several challenges for the isolation and amplification of DNA from plant gametes have limited such approaches in plants. Here, we combined a new approach for pollen protoplast isolation with a single-cell DNA amplification technique and then used a "barcoding" bioinformatics strategy to incorporate haploid-specific sequence data from 12 pollen cells, ultimately enabling the efficient and accurate phasing of the pear genome into its A and B haploid genomes. Beyond revealing that 8.12% of the genes in the pear reference genome feature mosaic assemblies and enabling a previously impossible analysis of allelic affects in pear gene expression, our new haploid genome assemblies provide high-resolution information about recombination during meiosis in pollen. Considering that outcrossing pear is an angiosperm species featuring very high heterozygosity, our method for rapidly phasing genome assemblies is potentially applicable to several yet-unsequenced outcrossing angiosperm species in nature.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31649061 PMCID: PMC6836740 DOI: 10.1101/gr.251033.119
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Schematic of haploid phasing using a barcoding approach. (A) Pollen is a haploid gamete. Here, we combined a new approach for pollen protoplast isolation with a single-cell DNA amplification technique and then used a barcoding bioinformatics strategy to incorporate haploid-specific sequence data from pollen cells, ultimately enabling the efficient and accurate phasing to assemble a haplotype genome as well as detecting meiotic recombination events. (B) Establishing the relationship between 12 pollen cells and BAC sequences using 12-bit binary codes in which “1” indicates identical and “0” indicates “not identical” or “absent.” Each base was labeled with a 12-bit binary code. Finally, each BAC was labeled with a specific 12-bit binary code. (C) BACs from different haplotypes received different 12-bit binary codes. (D) The chromosomal location of each BAC on the reference genome chromosomes was determined by aligning the BACs to the anchored reference chromosomes. (E) BACs were classified into haplotypes based on the chromosomal location and the 12-bit binary code.
Comparison of the haplotype-resolved genomes with the previous pear reference genome
Comparison of gene annotation between the haplotype-resolved and reference pear genomes
Figure 2.Distribution of mosaic sequences in genome assemblies built from incompletely phased data. Genes were classified as two types: mosaic errors in introns (blue) and mosaic errors in exons (red).
Figure 3.Mosaic assembly of Pbr017687.1 in the reference genome. Four base pairs in yellow only matched haplotype A, and 4 bp in red only matched haplotype B. Five of these eight sequences result in differences in amino acid sequence. (N) Redundant sequences in the reference.
Figure 4.The difference in transposon insertions between haplotype A and haplotype B. (A) Inversion insertions around Pbr020217.2 in a region with high collinearity between the reference, haplotype A, and haplotype B. Orange arrows show the positions of transposons. (B) Four LTR/copia transposon insertions in Pbr030337.1. Orange arrows show the positions of transposons.
Figure 5.Monoallelic expression in the development of pear fruit. The four stages include the falling stage, swelling stage, later swelling stage, and ripeness stage. Blue shows the number of monoallelic expression in haplotype A. Orange shows the number of monoallelic expression in haplotype B.
Figure 6.Meiotic recombination in 12 pollen cells. Haplotype A and haplotype B are shown in different colors, with switching points between the haplotypes representing meiotic crossover (MCO) events.