| Literature DB >> 26990124 |
Cheng Zou1, Pingxi Wang1, Yunbi Xu2,3.
Abstract
Biological assay has been based on analysis of all individuals collected from sample populations. Bulked sample analysis (BSA), which works with selected and pooled individuals, has been extensively used in gene mapping through bulked segregant analysis with biparental populations, mapping by sequencing with major gene mutants and pooled genomewide association study using extreme variants. Compared to conventional entire population analysis, BSA significantly reduces the scale and cost by simplifying the procedure. The bulks can be built by selection of extremes or representative samples from any populations and all types of segregants and variants that represent wide ranges of phenotypic variation for the target trait. Methods and procedures for sampling, bulking and multiplexing are described. The samples can be analysed using individual markers, microarrays and high-throughput sequencing at all levels of DNA, RNA and protein. The power of BSA is affected by population size, selection of extreme individuals, sequencing strategies, genetic architecture of the trait and marker density. BSA will facilitate plant breeding through development of diagnostic and constitutive markers, agronomic genomics, marker-assisted selection and selective phenotyping. Applications of BSA in genetics, genomics and crop improvement are discussed with their future perspectives.Entities:
Keywords: DNA-seq; RNA-seq; breeding; bulked sample analysis; bulked segregant analysis; chips; microarrays; protein-seq
Mesh:
Year: 2016 PMID: 26990124 PMCID: PMC5043468 DOI: 10.1111/pbi.12559
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Figure 1Plant populations and their relationships. H&D, chromosome haploidization and doubling; BC, backcross; BIL, backcross inbred line; DH, double haploid; IM, intermating; MAGIC, multiparent advance generation intercross; CP, common parent; NAM, nested association mapping; NIL, near‐isogenic line; RIL, recombinant inbred line; TC, testcross; TTC, triple testcross; sTTC, simplified triple testcross; NCD, North Carolina Design. Modified from Xu (2016).
Figure 2Four types of bulked sample analysis (BSA). (a) BSA for qualitative traits such as disease resistance with two distinct phenotypes (R, resistance; S, susceptible). (b) BSA for quantitative traits with normal distribution, among which samples from two tails (L: lower; U, upper) are selected and bulked. (c) BSA for multiple parallel bulks with individuals selected independently from the two tails of a normal distribution. (d) BSA with only one bulk available for the target trait, while the other tail was killed by lethal genes or due to severe stresses, when compared with individuals randomly selected from a control population under no stress with normal allele frequencies for the target trait; CK: plants from the control population, R: plants selected from the stressed environment.
Figure 3The pipeline and affecting factors of bulked sample analysis. Two major types of populations, artificial population and nature variants, are taken as an example, with two bulks (Bulk A and Bulk B) formed by selection of individuals with extreme phenotypes. Artificial populations: a segregating population derived from biparents and multiparents. Natural variants: a population including various natural variants, which are selected from either single or multiple sources. The pipeline starts from original populations and ending‐ups with applications.
Analytical methods based on individual markers, microarrays and sequencing at DNA, RNA and protein levels
| Individual/low throughput | Microarray | Sequencing | |
|---|---|---|---|
| DNA |
PCR‐based markers (RAPD, STS, SCAR, RP‐PCR, AP‐PCR, OP‐PCR, SSCP‐PCR, SODA, DAF, AFLP, SRAP, TRAP, Indels) |
DArT |
Next‐generation sequencing (SNP, Indels, PAV, CNV, DNA rearrangement) |
| RNA |
mRNA‐based markers (DD, RT‐PCR, DDRT‐PCR, RDA, EST, STS, SAGE) | Transcriptome array (gene expression) | RNA‐seq (novel transcript, SNP, Indel, alternative splicing) |
| Protein |
Western blotting |
Protein array |
MS, MS/MS |
AFLP, amplified fragment length polymorphism; AP‐PCR, arbitrary primer‐PCR; CGH, comparative genomic hybridization; CNV, copy number variation; DAF, DNA amplification fingerprinting; DArT, diversity array technology; DD, differential display; DDRT‐PCR, differential display reverse transcription PCR; EST, expression sequence tags; ICAT, isotope‐coded affinity tags; ICPL, isotope‐coded protein labelling; Indel, insertion/deletion polymorphism; iTRAQ, isobaric tags for absolute and relative quantification; LC‐MS, liquid chromatography–mass spectrometry; MCAT, mass‐coded abundance tag; MS, mass spectrometry; MS/MS, tandem MS; OP‐PCR, oligo primer‐PCR; PAV, presence‐absence variation; RAPD, randomly amplified polymorphic DNA; RDA, representational difference analysis; RFLP, restriction fragment length polymorphism; RP‐HPLC, reversed phase liquid chromatography; SSCP‐RFLP, single strand conformation polymorphic RFLP; DGGE‐RFLP, denaturing gradient gel electrophoresis RFLP; RP‐PCR, random primer‐PCR; RT‐PCR, reverse transcription PCR; SAGE, serial analysis of gene expression; SCAR, sequence characterized amplified region; SILAC, stable isotope labelling with amino acids in cell culture; SNP, single‐nucleotide polymorphism; SODA, small oligo DNA analysis; SRAP, sequence‐related amplified polymorphism; SRS, short repeat sequence; SSCP‐PCR, single strand conformation polymorphism‐PCR; SSR, simple sequence repeat; STS, sequence tagged site; 2D‐PAGE, two‐dimensional polyacrylamide electrophoresis; TILLING, targeting induced local lesions in genomes; TRAP, target region amplified polymorphism; TRS, tandem repeat sequence.
Examples of bulked sample analysis for gene mapping in plants
| Traits | Population type | Population size | Tail size | References | |
|---|---|---|---|---|---|
| Chip‐based analysis | |||||
| Maize | Root‐lodging | F2 | 450 | 30, 30 | Farkhari |
| Maize | Dozen recessive mutants | F2 | – | 20, 20 | Liu |
| Wheat | Leaf rust | F3:4 | 124 | 15, 15 | Forrest |
|
| Sulphur and selenium content | F2 | 412 | 31, 33 | Becker |
| Cotton | Salt resistance | BC2F1 | 99 | 10, 10 | Rodriguez‐Uribe |
| Common bean | Bean common mosaic virus | Natural population | 506 | – | Bello |
| Pepper | Phytophthora root rot | F2 | 200 | 20, 20 | Liu |
| Poplar | Photosynthetic traits | F2 | 1200 | 15, 15 | Wang |
| DNA‐seq‐based analysis | |||||
| Rice | Salt tolerance | F2 | – | 20, 20 | Takagi |
| Rice | Male sterility | F2 | 946 | – | Frouin |
| Rice | Blast disease and seedling vigour | RILs | 241 | 20, 20 | Takagi |
| F2 | 531 | 50, 50 | |||
| Rice | Blast disease | F2 | – | 20, 20 | Takagi |
| Rice | Cold tolerance | F3 | 10 800 | 430, 385 | Yang |
| Rice | Pale green leaves and semidwarfism | F2 | – | 20, 20 | Abe |
| Cotton | Nulliplex‐branch | F2 | 168 | 30, 30 | Chen |
| Cotton | Short‐fibre mutant | F2 | 536 | 100, 100 | Thyssen |
| Cucumber | Early flowering | F2 | – | 10, 10 | Lu |
| Maize | Multiple traits | Natural population | 7000 | 200, 200 | Yang |
| RNA‐seq based analysis | |||||
| Maize |
| F2 | – | 32, 31 | Liu |
| Wheat | Grain protein content | RSLs | – | 14, 14 | Trick |
| Wheat | Yellow rust | F2 | 232 | – | Ramirez‐Gonzalez |
| Sunflower | Downy mildew | F2 | 2141 | 16, 16 | Livaja |
| Sand pear | Pericarp russet pigmentation | F2 | – | 10, 10 | Wang |
| Onion | Restorer‐of‐fertility | F2:5 | 251 | 10, 10 | Kim |
| Radish | Cytoplasmic male sterility | F2 | 224 | 10, 10 | Lee |
RILs‐Recombinant inbred lines;? RSLs‐Recombinant substitution line.
‘–’: The information unavailable.