| Literature DB >> 25874132 |
M Govindaraj1, M Vetriventhan1, M Srinivasan2.
Abstract
The importance of plant genetic diversity (PGD) is now being recognized as a specific area since exploding population with urbanization and decreasing cultivable lands are the critical factors contributing to food insecurity in developing world. Agricultural scientists realized that PGD can be captured and stored in the form of plant genetic resources (PGR) such as gene bank, DNA library, and so forth, in the biorepository which preserve genetic material for long period. However, conserved PGR must be utilized for crop improvement in order to meet future global challenges in relation to food and nutritional security. This paper comprehensively reviews four important areas; (i) the significance of plant genetic diversity (PGD) and PGR especially on agriculturally important crops (mostly field crops); (ii) risk associated with narrowing the genetic base of current commercial cultivars and climate change; (iii) analysis of existing PGD analytical methods in pregenomic and genomic era; and (iv) modern tools available for PGD analysis in postgenomic era. This discussion benefits the plant scientist community in order to use the new methods and technology for better and rapid assessment, for utilization of germplasm from gene banks to their applied breeding programs. With the advent of new biotechnological techniques, this process of genetic manipulation is now being accelerated and carried out with more precision (neglecting environmental effects) and fast-track manner than the classical breeding techniques. It is also to note that gene banks look into several issues in order to improve levels of germplasm distribution and its utilization, duplication of plant identity, and access to database, for prebreeding activities. Since plant breeding research and cultivar development are integral components of improving food production, therefore, availability of and access to diverse genetic sources will ensure that the global food production network becomes more sustainable. The pros and cons of the basic and advanced statistical tools available for measuring genetic diversity are briefly discussed and their source links (mostly) were provided to get easy access; thus, it improves the understanding of tools and its practical applicability to the researchers.Entities:
Year: 2015 PMID: 25874132 PMCID: PMC4383386 DOI: 10.1155/2015/431487
Source DB: PubMed Journal: Genet Res Int ISSN: 2090-3162
Figure 1Changes in the relative global production of crops since 1961 (when relative production scaled to 1 (m.t) in 1961) (source: http://faostat.fao.org/default.aspx (2010)).
Some basic statistical concept on genomic data for genetic diversity assessment.
| Concept terms | Description/features | Formulae/pros/cons |
|---|---|---|
| Band-based approaches | Easiest way to analyze and measure diversity by focusing on presence or absence of banding pattern. | Routinely use individual level. |
|
| ||
| (1) Measuring polymorphism | Observing the total number of polymorphic bands (PB) and then calculating the percentage of polymorphic bands. | This “band informativeness” (Ib) can be represented on a scale ranging from 0 to 1 according to the formula |
|
| ||
| (2) Shannon's information index ( | It is called the Shannon index of phenotypic diversity and is widely applied. |
|
|
| ||
| (3) Similarity coefficients | Utilize similarity or dissimilarity (the inverse of the previous one) coefficients. |
(i) |
|
| ||
| (4) Allele frequency based approaches | Measure variability by describing changes in allele frequencies for a particular trait over time, more population oriented than band-based approaches. | These methods depend on the extraction of allelic frequencies from the data. |
|
| ||
| (5) Allelic diversity ( | Easiest ways to measure genetic diversity is to quantify the number of alleles present. |
|
|
| ||
| (6) Effective population size ( | It provides a measure of the rate of genetic drift, the rate of genetic diversity loss, and increase of inbreeding within a population. | Effective size of a population is an idealized number, since many calculations depend on the genetic parameters used and on the reference generation. Thus, a single population may have many different effective sizes which are biologically meaningful but distinct from each other. |
|
| ||
| (7) Heterozygosity ( | There are two types of heterozygosity observed ( | Expected |
|
| ||
| (8) | In population genetics the most widely applied measurements besides heterozygosity are | Three indexes can be calculated as follows: |
List of analytical programs for measuring molecular (genetic) diversity.
| Analytical tools | Data type | Main features | Source links | Reference |
|---|---|---|---|---|
| Arlequin | RFLPs, DNA sequences, SSR data, allele frequencies, or standard multilocus genotypes. | (i) Estimation allele and haplotype frequencies. |
| Schneider et al. [ |
|
| ||||
| DnaSP | DNA sequence data | (i) Estimating several measures of DNA sequence variation within and between populations (in noncoding, synonymous, or nonsynonymous sites or in various sorts of codon positions), as well as linkage disequilibrium, recombination, gene flow, and gene conversion parameters. |
| J. Rozas and R. Rozas, [ |
|
| ||||
| PowerMarker | SSR, SNP, and RFLP data | (i) Computes several summary statistics for each marker locus, including allele number, missing proportion, heterozygosity, gene diversity, polymorphism information content (PIC), and stepwise patterns for microsatellite data. |
| Liu and Muse [ |
|
| ||||
| DARwin | Single data (for haploids, homozygote diploids, and dominant markers), allelic data, and sequence data | (i) Most widely used for various dissimilarity and distance estimations for different data, tree construction methods including hierarchical trees with various aggregation criteria (weighted or unweighted), Neighbor-Joining tree (weighted or unweighted), Scores method and principal coordinate analysis, and so forth. |
| Perrier and Jacquemoud-Collet [ |
|
| ||||
| NTSYSpc | Single data (for haploids, homozygote diploids, and dominant markers), allelic data, and sequence data | (i) Used for clustering analysis, ordination analysis, principal component analysis, principal coordinate analysis, scaling analysis, and comparison of two matrices (Mantel test, Mantel [ |
| Rohlf [ |
|
| ||||
| MEGA | DNA sequence, protein sequence, evolutionary distance, or phylogenetic tree data | (i) Molecular evolutionary genetics analysis (MEGA) is most widely used for aligning sequences, estimating evolutionary distances, building tree from sequence data, testing tree reliability, and so forth. |
|
Kumar et al. [ |
|
| ||||
| PAUP | Molecular sequences, morphological data, and other data types | (i) Used for inferring and interpreting phylogenetic trees using parsimony, distance matrix, invariants, maximum likelihood methods, and many indices and statistical analyses. |
| Swofford [ |
|
| ||||
| STRUCTURE | All types of markers including mostly used markers like SSRs, SNPs, RFLPs, dArT, and so forth. | (i) A free program to investigate population structure; it includes inferring the presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed. |
| Pritchard et al. [ |
|
| ||||
| fastSTRUCTURE | SNP | (i) An algorithm for inferring population structure from large SNP genotype data. |
| Raj et al. [ |
|
| ||||
| ADMIXTURE | SNP | (i) ADMIXTURE is a program for maximum likelihood estimation of individual ancestries from multilocus SNP genotype datasets. |
| Alexander et al. [ |
|
| ||||
| fineSTRUCTURE | Sequencing data | (i) A fast and powerful algorithm for identifying population structure using dense sequencing data. |
| Lawson et al. [ |
|
| ||||
| POPGENE | Use the dominant, codominant, and quantitative data for population genetic analysis | (i) Used to calculate gene and genotype frequency, allele number, effective allele number, polymorphic loci, gene diversity, observed and expected heterozygosity, Shannon index, homogeneity test, |
|
Francis et al. [ |
|
| ||||
| GENEPOP | Haploid or diploid data | (i) Used to compute exact tests or their unbiased estimation for Hardy-Weinberg equilibrium, population differentiation, and two-locus genotypic disequilibrium. |
| Raymond and Rousset [ |
|
| ||||
| GenAIEx | Codominant, haploid, and binary genetic data. It accommodates the full range of genetic markers available, including allozymes, SSRs, SNPs, AFLP, and other multilocus markers, as well as DNA sequences | (i) GenAIEx runs within Microsoft Excel enabling population genetic analysis of codominant, haploid, and binary data. Used to compute allele frequency-based analyses including heterozygosity, |
| Peakall and Smouse [ |