| Literature DB >> 32235513 |
Bin Jiang1, Yikun Zhao1, Hongmei Yi1, Yongxue Huo1, Haotian Wu1, Jie Ren1, Jianrong Ge1, Jiuran Zhao1, Fengge Wang1.
Abstract
The high variability and somatic stability of DNA fingerprints can be used to identify individuals, which is of great value in plant breeding. DNA fingerprint databases are essential and important tools for plant molecular research because they provide powerful technical and information support for crop breeding, variety quality control, variety right protection, and molecular marker-assisted breeding. Building a DNA fingerprint database involves the production of large amounts of heterogeneous data for which storage, analysis, and retrieval are time and resource consuming. To process the large amounts of data generated by laboratories and conduct quality control, a database management system is urgently needed to track samples and analyze data. We developed the plant international DNA-fingerprinting system (PIDS) using an open source web server and free software that has automatic collection, storage, and efficient management functions based on merging and comparison algorithms to handle massive microsatellite DNA fingerprint data. PIDS also can perform genetic analyses. This system can match a corresponding capillary electrophoresis image on each primer locus as fingerprint data to upload to the server. PIDS provides free customization and extension of back-end functions to meet the requirements of different laboratories. This system can be a significant tool for plant breeders and can be applied in forensic science for human fingerprint identification, as well as in virus and microorganism research.Entities:
Keywords: DNA fingerprint; algorithms; database; genotyping; microsatellites
Year: 2020 PMID: 32235513 PMCID: PMC7230844 DOI: 10.3390/genes11040373
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Schematic representation of the Fingerprint Merging Algorithm.
Conditions to be specified for the Fingerprint Comparison Algorithm.
| Comparison Method | Default Fingerprint Data Range |
|---|---|
| Database Comparison | Entire Local Fingerprint Database |
| Homonymy Comparison | Fingerprints of the same name or synonyms within the entire Local Fingerprint Database |
| Non-homonymy Comparison | Fingerprints with different names and synonyms within the entire Local Fingerprint Database |
| Sub-Database Comparison | Assigned through Excel |
| Paired Comparison | Assigned through Excel |
Parameters required in the Fingerprint Comparison Algorithm.
| Condition | Description |
|---|---|
| Number of comparison loci | To control the matching of locus between different fingerprints, value: X ≥ 0. This parameter is used in the Fingerprint Comparison Algorithm, proper control of a value can reduce invalid comparison to improve fingerprint comparison speed, the default value of min(X) is 20. |
| Number of differential loci | To control the locus difference between samples, which used in the Fingerprint Comparison Algorithm to filter the comparison results. Proper control of a value can reduce the display of useless results, the range of X is ≥0, and the default value of max(X) is 20. |
| Percentage of differential loci | To control the degree of difference between samples, the range of X is 0 ≤ X ≤ 1, detail descriptions can be found in the mixed strain comparison algorithm. The default value of max(X) is 0.05. |
| Base offset | To control the difference between the two loci, the range of X is 0 bp ≤ X ≤ 2 bp. The default MaxX is 2 bp. This parameter is used in the comparison algorithm. |
Figure 2Modular structure of plant international DNA-fingerprinting system (PIDS), including the workflows between the modules and functions. PIDS covers three core fingerprint databases and the data transfer relationship between them. Modules 5 and 6 perform the core functions of identification and data analysis. The workflows ensure that the entire experiment and data analysis is a closed loop process. When problems occur, they can be solved by supplementary experiments or reanalysis, which can improve the efficiency of the data transfer process.
Figure 3Entity relationship and class table models in PIDS. (A) Entity relationship model diagram (ERD) based on Chen’s ERD notation. The rectangular boxes indicate entities; ellipses indicate properties of the entities; and diamonds indicate relationships between entities. LFD, Local Fingerprint Database; SFD, Sample Fingerprint Database; EFD, Experimental Fingerprint Database. (B) Table-like model diagram based on the ERD. The top of each box contains the table name. Primary key (PK) indicates a primary key. Foreign key (FK) indicates a foreign key to another table and is indicated by an arrow.
Figure 4The PIDS interface and representative workflows. (A) Users can import the sample information table and DNA information table of the experiment, and automatically create PCR and electrophoresis plates according to the given condition parameters. (B) Users can print the automatically generated electrophoresis plate hole position information table for actual DNA detection experiments. (C) Users can view the fingerprint data and map information submitted for storage. (D) Users can enter comparison condition parameters and view detailed information of the comparison results. (E) Users can enter genetic analysis parameters and generate a chart of genetic analysis related results.