| Literature DB >> 35518356 |
Fushun Wang1,2, Kang Zhang3,4,5, Ruolan Zhang1, Hongquan Liu6, Weijin Zhang1, Zhanxiao Jia1, Chunyang Wang3,4.
Abstract
Polyploidization plays a critical role in producing new gene functions and promoting species evolution. Effective identification of polyploid types can be helpful in exploring the evolutionary mechanism. However, current methods for detecting polyploid types have some major limitations, such as being time-consuming and strong subjectivity, etc. In order to objectively and scientifically recognize collinearity fragments and polyploid types, we developed PolyReco method, which can automatically label collinear regions and recognize polyploidy events based on the K S dotplot. Combining with whole-genome collinearity analysis, PolyReco uses DBSCAN clustering method to cluster K S dots. According to the distance information in the x-axis and y-axis directions between the categories, the clustering results are merged based on certain rules to obtain the collinear regions, automatically recognize and label collinear fragments. According to the information of the labeled collinear regions on the y-axis, the polyploidization recognition algorithm is used to exhaustively combine and obtain the genetic collinearity evaluation index of each combination, and then draw the genetic collinearity evaluation index graph. Based on the inflection point on the graph, polyploid types and related chromosomes with polyploidy signal can be detected. The validation experiments showed that the conclusions of PolyReco were consistent with the previous study, which verified the effectiveness of this method. It is expected that this approach can become a reference architecture for other polyploid types classification methods.Entities:
Keywords: DBSCAN; chromosome; clustering; collinearity fragment; polyploidy
Year: 2022 PMID: 35518356 PMCID: PMC9065682 DOI: 10.3389/fgene.2022.842387
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1(A) K dotplot between Salix sinopurpurea and Vitis vinifera genome homologous genes (B) DBSCAN cluster recognition effect figure (C) Automatic label result of collinearity fragments based on DBSCAN.
Partial data of grape chromosome 13 cluster.csv file.
| chr1 | chr2 | id | l_x | l_y | num | r_x | r_y | y1-y2 | x1-x2 |
|---|---|---|---|---|---|---|---|---|---|
| 1 | 13 | 1 | 145 | 354 | 32 | 257 | 228 | 126 | 112 |
| 4 | 13 | 1 | 1,266 | 1,153 | 10 | 1,310 | 1,097 | 56 | 44 |
| 6 | 13 | 1 | 838 | 1,267 | 17 | 913 | 1,097 | 170 | 75 |
| 6 | 13 | 2 | 735 | 994 | 26 | 834 | 840 | 154 | 99 |
| 8 | 13 | 1 | 592 | 1,267 | 56 | 671 | 1,155 | 112 | 79 |
Partial data of grape chromosome 13 combine.csv file.
| chr1 | chr2 | id | l_x | l_y | num | r_x | r_y | ∆y | ∆x |
|---|---|---|---|---|---|---|---|---|---|
| 8 | 13 | 1 | 592 | 1,267 | 56 | 671 | 1,155 | 112 | 79 |
| 8 | 13 | 2 | 78 | 1,089 | 239 | 605 | 446 | 643 | 527 |
| 9 | 13 | 1 | 378 | 257 | 63 | 478 | 89 | 168 | 100 |
| 10 | 13 | 1 | 1,327 | 1,267 | 65 | 1,426 | 1,157 | 110 | 99 |
| 10 | 13 | 2 | 1,419 | 1,088 | 276 | 1,934 | 457 | 631 | 515 |
FIGURE 2(A) Collinearity evaluation index line chart of Vitis vinifera Chr.4, Chr.13, and Chr.14 (B) Combination figure of Salix sinopurpurea polyploidy. (A,B) correspond to each other. (1) The grape chromosome 4 (2) The grape chromosome 13 (3) The grape chromosome 14.
Vitis vinifera Chr. 13 result.csv file.
| chr1 | chr2 | id | l_x | l_y | num | r_x | r_y | Sumy |
|
| Comro |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 8 | 13 | 2 | 78 | 1,089 | 239 | 605 | 446 | 947 | 643 | 527 | 1 |
| 8 | 13 | 1 | 592 | 1,267 | 56 | 671 | 1,155 | 947 | 112 | 79 | 1 |
| 16 | 13 | 1 | 1,163 | 283 | 42 | 1,235 | 91 | 947 | 192 | 72 | 1 |
| 10 | 13 | 2 | 1,419 | 1,088 | 276 | 1,934 | 457 | 909 | 631 | 515 | 2 |
| 10 | 13 | 1 | 1,327 | 1,267 | 65 | 1,426 | 1,157 | 909 | 110 | 99 | 2 |
| 9 | 13 | 1 | 378 | 257 | 63 | 478 | 89 | 909 | 168 | 100 | 2 |
FIGURE 3(A) Collinearity evaluation index line chart of Arabidopsis thaliana Chr.1, Chr.3 and Chr.5 (B) Combination figure of Brassica rapa polyploidy. (A,B) correspond to each other. (1) The Arabidopsis thaliana chromosome 1 (2) The Arabidopsis thaliana chromosome 3 (3) The Arabidopsis thaliana chromosome 5
Arabidopsis thaliana Chr. 3 result.csv file.
| chr1 | chr2 | id | l_x | l_y | num | r_x | r_y | Sumy |
|
| Comro |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 3 | 3 | 1 | 2,919 | 5,436 | 857 | 4,001 | 2,763 | 4,793 | 2,673 | 1,082 | 1 |
| 9 | 3 | 1 | 3,322 | 1,458 | 827 | 4,386 | 8 | 4,793 | 1,450 | 1,064 | 1 |
| 6 | 3 | 1 | 1,571 | 2,132 | 260 | 2,021 | 1,462 | 4,793 | 670 | 450 | 1 |
| 1 | 3 | 1 | 2,581 | 5,431 | 1,036 | 3,948 | 2,863 | 3,799 | 2,568 | 1,367 | 2 |
| 4 | 3 | 1 | 3 | 1,231 | 501 | 631 | 0 | 3,799 | 1,231 | 628 | 2 |
| 5 | 3 | 1 | 1,955 | 5,432 | 1,374 | 3,668 | 3,021 | 3,533 | 2,411 | 1,713 | 3 |
| 7 | 3 | 1 | 1,555 | 1,130 | 373 | 1,998 | 8 | 3,533 | 1,122 | 443 | 3 |