| Literature DB >> 34145284 |
Teun A P M Huijben1, Hamidreza Heydarian1, Alexander Auer2,3, Florian Schueder2,3, Ralf Jungmann2,3, Sjoerd Stallinga1, Bernd Rieger4.
Abstract
Particle fusion for single molecule localization microscopy improves signal-to-noise ratio and overcomes underlabeling, but ignores structural heterogeneity or conformational variability. We present a-priori knowledge-free unsupervised classification of structurally different particles employing the Bhattacharya cost function as dissimilarity metric. We achieve 96% classification accuracy on mixtures of up to four different DNA-origami structures, detect rare classes of origami occuring at 2% rate, and capture variation in ellipticity of nuclear pore complexes.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34145284 PMCID: PMC8213809 DOI: 10.1038/s41467-021-24106-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Classification pipeline.
N particles are pairwise registered, resulting in N(N-1)/2 dissimilarity values. Multidimensional scaling (MDS) embeds the elements of the dissimilarity matrix in a multidimensional space (only the first 3 dimensions are shown). K-means clustering in this space results in K clusters and the particles are fused per cluster.
Fig. 2Classification of experimental DNA origami datasets.
a Templates of four DNA-origami designs in the digits dataset: the digits 1, 2, and 3 and a 20 nm grid. b Fusion result without classification of 2000 particles (randomly selected from a set of 10,000 images, 2500 per class). This data is imaged separately per class and combined into one dataset prior to the classification. c–f The four classes resulting from the classification of 5000 images (randomly selected from a set of 10,000 images, 2500 per class) containing 1374, 1179, 1214, and 1233 particles, respectively. g Confusion matrix of the classifications c–f with an overall performance of 96.4%. h Fusion result without classification of 2000 particles imaged in one FOV. i–l The four classes resulting from the classification of 5000 particles images in one FOV, containing 1219, 1309, 1278, and 1194 particles, respectively. m Confusion matrix of the classifications i–l, with an overall performance of 96.4%. n Templates of three DNA-origami designs in the letters dataset: letters T, O, and L. o Fusion result without classification of 600 particles (200 per class). This data is imaged separately per class and combined into one dataset prior to the classification. p–s The four classes resulting from the classification of o, containing 207, 122, 176, and 95 particles, respectively. t Average confusion matrix of two independent classifications performed as in p–s with an average performance of 97.3 ± 1.9%, where the class of misfolds, s, is not taken into account. u Fusion result without classification of 800 particles imaged in one FOV. v–z The five classes resulting from the classification of u, containing 170, 238, 130, 139, and 123 particles, respectively. Scale bar of b applies to c–f and h–l. Scale bar of o applies to p–s and u–z.
Fig. 3Classification of flipped DNA Origami data and NPC data.
a Fusion result without classification of 456 DNA-origami structures of the TUD-logo with 80% DoL. b, c The two classes resulting from the classification of a, containing 446 (normal orientation) and 10 (flipped orientation) particles per class, respectively. d–f Same as a–c, but with 50% DoL and 381 and 8 particles per class, respectively. g, h The two classes resulting from the classification of i, containing 168 and 136 particles, respectively. We used K = 4 for the 80% DoL data and K = 40 for the 50% DoL data, followed by further grouping with the eigen image approach with C = 2 (Methods). i Fusion result without classification of 304 NPCs. j–l Same as g, h, but with classification into three classes, with 133, 90 and 81 particles per class, respectively. For g–l, the ellipticity values e are defined as the ratio of the major axis over the minor axis of the ellipse fitted to the localizations, and are indicated below each class. Scale bar of a applies to b–f. Scale bar of g applies to h–l.