Literature DB >> 33664929

A clustering method for graphical handwriting components and statistical writership analysis.

Amy M Crawford1, Nicholas S Berry1,2, Alicia L Carriquiry1.   

Abstract

Handwritten documents can be characterized by their content or by the shape of the written characters. We focus on the problem of comparing a person's handwriting to a document of unknown provenance using the shape of the writing, as is done in forensic applications. To do so, we first propose a method for processing scanned handwritten documents to decompose the writing into small graphical structures, often corresponding to letters. We then introduce a measure of distance between two such structures that is inspired by the graph edit distance, and a measure of center for a collection of the graphs. These measurements are the basis for an outlier tolerant K-means algorithm to cluster the graphs based on structural attributes, thus creating a template for sorting new documents. Finally, we present a Bayesian hierarchical model to capture the propensity of a writer for producing graphs that are assigned to certain clusters. We illustrate the methods using documents from the Computer Vision Lab dataset. We show results of the identification task under the cluster assignments and compare to the same modeling, but with a less flexible grouping method that is not tolerant of incidental strokes or outliers.
© 2020 The Authors. Statistical Analysis and Data Mining published by Wiley Periodicals LLC.

Entities:  

Keywords:  Bayesian; clustering; forensic statistics; handwriting analysis; hierarchical modeling

Year:  2020        PMID: 33664929      PMCID: PMC7894190          DOI: 10.1002/sam.11488

Source DB:  PubMed          Journal:  Stat Anal Data Min        ISSN: 1932-1864            Impact factor:   1.051


  3 in total

1.  Tight clustering: a resampling-based approach for identifying stable and tight patterns in data.

Authors:  George C Tseng; Wing H Wong
Journal:  Biometrics       Date:  2005-03       Impact factor: 2.571

2.  Text-independent writer identification and verification using textural and allographic features.

Authors:  Marius Bulacu; Lambert Schomaker
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2007-04       Impact factor: 6.226

3.  A Set of Handwriting Features for Use in Automated Writer Identification.

Authors:  John J Miller; Robert Bradley Patterson; Donald T Gantz; Christopher P Saunders; Mark A Walch; JoAnn Buscaglia
Journal:  J Forensic Sci       Date:  2017-01-05       Impact factor: 1.832

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.