| Literature DB >> 29322083 |
Anna Agius1, Marie Morelato1, Sébastien Moret1, Scott Chadwick1, Kylie Jones2, Rochelle Epple2, James Brown3, Claude Roux1.
Abstract
The data presented here is related to the article titled, "Using handwriting to infer a writer's country of origin for forensic intelligence purposes" (Agius et al., 2017) [1]. This article reports original writer, spatial and construction characteristic data for thirty-seven English Australian writers and thirty-seven Vietnamese writers. All of these characteristics were coded and recorded in Microsoft Excel 2013 (version 15.31). The construction characteristics coded were only extracted from seven characters, which were: 'g', 'h', 'th', 'M', '0', '7' and '9'. The coded format of the writer, spatial and construction characteristics is made available in this Data in Brief in order to allow others to perform statistical analyses and modelling to investigate whether there is a relationship between the handwriting features and the nationality of the writer, and whether the two nationalities can be differentiated. Furthermore, to employ mathematical techniques that are capable of characterising the extracted features from each participant.Entities:
Year: 2017 PMID: 29322083 PMCID: PMC5752089 DOI: 10.1016/j.dib.2017.12.014
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Summary of the features coded in each of the Excel spreadsheets provided.
| Writer & Spatial Characteristics | Writer's ID number; Nationality; Text indented, Valediction indented; Salutation indented; Paragraph 1; Paragraph 2; Slant; Handedness; Number of letters overhanging the right margin; Age; Years they have learnt English; Height of text; Width of text; Area of text; Average height of the letters ‘a’, ‘e’ and ‘o’; and average number of letters per line |
| Lowercase ‘g’ | Writer ID number; Nationality; Handedness; Slant; Allograph; Letter number and the word the letter came from; Letter position within the word; Descending stroke design; Upper loop design; Connected to previous letter; Connected to subsequent letter; Direction of upper loop; Age; and the years they have learnt English |
| Lowercase ‘h’ and ‘th’ | Writer ID number; Nationality; Handedness; Slant; Allograph; Letter number and the word the letter came from; Letter position within word; Connected to previous letter; Connected to subsequent letter; Curve at the top of staff; Relative height of arch to staff; Arch retrace of staff; Arch shape; Height of ‘h’ staff relative to ‘t’ staff; Age; Number of strokes to form ‘th’ combination; and the years they have learnt English |
| Uppercase ‘M’ | Writer ID number; Nationality; Handedness; Slant; Allograph; Letter number and the word the letter came from; Retrace on initial stroke; Apex 1 shape; Apex 2 shape; Apex 3 shape; Relative height of apex 2 to apex 1; Position of apex 2 relative to 1 and 3; Age; Number of strokes; and years they have learnt English |
| Number ‘0’ | Writer ID number; Nationality; Handedness; Slant; Allograph; Number allocated to each ‘0’ written by the participant; Start position; Finished position; Connection to start stroke; Embellishment; Direction; Age; and years they have learnt English |
| Number ‘7’ | Writer ID number; Nationality; Handedness; Slant; Allograph; Number allocated to each ‘7’ written by the participant; Style; Curve at bottom of staff; Angle between the horizontal and vertical strokes; Age; Number of strokes; and years they have learnt English |
| Number ‘9’ | Writer ID number; Nationality; Handedness; Slant; Allograph; number allocated to each ‘9’ written by the participant; Upper loop design; Descending stroke design; Direction of upper loop; Age; Number of strokes; and years they have learnt English |
Fig. 1Source document (left) and provided lined piece of white A4 copy paper to write out the source document (right).
Fig. 2Upper and lowercase letters, numbers and postal addresses written out by participants.
Fig. 3Space provided for participants to write out two postal addresses.
Fig. 4Diagram of the spatial features recorded from the source document.
Qualitative spatial characteristic codes.
| Salutation indented | 0 | No indent |
| 1 | Indent | |
| Text indented | 0 | No indent |
| 1 | Indent | |
| Valediction indented | 0 | No indent |
| 1 | Indent | |
| Paragraph (1) | 0 | No space between salutation and text body |
| 1 | Space between salutation and text body | |
| Paragraph (2) | 0 | No space between text and valediction |
| 1 | Space between text body and valediction |
Fig. 5Example of measuring the average height of ‘a’, ‘e’ and ‘o’ from the text. The distance between the top and bottom lines of best fit (red lines) was measured.
The number of characters that had their construction characteristics extracted and coded.
| g | 10 |
| h | 28 |
| th | 13 |
| M | 8 |
| 0 | 12 |
| 7 | 8 |
| 9 | 7 |
Examples of all of the construction characteristics extracted for each character and their respective codes.
| Forensic science | |
| Handwriting examination, forensic intelligence | |
| Table | |
| Manual coding | |
| Categorical, numerical data | |
| Handwriting specimens were scanned and coded | |
| Features of each participants’ handwriting were selected, extracted and coded | |
| Handwriting specimens were collected from people living in the Sydney region (and whom had learnt to write English in New South Wales). All original documents are stored at the University of Technology Sydney (UTS), 15 Broadway, Ultimo, Sydney, NSW, 2007. | |
| The data is made available with this article | |
| Agius et al. |