| Literature DB >> 29258477 |
Tracy Dudding-Byth1,2,3,4, Anne Baxter5, Elizabeth G Holliday6,7, Anna Hackett5,7,8, Sheridan O'Donnell5, Susan M White9,10, John Attia6,7, Han Brunner11, Bert de Vries11, David Koolen11, Tjitske Kleefstra11, Seshika Ratwatte7,12, Carlos Riveros6, Steve Brain13, Brian C Lovell13,14.
Abstract
BACKGROUND: Massively parallel genetic sequencing allows rapid testing of known intellectual disability (ID) genes. However, the discovery of novel syndromic ID genes requires molecular confirmation in at least a second or a cluster of individuals with an overlapping phenotype or similar facial gestalt. Using computer face-matching technology we report an automated approach to matching the faces of non-identical individuals with the same genetic syndrome within a database of 3681 images [1600 images of one of 10 genetic syndrome subgroups together with 2081 control images]. Using the leave-one-out method, two research questions were specified: 1) Using two-dimensional (2D) photographs of individuals with one of 10 genetic syndromes within a database of images, did the technology correctly identify more than expected by chance: i) a top match? ii) at least one match within the top five matches? or iii) at least one in the top 10 with an individual from the same syndrome subgroup? 2) Was there concordance between correct technology-based matches and whether two out of three clinical geneticists would have considered the diagnosis based on the image alone?Entities:
Keywords: 2D photography; Clinical genetics; Computational biology; Computer vision; Dysmorphology; Facial gestalt; Intellectual disability; Phenotyping; Syndromic
Mesh:
Year: 2017 PMID: 29258477 PMCID: PMC5735520 DOI: 10.1186/s12896-017-0410-1
Source DB: PubMed Journal: BMC Biotechnol ISSN: 1472-6750 Impact factor: 2.563
Number of images in the database at different stages of analysis
| Syndrome | Images added to database before this stage of analysis | Number of images for syndrome | Total images in database at time of analysis | |
|---|---|---|---|---|
| 1 | Williams | 1 | 183 | 3145 |
| 1 | Rubinstein-Taybi | 1 | 155 | 3145 |
| 1 | Floating Harbor | 1 | 61 | 3145 |
| 1 | Coffin Lowry | 1 | 154 | 3145 |
| 1 | Kabuki | 1 | 195 | 3145 |
| 1 | Smith Magenis | 1 | 124 | 3145 |
| 2 | PACS1 | 2 | 39 | 3432 |
| 2 | Kleefstra | 2 | 128 | 3432 |
| 2 | Koolen-de Vries | 2 | 120 | 3432 |
| 3 | Cornelia de Lange | 192 before Stage 1, then an additional 249 before Stage 3 | 441 | 3681 |
Observed and expected counts of individuals with a syndrome diagnosis for whom the top match was another unrelated individual within the same syndrome subgroup
| Syndrome | Number of images for syndrome | Observed match/No match | Expected match/No match | Chi-square (1) |
|
|---|---|---|---|---|---|
| Williams | 183 | 110/73 | 11/172 | 938.43 | <.00001 |
| Rubinstein-Taybi | 155 | 92/63 | 7/148 | 1068.28 | <.00001 |
| Floating Harbor | 61 | 42/19 | 2/59 | 806.57 | <.00001 |
| Coffin Lowry | 154 | 88/66 | 7/147 | 969.83 | <.00001 |
| Kabuki | 195 | 106/89 | 12/183 | 776.29 | <.00001 |
| Smith Magenis | 124 | 72/52 | 5/119 | 921.61 | <.00001 |
| PACS1 | 39 | 6/33 | 2/37 | 6.46 | 0.01106 |
| Kleefstra | 128 | 72/56 | 5/123 | 920.40 | <.00001 |
| Koolan-de Vries | 120 | 66/54 | 4/116 | 978.17 | <.00001 |
| Cornelia de Lange | 441 | 329/112 | 53/388 | 1627.70 | <.00001 |
Observed and expected counts of individuals with a syndrome diagnosis for whom at least one in the top five matches was another unrelated individual within the same syndrome subgroup
| Syndrome | Number of images for syndrome | Observed match/No match | Expected match/No match | Chi-square (1) |
|
|---|---|---|---|---|---|
| Williams | 183 | 176/7 | 47/136 | 472.74 | <.00001 |
| Rubinstein-Taybi | 155 | 149/6 | 34/121 | 493.94 | <.00001 |
| Floating Harbor | 61 | 58/3 | 5/56 | 600.47 | <.00001 |
| Coffin Lowry | 154 | 133/21 | 34/120 | 366.21 | <.00001 |
| Kabuki | 195 | 187/8 | 53/142 | 461.78 | <.00001 |
| Smith Magenis | 124 | 122/2 | 23/101 | 517.90 | <.00001 |
| PACS1 | 39 | 19/20 | 3/36 | 86.76 | <.00001 |
| Kleefstra | 128 | 108/20 | 22/106 | 401.25 | <.00001 |
| Koolen-de Vries | 120 | 96/24 | 20/100 | 342.02 | <.00001 |
| Cornelia de Lange | 441 | 418/23 | 208/233 | 399.38 | <.00001 |
Observed and expected counts of individuals with a syndrome diagnosis for whom at least one in the top ten matches was another unrelated individual within the same syndrome subgroup
| Syndrome | Number of images for syndrome | Observed match/No match | Expected match/No match | Chi-square (1) |
|
|---|---|---|---|---|---|
| Williams | 183 | 179/4 | 82/101 | 205.76 | <.00001 |
| Rubinstein-Taybi | 155 | 155/0 | 62/93 | 230.01 | <.00001 |
| Floating Harbor | 61 | 58/3 | 11/50 | 239.81 | <.00001 |
| Coffin Lowry | 154 | 146/8 | 61/93 | 193.83 | <.00001 |
| Kabuki | 195 | 192/3 | 92/103 | 203.73 | <.00001 |
| Smith Magenis | 124 | 123/1 | 41/83 | 242.03 | <.00001 |
| PACS1 | 39 | 28/11 | 4/35 | 153.84 | <.00001 |
| Kleefstra | 128 | 119/9 | 40/88 | 224.08 | <.00001 |
| Koolan-de Vries | 120 | 109/11 | 36/84 | 208.58 | <.00001 |
| Cornelia de Lange | 441 | 432/9 | 317/124 | 147.09 | <.00001 |
Concordance between a software-identified top match and whether two of three clinical geneticists would definitely have considered this diagnosis based on the image alone
| Syndrome | Frequency | Correctly classified by neither | Correctly classified by both | Correctly classified by software, but not clinicians | Correctly classified by clinicians, but not software | McNemars Chi-square |
| Kappa |
|---|---|---|---|---|---|---|---|---|
| Williams | 183 | 31 | 66 | 44 | 42 | 0.05 | 0.82925 | 0.02 |
| Rubinstein-Taybi | 155 | 28 | 61 | 31 | 35 | 0.24 | 0.62246 | 0.11 |
| Floating Harbor | 61 | 7 | 33 | 9 | 12 | 0.43 | 0.51269 | 0.16 |
| Coffin Lowry | 154 | 42 | 41 | 47 | 24 | 7.45 | 0.00634 | 0.10 |
| Kabuki | 195 | 23 | 88 | 18 | 66 | 27.43 | <.00001 | 0.09 |
| Smith Magenis | 124 | 34 | 38 | 34 | 18 | 4.92 | 0.02650 | 0.17 |
| PACS1 | 39 | 33 | 2 | 4 | 0 | 4.00 | 0.04550 | 0.46 |
| Kleefstra | 128 | 52 | 9 | 63 | 4 | 51.96 | <.00001 | 0.05 |
| Koolan-de Vries | 120 | 30 | 37 | 29 | 24 | 0.47 | 0.49221 | 0.12 |
| Cornelia de Lange | 441 | 52 | 264 | 65 | 60 | 0.20 | 0.65472 | 0.26 |
Concordance between a software-identified match within the top five closest matches and whether two of three clinical geneticists would definitely have considered this diagnosis based on the image alone
| Syndrome | Frequency | Correctly classified by neither | Correctly classified by both | Correctly classified by software, but not clinicians | Correctly classified by clinicians, but not software | McNemars Chi-square |
| Kappa |
|---|---|---|---|---|---|---|---|---|
| Williams | 183 | 2 | 103 | 73 | 5 | 59.28 | <.00001 | −0.02 |
| Rubinstein-Taybi | 155 | 4 | 94 | 55 | 2 | 49.28 | <.00001 | 0.06 |
| Floating Harbor | 61 | 1 | 43 | 15 | 2 | 9.94 | 0.00162 | 0.02 |
| Coffin Lowry | 154 | 15 | 59 | 74 | 6 | 57.80 | <.00001 | 0.07 |
| Kabuki | 195 | 2 | 148 | 39 | 6 | 24.20 | <.00001 | 0.01 |
| Smith Magenis | 124 | 2 | 56 | 66 | 0 | 66.00 | <.00001 | 0.03 |
| PACS1 | 39 | 20 | 2 | 17 | 0 | 17.00 | 0.00004 | 0.11 |
| Kleefstra | 128 | 18 | 11 | 97 | 2 | 91.16 | <.00001 | 0.00 |
| Koolan-de Vries | 120 | 17 | 54 | 42 | 7 | 25.00 | <.00001 | 0.18 |
| Cornelia de Lange | 441 | 10 | 311 | 107 | 13 | 73.63 | <.00001 | 0.06 |
Concordance between a software-identified match within the top ten closest matches and whether two of three clinical geneticists would definitely have considered this diagnosis based on the image alone
| Syndrome | Frequency | Correctly classified by neither | Correctly classified by both | Correctly classified by software, but not clinicians | Correctly classified by clinicians, but not software | McNemars Chi-square |
| Kappa |
|---|---|---|---|---|---|---|---|---|
| Williams | 183 | 1 | 105 | 74 | 3 | 65.47 | <.00001 | −0.02 |
| Rubinstein-Taybi | 155 | 0 | 96 | 59 | 0 | . | . | 0.00 |
| Floating Harbor | 61 | 1 | 43 | 15 | 2 | 9.94 | 0.00162 | 0.02 |
| Coffin Lowry | 153 | 5 | 63 | 83 | 2 | 77.19 | <.00001 | 0.02 |
| Kabuki | 195 | 1 | 152 | 40 | 2 | 34.38 | <.00001 | 0.02 |
| Smith Magenis | 124 | 1 | 56 | 67 | 0 | 67.00 | <.00001 | 0.01 |
| PACS1 | 39 | 11 | 2 | 26 | 0 | 26.00 | <.00001 | 0.04 |
| Kleefstra | 128 | 9 | 13 | 106 | 0 | 106.00 | <.00001 | 0.02 |
| Koolan-de Vries | 120 | 8 | 58 | 51 | 3 | 42.67 | <.00001 | 0.09 |
| Cornelia de Lange | 441 | 6 | 321 | 111 | 3 | 102.32 | <.00001 | 0.06 |