| Literature DB >> 32127604 |
François Charih1, Matthew Bromwich2,3,4, Amy E Mark2,4, Renée Lefrançois4, James R Green5.
Abstract
Recent mobile and automated audiometry technologies have allowed for the democratization of hearing healthcare and enables non-experts to deliver hearing tests. The problem remains that a large number of such users are not trained to interpret audiograms. In this work, we outline the development of a data-driven audiogram classification system designed specifically for the purpose of concisely describing audiograms. More specifically, we present how a training dataset was assembled and the development of the classification system leveraging supervised learning techniques. We show that three practicing audiologists had high intra- and inter-rater agreement over audiogram classification tasks pertaining to audiogram configuration, symmetry and severity. The system proposed here achieves a performance comparable to the state of the art, but is significantly more flexible. Altogether, this work lays a solid foundation for future work aiming to apply machine learning techniques to audiology for audiogram interpretation.Entities:
Year: 2020 PMID: 32127604 PMCID: PMC7054524 DOI: 10.1038/s41598-020-60898-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Audiogram set composition.
| Number of presentations | Non-trivial | Trivial | Total |
|---|---|---|---|
| Once | 200 | 20 | 220 |
| Twice | 40 | 10 | 100 |
| Total | 280 | 40 | 320 |
Figure 1The Rapid Audiogram Annotation Environment has (A) an efficient user-interface and (B) a scalable cloud architecture.
Questions posed during audiogram annotation.
| Question | Possible answers |
|---|---|
| Is the audiogram symmetrical? | |
| What is the configuration?a,b | |
| How severe is the loss?a,c | |
| Are there potentially unreliable thresholds?a | Possibility to click on unreliable thresholds |
| Are there notches?a | Possibility to click on thresholds in a notch |
aOn a per-ear basis. bOnly required for ears where there is hearing loss. cThe number of descriptors varies between b and c, depending on the configuration provided.
Figure 2Intra-rater reliability calculated from 50 audiogram replicates (error bars represent the standard error from the mean).
Figure 3Inter-rater reliability between three professional audiologists for 270 audiograms (error bars represent the standard error from the mean).
Figure 4Configuration classification decision forest.
Features defined for the configuration classification models.
| Description | |
|---|---|
| 1 | Slope of the line of best fit |
| 2 | Proportion of positive slopes joining consecutive thresholds |
| 3 | Proportion of negative slopes joining consecutive thresholds |
| 4 | Maximum threshold (worst threshold) |
| 5 | Minimum threshold (best threshold) |
| 6 | Average threshold |
| 7 | Standard deviation of the thresholds |
| 8 | Average of thresholds in the low frequency range (below 1,000 Hz) |
| 9 | Average of thresholds in the mid frequency range (between 1,000 Hz and 3,000 Hz) |
| 10 | Average of thresholds in the high frequency range (4,000 Hz and above) |
| 11 | Proportion of slopes that change signs with respect to the previous slope |
| 12 | Mean absolute residual from the line of best fit |
| 13 | Audiogram curvature; highest-order coefficient of the quadratic of best fit |
| 14 | Audiogram range; difference between the maximum and minimum thresholds |
| 15 | Notch index[ |
Performance of our configuration classifiers.
| DDAE | AMCLASS™ | |||||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | Recall | Precision | Accuracy | Recall | Precision | |||
| Flat | 0.94 ± 0.02 | 0.84 ± 0.10 | 0.80 ± 0.05 | 0.81 ± 0.07 | 0.90 ± 0.02 | 0.66 ± 0.10 | 0.47 ± 0.09 | 0.55 ± 0.08 |
| Sloping | 0.81 ± 0.07 | 0.81 ± 0.04 | 0.81 ± 0.04 | 0.81 ± 0.04 | 0.81 ± 0.02 | 0.80 ± 0.03 | 0.88 ± 0.03 | 0.84 ± 0.02 |
| Precipitous | 0.88 ± 0.01 | 0.82 ± 0.03 | 0.85 ± 0.02 | 0.83 ± 0.01 | ||||
| Reverse sloping | 0.96 ± 0.01 | 0.83 ± 0.06 | 0.81 ± 0.05 | 0.81 ± 0.01 | 0.81 ± 0.02 | 0.80 ± 0.03 | 0.88 ± 0.03 | 0.84 ± 0.02 |
| Cookie bite | 0.95 ± 0.01 | 0.62 ± 0.02 | 0.74 ± 0.06 | 0.65 ± 0.01 | 0.96 ± 0.01 | 0.38 ± 0.14 | 0.63 ± 0.19 | 0.46 ± 0.14 |
| Reverse cookie bite | 0.93 ± 0.01 | 0.80 ± 0.06 | 0.83 ± 0.05 | 0.81 ± 0.04 | 0.93 ± 0.02 | 0.66 ± 0.09 | 0.66 ± 0.09 | 0.66 ± 0.08 |
| Notched | 0.84 ± 0.04 | 0.71 ± 0.04 | 0.70 ± 0.06 | 0.70 ± 0.05 | N/A | N/A | N/A | N/A |
| Atypical | 0.84 ± 0.04 | 0.61 ± 0.03 | 0.70 ± 0.13 | 0.62 ± 0.04 | 0.75 ± 0.03 | 0.09 ± 0.03 | 0.61 ± 0.16 | 0.15 ± 0.05 |
Features defined for severity classification.
| Description | |
|---|---|
| 1 | Average threshold |
| 2 | Maximum (worst) threshold |
| 3 | Minimum (best) threshold |
| 4 | Average of thresholds in the low range |
| 5 | Maximum (worst) threshold in the low range |
| 6 | Minimum (best) threshold in the low range |
| 7 | Average of thresholds in the mid range |
| 8 | Maximum (worst) threshold in the mid range |
| 9 | Minimum (best) threshold in the mid range |
| 10 | Average of thresholds in the high range |
| 11 | Maximum (worst) threshold in the high range |
| 12 | Minimum (best) threshold in the low range |
| 13 | Maximum (worst) threshold in notch-susceptible frequencies (between 3,000 and 6,000 Hz, inclusively) |
Low range is defined as frequencies below 1,000 Hz, the mid range includes frequencies between 1,000 Hz and 3,000 Hz inclusively, while the high range comprises all frequencies greater or equal to 4,000 Hz.
Accuracy of the severity prediction module of the DDAE.
| Configuration | Lows | Mids | Highs |
|---|---|---|---|
| Flat | 0.87 ± 0.13 | N/A | N/A |
| Sloping | 0.93 ± 0.02 | N/A | 0.96 ± 0.01 |
| Precipitous | 0.95 ± 0.03 | N/A | 0.97 ± 0.02 |
| Reverse sloping | 0.98 ± 0.05 | N/A | 0.97 ± 0.05 |
| Cookie bite | 0.98 ± 0.04 | 1.00 ± 0.00 | 0.97 ± 0.05 |
| Reverse cookie bite | 0.91 ± 0.09 | 0.99 ± 0.03 | 0.93 ± 0.06 |
| Notched | 0.98 ± 0.02 | 0.72 ± 0.02* | 0.54 ± 0.17 |
| Atypical | 0.81 ± 0.16 | 0.66 ± 0.30 | 0.71 ± 0.13 |
*This descriptor represents frequencies most susceptible to host audiometric notches (i.e. 3,000 Hz, 4,000 Hz and 6,000 Hz), and corresponds to the deepest threshold in the notch.
Features defined for the symmetry classification model.
| Description | |
|---|---|
| 1 | Maximum inter-aural threshold difference |
| 2 | Minimum inter-aural threshold difference |
| 3 | Average inter-aural threshold difference |
| 4 | Average inter-aural threshold difference |
| 5 | Difference in the slopes of the lines of best fit |
| 6 | Difference between the average threshold across ears |
Performance on symmetry classification.
| Configuration | Our method | Existing rule |
|---|---|---|
| Accuracy | 0.98 ± 0.02 | 0.89 ± 0.02 |
| Recall | 0.94 ± 0.05 | 1.00 ± 0.00 |
| Precision | 0.99 ± 0.01 | 0.88 ± 0.02 |
| 0.96 ± 0.04 | 0.93 ± 0.01 |