| Literature DB >> 22595091 |
Muhammad N Anwar1, Michael P Oakes.
Abstract
BACKGROUND: This paper describes the analysis of a database of over 180,000 patient records, collected from over 23,000 patients, by the hearing aid clinic at James Cook University Hospital in Middlesbrough, UK. These records consist of audiograms (graphs of the faintest sounds audible to the patient at six different pitches), categorical data (such as age, gender, diagnosis and hearing aid type) and brief free text notes made by the technicians. This data is mined to determine which factors contribute to the decision to fit a BTE (worn behind the ear) hearing aid as opposed to an ITE (worn in the ear) hearing aid.Entities:
Mesh:
Year: 2012 PMID: 22595091 PMCID: PMC3339393 DOI: 10.1186/1472-6947-12-S1-S6
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Component coefficient vectors of PCA
| PC1 | PC2 | PC3 | PC4 | |
|---|---|---|---|---|
| AC250 | -0.3001 | -0.3811 | 0.2988 | -0.1677 |
| AC500 | -0.3218 | -0.3619 | 0.2754 | -0.0166 |
| AC1000 | -0.3410 | -0.1999 | 0.2427 | 0.2643 |
| AC2000 | -0.3436 | 0.1440 | 0.1910 | 0.2697 |
| AC4000 | -0.3031 | 0.3673 | 0.2409 | -0.1742 |
| AC8000 | -0.2722 | 0.3186 | 0.2629 | -0.4684 |
| BC250 | -0.2510 | -0.2304 | -0.4890 | -0.5087 |
| BC500 | -0.2942 | -0.2404 | -0.4152 | -0.0846 |
| BC1000 | -0.3189 | -0.0760 | -0.3052 | 0.3595 |
| BC2000 | -0.3028 | 0.2699 | -0.2419 | 0.4088 |
| BC4000 | -0.2516 | 0.4870 | -0.2219 | -0.1299 |
Observed and expected frequencies for ITE/BTE aid with gender
| Hearing aid type | Male | Female | Row total |
|---|---|---|---|
| BTE | 3196 | 3850 | 7046 |
| ITE | 3647 | 3617 | 7264 |
| Column total | 6843 | 7467 | 14310 |
Most significant positive and negative keywords in records with BTE/ITE aid [11]
| Positive keywords | Negative keywords | |
|---|---|---|
| BTE | mould, be34, map, gp, 92, audio, inf, be52, ref, staff, reqd, be36, contact | fta, reshel, appt, it, nn, nfa, 2001, rev, lacquer, hn, km, imp, review, 2000 |
| ITE | fta, reshel, appt, it, nn, nfa, 2001, rev, lacquer, hn, km, imp, review, 2000, nh, vent, progress, aid, dt, taken | mould, be34, map, gp, 92, audio, inf, be52, ref, staff, reqd, be36, contact, tri, n, order |
Logistic regression for BC4000
| Regression coefficient b | Standard error se(b) | Z | P | |
|---|---|---|---|---|
| Constant | -0.09 | 0.08 | -1.12 | 0.26 |
| Bc4000_ind1 | -0.15 | 0.11 | -1.33 | 0.18 |
| BC4000_ind2 | -0.20 | 0.09 | -2.12 | 0.03 |
| BC4000_ind3 | 0.09 | 0.09 | 1.01 | 0.31 |
* Note: Bc4000_ind1, Bc4000_ind2 and Bc4000_ind3 represent bone conduction threshold quartiles of 25, 40 and 55 dB respectively.
Logistic regression for masker
| Regression coefficient b | Standard error se(b) | Z | P | |
|---|---|---|---|---|
| Constant | -0.41 | 0.25 | -1.60 | 0.11 |
| Masker(No_masker, OTHERS) | -0.91 | 0.50 | -1.83 | 0.07 |
Logistic regression for keywords
| Regression coefficient b | Standard error se(b) | Z | P | |
|---|---|---|---|---|
| Constant | -0.16 | 0.03 | -5.63 | 0.00 |
| APPT | 0.06 | 0.15 | 0.37 | 0.71 |
| FTA | -0.77 | 0.19 | -4.05 | 0.00 |
| GP | 0.62 | 0.13 | 4.75 | 0.00 |
| MAP | 2.32 | 0.53 | 4.39 | 0.00 |
| NFA | -0.93 | 0.32 | -2.93 | 0.00 |
| REV | 0.12 | 0.10 | 1.12 | 0.26 |
The thresholds corresponding to the first four Principal Components
| Principal Component (PC) | Frequency (in Hz) | |||||
|---|---|---|---|---|---|---|
| 250 | 500 | 1000 | 2000 | 4000 | 8000 | |
| PC1: Flat hearing loss | 42 | 41 | 40 | 39 | 42 | 44 |
| 45 | 45 | 42 | 42 | 45 | ||
| PC2: High tone sensorineural loss | 37 | 38 | 48 | 69 | 82 | 79 |
| 46 | 46 | 55 | 76 | 89 | ||
| PC3: Air-bone gap (flat) | 78 | 77 | 75 | 71 | 75 | 76 |
| 31 | 35 | 42 | 45 | 47 | ||
| PC4: Air-bone gap (predominant at low tone) | 50 | 59 | 76 | 76 | 50 | 32 |
| 29 | 55 | 82 | 85 | 52 | ||
Observed values (O)
| Hearing aid type | PCA1 | PCA2 | PCA3 | PCA4 |
|---|---|---|---|---|
| ITE | 2036 | 1341 | 476 | 75 |
| BTE | 1119 | 1166 | 1165 | 59 |
(O-E)values
| Hearing aid type | PCA1 | PCA2 | PCA3 | PCA4 |
|---|---|---|---|---|
| ITE | 81.99 | 0.22 | 176.14 | 0.25 |
| BTE | 91.78 | 0.24 | 197.18 | 0.28 |
Logistic regression for gender
| Regression coefficient b | Standard error se(b) | Z | P | |
|---|---|---|---|---|
| Constant | -0.23 | 0.04 | -5.93 | 0 |
| Gender | 0.16 | 0.05 | 3.08 | 0 |
Logistic regression for AC250
| Regression coefficient b | Standard error se(b) | Z | P | |
|---|---|---|---|---|
| Constant | -0.72 | 0.04 | -17.23 | 0 |
| AC250_ind1 | 0.54 | 0.07 | 8.15 | 0 |
| AC250_ind2 | 1.29 | 0.07 | 17.26 | 0 |
| AC250_ind3 | 2.18 | 0.12 | 17.91 | 0 |
* Note: Ac250_ind1, Ac250_ind2 and Ac250_ind3 represent Air conduction at 250 dB quartile of 40, 55 and 75 dB respectively.
Predicted Log odds for AC250
| AC250 group | Logistic regression equation | Predicted log odds |
|---|---|---|
| 0<AC250< = 40 | Log odds = bconstant | -0.72 |
| 40<AC250< = 55 | Log odds = bconstant + bAC250_ind1 | -0.18 |
| 55<AC250< = 75 | Log odds = bconstant + bAC250_ind2 | 0.57 |
| 75<AC250 | Log odds = bconstant + bAC250_ind3 | 1.45 |
Logistic regression - worked example
| Candidate variables (database record) | Actual values | Predicted log odds | Overall predicted log odds |
|---|---|---|---|
| Age | 71 | Not-significant | 0 |
| Gender | Male | -0.23 | -0.23 |
| AC250 | 75 | 0.57 | 0.34 |
| AC500 | 70 | 0.72 | 1.06 |
| AC1000 | 80 | 2.08 | 3.14 |
| AC2000 | 90 | 1.19 | 4.33 |
| AC4000 | 100 | 0.40 | 4.73 |
| AC8000 | 100 | 0.09 | 4.82 |
| BC250 | 40 | -0.03 | 4.79 |
| BC500 | 60 | 0.56 | 5.35 |
| BC1000 | 65 | 0.56 | 5.91 |
| BC2000 | 70 | 0.14 | 6.05 |
| BC4000 | 70 | Not-significant | 6.05 |
| Diagnosis | Tinnitus | Not-significant | 6.05 |
| Hearing aid type | BTE | To be found | 6.05 |
| Masker | No masker | Not-significant | 6.05 |
| Mould | 2107 | 4.09 | 10.14 |
| Free-text words | REV | -0.16+0.12 = -0.04 | 10.1 |
Overall results
| Results | Number of records | Percentage |
|---|---|---|
| Similar | 1170 | 81.64 |
| Not-similar | 263 | 18.35 |
| Total | 1433 |
ITE/BTE aid Precision, Recall, F-score
| ITE | BTE | |
|---|---|---|
| Precision | 0.81 | 0.82 |
| Recall | 0.86 | 0.76 |
| F-score | 0.84 | 0.79 |
ITE/BTE aid predicted results
| Machine results | Human (actual data) | ||
|---|---|---|---|
| ITE | 676 (86%) | 106 (14%) | 782 |
| BTE | 157 (24%) | 494 (76%) | 651 |
| Total | 833 | 600 | 1433 |
Logistic regression for diagnosis
| Regression coefficient b | Standard error se(b) | Z | P | |
|---|---|---|---|---|
| Constant | 0.37 | 0.39 | 0.96 | 0.34 |
| Diagnosis | -1.05 | 0.44 | -2.37 | 0.02 |
Logistic regression for age
| Regression coefficient b | Standard error se(b) | Z | P | |
|---|---|---|---|---|
| Constant | -0.08 | 0.05 | -1.49 | 0.13 |
| Age_ind1 | -0.13 | 0.08 | -1.73 | 0.08 |
| Age_ind2 | -0.26 | 0.08 | -3.48 | 0.00 |
| Age_ind3 | 0.14 | 0.08 | 1.88 | 0.06 |
* Note: Age_ind1, Age_ind2 and Age_ind3 represent age quartiles of 60, 70 and 78 years respectively.
Expected values (E)
| Hearing aid type | PCA1 | PCA2 | PCA3 | PCA4 |
|---|---|---|---|---|
| ITE | 1666.38 | 1324.12 | 866.73 | 70.77 |
| BTE | 1488.62 | 1182.88 | 774.27 | 63.23 |