| Literature DB >> 35348008 |
Alanna M Chamberlain1,2, Véronique L Roger2,3, Peter A Noseworthy2, Lin Y Chen4, Susan A Weston1, Ruoxiang Jiang1, Alvaro Alonso5.
Abstract
Background Electronic medical records are increasingly used to identify disease cohorts; however, computable phenotypes using electronic medical record data are often unable to distinguish between prevalent and incident cases. Methods and Results We identified all Olmsted County, Minnesota residents aged ≥18 with a first-ever International Classification of Diseases, Ninth Revision (ICD-9) diagnostic code for atrial fibrillation or atrial flutter from 2000 to 2014 (N=6177), and a random sample with an International Classification of Diseases, Tenth Revision (ICD-10) code from 2016 to 2018 (N=200). Trained nurse abstractors reviewed all medical records to validate the events and ascertain the date of onset (incidence date). Various algorithms based on number and types of codes (inpatient/outpatient), medications, and procedures were evaluated. Positive predictive value (PPV) and sensitivity of the algorithms were calculated. The lowest PPV was observed for 1 code (64.4%), and the highest PPV was observed for 2 codes (any type) >7 days apart but within 1 year (71.6%). Requiring either 1 inpatient or 2 outpatient codes separated by >7 days but within 1 year had the best balance between PPV (69.9%) and sensitivity (95.5%). PPVs were slightly higher using ICD-10 codes. Requiring an anticoagulant or antiarrhythmic prescription or electrical cardioversion in addition to diagnostic code(s) modestly improved the PPVs at the expense of large reductions in sensitivity. Conclusions We developed simple, exportable, computable phenotypes for atrial fibrillation using structured electronic medical record data. However, use of diagnostic codes to identify incident atrial fibrillation is prone to some misclassification. Further study is warranted to determine whether more complex phenotypes, including unstructured data sources or using machine learning techniques, may improve the accuracy of identifying incident atrial fibrillation.Entities:
Keywords: atrial fibrillation; computable phenotype; electronic medical records
Mesh:
Year: 2022 PMID: 35348008 PMCID: PMC9075468 DOI: 10.1161/JAHA.121.023237
Source DB: PubMed Journal: J Am Heart Assoc ISSN: 2047-9980 Impact factor: 6.106
Figure 1Flow diagram of patients included in the analysis for the prevalent approach and the incident approach.
Number of patients included in the code only phenotypes are shown in the blue boxes. For the computable phenotypes requiring medications and procedures, the analysis was restricted to 2005 forward (number of patients included are shown in orange boxes). AF indicates atrial fibrillation.
Figure 2Positive predictive value and upper limit of sensitivity of computable phenotypes for atrial fibrillation using the prevalent approach (left panel) and incident approach (right panel).
The sensitivity reported is an upper limit of the true sensitivity of the algorithms and was estimated including patients with electrocardiographic evidence of atrial fibrillation without a diagnostic code. The study period for the computable phenotypes including medications and procedures was January 1, 2005 to December 31, 2014 for the prevalent approach and January 1, 2005 to December 31, 2013 for the incident approach. ICD‐9 indicates International Classification of Diseases, Ninth Revision.
Positive Predictive Value and Upper Limit of Sensitivity* of Various Computable Phenotypes for Atrial Fibrillation Using the Prevalent Approach, January 1, 2000 toDecember 31, 2014
| Code only algorithms | Codes + medications/procedures | |||||||
|---|---|---|---|---|---|---|---|---|
| PPV | Sensitivity | PPV | Sensitivity | |||||
| Computable phenotype | True+ / algorithm+ | % | Algorithm+ / true+ | % | True+ / algorithm+ | % | Algorithm+ / true+ | % |
| 1 code ( | 4376/6135 | 71.3 | 4376/4446 | 98.4 | 1958/2645 | 74.0 | 1958/3143 | 62.3 |
| 2 codes (any type) ≥1 d apart and within 1 y | 4237/5419 | 78.2 | 4237/4446 | 95.3 | 1936/2422 | 79.9 | 1936/3143 | 61.6 |
| 2 codes (any type) >7 d apart and within 1 y | 3933/4814 | 81.7 | 3933/4446 | 88.5 | 1862/2258 | 82.5 | 1862/3143 | 59.2 |
| 2 codes (any type) >30 d apart and within 1 y | 3499/4216 | 83.0 | 3499/4446 | 78.7 | 1737/2074 | 83.8 | 1737/3143 | 55.3 |
| 1 inpatient code OR 2 outpatient codes ≥1 d apart and within 1 y | 4316/5638 | 76.6 | 4316/4446 | 97.1 | 1949/2484 | 78.5 | 1949/3143 | 62.0 |
| 1 inpatient code OR 2 outpatient codes >7 d apart but within 1 y | 4223/5407 | 78.1 | 4223/4446 | 95.0 | 1920/2420 | 79.3 | 1920/3143 | 61.1 |
| 1 inpatient code OR 2 outpatient codes >30 d apart but within 1 y | 4112/5226 | 78.7 | 4112/4446 | 92.5 | 1880/2355 | 79.8 | 1880/3143 | 59.8 |
PPV indicates positive predictive value.
The sensitivity was estimated including patients with ECG evidence of atrial fibrillation without a diagnostic code. However, patients with true atrial fibrillation without an ECG or diagnostic code may have been missed, resulting in an overestimate of the sensitivity. Thus, our sensitivity estimates are an upper limit of the true sensitivity of the algorithms.
The study period for this analysis was January 1, 2005 to December 31, 2014 (N=4459; 4430 with codes and 29 with ECG only). The algorithms incorporating both codes and medications/procedures require the patient to have the listed number and type of diagnostic codes plus a prescription for an anticoagulant drug or an antiarrhythmic drug or an electrical cardioversion procedure within 1 year of the first diagnostic code.
Positive Predictive Value and Upper Limit of Sensitivity* of Various Computable Phenotypes for Atrial Fibrillation Using the Incident Approach, February 1, 2000 toDecember 31, 2013
| Code only algorithms | Codes + medications/procedures | |||||||
|---|---|---|---|---|---|---|---|---|
| PPV | Sensitivity | PPV | Sensitivity | |||||
| Computable phenotype | True+ / algorithm+ | % | Algorithm+ / true+ | % | True+ / algorithm+ | % | Algorithm+ / true+ | % |
| 1 code ( | 3586/5568 | 64.4 | 3586/3652 | 98.2 | 1581/2288 | 69.1 | 1581/2572 | 61.5 |
| Sex | ||||||||
| Male | 1919/2976 | 64.5 | 1919/1954 | 98.2 | 893/1302 | 68.6 | 893/1398 | 63.9 |
| Female | 1667/2592 | 64.3 | 1667/1698 | 98.2 | 688/986 | 69.8 | 688/1174 | 58.6 |
| Age group, y | ||||||||
| Age <65 | 985/1626 | 60.6 | 985/1011 | 97.4 | 431/661 | 65.2 | 431/728 | 59.2 |
| Age 65–74 | 848/1314 | 64.5 | 848/861 | 98.5 | 395/573 | 68.9 | 395/615 | 64.2 |
| Age 75–84 | 1052/1608 | 65.4 | 1052/1064 | 98.9 | 519/720 | 72.1 | 519/719 | 72.2 |
| Age ≥85 | 701/1020 | 68.7 | 701/716 | 97.9 | 236/334 | 70.7 | 236/510 | 46.3 |
| 2 codes (any type) ≥1 d apart and within 1 y | 3480/4929 | 70.6 | 3480/3645 | 95.5 | 1515/2036 | 74.4 | 1515/2578 | 58.8 |
| Sex | ||||||||
| Male | 1865/2627 | 71.0 | 1865/1956 | 95.3 | 860/1157 | 74.3 | 860/1407 | 61.1 |
| Female | 1615/2302 | 70.2 | 1615/1689 | 95.6 | 655/879 | 74.5 | 655/1171 | 55.9 |
| Age group | ||||||||
| Age <65 | 937/1349 | 69.5 | 937/997 | 94.0 | 421/575 | 73.2 | 421/718 | 58.6 |
| Age 65–74 | 834/1160 | 71.9 | 834/866 | 96.3 | 382/509 | 75.0 | 382/620 | 61.6 |
| Age 75–84 | 1027/1470 | 69.9 | 1027/1062 | 96.7 | 491/651 | 75.4 | 491/724 | 67.8 |
| Age ≥85 | 682/950 | 71.8 | 682/720 | 94.7 | 221/301 | 73.4 | 221/516 | 42.8 |
| 2 codes (any type) >7 d apart and within 1 y | 3160/4411 | 71.6 | 3160/3554 | 88.9 | 1485/2004 | 74.1 | 1485/2536 | 58.6 |
| Sex | ||||||||
| Male | 1681/2358 | 71.3 | 1681/1908 | 88.1 | 837/1137 | 73.6 | 837/1385 | 60.4 |
| Female | 1479/2053 | 72.0 | 1479/1646 | 89.9 | 648/867 | 74.7 | 648/1151 | 56.3 |
| Age group, y | ||||||||
| Age <65 | 802/1131 | 70.9 | 802/964 | 83.2 | 397/539 | 73.7 | 397/697 | 57.0 |
| Age 65–74 | 764/1053 | 72.6 | 764/838 | 91.2 | 373/503 | 74.2 | 373/608 | 61.3 |
| Age 75–84 | 962/1346 | 71.5 | 962/1038 | 92.7 | 491/659 | 74.5 | 491/715 | 68.7 |
| Age ≥85 | 632/881 | 71.7 | 632/714 | 88.5 | 224/303 | 73.9 | 224/516 | 43.4 |
| 2 codes (any type) >30 d apart and within 1 y | 2732/3909 | 69.9 | 2732/3461 | 78.9 | 1411/1949 | 72.4 | 1411/2490 | 56.7 |
| Sex | ||||||||
| Male | 1459/2100 | 69.5 | 1459/1858 | 78.5 | 793/1099 | 72.2 | 793/1362 | 58.2 |
| Female | 1273/1809 | 70.4 | 1273/1603 | 79.4 | 618/850 | 72.7 | 618/1128 | 54.8 |
| Age group | ||||||||
| Age <65 | 665/965 | 68.9 | 665/931 | 71.4 | 370/510 | 72.5 | 370/684 | 54.1 |
| Age 65–74 | 680/947 | 71.8 | 680/818 | 83.1 | 360/500 | 72.0 | 360/600 | 60.0 |
| Age 75–84 | 860/1219 | 70.6 | 860/1012 | 85.0 | 466/643 | 72.5 | 466/700 | 66.6 |
| Age ≥85 | 527/778 | 67.7 | 527/700 | 75.3 | 215/296 | 72.6 | 215/506 | 42.5 |
| 1 inpatient code OR 2 outpatient codes ≥1 d apart and within 1 y | 3544/5125 | 69.2 | 3544/3645 | 97.2 | 1520/2073 | 73.3 | 1520/2576 | 59.0 |
| Sex | ||||||||
| Male | 1901/2735 | 69.5 | 1901/1955 | 97.2 | 863/1182 | 73.0 | 863/1403 | 61.5 |
| Female | 1643/2390 | 68.7 | 1643/1690 | 97.2 | 657/891 | 73.7 | 657/1173 | 56.0 |
| Age group | ||||||||
| Age <65 | 960/1416 | 67.8 | 960/998 | 96.2 | 425/590 | 72.0 | 425/719 | 59.1 |
| Age 65–74 | 845/1205 | 70.1 | 845/866 | 97.6 | 385/520 | 74.0 | 385/620 | 62.1 |
| Age 75–84 | 1042/1518 | 68.6 | 1042/1062 | 98.1 | 489/657 | 74.4 | 489/723 | 67.6 |
| Age ≥85 | 697/986 | 70.7 | 697/719 | 96.9 | 221/306 | 72.2 | 221/514 | 43.0 |
| 1 inpatient code OR 2 outpatient codes >7 d apart but within 1 y | 3451/4936 | 69.9 | 3451/3614 | 95.5 | 1509/2066 | 73.0 | 1509/2559 | 59.0 |
| Sex | ||||||||
| Male | 1847/2636 | 70.1 | 1847/1943 | 95.1 | 853/1173 | 72.7 | 853/1399 | 61.0 |
| Female | 1604/2300 | 69.7 | 1604/1671 | 96.0 | 656/893 | 73.5 | 656/1160 | 56.6 |
| Age group | ||||||||
| Age <65 | 901/1314 | 68.6 | 901/981 | 91.8 | 414/576 | 71.9 | 414/708 | 58.5 |
| Age 65–74 | 829/1168 | 71.0 | 829/859 | 96.5 | 383/523 | 73.2 | 383/618 | 62.0 |
| Age 75–84 | 1026/1478 | 69.4 | 1026/1055 | 97.3 | 490/662 | 74.0 | 490/719 | 68.2 |
| Age ≥85 | 695/976 | 71.2 | 695/719 | 96.7 | 222/305 | 72.8 | 222/514 | 43.2 |
| 1 inpatient code OR 2 outpatient codes >30 d apart but within 1 y | 3348/4801 | 69.7 | 3348/3578 | 93.6 | 1497/2058 | 72.7 | 1497/2543 | 58.9 |
| Sex | ||||||||
| Male | 1784/2569 | 69.4 | 1784/1917 | 93.1 | 843/1169 | 72.1 | 843/1387 | 60.8 |
| Female | 1564/2232 | 70.1 | 1564/1661 | 94.2 | 654/889 | 73.6 | 654/1156 | 56.6 |
| Age group | ||||||||
| Age <65 | 846/1244 | 68.0 | 846/962 | 87.9 | 405/565 | 71.7 | 405/700 | 57.9 |
| Age 65–74 | 810/1146 | 70.7 | 810/850 | 95.3 | 381/525 | 72.6 | 381/614 | 62.1 |
| Age 75–84 | 1005/1445 | 69.6 | 1005/1048 | 95.9 | 488/664 | 73.5 | 488/717 | 68.1 |
| Age ≥85 | 687/966 | 71.1 | 687/718 | 95.7 | 223/304 | 73.4 | 223/512 | 43.6 |
PPV indicates positive predictive value.
The sensitivity was estimated including patients with ECG evidence of atrial fibrillation without a diagnostic code. However, patients with true atrial fibrillation without an ECG or diagnostic code may have been missed, resulting in an overestimate of the sensitivity. Thus, our sensitivity estimates are an upper limit of the true sensitivity of the algorithms.
The study period for this analysis was January 1, 2005 to December 31, 2013 (N=3921; 3894 with codes and 27 ECG only). The algorithms incorporating both codes and medications/procedures require the patient to have the listed number and type of diagnostic codes plus a prescription for an anticoagulant drug or an antiarrhythmic drug or an electrical cardioversion procedure within 1 year of the first diagnostic code.
Positive Predictive Value of Various Computable Phenotypes for Atrial Fibrillation Using ICD‐10 codes, February 1, 2016 to December 31, 2018
| Code only algorithms | Codes + medications/procedures | |||
|---|---|---|---|---|
| PPV | PPV | |||
| Computable phenotype | True+ / algorithm+ | % | True+ / algorithm+ | % |
| 1 code ( | 135/200 | 67.5 | 99/143 | 69.2 |
| 2 codes (any type) ≥1 d apart and within 1 y | 130/180 | 72.2 | 97/133 | 72.9 |
| 2 codes (any type) >7 d apart and within 1 y | 120/162 | 74.1 | 94/125 | 75.2 |
| 2 codes (any type) >30 d apart and within 1 y | 112/148 | 75.7 | 89/117 | 76.1 |
| 1 inpatient code OR 2 outpatient codes ≥1 d apart and within 1 y | 133/187 | 71.1 | 98/138 | 71.0 |
| 1 inpatient code OR 2 outpatient codes >7 d apart but within 1 y | 131/184 | 71.2 | 97/136 | 71.3 |
| 1 inpatient code OR 2 outpatient codes >30 d apart but within 1 y | 125/175 | 71.4 | 93/130 | 71.5 |
ICD‐10 indicates International Classification of Diseases, Tenth Revision; PPV, positive predictive value.