| Literature DB >> 20470433 |
Baiju R Shah1, Maria Chiu, Shubarna Amin, Meera Ramani, Sharon Sadry, Jack V Tu.
Abstract
BACKGROUND: Surname lists are useful for identifying cohorts of ethnic minority patients from secondary data sources. This study sought to develop and validate lists to identify people of South Asian and Chinese origin.Entities:
Mesh:
Year: 2010 PMID: 20470433 PMCID: PMC2877682 DOI: 10.1186/1471-2288-10-42
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Figure 1The derivation of the surname-derived ethnic identification file from the Registered Persons Database.
The 200 most common surnames from the South Asian and Chinese surname lists, and from the general population in Ontario.
| South Asian | Chinese | General population | |||
|---|---|---|---|---|---|
| Patel | 35,984 | Wong | 34,567 | Smith | 91,575 |
| Singh | 31,820 | Chan | 32,692 | Brown | 57,222 |
| Sharma | 10,216 | Li | 27,608 | Lee | 49,898 |
| Kaur | 7,462 | Chen | 25,618 | Wilson | 43,803 |
| Persaud | 6,982 | Wang | 22,548 | Martin | 38,878 |
| Sandhu | 6,229 | Liu | 18,784 | Taylor | 35,746 |
| Grewal | 6,044 | Zhang | 18,003 | Campbell | 34,551 |
| Sidhu | 5,862 | Lam | 15,910 | Williams | 34,104 |
| Dhaliwal | 5,209 | Leung | 13,696 | Thompson | 33,810 |
| Dhillon | 4,847 | Ho | 12,830 | Jones | 32,644 |
| Agarwal, Aggarwal, Ahluwalia, Ahuja, Akhtar, Akhter, Akram, Anand, Arora, Arumugam, Atwal, Aujla, Aulakh, Bains, Bajwa, Baksh, Balachandran, Balasingam, Balasubramaniam, Banerjee, Bansal, Banwait, Bedi, Begum, Beharry, Bhalla, Bhandari, Bhardwaj, Bhatia, Bhatt, Bhatti, Bhavsar, Bhogal, Bhullar, Boodram, Boparai, Brar, Chadha, Chahal, Chand, Chandra, Chaudhary, Chaudhry, Chauhan, Chawla, Cheema, Chohan, Chopra, Choudhry, Choudhury, Chowdhury, Das, Dass, Datta, Deol, Desai, Dhami, Dhanoa, Dhindsa, Dosanjh, Gandhi, Ganesh, Garcha, Ghosh, Ghuman, Gopaul, Gupta, Heer, Hundal, Jaffer, Jafri, Jain, Jassal, Johal, Joshi, Kahlon, Kalra, Kanagaratnam, Kandasamy, Kandiah, Kanji, Kapadia, Kapoor, Karam, Karimi, Kaushal, Khaira, Khanna, Khatri, Khokhar, Kohli, Kumar, Kumarasamy, Ladha, Lakhani, Lal, Lalani, Lall, Mahabir, Mahadeo, Maharaj, Mahendran, Malhi, Malhotra, Mangat, Manji, Manoharan, Maraj, Matharu, Mathur, Mehta, Mistry, Modi, Mohan, Multani, Nadarajah, Naik, Nair, Naraine, Navaratnam, Nijjar, Panchal, Pandher, Pandya, Panesar, Pannu, Parekh, Parikh, Parmar, Parveen, Pathak, Pathan, Pathmanathan, Persad, Prajapati, Prasad, Prashad, Purewal, Puri, Rai, Raja, Rajaratnam, Rajkumar, Ram, Ramcharan, Ramkissoon, Ramnarine, Rampersad, Rampersaud, Ramroop, Randhawa, Rao, Sahota, Saini, Samra, Sangha, Sanghera, Sankar, Sehgal, Sekhon, Selvarajah, Selvaratnam, Sethi, Shanmuganathan, Shergill, Sheth, Shukla, Sinha, Sinnathamby, Sivakumar, Sivasubramaniam, Sodhi, Sohail, Sohal, Sohi, Sood, Sritharan, Subramaniam, Tharmalingam, Thind, Toor, Trahan, Trivedi, Uppal, Varghese, Verma, Virdi, Virk, Vyas, Walia | An, Au, Bai, Cai, Cao, Chang, Chao, Chau, Cheng, Cheong, Cheung, Chiang, Chin, Ching, Chiu, Cho, Chong, Chou, Chow, Choy, Chu, Chua, Chui, Chun, Chung, Cui, Dai, Deng, Ding, Dong, Du, Duong, Eng, Fan, Fang, Feng, Fok, Fong, Fu, Fung, Gao, Gong, Gu, Guan, Guo, Ha, Han, He, Hong, Hou, Hsu, Hu, Hua, Huang, Hui, Hum, Hung, Hwang, Ing, Ip, Jang, Ji, Jia, Jiang, Jin, Kam, Kan, Ko, Kong, Koo, Ku, Kung, Kuo, Kwan, Kwok, Kwon, Kwong, La, Lai, Lao, Lau, Lei, Leong, Liang, Liao, Lin, Ling, Lo, Lu, Lui, Luk, Lum, Luo, Luong, Ma, Mah, Mai, Mak, Man, Mao, Mei, Meng, Mian, Mo, Mok, Monk, Ng, Ngai, Ong, Ou, Pan, Pang, Peng, Phung, Poon, Qi, Qian, Qin, Qiu, Quan, Ren, Seto, Shao, Shen, Shi, Shum, Sin, Situ, Siu, So, Song, Su, Sun, Sung, Szeto, Ta, Tai, Tam, Tan, Tang, Tao, Tian, To, Tom, Tong, Tsai, Tsang, Tse, Tsui, Tu, Tung, Wan, Wei, Wen, Wing, Woo, Wu, Xia, Xiao, Xie, Xu, Xue, Yan, Yang, Yao, Yap, Yau, Ye, Yee, Yeh, Yeung, Yi, Yim, Yin, Yip, Yiu, Yong, Yoon, Yu, Yuan, Yue, Yuen, Yung, Zeng, Zhao, Zheng, Zhong, Zhou, Zhu, Zou | Adams, Ahmed, Alexander, Ali, Allen, Anderson, Andrews, Armstrong, Bailey, Baker, Barnes, Bélanger, Bell, Bennett, Black, Boyd, Bradley, Brooks, Burke, Burns, Butler, Cameron, Carter, Chapman, Choi, Clark, Clarke, Cole, Collins, Cook, Cooper, Cox, Craig, Crawford, Cunningham, Da Silva, Davidson, Davies, Davis, Dawson, Dixon, Douglas, Doyle, Duncan, Dunn, Edwards, Elliott, Ellis, Evans, Ferguson, Fernandes, Ferreira, Fisher, Fleming, Ford, Foster, Fox, Francis, Fraser, Gagnon, Garcia, Gauthier, George, Gibson, Gill, Gordon, Graham, Grant, Gray, Green, Hall, Hamilton, Harris, Harrison, Hart, Harvey, Hassan, Hayes, Henderson, Henry, Hill, Holmes, Howard, Hughes, Hunt, Hunter, Huynh, Jackson, James, Johnson, Johnston, Kelly, Kennedy, Kerr, Khan, Kim, King, Knight, Lalonde, Lawrence, Le, Leblanc, Lewis, Little, Macdonald, Mackenzie, Maclean, Macleod, Mann, Marshall, Mason, Matthews, McDonald, McIntyre, McKay, McKenzie, McLean, McLeod, Miller, Mills, Mitchell, Mohamed, Moore, Morgan, Morin, Morris, Morrison, Murphy, Murray, Nelson, Nguyen, O'Brien, Palmer, Park, Parker, Parsons, Patterson, Paul, Payne, Pereira, Perry, Peters, Phillips, Porter, Powell, Price, Reid, Reynolds, Richards, Richardson, Roberts, Robertson, Robinson, Rogers, Rose, Ross, Roy, Russell, Ryan, Santos, Saunders, Scott, Seguin, Shah, Shaw, Silva, Simpson, Spencer, Stevens, Stevenson, Stewart, Sullivan, Sutherland, Thomas, Thomson, Tran, Tremblay, Turner, Walker, Wallace, Walsh, Ward, Warren, Watson, White, Williamson, Wood, Woods, Wright, Young | |||
Unweighted and weighted baseline characteristics of the study population
| Characteristic | South Asian | Chinese | General population |
|---|---|---|---|
| N | 1,400 | 1,129 | 67,330 |
| Surname-derived ethnicity | |||
| South Asian | 654 | 4 | 129 |
| Chinese | 9 | 899 | 139 |
| General population | 737 | 226 | 67,062 |
| Proportion of the population | 5.4% | 4.0% | 90.6% |
| Sex | |||
| Male | 54.7% | 52.5% | 48.4% |
| Female | 45.3% | 47.5% | 51.6% |
| Age | |||
| 44 or younger | 64.4% | 62.6% | 52.0% |
| 45 to 64 | 29.2% | 27.6% | 31.4% |
| 65 or older | 6.4% | 9.8% | 16.6% |
| Immigration status | |||
| Born in Canada | 7.0% | 9.6% | 73.7% |
| Immigrant ≤10 years | 47.2% | 38.1% | 6.0% |
| Immigrant 11 to 20 years | 25.9% | 28.3% | 4.8% |
| Immigrant > 20 years | 19.9% | 24.0% | 15.5% |
Test characteristics of the South Asian and Chinese surname lists compared against self-reported ethnicity
| Surname list | Sensitivity (%) | Specificity (%) | Positive predictive value (%) | Negative predictive value (%) |
|---|---|---|---|---|
| South Asian | 50.4 | 99.7 | 89.3 | 97.2 |
| Chinese | 80.2 | 99.7 | 91.9 | 99.2 |
| Previously published Chinese9 | 82.5 | 99.7 | 91.2 | 99.3 |
Positive predictive value (%) of the South Asian and Chinese surname lists compared against self-reported ethnicity, stratified by sex, age and immigration status.
| Characteristic | South Asian | Chinese |
|---|---|---|
| Overall | 89.3 | 91.9 |
| Sex | ||
| Male | 91.4 | 92.1 |
| Female | 86.5 | 91.7 |
| Age | ||
| 44 or younger | 89.2 | 92.6 |
| 45 to 64 | 88.0 | 90.0 |
| 65 or older | 94.6 | 93.0 |
| Immigration status | ||
| Born in Canada | 58.7 | 77.8 |
| Immigrant ≤10 years | 95.2 | 95.9 |
| Immigrant 11 to 20 years | 89.6 | 94.1 |
| Immigrant > 20 years | 88.6 | 88.1 |