| Literature DB >> 34737324 |
Oliver Lung1, Rebecca Candlish2, Michelle Nebroski2, Peter Kruckiewicz2, Cody Buchanan2, Mariko Moniwa2.
Abstract
Cell lines are widely used in research and for diagnostic tests and are often shared between laboratories. Lack of cell line authentication can result in the use of contaminated or misidentified cell lines, potentially affecting the results from research and diagnostic activities. Cell line authentication and contamination detection based on metagenomic high-throughput sequencing (HTS) was tested on DNA and RNA from 63 cell lines available at the Canadian Food Inspection Agency's National Centre for Foreign Animal Disease. Through sequence comparison of the cytochrome c oxidase subunit 1 (COX1) gene, the species identity of 53 cell lines was confirmed, and eight cell lines were found to show a greater pairwise nucleotide identity in the COX1 sequence of a different species within the same expected genus. Two cell lines, LFBK-αvβ6 and SCP-HS, were determined to be composed of cells from a different species and genus. Mycoplasma contamination was not detected in any cell lines. However, several expected and unexpected viral sequences were detected, including part of the classical swine fever virus genome in the IB-RS-2 Clone D10 cell line. Metagenomics-based HTS is a useful laboratory QA tool for cell line authentication and contamination detection that should be conducted regularly.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34737324 PMCID: PMC8569163 DOI: 10.1038/s41598-021-00779-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Cell lines with species identity determined by sequencing that matched institute records or were previously unknown.
| Cell line | Species | cDNA run | Total reads | Number of reads mapped to | Breadth of coverage of | Mean depth of coverage of | Total variants | GenBank reference accession |
|---|---|---|---|---|---|---|---|---|
| 3T6-Swiss Albino | 2 | 715,514 | 128 | 99.3 | 12.3 | 0 | KY018919.1 | |
| A549 | 2 | 2,306,896 | 1374 | 100.0 | 141.8 | 0 | MW389273.1 | |
| BHK-21 | 2 | 706,654 | 39 | 99.2 | 3.4 | 0 | EU660218.1 | |
| CEF | 2 | 711,596 | 1882 | 100.0 | 238.5 | 0 | MN013407.1 | |
| CHCC-OU2 | 2 | 2,327,710 | 9681 | 100.0 | 1,070.4 | 0 | MN013407.1 | |
| CHO-K1 | 2 | 3,254,926 | 300 | 100.0 | 20.4 | 0 | KX576660.1 | |
| COS-1 | 2 | 1,438,372 | 141 | 99.5 | 11.0 | 0 | MN816163.1 | |
| CPAE | 1 | 5,933,398 | 1676 | 100.0 | 146.6 | 0 | MF663794.1 | |
| CV-1 | 2 | 923,170 | 277 | 100.0 | 22.3 | 0 | MN816163.1 | |
| DE | 2 | 840,902 | 505 | 100.0 | 51.8 | 0 | MH744426.1 | |
| DF-1 | 2 | 1,145,492 | 805 | 100.0 | 88.6 | 0 | MK163563.1 | |
| Efk-1B | 2 | 1,015,466 | 311 | 95.5 | 25.6 | 2 | MF143474.1 | |
| Efk-2F | 2 | 1,137,244 | 257 | 95.5 | 20.9 | 2 | MF143474.1 | |
| Efk-3B | 2 | 1,051,616 | 246 | 95.4 | 18.8 | 2 | MF143474.1 | |
| EL4-IL2 | 2 | 2,016,368 | 735 | 100.0 | 86.3 | 1 | KY018919.1 | |
| H1299 | 2 | 1,322,326 | 392 | 100.0 | 37.2 | 0 | MW389273.1 | |
| HEK-293 | 2 | 2,997,732 | 845 | 100.0 | 95.0 | 0 | X93334.1 | |
| IB-RS-2 Clone D10 | 2 | 4,731,602 | 2846 | 100.0 | 282.8 | 1 | MF183225.1 | |
| IPAM 3C10 | 2 | 6,759,802 | 2827 | 100.0 | 238.2 | 0 | MH603005.1 | |
| IPAM 3C8 | 2 | 1,701,142 | 229 | 100.0 | 17.0 | 0 | MT199606.1 | |
| IPAM 3E8 | 2 | 1,214,488 | 160 | 100.0 | 13.6 | 0 | MG250562.1 | |
| IPAM 3F6 | 2 | 1,948,740 | 434 | 100.0 | 33.5 | 0 | MG250562.1 | |
| L929 | 2 | 1,134,052 | 164 | 100.0 | 21.0 | 0 | EU315228.1 | |
| LK-W(14) | 2 | 3,834,432 | 5324 | 100.0 | 613.8 | 0 | EF490453.1 | |
| LLC-PK1 | 2 | 862,572 | 423 | 100.0 | 48.2 | 0 | AF486866.1 | |
| LMH | 2 | 847,388 | 524 | 100.0 | 56.5 | 0 | MN013407.1 | |
| LT | 2 | 1,165,940 | 452 | 100.0 | 38.0 | 0 | EF490453.1 | |
| MDBK-HS-1 | 2 | 2,071,902 | 846 | 100.0 | 98.2 | 0 | MN714195.1 | |
| MDBK-HS-2 | 2 | 658,222 | 103 | 100.0 | 10.9 | 0 | MN714195.1 | |
| MDCK SIAT1 | 2 | 1,270,062 | 506 | 100.0 | 53.6 | 0 | KM061581.1 | |
| MDCK2 | 2 | 1,474,730 | 2038 | 100.0 | 236.5 | 0 | KM061555.1 | |
| MDCK-PGOK | 2 | 1,242,568 | 153 | 100.0 | 18.9 | 0 | KM061581.1 | |
| OA3.Ts | 2 | 1,148,750 | 647 | 100.0 | 55.0 | 0 | KU681212.1 | |
| OA4K/s1 | 2 | 900,506 | 182 | 98.5 | 13.1 | 0 | MT768116.1 | |
| P3X63-Ag8-653 | 2 | 1,999,368 | 2896 | 100.0 | 305.4 | 0 | AY533105.1 | |
| PaKi | 2 | 1,002,720 | 223 | 99.9 | 19.7 | 0 | KF726143.1 | |
| PK-15 (PCV-) | 1 | 1,116,214 | 1123 | 100.0 | 98.1 | 0 | KT279758.1 | |
| PK-15 (PCV +) | 1 | 3,916,490 | 757 | 100.0 | 68.2 | 0 | KT279758.1 | |
| QT-35 | 2 | 824,636 | 1183 | 100.0 | 133.8 | 0 | KX712089.1 | |
| RK13 | 2 | 729,418 | 320 | 100.0 | 34.3 | 5 | MN296708.1 | |
| SC-1 | 2 | 1,296,288 | 1101 | 100.0 | 91.0 | 0 | GU261694.1 | |
| SIRC | 2 | 614,400 | 146 | 100.0 | 15.2 | 1 | MN296708.1 | |
| SK-6 | 2 | 867,492 | 162 | 100.0 | 18.0 | 0 | MG250562.1 | |
| ST | 1 | 2,051,240 | 707 | 100.0 | 60.5 | 0 | AF486866.1 | |
| TG180 | 2 | 4,749,142 | 1263 | 100.0 | 130.8 | 0 | KP260515.1 | |
| Tni | 2 | 1,449,444 | 348 | 100.0 | 43.9 | 0 | NC_045936.1 | |
| WSL-R-HP | 2 | 913,228 | 804 | 100.0 | 65.8 | 0 | MF183225.1 | |
| ZZR | 2 | 988,600 | 262 | 100.0 | 28.0 | 0 | MH229952.1 | |
| BD41/31 | 2 | 1,020,194 | 413 | 100.0 | 47.5 | 0 | MH603005.1 | |
| MARC-145 | 2 | 594,988 | 222 | 100.0 | 14.6 | 0 | MT481926.1 | |
| MRC5 | 2 | 858,666 | 249 | 100.0 | 23.7 | 0 | MK059615.1 | |
| N418 | 2 | 1,180,212 | 194 | 100.0 | 22.2 | 0 | KY018919.1 | |
| OUR-1 | 2 | 1,615,188 | 129 | 100.0 | 10.1 | 0 | KP260516.1 | |
Threshold for coverage depth for calling breadth and depth of coverage was set to 3, while the minimum variant frequency = 0.75.
Cell lines in which a different species from institution records was identified.
| Cell line | Species | cDNA run | Total reads | Number of reads mapped to | Breadth of coverage of | Mean depth of coverage of | Total variants | Reference mitogenome accession |
|---|---|---|---|---|---|---|---|---|
| LFBK-αVβ6 | 2 | 4,351,208 | JN601075.1 | |||||
| MN200869.1 | ||||||||
| SCP-HS | 2 | 511,706 | MF663794.1 | |||||
| KU681212.1 | ||||||||
| BGMK | 2 | 896,826 | EF597501.1 | |||||
| 320 | 100.0 | 40.1 | 5 | JX983774.1 | ||||
| MN816163.1 | ||||||||
| 287 | 100.0 | 36.8 | 47 | |||||
| CGBQ | 2 | 2,426,176 | MN356388.1 | |||||
| 3363 | 100.0 | 536.1 | 0 | MK102803.1 | ||||
| MN122908.1 | ||||||||
| 3352 | 99.9 | 534.7 | 19 | |||||
| MA-104 | 2 | 947,256 | MT481926.1 | |||||
| 127 | 100.0 | 13.5 | 0 | |||||
| MN816163.1 | ||||||||
| 96 | 100.0 | 9.7 | 49 | |||||
| PaLu | 2 | 1,030,394 | NC_046926.1 | |||||
| 97 | 100.0 | 10.1 | 41 | |||||
| KF726143.1 | ||||||||
| 96 | 100.0 | 10.4 | 37 | |||||
| PaSPT | 2 | 936,338 | NC_046926.1 | |||||
| 93 | 100.0 | 12.7 | 38 | |||||
| KF726143.1 | ||||||||
| 91 | 100.0 | 12.4 | 37 | |||||
| Vero | 2 | 461,114 | JQ256913.1 | |||||
| 223 | 100.0 | 23.3 | 2 | EF597503.1 | ||||
| MN816163.1 | ||||||||
| 161 | 99.1 | 17.9 | 67 | |||||
| Vero Nectin-4 | 2 | 633,176 | JQ256913.1 | |||||
| 294 | 100.0 | 40.2 | 2 | EF597503.1 | ||||
| MN816163.1 | ||||||||
| 223 | 100.0 | 33.5 | 70 | |||||
| Vero-76 | 1 | 1,284,012 | EF597503.1 | |||||
| 598 | 100.0 | 64.8 | 2 | EF597503.1 | ||||
| MN816163.1 | ||||||||
| 534 | 100.0 | 59.5 | 70 | |||||
Species names in bold are the observed species while the non-bolded names are the expected species based on laboratory records. Bold values indicate results to COX1 reference while non-bolded cells are to CytB reference. Variants were only called if the read depth had a minimum coverage of 3×. Threshold for coverage depth for calling breadth and depth of coverage was set to 3, while the minimum variant frequency = 0.75.
Figure 1Reference assemblies of LFBK-αvβ6 and SCP-HS reads to references of the expected species and top BLASTn-matched mitogenomes. The nf-illmap workflow was used to map reads from the LFBK-αvβ6 and SCP-HS cell lines to reference COX1 sequences from the expected species of each cell line and the species which showed the top BLAST match to the de novo-assembled sequences. (A) LFBK reads were mapped to B. taurus and S. Scrofa. (B) SCP-HS was mapped to O. aries and B. taurus. The Y-axis shows the coverage of each genome position. Positions of variants are indicated by the grey lines below the graphs.
List of viral genomes detected in the cell lines.
| Cell line | Species | Virus | Total reads | Number of reads mapped to virus | Breadth of coverage (%) | Mean depth of coverage | Total variants | Viral reference genome accession |
|---|---|---|---|---|---|---|---|---|
| CPAE | Bovine viral diarrhea virus 2 (BVDV) | 5,933,398 | 173 | 82.8 | 2.1 | 3 | MN824468.1 | |
| OA3.Ts | 1,148,750 | 27 | 18.0 | 0.3 | 7 | MH806437.1 | ||
| IB-RS-2 Clone D10 | Classical swine fever virus (CSFV) | 4,731,602 | 53 | 39.5 | 0.8 | 7 | X96550.1 | |
| P3X63-Ag8-653 | Murine leukemia virus (MuLV) | 1,999,368 | 3470 | 99.3 | 76.3 | 124 | KY574512.1 | |
| Tni | Flock house virus | 1,449,444 | 67 | 87.4 | 4.5 | 8 | EF690537.1 | |
| A549 | Human adenovirus C | 2,306,896 | 17 | 3.5 | 0.1 | 0 | KF429754.1 | |
| HEK-293 | 2,997,732 | 221 | 8.8 | 1.3 | 1 | |||
| PK-15 (PCV+) | Porcine circovirus 1 (PCV1) | 3,916,490 | 217,311 | 100 | 15,571.2 | 5 | MK770354.1 | |
| IPAM 3C10 | 6,759,802 | 1566 | 100 | 80.0 | 6 | MK770354.1 | ||
| IPAM 3C8 | 1,701,142 | 3390 | 100 | 186.9 | 6 | MK770354.1 | ||
| IPAM 3E8 | 1,214,488 | 2939 | 100 | 174.3 | 6 | MK770354.1 | ||
| IPAM 3F6 | 1,948,740 | 2384 | 100 | 143.1 | 5 | AY754015.1 | ||
Only retroviruses with > 98% genome coverage were included. Variants were only called if the read depth had a minimum coverage of 3×. Threshold for coverage depth for calling breadth and depth of coverage was set to 3, while the minimum variant frequency = 0.75.