| Literature DB >> 29320760 |
Andreas Kurtz1, Stefanie Seltmann2, Amos Bairoch3, Marie-Sophie Bittner4, Kevin Bruce5, Amanda Capes-Davis6, Laura Clarke7, Jeremy M Crook8, Laurence Daheron9, Johannes Dewender4, Adam Faulconbridge7, Wataru Fujibuchi10, Alexander Gutteridge11, Derek J Hei12, Yong-Ou Kim13, Jung-Hyun Kim13, Anja Kolb- Kokocinski14, Fritz Lekschas4, Geoffrey P Lomax15, Jeanne F Loring16, Tenneille Ludwig17, Nancy Mah4, Tohru Matsui18, Robert Müller4, Helen Parkinson7, Michael Sheldon19, Kelly Smith20, Harald Stachelscheid21, Glyn Stacey22, Ian Streeter7, Anna Veiga23, Ren-He Xu24.
Abstract
Unambiguous cell line authentication is essential to avoid loss of association between data and cells. The risk for loss of references increases with the rapidity that new human pluripotent stem cell (hPSC) lines are generated, exchanged, and implemented. Ideally, a single name should be used as a generally applied reference for each cell line to access and unify cell-related information across publications, cell banks, cell registries, and databases and to ensure scientific reproducibility. We discuss the needs and requirements for such a unique identifier and implement a standard nomenclature for hPSCs, which can be automatically generated and registered by the human pluripotent stem cell registry (hPSCreg). To avoid ambiguities in PSC-line referencing, we strongly urge publishers to demand registration and use of the standard name when publishing research based on hPSC lines.Entities:
Keywords: cell data referencing; cell databases; cell line authentication; cell line nomenclature; human pluripotent stem cells; stem cell registry
Mesh:
Year: 2018 PMID: 29320760 PMCID: PMC5768986 DOI: 10.1016/j.stemcr.2017.12.002
Source DB: PubMed Journal: Stem Cell Reports ISSN: 2213-6711 Impact factor: 7.765
Most Common Risks Leading to Ambivalent and Wrong Cell Line Names due to Absence of a Standard hPSC Nomenclature and Unique Identifier
| Reason | Risk | Risk Mitigation |
|---|---|---|
| Independent use of identical primary source cell for reprogramming (e.g., commercial provider of primary cells, time lapses between donations [e.g., healthy young and diseased old donor, donation in different labs by same donor]) | high | genetic profiling, preferentially by short tandem repeats (STR); use of acronym identifying donor in unique identifier; link unique identifier to STR profile |
| Distribution of line between labs and other resources such as banks and registries, which rename according to internal nomenclatures | high | establish unchangeable unique identifier; link other names as synonym |
| Change of ownership with renaming | very high | establish unchangeable unique identifier; link different owner's name as synonym |
| Renaming after genetic modification of line (subclones, isogenic lines, reporter lines) | high | establish link to unique identifier in the name |
| Using generic lab-style names (iPSC-1, ALS-4) | very high | establish unchangeable unique identifier; link generic name as synonym |
| Cell culture contamination with other lines and subsequent mix-up and misauthentication | high | genetic profiling, preferentially by STR |
Nomenclature: Elements and Scheme
| Name of Element | No. of Digits | Example | Comments | Maximal Possibilities | |
|---|---|---|---|---|---|
| 1 | generator acronym | 2–6 | XXXXXX | self-assigned code provided by the generator; capital letters | 321,272,380 |
| 2 | cell line type | 1 | “e” or “i” | e: ESC line | >2 |
| i: induced PSC line | |||||
| 3 | donor ID | 3 | 001 | alphanumerical 3-digit code | 46,655 |
| 4 | clone number | 2–3 | A | alphabetic capital letter preceded by hyphen | 702 |
| 5 | subclone or subline | 0–3 | −1 | alphanumerical 2-character code preceded by hyphen | 1,330 |
| total characters | 8–16 | >87 × 109 cell lines per generator acronym | |||
| >28 × 1018 unique names | |||||
Structure and elements of the hPSC nomenclature. Five descriptive elements compose a cell line name. (1) An acronym, for example of the generator institution, laboratory, a registry, or a bank. (2) The cell type—other symbols are, for example, (p) for parthenogenic or (s) for somatic cell nuclear transfer PSC lines. (3) A unique alphanumerical identifier. Depending on future needs, this could be extended to 4 digits (1,679,615 possible permutations). These first three elements together form a unique identifier linking the line to a single origin or donor, or blastocyst in case of ESC. Element (4) identifies each cell line, or clone, which was generated from the donor, and element (5) identifies genetically modified subclones or sublines derived from a given clone. The fixed string of elements helps to detect and avoid syntactic naming errors. The nomenclature does not include any personal information that might compromise the privacy and confidentiality of the donor, as recommended in ICLAC's guidelines for cell line naming. Automated generation of an autonomous donor identifier module as part of the name safeguards privacy even further. An example is provided in Table S1.
Figure 1Naming Scheme for hPSC Lines
A donor is recruited and hPSC lines generated by diverse generators x, y, … n or deposited in banks and registries, which submit the essential name elements to hPSCreg to automatically generate a standard name. A BioSamples ID for the line and the donor is automatically provided in parallel for each cell line. The names and BioSamples IDs are locked and registered in a catalog in the hPSCreg database as well as in the Cellosaurus cell line database. Historical lines with non-standard names are often used in hPSC banks and registries. Registration of these lines in hPSCreg will automatically provide a standard name and link it to the pre-existing names, which remain as synonyms.