Literature DB >> 23594443

The fractured genome of HeLa cells.

David Mittelman, John H Wilson.   

Abstract

Whole-genome sequencing of the widely used HeLa cell line provides a nucleotide-resolution view of a greatly mutated and in some places shattered genome.

Entities:  

Mesh:

Year:  2013        PMID: 23594443      PMCID: PMC3663084          DOI: 10.1186/gb-2013-14-4-111

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


Human cell culture has proven an invaluable tool for revealing key aspects of human biology. The trouble with growing human cells in culture, or rather most primary cells, is that they eventually senesce or cease to divide. The trick to exploiting human cells in culture, for long-term studies, is to immortalize them so that they continue to divide indefinitely. Long before Shay and Wright demonstrated that ectopic expression of human telomerase reverse transcriptase (hTERT) enabled cells to proliferate endlessly, with seemingly normal phenotypes [1], scientists relied on naturally immortalized cells, often derived from the tumors of cancer patients. The first such immortalized cell, the HeLa cell line, was established more than half a century ago from the tumor of a cervical cancer patient called Henrietta Lacks. Although Henrietta Lacks eventually died from her cancer in 1951, the HeLa line has continued to proliferate in culture, becoming one of the most commonly used human cell lines in biomedical research. Roughly 60,000 scientific publications have cited the use of HeLa cells and major discoveries have been made using the cell line, including the development of the polio vaccine in 1952, the link between human papillomavirus (HPV) and cervical cancer, and the role of telomerase in chromosome maintenance [2]. Now, for the first time, Lars Steinmetz and colleagues report a comprehensive genomic analysis and expression profile for the popular Kyoto version of the HeLa cell line [2].

A map of the HeLa genome

Despite the tremendous value and widespread use of HeLa cells, it has been known for some time that these cells, like most cancer cells, are genetically abnormal, and perhaps even more so than typical cancer-derived cell lines. A previous study [3] combined spectral karyotyping, fluorescence in situ hybridization and conventional cytogenetic techniques to reveal extensive chromosomal aberrations, including hyper-triploid chromosome number and genetic abnormalities (often used as HeLa markers) on 20 chromosomes. Landry et al. [2] have now used deep DNA and RNA sequencing to define the HeLa genome and transcriptome, revealing the true extent of genetic abnormalities at nucleotide resolution. This study establishes a reference sequence for the HeLa genome, along with genetic variations identified in the cell line: valuable resources for the continued use of HeLa cells in biomedical research. In the study [2], the authors report a plethora of single nucleotide variants (SNVs), indels and copy-number changes. Figure 1 shows a genome-wide view of the genomic changes reported in the HeLa genome. Impressively, the authors report, with nucleotide resolution, 2,893 structural variants dominated by large deletion events. In addition, a total of approximately 4.5 million SNVs and 0.5 million indels were identified, the vast majority of which were common variants that had already been reported in the dbSNP database and the 1000 Genomes Project. Common SNVs that are potentially damaging were found in 1,231 genes. Among the 336,006 HeLa-specific SNVs, just 233 would cause amino acid changes, with the function of only 66 proteins predicted to be adversely affected. The potential contributions of these common and specific changes to the phenotypes of the cells, or to the tumor from which they were derived, are unclear. As the authors note, without normal tissue and tumor cells from Henrietta Lacks, which are unavailable, it is not possible to know whether these HeLa-specific variants are unique to the donor's genome, or the donor's cancer, or are a byproduct of 60 years of propagation in culture [2].
Figure 1

Circos plot illustrating the genomic features of the HeLa genome. From outside to inside, the tracks represent: read depth (100 kb binned coverage); copy number (color gradient from light green for one copy to dark red for more than six copies); zygosity (pink, heterozygous; purple, homozygous); SNV density (1 Mb binned SNV count; darker blue for greater density); and translocation calls (colored arcs; light blue, from paired-end sequencing data; light green, from mate pair data; orange, from both datasets). Reproduced, with permission, from [2].

Circos plot illustrating the genomic features of the HeLa genome. From outside to inside, the tracks represent: read depth (100 kb binned coverage); copy number (color gradient from light green for one copy to dark red for more than six copies); zygosity (pink, heterozygous; purple, homozygous); SNV density (1 Mb binned SNV count; darker blue for greater density); and translocation calls (colored arcs; light blue, from paired-end sequencing data; light green, from mate pair data; orange, from both datasets). Reproduced, with permission, from [2]. Finally, the authors [2] find extensive copy-number heterogeneity, with most loci found in three or more copies, which is consistent with previous studies reporting a 3n+ chromosome state [3]. Surprisingly, although less than 1% of the HeLa genome is present at a copy number of one, there are large stretches of homozygosity non-uniformly distributed in the HeLa genome. The authors partitioned the genome into 100 kb bins and found that 23% of these bins were composed of mostly homozygous SNVs (purple in Figure 1). In contrast, they did not find any 100 kb bins of homozygosity in HapMap samples, which are more representative of normal human variation. Comparisons of copy numbers with transcriptome data, also generated in the study [2], indicated a correlation between copy number and expression level, suggesting that dosage compensation does not occur at a global scale in HeLa cells. Among the more highly expressed genes were those enriched for functions such as proliferation, transcription, and DNA repair - arguably valuable assets for life in culture or in a tumor.

Genetic signatures of cancer

A subset of cervical cancers is caused by HPV infection. The authors identified a known insertion of HPV18 on chromosome 8, consistent with previous studies, but also documented nine additional putative viral integration sites [2]. Remarkably, they also found evidence that four of the HeLa chromosomes had been shattered into pieces, with many of the fragments reassembled randomly into highly rearranged chromosomes. This recently described phenomenon, known as chromothripsis, has been found to be associated with 2 to 3% of all cancers [4]. In HeLa cells, evidence of chromothripsis was most pronounced in chromosome 11, which is a hotspot for loss of heterozygosity associated with cervical cancer, the cancer that killed Henrietta Lacks. Whole-genome sequencing was instrumental in revealing the abnormalities of the shattered chromosome 11. In a previous study that used cytogenetic techniques, it was reported that the rearrangement of chromosome 11 segments could not be fully resolved [3].

Identities and architectures of model cell lines

Human cell lines are important models for studying biological function and disease, but to maximize insight from model cell lines, it is critical to understand the genomic architecture of these cells. HeLa cells are particularly abnormal compared with non-cancer human cell lines, as well as compared with the human reference sequence. At the same time, most genomic studies in HeLa cells have used the human reference sequence. The authors highlight the critical importance of this in a case study in which they re-evaluate small interfering RNAs (siRNAs) designed for a large-scale screen in HeLa cells. Some of the siRNAs designed to target the human reference sequences failed to elicit effects in HeLa cells because they did not match the sequence of the HeLa cell genome. This shows the importance of validating targets of siRNAs and other reagents designed against a specific genomic sequence. It is equally important to understand the transcriptional (and ultimately the proteomic) profile of model systems to confirm expected properties before initiating a new study. High-throughput sequencing offers a fast and cost-effective way to characterize cell lines, with the price of exome and transcriptome sequencing already below $1,000 and dropping. For some popular cell lines, these data are already available in NCBI's Sequence Read Archive and other public repositories. The use of high-throughput sequencing data to characterize cell lines is timely, not just because of cost, but also because methods have matured for detecting SNVs, indels and more complex variants in the genome [5,6]. The authors of the HeLa study [2] are working toward making available not only the HeLa reference sequence, but also all the read data from whole-genome and transcriptome sequencing. In addition to characterizing cell lines, it is just as important to confirm the identity of the lines. Misidentification of cell lines, sometimes because of contamination from other cells, is a continuing concern [7]. Historically, cell lines have been identified using short tandem repeat (STR) markers that can be assayed with commercial kits, or from validated marker lists provided by sample repositories such as the American Type Culture Collection (ATCC). In the HeLa study [2], for example, the authors reported that they first confirmed the identity of their HeLa cells using 16 STRs, of which nine were promoted as standards by the ATCC and the Deutsche Sammlung von Mikroorganismen und Zellkulturen (DSMZ). They reported that more than 80% of the markers matched, the gold standard set by the ATCC for STR-based cell line identification. In the era of high-throughput sequencing, even exome sequencing provides sufficient data from which a multitude of additional markers can be established. Improved methods for accurately identifying STR genotypes from high-throughput sequencing data [8] make STR marker identification eminently feasible and open up the possibility of constructing databases with thousands of STRs that uniquely distinguish one cell line from another. Sequencing the genome and transcriptome of the HeLa cell line is an important milestone in biomedical research. For more than 60 years, scientists have studied many biological processes in HeLa cells, publishing some 60,000 papers along the way. The detailed studies by Steinmetz and his colleagues [2] will undoubtedly foster even more productive research using HeLa cells. By documenting just how aberrant HeLa genomes are, however, they also heighten our awareness of exactly what it means to select a cell line for a particular study, and they raise the bar for making such decisions.

Competing interests

The authors declare that they have no competing interests.
  8 in total

1.  Comprehensive and definitive molecular cytogenetic characterization of HeLa cells by spectral karyotyping.

Authors:  M Macville; E Schröck; H Padilla-Nash; C Keck; B M Ghadimi; D Zimonjic; N Popescu; T Ried
Journal:  Cancer Res       Date:  1999-01-01       Impact factor: 12.701

2.  Absence of cancer-associated changes in human fibroblasts immortalized with telomerase.

Authors:  C P Morales; S E Holt; M Ouellette; K J Kaur; Y Yan; K S Wilson; M A White; W E Wright; J W Shay
Journal:  Nat Genet       Date:  1999-01       Impact factor: 38.330

3.  Cell-line authentication: End the scandal of false cell lines.

Authors:  John R Masters
Journal:  Nature       Date:  2012-12-13       Impact factor: 49.962

4.  The genomic and transcriptomic landscape of a HeLa cell line.

Authors:  Jonathan J M Landry; Paul Theodor Pyl; Tobias Rausch; Thomas Zichner; Manu M Tekkedil; Adrian M Stütz; Anna Jauch; Raeka S Aiyar; Gregoire Pau; Nicolas Delhomme; Julien Gagneur; Jan O Korbel; Wolfgang Huber; Lars M Steinmetz
Journal:  G3 (Bethesda)       Date:  2013-08-07       Impact factor: 3.154

Review 5.  Genome structural variation discovery and genotyping.

Authors:  Can Alkan; Bradley P Coe; Evan E Eichler
Journal:  Nat Rev Genet       Date:  2011-03-01       Impact factor: 53.242

6.  Massive genomic rearrangement acquired in a single catastrophic event during cancer development.

Authors:  Philip J Stephens; Chris D Greenman; Beiyuan Fu; Fengtang Yang; Graham R Bignell; Laura J Mudie; Erin D Pleasance; King Wai Lau; David Beare; Lucy A Stebbings; Stuart McLaren; Meng-Lay Lin; David J McBride; Ignacio Varela; Serena Nik-Zainal; Catherine Leroy; Mingming Jia; Andrew Menzies; Adam P Butler; Jon W Teague; Michael A Quail; John Burton; Harold Swerdlow; Nigel P Carter; Laura A Morsberger; Christine Iacobuzio-Donahue; George A Follows; Anthony R Green; Adrienne M Flanagan; Michael R Stratton; P Andrew Futreal; Peter J Campbell
Journal:  Cell       Date:  2011-01-07       Impact factor: 41.582

7.  DELLY: structural variant discovery by integrated paired-end and split-read analysis.

Authors:  Tobias Rausch; Thomas Zichner; Andreas Schlattl; Adrian M Stütz; Vladimir Benes; Jan O Korbel
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

8.  Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles.

Authors:  Gareth Highnam; Christopher Franck; Andy Martin; Calvin Stephens; Ashwin Puthige; David Mittelman
Journal:  Nucleic Acids Res       Date:  2012-10-22       Impact factor: 16.971

  8 in total
  10 in total

1.  High-throughput imaging of mRNA at the single-cell level in human primary immune cells.

Authors:  Manasi Gadkari; Jing Sun; Adrian Carcamo; Hugh Alessi; Zonghui Hu; Iain D C Fraser; Gianluca Pegoraro; Luis M Franco
Journal:  RNA       Date:  2022-06-28       Impact factor: 5.636

2.  Mapping a Large Number of QTL for Durable Resistance to Stripe Rust in Winter Wheat Druchamp Using SSR and SNP Markers.

Authors:  Lu Hou; Xianming Chen; Meinan Wang; Deven R See; Shiaoman Chao; Peter Bulli; Jinxue Jing
Journal:  PLoS One       Date:  2015-05-13       Impact factor: 3.240

3.  Lysosomal recruitment of TSC2 is a universal response to cellular stress.

Authors:  Constantinos Demetriades; Monika Plescher; Aurelio A Teleman
Journal:  Nat Commun       Date:  2016-02-12       Impact factor: 14.919

4.  Reduced cell size, chromosomal aberration and altered proliferation rates are characteristics and confounding factors in the STHdh cell model of Huntington disease.

Authors:  Elisabeth Singer; Carolin Walter; Jonasz J Weber; Ann-Christin Krahl; Ulrike A Mau-Holzmann; Nadine Rischert; Olaf Riess; Laura E Clemensson; Huu P Nguyen
Journal:  Sci Rep       Date:  2017-12-04       Impact factor: 4.379

5.  Enhancer occlusion transcripts regulate the activity of human enhancer domains via transcriptional interference: a computational perspective.

Authors:  Amit Pande; Wojciech Makalowski; Jürgen Brosius; Carsten A Raabe
Journal:  Nucleic Acids Res       Date:  2020-04-17       Impact factor: 16.971

6.  Chlamydia trachomatis Serovars Drive Differential Production of Proinflammatory Cytokines and Chemokines Depending on the Type of Cell Infected.

Authors:  Robert Faris; Shelby E Andersen; Alix McCullough; Françoise Gourronc; Aloysius J Klingelhutz; Mary M Weber
Journal:  Front Cell Infect Microbiol       Date:  2019-11-26       Impact factor: 5.293

7.  STAU2 protein level is controlled by caspases and the CHK1 pathway and regulates cell cycle progression in the non-transformed hTERT-RPE1 cells.

Authors:  Lionel Condé; Yulemi Gonzalez Quesada; Florence Bonnet-Magnaval; Rémy Beaujois; Luc DesGroseillers
Journal:  BMC Mol Cell Biol       Date:  2021-03-04

8.  Ultra-deep sequencing validates safety of CRISPR/Cas9 genome editing in human hematopoietic stem and progenitor cells.

Authors:  M Kyle Cromer; Valentin V Barsan; Erich Jaeger; Mengchi Wang; Jessica P Hampton; Feng Chen; Drew Kennedy; Jenny Xiao; Irina Khrebtukova; Ana Granat; Tiffany Truong; Matthew H Porteus
Journal:  Nat Commun       Date:  2022-08-11       Impact factor: 17.694

9.  Glycosphingolipid-functionalized nanoparticles recapitulate CD169-dependent HIV-1 uptake and trafficking in dendritic cells.

Authors:  Xinwei Yu; Amin Feizpour; Nora-Guadalupe P Ramirez; Linxi Wu; Hisashi Akiyama; Fangda Xu; Suryaram Gummuluru; Björn M Reinhard
Journal:  Nat Commun       Date:  2014-06-20       Impact factor: 14.919

10.  A patient-derived cellular model for Huntington's disease reveals phenotypes at clinically relevant CAG lengths.

Authors:  Claudia Lin-Kar Hung; Tamara Maiuri; Laura Erin Bowie; Ryan Gotesman; Susie Son; Mina Falcone; James Victor Giordano; Tammy Gillis; Virginia Mattis; Trevor Lau; Vickie Kwan; Vanessa Wheeler; Jonathan Schertzer; Karun Singh; Ray Truant
Journal:  Mol Biol Cell       Date:  2018-09-26       Impact factor: 4.138

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.