| Literature DB >> 30813969 |
Allison Piovesan1, Maria Chiara Pelleri1, Francesca Antonaros1, Pierluigi Strippoli1, Maria Caracausi2, Lorenza Vitale1.
Abstract
OBJECTIVE: Basic parameters commonly used to describe genomes including length, weight and relative guanine-cytosine (GC) content are widely cited in absence of a primary source. By using updated data and original software we determined these values to the best of our knowledge as standard reference for the whole human nuclear genome, for each chromosome and for mitochondrial DNA. We also devised a method to calculate the relative GC content in the whole messenger RNA sequence set and in transcriptomes by multiplying the GC content of each gene by its mean expression level.Entities:
Keywords: GC content; Genome length; Genome weight; Human genome; Mitochondrial DNA
Mesh:
Substances:
Year: 2019 PMID: 30813969 PMCID: PMC6391780 DOI: 10.1186/s13104-019-4137-z
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Human genome length estimates
| References | Meters |
|---|---|
| Web sites | |
| | Diploid: 1.5–3 |
| | Haploid: 1 |
| | Diploid: 2 |
| | Diploid: 1.8a |
| | Haploid: 1.8 |
| Articles | |
| [ | Diploid: 2 |
| [ | Diploid: 2 |
| [ | Haploid: 1.5 |
| Books | |
| [ | Diploid: 2 |
| [ | Haploid: 1 |
| [ | Diploid: 2 |
| [ | Diploid: 2 |
aThis value was converted from the stated 6 feet
Length, weight and GC content of human chromosomes, genome and mitochondrial DNA
| Chromosome | Length (bp) | Length (cm) | Weight (pg) | Weight (fg) | GC% |
|---|---|---|---|---|---|
| 1 |
|
|
| 41.72 | |
| 2 | 242,193,529 | 7.92 ± 0.08 |
| 247.65 | 40.23 |
| 3 | 198,295,559 | 6.48 ± 0.06 | 0.20 | 202.76 | 39.67 |
| 4 | 190,214,555 | 6.22 ± 0.06 | 0.19 | 194.49 |
|
| 5 | 181,538,259 | 5.93 ± 0.06 | 0.19 | 185.63 | 39.51 |
| 6 | 170,805,979 | 5.58 ± 0.05 | 0.17 | 174.65 | 39.61 |
| 7 | 159,345,973 | 5.21 ± 0.05 | 0.16 | 162.94 | 40.70 |
| 8 | 145,138,636 | 4.74 ± 0.05 | 0.15 | 148.41 | 40.16 |
| 9 | 138,394,717 | 4.52 ± 0.04 | 0.14 | 141.51 | 41.28 |
| 10 | 133,797,422 | 4.37 ± 0.04 | 0.14 | 136.81 | 41.54 |
| 11 | 135,086,622 | 4.42 ± 0.04 | 0.14 | 138.13 | 41.54 |
| 12 | 133,275,309 | 4.36 ± 0.04 | 0.14 | 136.28 | 40.77 |
| 13 | 114,364,328 | 3.74 ± 0.04 | 0.12 | 116.94 | 38.55 |
| 14 | 107,043,718 | 3.50 ± 0.03 | 0.11 | 109.46 | 40.83 |
| 15 | 101,991,189 | 3.33 ± 0.03 | 0.10 | 104.29 | 42.03 |
| 16 | 90,338,345 | 2.95 ± 0.03 | 0.09 | 92.38 | 44.58 |
| 17 | 83,257,441 | 2.72 ± 0.03 | 0.09 | 85.14 | 45.32 |
| 18 | 80,373,285 | 2.63 ± 0.03 | 0.08 | 82.18 | 39.78 |
| 19 | 58,617,616 | 1.92 ± 0.02 | 0.06 | 59.95 |
|
| 20 | 64,444,167 | 2.11 ± 0.02 | 0.07 | 65.90 | 43.80 |
| 21 |
|
|
| 40.94 | |
| 22 | 50,818,468 | 1.66 ± 0.02 |
| 51.97 | 47.00 |
| X | 156,040,895 | 5.10 ± 0.05 | 0.16 | 159.55 | 39.53 |
| Y | 57,227,415 | 1.87 ± 0.02 | 0.06 | 58.52 | 40.03 |
| Total (1–22, X, Y)a | 3,088,269,832 | 100.96 ± 0.97 | 3.16 | 3157.87 | 40.87 |
| Unplaced | 153,667,028 | 5.02 ± 0.05 | 0.16 | 157.13 | |
| Total male (46, XY)b | 6,270,605,410 | 205.00 ± 1.97 | 6.41 | 6411.94 | 40.91 |
| Total female (46, XX)b | 6,369,418,890 | 208.23 ± 2.00 | 6.51 | 6512.98 | 40.88 |
| Mean (male and female) | 6,320,012,150 | 206.62 ± 1.99 | 6.46 | 6462.46 | 40.89 |
| mtDNA | 16,569 | 0.00054 | 0.000017 | 0.02 | 44.36 |
| Mean mtDNA per cell | 56,793,727 | 1.86 ± 0.02 | 0.06 | 58.08 |
Italics: minimum and maximum values
Bp, base pairs; cm, centimeters (variation was calculated considering the uncertainty of the bp number per DNA helical turn [55]); pg, picograms; fg, femtograms; GC%, percentage of G (guanine), C (cytosine) and S (G or C) among certain bases
aThe total was obtained summing lengths and weights for the 24 types of human linear DNA molecules and used in order to proportionately calculate the length and weight of unplaced bases, improving whole genome calculation accuracy
bTotal for a male or female diploid cell, including a double complement of unplaced bases
Genomic, mRNA and transcriptomic GC contents in the investigated human conditions and other species
| Species | Genomic GC% | mRNA GC% | Transcriptomic GC% | ∆GC%(mRNA−Genomic) | ∆GC%(Transcriptomic−mRNA) |
|---|---|---|---|---|---|
| 40.89 | 48.80 | 48.21 | 7.91 | − 0.59 | |
| 49.27 | 7.91 | 0.47 | |||
| 48.83 | 48.85 | 7.94 | 0.03 | ||
| 48.92 | 7.94 | 0.09 | |||
| 36.63 | 45.70 | 45.40 | 9.08 | − 0.30 | |
|
| 35.44 | 42.21 | 43.22 | 6.77 | 1.00 |
|
| 38.30 | 39.63 | 40.21 | 1.34 | 0.58 |
|
| 50.79 | 51.99 | 52.13 | 1.20 | 0.15 |
Genomic GC%: percentage of G (guanine), C (cytosine) and S (G or C, present only in H. sapiens and D. rerio assemblies); for H. sapiens the mean GC content between male and female genomes was used; mRNA GC%: percentage of G and C in the analysed messenger RNA set having an expression value together with mRNA sequences available (see “Methods” section for details); transcriptomic GC%: percentage of G and C in the sum of each mRNA GC count multiplied by its mean expression value for each biological condition; ∆GC%(mRNA−Genomic): difference between mRNA and genomic GC%; ∆GC%(Transcriptomic−mRNA): difference between transcriptomic and mRNA GC%