| Literature DB >> 35558164 |
Ziv Attia1, Cloe Pogoda1, Daniela Vergara1, Nolan C Kane1.
Abstract
Currently, the amount of genetic data for Cannabis is lacking due to the illegal nature of the plant. Our study used 73 Cannabis sativa whole-genome shotgun libraries to reveal eight different mtDNA haplotypes. The most common haplotype contained 60 of the 73 samples studied and was composed of only dioecious individuals. However, other haplotypes contained a mix of both mating strategies (i.e. monoecious and dioecious). From these haplotype groupings we further examined the fully annotated mitochondrial genomes of four hemp individuals with different mt haplotypes and recorded gene content, copy number variation and synteny. Our results revealed highly syntenic mitochondrial genomes that contained ~60 identifiable sequences for protein-coding genes, tRNAs and rRNAs and no obvious rearrangements or chimeric genes. We found no clear evidence that modern reproductive patterns are due to simple cytoplasmic male sterility mutations. It is likely the interaction between nuclear genetic components and the X/Y sex chromosomes that determines reproductive strategy. Additionally, we added 50 % more mitochondrial genomes to the publicly available repository.Entities:
Keywords: Genome assembly; haplotype network; hemp; marijuana; reproductive strategy
Year: 2021 PMID: 35558164 PMCID: PMC9089828 DOI: 10.1093/aobpla/plab068
Source DB: PubMed Journal: AoB Plants Impact factor: 3.138
Figure 1.Haplotype network of 73 Cannabis individuals (duplicate cultivars are included in the sample size n). The total number of individuals (n), or the individual’s name is given if it is a group of one. Dioecious sex strategy is represented by blue, monoecious by yellow and unknown is given in grey.
Figure 2.Synteny plot of four hemp haplotypes: Carmagnola (KR_059940), Kompolti (MT361981.1), Euro Oil (MT557709) and Sievers Infinity (KU363807.1). Genetic content for the reference (Carmagnola) is shown at the top of the figure and the figure legend (top right) shows what each coloured object represents. All haplotypes were of similar size (400 kb) and they are aligned to each other.
Gene presence/absence for the two representative dioecious individuals (Carmagnola and Kompolti) and two representative monoecious individuals (Sievers Infinity and Euro Oil). Number of exons for each gene, tRNA, and rRNA are given as well as if it was on the plus or minus strand. Percentage of coding sequence on the plus strand and GC content is given for each genome.
| Gene/haplotype | Carmagnola | Kompolti | Sievers Infinity | Euro Oil | ||||
|---|---|---|---|---|---|---|---|---|
| Number of exons | Strands | Number of exons | Strands | Number of exons | Strands | Number of exons | Strands | |
| ATP1 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| ATP4 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| ATP6 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| ATP8 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| ATP9 | 1 | Plus | 1 | Minus | 1 | Minus | 1 | Minus |
| CcmB | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| ccmc | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| ccmfc | 2 | Minus | 2 | Minus | 2 | Minus | 1 | Minus |
| ccmfn | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| cob | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| cox1 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| cox2 | 2 | Plus | 2 | Plus | 2 | Plus | 2 | Plus |
| cox3 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| matr | 1 | Plus | 1 | Minus | 1 | Minus | 1 | Minus |
| mttB | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| nad1 | 5 | Minus | 4 | Plus | 5 | Minus | 5 | Minus |
| nad2 | 5 | Plus | 5 | Plus | 4 | Plus | 5 | Plus |
| nad3 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| nad4 | 3 | Plus | 3 | Plus | 3 | Plus | 3 | Plus |
| nad4L | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| nad5 | 5 | Minus | 4 | Minus | 2 | Minus | 4 | Minus |
| nad6 | 1 | Plus | 1 | Minus | 1 | Minus | 1 | Minus |
| nad7 | 4 | Plus | 4 | Minus | 4 | Plus | 4 | Plus |
| nad9 | 1 | Plus | 1 | Plus | 1 | Minus | 1 | Plus |
| rpl16 | 1 | Minus | 1 | Minus | 1 | Minus | 1 | Minus |
| rpl2 | 2 | Minus | 2 | Minus | 2 | Minus | 2 | Minus |
| rps12 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| rps13 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| rps3 | 2 | Minus | 2 | Minus | 2 | Minus | 2 | Minus |
| rps4 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| rps7 | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| rrn18 | 1 | Plus | 1 | Plus | NA | NA | 1 | Plus |
| rrn5 | 1 | Minus | 1 | Plus | 1 | Minus | 1 | Plus |
| trnD | 1 | Minus | 1 | Plus | 1 | Minus | 1 | Plus |
| trnE | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| trnF | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| trnK | 1 | Minus | ×2 | Plus | 1 | Plus | ×2 | Plus |
| trnP | 1 | Plus | ×2 | Plus | 1 | Plus | ×2 | Plus |
| trnQ | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| trnS | 1 | Plus | ×2 | Plus | 1 | Plus | ×2 | Plus |
| trnW | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| trnY | 1 | Plus | 1 | Plus | 1 | Plus | 1 | Plus |
| GC content % | 45.6 | 45.6 | 45.6 | 45.6 | ||||
| Genes on plus strand % | 78.6 | 80.0 | 69.0 | 80.0 |
Single-nucleotide polymorphism (SNP) and INDEL differences between the reference dioecious individual (Carmagnola) and the alternate monoecious individuals (Euro Oil and Sievers Infinity). Position is given in bp for the reference Carmagnola annotation. The gene feature corresponds to the reference Carmagnola annotation.
| Position (bp) in Carmagnola | Reference (dioecious) | Alternate (monoecious) | Gene feature in Carmagnola |
|---|---|---|---|
| 3208 | C | T | Non-coding |
| 56 571 | C | A | Non-coding |
| 74 900 | CT | C | Non-coding |
| 122 196 | C | A | Non-coding |
| 144 424 | A | G | Non-coding |
| 150 582 | G | T | Non-coding |
| 167 988 | C | A | Intron trans-spliced nad2 |
| 213 604 | A | T | 5′ Leader sequence nad3 |
| 247 699 | C | G | 5′ Leader sequence nad6 |
| 256 390 | T | G | Non-coding |
| 257 100 | C | G | Non-coding |
| 267 314 | CAG | CAGAG | Exon nad7 |
| 277 630 | T | G | Non-coding |
| 295 239 | A | T | Non-coding |
| 306 408 | T | C | Intron nad4 |
| 318 064 | C | A | Non-coding |
| 331 459 | A | C | Non-coding |
| 363 735 | T | G | Non-coding |
| 378 146 | A | C | Non-coding |
| 384 437 | AGGG | AGGGG | Non-coding |
| 400 445 | A | C | Non-coding |
| 401 584 | C | G | Non-coding |
| 404 499 | G | T | Non-coding |
| 413 301 | ATC | ATCGTTC | Non-coding |
| 413 302 | TC | TCGTCC | Non-coding |
Figure 3.Copy number variation (CNV) for Carmagnola, Sievers Infinity, Kompolti and Euro Oil hemp haplotypes. Values were normalized by dividing by the average CNV of each individual mitochondrial genome.