Literature DB >> 35558164

Mitochondrial genomes do not appear to regulate flowering pattern/reproductive strategy in Cannabis sativa.

Ziv Attia1, Cloe Pogoda1, Daniela Vergara1, Nolan C Kane1.   

Abstract

Currently, the amount of genetic data for Cannabis is lacking due to the illegal nature of the plant. Our study used 73 Cannabis sativa whole-genome shotgun libraries to reveal eight different mtDNA haplotypes. The most common haplotype contained 60 of the 73 samples studied and was composed of only dioecious individuals. However, other haplotypes contained a mix of both mating strategies (i.e. monoecious and dioecious). From these haplotype groupings we further examined the fully annotated mitochondrial genomes of four hemp individuals with different mt haplotypes and recorded gene content, copy number variation and synteny. Our results revealed highly syntenic mitochondrial genomes that contained ~60 identifiable sequences for protein-coding genes, tRNAs and rRNAs and no obvious rearrangements or chimeric genes. We found no clear evidence that modern reproductive patterns are due to simple cytoplasmic male sterility mutations. It is likely the interaction between nuclear genetic components and the X/Y sex chromosomes that determines reproductive strategy. Additionally, we added 50 % more mitochondrial genomes to the publicly available repository.
© The Author(s) 2021. Published by Oxford University Press on behalf of the Annals of Botany Company.

Entities:  

Keywords:  Genome assembly; haplotype network; hemp; marijuana; reproductive strategy

Year:  2021        PMID: 35558164      PMCID: PMC9089828          DOI: 10.1093/aobpla/plab068

Source DB:  PubMed          Journal:  AoB Plants            Impact factor:   3.138


Introduction

Cannabis sativa is an important annual herb which has been cultivated by humans for millennia. It has extensive amounts of phytochemicals that are used in folk medicine (marijuana type Cannabis) and also contains cellulosic fibres (hemp type Cannabis), which are valuable in the textile industry (Russo 2011; Skoglund ; Andre ). Presently in the USA, the plant is being developed for more extensive industrial purposes after the approval of the 2018 Agriculture Improvement Act (Congress 2018). Prejudices surrounding this crop species are lessening and research activity continues to increase. Thus, understanding the genetic nature of the Cannabis plant as a modern agricultural crop will help to inform the development of it as a valuable plant. Even though most cultivated Cannabis for medical and recreational purposes is dioecious, i.e. male and female flowers develop on separate plants when grown from seeds, monoecious populations where male and female flowers are present on the same plant also exist, particularly in industrial hemp varieties. Interestingly, it is unknown what form wild Cannabis took and whether humans impacted the selection for dioecy or monoecy. Most researchers agree that there may no longer be any wild populations that can be examined (Small and Cronquist 1976; McPartland 2018). However, modern monoecious varieties have been obtained by selection from naturally occurring variants (Bocsa 1958; Westergaard 1958). These monoecious varieties offer several agronomic/industrial advantages when compared to dioecious cultivars, such as higher crop homogeneity and increased seed yield due to self-fertilization during breeding (Faux ; Salentijn ). However, monoecy is also associated with some drawbacks, mainly due to inbreeding reducing genetic variation, leading to lower vigour, and slower breeding improvement (Bocsa and Karus 1998). Additionally, when plants produce seeds, they usually devote energy to that process instead of in the production of secondary metabolites such as THCA (delta 9 tetrahydrocannabinolic acid) and CBDA (cannabidiolic acid), making monoecious varieties less desirable for medicinal/recreational uses (Lubell and Brand 2018). Cannabis sativa in the haploid state has nine autosomes and one sex chromosome (either X or Y; Lloyd 1975; Mandolino ; Barrett 2013; Käfer ). Females are typically X/X and males are X/Y. Monoecious individuals also exist, and look cytologically like females, X/X (Van Bakel ; Razumova ). This is not unusual in plants, where femaleness can be influenced by other mechanisms such as nuclear or cytoplasmic male sterility (CMS), i.e. a male-suppressing genotype. Further mutations can create a plant that has a nuclear genome that appears monoecious, but is phenotypically female, and does not express male characteristics (Charlesworth 2002). The factors that determine separate sexes in plants have evolved numerous times over evolutionary history, usually from hermaphroditic origins (Charlesworth 2002; Renner ). Many theoretical models of the transition from hermaphrodites to dioecious individuals suggest that both cytoplasmic and nuclear male sterility can play significant roles (Charlesworth 2002). These mutations allow the production of viable female flowers only (e.g. a dioecious plant) and prevent the natural production of pollen (e.g. a monoecious plant). These mutations arise frequently (Hanson 1991; Balk and Leaver 2001; León ) and can be strongly favoured in nature and by plant breeders, if the reduction in pollen production is accompanied by an increase in number of seeds (Duvick 1999; Vavdiya ). The complex nature of plant sex evolution and control is repeated numerous times in the Plantae kingdom. Some interesting examples include papaya (Carica papaya), which in its diecious state utilizes the Y chromosome to determine maleness in the XY individual. In contrast, papaya can also be gynodioecious due to a domestication mutation where the Y chromosome becoming a hermaphroditic chromosome (Chae ). Another example is Mercurialis annua which can be both feminized and masculinized depending on which genes are mutated and epigenetic control (Cossard and Pannell 2019; Khadka ). Two close relatives of Cannabis, Humulus lupulus and H. japonicus, have been extensively studied and it has been shown that sex is controlled by the X to autosome ratio (Divashuk ). However, given Cannabis’ illegal status, research has been inhibited and it is unclear what specifically affects the determination of sexual strategy in Cannabis. Understanding the distribution of dioecious and monoecious flowering patterns, within different haplotype groups, therefore, has broad importance to plant biology and agronomy and may shed light on the evolutionary origins and genetic basis of Cannabis’ mating strategy. Here, we compare 73 unique Cannabis individuals that represent both monoecious and dioecious reproductive strategies and interrogate their mitochondrial genome content and organization. Mitochondrial genomes are relatively inexpensive to sequence and assemble given their small size and high copy number in the cell, and offer a useful first look at the genetics of an organism. Our goal was to pursue a detailed examination of these mitochondrial genomes, including annotating protein-coding genes, tRNAs and rRNAs, as well as examining copy number variation (CNV), and synteny to identify potential rearrangements. The data produced here are a significant increase (50 %) in the publicly available genetic information for this species that has been previously inaccessible to the research/scientific community.

Materials and Methods

Whole-genome shotgun libraries

We used publicly available whole-genome shotgun libraries (bioproject PRJNA310948) sequenced by Illumina™ Nextera (Lynch ; Vergara ). These genomes have raw read lengths from 100 to 151 bp. Detailed information regarding DNA extraction, sequencing and library preparation are provided in Lynch and Vergara . These libraries included 73 C. sativa individuals, with some cultivars represented multiple times (Carmagnola × 6, Chocolope × 2, Durban Poison × 2, Afghan Kush × 6, Feral Nebraska × 2 and Kompolti × 2; see). Duplicates were included in our subsequent analyses as a positive control.

Variant calling

Genomic libraries for 73 C. sativa individuals, 67 of them identified in Lynch , were processed to remove adapters and low-quality reads by using Trimmomatic v0.39 (Bolger ) with the following parameters: Illuminaclip: NexteraPE-PE.fa:2:20:10 Leading:20 Trailing:20 Sliding window:4:15 Minlen:100. The resulting FASTQ files were checked for quality using FASTQC (Andrews 2010). The quality-checked, trimmed sequences were then aligned to the C. sativa cs10 assembly (GenBank accession GCA_900626175.2) using the Genome Analysis Toolkit (Van der Auwera ). The resulting variant call file (VCF) table was filtered using vcftools (Danecek ) to only include single-nucleotide polymorphisms (SNPs) that specifically aligned to the mitochondrion and had quality scores above 100 (--minQ 100; see). The cs10 assembly (Grassa ) was used as it is currently the most complete (as of time of writing), full annotation publicly available for Cannabis and allowed for the entire whole-genome libraries to be aligned, which avoided spurious sequence alignment due to similarity between reads for the plastids and nuclear genome.

Haplotype determination

To determine the major haplotype groups of the 73 Cannabis individuals, we used the 1356 SNPs identified between the 73 C. sativa individuals that were present in the filtered VCF table. These SNPs were converted into a FASTA consensus sequence using vcf2phylip.py (Ortiz 2019; see). The resulting multi-FASTA was analysed using the R package pegas (Paradis 2010) in R version 3.5.3 to calculate and plot the unique haplotype groups. Each haplotype group was coloured based upon reproductive type (dioecious, monoecious and unknown). Each of the 73 individuals were assigned to a haplotype group 1–8 [see].

Genome assembly

To carefully determine possible differences between the mitochondrial genomes of monoecious and dioecious Cannabis individuals, we focused most of our efforts on two representative monoecious and two representative dioecious cultivars. Two of these, Carmagnola (GenBank accession KR059940.1) and Sievers Infinity (GenBank accession KU363807.1), were already assembled/annotated and were obtained from NCBI. The other two, Kompolti (MT361981.1) and Euro Oil (MT557709), were newly assembled and annotated here. In order to assemble Kompolti, and Euro Oil, de novo assembly of trimmed reads into scaffolds was performed with SPAdes v3.11.1 (Bankevich ). Relative position, order and orientation of scaffolds were determined by comparison to available Carmagnola reference genomes. We selected contigs based on read coverage when multiple contigs represented the same genomic region. Contigs were then placed in the correct order and combined by trimming overlapping sequences. Gaps between scaffolds were filled with either raw or trimmed reads that overlapped (e.g. tiling) from the FASTQ files. Once assembled, zPicture was used to validate the assembly and visualize any potential major differences in structure between the reference (Carmagnola) and each assembled mitochondrial genome (Ovcharenko ). Additionally, samtools tView was used to confirm the absence of chimerically assembled genomes, as well as the presence of high-quality SNPs and/or INDELs (insertions and/or deletions) in the genome. If the mapped reads supported these assembly errors, modifications were made to the FASTA files as needed (Li 2011).

Mitochondrial genome annotation

Annotations of genomic features (protein-coding sequences, tRNAs and rRNAs) were initiated using GeSeq (Tillich ) to find approximate locations of the predicted gene features. In order to identify all possible tRNAs, we used tRNAscan-SE 2.0. Additionally, genes not automatically identified using GeSeq were found by using nucleotide and translated protein sequences that were extracted from the reference Carmagnola mitochondrial genome from NCBI. We used BLAST (blastn and blastx) to identify regions with homology to these known sequences in our newly assembled FASTA sequences and subsequently annotated any missing features. Annotations were then completed in NCBI’s Sequin 15.50 (Bethesda, MD) and submitted to GenBank for publication.

Comparative synteny analysis

The comparative positions of genes and reorganization within mitochondrial genomes based on orthologous relationships were plotted using the GUI program MAUVE with default settings (Darling ). Comparisons were made between two representative dioecious (Carmagnola and Kompolti) and two monoecious (Sievers Infinity and Euro Oil) individuals.

Copy number variation

To interrogate the CNV between two representative monoecious and dioecious individuals, we divided the calculated coverage by depth at every position in the genome. Specifically, we determined coverage by reporting the number of mapped reads (samtools view -c -f) and the depth at every position in the sorted.bam file (samtools depth). Coverage was estimated as (# mapped reads) * (read length)/(mitochondrial genome length). Read length was 150 bp for all four haplotypes. Copy number variation was then determined from the formula (coverage)/(depth at every position). These values were then normalized by dividing the CNV at each position by the average of the CNV of the entire genome. Next, we used the GenBank annotation files to denote the boundaries of each gene and their associated exons. We report the normalized CNV values for each gene’s exon in .

Results

Variant calling and haplotype network for 73 Cannabis individuals

Alignment of the 73 C. sativa individuals to the cs10 assembly reference genome (as it is currently the most complete publicly available assembly; Grassa ) identified a total of 1356 SNPs [see]. Haplotype network prediction using a consensus sequence based on the SNPs [see] produced eight distinct groups. The haplotype groupings are coloured based upon reproductive strategy (Fig. 1; see). The largest group contained 60 of the 73 individuals and showed only the dioecious sex strategy (there were also five individuals that the sex strategy is not yet determined). However, this haplotype grouping suggests that those unknown individuals are likely also dioecious. The second largest group contained six individuals of which two (Dagastani hemp and Sievers Infinity) are monoecious and four (Finola, Feral Nebraska × 2 and Lebanese) dioecious. Duplicate cultivars (Carmagnola × 6, Chocolope × 2, Durban Poison × 2, Afghan Kush × 6, Feral Nebraska × 2 and Kompolti × 2) were assigned to the same haplotype groups (group I, I, I, I, II and IV, respectively) as positive controls. Given this haplotype map and our interest in establishing if the mitochondrial genome might affect reproductive strategy, we focused our analysis on four different hemp individuals. We chose two representative monoecious and two representative dioecious genomes for our subsequent analyses [see]. The first individual was Carmagnola as it was part of the major haplotype group I and is known to be strongly dioecious (Small 2016). The second dioecious individual selected was Kompolti as it is the most distinct representative of that sex strategy based on our haplotype network. Similarly, we chose Sievers Infinity as a representative monoecious individual, because it is part of the second largest haplotype, group II, and Euro Oil as it is the most distinct monoecious individual based on our haplotype groupings.
Figure 1.

Haplotype network of 73 Cannabis individuals (duplicate cultivars are included in the sample size n). The total number of individuals (n), or the individual’s name is given if it is a group of one. Dioecious sex strategy is represented by blue, monoecious by yellow and unknown is given in grey.

Haplotype network of 73 Cannabis individuals (duplicate cultivars are included in the sample size n). The total number of individuals (n), or the individual’s name is given if it is a group of one. Dioecious sex strategy is represented by blue, monoecious by yellow and unknown is given in grey.

Synteny

To begin investigating the possible role of rearrangements of the mitochondrial genome in the potential CMS phenotype, we compared synteny between our four representative individuals. Synteny is shown in Fig. 2 and is well conserved among the four hemp haplotypes. We specifically examined Carmagnola (dioecious) vs. Kompolti (dioecious), Kompolti (dioecious) vs. Euro Oil (Monoecious) and Euro Oil (monoecious) vs. Sievers Infinity (monoecious). All four genomes were aligned to begin with the same nucleotide sequence to ensure appropriate alignment of the genomes. It is clear that there are no rearrangements between any of the individuals and specifically no major or minor difference between the dioecious and monoecious groups was observed [see].
Figure 2.

Synteny plot of four hemp haplotypes: Carmagnola (KR_059940), Kompolti (MT361981.1), Euro Oil (MT557709) and Sievers Infinity (KU363807.1). Genetic content for the reference (Carmagnola) is shown at the top of the figure and the figure legend (top right) shows what each coloured object represents. All haplotypes were of similar size (400 kb) and they are aligned to each other.

Synteny plot of four hemp haplotypes: Carmagnola (KR_059940), Kompolti (MT361981.1), Euro Oil (MT557709) and Sievers Infinity (KU363807.1). Genetic content for the reference (Carmagnola) is shown at the top of the figure and the figure legend (top right) shows what each coloured object represents. All haplotypes were of similar size (400 kb) and they are aligned to each other.

Mitochondrial genomic content

To shed light on any potential differences in mitochondrial genome content of the Cannabis mitochondrial genomes used here, we compared four different hemp mitochondrial genomes: two representative monoecious and two representative dioecious individuals. The mitochondrial gene content was highly similar between the four mitochondrial genomes we analysed. We identified ~60 genes in each mitochondrial genome including protein-coding genes, tRNAs and rRNAs. Forty-two protein-coding genes previously associated with the CMS phenotype were present in all these genomes and are closely compared in Table 1, with only minor observable differences in some cases, such as different numbers of exons or strand orientation of a given gene. These differences, however, could be due to slight variability in annotation rather than underlying genetic differences because genome-wide, nucleotide similarity for the assembled mitochondrial genomes (i.e. FASTA sequences) was 100 % for Carmagnola vs. Sievers Infinity, 99.94 % for Carmagnola vs. Kompolti and 99.96 % for Carmagnola vs. Euro Oil. The differences between dioecious (Carmagnola) and monoecious (Sievers Infinity and Kompolti) and whether they occurred in a coding region or not are presented in Table 2. Out of 25 potential SNPs and INDELs that were different between the Carmagnola representative dioecious individual and both our monoecious individuals (Euro Oil and Sievers Infinity), there were only five that occurred near an annotated gene, with only one (an insertion of AG at position 26 7314 bp) occurring in an exon of nad7 in Carmagnola. However, alignment of this protein did not reveal any amino acid changes, as this insertion occurs outside the coding regions in both the dioecious individuals. GC percentage was identical between the four haplotypes and was found to be 45.6 %.
Table 1.

Gene presence/absence for the two representative dioecious individuals (Carmagnola and Kompolti) and two representative monoecious individuals (Sievers Infinity and Euro Oil). Number of exons for each gene, tRNA, and rRNA are given as well as if it was on the plus or minus strand. Percentage of coding sequence on the plus strand and GC content is given for each genome.

Gene/haplotypeCarmagnolaKompoltiSievers InfinityEuro Oil
Number of exonsStrandsNumber of exonsStrandsNumber of exonsStrandsNumber of exonsStrands
ATP11Plus1Plus1Plus1Plus
ATP41Plus1Plus1Plus1Plus
ATP61Plus1Plus1Plus1Plus
ATP81Plus1Plus1Plus1Plus
ATP91Plus1Minus1Minus1Minus
CcmB1Plus1Plus1Plus1Plus
ccmc1Plus1Plus1Plus1Plus
ccmfc2Minus2Minus2Minus1Minus
ccmfn1Plus1Plus1Plus1Plus
cob1Plus1Plus1Plus1Plus
cox11Plus1Plus1Plus1Plus
cox22Plus2Plus2Plus2Plus
cox31Plus1Plus1Plus1Plus
matr1Plus1Minus1Minus1Minus
mttB1Plus1Plus1Plus1Plus
nad15Minus4Plus5Minus5Minus
nad25Plus5Plus4Plus5Plus
nad31Plus1Plus1Plus1Plus
nad43Plus3Plus3Plus3Plus
nad4L1Plus1Plus1Plus1Plus
nad55Minus4Minus2Minus4Minus
nad61Plus1Minus1Minus1Minus
nad74Plus4Minus4Plus4Plus
nad91Plus1Plus1Minus1Plus
rpl161Minus1Minus1Minus1Minus
rpl22Minus2Minus2Minus2Minus
rps121Plus1Plus1Plus1Plus
rps131Plus1Plus1Plus1Plus
rps32Minus2Minus2Minus2Minus
rps41Plus1Plus1Plus1Plus
rps71Plus1Plus1Plus1Plus
rrn181Plus1PlusNANA1Plus
rrn51Minus1Plus1Minus1Plus
trnD1Minus1Plus1Minus1Plus
trnE1Plus1Plus1Plus1Plus
trnF1Plus1Plus1Plus1Plus
trnK1Minus×2Plus1Plus×2Plus
trnP1Plus×2Plus1Plus×2Plus
trnQ1Plus1Plus1Plus1Plus
trnS1Plus×2Plus1Plus×2Plus
trnW1Plus1Plus1Plus1Plus
trnY1Plus1Plus1Plus1Plus
GC content %45.645.645.645.6
Genes on plus strand %78.680.069.080.0
Table 2.

Single-nucleotide polymorphism (SNP) and INDEL differences between the reference dioecious individual (Carmagnola) and the alternate monoecious individuals (Euro Oil and Sievers Infinity). Position is given in bp for the reference Carmagnola annotation. The gene feature corresponds to the reference Carmagnola annotation.

Position (bp) in CarmagnolaReference (dioecious)Alternate (monoecious)Gene feature in Carmagnola
3208CTNon-coding
56 571CANon-coding
74 900CTCNon-coding
122 196CANon-coding
144 424AGNon-coding
150 582GTNon-coding
167 988CAIntron trans-spliced nad2
213 604AT5′ Leader sequence nad3
247 699CG5′ Leader sequence nad6
256 390TGNon-coding
257 100CGNon-coding
267 314CAGCAGAGExon nad7
277 630TGNon-coding
295 239ATNon-coding
306 408TCIntron nad4
318 064CANon-coding
331 459ACNon-coding
363 735TGNon-coding
378 146ACNon-coding
384 437AGGGAGGGGNon-coding
400 445ACNon-coding
401 584CGNon-coding
404 499GTNon-coding
413 301ATCATCGTTCNon-coding
413 302TCTCGTCCNon-coding
Gene presence/absence for the two representative dioecious individuals (Carmagnola and Kompolti) and two representative monoecious individuals (Sievers Infinity and Euro Oil). Number of exons for each gene, tRNA, and rRNA are given as well as if it was on the plus or minus strand. Percentage of coding sequence on the plus strand and GC content is given for each genome. Single-nucleotide polymorphism (SNP) and INDEL differences between the reference dioecious individual (Carmagnola) and the alternate monoecious individuals (Euro Oil and Sievers Infinity). Position is given in bp for the reference Carmagnola annotation. The gene feature corresponds to the reference Carmagnola annotation. Copy number variation was similar across the monoecious and dioecious hemp individuals and 42 genes analysed (Fig. 3). The average CNV was ~1. The only outlier was the tRNA trnP (Proline), which has much higher copy number only in the Carmagnola mitochondrial genome.
Figure 3.

Copy number variation (CNV) for Carmagnola, Sievers Infinity, Kompolti and Euro Oil hemp haplotypes. Values were normalized by dividing by the average CNV of each individual mitochondrial genome.

Copy number variation (CNV) for Carmagnola, Sievers Infinity, Kompolti and Euro Oil hemp haplotypes. Values were normalized by dividing by the average CNV of each individual mitochondrial genome.

Discussion

This study takes a detailed look at different Cannabis haplotypes to determine any potential differences in the mitochondrial genomes for these various cultivars. Using 73 C. sativa individuals aligned to the cs10 (Grassa ) reference genome, we identified eight distinct haplotypes, with most of the dioecious individuals (60) falling within haplotype group I (Fig. 1). We chose to closely examine four separate individuals, two (Kompolti and Euro Oil) of which were newly assembled and annotated here. The first of the four we examined, Carmagnola, was part of the main haplotype group I and was dioecious. The second, Sievers Infinity, fell within haplotype group II and was monoecious. All individuals examined are hemp varieties of C. sativa and are cultivated for industrial purposes. Carmagnola is strongly dioecious, existing always as separate male and female plants. Kompolti is also dioecious but can express flowers of either sex on the same plant, with the use of chemicals (e.g. Ag+ ions and gibberellins; Ram and Jaiswal 1972; Atsmon and Tabbak 1979; Ram and Sett 1982). In contrast, Sievers Infinity and Euro Oil are monoecious [see]. Our analyses revealed that the mitochondrial genomes of all four individuals were remarkably similar even though they belonged to different haplotype groups. The genetic content, SNPs/INDELs, CNV and synteny were all highly conserved. This observation is very close to previous research that looked for differences in the chloroplast genomes of Cannabis and found that they too were highly similar and conserved (Vergara ; Roman ). There are two types of sterility possible in plants: mitochondrial- or nuclear-encoded. Mitochondrial-encoded sequences conferring CMS are commonly observed in plant species and are the result of genomic conflict between the mitochondrial and nuclear genomes (Horn ; Touzet and Meyer 2014). In contrast, nuclear, or genic, male sterility has been associated with 20 nuclear-associated gene mutations (Neuffer ) and it is also common in flowering plants (Gabay-Laughnan and Laughnan 1994). Our haplotype group results suggest that a single, ancient origin of monoecy from a dioecious ancestor is not supported. If dioecy were ancestral as suggested by previous research (Kovalchuk ), we would expect more diversity in those dioecious individuals and not the single large haplotype observed. Group I of our haplotype network contains 60 individuals and suggests that instead there was a recent selective sweep specific to this mitochondrial type, but it is not the only cytotype associated with dioecy. The lack of observed diversity in both the mitochondrial and chloroplast genomes is in direct contrast to the nuclear genome diversity in these individuals (Lynch ). Our genetic data suggest that instead of a simple CMS mutation controlling reproductive strategy there has been a fairly complex evolution of dioecy vs. monoecy in Cannabis, perhaps involving several distinct mutations and likely involving the nuclear genome.

Conclusions

As the nature of Cannabis continues to evolve towards being an accepted agricultural crop, more information about agronomic traits is required. This crucial information will provide valuable tools for breeding hemp. Currently, most Cannabis for medical and recreational purposes is propagated vegetatively, and stocks of homogenous seed in the USA do not exist. Cannabis lacks the genetic and genomic tools available, for most important agricultural crops, due to its illegal status (Vergara ), and the absence of basic resources such as public isogenic germplasm collections hinders the improvement and development of cultivars. Crop improvement specialists require creative and collaborative solutions to overcome these issues due to years of scientific neglect. Therefore, in order to make Cannabis more appealing and widely available to commercial farmers, ‘true breeding’ (e.g. F1 hybrids) approaches are necessary. In addition, focused crossing will allow the development of biparental populations and introgression lines that facilitate sophisticated genomic approaches, such as genome-wide association studies. These methods are well established and used in other agricultural crops like sunflower, corn and tomato (Vear 2016; Bauchet ; Darrah ). This type of research has important implications for the medical, recreational and industrial industries that rely on Cannabis. Gaining legal acceptance for the scientific study of this plant, given the growing public acceptance, will be a huge step forward in the process of making Cannabis into a profitable, accessible and modern crop. These data presented here offer a first step in the genetic interrogation and understanding the enigmatic sexual strategies of this useful and controversial plant.

Supporting Information

The following additional information is available in the online version of this article— Cultivar name, reproductive type (D for dioecious, H for Hermaphrodite (monoecious)) and haplotype group number for the 73 individual Cannabis sativa samples used here. Duplicate cultivar names are provided. Haplotype group number is 1–8 and corresponds to Fig. 1 in the main text. Variant call format table (SNPs and INDELs) for the 73 Cannabis sativa cultivars aligned to the reference cs10 (VCF was created using Genome Analysis Toolkit (GATK)). FASTA consensus sequence for each of the 73 Cannabis sativa cultivars aligned to the reference cs10 (Grassa ). A total of 1356 SNPs are included for each individual. The normalized copy number variation (CNV) values for each gene’s exon for the four hemp haplotypes. Genomic content for the Carmagnola (KR_059940) and Sievers Infinity (KU363807.1) as well as the two newly sequenced, assembled and annotated Cannabis sativa hemp Kompolti (MT361980.1) and Euro Oil (MT557709) mitochondrial genomes. Dot plot synteny comparisons performed via NCBI’s blastn. Click here for additional data file. Click here for additional data file.
  36 in total

1.  The PET1-CMS mitochondrial mutation in sunflower is associated with premature programmed cell death and cytochrome c release.

Authors:  J Balk; C J Leaver
Journal:  Plant Cell       Date:  2001-08       Impact factor: 11.277

2.  The mechanism of sex determination in dioecious flowering plants.

Authors:  M WESTERGAARD
Journal:  Adv Genet       Date:  1958       Impact factor: 1.944

3.  Mauve: multiple alignment of conserved genomic sequence with rearrangements.

Authors:  Aaron C E Darling; Bob Mau; Frederick R Blattner; Nicole T Perna
Journal:  Genome Res       Date:  2004-07       Impact factor: 9.043

4.  Molecular cytogenetic analysis of monoecious hemp (Cannabis sativa L.) cultivars reveals its karyotype variations and sex chromosomes constitution.

Authors:  Olga V Razumova; Oleg S Alexandrov; Mikhail G Divashuk; Tatiana I Sukhorada; Gennady I Karlov
Journal:  Protoplasma       Date:  2015-07-07       Impact factor: 3.356

5.  A functional decomposition of sex inconstancy in the dioecious, colonizing plant Mercurialis annua.

Authors:  Guillaume G Cossard; John R Pannell
Journal:  Am J Bot       Date:  2019-05       Impact factor: 3.844

Review 6.  Plant sex determination and sex chromosomes.

Authors:  Deborah Charlesworth
Journal:  Heredity (Edinb)       Date:  2002-02       Impact factor: 3.821

7.  The variant call format and VCFtools.

Authors:  Petr Danecek; Adam Auton; Goncalo Abecasis; Cornelis A Albers; Eric Banks; Mark A DePristo; Robert E Handsaker; Gerton Lunter; Gabor T Marth; Stephen T Sherry; Gilean McVean; Richard Durbin
Journal:  Bioinformatics       Date:  2011-06-07       Impact factor: 6.937

8.  GeSeq - versatile and accurate annotation of organelle genomes.

Authors:  Michael Tillich; Pascal Lehwark; Tommaso Pellizzer; Elena S Ulbricht-Jones; Axel Fischer; Ralph Bock; Stephan Greiner
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

Review 9.  The Complex Interactions Between Flowering Behavior and Fiber Quality in Hemp.

Authors:  Elma M J Salentijn; Jordi Petit; Luisa M Trindade
Journal:  Front Plant Sci       Date:  2019-05-16       Impact factor: 5.753

10.  Gene copy number is associated with phytochemistry in Cannabis sativa.

Authors:  Daniela Vergara; Ezra L Huscher; Kyle G Keepers; Robert M Givens; Christian G Cizek; Anthony Torres; Reggie Gaudino; Nolan C Kane
Journal:  AoB Plants       Date:  2019-11-20       Impact factor: 3.276

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.