| Literature DB >> 35741752 |
Gianluigi Paolillo1, Maria Grazia De Iorio1, Joel F Soares Filipe1, Federica Riva1, Alessandra Stella2, Gustavo Gandini1, Giulio Pagnacco2, Barbara Lazzari2, Giulietta Minozzi1.
Abstract
Sexual regulation in Apis mellifera is controlled by the complementary sex-determiner (csd) gene: females (queens and workers) are heterozygous at this locus and males (drones) are hemizygous. When homozygous diploid drones develop, they are eaten by worker bees. High csd allelic diversity in honeybee populations is a priority for colony survival. The focus of this study is to investigate csd variability in the genomic sequence of the hypervariable region (HVR) of the csd gene in honeybee subspecies sampled in Italy. During the summer of 2017 and 2018, worker bees belonging to 125 colonies were sampled. The honeybees belonged to seven different A. mellifera subspecies: A. m. ligustica, A. m. sicula, A. m cecropia, A. m. carnica, A. m. mellifera, Buckfast and hybrid Carnica. Illumina genomic resequencing of all samples was performed and used for the characterization of global variability among colonies. In this work, a pipeline using existing resequencing data to explore the csd gene allelic variants present in the subspecies collection, based on de novo assembly of sequences falling within the HVR region, is described. On the whole, 138 allelic sequences were successfully reconstructed. Among these, 88 different alleles were identified, 68 of which match with csd alleles present in the NCBI GenBank database.Entities:
Keywords: Apis mellifera; biodiversity; csd alleles; hypervariable region; sex determination
Mesh:
Year: 2022 PMID: 35741752 PMCID: PMC9222915 DOI: 10.3390/genes13060991
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Amino acid sequences of alleles belonging to the hypervariable region of the csd gene, their lengths (L = nr of aa) and frequencies. If the allele was identified in GenBank, the accession numbers are shown (GenBank ID). In bold are the novel alleles identified in the study.
| Allele ID | Amino Acid Sequences | L | Freq. in the Population | Freq. in the | GenBank ID |
|---|---|---|---|---|---|
|
|
|
|
|
| |
| Allele 2 | SSLSNNYNYNNYNNNYKPLYYNINYIEQI | 29 | 2.17% | Ligustica: 2.94% | AAS86659.1, AAS86660.1, AAS86661.1, AAS86663.1, ABD14105.1, ABD14106.1, ABD14107.1, ABD14108.1, AGA84527.1 |
| Buckfast: 2.38% | |||||
| Allele 3 | SSLSNNYNSNSYNNYNNNYKKLQYYNIINIEQI | 33 | 1.45% | Ligustica: 2.94% | CCF23480.1, QEN96041.1 |
|
|
|
|
|
| |
|
| |||||
| Allele 5 | SSLSNKTIHNNNNYKYNYNNNYNNNNNYSKKLYYNINYIEQI | 42 | 2.17% | Ligustica: 4.41% | AQZ41186.1, AQZ41196.1, AQZ41212.1, QEN96050.1 |
| Allele 6 | SSLSNNYNYSNYNNYNNYNNNYNNYKKLYYNINYIEQI | 38 | 1.45% | Ligustica: 2.94% | CCF23508.1 |
| Allele 7 | SSLSKNTIHNNNYKYNYNNNNNYNNNYKKLQYYNINYIEQI | 41 | 2.90% | Ligustica: 2.94% | CCF23518.1 |
| Buckfast: 4.76% | |||||
| Allele 8 | SSLSNSCNYSNNYYNKKLYYNIINIEQI | 28 | 2.17% | Ligustica: 2.94% | CCF23469.1 |
| Buckfast: 2.38% | |||||
| Allele 9 | SSLSNKTIHNNNNYKYNYNNKYNYNNNNYNKKLYYKNYIINIEQI | 45 | 3.62% | Ligustica: 1.47% | ABD14097.1, QEN96103.1 |
| Buckfast: 9.52% | |||||
| Allele 10 | SSLSNNYNYSNYNNYNNNYNNYKKLYYNINYIEQI | 35 | 1.45% | Ligustica: 2.94% | ABV56215.1 |
| Allele 11 | SSLSNNYISNISNYNNNNNSKKLYYNINYIEQI | 33 | 2.17% | Ligustica: 4.41% | AGZ61876.1, CCF23481.1, CCF23482.1 |
| Allele 12 | SSLSKNTIHNNNYKYNYNNNNYNNSKKLYYNINYIEQI | 38 | 1.45% | Ligustica: 2.94% | CCF23507.1 |
| Ligustica: 1.47% | |||||
| Allele 13 | SSLSNKTIHNNNNYKYNYNNNNYKNYNNYKKLYYNINYIEQI | 42 | 2.17% | Carnica: 10.00% | AQZ41220.1 |
| Buckfast: 2.38% | |||||
|
|
|
|
|
| |
| Allele 15 | SSLSNNTIHNNNYKYNNYNNYNKKLYYNIINIEQI | 35 | 1.45% | Ligustica: 2.94% | CCF23492.1 |
|
| |||||
|
|
|
|
|
| |
|
| |||||
| Allele 17 | SSLSNNTIHNNNNYNKKLYYNIINIEQI | 28 | 1.45% | Ligustica: 1.47% | AGZ61866.1, AGZ61871.1, AGZ61872.1, AGZ61874.1, CCF23470.1 |
| Buckfast: 2.38% | |||||
| Allele 18 | SSLSNKTIHNNNNYKYNYNNNCKKLYYNINYIEQI | 35 | 1.45% | Hyb. Carn: 25.00% | ABD14096.1, QEN96024.1, QEN96028.1 |
| Ligustica: 1.47% | |||||
| Allele 19 | SSLSNNYKYSNYNNYNNNYNNNYNNNYNNNYKKLYKNYIINIEQI | 45 | 2.90% | Hyb. Carn: 25.00% | ABD14145.1, ABD14146.1, ADJ57940.1 |
| Buckfast: 7.14% | |||||
| Allele 20 | SSLSNNYNSNNYNKYNYNNSKKLYYNINYIEQI | 33 | 2.17% | Buckfast: 7.14% | CCF23483.1 |
| Allele 21 | SSLSNKTIHNNNNYNNNNYNNYKKLYYNIINIEQI | 35 | 1.45% | Carnica: 10.00% | AEI99777.1 |
| Buckfast: 2.38% | |||||
| Allele 22 | SSLSNNYKYSNYNNYNNNYNNYNNNYNNNYKKLYYNINYIEQI | 43 | 1.45% | Carnica: 10.00% | CCF23533.1 |
| Buckfast: 2.38% | |||||
| Allele 23 | SSLSNHYNYNNNKYNNYNNDYKKLYYNINYIEQI | 34 | 1.45% | Ligustica: 2.94% | ADJ57943.1, AQZ41194.1, AQZ41195.1 |
| Allele 24 | SSLSNKTIHNNNNYKYNYKNYNNSKKLYYNVINIEQI | 37 | 1.45% | Buckfast: 2.38% | AEI99780.1, AEI99781.1, AEI99790.1 |
| Cecropia: 50.00% | |||||
| Allele 25 | SSLSNKTIHNNNNYNNYKKLYYNINYIEQI | 30 | 1.45% | Carnica: 20.00% | AQZ41206.1, AQZ41207.1, AQZ41208.1, CCF23474.1 |
| Allele 26 | SSLSNKTIHNNNKYNYNKYNYNNNNYNNYKKLYYNINYIEQI | 42 | 1.45% | Carnica: 10.00% | CCF23526.1 |
| Buckfast: 2.38% | |||||
| Allele 27 | SSLSNNYNYNNNNYNNYNNNYNNNYNKKLYYNIINIEQI | 39 | 2.17% | Carnica: 10.00% | AQZ41204.1, AQZ41205.1, CCF23512.1 |
| Buckfast: 4.76% | |||||
| Allele 28 | SSLSNKTIHNNNNYKYNYNNNNYNNNYNNNCKKLYYNIINIEQI | 44 | 1.45% | Mellifera: 25.00% | AEI99762.1, QEN96035.1 |
| Ligustica: 1.47% | |||||
| Allele 29 | SSLSNNYNYNNNNYNNNYNKKLYYNINYIEQI | 32 | 1.45% | Ligustica: 1.47% | ADJ57960.1, AEI99714.1, AEI99724.1, AEI99729.1, AEI99733.1, AEI99735.1, AEI99736.1, AEI99738.1, AEI99741.1, AEI99743.1, AEI99744.1, AEI99755.1, AEI99759.1 |
| Sicula: 12.50% | |||||
|
|
|
|
|
| |
| Allele 31 | SSLSNKTIHNNNNYKYNYNNKYNYNNNNYNNNNYNKKLYYKNYIINIEQI | 50 | 2.17% | Sicula: 12.50% | AQZ41179.1, AQZ41181.1, AQZ41192.1, AQZ41193.1, QEN96025.1 |
| Ligustica: 2.94% | |||||
| Allele 32 | SSLSNSCNYSNNYNNNYNNTKKLYYNINYIEQI | 33 | 0.72% | Ligustica: 4.41% | AEI99783.1, AEI99788.1, AEI99792.1, AGZ61869.1 |
| Allele 33 | SSLSNKTIHNNNNYKNYNYKKLYYNIINIEQI | 32 | 0.72% | Ligustica: 1.47% | CCF23479.1 |
| Allele 34 | SSLSNNYNYSNYNNNNYKQLCYNINYIEQI | 30 | 0.72% | Ligustica: 1.47% | ABD14139.1, ABD14141.1, ABD14142.1, ABD14143.1, ABD14144.1, AGA84531.1, AGZ61875.1, |
| Allele 35 | SSLSNNYNYSNYNNYNNYNNNYNNYNNNYNNYKKLYYNINYIEQI | 45 | 0.72% | Buckfast: 2.38% | CCF23536.1 |
| Allele 36 | SSLSNNYNSNSYNNYNNNYYNNKKLQYYNINYIEQI | 36 | 0.72% | Ligustica: 1.47% | AGA84533.1, CCF23490.1, QEN96078.1 |
| Allele 37 | SSLSNKTIHNNNNYNNNNYNNYNNNNYNNYKKLYYNIINIEQI | 43 | 0.72% | Buckfast: 2.38% | QEN96087.1 |
| Allele 38 | SSLSSNYNSNNYNNYNNYKQLCYNINYIEQI | 31 | 0.72% | Mellifera: 25.00% | ART88596.1, CCF23477.1 |
| Allele 39 | SSLSNNYNYNNNKYNYNNNNYKQLCYNINYIEQI | 34 | 0.72% | Ligustica: 1.47% | AEI99717.1, AEI99718.1, AEI99720.1, AEI99721.1, AEI99726.1, AEI99732.1, AEI99734.1, AEI99740.1, AEI99742.1, AGA84529.1 |
| Allele 40 | SSLSNKTIHNNNNYNNNNYNNYNNNNYNNYKKLYYNINYIEQI | 43 | 0.72% | Ligustica: 1.47% | CCF23531.1 |
| Allele 41 | SSLSNNYKYSNYNNYNNNNYNNNNYNNNSKKLYYNIINIEQI | 42 | 0.72% | Ligustica: 1.47% | AGA84526.1, CCF23524.1 |
| Allele 42 | SSLSNKTIHNNNNYNNNNYKKLQYYNINYIEQI | 33 | 0.72% | Ligustica: 1.47% | ABV56220.1, AQZ41221.1, QEN96101.1 |
| Allele 43 | SSLSNNYNYNNNNYNNYNNNYNNNYNKKLYYNINYIEQI | 39 | 0.72% | Ligustica: 1.47% | QEN96052.1, QEN96059.1 |
|
|
|
|
|
| |
| Allele 45 | SSLSNNYNYNNNNYNNYNNYNNYNNNYNKKLYYNINYIEQI | 41 | 0.72% | Ligustica: 1.47% | ABV56219.1 |
| Allele 46 | SSLSNNYKYSNYNNYNNYNKKLYYKNYIINIEQI | 34 | 0.72% | Ligustica: 1.47% | ABD14104.1, ADJ57958.1, AGA84523.1, QEN96085.1, QEN96088.1 |
|
|
|
|
|
| |
| Allele 48 | SSLSNNYKYSNYNNNNYNNNSKKLYYNINYIEQI | 34 | 0.72% | Ligustica: 1.47% | AQZ41199.1, CCF23485.1 |
|
|
|
|
|
| |
| Allele 50 | SSLSNKTIHNNNNYKYNYNNNNYNNNYNNYKKLYYNINYIEQI | 43 | 0.72% | Ligustica: 1.47% | CCF23532.1, QEN96032.1, QEN96033.1, QEN96042.1 |
| Allele 51 | SSLSNKTIHNNNNYKNYNNYKNYNNYKKLYYNINYIEQI | 39 | 0.72% | Carnica: 10.00% | CCF23513.1 |
| Allele 52 | SSLSNKTIHNNNNYKYNYNNNNYNNNNYNKKLYYKNYIINIEQI | 44 | 0.72% | Ligustica: 1.47% | CCF23527.1 |
| Allele 53 | SSLSNNTIHNNNYKYNYNNKYNYNNKKLYYNIINIEQI | 38 | 0.72% | Carnica: 10.00% | CCF23510.1, QEN96012.1 |
| Allele 54 | SSLSNKTIHNNNNYKYNYNNNNYNNNNYNNNYNNNCKKLYYNIINIEQI | 49 | 0.72% | Buckfast: 2.38% | AAQ57659.1, QEN96069.1 |
| Allele 55 | SSLSNNYNSNSYNNNYNNNYYNKKLQYYNINYIEQI | 36 | 0.72% | Buckfast: 2.38% | CCF23486.1 |
|
|
|
|
|
| |
| Allele 57 | SSLSNKTIHNNNNYKKLYYNINYIEQI | 27 | 0.72% | Buckfast: 2.38% | CCF23466.1 |
| Allele 58 | SSLSNNYNYSNYNNYNNYKKLYYNINYIEQI | 31 | 0.72% | Buckfast: 2.38% | CCF23475.1 |
| Allele 59 | SSLSNNTIHNNNYKYNYNNNNYNNNNYNKKLYYNIINIEQI | 41 | 0.72% | Buckfast: 2.38% | ABD14102.1, ABD14103.1, AEI99760.1, AEI99784.1 |
| Allele 60 | SSLSNNYNYSNYNNYNNNNNYNNYKKLYYNINYIEQI | 37 | 0.72% | Ligustica: 1.47% | ABD14109.1, ABD14110.1, ABD14111.1, ABD14112.1, ABD14113.1, ABD14114.1, ABD14115.1, ABD14116.1, AQZ41223.1, QEN96102.1 |
| Allele 61 | SSLSNNYKYSNYNNYNNNYNNYNNNYKKLYYNINYIEQI | 39 | 0.72% | Ligustica: 1.47% | AEI99754.1, AEI99786.1, AEI99787.1, AEI99789.1 |
| Allele 62 | SSLSSSCNYSNNYNNYYNNNKKLYYNIINIEQI | 33 | 0.72% | Ligustica: 1.47% | AGA84525.1, AIS73042.1, AIS73050.1, CCF23484.1, QEN96007.1, QEN96009.1, QEN96018.1, QEN96019.1, QEN96026.1, QEN96061.1, QEN96074.1, QEN96084.1, QEN96097.1, QEN96105.1 |
|
|
|
|
|
| |
| Allele 64 | SSLSNNRNSNNYNNYNYKKLYYNINYIEQI | 30 | 0.72% | Buckfast: 2.38% | ABV56222.1, CCF23473.1 |
| Allele 65 | SSLSNNYNYSNYNNYNNNYNNNYNNNDYKKLYYKNYIINIEQI | 43 | 0.72% | Buckfast: 2.38% | ABV56216.1, QEN96106.1 |
| Allele 66 | SSLSNNYNYSNYNNYNNNNYNNYKKLYYNINYIEQI | 36 | 0.72% | Hyb. Carn: 25.00% | CCF23496.1 |
| Allele 67 | SSLSNNYNYSNNYNNYYNNNNNYNNYKKLYYNIINIEQI | 39 | 0.72% | Hyb. Carn: 25.00% | AEI99727.1, AEI99728.1 |
| Allele 68 | SSLSKNTIHNNNYNNSKKLYYNIINIEQI | 29 | 0.72% | Buckfast: 2.38% | AEI99716.1 |
|
|
|
|
|
| |
|
|
|
|
|
| |
|
|
|
|
|
| |
| Allele 72 | SSLSNKTIHNNNNYKYNYNNNNYKPYYNINYIEQI | 35 | 0.72% | Ligustica: 1.47% | AEI99745.1 |
|
|
|
|
|
| |
|
|
|
|
|
| |
|
|
|
|
|
| |
| Allele 76 | SSLSNNYKYSNYNNYNNYNNNSKKLYKNYIINIEQI | 36 | 0.72% | Sicula: 12.50% | AQZ41187.1, AQZ41197.1, AQZ41198.1, ART88598.1, CCF23491.1, QEN96020.1, QEN96040.1 |
| Allele 77 | SSLSNKTIHNNNNYNNNNYNNYNNNNYNYKKLYYNINYIEQI | 42 | 0.72% | Ligustica: 1.47% | CCF23528.1 |
| Allele 78 | SSLSNKTIHNNNNYKYNYNNNYNNNSKKLYYNINYIEQI | 39 | 0.72% | Ligustica: 1.47% | AGA84537.1, CCF23514.1 |
| Allele 79 | SSLSNNYNYSNYNNYNNNYNNYNKKLYYNINYIEQI | 36 | 0.72% | Buckfast: 2.38% | ABD14117.1 |
|
|
|
|
|
| |
| Allele 81 | SSLSNNYNYNNYNNTNNINKQLYYNINYIEQI | 32 | 0.72% | Buckfast: 2.38% | ABV56218.1, AQZ41218.1, QEN96077.1, QEN96095.1 |
| Allele 82 | SSLSNNYSYNNYNNNNYNKKLYYNINYIEQI | 31 | 0.72% | Buckfast: 2.38% | CCF23476.1, QEN96068.1 |
| Allele 83 | SSLSNNYNYNNNNYNNYNNNYNKKLYYNINYIEQI | 35 | 0.72% | Buckfast: 2.38% | AQZ41215.1, AQZ41216.1, AQZ41217.1, CCF23487.1 |
|
|
|
|
|
| |
|
|
|
|
|
| |
| Allele 86 | SSLSNNTIHNNNNYKYNYNNNYNNYNNYNNKKLYYNIINIEQI | 43 | 0.72% | Buckfast: 2.38% | CCF23529.1 |
|
|
|
|
|
| |
| Allele 88 | SSLSNNYISNISNYNNNNNYNKKLYYNINYIEQI | 34 | 0.72% | Ligustica: 1.47% | AAQ67418.1, ABD14119.1, DAA06292.1, QEN96014.1, QEN96029.1, QEN96031.1, QEN96036.1, QEN96038.1, QEN96045.1, QEN96055.1, QEN96057.1, QEN96079.1 |
Figure 1Phylogenetic tree based on the amino acid sequence of the csd alleles HVR region in 7 different Italian honeybee subspecies. Colors are given according to evolutionary lineages: A. m. sicula belongs to the African type (pink); A. m. mellifera belongs to the western and northern Europe lineage (blue); A. m. carnica, A. m. cecropia and A. m. ligustica belong to the eastern Europe group (green); Buckfast and hybrid Carnica are crosses of European subspecies belonging to C type (red). The tree was rooted on Apis cerana (black).
Figure 2Venn diagram of the csd alleles among the 7 honeybee subspecies.