| Literature DB >> 25710878 |
Rafael Bayarri-Olmos1, Soren Hansen2, Maiken Lumby Henriksen2, Line Storm3, Steffen Thiel3, Peter Garred1, Lea Munthe-Fog1.
Abstract
Collectin liver 1 (CL-L1, alias CL-10) and collectin kidney 1 (CL-K1, alias CL-11), encoded by the COLEC10 and COLEC11 genes, respectively, are highly homologous soluble pattern recognition molecules in the lectin pathway of complement. These proteins may be involved in anti-microbial activity and in tissue development as mutations in COLEC11 are one of the causes of the developmental defect syndrome 3MC. We studied variations in COLEC10 and COLEC11, the impact on serum concentration and to what extent CL-L1 and CL-K1 serum concentrations are correlated. We sequenced the promoter regions, exons and exon-intron boundaries of COLEC10 and COLEC11 in samples from Danish Caucasians and measured the corresponding serum levels of CL-L1 and CL-K1. The median concentration of CL-L1 and CL-K1 was 1.87 μg/ml (1.00-4.14 μg/ml) and 0.32 μg/ml (0.11-0.69 μg/ml), respectively. The level of CL-L1 strongly correlated with CL-K1 (ρ = 0.7405, P <0.0001). Both genes were highly conserved with the majority of variations in the non-coding regions. Three non-synonymous variations were tested: COLEC10 Glu78Asp (rs150828850, minor allele frequency (MAF): 0.003), COLEC10 Arg125Trp (rs149331285, MAF: 0.007) and COLEC11 His219Arg (rs7567833, MAF: 0.033). Carriers of COLEC10 Arg125Trp had increased CL-L1 serum levels (P = 0.0478), whereas promoter polymorphism COLEC11-9570C>T (rs3820897) was associated with decreased levels of CL-K1 (P = 0.044). In conclusion, COLEC10 and COLEC11 are highly conserved, which may reflect biological importance of CL-L1 and CL-K1. Moreover, the strong inter individual correlation between the two proteins suggests that a major proportion are found as heterooligomers or subjected to the same regulatory mechanisms.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25710878 PMCID: PMC4339841 DOI: 10.1371/journal.pone.0114883
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Distribution of polymorphisms and pair wise LD analysis (r2) observed in COLEC10.
A schematic drawing of the gene COLEC10 is given at the top. Exons are numbered and promoter and non-translated regions are indicated. The numbers in the grid refer to the r2 values of a given pair of SNPs. Darker colours of the boxes means higher r2 values.
COLEC10 and COLEC11 primer sets.
| Forward primer | Reverse primer | Internal primer | |
|---|---|---|---|
| COLEC10 | |||
| Promoter region | |||
| Fragment 1 | 5′-cctttcacctaactcttagttga-3′ | 5′-gcctttctggtacactatctc-3′ | |
| Fragment 2 | 5′-ttcgcctttctgccacatcc-3′ | 5′-gaactccagtgtgcatgtcag-3′ | |
| Fragment 3 | 5′-tcaagtactcagcagaaggca-3′ | 5′-caaagccattcattgctgtgg-3′ | |
| Exonic regions | |||
| Exon 1 | 5′-ggtccatttggagcactgag-3′ | 5′-ataagtaacagaccagatctgg-3′ | |
| Exon 2 | 5′-aagtgcatgtgaacatacatcc-3′ | 5′-atgcaacttgagaaacatatgag-3′ | |
| Exon 3 | 5′-tagtatatattcatactcactcac-3′ | 5′-ccaccatgcccagctaattc-3′ | |
| Exon 4 | 5′-ttatgactcatttaattagttgcac-3′ | 5′-actgccagagttttctcttgg-3′ | |
| Exon 5 | 5′-tttagaattgtcttatagttcattc-3′ | 5′-cagacttgaggtgtcctagg-3′ | |
| Exon 6 | 5′-tctgtgactacctcataatagg-3′ | 5′-tgtaacctggcgagttattgg-3′ | |
| 5′-gagtgccatcttaccatgtac-3′ | 5′-aaggagaacttccaagtatagc-3′ | ||
| Pyrosequencing | |||
| –7C>T | 5′-tgtgtgttccaaatacttccc-3′ | 5′-[B]gactctgaatttgcaaaagaaatag-3′ | 5′-tgggagacccttttctgagg-3′ |
| Glu78Asp | 5′-acattagccacatttcaaggga-3′ | 5′-[B]acctcggtctctgagtgttg-3′ | 5′-gtttgcttttcaggaattaaagg-3′ |
| Arg125Trp | 5′-[B]cgtgtcttcatcggcatca-3′ | 5′-ctccttgtcaaactcacacatga-3′ | 5′-ccacttgttgaaggtccgca-3′ |
| COLEC11 | |||
| Promoter region | |||
| Fragment 1 | 5′-tgtctggttttggtatcagggt-3′ | 5′-gcttctatgcagcccgaatc-3′ | |
| Fragment 2 | 5′-tcacctttgaacctagcttctta-3′ | 5′-cctggggagtgtgggaac-3′ | |
| Fragment 3 | 5′-ttgggatgttcgggttggag-3′ | 5′-atcaaactttggaactctggtg-3′ | |
| Exonic regions | |||
| Exon 1 | 5′-ttgggatgttcgggttggag-3′ | 5′-atcaaactttggaactctggtg-3′ | |
| Exon 2 | 5′-ttggtgcctggtgctgacc-3′ | 5′-ggaattccatggccgaagac-3′ | |
| Exon 3 | 5′-aaacacaggccctttctaagag-3′ | 5′-ccacaggtcagcacaaggag-3′ | |
| Exon 4 | 5′-ctccattctatagtgtgtatgac-3′ | 5′-agctgcagcccctccatcc-3′ | |
| Exon 5 | 5′-aaagaatgatcactcgataatcc-3′ | 5′-gtgtccaccgtgggatgtg-3′ | |
| Exon 6 | 5′-ttgaaatacatgtgtctgccct-3′ | 5′-gcacagccacaggcagcag-3′ | |
| Exon 7 | 5′-acctcccagccctgtcctg-3′ | 5′-cattttctcagtttagacaacagc-3′ | |
| 5′-cccaacaatgcctacgacg-3′ | 5′-catggataatagtgtgaaggaacc-3′ | ||
| Pyrosequencing | |||
| His219Arg | 5′-[B]tggaaagagagcaaatgttcct-3′ | 5′-actggaattgtagtgttgatgga-3′ | 5′-actaatatccagttgtccaacaaa-3′ |
Forward and reverse primers were used for PCR amplification, internal primer was used for pyrosequencing. All forward primers for the promoter and exonic regions include a 5′-T7 sequence (5′-TAATACGACTCACTATAGGG-3′). Biotinylated oligos are indicated as [B].
Genetic variation observed in the COLEC10 gene.
| Variation | dbSNP | Region | Amino acid change | AA | Aa | aa | MAF |
|---|---|---|---|---|---|---|---|
| −996C>T | rs117863403 | Promoter | - | 93 (96.9%) | 3 (3.1%) | 0 | 0.016 |
| −796delT | ss991382312 | Promoter | - | 95 (99%) | 1 (1%) | 0 | 0.005 |
| −743C>T | ss991382313 | Promoter | - | 95 (99%) | 1 (1%) | 0 | 0.005 |
| −742G>A | rs3812490 | Promoter | - | 94 (98%) | 2 (2%) | 0 | 0.010 |
| −663C>G | rs1485297 | Promoter | - | 81(84.4%) | 14 (14.6%) | 1 (1%) | 0.083 |
| −315G>C | rs1485298 | Promoter | - | 62 (64.6%) | 28 (29.2%) | 6 (6.3%) | 0.208 |
| −277T>C | rs2450048 | Promoter | - | 56 (58.3%) | 33 (34.4%) | 7 (7.3%) | 0.245 |
| −161_−157 delAAAAT | rs148350292 | Promoter | - | 95 (99%) | 1 (1%) | 0 | 0.005 |
| −150C>T | rs3829048 | Promoter | - | 93 (96.9%) | 3 (3.1%) | 0 | 0.016 |
| −145G>A | ss991382317 | Promoter | - | 95 (99%) | 1 (1%) | 0 | 0.005 |
| −7C>T | ss749616235 | 5′ UTR | - | 299 (99.7%) | 1 (0.3%) | 0 | 0.005 |
| +228A>G | rs2465383 | Intron 1 | - | 26 (26%) | 55 (55%) | 19 (19%) | 0.465 |
| +22418A>G | rs16891987 | Exon 2 | - | 97 (97%) | 3 (3%) | 0 | 0.015 |
| +22547T>C | rs149290883 | Intron 2 | - | 99 (99%) | 1 (1%) | 0 | 0.005 |
| +23881A>C | rs150828850 | Exon 3 | Glu78Asp | 298 (99.3%) | 2 (0.7%) | 0 | 0.005 |
| +23998A>T | rs4512407 | Intron 3 | - | 97 (97%) | 3 (3%) | 0 | 0.015 |
| +36545C>T | rs149331285 | Exon 5 | Arg125Trp | 296 (98.7%) | 4 (1.3%) | 0 | 0.005 |
| +38501C>A | rs11987106 | Intron 5 | - | 97 (97%) | 3 (3%) | 0 | 0.015 |
| +39097A>G | rs1064556 | 3′ UTR | - | 97 (97%) | 3 (3%) | 0 | 0.015 |
| +39391C>G | rs1064557 | 3′ downstream | - | 97 (97%) | 3 (3%) | 0 | 0.015 |
The base position refers to the location of the polymorphism considering the A from the translation start site ATG as +1, and the adjacent 5′ base as -1. All variations adhered to the Hardy-Weinberg equilibrium. ‘AA’, individuals homozygote for the major allele; ‘Aa’, heterozygous individuals; ‘aa’, individuals homozygous for the minor allele; MAF, minor allele frequency.
Genetic variation observed in the COLEC11 gene.
| Base position | dbSNP | Region | Amino acid change | AA | Aa | aa | MAF |
|---|---|---|---|---|---|---|---|
| −10007_− 10004delCATT | ss991382311 | Promoter | - | 95 (99%) | 1 (1%) | 0 | 0.005 |
| −9970T>C | rs1864480 | Promoter | - | 53 (55.2%) | 40 (41.7%) | 3 (3%) | 0.240 |
| −9766T>C | rs4849953 | Promoter | - | 44 (45.8%) | 44 (45.8%) | 8 (8.4%) | 0.313 |
| −9591G>T | rs34596301 | Promoter | - | 86 (89.6%) | 10 (10.4%) | 0 | 0.052 |
| −9570C>T | rs3820897 | Promoter | - | 52 (54.2%) | 41 (42.7%) | 3 (3%) | 0.245 |
| −9175T>C | rs77246730 | 5′ UTR | - | 193 (98.5%) | 3 (1.5%) | 0 | 0.010 |
| +8822C>T | rs3811531 | Intron 2 | - | 36 (36%) | 46 (46%) | 18 (18%) | 0.410 |
| +8926G>A | rs7567724 | Intron 2 | - | 77 (77%) | 20 (20%) | 3 (3%) | 0.130 |
| +8939G>A | rs17017752 | Intron 2 | - | 95 (95%) | 4 (4%) | 1 (1%) | 0.030 |
| +9072G>A | rs67826307 | Intron 3 | - | 95 (95%) | 4 (4%) | 1 (1%) | 0.030 |
| +9091C>T | rs72769325 | Intron 3 | - | 97 (97%) | 3 (3%) | 0 | 0.015 |
| +33064T>G | ss749616236 | Intron 3 | - | 99 (99%) | 1 (1%) | 0 | 0.005 |
| +33116C>T | rs3811528 | Intron 3 | - | 49 (49%) | 40 (40%) | 11 (11%) | 0.310 |
| +33119G>T | rs36024497 | Intron 3 | - | 99 (99%) | 1 (1%) | 0 | 0.005 |
| +33125G>A | ss749616237 | Intron 3 | - | 99 (99%) | 1 (1%) | 0 | 0.005 |
| +33144A>G | rs3811527 | Intron 3 | - | 49 (49%) | 40 (40%) | 11 (11%) | 0.310 |
| +33145C>T | rs148763047 | Intron 3 | - | 98 (98%) | 2 (2%) | 0 | 0.010 |
| +33233T>C | rs34347318 | Exon 4 | - | 98 (98%) | 2 (2%) | 0 | 0.010 |
| +33245T>C | rs17017791 | Exon 4 | - | 99 (99%) | 1 (1%) | 0 | 0005 |
| +35844C>T | rs17017804 | Intron 4 | - | 99 (99%) | 1 (1%) | 0 | 0.005 |
| +35846C>T | rs144784212 | Intron 4 | - | 99 (99%) | 1 (1%) | 0 | 0.005 |
| +39141C>T | rs34436491 | Exon 6 | - | 99 (99%) | 1 (1%) | 0 | 0.005 |
| +39618C>G | rs7567833 | Exon 7 | His219Arg | 280 (93.3%) | 20 (6.7%) | 0 | 0.025 |
| +39739C>T | rs114716171 | Exon 7 | - | 96 (96%) | 4 (4%) | 0 | 0.020 |
The base position refers to the location of the polymorphism considering the A from the translation start site ATG as +1, and the adjacent 5′ base as -1. All variations adhered to the Hardy-Weinberg equilibrium. ‘AA’; individuals homozygote for the major allele, ‘Aa’; heterozygous individuals, ‘aa’; individuals homozygous for the minor allele; MAF, minor allele frequency.
Non-synonymous variations in COLEC10 and COLEC11.
| Variation | Region | PolyPhen-2 | Blosum62 | SIFT | Align-GVGD | PhD-SNP | Consurf |
|---|---|---|---|---|---|---|---|
|
| Exon 3 | Benign | 2 | 1 | C0 | Neutral (7) | Variable(3) |
|
| Exon 5 | Probably damaging | -3 | 0 | C65 | Disease (3) | Conserved (9) |
|
| Exon 7 | Benign | 0 | 0.462 | 29 | Neutral (3) | Variable(3) |
PolyPhen-2 classifies qualitatively a mutation as benign, possibly damaging, or probably damaging. BLOSUM62 matrix values range from 4 (more likely) to -4 (less likely substitutions). SIFT values >0.05 indicate substitutions with little effect on protein function, while values <0.05 point out possible deleterious substitutions. Align-GVGD categorizes a mutation in seven graded classifiers from less likely (class C0) to most likely to interfere with protein function (class C65). PhD-SNP produces a binary prediction (neutral vs disease-related polymorphism), together with the reliability score of the prediction (0–9). Consurf position-specific conservation scores are divided in 9 grades, from the most variable (grade 1) to the most conserved positions (grade 9).
Fig 2Distribution of polymorphisms and pair wise LD analysis (r2) observed in COLEC11.
A schematic drawing of the gene COLEC11 is given at the top. Exons are numbered and promoter and non-translated regions are indicated. The numbers in the grid refer to the r2 values of a given pair of SNPs. Darker colours of the boxes means higher r2 values.
Fig 3Impact of allelic variants on serum levels.
A-C, the serum levels of CL-L1 and CL-K1 and the corresponding genotype. Horizontal bars indicate median values; number in parenthesis, number of individuals and white circles, outliers. Statistical significance was determined by non-parametric Mann-Whitney and Kruskal-Wallis tests. D, the distribution of CL-L1 and CL-K1 levels in the cohort. The strength of the correlation was calculated using Spearman Rank.