Literature DB >> 21660233

A Method to Assess Linkage Disequilibrium between CNVs and SNPs Inside Copy Number Variable Regions.

Nathan E Wineinger¹, Nicholas M Pajewski, Hemant K Tiwari.

Abstract

Since the discovery of the ubiquitous contribution of copy number variation to genetic variability, researchers have commonly used metrics such as r (2) to quantify linkage disequilibrium (LD) between copy number variants (CNVs) and single nucleotide polymorphisms (SNPs). However, these reports have been restricted to SNPs outside copy number variable regions (CNVR) as current methods have not been adapted to account for SNPs displaying variable copy number. We show that traditional LD metrics inappropriately quantify SNP/CNV covariance when SNPs lie within CNVR. We derive a new method for measuring LD that solves this issue, and defaults to traditional metrics otherwise. Finally, we present a procedure to estimate CNV-SNP allele frequencies from unphased CNV-SNP genotypes. Our method allows researchers to include all SNPs in SNP/CNV LD measurements, regardless of copy number.

Entities: Disease Gene Species

Keywords: CNV–SNP haplotype; copy number variation; linkage disequilibrium

Year: 2011 PMID： 21660233 PMCID： PMC3109359 DOI： 10.3389/fgene.2011.00017

Source DB: PubMed Journal: Front Genet ISSN： 1664-8021 Impact factor: 4.599

Introduction

Examination of linkage disequilibrium (LD) between single nucleotide polymorphisms (SNPs) has played a key role in our understanding of worldwide patterns of genetic variation, including determining the extent of haplotype diversity (Conrad et al., 2006), detecting regions of positive selection (Sabeti et al., 2007), and guiding the design of most current genotyping arrays through the selection of appropriate haplotype tagging SNPs. Traditional pairwise metrics of LD, including r2, D, and D′, have been designed to quantify the degree of non-independence between neighboring genetic polymorphisms (Lewontin and Kojima, 1960; Lewontin, 1964; Hill and Robertson, 1968). With the current understanding that copy number variation (CNV) also significantly contributes to genetic variation (Redon et al., 2006), research has turned to the role for CNV in disease risk (Gonzalez et al., 2005; Aitman et al., 2006; McCarroll and Altshuler, 2007; Sebat et al., 2007), particularly as a partial explanation for the so-called missing heritability (Manolio et al., 2009; Eichler et al., 2010). Recently, genome-wide CNV surveys such as that performed by the Wellcome Trust Case Control Consortium (WTCCC) have concluded that common CNVs were adequately tagged by SNPs; and thus unlikely to substantially contribute to the genetic basis of common human diseases (Conrad et al., 2010; Wellcome Trust Case Control Consortium et al., 2010). However, current methods have restricted these studies to only include SNPs that fall outside of copy number variable regions (CNVR) – the ramifications being that more tagging SNPs are being missed, particularly in DNA segments of higher copy number. In this paper, we explicitly derive the covariance between SNPs and CNVs under a range of scenarios where SNPs either fall inside (interior) or outside (exterior) of a CNVR. We find that traditional LD metrics are sufficient for exterior SNPs; however, these same metrics inappropriately quantify covariance for interior SNPs. Specifically, we show that the covariance estimated from common metrics using interior SNPs will: (1) always be non-zero at any polymorphic loci; (2) differ based upon the arbitrary choice of reference SNP allele; and (3) potentially lead to high values of LD despite any meaningful correlation between the copy number state and SNP allele. Based on this result, we modify traditional techniques to appropriately quantify the covariance in the case of SNPs residing within CNVRs.

Materials and Methods

We begin with a brief review of current statistical metrics for the quantification of LD, discuss their performance in the presence of CNV, and conclude with our proposed statistics based on CNV–SNP covariance.

Review of current LD metrics for SNPs and CNVs

In accordance with current LD metrics, let X denote the integer copy number state for a CNVR on a single maternal/paternal chromosome or haploid, where we assume for simplicity that X can take three values representing a deletion (0), normal copy number (1), and duplication (2). Similarly define Y as the count of reference alleles at a SNP on the same chromosome, where we arbitrarily label the SNP alleles as A (reference) and B. The marginal probability distributions for X and Y can then be defined as: Assuming that the joint frequencies (f) are known, the covariance between X and Y can be written as: We consider this covariance between CNVs and SNPs in the following four scenarios. Scenario 1a: The SNP is outside a CNVR (exterior SNP) that contains a normal (one copy) variant and deletion (zero copies). Then: Scenario 1b: The SNP is outside a CNVR (exterior SNP) that contains a normal (one copy) variant and duplication (two copies). Then: In both of the above scenarios, the covariance between the CNV and SNP will appropriately be zero when X and Y are independent (i.e., the joint frequency is equivalent to the product of the marginal frequencies). Also, any inference concerning the relationship between the CNV and SNP does not depend on as the choice of reference allele, since only the direction of the covariance differs. Given these features, traditional measurements of LD between CNVs and SNPs are sufficient for exterior SNPs. Scenario 2a: The SNP is inside a CNVR (interior) that contains a normal (one copy) variant and deletion (zero copies). Table 1 provides definitions of CNV–SNP allele frequencies based on haploid, three copy number state model (zero to two copies per haploid). In situations where the SNP lies within the CNVR, SNP allele counts are dependent on copy number state. For example, whenever a deletion is present, both X and Y must be equal to zero. Thus,

Table 1

Copy number variant–single nucleotide polymorphism (CNV–SNP) alleles based upon a haploid three copy number state model (zero to two copies per haploid).

CNV–SNP allele	Copy number	Number of A alleles	Frequency
0	0	0	f₀
A	1	1	f_1,A
B	1	0	f_1,B
AA	2	2	f_2,AA
AB	2	1	f_2,AB
BB	2	0	f_2,BB

Copy number variant–single nucleotide polymorphism (CNV–SNP) alleles based upon a haploid three copy number state model (zero to two copies per haploid). Scenario 2b: The SNP is inside a CNVR (interior) that contains a normal (one copy) variant and duplication (two copies). This final scenario represents the most complex case. The sample space of Y needs to change to reflect the possibility of zero to two copies of the A allele. Namely: The covariance then becomes, Based on the covariances calculated in scenarios 2a and 2b, we find two undesirable features of current metrics when used to assess LD between interior SNP and CNVs: (1) polymorphic SNPs inside CNVRs will never be uncorrelated with the CNV; and (2) the correlation between variants will differ based upon which SNP allele is considered as the reference. In these scenarios the use of traditional LD measurements could impact association results. Consider a population where a monomorphic SNP lies within a CNVR that includes a moderately frequent deletion (for instance: f0 = 0.1 and f1,A = 0.9). Traditional metrics would conclude that the SNP and CNV are in perfect LD; and that any inference based upon the SNP would apply to the CNV. However, in the absence of copy number data, an association analysis based upon the SNP would be completely uninformative – leading to, perhaps, the incorrect conclusion that CNV is also not associated with the trait. In general, we show that high values of r2 between an interior SNP and deletion are obtained whenever the SNP minor allele frequency is low (Figure 1). However, in the absence of CNV data, the same incorrect conclusion would again be applied to the CNV. In these situations we would hope LD measurements would conclude independence. However that is not the case. We also note the result that the correlation between the SNP and CNV depends on the SNP allele considered as the reference. We have provided an example in the results section signifying this property. Together, these features demonstrate that traditional LD metrics are inappropriate when applied to interior SNPs and CNVs.

Figure 1

Linkage disequilibrium (.

Linkage disequilibrium (. To address these deficiencies, we now propose a new metric to quantify LD between CNVs and SNPs that functions equivalently to traditional measures for exterior SNPs, and solves these issues for interior SNPs.

Derivation of new proposed statistic

We consider a bi-allelic SNP present within a CNVR with three potential haploid copy number states: zero, one, or two copies – although methods here can be expanded to higher copy number, or multiple SNP alleles (Kalinowski and Hedrick, 2001). Define the CNV–SNP allele at this locus to be a combination of the haploid copy number state and nucleotide frequency with two differing, generically labeled SNPs A and B. Then this model can be treated similar to a multiallelic locus with alleles: 0, A, B, AA, AB, and BB; where 0 represents a deletion (Table 1). Combined in pairs, these alleles form a CNV–SNP genotype which provides information on the total number of copies of each nucleotide (Table 2). This model is consistent with those in the majority of copy number calling algorithms for array-based CNV detection (Wang et al., 2007; Korn et al., 2008; Coin et al., 2010). Note, however, that while CNV–SNP genotypes can be inferred from common genotyping platforms (Korn et al., 2008; Coin et al., 2010), the phase, particularly in duplicated regions, may be ambiguous. For example, an AAB genotype may have either of the phased haploid configurations AA/B or AB/A.

Table 2

Copy number variant–single nucleotide polymorphism (CNV–SNP) genotypes based upon a haploid three copy number state model, CNV–SNP haploid configurations, and respective frequencies.

CNV–SNP genotype	Haploid configuration	Frequency
0	0/0	f02
A	A/0	2f_1,Af₀
B	B/0	2f_1,Bf₀
AA	A/A	f1,A2
	AA/0	2f_2,AAf₀
AB	A/B	2f_1,Af_1,B
	AB/0	2f_2,ABf₀
BB	B/B	f1,B2
	BB/0	2f_2,BBf₀
AAA	AA/A	2f_2,AAf_1,A
AAB	AA/B	2f_2,AAf_1,B
	AB/A	2f_2,ABf_1,A
ABB	BB/A	2f_2,BBf_1,A
	AB/B	2f_2,ABf_1,B
BBB	BB/B	2f_2,BBf_1,B
AAAA	AA/AA
AAAB	AA/AB	2f_2,AAf_2,AB
AABB	AA/BB	2f_2,AAf_2,BB
	AB/AB	f2,AB2
ABBB	AB/BB	2f_2,ABf_2,BB
BBBB	BB/BB	f2,BB2

Frequency estimates are based upon haploid configurations falling into their appropriate Hardy–Weinberg equilibrium proportions.

Copy number variant–single nucleotide polymorphism (CNV–SNP) genotypes based upon a haploid three copy number state model, CNV–SNP haploid configurations, and respective frequencies. Frequency estimates are based upon haploid configurations falling into their appropriate Hardy–Weinberg equilibrium proportions. We note that in the case of interior SNPs, a deletion should not provide any information on the relationship between the copy number state and SNP allele(s) present. Therefore, let X be the integer haploid copy number state and Y represent the presence of a particular SNP allele, conditional on haploid copy number state not equal to zero, so that: f1 = f1,A + f1,B; f2 = f2,AA + f2,AB + f2,BB; fA = f1,A + f2,AA + 1/2f2,AB; and fB = f1,B = f2,BB + 1/2f2,AB according to the CNV–SNP allele frequencies listed in Table 1. The covariance between X and Y then becomes, which does not depend on the particular choice of the reference allele. We denote the inner factor in formula {9} as DC, noting its equivalence to Lewontin's D (Lewontin and Kojima, 1960) in situations for exterior SNPs. Specifically, let where Similar to D, the range of values for DC is difficult to interpret without proper scaling. Therefore, we propose a method nearly identical to the construction of D′(Lewontin, 1964). Define the maximum value that DC can take based upon allele frequencies as Then: Finally, let Meanwhile, we can also calculate the correlation between X and Y to be: or, alternatively: We again note that and are identical to the traditional LD measurements D′ and r2, respectively, for exterior SNPs; and both are an appropriate measurement for interior SNPs.

Estimation of CNV–SNP allele frequencies

Calculation of and is straightforward when the CNV–SNP haplotype frequencies are known. However, current methods for array-based genotype/CNV calling do not directly infer the haploid configuration (phase), though methods for estimating this configuration have been recently proposed (Kato et al., 2008; Su et al., 2010). Here we present a novel method to estimate CNV–SNP allele frequencies based on unphased CNV–SNP genotypes. The method is a direct result of an EM algorithm and nearly identical in construction to the gene-counting, allele frequency estimation procedure in Ceppellini et al. (1955) and Smith (1957). Consider a CNVR with CNV–SNP haploid configurations S/T such that S, T ∈ {0, A, B, AA, AB, BB}. In the E-step, haploid configuration counts are estimated based on the expected counts from estimated CNV–SNP allele frequencies. That is, for each CNV–SNP haploid configuration S/T: where NST is number of CNV–SNP genotypes that could possibly result in an S/T haploid configuration, fS/T, is the estimated frequency of the S/T haploid configuration, and fS/T, is the estimated frequency of CNV–SNP genotypes that could result in an S/T haploid configuration for the kth iteration. In the M-step, CNV–SNP allele frequencies estimates are updated: as well as new CNV–SNP haploid configuration frequencies estimates: The algorithm is based upon haploid configurations falling into their appropriate Hardy–Weinberg equilibrium proportions. As a result, this approach may perform poorly in de novo mutation hot-spots and CNVs found only in somatic cells.

Results

We provide calculations of for various CNV–SNP allele frequencies and compare them to the traditional measurements for SNPs inside CNVRs (Table 3). We define, and are the traditional metrics of LD using SNP allele A or B as the reference allele, respectively. Note how vastly different results can be obtained depending on which allele is used as the reference. The value of is the same irrespective of SNP allele considered as the reference allele.

Table 3

Measurements of linkage disequilibrium (LD) between copy number variants and single nucleotide polymorphisms (SNPs) within the copy number variable region.

Type	Frequency	ρΑ2	ρΒ2	ρΧ2
Deletion only (1)	f₀ = 0.1	0.111	0.074	0*
	f_1,A = 0.5
	f_1,B = 0.4
Deletion only (2)	f₀ = 0.5	0.429	0.250	0*
	f_1,A = 0.3
	f_1,B = 0.2
Duplication only (1)	f_1,A = 0.5	0.667	0.910	0.818
	f_2,AB = 0.1
	f_2,BB = 0.4
Duplication only (2)	f_1,A = 0.3	0.146	0.146	0
	f_1,B = 0.3
	f_2,AA = 0.1
	f_2,AB = 0.2
	f_2,BB = 0.1
Duplication only (3)	f_1,A = 0.3	0.098	0.098	0
	f_1,B = 0.3
	f_2,AA = 0.2
	f_2,BB = 0.2
Multiallelic (1)	f₀ = 0.2	0.014	0.656	0.758
	f_1,A = 0.5
	f_2,AB = 0.1
	f_2,BB = 0.2
Multiallelic (2)	f₀ = 0.2	0.222	0.222	0
	f_1,A = 0.3
	f_1,B = 0.3
	f_2,AA = 0.1
	f_2,BB = 0.1

0*: .

Measurements of linkage disequilibrium (LD) between copy number variants and single nucleotide polymorphisms (SNPs) within the copy number variable region. 0*: . We theoretically demonstrated how current metrics of LD are inappropriate in certain cases and proposed a new method that solves these issues. Note that the CNV–SNP allele frequencies are critical in calculating . We evaluated our method for estimating CNV–SNP allele frequencies via an EM algorithm, as described above in the methods section, using a simulation procedure. These results are provided in Table 4. In summary, our metric accurately and precisely measures SNP/CNV covariance, regardless of the location of the SNP and type of CNV. In particular, high values of will always lead to a proper conclusion about role of CNVs from the study of SNPs. In CNVRs that only include a deletion, our proposed method will always correctly assign independence between interior SNPs and the CNV. Meanwhile, in duplicated regions our metric will provide a value that appropriately quantifies the correlation between SNP allele(s) and the number of copies present.

Table 4

Type	Simulated frequency	Mean difference
No CNVs	f_1,A = 0.5	0*
	f_1,B = 0.5
Deletion only (1)	f₀ = 0.1	0*
	f_1,A = 0.5
	f_1,B = 0.4
Deletion only (2)	f₀ = 0.5	0*
	f_1,A = 0.3
	f_1,B = 0.2
Duplication only (1)	f_1,A = 0.5	0.0019
	f_1,B = 0.1	0.0019
	f_2,AB = 0.1	0.0019
	f_2,BB = 0.3	0.0019
Duplication only (2)	f_1,A = 0.3	0.0050
	f_1,B = 0.3	0.0050
	f_2,AA = 0.1	0.0048
	f_2,AB = 0.2	0.0082
	f_2,BB = 0.1	0.0048
Duplication only (3)	f_1,A = 0.3	0*
	f_1,B = 0.3
	f_2,AA = 0.2
	f_2,BB = 0.2
Multiallelic (1)	f₀ = 0.1	0.0040
	f_1,A = 0.3	0.0069
	f_1,B = 0.3	0.0108
	f_2,AA = 0.1	0.0047
	f_2,AB = 0.1	0.0076
	f_2,BB = 0.1	0.0100
Multiallelic (2)	f₀ = 0.1	0.0028
	f_1,A = 0.4	0.0038
	f_1,B = 0.3	0.0102
	f_2,AA = 0.1	0.0020
	f_2,BB = 0.1	0.0097

Mean difference represents the mean difference between the true and estimated CNV–SNP allele frequencies.

0*: Less than 1 × 10.

Copy number variant–single nucleotide polymorphism (CNV–SNP) allele frequency estimation procedure diagnostics based upon 1,000 simulations of a sample size of 1,000 (2,000 haploids) and various CNV–SNP allele frequencies. Mean difference represents the mean difference between the true and estimated CNV–SNP allele frequencies. 0*: Less than 1 × 10.

Discussion

We have provided rationale for why current metrics used to assess LD between CNVs and interior SNPs are inappropriate. Given that difficulties arise only for these SNPs, one potential solution, as previous studies have done, would be to rely upon exterior SNPs for tagging CNVs. Though this approach been successful for deletions, duplications tend to be in very low LD with exterior SNPs (Kato et al., 2010). It is possible that duplicate copies are not simply positioned in tandem next to a neighboring SNP in relation to the reference genome. The more extreme case arises when a duplicate copy has been translocated onto a different chromosome. In this situation an exterior SNP will be completely unlinked to the translocated duplicate. However, interior SNPs will segregate within the duplicate – particularly if this copy is not suitably matched for recombination. A similar argument can be made for duplicated segments inserted downstream of its reference location. A larger physical distance between the duplicated copies and an exterior SNP allows for a greater probability of recombination to eliminate LD. However, this distance will be irrelevant in regards to the allelic content within the duplicated genomic segment. We have included a new method to quantify LD between CNVs and SNPs which provides accurate estimates for interior SNPs and defaults to the traditional measurements otherwise. As our methods require knowledge of CNV–SNP allele frequencies, we have provided an estimation procedure that performs well under a wide range of scenarios. We hope CNV researchers, particularly those hoping to draw conclusions about CNVs from SNPs, will use this method to identify tagging SNPs which may or may not exist within the CNV boundary.

Web Resources

R code to measure and CNV–SNP allele frequencies from CNV–SNP genotypes is available from the corresponding author upon request.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

22 in total

1. The estimation of gene frequencies in a random-mating population.

Authors: R CEPPELLINI; M SINISCALCO; C A SMITH
Journal: Ann Hum Genet Date: 1955-10 Impact factor: 1.670

2. The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models.

Authors: R C Lewontin
Journal: Genetics Date: 1964-01 Impact factor: 4.562

Review 3. Copy-number variation and association studies of human disease.

Authors: Steven A McCarroll; David M Altshuler
Journal: Nat Genet Date: 2007-07 Impact factor: 38.330

4. PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.

Authors: Kai Wang; Mingyao Li; Dexter Hadley; Rui Liu; Joseph Glessner; Struan F A Grant; Hakon Hakonarson; Maja Bucan
Journal: Genome Res Date: 2007-10-05 Impact factor: 9.043

5. The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility.

Authors: Enrique Gonzalez; Hemant Kulkarni; Hector Bolivar; Andrea Mangano; Racquel Sanchez; Gabriel Catano; Robert J Nibbs; Barry I Freedman; Marlon P Quinones; Michael J Bamshad; Krishna K Murthy; Brad H Rovin; William Bradley; Robert A Clark; Stephanie A Anderson; Robert J O'connell; Brian K Agan; Seema S Ahuja; Rosa Bologna; Luisa Sen; Matthew J Dolan; Sunil K Ahuja
Journal: Science Date: 2005-01-06 Impact factor: 47.728

6. Global variation in copy number in the human genome.

Authors: Richard Redon; Shumpei Ishikawa; Karen R Fitch; Lars Feuk; George H Perry; T Daniel Andrews; Heike Fiegler; Michael H Shapero; Andrew R Carson; Wenwei Chen; Eun Kyung Cho; Stephanie Dallaire; Jennifer L Freeman; Juan R González; Mònica Gratacòs; Jing Huang; Dimitrios Kalaitzopoulos; Daisuke Komura; Jeffrey R MacDonald; Christian R Marshall; Rui Mei; Lyndal Montgomery; Kunihiro Nishimura; Kohji Okamura; Fan Shen; Martin J Somerville; Joelle Tchinda; Armand Valsesia; Cara Woodwark; Fengtang Yang; Junjun Zhang; Tatiana Zerjal; Jane Zhang; Lluis Armengol; Donald F Conrad; Xavier Estivill; Chris Tyler-Smith; Nigel P Carter; Hiroyuki Aburatani; Charles Lee; Keith W Jones; Stephen W Scherer; Matthew E Hurles
Journal: Nature Date: 2006-11-23 Impact factor: 49.962

7. Origins and functional impact of copy number variation in the human genome.

Authors: Donald F Conrad; Dalila Pinto; Richard Redon; Lars Feuk; Omer Gokcumen; Yujun Zhang; Jan Aerts; T Daniel Andrews; Chris Barnes; Peter Campbell; Tomas Fitzgerald; Min Hu; Chun Hwa Ihm; Kati Kristiansson; Daniel G Macarthur; Jeffrey R Macdonald; Ifejinelo Onyiah; Andy Wing Chun Pang; Sam Robson; Kathy Stirrups; Armand Valsesia; Klaudia Walter; John Wei; Chris Tyler-Smith; Nigel P Carter; Charles Lee; Stephen W Scherer; Matthew E Hurles
Journal: Nature Date: 2009-10-07 Impact factor: 49.962

8. Strong association of de novo copy number mutations with autism.

Authors: Jonathan Sebat; B Lakshmi; Dheeraj Malhotra; Jennifer Troge; Christa Lese-Martin; Tom Walsh; Boris Yamrom; Seungtai Yoon; Alex Krasnitz; Jude Kendall; Anthony Leotta; Deepa Pai; Ray Zhang; Yoon-Ha Lee; James Hicks; Sarah J Spence; Annette T Lee; Kaija Puura; Terho Lehtimäki; David Ledbetter; Peter K Gregersen; Joel Bregman; James S Sutcliffe; Vaidehi Jobanputra; Wendy Chung; Dorothy Warburton; Mary-Claire King; David Skuse; Daniel H Geschwind; T Conrad Gilliam; Kenny Ye; Michael Wigler
Journal: Science Date: 2007-03-15 Impact factor: 47.728

9. Population-genetic nature of copy number variations in the human genome.

Authors: Mamoru Kato; Takahisa Kawaguchi; Shumpei Ishikawa; Takayoshi Umeda; Reiichiro Nakamichi; Michael H Shapero; Keith W Jones; Yusuke Nakamura; Hiroyuki Aburatani; Tatsuhiko Tsunoda
Journal: Hum Mol Genet Date: 2009-12-05 Impact factor: 6.150

10. Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humans.

Authors: Timothy J Aitman; Rong Dong; Timothy J Vyse; Penny J Norsworthy; Michelle D Johnson; Jennifer Smith; Jonathan Mangion; Cheri Roberton-Lowe; Amy J Marshall; Enrico Petretto; Matthew D Hodges; Gurjeet Bhangal; Sheetal G Patel; Kelly Sheehan-Rooney; Mark Duda; Paul R Cook; David J Evans; Jan Domin; Jonathan Flint; Joseph J Boyle; Charles D Pusey; H Terence Cook
Journal: Nature Date: 2006-02-16 Impact factor: 49.962

4 in total

1. FCGR Genetic Variation in Two Populations From Ecuador Highlands-Extensive Copy-Number Variation, Distinctive Distribution of Functional Polymorphisms, and a Novel, Locally Common, Chimeric FCGR3B/A (CD16B/A) Gene.

Authors: Manuela Moraru; Adriana Perez-Portilla; Karima Al-Akioui Sanz; Alfonso Blazquez-Moreno; Antonio Arnaiz-Villena; Hugh T Reyburn; Carlos Vilches
Journal: Front Immunol Date: 2021-05-24 Impact factor: 7.561

2. Genomic predictions combining SNP markers and copy number variations in Nellore cattle.

Authors: El Hamidi A Hay; Yuri T Utsunomiya; Lingyang Xu; Yang Zhou; Haroldo H R Neves; Roberto Carvalheiro; Derek M Bickhart; Li Ma; Jose Fernando Garcia; George E Liu
Journal: BMC Genomics Date: 2018-06-05 Impact factor: 3.969

3. High density LD-based structural variations analysis in cattle genome.

Authors: Ricardo Salomon-Torres; Lakshmi K Matukumalli; Curtis P Van Tassell; Carlos Villa-Angulo; Víctor M Gonzalez-Vizcarra; Rafael Villa-Angulo
Journal: PLoS One Date: 2014-07-22 Impact factor: 3.240

4. A Highly Polymorphic Copy Number Variant in the NSF Gene is Associated with Cocaine Dependence.

Authors: Judit Cabana-Domínguez; Carlos Roncero; Lara Grau-López; Laia Rodríguez-Cintas; Carmen Barral; Alfonso C Abad; Galina Erikson; Nathan E Wineinger; Bàrbara Torrico; Concepció Arenas; Miquel Casas; Marta Ribasés; Bru Cormand; Noèlia Fernàndez-Castillo
Journal: Sci Rep Date: 2016-08-08 Impact factor: 4.379

4 in total