Literature DB >> 18710478

MarkerSet: a marker selection tool based on markers location and informativity in experimental designs.

Abstract

BACKGROUND: The recent sequencing of full genomes has led to the availability of many SNP markers which are very useful for the mapping of complex traits. In livestock production, there are still no commercial arrays and many studies use home-made sets of SNPs. Thus, the current methodologies for SNP genotyping are still expensive and it is a crucial step to select the SNPs to use. Indeed, the main factors affecting the power of the linkage analyses are the density of the genetic map and the heterozygosity of markers in tested animal parents.
FINDINGS: This is why we have developed a PERL program selecting a defined number of markers based on their locations on the genome and their informativity in specific experimental designs. As an option, different experimental designs can be combined in order to select the best possible common marker set. The program has been tested using different conditions of marker informativity and density with both real and simulated datasets. The results show the efficiency of our program to select the most informative markers even if there is a wide range of informativity for whole genome scan mapping analyses. In case of combination of different experimental crosses, the multidesign mode can optimize the SNP markers selection.
CONCLUSION: Written in PERL, it assures a maximum portability to other operating systems (OS) and the source code availability for user modifications. Except for the simulation mode which could be time consuming, MarkerSet can compute results in a very short time.

Entities: Chemical Disease Species

Year: 2008 PMID： 18710478 PMCID： PMC2525642 DOI： 10.1186/1756-0500-1-9

Source DB: PubMed Journal: BMC Res Notes ISSN： 1756-0500

Findings

The recent sequencing of full genomes has led to the availability of many SNP markers ([1] for Human and [2] for Chicken). The current methodologies for home-made SNP sets genotyping are still expensive, meaning that only few thousands of SNPs can be used. It is then a crucial step for a specific study to select the best suited SNPs. For linkage analyses, the main criteria to increase the analysis power are the distances between markers and the ability to follow the marker's allele segregation in the experimental design. It means that the markers must be as much as possible heterozygous for phenotyped animal parents. This is why the heterozygosity in phenotyped animal parents (further called reference animals) must be included in the marker selection. In this manuscript, this heterozygosity for reference animals will be referred as informativity of the markers. From our point of view, if there are no available SNP arrays, the best strategy is a two step genotyping, with a test of a large panel of SNPs informativity on reference animals from the studied experimental design, followed by a genotyping of all the animals for markers selected based on the results of the first step. The marker selection is complicated by the fact that markers the most heterozygous in reference animals are not homogenously spaced across the genome, and the number of markers to handle has greatly increased. It is therefore not possible anymore to select the markers without dedicated software. Different tools have already been proposed to select Tag SNPs [3-11], but most of them are based on very high marker density and linkage disequilibrium information and cannot be used in exotic species and species without SNP arrays for which linkage disequilibrium information is not always available. We propose here a tool to select the best possible markers for further linkage analysis, without any use of linkage disequilibrium information. Its originality is the use of both marker location in the genome and heterozygosity in parental animals. The MarkerSet software was written in the PERL programming language and can be downloaded with manual and example files at . The software is designed to use already available information about markers informativity, expressed in number of heterozygous animals out of all the reference animals tested in the experimental design. This allows the use of any kind of markers, as well as their combinations if needed. If more than one experimental design is to be genotyped, a specific set of SNPs can be selected, or the marker informativity for all these experimental designs can be used simultaneously to select common sets of markers. In case of a marker set selection common to all designs, both general informativity score and experimental design specific scores are detailed, so it is possible to evaluate specifically the marker set informativity for each experimental design. In most species, the only available information for the markers will be their physical location (especially true for SNP markers), as all the markers have not been tested on a reference population to estimate genetic distances. Nevertheless, for a QTL mapping, the genetic distances are the key points as, depending on the species, the recombination rate can highly vary. So MarkerSet uses physical distances as input and converts them into cM. This conversion can be adapted to fit the specificity of the studied species (as an example, in pigs, we can considerer that 1 cM corresponds to approximately 1 Mb). Basically, the algorithm will select the most informative markers in two windows separated by a constant gap, and sliding on the genome (see Figure 1A). In case of a similar informativity between several markers in a window, MarkerSet will select the closest marker from the middle of the window. Using this strategy, the distance between two markers is the first criterion of selection, and the informativity is used for discriminating closely located markers. The two main variables are the first window starting point on the genome and the size of the gap separating the two windows. Depending on the number of markers to select and the size of the genome, MarkerSet will compute different window starting points to get the best genome coverage.

Figure 1

Principles of MarkerSet and main parameters. a) MarkerSet selects markers in two windows separated by the Average Marker Interval (AMI), which is the whole genome size divided by the number of markers to select. The window size is a percentage of the AMI (20% by default). Shifting iteratively the windows by the AMI gives a full genome coverage. Different sets are created by using all the possible starting points (x and y). b) Several parameters and options are available in order to improve the sets quality. The space_plus and space_resampling parameters are used to enlarge the window size in case of low (or no) informativity: space_plus is set by default as 50% of the window size on each side. This is automatically performed if the informativity of markers available is lower than the defined informativity threshold. Space_resampling is used to iteratively enlarge window size (by default +1 cM on each side at each step) until markers with informativity higher than the defined resampling threshold are found (resampling option mode). The gap size and the window size are defined by the average marker interval (AMI), corresponding to a ratio of the whole genome size and the number of the markers to select. The AMI percentage used to calculate the window size is defined in the config.pm file (set by default as 20% of the AMI). So, the setting of the selection window size is automatically handled by the software. These two parameters (AMI and window size) permit to compute the number of possible starting points (i.e. the number of selected marker panels). Thus, for each combination of these parameters, a marker selection will be performed with a fixed starting point and multiple iterations over the genome (Selection Frame). At each iteration step, the starting point of each pickup box will be increased by AMI+window size (see Figure 1A). For all analyses, an informativity threshold is set, so if the best available marker in one window has an informativity strictly lower than this threshold, the window is enlarged (space plus: 50% of the window size is added to each side of the window, as default – see Figure 1B) and a more informative marker is searched. By default, this threshold is set as half of the best possible informativity score for one marker (i.e. half of the total animals tested). If there is no marker with a higher informativity, the best previous marker is conserved, as it results in shorter distance between markers. As an option, the window size can be enlarged as long as a marker more informative than the resampling threshold (set by the user) is not found (resampling option). The window size enlargement is defined by the user through the space resampling parameter (see Figure 1B). The working principle of the software is exposed in figure 2.

Figure 2

Working principle of MarkerSet.

Working principle of MarkerSet. In order to score the different obtained panel, one approach is to sum basically the informativity value for each selected marker (i.e. the number of heterozygous parental animals in our case). This approach of linear scoring is effective for markers with an extreme informativity value (i.e. 0 or 1 heterozygous animals or, on the other hand, all animals heterozygous), but it is not enough discriminative for "middle-range" marker. As an example, on a total of 6 tested animals, we prefer to give much more weight to a marker with 4 heterozygous animals than one with 3 heterozygous animals. In order to best represent the informativity of a marker, we decided to transform the informativity value of each marker on a sigmoid scale (see Figure 3). Obviously, this approach maximises or minimises the score for maximum or minimum informative markers respectively, but more importantly, discriminates "middle-range" informative markers. Finally, a panel score is obtained by summing the score values of all markers selected for this panel. In addition, the software computes some informations to describe each experimental design: maximum informativity score (i.e. the sum of informativity scores of all available markers), and the distribution of the number of markers in each informativity value class. These data are available to user in a log file.

Figure 3

Computation of informativity weight. Empirically, this sigmoid scale is obtained by computing values between -5 and +5 with the arctangent function (corresponding to -1.37 to +1.37 transformed informativity scores). For each experimental design, we re-assign the different informativity values to a -5 to +5 scale (see Figure 3). Let X = {X0, X1, ..., Xn} denoting the informativity status value, with n denoting the number of tested animals for one experimental design. Each informativity value is determined as Xi = Xi-1 + 10/n, with X0 = -5 to fit a scale from -5 to +5. The informativity score values are then expressed from -1.37 to +1.37 (corresponding to -5 and +5 arctangent values respectively). The scores obtained are finally adjusted to a 0 to 2.74 range in order to get only positive score values. vertical axis represents the informativity weight, horizontal axis the informativity values. When studying several experimental designs in the same species, user may want to compare what is the best option: to select markers perfectly fitted for each experimental design (for example heterozygous for all F1 sires), or to try to select a larger set of markers common to all experimental designs (in this case, some markers will be homozygous in some families resulting in a loss of power in the linkage analysis). To help with this dilemma, a multidesign option has been implemented. The principle of marker selection and panel scoring is absolutely the same except that the software use a global informativity value generated by summing, for each marker, the informativity value of each experimental design. Based on this global informativity, MarkerSet will select the best informative markers for the multidesign, and score it with the multidesign informativity values (score A). As mentioned above, this multidesign option should permit to evaluate which solution best fits for a number of defined markers: a set of common marker for all experimental designs or several sets of markers specific for each design. In order to measure the loss of informativity, MarkerSet will perform a simulation of marker selection specific for each design using the same selection frame (with the number of marker to select in multidesign option) and score it (score B). A ratio between multidesign score (score A) and experimental design specific score (score B) is calculated (called MD/Sim, r in the logfile). This ratio gives an estimation of the "conserved" informativity score between multidesign and design-specific marker selection: as an example, a ratio of 0.82 means that only 18% of informativity score is lost with the multidesign option. As the results can highly fluctuate according to the informativity and the density of available markers, it is possible to perform a simulation to define the best suited window sizes percentage. It is also possible to combine this simulation with all available options (resampling and multidesign). In order to test the program core functions and options, MarkerSet has been run on several different data files. First, a small data file corresponding to a real case has been generated with 206 low informativity markers (among them, 162 are not informative at all), located on one chromosome of 63 Mb and 4 tested animals. Using MarkerSet with this file in verbose mode, we have checked that the algorithm selects effectively the best informative marker taking into consideration the marker location in case of similar informativity, but also enlarges the window in case of low or no informativity. The resampling function has also been validated. The test file and the results log are available on the website as examples. Once the main concept of the program was tested and validated with the small data file, we have extended the functioning of the program to other various situations by generating simulated data files with different marker density and informativity distribution. Finally, the program has been also tested on a real informativity file of 9216 markers with five experimental designs (cf. Figure 4 for the distribution of the number of maker in each informativity value). For each informativity file, a selection of 384 markers has been performed in the basic mode with or without resampling option, and in the multidesign mode (1536 markers requested) with or without resampling option. Score results for simulated data files and real data file are shown in table 1 and 2, respectively (see additional files 1 and 2 for complete results). As expected, MarkerSet results are very sensitive to marker density and informativity distribution. It is noticeable that, with our real data file, there are not enough informative markers to select 1536 SNP. Moreover, multidesign option could have a drastic impact on the scores and the loss of informativity (Ratio) with low informativity files (especially with the resampling option).

Figure 4

Experimental designs marker informativity distribution. Each bar represents the number of markers for every informativity values for each experimental design.

Table 1

Testing results for simulated data files, requesting 384 and 1536 markers.

		Low Density (5K)			High Density (40K)
		Score	Ratio	-r gain	Score	Ratio	-r gain

Basic	HI	1002.86			1051.61
	VI	927.44			1047.9
	LI	317.92			500
R	HI	1008.3		0.54%	1051.61		0%
	VI	932.38		0.53%	1047.9		0%
	LI	320.59		0.84%	500		0%
MD	MD	1879.48			3513.19
	HI	2774.49	0.99		3959.82	0.96
	VI	1940.27	0.96		3760.68	0.94
	LI	428.14	0.80		769.18	0.49
R + MD	MD	2428.19		29.19%	3513.19		0%
	HI	3487.45	0.98	25.70%	3959.82	0.96	0%
	VI	2519.01	0.82	29.83%	3760.68	0.94	0%
	LI	539.56	0.53	26.02%	769.18	0.49	0%

For this purpose, six files with different marker informativity status have been generated with two variables. The first one is the marker density (5K markers – LD for low density or 40K markers – HD for High Density spanned homogeneously on the genome). The second one is marker informativity distribution. Considering a total of 100 reference animals, the following conditions have been explored: markers with heterozygozity values ranging from 50 to 100 (High Informativity, HI), 0 to 100 (Various Informativity, VI) or 0 to 50 (Low Informativity, LI). For each markers panel and condition, the maximal available informativity score (max info), the selected set score, the multidesign/monodesign ratio and the score gain obtained by using the resampling options (-r gain) are detailed. R and MD refer at resampling option activation and multidesign option activation respectively. Scores results are depending on marker density and informativity distribution (better with HI and lower with LI files). Nevertheless, there's only a slight score difference between HI and VI, showing the efficiency of MarkerSet to select the most informative markers. Resampling option is more useful with LD files but can have an impact on the loss of informativity (Ratio) in multidesign mode with LI file.

Table 2

Testing results for the real data set, requesting 384 and 1536 markers.

		Real dataset
		Max info	Score	Ratio	-r gain	Dmax	Dmin	AveD	StD	Markers	0 markers

Basic	Exp1	3509.86	528.67			28.6	0.4	9.2	2.2	380	72
	Exp2	5958.31	808.76			28.7	0.4	9.2	2.2	380	32
	Exp3	5685.11	680.55			20.6	0.6	9.2	2	380	10
	Exp4	6503.60	785.52			17.8	0.2	9.1	2.1	382	7
	Exp5	5293.64	673.66			17.5	0.2	9.1	2	382	26

R	Exp1		605.03		14.44%	90.3	0.4	9.5	5.8	366	0
	Exp2		887.82		9.78%	20.6	0.3	9.2	2.5	383	0
	Exp3		701.85		3.13%	16.8	0.5	9.1	2.1	384	0
	Exp4		803.9		2.34%	16.6	0	9.1	2.1	384	0
	Exp5		713.43		5.90%	26.9	0.5	9.2	2.5	380	0

MD	MD	3581.64	979.58			11.7	0.1	2.5	0.9	1461	137
	Exp1		801.79	0.81
	Exp2		1512.85	0.92
	Exp3		1282.7	0.89
	Exp4		1494.43	0.9
	Exp5		1229.34	0.89

R + MD	MD		1114.33		13.76%	11.3	0.1	2.4	0.9	1483	0
	Exp1		898.48	0.47	12.06%
	Exp2		1720.82	0.6	13.75%
	Exp3		1446.47	0.72	12.77%
	Exp4		1695.42	0.73	13.45%
	Exp5		1400.07	0.63	13.89%

The data file includes the genotype of 9216 SNPs covering the whole genome for The 26 F1 sires of five real chicken F2 designs (4 in Exp1, 5 in Exp3 and Exp5 and 6 in Exp2 and Exp4). For each markers panel and condition, the maximal available informativity score (max info), the selected set score, the multidesign/monodesign ratio, the score gain obtained by using the resampling (-r gain), the maximal (Dmax), minimal (Dmin), average (AveD) and standard deviation (StD) distances between two markers, the number of selected markers and the number of no informative markers in this set are detailed. R and MD refer at resampling option activation and multidesign option activation, respectively. With the resampling option, the gain is inversely proportional to the maximum informativity, except for Exp2, because of an overrepresentation of markers heterozygous for 0 and 6 animals in this experimental design. The results for multidesign mode (1536 markers) are similar to those obtained with the 5K markers file: the ratio is about 0.90, and the resampling option permits the increase of the number of selected markers (and thus the final score) without significant modifications of the average distance and the standard deviation.

Testing results for simulated data files, requesting 384 and 1536 markers. For this purpose, six files with different marker informativity status have been generated with two variables. The first one is the marker density (5K markers – LD for low density or 40K markers – HD for High Density spanned homogeneously on the genome). The second one is marker informativity distribution. Considering a total of 100 reference animals, the following conditions have been explored: markers with heterozygozity values ranging from 50 to 100 (High Informativity, HI), 0 to 100 (Various Informativity, VI) or 0 to 50 (Low Informativity, LI). For each markers panel and condition, the maximal available informativity score (max info), the selected set score, the multidesign/monodesign ratio and the score gain obtained by using the resampling options (-r gain) are detailed. R and MD refer at resampling option activation and multidesign option activation respectively. Scores results are depending on marker density and informativity distribution (better with HI and lower with LI files). Nevertheless, there's only a slight score difference between HI and VI, showing the efficiency of MarkerSet to select the most informative markers. Resampling option is more useful with LD files but can have an impact on the loss of informativity (Ratio) in multidesign mode with LI file. Testing results for the real data set, requesting 384 and 1536 markers. The data file includes the genotype of 9216 SNPs covering the whole genome for The 26 F1 sires of five real chicken F2 designs (4 in Exp1, 5 in Exp3 and Exp5 and 6 in Exp2 and Exp4). For each markers panel and condition, the maximal available informativity score (max info), the selected set score, the multidesign/monodesign ratio, the score gain obtained by using the resampling (-r gain), the maximal (Dmax), minimal (Dmin), average (AveD) and standard deviation (StD) distances between two markers, the number of selected markers and the number of no informative markers in this set are detailed. R and MD refer at resampling option activation and multidesign option activation, respectively. With the resampling option, the gain is inversely proportional to the maximum informativity, except for Exp2, because of an overrepresentation of markers heterozygous for 0 and 6 animals in this experimental design. The results for multidesign mode (1536 markers) are similar to those obtained with the 5K markers file: the ratio is about 0.90, and the resampling option permits the increase of the number of selected markers (and thus the final score) without significant modifications of the average distance and the standard deviation. Experimental designs marker informativity distribution. Each bar represents the number of markers for every informativity values for each experimental design. The simulation option has been also tested for simulated data and real data files (see additional files 1 and 2). As expected, the highest score is always obtained with the highest AMI percentage since the window sizes are larger (see Figure 5). Depending on the priority given to the marker locations or their informativity, users should test different conditions to find out which parameters are best fitted to their experimental designs.

Figure 5

Impacts of window sizes upon informativity score and standard deviation The horizontal axis represents the percentage of AMI used to define the window size (15 to 40%). The left vertical axis represents the best marker set score (full squares), and the right vertical axis the standard deviation (white diamonds). The simulation mode was performed on experimental design 1 for 384 markers requested without the resampling options.

Availability and requirements

Project name: MarkerSet Project homepage: Operating system: Platform independent Programming language: PERL Other requirements: POSIX PERL module License: GNU GPL Any restrictions to use by non-academics: license needed

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

OD carried out the general objectives of the program, data analyses and drafts the manuscript. FL carried out the general objectives of the program, algorithms definition, PERL code implementation and drafts the manuscript. Both authors read and approved the final manuscript.

Additional file 1

complete results for 5K and 40K markers datasets (including simulation mode results). Click here for file

Additional file 2

Complete results for real dataset (including simulation mode results). Click here for file

11 in total

1. Initial sequencing and analysis of the human genome.

Authors: E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal: Nature Date: 2001-02-15 Impact factor: 49.962

2. Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.

Authors: Christopher S Carlson; Michael A Eberle; Mark J Rieder; Qian Yi; Leonid Kruglyak; Deborah A Nickerson
Journal: Am J Hum Genet Date: 2003-12-15 Impact factor: 11.025

Review 3. Tag SNP selection for association studies.

Authors: Daniel O Stram
Journal: Genet Epidemiol Date: 2004-12 Impact factor: 2.135

4. Linear reduction methods for tag SNP selection.

Authors: Jingwu He; Alex Zelikovsky
Journal: Conf Proc IEEE Eng Med Biol Soc Date: 2004

5. Combining functional and linkage disequilibrium information in the selection of tag SNPs.

Authors: P C Sham; S I Ao; J S H Kwan; P Kao; F Cheung; P Y Fong; M K Ng
Journal: Bioinformatics Date: 2006-10-23 Impact factor: 6.937

6. Informative SNP selection methods based on SNP prediction.

Authors: Jingwu He; Alexander Zelikovsky
Journal: IEEE Trans Nanobioscience Date: 2007-03 Impact factor: 2.935

7. A genetic variation map for chicken with 2.8 million single-nucleotide polymorphisms.

Authors: Gane Ka-Shu Wong; Bin Liu; Jun Wang; Yong Zhang; Xu Yang; Zengjin Zhang; Qingshun Meng; Jun Zhou; Dawei Li; Jingjing Zhang; Peixiang Ni; Songgang Li; Longhua Ran; Heng Li; Jianguo Zhang; Ruiqiang Li; Shengting Li; Hongkun Zheng; Wei Lin; Guangyuan Li; Xiaoling Wang; Wenming Zhao; Jun Li; Chen Ye; Mingtao Dai; Jue Ruan; Yan Zhou; Yuanzhe Li; Ximiao He; Yunze Zhang; Jing Wang; Xiangang Huang; Wei Tong; Jie Chen; Jia Ye; Chen Chen; Ning Wei; Guoqing Li; Le Dong; Fengdi Lan; Yongqiao Sun; Zhenpeng Zhang; Zheng Yang; Yingpu Yu; Yanqing Huang; Dandan He; Yan Xi; Dong Wei; Qiuhui Qi; Wenjie Li; Jianping Shi; Miaoheng Wang; Fei Xie; Jianjun Wang; Xiaowei Zhang; Pei Wang; Yiqiang Zhao; Ning Li; Ning Yang; Wei Dong; Songnian Hu; Changqing Zeng; Weimou Zheng; Bailin Hao; Ladeana W Hillier; Shiaw-Pyng Yang; Wesley C Warren; Richard K Wilson; Mikael Brandström; Hans Ellegren; Richard P M A Crooijmans; Jan J van der Poel; Henk Bovenhuis; Martien A M Groenen; Ivan Ovcharenko; Laurie Gordon; Lisa Stubbs; Susan Lucas; Tijana Glavina; Andrea Aerts; Pete Kaiser; Lisa Rothwell; John R Young; Sally Rogers; Brian A Walker; Andy van Hateren; Jim Kaufman; Nat Bumstead; Susan J Lamont; Huaijun Zhou; Paul M Hocking; David Morrice; Dirk-Jan de Koning; Andy Law; Neil Bartley; David W Burt; Henry Hunt; Hans H Cheng; Ulrika Gunnarsson; Per Wahlberg; Leif Andersson; Ellen Kindlund; Martti T Tammi; Björn Andersson; Caleb Webber; Chris P Ponting; Ian M Overton; Paul E Boardman; Haizhou Tang; Simon J Hubbard; Stuart A Wilson; Jun Yu; Jian Wang; Huanming Yang
Journal: Nature Date: 2004-12-09 Impact factor: 49.962

Review 8. Software for tag single nucleotide polymorphism selection.

Authors: Daniel O Stram
Journal: Hum Genomics Date: 2005-06 Impact factor: 4.639

9. A model-based approach to selection of tag SNPs.

Authors: Pierre Nicolas; Fengzhu Sun; Lei M Li
Journal: BMC Bioinformatics Date: 2006-06-15 Impact factor: 3.169

10. A comparison of five methods for selecting tagging single-nucleotide polymorphisms.

Authors: Kelly M Burkett; Mercedeh Ghadessi; Brad McNeney; Jinko Graham; Denise Daley
Journal: BMC Genet Date: 2005-12-30 Impact factor: 2.797

6 in total

1. New QTL for resistance to Salmonella carrier-state identified on fowl microchromosomes.

Authors: Fanny Calenge; Alain Vignal; Julie Demars; Katia Fève; Pierrette Menanteau; Philippe Velge; Catherine Beaumont
Journal: Mol Genet Genomics Date: 2011-01-30 Impact factor: 3.291

2. Gene expression and linkage analysis implicate CBLB as a mediator of rituximab resistance.

Authors: J Jack; G W Small; C C Brown; T M Havener; H L McLeod; A A Motsinger-Reif; K L Richards
Journal: Pharmacogenomics J Date: 2017-12-05 Impact factor: 3.550

3. QTL detection for coccidiosis (Eimeria tenella) resistance in a Fayoumi × Leghorn F₂ cross, using a medium-density SNP panel.

Authors: Nicola Bacciu; Bertrand Bed'Hom; Olivier Filangi; Hélène Romé; David Gourichon; Jean-Michel Répérant; Pascale Le Roy; Marie-Hélène Pinard-van der Laan; Olivier Demeure
Journal: Genet Sel Evol Date: 2014-02-19 Impact factor: 4.297

4. Genome-wide interval mapping using SNPs identifies new QTL for growth, body composition and several physiological variables in an F2 intercross between fat and lean chicken lines.

Authors: Olivier Demeure; Michel J Duclos; Nicola Bacciu; Guillaume Le Mignon; Olivier Filangi; Frédérique Pitel; Anne Boland; Sandrine Lagarrigue; Larry A Cogburn; Jean Simon; Pascale Le Roy; Elisabeth Le Bihan-Duval
Journal: Genet Sel Evol Date: 2013-09-30 Impact factor: 4.297

5. Re-sequencing data for refining candidate genes and polymorphisms in QTL regions affecting adiposity in chicken.

Authors: Pierre-François Roux; Morgane Boutin; Colette Désert; Anis Djari; Diane Esquerré; Christophe Klopp; Sandrine Lagarrigue; Olivier Demeure
Journal: PLoS One Date: 2014-10-21 Impact factor: 3.240

6. Detection of QTL controlling digestive efficiency and anatomy of the digestive tract in chicken fed a wheat-based diet.

Authors: Thanh-Son Tran; Agnès Narcy; Bernard Carré; Irène Gabriel; Nicole Rideau; Hélène Gilbert; Olivier Demeure; Bertrand Bed'Hom; Céline Chantry-Darmon; Marie-Yvonne Boscher; Denis Bastianelli; Nadine Sellier; Marie Chabault; Fanny Calenge; Elisabeth Le Bihan-Duval; Catherine Beaumont; Sandrine Mignon-Grasteau
Journal: Genet Sel Evol Date: 2014-04-03 Impact factor: 4.297

6 in total