| Literature DB >> 30541448 |
Odile Rogier1, Aurélien Chateigner1, Souhila Amanzougarene1, Marie-Claude Lesage-Descauses1, Sandrine Balzergue2,3, Véronique Brunaud2, José Caius2, Ludivine Soubigou-Taconnat2, Véronique Jorge1, Vincent Segura4.
Abstract
BACKGROUD: Populus nigra is a major tree species of ecological and economic importance for which several initiatives have been set up to create genomic resources. In order to access the large number of Single Nucleotide Polymorphisms (SNPs) typically needed to carry out a genome scan, the present study aimed at evaluating RNA sequencing as a tool to discover and type SNPs in genes within natural populations of P. nigra.Entities:
Keywords: Bioinformatics pipeline; Black poplar; DNA polymorphisms; Transcriptomics
Mesh:
Substances:
Year: 2018 PMID: 30541448 PMCID: PMC6291945 DOI: 10.1186/s12864-018-5239-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Variant calling pipeline. Schematic representation of the bioinformatic pipeline devised for variant calling and genotyping from mRNA sequencing reads. References and parameters for each tools are indicated in the method section
Number of variants detected for each of the 7 calling modalities tested in the present study
| Variant calling modality | Missing value in genotype calls | ||
|---|---|---|---|
| noNA | 2NA | anyNA | |
| GATK | 464,829 | 555,828 | 902,256 |
| gVCF_GATK | 407,037 | 497,314 | 927,522 |
| FreeBayes | 492,073 | 640,445 | 795,459 |
| Mpileup | 496,688 | 594,932 | 949,411 |
| 3Callers | 341,584 | 366,123 | 400,392 |
| 4Callers | 252,887 | 262,447 | 271,399 |
| 3CallersConsensus | 356,275 | 442,931 | 785,377 |
“noNA”: no missing value; “2NA”: lower or equal to 2 missing values; “anyNA”: no filter on missing values. “3Callers”: SNPs detected with at least 3 callers; “4Callers”: SNPs detected with 4 callers; “3CallersConsensus”: SNPs detected with at least 3 callers with correction of the genotype calling when discrepancies existed between callers (see details in Methods)
Fig. 2Variant discovery in 12 Populus nigra genotypes. Number of variants discovered with each of the four callers (“GATK” in red; “gVCF_GATK” in blue; “FreeBayes” in green and “Mpileup” in purple) after applying different filters (“Raw”: no filters; “Biallelic SNPs”: indel removed; only biallelic SNPs retained; “QUAL ≥ 30”: SNP quality greater than 30 retained; “Intranigra/anyNA”: SNP polymorphic in P. nigra retained; “Intranigra/2NA”: SNP polymorphic in P. nigra with at most 2 missing genotype values retained; “noNA”: SNP polymorphic in P. nigra without missing genotype value retained)
Fig. 3Relationship between genotyping accuracy and the number of SNPs detected. Number of SNPs detected and genotyping accuracy for 7 calling modalities times 3 options for missing values. See Table 1 for the corresponding denominations
Fig. 4Positions of SNPs discovered and genotyped with RNAseq across 12 Populus nigra individuals and along four genes. a. “HCT1” (Potri.003G183900); b. “CCR7” (Potri.003G181400); c. “4CL3” (Potri.001G036900); d. “CAD4” (Potri.009G095800). “Coverage” in red refers to the mean depth at each position among the 12 individuals; “Frag. Marroni” and “SNP Marroni” in blue refer to sequenced fragments and SNPs discovered and typed in [3, 4]; “SNP RNA-seq” in green refers to SNPs discovered and typed in the present study with the modality “3CallersConsensus-noNA”. Point symbol size for each SNP is proportional to its MAF across the 12 individuals
Fig. 5Description of the final set of discovered SNPs. a Minor Allele Frequency distribution; b. Density of SNPs across the genome (100kb windows); c. Relationship between SNP density and gene expression; d. Annotation of SNPs
Fig. 6Hierarchical ascendant clustering of 12 Populus nigra individuals. The analysis was carried out with 250,784 SNPs from the “3CallersConsensus” modality after filtering SNPs with a minor allele frequency below 0.05