| Literature DB >> 35139814 |
Daniel C Frailey1, Qian Zhang2,3, David J Wood2, Thomas M Davis2.
Abstract
BACKGROUND: Like most legumes, chickpeas form specialized organs called root nodules. These nodules allow for a symbiotic relationship with rhizobium bacteria. The rhizobia provide fixed atmospheric nitrogen to the plant in a usable form. It is of both basic and practical interest to understand the host plant genetics of legume root nodulation. Chickpea lines PM233 and PM405, which harbor the mutationally identified nodulation genes rn1 and rn4, respectively, both display nodulation-deficient phenotypes. Previous investigators identified the rn1 mutation with the chickpea homolog of Medicago truncatula nodulation gene NSP2, but were unable to define the mutant rn1 allele. We used Illumina and Nanopore sequencing reads to attempt to identify and characterize candidate mutation sites responsible for the PM233 and PM405 phenotypes.Entities:
Keywords: Chickpea; Cicer; Mutant; Nodulation; Variant identification
Mesh:
Substances:
Year: 2022 PMID: 35139814 PMCID: PMC8827291 DOI: 10.1186/s12870-022-03446-7
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Outcomes of variant and candidate gene identification pipeline in chickpea mutants
| Sample | Filter Criterion | ||||||
|---|---|---|---|---|---|---|---|
| Total | CDS | Unique | 2 Alleles | Read Depth | Min Allele Frequency | Manual IGV Check | |
| PM233 | 829,368 | 12,791 | 412 | 411 | 189 | 168 | 15 |
| PM405 | 1,504,184 | 10,823 | 133 | 130 | 70 | 48 | 10 |
| PM638 | 1,413,448 | 11,083 | 133 | 128 | 71 | 48 | 11 |
| PM665 | 2,205,449 | 10,427 | 138 | 135 | 49 | 44 | 13 |
| PM796 | 1,586,765 | 10,138 | 90 | 87 | 35 | 23 | 6 |
Legend: The listed variant counts are those remaining after implementation of each respective filtration step. The Total column includes all variants found before filtering. The CDS (coding sequence) column includes remaining variants after filtering only for variants found within coding sequence. The Unique column are all variants found only within a single chickpea line. The 2 Alleles column are variants with 2 alelles. The Read Depth column are variants with a read depth of at least 2. The Min Allele Frequency are variants where the alternate allele (the one not matching to the reference sequence) had a frequency of at least 0.9. The Manual IGV (Integrative Genomics Viewer) Check column included variants confirmed after manual confirmation using IGV (i.e. variants which were present in other chickpea lines but not called as variants were excluded)
Fig. 1Alignment of ICC 640 (top) and PM233 (bottom) Illumina and Nanopore reads to pseudochromosome Ca5 of the ICC 4958 v3.0 desi reference genome assembly [36], and expected amplicons (brown bars) A1 through A4, as defined in (Additional file 2: Table S1). The location of the CaNSP2 gene is depicted as a red arrow in the reference track. Its 5′-3′ orientation is left to right. The deletional gap in PM233 is shaded in yellow. The PM233 Nanopore reads that aligned to the left of the deletional gap are not continuous with those that aligned to the right of the gap
Fig. 2Alignment of genomic region contigs Nanopore ICC 640 contig c636 (middle) to the desi and kabuli (top) and the M. truncatula (bottom) reference genomes. The top diagram represents the Ca5 region of interest in both the desi ICC 4958 v3.0 and kabuli CDC Frontier reference genomes, which we found to be collinear with each other in this region. The coordinates along the top bar show the positions in the desi and the kabuli (in parentheses) assemblies, respectively. In the middle diagram, the red bar indicates the CaNSP2 gene, while the yellow region is the deletional gap within PM233. The orange region is immediately upstream of, and adjacent to, the yellow deleted region and mapped adjacent to the yellow region in both chickpea (top diagram) and the M. truncatula version Mt4.0 genome assembly (bottom diagram) assemblies. The sequence block containing the orange and yellow regions and including the deletional region in PM233 resided in between the blue and green regions in contig c636 and in the M. truncatula (bottom diagram) assembly, but were positioned elsewhere in the desi and kabuli Ca5 assemblies, as shown (top diagram)
Coding sequence variants identified in chickpea mutant PM405
| Chromosome | GenBank Sequence | Position | Ref | Alt | GeneID | Top tBLASTn Hit | Stop | Frameshift | Syn/NonSyn |
|---|---|---|---|---|---|---|---|---|---|
| Ca2 | CM003629.3 | 37,700,934 | G | T | Ca_06416 | Heat Shock Protein 90–2 | No | No | Syn |
| Ca3 | CM003630.3 | 42,275,344 | C | T | Ca_09456 | Component of oligomeric protein | No | No | NonSyn |
| Ca6 | CM003633.3 | 19,663,461 | A | T | Ca_18764 | Glutathione S-transferase-like protein | No | No | Syn |
| Ca6 | CM003633.3 | 20,028,458 | T | G | Ca_05252 | Predicted: serine/threonine-protein kinase GRIK2-like | No | No | NonSyn |
| Ca6 | CM003633.3 | 57,850,387 | TAA | – | Ca_20904 | Predicted: phosphatidylinositol/phosphatidylcholine transfer protein SFH3 | No | No | NonSyn |
| Ca6 | CM003633.3 | 58,492,000 | A | C | Ca_20961 | Predicted: 4-coumarate--CoA ligase 1-like | No | No | NonSyn |
| Ca7 | CM003634.3 | 36,911,781 | A | G | Ca_23613 | Predicted: ATP-dependent helicase BRM isoform X1 | No | No | Syn |
| Ca7 | CM003634.3 | 6,585,411 | G | A | Ca_21616 | Uncharacterized | No | No | Syn |
| Ca7 | CM003634.3 | 7,929,900 | G | A | Ca_21755 | Uncharacterized | No | No | Syn |
Legend. Top tBLASTn hits were taken from the NCBI nucleotide collection database. The position coordinates are those of the ICC 4958 v3.0 genome reference assembly. Ca_06500, the primary candidate gene, is indicated in bold. The Ref (reference) allele is the allele in the ICC 4958 v3.0 genome reference assembly. The Alt (alternate) allele is the allele seen in the PM405 Illumina reads
Fig. 3Candidate gene identification pipeline