Literature DB >> 31708953

Seed Coat Pattern QTL and Development in Cowpea (Vigna unguiculata [L.] Walp.).

Ira A Herniter1, Ryan Lo1, María Muñoz-Amatriaín1, Sassoum Lo1, Yi-Ning Guo1, Bao-Lam Huynh2, Mitchell Lucas1, Zhenyu Jia1, Philip A Roberts2, Stefano Lonardi3, Timothy J Close1.   

Abstract

The appearance of the seed is an important aspect of consumer preference for cowpea (Vigna unguiculata [L.] Walp.). Seed coat pattern in cowpea has been a subject of study for over a century. This study makes use of newly available resources, including mapping populations, a reference genome and additional genome assemblies, and a high-density single nucleotide polymorphism genotyping platform, to map various seed coat pattern traits to three loci, concurrent with the Color Factor (C), Watson (W), and Holstein (H) factors identified previously. Several gene models encoding proteins involved in regulating the later stages of the flavonoid biosynthesis pathway have been identified as candidate genes, including a basic helix-loop-helix gene (Vigun07g110700) for the C locus, a WD-repeat gene (Vigun09g139900) for the W locus and an E3 ubiquitin ligase gene (Vigun10g163900) for the H locus. A model of seed coat development, consisting of six distinct stages, is described to explain some of the observed pattern phenotypes.
Copyright © 2019 Herniter, Lo, Muñoz-Amatriaín, Lo, Guo, Huynh, Lucas, Jia, Roberts, Lonardi and Close.

Entities:  

Keywords:  Vigna unguiculata; cowpea; pattern; pigment; quantitative trait loci; seed coat

Year:  2019        PMID: 31708953      PMCID: PMC6824211          DOI: 10.3389/fpls.2019.01346

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


Introduction

Cowpea (Vigna unguiculata [L.] Walp.) is a diploid (2n = 22) warm season legume which is primarily grown and serves as a major source of protein and calories in sub-Saharan Africa. Further production occurs in the Mediterranean Basin, Southeast Asia, Latin America, and the United States. Just over 7.4 million metric tonnes of dry cowpeas were reported worldwide in 2017 (FAOSTAT, 2019), though these numbers do not include Brazil, Ghana, and some other relatively large producers. Most of the production in sub-Saharan Africa is by smallholder farmers in marginal conditions, often as an intercrop with maize, sorghum, or millet (Ehlers and Hall, 1997). Due to its high adaptability to both heat and drought and its association with nitrogen fixing bacteria, cowpea is a versatile crop (Ehlers and Hall, 1997; Boukar et al., 2018). The most common form of consumption is as dry grain. The seeds are used whole or ground into flour (Singh, 2014; Tijjani et al., 2015). Seed coat pattern is an important consumer-related trait in cowpea. Consumers make decisions about the quality and presumed taste of a product based on appearance (Kostyla et al., 1978; Jaeger et al., 2018). Cowpea displays a variety of patterns, including varied eye shapes and sizes, Holstein, Watson, and Full Coat pigmentation, among others ( ). Each cowpea production region has preferred varieties, valuing certain color and pattern traits above others for determining quality and use. In West Africa consumers pay a premium for seeds exhibiting certain characteristics specific to the locality, such as lack of color for use as flour or solid brown for use as whole beans (Langyintuo et al., 2004; Mishili et al., 2009; Herniter et al., 2019a). In the United States consumers prefer varieties with tight black eyes, commonly referred to as “black-eyed peas” (Fery, 1985).
Figure 1

Seed coat pattern traits. Images of lines from various populations demonstrating the phenotypes which were scored as part of this study.

Seed coat pattern traits. Images of lines from various populations demonstrating the phenotypes which were scored as part of this study. Seed coat traits in cowpea have been studied since the early 20th century, when Spillman (1911) and Harland (1919, 1920), reviewed by Fery (1980), explored the inheritance of factors controlling seed coat color and pattern. In a series of F2 populations Spillman (1911) and Harland (1919, 1920) identified genetic factors responsible for color expression, including “Color Factor” (C), “Watson” (W), “Holstein-1” (H-1), and “Holstein-2” (H-2). A three-locus system controlling seed coat pattern was established by Spillman and Sando (1930) and was confirmed by Saunders (1960) and Drabo et al. (1988), though “O” was used in place of “C.” A genotyping array for 51,128 single nucleotide polymorphisms (SNP) was recently developed for cowpea (Muñoz-Amatriaín et al., 2017) which offers opportunities to improve the precision of genetic mapping. Numerous biparental populations have been used to map major quantitative trait loci (QTL) for various traits, including root-knot nematode resistance (Santos et al., 2018), domestication-related traits (Lo et al., 2018), and black seed coat color (Herniter et al., 2018) and to develop consensus genetic maps of cowpea (Muchero et al., 2009; Lucas et al., 2011; Muñoz-Amatriaín et al., 2017). In addition, new populations have been developed for higher-resolution mapping including an eight-parent Multi-parent Advanced Generation Inter-Cross (MAGIC) population containing 305 lines (Huynh et al., 2018). A reference genome sequence of cowpea (Lonardi et al., 2019; phytozome.net) and genome assemblies of six additional diverse accessions (Muñoz-Amatriaín et al., 2019) have been produced recently. Here, we make use of these resources to map a variety of seed coat pattern traits, determine candidate genes, and develop a model for genetic control of seed coat pattern. Additionally, we posit a developmental pattern for the cowpea seed coat to explain some of the observed variation.

Materials and Methods

Plant Materials

Ten populations were used for mapping: an eight-parent MAGIC population containing 305 lines (Huynh et al., 2018), four biparental recombinant inbred line (RIL) populations, and five F2 populations. Descriptions of each pattern discussed below can be found in Seed Coat Phenotyping Section and examples can be seen in . One biparental population consisted of 87 RILs developed at the University of California, Riverside (UCR), derived from a cross between California Blackeye 27 (CB27), which has a black Eye 2 pattern, and IT82E-18, also known as “Big Buff” (BB), which has a brown Full Coat pattern (Muchero et al., 2009). The second biparental RIL population consisted of 80 RILs developed at UCR derived from a cross between CB27 and IT97K-556-6 (556), which has a brown Full Coat pattern (Huynh et al., 2015). The third biparental RIL population consisted of 101 RILs developed at UCR, derived from a cross between California Blackeye 46 (CB46), which has a black Eye 2 pattern, and IT93K-503-1 (503), which has a brown Eye 1 pattern (Pottorff et al., 2014). The fourth biparental RIL population consisted of 76 RILs developed at UCR and at the International Institute for Tropical Agriculture in Nigeria, derived from a cross between 524B, which has a black Eye 2 pattern, and IT84S-2049 (2049), which has a brown Eye 1 pattern (Menéndez et al., 1997). The F2 populations were developed at UCR as part of this work. Two F2 populations, consisting of 176 and 132 individuals, were developed from independent crosses between CB27 and Bambey 21 (B21), which has the No Color phenotype. One F2 population, consisting of 143 individuals, was developed from a cross between B21 and California Blackeye 50 (CB50), which has a black Eye 2 pattern. Two F2 populations, consisting of 175 and 119 individuals, were developed from independent crosses between Tvu-15426, which has a purple Full Coat pattern, and MAGIC014, a line developed as part of the MAGIC population but not included in the final population, which has a black Watson pattern. To temporally describe seed coat development four accessions were examined: CB27, MAGIC059, Sanzi, and Sasaque. CB27 is described above. MAGIC059 has the Starry Night pattern in black and purple and is one of the lines included in the MAGIC population. Sanzi has a Speckled pattern in black and purple. Sasaque has the Full Coat pattern in red and purple.

SNP Genotyping and Data Curation

DNA was extracted from young leaf tissue using the Qiagen DNeasy Plant Mini Kit (Qiagen, Germany). A total of 51,128 SNPs were assayed in each sample using the Illumina Cowpea iSelect Consortium Array (Illumina Inc., California, USA; Muñoz-Amatriaín et al., 2017). Genotyping was performed at the University of Southern California Molecular Genomics Core facility (Los Angeles, California, USA). The same custom cluster file as in Muñoz-Amatriaín et al. (2017) was used for SNP calling. In the F2 populations the extracted DNA was bulked by phenotype, with DNA from 20 individuals combined in each genotyped sample. For the MAGIC population, SNP data and a genetic map were available from Huynh et al. (2018). The map included 32,130 SNPs in 1,568 genetic bins (Huynh et al., 2018). For the biparental RIL populations, SNP data and genetic maps for the CB27 by BB and the CB46 by 503 populations were available from Muñoz-Amatriaín et al. (2017), and SNP data and a genetic map were available for the 524B by 2,049 population from Santos et al. (2018). The CB27 by 556 genetic map was created using MSTMap (Wu et al., 2008). The CB27 by BB genetic map included 16,566 polymorphic SNPs in 977 genetic bins (Muñoz-Amatriaín et al., 2017); the CB27 by 556 genetic map contained 16,284 SNPs in 2,604 bins; the CB46 by 503 genetic map contained 16,578 SNPs in 683 bins (Muñoz-Amatriaín et al., 2017); the 524B by 2,049 genetic map contained 14,202 SNPs in 933 bins (Santos et al., 2018). For each F2 population, SNPs were filtered to remove non-polymorphic loci between the respective parents. The number of markers used for each population is as follows: the two CB27 by B21 populations, 8,550 SNPs ( ); the B21 by CB50 population, 8,628 SNPs ( ); the two Tvu-15426 by MAGIC014 populations, 20,010 SNPs ( ).

Seed Coat Phenotyping

Phenotype data for seed coat traits were collected by visual examination of the seeds. The scored phenotypic classes consisted of No Color, Eye 1, Eye 2, Holstein, Watson, and Full Coat ( ). No Color indicates no pigmentation present on the seed coat. Eye 1 consists of a loose eye in the shape of a teardrop with spots of color outside the eye on the wider side. Eye 2 consists of a tight eye in the shape of two wings with no pigment observed outside the edge of the eye. Holstein consists of an eye with a defined edge and additional spots of pigmentation spread over the seed coat up to almost completely covering the coat. Watson consists of an eye with an indefinite edge. Full Coat consists of pigment completely covering the seed coat. Two of the lines used for observing seed coat development had other seed coat patterns than those mapped. MAGIC014 had the Starry Night pattern, which consists of incomplete pigmentation covering the entire seed. Sanzi had the Speckled pattern, which consists of small dots of pigment covering the seed coat. Seeds with a paler brown color are often difficult to distinguish between the Eye 1 and Watson patterns. The MAGIC population was scored for Eye 1, Eye 2, Holstein, Watson, and Full Coat patterns ( ). The CB27 by BB ( ) and CB27 by 556 ( ) biparental RIL populations were scored for Eye 2, Holstein, Watson, and Full Coat patterns. The CB46 by 503 ( ) and 524B by 2,049 ( ) biparental RIL populations were scored for Eye 1, Eye 2, Holstein, Watson, and Full Coat patterns. The CB27 by B21 and B21 by CB50 F2 populations were scored for the No Color and Eye 2 patterns. The Tvu-15426 by MAGIC014 F2 populations were scored for the Watson and Full coat patterns. For mapping purposes, each observed pattern was scored individually and mapped independently with scores assigned as “1” indicating presence of the trait and a “0” indicating absence. For example, a line expressing the Eye 1 pattern would be scored as “1” for the Eye 1 trait and “0” for all other traits. Pattern phenotypes are mutually exclusive. As the Eye 1 pattern appears to be epistatic towards the H and W loci, any lines with the Eye 1 phenotype were scored as missing data for other seed coat phenotypes to avoid biasing the mapping. This was the case in all populations other than the MAGIC population, as the mpMap script could not operate with such an extent of missing data. In the MAGIC population, for traits other than Eye 1 (Eye 2, Holstein, Watson, and Full Coat), individuals with the Eye 1 phenotype were scored as “0” instead of as missing data since marking too many lines as missing data caused r/mpMap to fail.

Segregation Ratios

Expected segregation ratios reported in were determined based on the type of population, parental and F1 phenotypes. For example, the F2 populations were expected to segregate in a 3:1 ratio for traits controlled by single genes with complete dominant/recessive relationships, while the biparental RIL populations were expected to segregate in a 1:1 ratio. Expected segregation ratios were tested by chi-square analysis.
Table 2

Phenotypes, segregation ratios, and probability values for the tested populations.

Population (# of lines)Eye 1Eye 2HolsteinWatsonFull CoatNo ColorPred. Seg. RatioX2 Probability
MAGIC (305)12102113141192:5:35:35:2456.410.17
CB27 by Big Buff (87)202816231:1:1:13.530.32
CB27 by 556 (80)143017191:1:1:17.300.06
CB46 by 503 (101)4912178154:1:1:1:13.730.44
524B by 2049 (76)47586104:1:1:1:15.820.21
CB27 by B21 A (176)129473:10.270.60
CB27 by B21 B (132)88443:14.890.027
B21 by CB50 (143)112313:10.840.36
Tvu-15426 by MAGIC014 A (175)441311:30.00190.97
Tvu-15426 by MAGIC014 B (120)27931:30.400.53
For the MAGIC population, based on how the population was constructed (Huynh et al., 2018) it was assumed that each fully homozygous parent had a roughly 1/8 probability to pass its genotype at a particular locus to a given RIL. For example, at the C locus, three parents (IT84S-2049, IT89KD-288, and IT93K-503-1) express the Eye 1 phenotype and are proposed to have a C genotype, while the other five parents are proposed to have a C genotype. Based on this, a given line in the population is expected to have a 3/8 probability of having a C genotype and a 5/8 probability of have a C genotype. At the W and H loci, one parent (CB27) is proposed to have the H and W genotypes, while the other seven parents are proposed to have the W and H genotypes. Based on this, a line should have a 1/8 probability of having the W and a 1/8 probability of having the H genotype. By multiplying the probabilities at each locus, the probability of a given genotype can be determined using the following equation: where PC is the probability of a given allele at the C locus, PW is the probability of a given allele at the W locus, PH is the probability of a given allele at the H locus, and Pnet is the probability of a given genotype. For example, the probability of a C genotype, which would have a Holstein phenotype would be 35/512 ([5/8]*[7/8]*[1/8]). The above method results in a predicted 192:5:35:35:245 phenotypic ratio for the Eye 1 (C), Eye 2 (C), Holstein (C), Watson (C), and Full Coat (C) patterns, respectively.

Trait Mapping

Trait mapping was achieved with different methods for each type of population. In the MAGIC population, the R package “mpMap” (Huang and George, 2011) was used as described by Huynh et al. (2018). The significance cutoff values were determined through 1,000 permutations, resulting in a threshold of p = 8.10E−05 [−log10(p) = 4.09]. Due to the high number of markers in the genotype data, imputed markers spaced at 1 cM intervals were used. In the biparental RIL populations, the R packages “qtl” (Broman et al., 2003) and “snow” (Tierney et al., 2015) were used as in Herniter et al. (2018). Briefly, probability values were assigned to each SNP using a Haley-Knott regression, tested for significance with 1,000 permutations, and marker effects were determined using a hidden Markov model. For the F2 populations, the genotype calls of each bulked DNA pool in the population were filtered to leave only the markers known to be polymorphic between the parents, and these were then sorted based on physical positions in the pseudochromosomes available from Phytozome (Lonardi et al. 2019; phytozome.net). Each population’s genotype was then examined visually in Microsoft Excel for areas where the recessive bulk was homozygous, and the dominant bulk was heterozygous. Duplicated populations were examined in conjunction.

Determining Haplotype Blocks

Once significant regions were established through mapping analysis, the overlapping area shared between the four biparental RIL populations was examined to determine the minimal area where all four biparental populations had overlapping haplotype blocks. SNPs located in the hotspots of pseudochromosomes Vu07, Vu09, and Vu10 were examined visually in Microsoft Excel for regions of identity within phenotypic groups. SNPs located in the hotspots which had been removed during trait mapping due to high levels of missing data were added back as presence/absence variations and segregated similar to nucleotide polymorphisms.

Determining Candidate Genes

Genes were examined within each minimal haplotype block. Gene expression data (Yao et al., 2016), from the cowpea reference genome (IT97K-499-35), which has a black Eye 1 (C) pattern available from the Legume Information System (legumeinfo.org) were examined for expression in developing seed tissue. Genes encoding proteins known to be involved in regulation of the flavonoid biosynthesis pathway were prioritized.

Determining Allelic Series

Dominance relationships were determined by examining the phenotypes of several F1 progeny in addition to segregation ratios in the F2 populations. Crosses were made between CB27 and three lines from the CB27 by BB population (BB-090, BB-113, and BB-074). Seeds from these F1 plants were visually examined for seed coat patterns. CB27/BB-090 seeds had a Watson pattern (C), CB27/BB-113 seeds had a Holstein pattern (C), and CB27/BB-074 seeds had a Full Coat pattern (C). An additional cross was available from the early development of the MAGIC population, where the phenotype of the seed coat on seeds from a maternal C heterozygote was Full Coat. IT84S-2246 (Full Coat, C) was crossed with IT93K-503-1 (Eye 1, C) to yield this C maternal parent.

Comparing Sequence Variation

The genome sequences of the candidate genes from each of five genome sequences (the reference genome sequence and four additional genome assemblies) and about 3 kb of upstream sequence were compared using A plasmid Editor (ApE; jorgensen.biology.utah.edu/wayned/ape/). Transcription factor binding sites were predicted in the upstream regulatory region of each gene using the binding site prediction function available from the Plant Transcription Factor Database (Jin et al., 2017; planttfdb.cbi.pku.edu.cn/). The species input was Vigna radiata (mung bean), as a map of cowpea was unavailable. The cowpea reference sequence is of IT97K-499-35. Among the additional sequenced genomes, CB5-2 has the Eye 2 pattern (C), Suvita-2 has the Full Coat pattern (C), Sanzi has a Speckled pattern, and UCR779 has the Full Coat pattern (C). See Seed Coat Phenotyping Section for pattern descriptions and for examples. A larger set of SNPs (about 1 million), discovered from whole-genome shotgun sequencing of 37 diverse accessions (Muñoz-Amatriaín et al., 2017; Lonardi et al. 2019), was available from Phytozome (phytozome.net). Among the 37 accessions, 28 had phenotype data available. These lines were examined for variations in the SNP selection panel that were in the gene-coding and regulatory regions of the candidate genes.

Correlation Test

The 28 lines from the SNP selection panel with phenotype and genotype data available were tested for correlation in R, using the native “cor.test” function. For input, the phenotype was recorded as “+1” for accessions with the Eye 1 (C) phenotype and “−1” for those without. The genotype was recorded as “+1” for accessions matching the reference genotype, “−1” for the alternate homozygote, and “0” for the heterozygote ( ).

Seed Color Development

The four accessions for which pattern development was recorded (CB27, MAGIC059, Sanzi, and Sasaque) were grown in a greenhouse at the University of California, Riverside (Riverside, California; 33.97° N 117.32° W) at a constant temperature of about 32°C from March through May 2018. Three plants were used for each accession. Upon flowering, each flower was tagged with the date it opened. The flowers were permitted to self-fertilize. For each day after the flower opened, beginning on the second day, on each of the three test plants a pod was collected until no more green pods were observed. Seeds from each collected pod were photographed using a Canon EOS Rebel T6i at a 90º angle under consistent lighting conditions. The length of the most advanced seed within the pod was measured using ImageJ (imagej.nih.gov). A developmental scale from 0 to 5 was designed based on the visual observations of the spread of pigmentation (see Results section). Each photograph was scored using this scale.

Results

Phenotypic Data and Segregation Ratios

Phenotypic data and proposed genotypes for each parent in the observed populations can be found in . A summary of the phenotypic data, along with predicted segregation ratios, chi-square values, and probability can be found in .
Table 1

Parental phenotypes and expected genotypes of the examined populations.

PopulationPopulation typeParentPhenotypeProposed Genotype
MAGIC8-Parent RILCalifornia Blackeye 27Eye 2C2C2W0W0H0H0
IT00K-1263Full CoatC2C2W1W1H1H1
IT82E-18Full CoatC2C2W1W1H1H1
IT84S-2049Eye 1C1C1W1W1H1H1
IT84S-2246Full CoatC2C2W1W1H1H1
IT89KD-288Eye 1C1C1W1W1H1H1
IT93K-503-1Eye 1C1C1W1W1H1H1
SuVita 2Full CoatC2C2W1W1H1H1
CB27 by BBBiparental RILCalifornia Blackeye 27Eye 2C2C2W0W0H0H0
IT82E-18Full CoatC2C2W1W1H1H1
CB27 by 556Biparental RILCalifornia Blackeye 27Eye 2C2C2W0W0H0H0
IT97K-556-6Full CoatC2C2W1W1H1H1
CB46 by 503Biparental RILCalifornia Blackeye 46Eye 2C2C2W0W0H0H0
IT93K-503-1Eye 1C1C1W1W1H1H1
524B by 2049Biparental RIL524BEye 2C2C2W0W0H0H0
IT84S-2049Eye 1C1C1W1W1H1H1
CB27 by B21F2California Blackeye 27Eye 2C2C2W0W0H0H0
Bambey 21No ColorC0C0W0W0H0H0
B21 by CB50F2Bambey 21No ColorC0C0W0W0H0H0
California Blackeye 50Eye 2C2C2W0W0H0H0
Tvu-15426 by MAGIC014F2Tvu-15426Full CoatC2C2W1W1H1H1
MAGIC014WatsonC2C2W1W1H0H0
Parental phenotypes and expected genotypes of the examined populations. Phenotypes, segregation ratios, and probability values for the tested populations.

Identification of Loci Controlling Seed Coat Pattern

A total of 35 SNP loci were identified using different methods for each population type (see Materials and Methods section for details) and were concentrated on three chromosomes: Vu07 (C locus), Vu09 (H locus), and Vu10 (W locus). Mapping results can be found in . The overlapping mapping results allowed a narrowing of the area examined for candidate genes.

Determination of Minimal Haplotype Blocks

Following trait mapping, all called SNPs on chromosomes Vu07, Vu09, and Vu10 were examined for minimal haplotype blocks in the overlapping significant regions in the four biparental RIL populations. On Vu07 (C locus) the minimal haplotype block was between 2_12939 and 2_09638 (228,331 bp) and contained ten genes. On Vu09 the minimal haplotype block was between 2_33224 and 2_12692 (166,724 bp) and contained seventeen genes. On Vu10 the minimal haplotype block was between 2_12467 and 2_15325 (120,513 bp) and contained eleven genes. The list of candidate genes can be found in and on Phytozome (Lonardi et al., 2019; phytozome.net). The minimal haplotype block .0regions can be found in .

I890-dentification of Candidate Genes

A predominant candidate gene was identified at each locus based on high relative expression in the developing seeds ( ) and a review of the literature on the regulation of the flavonoid biosynthesis pathway (see Discussion section for details). This led to the determination of a single major candidate gene on each of Vu07, Vu09, and Vu10. Each of the candidate genes belongs to a class which is known to be involved in transcriptional control of the later stages of flavonoid biosynthesis. No Color, Eye 1, and Full Coat mapped to an overlapping area on Vu07, where the gene Vigun07g110700, encoding a basic helix–loop–helix protein, was noted as a strong candidate gene. Eye 2, Holstein, Watson, and Full Coat mapped to a similar area on Vu09, where the gene Vigun09g139900, encoding a WD-repeat gene, was noted as a strong candidate gene. Eye 1, Eye 2, Holstein, Watson, and Full Coat mapped to an overlapping area on Vu10, where the gene Vigun10g163900, encoding an E3 ubiquitin ligase protein with a zinc finger, was noted as a strong candidate gene.

Determination of Allelic Series

Segregation ratios indicated the dominance of H over H (Holstein locus, , Gii ), W over W (Watson locus, ), C over C (Color Factor locus, ), and C over C (Color Factor locus, ). The dominance relationship between the C and C0 alleles could not be determined from these data.
Figure 2

Interaction of seed coat pattern loci. (A) Table displaying the pattern loci identified in mapping, their locations, the trait encoded, alleles identified, and phenotypes. (B) Table displaying the allelic series and relative dominance of alleles. (C) Segregation patterns for the CB27 by BB and CB27 by 556 F8 populations. (D) Segregation patterns for the CB46 by 503 and 524B by 2049 F8 populations. (E) Segregation pattern for the Tvu-15426 by MAGIC014 F2 populations. (F) Segregation pattern for the CB27 by B21 and B21 by CB50 F2 populations. (G) Phenotype of seeds from the F1 plants resulting from a series of crosses (i) Cross between CB27 and line from the CB27 by BB population with a Watson pattern, resulting in Watson pattern. (ii) Cross between CB27 and a line from the CB27 by BB population a Holstein pattern, resulting in Holstein pattern. (iii) Cross between CB27 and a line from the CB27 by BB population with a Full Coat pattern, resulting in a Full Coat pattern. (iv) Cross between IT84S-2246 and IT93K-503-1 from the early development of the MAGIC population, resulting in a Full Coat pattern in the seed coats on seeds of the F1 maternal parent.

Interaction of seed coat pattern loci. (A) Table displaying the pattern loci identified in mapping, their locations, the trait encoded, alleles identified, and phenotypes. (B) Table displaying the allelic series and relative dominance of alleles. (C) Segregation patterns for the CB27 by BB and CB27 by 556 F8 populations. (D) Segregation patterns for the CB46 by 503 and 524B by 2049 F8 populations. (E) Segregation pattern for the Tvu-15426 by MAGIC014 F2 populations. (F) Segregation pattern for the CB27 by B21 and B21 by CB50 F2 populations. (G) Phenotype of seeds from the F1 plants resulting from a series of crosses (i) Cross between CB27 and line from the CB27 by BB population with a Watson pattern, resulting in Watson pattern. (ii) Cross between CB27 and a line from the CB27 by BB population a Holstein pattern, resulting in Holstein pattern. (iii) Cross between CB27 and a line from the CB27 by BB population with a Full Coat pattern, resulting in a Full Coat pattern. (iv) Cross between IT84S-2246 and IT93K-503-1 from the early development of the MAGIC population, resulting in a Full Coat pattern in the seed coats on seeds of the F1 maternal parent.

Sequence Comparisons of Candidate Genes

Multiple sequence alignments for each of the three candidate genes and regulatory regions (∼3 kb upstream of the transcription start site) revealed SNPs and small insertions or deletions ( , , and ). None of the variants in the transcript sequence were predicted to cause changes in the amino acid sequence. The regulatory region of Vigun07g110700 (C locus candidate gene) showed a C/T SNP variation between the reference genome and the four other genome sequences on Vu07 at 20,544,306 bp. The reference genome has a T at this position while the other four sequences have a C. Transcription factor binding site prediction from the Plant Transcription Factor Database (planttfdb.cbi.pku.edu.cn/) indicated that this variation constitutes either a WRKY binding site in the C allele or an ERF binding site in the T allele. Of the 28 accessions in the SNP selection panel, eleven expressed the Eye 1 (C) pattern and 17 did not. Twenty accessions had a CC genotype, six had a TT genotype, and two had a TC genotype. The correlation test gave an estimated correlation value of 0.75, with a p-value of 3.51E-06, indicating significant correlation between the genotype and phenotype values such that this SNP is a reliable marker for distinguishing between the C (Eye 1) and the C (Eye 2) alleles. Two of the 28 lines had the No Color (C) phenotype, but had the CC genotype, indicating that this SNP is not a good marker for the C allele (for a possible explanation see Discussion section). The regulatory region of Vigun09g139900 (W locus candidate gene) showed a C/T variation between the reference genome and CB5-2 against the other three genome sequences on Vu09 at 30,207,722 bp. This SNP was not included in the list from the SNP selection panel and so could not be examined like the previous SNP. Transcription factor binding site prediction did not indicate that the site was a target for any transcription factor in either form. The upstream regulatory region of Vigun10g163900 (H locus candidate gene) did not have any distinguishing variation.

Stages of Color Development

A model of seed coat development has been formulated consisting of six stages based on the spread of pigmentation. In Stage 0, there is no color on the seed coat. In Stage 1, color appears at the base of the hilum. In Stage 2, color appears around the hilum. In Stage 3, color begins to spread along the outside edges of the seed. In Stage 4, color begins to fill in on the edges of the testa. In Stage 5, the color has completely developed to the mature level. After Stage 5 the pod and seeds begin to desiccate. Of the observed varieties, only Sasaque and Sanzi completed all six stages. MAGIC059 reached Stage 4, while CB27 only reached Stage 2. No seeds in Stage 0 were observed for Sasaque. Images of each tested variety at various stages can be seen in . Color development was associated with seed size; the pigmentation spread as the seeds grew larger.
Figure 3

Seed coat color development. Images showing the development the seed and the spread of pigmentation.

Seed coat color development. Images showing the development the seed and the spread of pigmentation.

Discussion

Segregation Ratios and Epistatic Interaction of Seed Coat Pattern Loci

Segregation ratios and dominance data ( and ) in the tested populations were consistent with a three gene system with simple dominance and epistatic interactions that matches the C (Color Factor), W (Watson), and one of the H (Holstein) factors identified by Spillman (1911) and Harland (1919, 1920). In brief, the C locus encodes a “constriction” factor while the W and H loci encode distinct “expansion” factors. The C locus is the primary locus controlling seed coat pattern. Pigmentation may be not visible (No Color, C), constrained to an eye (Eye 1, C), or distributed throughout the seed coat (Eye 2, Holstein, Watson, or Full Coat, C). The extent of distribution is modified by the H and W loci, whose contribution is visible only with an unconstrained allele (C) at the C locus. In the presence of Holstein (H) and absence of Watson (W), a Holstein pattern is expressed. Conversely, in the presence of Watson (W) and absence of Holstein (H), a Watson pattern is expressed. In combination, the Watson (W) and Holstein (H) factors result in the Full Coat phenotype. Based on the above proposed allelic series, an individual with the C genotype will express the No Color pattern, regardless of the genotypes at the W and H loci, and an individual with the C genotype will express the Eye 1 pattern, regardless of the genotypes at the W and H loci. However, when not constricted by a C or C allele (having the C allele) the “expansion” factors can be observed. An individual with the C genotype expresses the Holstein pattern, while and individual with the C genotype expresses the Watson pattern. An individual with the C genotype, with both “expansion” factors, expresses the Full Coat pattern. An individual with theC genotype expresses the Eye 2 pattern. In this latter case the eye pattern is observed despite the unconstricted C allele due to the absence of the “expansion” factors. Based on this model, the CB27 by BB and CB27 by 556 populations segregate at the W and H loci ( ), while the MAGIC, CB46 by 503, and 524B by 2,049 populations segregate at all three loci ( ). Similarly, the Tvu-15426 by MAGIC014 populations segregate at the W locus ( ) and the CB27 by B21 and B21 by CB50 populations segregate at the C locus ( ). An additional pattern phenotype of Blue-grey Ring was noted in some of the tested populations. Blue-grey Ring consists of a pale ring of bluish-grey surrounding the eye ( ). It appears only with the Eye 1 (C) phenotype but is not always present when the phenotype is Eye 1 (C). The Blue-grey Ring phenotype may represent another (fourth) allele at the C locus, or it may result from a combination of the C (Eye 1) allele and other pigmentation genes. However, from other unpublished work on seed coat color there does not appear to be a strict correlation between seed coat color and presence of the Blue-grey Ring. Further research is required to clarify the basis of the Blue-grey Ring phenotype.

Pattern Traits Qtl Overlap

Several regions of the genome are hotspots for seed coat pattern traits ( ). These correspond to locations of genetic factors identified by Spillman (1911) and Harland (1919, 1920), who identified four factors controlling seed coat patterning: Color Factor (C), Watson (W), Holstein-1 (H-1), and Holstein-2 (H-2). The present data suggest the presence of only one Holstein locus or that the two loci are very closely linked in the tested populations. To avoid possible confusion, the Holstein locus discussed here is simply termed “H.” The major QTL and regions of interest for No Color and Eye 1 are clustered in an overlapping region on Vu07, suggesting that the “constriction” factor at locus C is at that position with allelism at the locus. Mapping results from the Tvu-15426 by MAGIC014 F2 populations indicate that the H locus is on Vu10. Additional evidence for the H locus being located on Vu10 comes from Wu et al. (2019), who identified the Anasazi locus (equivalent to the cowpea H locus) on chromosome 10 of common bean, which is homologous to Vu10 (Lonardi et al., 2019). While none of the biparental F2 populations segregated solely for the W locus, the identification of the C locus on Vu07 and the H locus on Vu10 must, by process of elimination, identify the location of the W “expansion” locus on Vu09.

Seed Coat Pattern Is Due to Failure to Complete the Normal Color Developmental Program

It was noted that the varieties with the Full Coat pattern at maturity followed the developmental pattern described in Stages of Color Development Section and shown in to completion. In contrast, varieties which do not display the Full Coat pattern appear to have color development arrested at certain points. This is most obvious in CB27 (Eye 2, C), where color development proceeds only to Stage 2. It is likely that other varieties which have distinct eye sizes proceed to varied stages of development. For example, varieties with the No Color (C) phenotype would not proceed past Stage 0. However, the three gene model presented here does not explain every seed coat pattern. An example is the pattern observed in mature Sanzi seed, which exhibits a Speckled black and purple seed coat (see Seed Coat Phenotyping Section for a description and ). According to this analysis, Sanzi completes all six stages of seed coat development, indicating that the Speckled pattern is controlled separately. A biparental RIL population, consisting of lines derived from a cross between Sanzi and Vita 7, which has a brown Full Coat pattern (C), was used for mapping the black seed coat color; there was a perfect correlation between black seed coat color and the Speckled pattern (Herniter et al., 2018). This indicates that genetic control of the Speckled pattern is colocalized with black seed coat color and may be an allele at the Bl locus, which is located on Vu05. Further research is needed to determine if all cowpea accessions follow the pattern observed in the four tested lines shown in . It may be that each of the observed stages of seed coat pigmentation development is controlled by a different gene, and that failures of normal gene function cause the observed variation in patterning. Evidence for this model is furnished by the noted developmental pattern of the seed coats where development appears to be arrested at Stage 2 in CB27, which expresses the Eye 2 (C) pattern, and at Stage 4 in MAGIC059, which expresses the Starry Night pattern (see Seed Coat Phenotyping Section for a description and ). The mechanism by which this occurs is not elucidated here and requires further research. Transcriptome data could be gathered for the seed coat at each developmental stage. The currently available transcriptome data (Yao et al., 2016; legumeinfo.org) used whole seeds at specific days post flowering and do not distinguish between transcripts in the seed coat and those in the embryo or cotyledons, and further do not separate transcripts by developmental stage.

Candidate Gene Function

The later steps in flavonoid biosynthesis are controlled by a transcription factor complex composed of an R2-R3 MYB protein, a basic helix-loop-helix protein (bHLH), and a WD-repeat protein (WD40; Xu et al., 2015). E3 Ubiquitin ligases (E3UL) are believed to negatively regulate this complex (Shin et al., 2015). The color and location (leaf, pod, seed coat) of the pigmentation are determined by expression patterns (Wu et al., 2003; Iorizzo et al., 2018). Candidate genes on Vu07 (C locus) and Vu09 (W locus) encode a bHLH and WD40 protein, respectively. A candidate gene on Vu10 (H locus) encodes an E3UL protein. This information lends itself to a model in which Vigun07g110700 (bHLH) serves as a “master switch” controlling the extent of pigmentation constriction while Vigun09g139900 (WD40) and Vigun10g163900 (E3UL) act as “modulating switches” controlling the type of expanded pattern, altering the effect of the pathway to result in the observed Holstein and Watson patterns ( ). The R2-R3 MYB directs the DNA binding of the complex, with expression of different genes in different tissues resulting in the observed color and location of the pigments. For example, MYB genes identified by Herniter et al. (2018) are required for black seed coat and purple pod tip color. Further, Vigun07g110700 (bHLH) was identified as a candidate gene controlling flower color in cowpea by Lo et al. (2018), indicating a possible dual function of the gene. Indeed, Harland (1919, 1920) noted that a lack of pigment in the flower was often associated with a lack of pigment in the seed coat. Finally, homologs of Vigun07g110700 have been identified in other legumes as Mendel’s A gene controlling flower color in Pisum sativum (Hellens et al., 2010) and as the P gene in Phaseolus vulgaris (McClean et al., 2018).
Figure 4

Proposed roles of the C, W, and H genes. Transcription of flavonoid biosynthesis pathway genes are controlled by a complex composed of three types of proteins (Xu et al., 2015), a basic helix-loop-helix protein (bHLH; e.g., Vigun07g110700, C locus), a WD-repeat protein (WD40; e.g., Vigun09g139900, W locus), and an R2R3 MYB transcription factor. This complex is in turn negatively regulated by an E3 Ubiquitin ligase (E3UL; e.g., Vigun10g163900, H locus). Sequence comparisons suggest that bHLH transcription may be controlled by ERF and WRKY proteins. The observed seed coat pattern phenotypes are a result of different alleles and expression patterns.

Proposed roles of the C, W, and H genes. Transcription of flavonoid biosynthesis pathway genes are controlled by a complex composed of three types of proteins (Xu et al., 2015), a basic helix-loop-helix protein (bHLH; e.g., Vigun07g110700, C locus), a WD-repeat protein (WD40; e.g., Vigun09g139900, W locus), and an R2R3 MYB transcription factor. This complex is in turn negatively regulated by an E3 Ubiquitin ligase (E3UL; e.g., Vigun10g163900, H locus). Sequence comparisons suggest that bHLH transcription may be controlled by ERF and WRKY proteins. The observed seed coat pattern phenotypes are a result of different alleles and expression patterns. Two R2R3 MYB genes (Vigun10g165300 and Vigun10g165400) are located only 110 kb downstream of Vigun10g163900 (H locus candidate gene). However, these fall outside of the haplotype blocks identified in the CB27 by BB and CB27 by 556 populations, indicating that they are not the source of the observed phenotypic variation. However, there may be interaction between one or both of these MYBs and the E3UL responsible for the Holstein pattern; this hypothesis could be investigated through additional research. The observed C/T SNP variation in the regulatory sequence of Vigun07g110700 (bHLH) at 20,544,306 bp constitutes a difference between a WRKY binding site in the C (Eye 2) allele versus an ERF binding site in the C (Eye 1) allele. WRKY proteins are positive regulators of seed coat pigment biosynthesis in Arabidopsis (Lloyd et al., 2017) while ERF proteins negatively regulate the same pathway (Matsui et al., 2008). This SNP could be used as a genetic marker to distinguish between the C and C alleles. The lack of correlation between an observed marker and the C (No Color) allele may be caused by other variants, such as a small deletion interrupting gene function, which has been shown in P. vulgaris (McClean et al., 2018). Such a variation would not be detected by the genotyping platform used for this study. Similarly, the observed C/T SNP variation in the regulatory region of Vigun09g139900 at 30,207,722 bp could be used as a marker to distinguish between the W (not Watson) and W (Watson) alleles, despite not necessarily being the cause of the observed phenotypic variation. No single variation was identified for Vigun10g163900 alleles. However, haplotype blocks determined from the biparental RIL populations can be used for future breeding efforts. Two SNPs which fall within the genome sequence of Vigun10g163900 segregate with the phenotype in the biparental RIL populations. At 2_24359, the lines with the H (not Holstein) allele have an A genotype and the lines with the H (Holstein) allele have a G genotype. At 2_24360, the lines with the H (not Holstein) allele have an A and the lines with the H (Holstein) allele have a C. Future research is needed to develop more perfect markers for the three loci.

Data Availability Statement

All datasets [SNPs] for this study are included in the manuscript/ .

Author Contributions

IH performed all trait mapping, statistical analysis, and interpretation. RL performed analysis of the seed coat development. MM-A assisted in trait mapping and provided SNP data. SaL assisted in trait mapping. Y-NG extracted DNA for genotyping. B-LH provided the MAGIC population and its genotypic information. ML performed crosses used for allelic series analysis. ZJ assisted in statistical analysis. PR and TC provided guidance and access to population and genetic resources. StL assisted with the SNP selection panel data. TC assisted IH with the writing.

Funding

This study was supported by the Feed the Future Innovation Lab for Climate Resilient Cowpea (USAID Cooperative Agreement AID-OAA-A-13-00070), the National Science Foundation BREAD project “Advancing the Cowpea Genome for Food Security” (NSF IOS-1543963), and Hatch Project CA-R-BPS-5306-H.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  23 in total

1.  R/qtl: QTL mapping in experimental crosses.

Authors:  Karl W Broman; Hao Wu; Saunak Sen; Gary A Churchill
Journal:  Bioinformatics       Date:  2003-05-01       Impact factor: 6.937

2.  R/mpMap: a computational platform for the genetic analysis of multiparent recombinant inbred lines.

Authors:  B Emma Huang; Andrew W George
Journal:  Bioinformatics       Date:  2011-01-08       Impact factor: 6.937

3.  White seed color in common bean (Phaseolus vulgaris) results from convergent evolution in the P (pigment) gene.

Authors:  Phillip E McClean; Kirstin E Bett; Robert Stonehouse; Rian Lee; Stephanie Pflieger; Samira Mafi Moghaddam; Valerie Geffroy; Phil Miklas; Sujan Mamidi
Journal:  New Phytol       Date:  2018-06-13       Impact factor: 10.151

4.  A multi-parent advanced generation inter-cross (MAGIC) population for genetic analysis and improvement of cowpea (Vigna unguiculata L. Walp.).

Authors:  Bao-Lam Huynh; Jeffrey D Ehlers; Bevan Emma Huang; María Muñoz-Amatriaín; Stefano Lonardi; Jansen R P Santos; Arsenio Ndeve; Benoit J Batieno; Ousmane Boukar; Ndiaga Cisse; Issa Drabo; Christian Fatokun; Francis Kusi; Richard Y Agyare; Yi-Ning Guo; Ira Herniter; Sassoum Lo; Steve I Wanamaker; Shizhong Xu; Timothy J Close; Philip A Roberts
Journal:  Plant J       Date:  2018-02-24       Impact factor: 6.417

Review 5.  Advances in the MYB-bHLH-WD Repeat (MBW) Pigment Regulatory Model: Addition of a WRKY Factor and Co-option of an Anthocyanin MYB for Betalain Regulation.

Authors:  Alan Lloyd; Austen Brockman; Lyndsey Aguirre; Annabelle Campbell; Alex Bean; Araceli Cantero; Antonio Gonzalez
Journal:  Plant Cell Physiol       Date:  2017-09-01       Impact factor: 4.927

6.  Identification of Mendel's white flower character.

Authors:  Roger P Hellens; Carol Moreau; Kui Lin-Wang; Kathy E Schwinn; Susan J Thomson; Mark W E J Fiers; Tonya J Frew; Sarah R Murray; Julie M I Hofer; Jeanne M E Jacobs; Kevin M Davies; Andrew C Allan; Abdelhafid Bendahmane; Clarice J Coyne; Gail M Timmerman-Vaughan; T H Noel Ellis
Journal:  PLoS One       Date:  2010-10-11       Impact factor: 3.240

7.  Identification of QTL controlling domestication-related traits in cowpea (Vigna unguiculata L. Walp).

Authors:  Sassoum Lo; María Muñoz-Amatriaín; Ousmane Boukar; Ira Herniter; Ndiaga Cisse; Yi-Ning Guo; Philip A Roberts; Shizhong Xu; Christian Fatokun; Timothy J Close
Journal:  Sci Rep       Date:  2018-04-19       Impact factor: 4.379

8.  Efficient and accurate construction of genetic linkage maps from the minimum spanning tree of a graph.

Authors:  Yonghui Wu; Prasanna R Bhat; Timothy J Close; Stefano Lonardi
Journal:  PLoS Genet       Date:  2008-10-10       Impact factor: 5.917

9.  Identification of candidate genes and molecular markers for heat-induced brown discoloration of seed coats in cowpea [Vigna unguiculata (L.) Walp].

Authors:  Marti Pottorff; Philip A Roberts; Timothy J Close; Stefano Lonardi; Steve Wanamaker; Jeffrey D Ehlers
Journal:  BMC Genomics       Date:  2014-05-01       Impact factor: 3.969

10.  Identification of Candidate Genes Controlling Black Seed Coat and Pod Tip Color in Cowpea (Vigna unguiculata [L.] Walp).

Authors:  Ira A Herniter; María Muñoz-Amatriaín; Sassoum Lo; Yi-Ning Guo; Timothy J Close
Journal:  G3 (Bethesda)       Date:  2018-10-03       Impact factor: 3.154

View more
  5 in total

Review 1.  Breeding of Vegetable Cowpea for Nutrition and Climate Resilience in Sub-Saharan Africa: Progress, Opportunities, and Challenges.

Authors:  Tesfaye Walle Mekonnen; Abe Shegro Gerrano; Ntombokulunga Wedy Mbuma; Maryke Tine Labuschagne
Journal:  Plants (Basel)       Date:  2022-06-15

Review 2.  Constraints and Prospects of Improving Cowpea Productivity to Ensure Food, Nutritional Security and Environmental Sustainability.

Authors:  Olawale Israel Omomowo; Olubukola Oluranti Babalola
Journal:  Front Plant Sci       Date:  2021-10-22       Impact factor: 6.627

Review 3.  Revisiting the Domestication Process of African Vigna Species (Fabaceae): Background, Perspectives and Challenges.

Authors:  Davide Panzeri; Werther Guidi Nissim; Massimo Labra; Fabrizio Grassi
Journal:  Plants (Basel)       Date:  2022-02-16

Review 4.  Unraveling Origin, History, Genetics, and Strategies for Accelerated Domestication and Diversification of Food Legumes.

Authors:  Muraleedhar S Aski; Aladdin Hamwieh; Akshay Talukdar; Santosh Kumar Gupta; Brij Bihari Sharma; Rekha Joshi; H D Upadhyaya; Kuldeep Singh; Rajendra Kumar
Journal:  Front Genet       Date:  2022-07-22       Impact factor: 4.772

5.  Gamma Rays and Sodium Azide Induced Genetic Variability in High-Yielding and Biofortified Mutant Lines in Cowpea [Vigna unguiculata (L.) Walp.].

Authors:  Aamir Raina; Rafiul Amin Laskar; Mohammad Rafiq Wani; Basit Latief Jan; Sajad Ali; Samiullah Khan
Journal:  Front Plant Sci       Date:  2022-06-14       Impact factor: 6.627

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.