Cheng-Chieh Wu1,2, Fu-Jin Wei1, Wan-Yi Chiou1, Yuan-Ching Tsai3, Hshin-Ping Wu1, Dhananjay Gotarkar1, Zhi-Han Wei3, Ming-Hsin Lai4, Yue-Ie Caroline Hsing1,5. 1. Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan. 2. Institute of Plant Science, National Taiwan University, Taipei, Taiwan. 3. Department of Agronomy, National Chia-yi University, Chiayi, Taiwan. 4. Crop Science Division, Taiwan Agriculture Research Institute, Taichung, Taiwan. 5. Department of Agronomy, National Taiwan University, Taipei, Taiwan.
Abstract
Rice domestication/adaptation is a good model for studies of the development and spread of this important crop. Mutations that caused morphological and physiological change, followed by human selection/expansion, finally led to the improvement of phenotypes suitable for different kinds of environments. We used the sequence information for Heading date 1 (Hd1) gene to reveal the association between sequence changes and flowering phenotypes of rice in different regions. Seven loss-of-function hd1 haplotypes had been reported. By data-mining the genome sequencing information in the public domain, we discovered 3 other types. These loss-of-function allele haplotypes are present in subtropical and tropical regions, which indicates human selection. Some of these haplotypes are present locally. However, types 7 and 13 are present in more than one-third of the world's rice accessions, including landraces and modern varieties. In the present study, phylogenetic, allele network and selection pressure analyses revealed that these two haplotypes might have occurred early in Southeastern Asia and then were introgressed in many local landraces in nearby regions. We also demonstrate that these haplotypes are present in weedy rice populations, which again indicates that these alleles were present in rice cultivation for long time. In comparing the wild rice sequence information, these loss-of-function haplotypes occurred in agro but were not from wild rice.
Rice domestication/adaptation is a good model for studies of the development and spread of this important crop. Mutations that caused morphological and physiological change, followed by human selection/expansion, finally led to the improvement of phenotypes suitable for different kinds of environments. We used the sequence information for Heading date 1 (Hd1) gene to reveal the association between sequence changes and flowering phenotypes of rice in different regions. Seven loss-of-function hd1 haplotypes had been reported. By data-mining the genome sequencing information in the public domain, we discovered 3 other types. These loss-of-function allele haplotypes are present in subtropical and tropical regions, which indicates human selection. Some of these haplotypes are present locally. However, types 7 and 13 are present in more than one-third of the world's rice accessions, including landraces and modern varieties. In the present study, phylogenetic, allele network and selection pressure analyses revealed that these two haplotypes might have occurred early in Southeastern Asia and then were introgressed in many local landraces in nearby regions. We also demonstrate that these haplotypes are present in weedy rice populations, which again indicates that these alleles were present in rice cultivation for long time. In comparing the wild rice sequence information, these loss-of-function haplotypes occurred in agro but were not from wild rice.
Rice (Oryza sativa) is one of the most important crops in the world. It is a short-day (SD) plant and was domesticated originally in a temperate zone, then brought to subtropical and tropical zones with warmer temperature and different photoperiod. One of the main reasons for the spread of rice cultivation to a wide range in Asia as well as the increase in production is the diversification of flowering time [1].Flowering time, also known as heading date (Hd) for rice, is an important trait in the cultivation period [2]. Flowering time is affected by photoperiod (day length) and also temperature [3]. As a facultative SD plant, rice flowering is promoted under SD conditions and delayed under long-day (LD) conditions, with a critical day length of about 13 hours [4, 5]. Several recent reviews provided detailed information on the regulation of rice flowering (e.g., Itoh and Izawa, 2013 [6]; Tsuji et al., 2013 [7]; Shrestha et al, 2014 [8]).A few key genes have been suggested to affect flowering time, including Heading date 1 (Hd1), which encodes a B-box zinc finger protein and is the ortholog of ArabidopsisCONSTANS (CO) [9]; Early heading date 1 (Ehd1), which encodes a B-type response regulator with no ortholog in Arabidopsis [10]; Hd3a, which encodes a phosphatidylethanolamine-binding protein and is an ortholog of ArabidopsisFLOWERING LOCUS T (FT) [11]; and Grain number, plant height, and heading date 7 (Ghd7), an important regulator of Hd and yield potential [12].In the temperate area, including Japan, Korea, northern China and Europe, rice crops grow once a year (i.e., they are germinated/transplanted in early summer and harvested in late fall). However, in subtropical and tropical areas, where the temperature is suitable for plant growth from February to November or even the whole year round, changes in sensitivity to day length played important roles during rice domestication/adaptation. If the cropping season each year may be increased from 1 to 2 or 3, the total seed production would at least double. Today, there are 2 cropping seasons per year in Taiwan and Vietnam and 2 or 3 in the Philippines, Thailand, India, and Pakistan. Thus, these varieties must be insensitive to photoperiod.Decades ago, cultivated rice varieties were found to have a large variation in flowering time, also known as Hd [5]; that is, rice flowering was controlled by quantitative trait loci (QTL). By using the progeny derived from a single cross between a japonica rice variety, Nipponbare, and an indica variety, Kasalath, Yano and colleagues identified 15 QTL for Hd: Hd1 to Hd5 were mapped by QTL analysis of an F2 population [13], and Hd7, Hd8, and Hd11 were discovered with BC1F5 lines [14]. Seven loci, Hd6, Hd9, Hd10, Hd12-Hd15, were detected by using advanced backcross generations [15, 16]. The molecular genetic pathway for 2 model plants, Arabidopsis (an LD plant) and rice (an SD plant), has been addressed in many studies and the pathways were recently compared (e.g., Cho et al., 2017 [17]; Hori et al., 2016 [18]; Shrestha et al., 2014 [8]).RiceHd1 was identified as a CO ortholog and encodes a zinc-finger type transcriptional activator containing CO, CO-like, and TOC1 (CCT) domains [9]. It has 2 exons, with the CCT domain at the second exon. This CCT domain functions as a nuclear localization signal, and the mutant without the CCT domain in CO showed a defect in protein function [19]. Early rice genetics studies also assigned this locus as Se1 [20], fl [21], Lm [22], K [23] and Rs [24]. All of these alleles were then confirmed to be identical to Hd1 by linkage analysis (summarized by Kinoshita, 1998 [25]). The functional Hd1 protein is required for suppressing flowering under LD conditions and promoting flowering under SD conditions. Hd1 regulates Hd3a expression [26], the rice ortholog of ArabidopsisFT, which encodes a mobile flowering signal [11]. However, Ehd1, promoting SD flowering in the loss-of-function (LOF) hd1, encodes a B-type response regulator but has no ortholog in the Arabidopsis genome. Variations in Hd1 protein, Hd3a promoters, and Ehd1 expression contribute to the diversity of flowering time [26].To reveal the selection footprint in Hd1 in Asian rice cultivars, Fujino et al. (2010) [27] used a 2.9-kb Hd1 gene region, from the promoter to 3′ UTR, with 60 landraces, including indica, japonica, and aus, for allele network analysis [28]. The authors showed that non-functional alleles were not clearly associated with cultivar groups or locations. Their studies also revealed that multiple introgressions in the Hd1 gene region had occurred and were important for rice accessions to adapt to different habitats. Furthermore, a set of chromosome segregation substitution lines was used to elucidate the allele variations of common rice flowering-time QTL [29]. Hd1 was one of the 8 genes under investigation. The authors also discovered that genomic rearrangements around the Hd1 region occurred at a relatively early time during rice expansion. They suggested that the functional diversity of these genes was not related strongly to the phylogenetic relationship of subspecies.In the present study, we analyzed Hd1 gene sequence changes by using the 3K rice genome sequence database and sequence information for several local rice accessions as well as weedy rice (O. sativa f. spontanea) and wild rice. Together with the haplotypes discovered previously, we investigated 9 LOF hd1 haplotypes. The sequence information was compared with the flowering date data, along with locations of these rice accessions. Two haplotypes were more abundant than others, had been expanded to nearby areas and are used extensively in modern breeding programs. Selection parameters and allele network analyses revealed that these 2 mutations occurred in tropical insular Asia areas early during rice cultivation, then expanded to nearby areas.
Materials and methods
Classification of Hd1 haplotypes and DNA sequence analyses
Three sets of whole-genome sequencing data were downloaded from the public domain: rice 3K project [30], Chinese weedy rice [31] and Asian wild rice [32, 33]. The paired reads were mapped against the Os-Nipponbare-Reference-IRGSP-1.0 database [34, 35]. SAMtools [36] and VCFtools [37] were used to manipulate and transform the sequence alignment/map format (SAM) and variant call format (VCF) of the file. To detect single nucleotide polymorphisms (SNPs) and small indels, we used the command lines in the section “Variant Calling” in “Workflows” of the SAMtools manual without any restriction on depth or mapping quality. The information on SNPs and small indels was recorded in VCF files.The functional impact of nucleotide variants was analyzed by using the rice genome sequencing data with SnpEff [25]. From the genome annotation, sequence variants were classified according to their location (open reading frame, intron, splice sites, etc.) and predicted functional impact (missense, frame shift, early stop, etc.).To search for the 1.9-kb deletion in the Hd1 region in the 3K dataset, we performed a quick screening by using a python script (S1 File). We first identified the corresponding location of the 1.9-kb insertion in Nipponbare using TC65 sequence data, then tried to find any “unreasonable unpaired read” among the Illumina paired-end reads in this region because the distance between each pair should be about 200 to 300 bp. We then verified each candidate by using Integrative Genomics Viewer (IGV, [38-40]). Candidates with similar signature to Taichung65 (TC65) at the site of the 1.9-kb insertion were the accessions carrying the type 19 hd1 gene.To search for the 36-bp insertion at the first exon that was not present in the Nipponbare genome but was present in Kasalath [9], we used the insertion position with BAM files and IGV for visualized identification. To accelerate the process, we used the batch function of IGV to produce snapshots of the insertion region. The IGV images for Nipponbare (without the 36-bp insertion) and Kasalath (with the 36-bp insertion) (S1 Fig) were different and were used as a standard to scan all accessions.To reveal the relationship of 3K landraces carrying type 7 and 13 hd1 mutations, we used phylogenetic analysis with next-generation sequencing data. S1 Table lists the names, types, origins and sequence information for these lines. The clean reads were mapped to the Nipponbare reference genome (IRGSP v1.0) by using BWA v0.7.13-r1126 mem with default parameters [35, 41]. The mapped results were merged, and data with low mapping quality (q<20) were removed as BAM files by using Samtools v1.3 [36, 42]. Picard v2.1.1 MarkDuplicates was used to identify and remove duplicate reads in the same DNA fragments (http://picard.sourceforge.net). The Genome Analysis Toolkit v3.5-0-g36282e4 RealignerTargetCreator was used to identify regions around indels, then the Genome Analysis Toolkit IndelRealigner was used for local realignment [43]. Samtools and Bcftools were used for variant calling including SNPs and indels with filter by depth and mapping quality. Genetic distance was calculated with the p-distances model, and a neighbor-joining tree was constructed with 1,000 bootstraps by using PHYLIP v3.695 [44]. MEGA v7 [45] was used to display the phylogenetic tree.High-quality SNPs were selected from VCF files according to 3 criteria: 1) homozygous, 2) genotype quality >20, and 3) allelic depth > 4x. The VCF files were then converted to the NEXUS multiple alignment format by using vcf2phylip v2.0 [46]. Allelic haplotype networks were then created by using TCS networks v1.13 [28] with the parsimony connection limit as 2 steps and gaps treated as missing data.
Estimation of diversity of different Hd1 haplotypes
DNA sequences were aligned by using MUSCLE [47, 48]. The −10- to +10-kb region of Hd1 genes, corresponding to Nipponbare genome chromosome 6 from 9,326,376 to 9,348,569, was used for the analysis. Statistical analysis involved using DNAsp.v6 [49]. Items analyzed included number of polymorphic (segregating) sites (S); total number of mutations (Eta); average number of nucleotide differences (k); nucleotide diversity (π [50]); theta (per sequence) from Eta θ; and Watterson’s estimator of theta (per site) from Eta θw [51]. The neutrality test, Tajima's D value [52], was used to test the neutral mutation hypothesis. The D value was based on the discrepancy between π and θw. Thus, negative values indicate excess low-frequency polymorphism. These values were calculated after removing missing data and alignment gaps.
Results
Many LOF haplotypes are present in rice accessions
Takahashi et al. (2009) [26] identified a high degree of sequence diversity in riceHd1 and grouped them into 17 types. The previously recognized Kasalath haplotype [9], with a 2-bp deletion and thus LOF, was classified as type 13. In addition, the authors discovered 4 new mutations leading to LOF hd1, including 3 deletions and 1 SNP. Both types 2 and 3 had a 1-bp deletion, type 7 a 4-bp deletion, and type 12 an SNP inducing an early stop.In the first work for Hd1 gene cloning, Yano et al. (2000) [9] reported 2 haplotypes that induced LOF hd1: the 2-bp deletion in Kasalath (hd1) and the 43-bp deletion (se1) in HS66, a mutant of a Japanese landrace Ginbozu located on the short arm of chromosome 6. This 43-bp deletion was not grouped previously, and here we classified it as type 18. Doi et al. (2004) [10] worked on Ehd1 and Hd1 genes of a photoperiod-insensitive variety TC65. The TC65 hd1 haplotype has a 1.9-kb insertion at exon 2. However, this TC65 haplotype was not mentioned in Takahashi’s nomenclature [26]. In the current study, we classified it as type 19. In our previous study, we demonstrated that this 1.9-kb insertion was introgressed from 2 Taiwan landraces, Muteka and Nakabo, to the early breeding generation of TC65 [53].In the current study, we discovered 3 new LOF hd1 types from the 3K [30] and Oryza Map Alignment Project (OMAP) [32] rice genome sequencing information: type 20 with an A-to-T SNP at position 9338273 that led to an early stop at Arg368; type 21 with another SNP (C to T) at position 9337150 for an early stop at Gln206; and type 22 with a C-to-A change at position 9337112 for an early stop at Ser193.These types could be grouped into 22 types, with 10 mutations inducing loss of Hd1 protein function: types 2, 3, 7, 12, 13, 18, 19, 20, 21 and 22. S2 Table shows the accessions, types and countries for all 10 LOF haplotypes in the 3K resources. The mutations included 3 categories: 1) a small insertion/deletion (indel) leading to a frameshift: types 2, 3, 7, 13 and 18; 2) an SNP leading to an early stop: types 12, 20, 21 and 22; and 3) a large (1.9-kb) insertion leading to a frameshift: type 19. Previously when types 1 to 17 were defined, the genome sequences of landrace Ginbouzu were used [26]. Here we aligned changes to the Nipponbare reference sequence, and Table 1 shows the haplotypes of the LOF hd1 gene in the 3K rice database as well as its sequence localization and changes in amino acid residues.
Table 1
Haplotypes of loss-of-function Hd1 genes in the 3K rice dataset.
NB location
9336853
9337102
9337112
9337150
9337242
9338004
9338032
9338220
9338243
9338273
Types of aa changes
FS*
FS
STOP
STOP
FS
FS
FS
FS
STOP
STOP
Nipponbare seq †
C
G
C
C
-
TTT
-
AAGA
C
A
Type 2
1-bp del
Type 3
1-bp del
Type 7
4-bp del
Type 12
T
Type 13
2-bp del
Type 18
43-bp del
Type 19
1.9 kb
Type 20
T
Type 21
T
Type 22
A
Nipponbare (NB) position, nucleotide sequence changes and types of protein changes are indicated.
*FS: frame shift.
†: The position on Os-Nipponbare-Reference-IRGSP-1.0.
Nipponbare (NB) position, nucleotide sequence changes and types of protein changes are indicated.*FS: frame shift.†: The position on Os-Nipponbare-Reference-IRGSP-1.0.The Hd information for rice grown at the International Rice Research Institute (IRRI) campus, along with abundant phenotype data for about two-thirds of the 3K accessions are available in the IRRI SNP-SEEK phenotype database (https://snp-seek.irri.org/) [54]. Type 18 belongs to a mutation in the mutagen-induced mutant population of Ginbouzu, and type 22 occurs only in O. punctata, an African wild rice, so no phenotype data are available for these 2 types in SNP-SEEK. The Hd information was downloaded from the database and grouped according to 8 LOF haplotypes. The ANOVA results indicated that the Hd for these 8 groups significantly differed from that of the wild type (S3 Table), which did not contain any of these 8 haplotypes. Because only one accession of type 2 had Hd information, the Hd behavior was analyzed by t-test for only 7 haplotypes. Fig 1 illustrates the Hd phenotype for these haplotypes and the wild type (i.e., the accessions without any LOF haplotypes), with the detailed data in S4 Table. The day length for Los Baños, on the IRRI main campus, ranges from 11 hr 17 min to 12 hr 57 min year-round; the critical day length of rice is about 12.5 hr. Wildtype accessions had an average Hd of 106 days, and Hd for the other types ranged from 77 to 93 days (S4 Table). Because all haplotypes tested flowered before the wild type did, we confirmed that the accessions with these 7 types were indeed not sensitive to day length.
Fig 1
Flowering date of the 7 loss-of-function haplotypes of Hd1 gene and the wild type.
Plants were grown at the International Rice Research Institute and the flowering date information was downloaded from the SNP-SEEK website (https://snp-seek.irri.org/). Sample number and detailed statistical data are in S4 Table. WT, wild type. Different letters above bars indicate significant difference at P<0.05.
Flowering date of the 7 loss-of-function haplotypes of Hd1 gene and the wild type.
Plants were grown at the International Rice Research Institute and the flowering date information was downloaded from the SNP-SEEK website (https://snp-seek.irri.org/). Sample number and detailed statistical data are in S4 Table. WT, wild type. Different letters above bars indicate significant difference at P<0.05.
Many accessions have types 7 or 13 LOF hd1, as revealed by the 3K genome data
A total of 408 accessions in the 3K genome had a type 7 deletion and thus LOF hd1; 99% are indica rice. In total, 51 accessions belong to traditional accessions: 16 were from Indonesia, 5 the Philippines, 4 Bangladesh, 4 Laos, and 3 Vietnam. All are indica rice (Table 2). A total of 772 accessions had the type 13 hd1 allele (i.e., the Kasalath type); 291 belong to traditional landraces: 35 are from Bangladesh, most aus rice; 99 from Indonesia, 61 japonica, and 34 indica; and 4 are intermediate types; 62 from the Philippines, 47 japonica and 15 indica; 17 from Malaysia, 16 japonica and 1 indica; 16 from India, all indica or aus; 13 from Pakistan, all aus; 7 from Sri Lanka, 6 indica and 1 japonica; and 5 from Thailand, 3 indica and 2 japonica. All are tropical areas. In addition, many accessions belong to tropical japonica and are from insular countries such as Indonesia, the Philippines and Malaysia (Table 2, S2 and S5 Tables). To summarize, about 800 modern rice varieties had type 7 or 13 LOF hd1. Thus, types 7 and 13 are used intensively in modern breeding programs in many countries.
Table 2
Traditional accessions of 8 haplotypes in each country.
Allele group
Total
Traditional
2
3
7
12
13
19
20
21
Bangladesh
186
104
--
--
4
--
35
--
--
--
Bhutan
19
19
--
--
2
--
2
--
--
--
Cambodia
59
45
--
--
--
--
1
1
--
--
China*
481
--
(11)
(120)
--
(50)
(16)
--
--
India
435
73
--
2
3
--
16
--
9
3
Indonesia
248
203
--
--
16
--
99
--
--
--
Japan
55
11
--
--
--
--
--
--
--
--
Korea
34
19
--
--
1
--
--
--
--
--
Laos
126
84
--
--
4
10
--
--
--
--
Madagascar
66
16
--
--
--
--
8
--
--
--
Malaysia
75
55
--
--
2
--
17
--
--
--
Myanmar
75
21
--
--
1
--
--
--
--
--
Nepal
44
27
--
--
--
--
3
1
--
--
Pakistan
34
24
--
--
1
--
13
--
--
--
Philippines
229
97
--
--
5
--
62
--
--
--
Sri Lanka
54
40
--
--
1
--
7
--
--
--
Taiwan
30
23
--
5
4
--
3
11
--
--
Thailand
147
103
--
--
1
--
5
--
--
--
Viet Nam
55
28
2
--
3
--
1
--
--
--
Total
2457
990
2
7
48
10
272
13
9
3
Most accessions are 3K data, and we added some accessions collected from Taiwan aboriginal villages [53].
*Because of no information on traditional/modern types for Chinese accessions, the total numbers are shown in brackets.
Most accessions are 3K data, and we added some accessions collected from Taiwan aboriginal villages [53].*Because of no information on traditional/modern types for Chinese accessions, the total numbers are shown in brackets.
The 1.9-kb insertion in type 19 is a small retrotransposon
The insertion in type 19 is 1901 bp long at exon 2. Dot matrix analysis of this sequence revealed long terminal repeats (LTRs) at both sides, 448 bp in length, and with exact identity between the 2 LTRs. In addition, the central part of this fragment contains a partial pol protein sequence. Hence, this is the smallest retrotransposon in the rice genome. The cultivated rice genome, including both japonica and indica rice, contains 2 similar retrotransposons, one at chromosome 5 and another at 8, with length 1895 and 1893 bp, respectively. In the Nipponbare genome, the retrotransposon in chromosome 5 is 446 and 452 bp for the right and left LTR, respectively, with 97% identity of these 2 LTRs. The retrotransposon in chromosome 8 is 445 and 450 bp for the left and right LTR, respectively, with identity 97%.
Other haplotypes occurred in local regions
Only 2 accessions in the 3K collection had type 2 hd1. Both are indica rice and were from Vietnam. A total of 21 accessions had type 3, all indica; 11 are from China and 5 from Taiwan. These 5 accessions were brought to Taiwan from southern China in the late Ming Dynasty about 400 years ago and thus must be old landraces. A total of 28 accessions had type 12; 10 are traditional landraces. These landraces were from Laos, and all are tropical japonica. In total, 31 accessions had type 19 hd1 and most are modern varieties, including TC65, Taichung 179 and Taichung 188. According to our previous study of the Hd1 gene in Taiwan aboriginal rice accessions using genomic PCR analysis, 11 had this haplotype [53]. A total of 16 accessions had type 20, all from India and all indica or aus. Only 3 accessions had type 21, all from India and all aus (S2 Table).
LOF haplotypes also found in Chinese weedy rice accessions and an African wild rice accession
In addition to many landraces and modern varieties, much effort has been extended to resequencing the many weedy rice accessions (Qiu et al., 2017 [55] and Sun et al., 2019 [56] for Chinese ones; Li et al., 2017 [57] for American ones) and wild rice accessions [32, 33, 58]. We downloaded the sequence information for 155 Chinese accessions from the US National Center for Biotechnology Information (NCBI) for Hd1 haplotype analysis. The accessions collected from Liaoning (northeastern part) and Ningxia (northwestern part) are japonica and those from Jiangsu (central part) and Guangdong (southern part) are indica [55]. In all, 25 of the 27 Guangdong accessions had type 7, one accession did not have any known LOF haplotype and one accession had heterozygous type 7. However, all 39 Jiangsu accessions and 30 Ningxia accessions had only functional Hd1. For the 59 Liaoning accessions, 2 had type 7 and another 2 type 13; the remaining 55 accessions had functional Hd1. The detailed analysis is in S6 Table.Rice production in the Guangdong region involves 2 cropping seasons, with only one cropping season in the other 3 regions. Sequence analysis also indicated only Guangdong with the LOF hd1 gene but not the other 3 regions. Thus, haplotype analysis indicated that flowering habitat of these weedy rice accessions had been adapted to the nearby rice production, and only those in southern China had an LOF hd1 allele.We also performed detailed analysis of Hd1 haplotypes with the resequencing data for wild rice accessions downloaded from NCBI. Only accessions with at least 10X genome redundancy were used, including those from 1) the O. rufipogon pangenome project: W0123, W0141, W0170, W1687, W1698, W1739, W1754, W1777, W1943, W1979, W2012, W3078, and W3095 [33]; and 2) the Oryza map alignment project: O. barthii, O. glumaepatula, O. meidionalis, O. nivara, O. rufipogon and O. punctata [32]. A new LOF hd1 haplotype was found in O. punctata (S2 Table). This type 22 is an SNP of C to A at position 9337112, leading to an early stop of Hd1 protein.
Phylogenetic and haplotype network analyses of types 7 and 13
The distribution of types 7 and 13 hd1 are relatively wide and the accession numbers high, including the landraces and modern varieties; that is, about 40% of the 3K project had either of these 2 mutations, which indicates their importance in primitive and modern rice production in many areas. Thus, we further analyzed these 2 types by using phylogenetic relationship and allele network analyses. We used the Hd1 gene region, a 3.8-kb fragment from −1-kb to +1-kb, for elucidating the relationship. These landraces include japonica, indica, aus and aromatic types from 4 regions: 1) insular Southeastern Asia areas (i.e., Indonesia, the Philippines, and Malaysia); 2) Indochina areas (i.e., Cambodia, Laos, Thailand, Vietnam, and Myanmar); 3) Indian subcontinent (i.e., Bhutan, Nepal, India, Sri Lanka, Bangladesh, and Pakistan); and 4) Madagascar. Fifteen Asian wild rice accessions were also used in the analysis, including O. rufipogon and O. nivara sequences from the OMAP project [32] and the 13 wild rice pan-genome sequences [33]. The accessions belonged to OrI, OrII, OrIIIa or OrIIIb and were collected from Indonesia, Malaysia, India, Thailand and China [32, 33]. The information on accessions, types and collected regions is in S1 Table.Nucleotide changes in the Hd1 gene region of the 40 landraces with type 7, the 262 landraces with type 13, and the 15 wild rice accessions are in S7 and S8 Tables. These 302 landraces feature 41-nt changes, 22 in the open reading frame (ORF). The 40 landraces with type 7 are mainly indica rice and may be grouped into 5 types, 34 with the first 2 types. However, accessions with type 13 are more complicated: 262 landraces were grouped into 156 types, 28 with the first 2 types. In addition, they belong to tropical japonica, temperate japonica, indica, aus and aromatic. There are more nucleotide changes in the 15 Asian wild rice accessions than landraces, each accession with a unique type (S8 Table). This SNP information was used to construct a phylogenetic tree and network.The phylogenetic tree indicated that the landraces with type 7 are clustered together and are distinct from the 15 wild rice accessions (Fig 2A). However, the relationships of landraces with type 13 are rather complicated. These landraces may be grouped into 5 groups: group I is closest to the wild rice group, which indicates that they belong to the most ancient changes. More than 80% are from insular Southeastern Asia areas, so the early type 13 might occur in the region. Group V were mainly aus from the Indian subcontinent (highlighted in blue), and the clade originated from 7 accessions (highlighted in green at group V) from insular Southeastern Asia areas, with 6 tropical japonica and 1 indica accessions (Fig 2B).
Fig 2
Phylogenic and network analysis of types 7 and 13 Hd1 gene.
Nucleotide changes in the −1000 to + 1000-bp Hd1 region among 3K Asian landraces carrying types 7 and 13 loss-of-function mutation haplotypes as well as 15 Asian wild rice accessions were used. Accession information is in S1 Table. A. Phylogenetic tree of 40 3K accessions carrying type 7 Hd1 mutation and 15 wild rice accessions. The tree was constructed with SNP data by using PHYLIP [44]. Green indicates insular Southeastern Asia areas, red Indochina areas, blue Indian subcontinent, magenta Madagascar and black wild rice species. The bootstrap values determined with 1,000 samples are shown. B. Phylogenetic tree of 262 3K accessions carrying type 13 Hd1 mutation and 15 wild rice accessions. Groups I to V are indicated. Other notes are the same as A. C. Allele network of types 7 and 13 as well as the wild rice accessions was constructed with the TCS program v1.13 [28]. Black circles indicate wild rice accessions, green and blue highlight japonica and indica types from insular Southeastern Asia areas, cyan and red highlight japonica and indica for Indochina areas, purple and orange highlight japonica and indica for Indian subcontinent, and brown highlights indica for Madagascar. Sizes of circles are proportion to the haplotype frequencies. Lines indicate mutation steps.
Phylogenic and network analysis of types 7 and 13 Hd1 gene.
Nucleotide changes in the −1000 to + 1000-bp Hd1 region among 3K Asian landraces carrying types 7 and 13 loss-of-function mutation haplotypes as well as 15 Asian wild rice accessions were used. Accession information is in S1 Table. A. Phylogenetic tree of 40 3K accessions carrying type 7 Hd1 mutation and 15 wild rice accessions. The tree was constructed with SNP data by using PHYLIP [44]. Green indicates insular Southeastern Asia areas, red Indochina areas, blue Indian subcontinent, magenta Madagascar and black wild rice species. The bootstrap values determined with 1,000 samples are shown. B. Phylogenetic tree of 262 3K accessions carrying type 13 Hd1 mutation and 15 wild rice accessions. Groups I to V are indicated. Other notes are the same as A. C. Allele network of types 7 and 13 as well as the wild rice accessions was constructed with the TCS program v1.13 [28]. Black circles indicate wild rice accessions, green and blue highlight japonica and indica types from insular Southeastern Asia areas, cyan and red highlight japonica and indica for Indochina areas, purple and orange highlight japonica and indica for Indian subcontinent, and brown highlights indica for Madagascar. Sizes of circles are proportion to the haplotype frequencies. Lines indicate mutation steps.The allele network revealed 3 groups for type 7 and 13 hd1: group I were all accessions with type 7; groups II and III both had accessions with type 13 (Fig 2C). Most of the accessions in group I were from insular Southeastern Asia areas, with a few from Indochina and the Indian subcontinent. Group II accessions were rather complicated, most being tropical japonica accessions from insular Southeastern Asia areas. The minor ones involved indica in the same area as well as japonica and indica in the Indochina area. Group III was clearly a separate lineage diverged from a small group of japonica/indica accessions from insular Southeastern Asia areas. All these accessions were aus collected from the Indian subcontinent and all except 2 had identical SNPs at the intron, the second ORF, and 3′ region (S7 Table). The first several “dots” of group III connecting to group II were collected from Bangladesh, fit well with the geographic distribution and indicated the footprint of early human movement. About 70% of group III accessions were collected from Bangladesh. This coalescent approach clearly suggested that type 13 occurred at insular areas in Southeastern Asia, followed by introgression and expansion (i.e., brought by human beings) to the Indochina area and Indian subcontinent. In addition, both type 7 and 13 did not have a close relationship with the Asian wild rice accessions tested. Hence, allele network analysis clearly shows the mutations, introgression and human activity of these accessions during rice cultivation/adaptation.During the work on the allele network analysis, we found that 3 wild rice accessions, W3078, W1687 and W1777, must have had introgressions in the 3.8-kb Hd1 gene region. The W3078 genome is completely identical to that of Nipponbare, and the other 2 have 2 and 4 SNPs, respectively. Thus, only 12 wild rice accessions were used for further allele network analysis. This finding also revealed that some of the wild rice accessions had introgression fragment(s) from cultivars.
Discussion
Together with Ehd1, Hd1 was suggested to control rice panicle development and showed their effect in crop yield [59]. Thus, Hd1 could have dual effect on rice production (i.e., involved in rice development and controlling flowering time). Insensitivity to photoperiod in tropical and subtropical regions has been beneficial to rice production because of 1) increased crop yield per year and 2) escape/avoidance from stresses caused by seasonable typhoons, monsoons, or drought. Thus, insensitivity was one of the targets to select for heading gene mutations during the long rice cultivation history. In this study, we analyzed the LOF hd1 mutations that occurred in rice in different regions of Asia by using whole-genome sequencing data for thousands of accessions.
Hd1 was suggested to be the main allele associated with adaptation of rice plants to tropical regions
Rice was domesticated about 8,000 to 10,000 years ago in the Yangtze River region of China, which today still has one cropping season. Domesticated rice had spread southward thousands of years ago, and one of the new traits beneficial to crop yield would be loss of sensitivity to photoperiod. If rice could be cultivated for 2 or 3 seasons each year, production would at least double.Takahashi and Shimamoto [2] performed a cDNA sequencing analysis of Hd1 and Hd3a genes in leaf tissues at tillering stage under an SD condition from several rice cultivars and 38 wild rice accessions. Cultivated and wild rice accessions showed no nucleotide changes in Hd3a. Also, the 38 wild rice accessions showed no change affecting Hd1 function, but several changes appeared in cultivated rice. Thus, the authors suggested that Hd1 was a possible target of selection to generate different flowering-time responses in different regions. Kim et al. [60] selected about 60 diverse rice accessions to analyze the Hd trait of various types, including 4 aus accessions, 20 indica accessions, 4 tropical japonica accessions, 17 temperate japonica accessions, and 12 tropical adapted temperate japonica accessions. The authors analyzed 7 major flowering genes, including Hd1 [9], OsPPR37 [61], DTH8 [62], Ghd7 [12], Ehd1 [10], RFT1 [63] and Hd3a [11] as well as heading behavior under 3 different field conditions in temperate and tropical regions. They concluded that only the accessions from tropical/subtropical regions preferentially had the non-functional alleles of hd1 but not the other flowering genes tested. Therefore, in the present study, we focused on the Hd1 haplotype.
Several Hd1 haplotypes were limited to small regions and some were widely spread
Table 2 summarizes the traditional accessions for each haplotype in rice-growing countries of Asia. As indicated in Results, types 2, 3, 12, 19, 20 and 21 were limited to small regions. Thus, mutations of these 6 LOF haplotypes must have occurred recently during adaptation, and all were in accessions in tropical/subtropical regions. Most were indica or aus rice, with only types 12 and 19 as tropical japonica. Fig 3 illustrates the sample size and frequency of traditional landraces collected for each haplotype in rice-growing countries in Asia. S2 Fig illustrates the distribution of subtypes in each of these countries to show ratios of aus, indica, temperate japonica, tropical japonica, aromatic, and admixture. Fig 3 clearly shows that no or only a few number or types of LOF Hd1 haplotypes were found in high-altitude and -latitude regions such as Bhutan, Nepal, Korea and Japan. However, most of the accessions from tropical/subtropical regions had the non-functional Hd1 alleles, with multiple haplotypes in the same region.
Fig 3
Geographic distribution of functional and non-functional Hd1 alleles.
The number in each pie indicates the number of cultivars belonging to each Hd1 allelic group. Colors indicate different haplotypes and the pie size indicates the sample size. Only the traditional landraces in the 3K dataset are illustrated. Type 2, lime green; Type 3, sky blue; Type 7, yellow; Type 12, green stripe; Type 13, navy; Type 19, brown; Type 20: yellowish green; Type 21, light blue; other (without loss-of-function mutation), gray. The Asia map was downloaded from https://freevectormaps.com/world-maps/WRLD-EPS-01-0017.
Geographic distribution of functional and non-functional Hd1 alleles.
The number in each pie indicates the number of cultivars belonging to each Hd1 allelic group. Colors indicate different haplotypes and the pie size indicates the sample size. Only the traditional landraces in the 3K dataset are illustrated. Type 2, lime green; Type 3, sky blue; Type 7, yellow; Type 12, green stripe; Type 13, navy; Type 19, brown; Type 20: yellowish green; Type 21, light blue; other (without loss-of-function mutation), gray. The Asia map was downloaded from https://freevectormaps.com/world-maps/WRLD-EPS-01-0017.For the 1.9-kb small retrotransposon integrated into Hd1 of type 19, because the length and sequences of both LTRs are identical, this transposition into chromosome 6 might have occurred about 2,000 years ago. Two similar retrotransposons at chromosome 5 and 8 had different lengths for the LTR pair, and the sequences were not identical (97% for each), so the transposition into these 2 locations must have occurred a long time ago. In addition, these 2 small retrotransposons were located in chromosomes 5 and 8 in indica rice accessions, which further confirmed that they were present in the rice genome before the split of japonica and indica rice. In the Asian AA-genome wild rice, O. nivara has the chromosome 5 copy only, and O. rufipogon has both chromosome 5 and 8 copies.Many traditional landraces were relatively limited to small areas, but this was not the case for accessions with types 7 and 13 (Table 2). S5 Table shows accession numbers with type 7 and 13 Hd1 alleles in the traditional landraces of each country, including information on their subspecies. Most of the type 7 accessions were indica rice, with more japonica rice for type 13 accessions, especially in Indonesia and the Philippines. Both haplotypes are still used for many modern indica varieties worldwide, such as most IR series inbred lines. The phylogenetic and allele network analyses suggested that these two mutations might have occurred first in insular southeastern Asia areas and then introgressed to many local landraces and moved to other regions.
Some weedy rice or wild rice accessions also contained LOF hd1
Weedy rice (O. sativa f. spontanea), also called “red rice”, has been considered a conspecific weed of cultivated rice. Most accessions are shattering, with a long awn, brown pericarp, and dark seed hull. Several origins for weedy rice have been suggested, including gene flow resulting from natural hybridization[64], evolving directly from domesticated lineages[65], introgression with local cultivars coupled with selection that maintained weedy identity[66], influenced by proximity to reproductively compatible wild and domesticated populations[67], originated from indica-japonica hybridization [31], hybridization of modern indica/indica or japonica/japonica [68], or evolved from maternal hybrid rice derivatives [69]. As mentioned in the Results section, data mining of the Chinese weedy rice accessions illustrates that type 7 LOF hd1 occurred in 96% of the accessions collected in Guangdong, a sub-tropical region. However, most of the accessions collected in another 3 locations (all temperate regions) had functional Hd1. Because of the ecological meaning of the weedy rice, type 7 hd1 must have existed in southern China, including Guangdong, for a long time.With the detailed studies of several wild rice accessions, we also found 1 LOF haplotype in O. punctata, a BB genome African wild rice. Therefore, sequence changes leading to LOF hd1 occurred not only in cultivated rice but also in its wild relatives.
LOF hd1 mutations occurred in agro
Several independent origins of LOF hd1 mutations led us to raise interesting questions about the adaptation of the Hd1-negative phenotype. One novel mutation, type 22, existed only in an African wild rice accession, and none of the other haplotypes preexisted in the wild rice accessions tested. Thus, most mutations, except type 22, did not originate in wild species but occurred in agro.We then checked for the existence of positive selection in the LOF hd1 mutation. We calculated selection parameters, including π [50], θw [51], as well as Tajima’s D [52], to test the neutral mutation hypothesis. Hd1 gene and the upstream 10-kb and downstream 10-kb regions were used. A significant negative D value indicates strong selection. We analyzed 5 haplotypes in 22 regions. Data are summarized in Table 3 and the complete data are in S9 Table. Type 7 was positively selected in Indonesia but not in Bangladesh, India, Laos, Malaysia, the Philippines or Taiwan. Thus, this mutation occurred in the indica rice accession in Indonesia, similar to the allele network analysis. Type 12 was present only in Laos and it was positively selected. For type 13, we found highly significant selection (P <0.001) in landraces of Indonesia and the Philippines, especially in japonica rice. The selection was also significant (P <0.05) in Malaysia japonica rice. However, the selection was not obvious in Bangladesh or India, etc. These results again verified the allele network studies. Type 19 mutation was mainly found in the aboriginal villages of Taiwan [53] and the parameters shown in Table 3 also indicated positive selection in these lines.
Table 3
Selection pressure analysis of four Hd1 types in several countries.
Haplotype
Country
Subspecies
No. of accessions
π
θw
Tajima's D
Significance of D
Type 7
Bangladesh
Indica
4
0.00065
0.00066
-0.15923
P >0.10
Type 7
Indonesia
Indica and japonica
16
0.00138
0.0027
-2.12711**
P <0.01
Type 7
Laos
Indica
4
0.00044
0.00044
-0.15777
P >0.10
Type 12
Laos
Japonica
10
0.00115
0.00179
-1.75625*
P <0.05
Type 13
Bangladesh
Indica
35
0.00294
0.00603
-1.95450*
P <0.05
Type 13
India
Indica
16
0.00320
0.00267
0.86860
P >0.10
Type 13
Indonesia
Indica and japonica
99
0.00120
0.00547
-2.65204**
P <0.001
Type 13
Indonesia
Japonica
61
0.00137
0.00534
-2.65105**
P <0.001
Type 13
Philippines
Indica and japonica
62
0.00125
0.00432
-2.52823**
P <0.001
Type 13
Philippines
Japonica
46
0.00137
0.00432
-2.50592**
P <0.001
Type 19
Taiwan
Japonica
11
0.00205
0.00328
-1.81600*
P<0.05
Hd1 gene region and nearby ± 10 kb of traditional landraces were used for calculation.
Hd1 gene region and nearby ± 10 kb of traditional landraces were used for calculation.We detected significant Tajima’s D values in several types/locations, as shown described above. The negative D values indicate either high selection pressure and thus reducing sequence variation or a recent population expansion. A total of 1181 accessions, including landraces and modern varieties, in the 3K resource have types 7 or 13, which indicates that the traits are important locally and globally. Therefore, we suggest that strong selections were applied to these LOF haplotypes (types 7, 12, 13, 19). In total, 203 Indonesian landraces were used in 3K data. However, according to the Genesys website (www.genesys-pgr.org), 6899 Indonesian landraces are in the IRRI collection. That is, we are only studying a small proportion of the total population in Indonesia and in many other countries.
A 36-bp deletion in the first ORF of Nipponbare genome
While working on the map-based cloning of Hd1 gene decades ago, Yano et al. pointed out that in addition to the 2-bp deletion in the second ORF in Kasalath that caused LOF, there was a 36-bp insertion in the first ORF of Kasalath as compared with Nipponbare [9]. However, this 36-bp fragment was present in the CO gene. Thus, Nipponbare contained an in-frame 12 amino acid deletion. The residues in Arabidopsis are LARRHQRVPILP and in Kasalath are LARRHQRVPVAP (i.e., identical in 10 amino acids and similar in 2). This 36-bp fragment is rich in GC and may form a stem-loop structure with the stable free energy of -2.06 kcal mol−1 (S3 Fig). Here we screened the presence of this 36-bp fragment in the 3K landraces, weedy rice and wild rice accessions. Among about 2000 accessions screened, most had this 36-bp fragment (S10 Table). That is, all wild rice and indica accessions had this short insertion. Few temperate japonica accessions are the same as Nipponbare, including 10 from Korea, 6 from Japan, 4 from China and 1 from Taiwan. For the 4 groups of Chinese weedy rice [55], those from Ningxia (temperate japonica), Jiangsu (indica), and Guangdong (indica) also had the 36-bp insertion. Only one-third of Liaoning accessions (temperate japonica) did not have the insertion, which might be due to the geographical closeness to Korea. This short deletion might have occurred in northeastern Asia and spread to nearby regions, without changing the flowering behavior.
Introgressions and human activities played important roles for rice diversification/adaptation as revealed by the Hd1 gene region
Previous studies using sequencing information of the Hd1 gene region of 60 landraces [27] or 429 CSSL materials [29] illustrated that multiple introgressions and thus dynamic rearrangements occurred around the Hd1 gene. In the present study, we showed that the type 7 mutation occurred in indica rice in insular Southeastern Asia areas. The mutation was introgressed into 3 japonica rice and other indica accessions in the same region. The indica types were also moved to the Indochina region through human activities. However, the type 13 mutation occurred in tropical japonica rice in insular Southeastern Asia areas and some were introgressed into local indica types. Some of these indica accessions were expanded to Indochina by human activities, most if not all being indica type. A few accessions with type 13 mutation were also brought from the islands to Bangladesh and introgressed to the local aus accessions. They were now aus lines carrying type 13 mutation in the Indian subcontinent. Thus, there should have been frequent transportation among the islands of insular Southeastern Asia areas and also from islands to Indochina as well as the Indian subcontinent hundreds or thousands of years ago. By studying Hd1 gene sequences, we illustrated that both introgression and human activities played important roles in rice diversification and adaptation.
Conclusions
In the current study, we chose landraces carrying LOF hd1 mutations for detailed analysis. We demonstrated that many LOF hd1 alleles existed in accessions in subtropical and tropical Asia rice-growing areas. Some of these haplotypes were present locally, whereas 2, types 7 and 13, originated from insular Southeastern Asia areas. They were introgressed to other accessions, then expanded with human migration. These 2 types were spread to many regions and are now used in most of the modern varieties in many countries in different continents.
Python script used for screening of 1.9-kb insertion in the 3K data.
(TXT)Click here for additional data file.(RAR)Click here for additional data file.
The accessions used for the phylogenetic and allele network analysis.
The accessions and collected regions of the landraces and wild rice are listed.(XLSX)Click here for additional data file.
The accession, types and collected nations and resource of loss-of-function Hd1 gene in the 3K dataset.
(XLSX)Click here for additional data file.
ANOVA for flowering dates of Hd1 haplotypes and the wild type.
(DOCX)Click here for additional data file.
Summary of statistical analysis of flowering dates of Hd1 haplotypes and wild type.
(DOCX)Click here for additional data file.
Distribution of japonica and indica rice in traditional accessions from different countries.
Only types 7 and 13 are illustrated. Most are from 3K data and we added some accessions collected from Taiwan aboriginal villages.(DOCX)Click here for additional data file.
Accessions, types, collected regions and source for 10 loss-of-function haplotypes of Hd1 in weedy rice and wild rice.
(XLSX)Click here for additional data file.
Nucleotide changes in the hd1 gene region for landraces carrying types 7 and 13 loss-of-function mutation.
The 3.8-kb fragments from −1000 to + 1000-bp of Hd1 were used. Position indicates the corresponding location of chromosome 1 in Nipponbare (IRGSP v1.0). Dot indicates the same as Nipponbare reference sequences, “-” indicates deletion in that position. The 3K code, accession names, subspecies, origin, haplotype group and numbers and sequence changes are listed.(XLSX)Click here for additional data file.
Nucleotide changes in the Hd1 gene region for 15 Asian wild rice accessions.
The 3.8-kb fragments range from −1000 to + 1000-bp of Hd1. Position indicates the corresponding location of chromosome 1 in Nipponbare (IRGSP v1.0). Dot indicates the same as Nipponbare reference sequences, “-” indicates deletion in that position. The accession names, subspecies, origin and sequence changes are listed.(XLSX)Click here for additional data file.
Selection parameters of five Hd1 types in several countries.
The Hd1 gene region and nearby ± 10-kb of traditional landraces were used for calculation.(DOCX)Click here for additional data file.
The presence or absence of the 36-bp fragment in 3K landraces, weedy rice and wild rice.
(XLSX)Click here for additional data file.
View of the 36-bp deletion in the Hd1 gene first exon.
The Integrative Genomics Viewer view of aligned reads of a 162-bp first exon region of Hd1 gene is shown. The pair read sequencing was performed on an Illumina platform. Alignments toward the Nipponbare IRGSP 1.0 (top row) are represented as gray polygons and mismatched nucleotides as orange, blue, red and green bars. The second row is the alignment of Nipponbare Illumina reads and the third row the alignment of Kasalath reads. Blue arrow at bottom points to the 36-bp deletion region.(TIF)Click here for additional data file.
Geographic distribution of rice subtypes of each Asian country shown in Fig 3.
Colors indicate different subtypes and the pie size indicates the sample size. Only the traditional landraces in the 3K dataset are illustrated. Temperate japonica: light blue; tropical japonica: green; indica: yellow; aus: orange; aromatic: dark blue; admixture: grey. Because of no information about tradition/modern types for Chinese accessions, all accessions from China are used for illustration. The Asia map was downloaded from https://freevectormaps.com/world-maps/WRLD-EPS-01-0017. Republished from https://freevectormaps.com/world-maps/WRLD- EPS-01-0017 under a CC BY license, with permission from FreeVectorMaps.com, original copyright 2020.(TIF)Click here for additional data file.
The stem-loop structure of the 36-bp deletion located at the first exon in Napponbare.
The mfold package at http://unafold.rna.albany.edu/?q=mfold was used. The free energy (dG) is -2.06 kcal mol-1.(TIF)Click here for additional data file.26 May 2020PONE-D-20-11893Studies of riceHd1 haplotypes worldwide reveal adaptation of flowering time to different environmentsPLOS ONEDear Dr. Hsing,Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. In particular, one of the reviewers criticizes the part of Champa rice because it still needs to be supported by more lines of evidence to reach a solid conclusion. Because the part may not be really necessary for the main story of current manuscript, thus you may reduce or delete it.Please submit your revised manuscript by Jul 10 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.Please include the following items when submitting your revised manuscript:A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocolsWe look forward to receiving your revised manuscript.Kind regards,Jong-Seong Jeon, Ph.D.Academic EditorPLOS ONEJournal Requirements:When submitting your revision, we need you to address these additional requirements.1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found athttps://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf andhttps://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf2. Thank you for stating the following in the Competing Interests section:'The authors have declared that no competing interests exist.'We note that one or more of the authors are employed by a commercial company: FourOneFour Studio.a. Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form.Please also include the following statement within your amended Funding Statement.“The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.”If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement.b. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc.Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.c. Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf.Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests3. We note that Figure 2 in your submission contain map images which may be copyrighted.All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.We require you to either (a) present written permission from the copyright holder to publish this figure specifically under the CC BY 4.0 license, or (b) remove the figures from your submission:a. You may seek permission from the original copyright holder of Figure 2 to publish the content specifically under the CC BY 4.0 license.We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission.In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”b. If you are unable to obtain permission from the original copyright holder to publish this figure under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.The following resources for replacing copyrighted map figures may be helpful:USGS National Map Viewer (public domain): http://viewer.nationalmap.gov/viewer/The Gateway to Astronaut Photography of Earth (public domain): http://eol.jsc.nasa.gov/sseop/clickmap/Maps at the CIA (public domain): https://www.cia.gov/library/publications/the-world-factbook/index.html and https://www.cia.gov/library/publications/cia-maps-publications/index.htmlNASA Earth Observatory (public domain): http://earthobservatory.nasa.gov/Landsat: http://landsat.visibleearth.nasa.gov/USGS EROS (Earth Resources Observatory and Science (EROS) Center) (public domain): http://eros.usgs.gov/#Natural Earth (public domain): http://www.naturalearthdata.com/[Note: HTML markup is below. Please do not edit.]Reviewers' comments:Reviewer's Responses to QuestionsComments to the Author1. Is the manuscript technically sound, and do the data support the conclusions?The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.Reviewer #1: YesReviewer #2: Partly**********2. Has the statistical analysis been performed appropriately and rigorously?Reviewer #1: YesReviewer #2: No**********3. Have the authors made all data underlying the findings in their manuscript fully available?The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: YesReviewer #2: Yes**********4. Is the manuscript presented in an intelligible fashion and written in standard English?PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.Reviewer #1: YesReviewer #2: No**********5. Review Comments to the AuthorPlease use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)Reviewer #1: The manuscript entitled “Studies of riceHd1 haplotypes worldwide reveal adaptation of flowering time to different environments” by Wu et al. investigates how the variations in riceHd1 halotypes contribute to rice domestication and adaptation. This study analyzed the Hd1 gene sequence alterations in various local rice accessions as well as weedy rice and wild rice to find out different hd1 halotypes.This manuscript is generally well-written and organized in a logical manner. The data presented by these authors support their conclusions. Here are some minor to make it easier for the readers to understand.Minor points:1. Page 3, line 67: Early heading date (Ehd1) => Early heading date 1 (Ehd1)2. Page 3, lines 66-71: Overall, it is confused whether they are gene name or protein name. I understood that the authors meant them gene names(Hd1, Ehd1, Hd3a, and Ghd7). If so the words “which encodes” after each gene names.3. Pages 6-7, lines 164~176: Simply listing synonyms for the Hd1 gene does not seem to provide that much information. It does not seem bad to make a table and insert references.4. Page 13, lines 286-287: 31 accessions type 19 hd1 =>31 accessions are type 19 hd1.5. Page 14, line 315: rufupogon => rufipogon6. Page 17, line 374 and 375: Hd1 => Hd17. Page 22, line 466: japonica => japonica8. Page 22, line 467: indica => indica9. Page 23, line 498: includingπ => including π10. Page 27, line 580: CONSTAINS => CONSTANS, Early heading date => Early heading date 111. The table 2 and 3 are a little hard to see and understand. I hope it can be fixed more easily and in a better way.12. At the beginning of the discussion session, the authors mentioned that photoperiod insensitivity has been beneficial to rice production. However, Endo-Higashi and Izawa (2011) showed that flowering time gene Hd1 could control tiller and panicle development in rice.Therefore, it might be necessary to mention that the Hd1 gene could be involved in the rice developments, along with controlling of the flowering timing, which affect rice productivity.Reviewer #2: The manuscript entitled “Studies of riceHd1 haplotypes worldwide reveal adaptation of flowering time to different environments” describes the relationship between sequence information and flowering time using published 3K rice genome data, accession of weedy rice sequences, and newly determined genome sequences of likely Champa lines in Taiwan. They found three new loss-of-function alleles of Hd1 from the data. Then they found that two of them, type 7 and type 13 alleles, are major ones among loss-of-function Hd1 alleles and have a long history after their selection. More than one-third of the world’s rice accessions contain these two mutations. Interestingly, they distributed among sub-tropical and tropical accsssion.In addition, they also found that “Champa” lines may not belong to aus, but to the indica cluster. The allele distribution of rice weedy were also examined.Since Hd1 gene is a major QTL gene to control rice flowering time, it make sense to analyze Hd1 alleles in details. However, there are several related papers already published. Thus, the novelty of this manuscript is not so high. Unfortunaltely, the authors may not recognize a few key papers related to Hd1 haplotypes such as Fujino et al. (2010) and Itoh et al. (2017), thus their discussion should be reorganized with them. I am afraid that the authors just focused on the SNPEffect data of Hd1 among 3K genome too much and missed some important aspect of Hd1 diversity. Especially, haplotype network classification of 3K data are necessary in this manuscript. The analysis of Hd1 introgression as shown in the not-cited papers, instead of selective sweep analysis, such as Tajima’s D would be appropriate in these data.In addition, it is very funny that 1.9kb insertion in type 19 was detected among 3K data since 3K data use short-reads less than 100 bp and mapped the read into IRGSP1.0. Thus, the authors should describe how they got information on type19 mutations.Furthermore, they should describe the allele of previously known 36bp deletion in japonica rice in this manuscript. This allele contain the in-frame deletion which caused 12 amino acid from very conserved region of CO like genes though there is no evidence which affect the function of Hd1 before. I am afraid that all this allele information is missing in the public 3K vcf file.The Champa story and weedy rice story looks extra. They should be independent manuscript if possible.Finally, when I read this manuscript, I am often confused that which data is new and novel from this work or ones from previous works. The authors should try to clarify this point better.Minor points,1) In the introduction section, description of previous analysis on Hd1 haplotypes are largely missing. I am afraid that it may become a biased presentation to pretend the novelty of this work.2) The previous paper should cited with the first authors. “Shimamoto and coworkers” and “Yoshimura and coworkers” are inappropriate. Only in some reviews, I saw this style before.3) How they got the flowering time graph in Fig.1 are not well explained yet. If they compared distinct lines containing the same mutation, they should plot all the data including outlier data.**********6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.If you choose “no”, your identity will remain anonymous but your review may still be made public.Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.Reviewer #1: Yes: Lae-Hyeon ChoReviewer #2: No[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.14 Jul 2020Reply to reviewers’ commentsReviewer 1Thank you very much for the suggestions. I hereby answer accordingly.Points 1 through 10:Yes, we changed it according to the comments.Points 11:Table 3 was moved to S3 Table in the revision accordingly.Point 12:Thank you very much for the suggestion. We added it to the discussion and cited the reference accordingly. Hd1 could have dual effect on rice production, i.e. involving in the rice development and controlling of flowering time.Reviewer 2:Thanks a lot for the critical comments and they certainly made improvements on this work. There are many major points so I reply one by one.1. The part related to Champa rice was indeed a different story and it was removed in this version.2. We performed allele network analysis as suggested. Because we only used the land races carrying types 7 and 13, a clear-cut network was shown (Fig. 2 in the current submission). We were able to show the area and types the mutations occurred and how they expanded by human migration. Together we demonstrated that introgressions and human activities played important roles for hd1 diversification/adaptation.3. Thank you very much for pointing out how we detected 1.9 kb insertion in 3K data. I then double checked with my former PhD student FJW. He indeed wrote a python script 3 years ago and quickly scanned the whole data set followed by IGV confirmation. The method was updated and script provided as Supplementary Material.4. About the 36-bp insertion in the first exon. We performed the analysis and the results were shown in the revision. Because 36-bp insertion is very short, it is not easy to write a suitable script to scan the whole data set in large/fast scale. Thus, we used visualization identification as mentioned in the revision. Of the 1070 traditional landraces in 3K, all indica and tropical japonica accessions had this short insertion. Only few temperate japonica accessions were the same as Nipponbare, including 10 accessions from Korea, 6 from Japan, 4 from China, and 1 from Taiwan. For the 6 Japanese ones, 4 are real landraces (from NARO Genebank website), 1 is Nipponbare, and the other CX330 Yueguang. It took us a while to figure out this Yueguang should be Koshihikari as its Han characters (Kanji) pronounced as Yueguang in Chinese. It was picked up by CAAS and maybe even IRRI breeders do not know its real name is Koshihikari.All 15 Asian wild rice tested also have this 36-bp fragment, however, 3 of them have identical or similar seq SNP with Nipponbare. We suggest the resource could be Japanese accession with 36-bp fragment. We do not have enough sequence information to check all though.5. About which data was new and novel or which ones from previous work. Yes, we improve it accordingly. Thank you very much for pointing out.Minor points1. We added more discussion accordingly.2. Yes, it was improved in the current session.3. For the ~2500 accessions with seeds in IRRI germplasm, IRRI scientists performed a series of phenotyping analyses and data were available in SNP-SEEK, as mentioned in the ms. We downloaded these data and ran analysis. “Wild type” means the accessions without any LOF mutation and thus was used as outliers. Wild type accessions had the average heading date of 106 days, and the other types ranged from 77 to 93 days (S4 Table). The accession numbers used for WT, type 3, 7, 12, 13, 19, 20, 21 are 1263, 19, 213, 21, 552, 16, 16, and 3, respectively. We hope these data were enough to confirm they were LOF mutations.4. There are 10 supplementary tables in the current version. For instances, we provided the SNPs information of hd1 gene region of landraces carrying type 7 and 13 mutations, we also listed the accessions belong to each haplotype, etc. Because there are around 400 landraces consisting types 7 and 13 mutations, they were shown as Excel files in Supplementary tables.Submitted filename: Response to Reviewers.docxClick here for additional data file.14 Aug 2020PONE-D-20-11893R1Studies of riceHd1 haplotypes worldwide reveal adaptation of flowering time to different environmentsPLOS ONEDear Dr. Hsing,Thank you for submitting your manuscript to PLOS ONE. Both reviewers and I feel that this revised manuscript deserves PLOS ONE’s publication. Before its publciation we invite you to revise only few minor points raised during the review process.Please submit your revised manuscript by Sep 28 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.Please include the following items when submitting your revised manuscript:A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocolsWe look forward to receiving your revised manuscript.Kind regards,Jong-Seong Jeon, Ph.D.Academic EditorPLOS ONE[Note: HTML markup is below. Please do not edit.]Reviewers' comments:Reviewer's Responses to QuestionsComments to the Author1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.Reviewer #1: All comments have been addressedReviewer #2: (No Response)**********2. Is the manuscript technically sound, and do the data support the conclusions?The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.Reviewer #1: YesReviewer #2: Yes**********3. Has the statistical analysis been performed appropriately and rigorously?Reviewer #1: YesReviewer #2: Yes**********4. Have the authors made all data underlying the findings in their manuscript fully available?The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: YesReviewer #2: Yes**********5. Is the manuscript presented in an intelligible fashion and written in standard English?PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.Reviewer #1: NoReviewer #2: Yes**********6. Review Comments to the AuthorPlease use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)Reviewer #1: Thank you for revising the manuscript. Most of my comments have been well revised. However, I still have some minor issues.ArabidopsisCO => CO(many times in the manuscript)Page 4, line 92; page 5, line 101 : orthologue => orthologPage 14, line 295 : 446 bp and 452 bp => 446 and 452 bpPage 16, line 327 : Guangdona => Guangdong(?)Page 19, Line 422 : Hd1 gene => Hd1Reviewer #2: This version of manuscript has been much improved since the authors have addressed my previous comments politely.Minor comments;In Table3, each circle may contain all subgroups in 3K. Thus, similar data for only a single subgroup such as aus or temperate japonica should be presented as a supplemental figure to imply that local requirement of the Hd1 function may not be affected by the subgroup difference.Explanation of Tajima'D results may not be restricted to selective sweeps. Thus, the authors should also describe other possibility such as population size changes (or bottle neck effect).**********7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.If you choose “no”, your identity will remain anonymous but your review may still be made public.Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.Reviewer #1: Yes: Lae-Hyeon ChoReviewer #2: No[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.27 Aug 2020We hereby submit the 2nd revision for our previous manuscript entitled “Studies of riceHd1 haplotypes worldwide reveal adaptation of flowering time to different environments”. Thanks a lot to the reviewers, and we modified the original manuscript accordingly.Here are the points-to-points replies for the comments:Reply to Reviewer #1:1. ArabidopsisCO => COYes, we changed them accordingly. Thank you very much.2. typos or mistakes in pages 4, 14, 16 and 19Yes, we changed them accordingly. Thank you very much.Reply to Reviewer #2:1. About the display of these subgroups (Fig 3). Yes, we added S2 Fig in the supplementary materials. It indeed improves the interpretation of Fig 3. Thank you very much.2. About explanation of Tajima’s D results. Thank you very much for the reminding. We added a short paragraph for the interpretation accordingly.Submitted filename: Rebuttal_letter_0826.docxClick here for additional data file.31 Aug 2020Studies of riceHd1 haplotypes worldwide reveal adaptation of flowering time to different environmentsPONE-D-20-11893R2Dear Dr. Hsing,We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.Kind regards,Jong-Seong Jeon, Ph.D.Academic EditorPLOS ONEAdditional Editor Comments (optional):Reviewers' comments:7 Sep 2020PONE-D-20-11893R2Studies of riceHd1 haplotypes worldwide reveal adaptation of flowering time to different environmentsDear Dr. Hsing:I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.If we can help with anything else, please email us at plosone@plos.org.Thank you for submitting your work to PLOS ONE and supporting open access.Kind regards,PLOS ONE Editorial Office Staffon behalf ofProfessor Jong-Seong JeonAcademic EditorPLOS ONE
Authors: Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo Journal: Genome Res Date: 2010-07-19 Impact factor: 9.043
Authors: James T Robinson; Helga Thorvaldsdóttir; Aaron M Wenger; Ahmet Zehir; Jill P Mesirov Journal: Cancer Res Date: 2017-11-01 Impact factor: 12.701