| Literature DB >> 29661190 |
Sarah M Pilkington1, Ross Crowhurst1, Elena Hilario1, Simona Nardozza1, Lena Fraser1, Yongyan Peng1,2, Kularajathevan Gunaseelan1, Robert Simpson3, Jibran Tahir3, Simon C Deroles3, Kerry Templeton1, Zhiwei Luo1, Marcus Davy4, Canhong Cheng1, Mark McNeilage1, Davide Scaglione5, Yifei Liu6, Qiong Zhang7, Paul Datson1, Nihal De Silva1, Susan E Gardiner3, Heather Bassett3, David Chagné3, John McCallum8, Helge Dzierzon3, Cecilia Deng1, Yen-Yi Wang1, Lorna Barron1, Kelvina Manako1, Judith Bowen1, Toshi M Foster3, Zoe A Erridge3, Heather Tiffin3, Chethi N Waite3, Kevin M Davies3, Ella P Grierson3, William A Laing3, Rebecca Kirk1, Xiuyin Chen1, Marion Wood1, Mirco Montefiori1, David A Brummell3, Kathy E Schwinn3, Andrew Catanach8, Christina Fullerton1, Dawei Li7, Sathiyamoorthy Meiyalaghan8, Niels Nieuwenhuizen1, Nicola Read2, Roneel Prakash1, Don Hunter3, Huaibi Zhang3, Marian McKenzie3, Mareike Knäbel3, Alastair Harris2, Andrew C Allan1,2, Andrew Gleave1, Angela Chen2, Bart J Janssen1, Blue Plunkett1, Charles Ampomah-Dwamena1, Charlotte Voogd1, Davin Leif1,2, Declan Lafferty2, Edwige J F Souleyre1, Erika Varkonyi-Gasic1, Francesco Gambi1, Jenny Hanley2, Jia-Long Yao1, Joey Cheung2, Karine M David2, Ben Warren1, Ken Marsh1, Kimberley C Snowden1, Kui Lin-Wang1, Lara Brian1, Marcela Martinez-Sanchez1, Mindy Wang1, Nadeesha Ileperuma1, Nikolai Macnee1, Robert Campin1, Peter McAtee1, Revel S M Drummond1, Richard V Espley1, Hilary S Ireland1, Rongmei Wu1, Ross G Atkinson1, Sakuntala Karunairetnam1, Sean Bulley4, Shayhan Chunkath2, Zac Hanley1, Roy Storey4, Amali H Thrimawithana1, Susan Thomson8, Charles David8, Raffaele Testolin5,9, Hongwen Huang6,7, Roger P Hellens10, Robert J Schaffer11,12.
Abstract
BACKGROUND: Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongyang' draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models.Entities:
Keywords: Actinidia chinensis; Genome sequencing; Manual annotation
Mesh:
Substances:
Year: 2018 PMID: 29661190 PMCID: PMC5902842 DOI: 10.1186/s12864-018-4656-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1The pedigree tree for Red5 kiwifruit. Two siblings were crossed from open-pollinated seed from a red Actinidia chinensis. An F2 sibling cross from two individuals from this population resulted in Red5 with a predicted inbreeding coefficient of 37.5%
Construction metrics for the Actinidia chinensis genome
| Chromosome | Size (Mb) | Number of Scaffolds | Scaffold N50 (Kb) | Longest Scaffold (Mb) | Manually Annotated Genes | ||
|---|---|---|---|---|---|---|---|
| Number of Genes | Number of Transcripts | Genes Per 100 Kb | |||||
| 1 | 18.6 | 94 | 595.4 | 1.51 | 1125 | 1133 | 6.06 |
| 2 | 14.6 | 83 | 536.6 | 1.41 | 1091 | 1094 | 7.46 |
| 3 | 21.7 | 92 | 1154.9 | 2.96 | 1726 | 1729 | 7.94 |
| 4 | 13.8 | 75 | 420.4 | 2.39 | 781 | 781 | 5.67 |
| 5 | 18.6 | 113 | 655.2 | 4.44 | 960 | 976 | 5.16 |
| 6 | 17.4 | 109 | 474.8 | 1.56 | 1059 | 1069 | 6.09 |
| 7 | 20.0 | 99 | 666.0 | 1.82 | 1068 | 1075 | 5.33 |
| 8 | 26.1 | 134 | 779.6 | 1.68 | 1448 | 1459 | 5.55 |
| 9 | 16.6 | 66 | 598.4 | 1.95 | 1118 | 1123 | 6.74 |
| 10 | 19.3 | 111 | 413.0 | 0.95 | 986 | 994 | 5.10 |
| 11 | 16.9 | 74 | 520.3 | 1.26 | 1119 | 1137 | 6.63 |
| 12 | 19.2 | 122 | 480.2 | 1.42 | 1053 | 1054 | 5.49 |
| 13 | 19.5 | 77 | 926.8 | 1.65 | 1388 | 1402 | 7.11 |
| 14 | 17.9 | 75 | 651.2 | 2.32 | 1106 | 1129 | 6.18 |
| 15 | 15.9 | 67 | 707.9 | 2.13 | 1106 | 1121 | 6.94 |
| 16 | 23.8 | 152 | 388.5 | 1.46 | 1252 | 1255 | 5.26 |
| 17 | 17.4 | 100 | 387.2 | 0.91 | 931 | 934 | 5.34 |
| 18 | 20.7 | 85 | 850.4 | 2.80 | 1213 | 1219 | 5.85 |
| 19 | 15.4 | 117 | 345.9 | 2.12 | 653 | 656 | 4.24 |
| 20 | 17.9 | 89 | 512.5 | 1.53 | 1055 | 1056 | 5.88 |
| 21 | 17.3 | 71 | 871.2 | 2.16 | 1046 | 1064 | 6.04 |
| 22 | 18.9 | 105 | 462.5 | 1.42 | 1092 | 1096 | 5.76 |
| 23 | 27.7 | 79 | 802.6 | 2.15 | 2324 | 2327 | 8.39 |
| 24 | 17.8 | 59 | 1032.0 | 3.02 | 1198 | 1201 | 6.72 |
| 25 | 19.6 | 83 | 966.1 | 2.45 | 1008 | 1010 | 5.14 |
| 26 | 20.4 | 79 | 1086.4 | 3.41 | 1237 | 1247 | 6.07 |
| 27 | 21.0 | 112 | 461.8 | 1.27 | 1013 | 1040 | 4.83 |
| 28 | 15.8 | 75 | 964.5 | 2.80 | 1011 | 1012 | 6.38 |
| 29 | 18.0 | 130 | 402.3 | 0.69 | 993 | 1002 | 5.53 |
| Total for Chrs 1–29 | 548.0 | 2727 | 575.6 | 4.44 | 33,160 | 33,395 | 6.05 |
| 5.9 | 1206 | 5.4 | 0.66 | 97 | 97 | 1.64 | |
| Total | 553.9 | 3933 | 33,257 | 33,492 | 6.00 | ||
Fig. 2a. Genome construction with respect to the original submitted Actinidia chinensis ‘Hongyang’ genome, showing the rearrangement of some assigned chromosomes and b. the distribution of unallocated sequence (Chromosome 30) to the rest of the genome
Tissue for RNA-Seq data for the Actinidia chinensis genome
| Red5 tissue description | Number of reads |
|---|---|
| tissue culture whole plant | 16 M |
| orchard plant growing bud | 16 M |
| orchard plant flower | 19 M |
| orchard fruit 0 DAFB1 | 18 M |
| orchard fruit 14 DAFB1 | 0.2 M |
| orchard fruit 60 DAFB1 | 14 M |
| orchard fruit 76 DAFB1 | 59 M |
| orchard fruit 139 DAFB1 | 60 M |
| glasshouse pot plant root tip | 56 M |
| glasshouse pot plant root main | 58 M |
1DAFB days after full bloom
Fig. 3Duplicated regions in Actinidia chinensis Red5. a. whole genome lined up with whole genome at 85% homology. b. Alignments of Chr9 and Ch8 with Chr1, suggesting a translocation event has occurred. c. A schematic of the translocated chromosomes. A dashed “x” suggests a Robertsonian translocation has occurred at the centromeric regions. Chromosomes not aligned with a dashed “x” show regions of homology i.e. other proposed chromosomal rearrangements. Asterisk marks the sex chromosome (Chr25)
Fig. 4Examples of genes with transcription, and no transcription in intronic regions highlighting some of the challenges associated with manual annotation. a RNA-Seq reads in a single intron of gene Acc00023. b and c a comparison of RNA-Seq reads in two independent RNA-Seq libraries of gene Acc00022. B) RNA-Seq from a root library showing transcription across the intron and C) RNA-Seq from a flower library showing no intronic RNA-Seq reads. d Transcription between gene models Acc00024 and Acc00025, which could not be constructed into a gene model; the Actinidia chinensis ‘Hongyang’ predicted model line-up Achn243511 is shown in blue
Alignment of DNA for 812 bidirectionally sequenced Hort1A cDNA sequences (A) and 550 Hort16A predicted protein sequences (B & C) with the Red5, original ‘Hongyang’ [14], and revised ‘Hongyang’ [18] gene model sets
| Red5 Models | Original Hongyang Models | Revised Hongyang Models | |
|---|---|---|---|
| A | |||
| Hort16A cDNA with BLAT match identitya > = 60% | 635 | 465 | 510 |
| Hort16A cDNA with BLAT match identitya < 60% | 165 | 289 | 263 |
| Hort16A cDNA with no match | 12 | 58 | 39 |
| 812 | 812 | 812 | |
| B | |||
| Hort16A predicted protein with BLAT match identitya > = 60% | 541 | 462 | 481 |
| Hort16A predicted protein with BLAT match identitya < 60% | 6 | 76 | 60 |
| Hort16A predicted protein with no match | 3 | 12 | 9 |
| 550 | 550 | 550 | |
| C | |||
| % Identity Rangea | |||
| 0–59 | 6 | 76 | 60 |
| 60–64 | 2 | 23 | 7 |
| 65–69 | 2 | 21 | 15 |
| 70–74 | 5 | 20 | 16 |
| 75–79 | 2 | 36 | 19 |
| 80–84 | 13 | 39 | 28 |
| 85–89 | 17 | 54 | 47 |
| 90–94 | 17 | 72 | 67 |
| 95–99 | 339 | 168 | 231 |
| 100 | 144 | 29 | 51 |
a% identity was calculated as the number of matched bases (A) or matched amino acids (B & C) between Hort16A and the named gene set divided by the total length of the Hort16A cds sequence (A) or predicted protein sequence (B & C)
Fig. 5Phylogenetic alignment of alignable protein sequences from EXPANSIN and ACC SYNTHASE proteins. Proteins in red are reference Arabidopsis proteins; blue and purple are published Actinidia chinensis protein sequences (blue are correctly annotated published unnamed ‘Hongyang’ proteins represented by gene models, purple models are partially represented); black are models are unannotated in the 2 published ‘Hongyang’ protein sets. Each tree is rooted with a basal species. Bootstrap values of 1000 iterations are given. A. EXPANSIN proteins; vertical line represents EXPANSIN proteins (Grey) EXPANSIN LIKE B proteins (Yellow) and EXPANSIN LIKE A proteins (blue). Rooted with DdEXP2 (NCBI # gi|74,854,151) B. ACC SYNTASE Proteins. Rooted with PpACL1 (NCBI# EDQ51432.1)
Fig. 6Gene density along Actinidia chinensis chromosomes. a Comparison of gene density of Chr6 (depicted by image of genes along chromosome) and physical distance measured in bp along the chromosome and genetic distance in centimorgans (cM), measured in a mapping population. b The relationship between gene density and regions of homology between sister chromosomes. When the gene plots were aligned to a subset of homeologous chromosome arrangements, translocation events appear to be linked to areas of lower gene density (cut point marked by small vertical arrows), RC represents gene alignment images that have been reversed to aid alignments. c) An expanded view of Chr19 which demonstrates two regions of low homology to other kiwifruit chromosomes, and low gene density, surrounding a region of high homology and higher gene density marked by arrows
Analysis of single copy genes in the Actinidia chinensis genome
|
| Kiwifruit | ||||||
|---|---|---|---|---|---|---|---|
| Best hit 1 | Chr | Best hit 2 | Chr | Best hit 3 | Chr | ||
|
| AT2G13360 | Acc30351.1 |
| Acc32422.1 |
| ||
|
| At3g47810 | Acc29850.1 | 26 | Acc04108.1 |
| Acc02358.1 |
|
| At2g32520 | Acc13638.1 | 12 | |||||
|
| At3g52300 | Acc25925.1 | 23 | Acc30188.1 |
| Acc32243.1 |
|
|
| At5g06360 | Acc12882.1 |
| Acc17591.1 |
| ||
|
| At5g04600 | Acc04134.1 |
| Acc27568.1 |
| ||
|
| At2g21870 | Acc05500.1 | 5 | Acc10590.1 | 9 | ||
|
| At4g33250 | Acc12920.1 |
| Acc23419.1 |
| ||
|
| At4g30010 | Acc12893.1 |
| Acc17577.1 |
| Acc27046.1 | 23 |
|
| At1g77710 | Acc05095.1 |
| Acc24370.1 |
| ||
|
| At4g08230 | Acc05092.1 | 4 | ||||
|
| At4g37830 | Acc05943.1 |
| Acc30914.1 |
| ||
|
| At5g47570 | Acc16748.1 | 15 | Acc15793.1 |
| Acc19901.1 |
|
|
| At5g23290 | Acc28316.1 |
| Acc31379.1 |
| ||
|
| At1g27530 | Acc04831.1 |
| Acc23863.1 |
| ||
|
| At3g20390 | Acc21627 | 19 | ||||
|
| At5g63135 | Acc22027.1 | 19 | Acc26769.1 |
|
Best BLAST hits to the manually annotated gene models. Chromosome numbers in bold indicates the predicted homeologous chromosomes