| Literature DB >> 28846680 |
Pierre Balmer1,2, Anina Bauer2,3, Shashikant Pujar4, Kelly M McGarvey4, Monika Welle2,5, Arnaud Galichet2,6, Eliane J Müller2,5,6,7, Kim D Pruitt4, Tosso Leeb2,3, Vidhya Jagannathan2,3.
Abstract
Keratins represent a large protein family with essential structural and functional roles in epithelial cells of skin, hair follicles, and other organs. During evolution the genes encoding keratins have undergone multiple rounds of duplication and humans have two clusters with a total of 55 functional keratin genes in their genomes. Due to the high similarity between different keratin paralogs and species-specific differences in gene content, the currently available keratin gene annotation in species with draft genome assemblies such as dog and horse is still imperfect. We compared the National Center for Biotechnology Information (NCBI) (dog annotation release 103, horse annotation release 101) and Ensembl (release 87) gene predictions for the canine and equine keratin gene clusters to RNA-seq data that were generated from adult skin of five dogs and two horses and from adult hair follicle tissue of one dog. Taking into consideration the knowledge on the conserved exon/intron structure of keratin genes, we annotated 61 putatively functional keratin genes in both the dog and horse, respectively. Subsequently, curators in the RefSeq group at NCBI reviewed their annotation of keratin genes in the dog and horse genomes (Annotation Release 104 and Annotation Release 102, respectively) and updated annotation and gene nomenclature of several keratin genes. The updates are now available in the NCBI Gene database (https://www.ncbi.nlm.nih.gov/gene).Entities:
Mesh:
Substances:
Year: 2017 PMID: 28846680 PMCID: PMC5573215 DOI: 10.1371/journal.pone.0180359
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Comparative map of the keratin type I gene cluster.
Type I keratin genes except KRT18 in the dog, human and horse genomes. Arrows indicate the orientation of the genes. Note that CFA 9 and ECA 11 are represented in reverse orientation with decreasing coordinates from top to bottom. The genes’ nomenclature in the figure corresponds to our updated nomenclature for keratin genes (see Methods). S3 Table details the correspondence of gene symbols to NCBI Gene IDs and RefSeq IDs.
Fig 2Frameshift deletion in exon 6 of the equine KRT9 gene.
A) Exon 6 alignment of equine KRT9P against the human ortholog KRT9. The yellow block shows the deletion of a single base in the horse exon leading to a frameshift. B) Illumina whole genome sequence and RNA-seq alignment confirms that the equine reference sequence does not contain a sequencing error which supports the claim of deletion. C. Expanded alignment region containing the frameshift deletion.
Details of the genes in the keratin type I gene cluster on HSA 17, CFA 9, and ECA 11.
| HSA 17 (GRCh38.p2) | CFA 9 (CanFam3.1) | ECA 11 (EquCab 2.0) | ||||||
|---|---|---|---|---|---|---|---|---|
| Gene symbol | Exons | Encoded amino acids | Gene symbol | Exons | Encoded amino acids | Gene symbol | Exons | Encoded amino acids |
| 8 | 983 | 8 | 674 | (9) | (P) | |||
| 8 | 584 | 8 | 568 | 8 | 559 | |||
| - | - | - | - | 8 | 596 | |||
| 8 | 494 | 8 | 507 | 8 | 488 | |||
| 8 | 458 | 8 | 452 | 8 | 451 | |||
| 8 | 472 | 8 | 482 | 8 | 478 | |||
| 8 | 456 | 8 | 472 | 8 | 469 | |||
| 8 | 473 | 8 | 477 | 8 | (472) | |||
| 8 | 432 | 8 | 433 | 8 | 433 | |||
| 6 | 400 | 6 | 399 | 6 | 411 | |||
| 8 | 424 | 8 | 434 | 8 | 424 | |||
| 8 | 422 | 8 | 419 | 8 | 428 | |||
| 8 | 525 | 8 | 541 | 8 | 506 | |||
| 8 | 450 | 8 | 450 | 8 | 450 | |||
| 8 | 468 | 8 | 464 | 8 | 469 | |||
| 8 | 459 | 8 | 446 | 8 | 460 | |||
| 8 | 464 | 8 | 464 | 8 | 464 | |||
| 7 | 416 | 7 | 409 | 7 | 416 | |||
| 7 | 448 | 7 | 448 | 7 | 448 | |||
| 7 | 404 | 7 | 404 | 7 | 404 | |||
| 7 | 404 | 7 | 407 | 7 | 404 | |||
| 7 | 436 | 7 | 393 | 7 | 393 | |||
| 7 | 455 | 7 | 455 | 7 | 455 | |||
| 7 | 467 | 7 | 467 | 7 | 461 | |||
| 7 | 449 | 7 | 440 | 7 | 444 | |||
| 7 | 456 | 7 | 457 | - | - | |||
| 7 | 491 | 7 | 483 | 7 | 492 | |||
| 7 | 431 | 7 | 431 | 7 | 431 | |||
| Pc | P | 7 | 437 | 7 | (P) | |||
| Pc | P | 10 | (492) | 8 | 453 | |||
| Pc | P | - | - | - | - | |||
| 6 | 295 | 6 | 295 | 6 | 295 | |||
| Pc | P | - | - | - | - | |||
a The gene symbols have been updated by NCBI Refseq curators. A comprehensive listing of gene symbols and NCBI Gene IDs is given in S3 Table.
bData in brackets are of low reliability due to gaps in the genome reference assemblies and/or insufficient coverage in the canine and equine RNA-seq data.
cP indicates pseudogenes.
Fig 3Comparative map of the keratin type II gene cluster.
Type II keratin genes and KRT18 in the dog (CanFam3.1), human (GRCh38.p2) and horse genomes (EquCab 2.0). Arrows indicate the orientation of the genes. Note that CFA 27 is represented in reverse orientation with decreasing coordinates from top to bottom. The genes’ nomenclature in the figure corresponds to our updated nomenclature for keratin genes (see Methods). S3 Table details the correspondence of gene symbols to NCBI Gene IDs and RefSeq IDs.
Details of the genes in the keratin type II gene cluster on HSA 12, CFA 27, and ECA 6.
| HSA 12 (GRCh38.p2) | CFA 27 (CanFam3.1) | ECA 6 (EquCab 2.0) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Gene symbol | Exons | Encoded amino acids | Gene symbol | Exons | Amino acids | Gene symbol | Exons | Encoded amino acids | |
| 9 | 644 | 9 | 619 | 9 | 626 | ||||
| 9 | 639 | 9 | 635 | 9 | 629 | ||||
| - | - | - | - | - | 9 | 618 | |||
| - | - | - | - | - | (9) | (618) | |||
| 9 | 628 | 9 | 610 | 9 | 630 | ||||
| 9 | 520 | 9 | 530 | 9 | 607 | ||||
| 9 | 590 | 9 | 597 | 9 | 593 | ||||
| 9 | 564 | 9 | 570 | 9 | 574 | ||||
| 9 | 564 | 9 | 570 | 9 | 562 | ||||
| 9 | 564 | - | - | 9 | 562 | ||||
| - | - | - | - | - | P | P | |||
| 9 | 469 | 9 | 468 | 9 | 465 | ||||
| 9 | 511 | 9 | 491 | 9 | 435 | ||||
| 7 | 430 | 7 | 431 | 6 | 430 | ||||
| 9 | 523 | 9 | 525 | 9 | 525 | ||||
| 9 | 511 | 9 | 523 | 9 | 523 | ||||
| 9 | 540 | 9 | 589 | 9 | 540 | ||||
| 9 | 529 | 9 | 533 | (6) | (392) | ||||
| 9 | 551 | 9 | 551 | 9 | 549 | ||||
| 9 | 638 | 9 | 648 | 9 | 640 | ||||
| 9 | 578 | (9) | (561) | 9 | 593 | ||||
| 9 | 520 | 9 | 517 | 9 | 499 | ||||
| 9 | 535 | 9 | 535 | 9 | 535 | ||||
| 9 | 452 | 9 | 453 | 9 | 453 | ||||
| 9 | 505 | 9 | 513 | 9 | 507 | ||||
| 9 | 513 | 9 | 518 | 9 | 518 | ||||
| 9 | 493 | 9 | 487 | 9 | 491 | ||||
| 9 | 600 | 9 | 586 | 9 | 575 | ||||
| 9 | 507 | 9 | 507 | 9 | 507 | ||||
| 9 | 486 | 9 | 485 | 9 | 486 | ||||
| P | P | P | P | (8) | (P) | ||||
| P | P | (5) | (153) | (9) | (483) | ||||
| P | P | 9 | 495 | 9 | 494 | ||||
| P | P | (9) | (531) | 9 | 521 | ||||
| P | P | - | - | - | - | ||||
| P | P | 9 | 630 | (n.d.) | (P) | ||||
| P | P | 9 | 581 | - | - | ||||
| P | P | - | - | - | (10) | (514) | |||
| - | - | - | - | - | - | (P) | (P) | ||
aThe gene symbols have been updated by NCBI Refseq curators. A comprehensive listing of gene symbols and NCBI Gene IDs is given in S3 Table.
bData in brackets are of low reliability due to gaps in the genome reference assemblies and/or insufficient coverage in the canine and equine RNA-seq data.
cP indicates pseudogenes.
Fig 4Keratin gene structure.
(A) Typical type I keratin gene with eight exons. (B) Typical type II keratin gene with nine exons. The length of conserved exons is given in bp. Untranslated regions are shown as open rectangles.