Literature DB >> 28540354

Data on the exon-intron organization of genes coding for B-cell receptor-like proteins.

Sergey Y Morozov1, Anna V Pankratenko1, Anastasia K Atabekova1, Andrey G Solovyev1.   

Abstract

B-cell receptor-associated protein (BAP) family plays important roles in the ER homeostasis and stress responses of eukaryotic cells [1]. We reported the analysis of plant BAP-like (PBL) genes and the encoded proteins of higher land plants [2]. The origin and functional divergence of these genes among all eukaryotes, however, are poorly studied, which impedes our understanding of the functional relationships and diversity among BAP-like proteins. One possible reason for the potential functional diversity may be the differences in the exon-intron structure of PBL genes. In this study, we first performed analysis of the exon-intron organization of these genes in the genome sequences of the Viridiplantae species in addition to previously published data on Angiosperms [2]. To further address the distribution of BAP-like genes in other eukaryotes, we extended our dataset to include the representative genes encoded by non-plant bikonts and unikonts [3].

Entities:  

Keywords:  B-cell receptor protein; Bikonts; Exons; Gene organization; Introns; Unikonts

Year:  2017        PMID: 28540354      PMCID: PMC5430146          DOI: 10.1016/j.dib.2017.05.011

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data The BAP-like genes are usually combined in small gene families and encoded by all well-studied eukaryotic taxons belonging to bikonts and unikonts. Data on BAP-like gene organizations enable researchers to compare how these genes evolved during progression of different eukaryotic branches resulted, particularly, in appearance of mammals and flowering plants. Data on BAP-like proteins and genes are intriguing to understand their unique features in different taxons. Data on gene organizations enable researchers to infer the possible ranges of time frames in the drastic divergence events (particularly, intron gain and loss) of BAP-like genes.

Data and experimental design

Annotation information was obtained for 29 representative genes in bikonts (Table 1) and 81 genes in unikonts (Table 2) [[1], [2], [3]]using Blast2GO program.
Table 1

Features of BAP-like proteins and genes in the selected Biconta speciesa.

Species nameFeatures of proteins and genes
NCBI accession numberTaxonProtein lengthProteinpINumber of intronsLength of introns
Amborella trichopoda (NW_006496807)Magnoliophyta Amborellales220 aa9.18119293 bp
Amborella trichopoda (NW_006499207)Magnoliophyta Amborellales220 aa9.88114781 bp
Pinus lambertiana (LMTP010039273)Acrogymnospermae Pinales228 aa9.61182221 bp
Pseudotsuga menziesii (LPNX010213588)Acrogymnospermae Pinales228 aa9.59171087 bp
Pinus taeda (APFE020507172)Acrogymnospermae Pinales230 aa9.37166410 bp
Selaginella moellendorffii (NW_003314330)Lycopodiopsida Selaginellales233 aa9.46168 bp
Physcomitrella patens (XP_001766710)Bryopsida Funariales257 aa9.402222 bp; 71 bp
Sphagnum fallax (Sphfalx0532s0001b)Sphagnopsida Sphagnales238 aa9.582996 bp; 329 bp
Sphagnum fallax (Sphfalx0012s0168b)Sphagnopsida Sphagnales238 aa9.582992 bp; 329 bp
Sphagnum fallax (Sphfalx0223s0006b)Sphagnopsida Sphagnales213 aa9.9021010 bp; 331 bp
Sphagnum fallax (Sphfalx0012s0164b)Sphagnopsida Sphagnales210 aa9.2821013 bp; 331 bp
Marchantia polymorpha (OAE31879)Marchantiopsida Marchantiales189 aa5.912140 bp 681 bp
Klebsormidium flaccidum (BANV01001480)Klebsormidiophyceae Klebsormidiales247 aa8.672487 bp 330 bp
Chlorella variabilis (XP_005842886)Chlorophyta Chlorellales291 aa5.645176 bp 191 bp 98 bp 189 bp 255 bp
Chlamydomonas reinhardtii (XP_005842886)Chlorophyta Chlamydomonadales259 aa9.115291 bp 264 bp 327 bp 352 bp 171 bp
Monoraphidium neglectum (XP_013903892)Chlorophyta Sphaeropleales232 aa9.483182 bp 190 bp 243 bp
Volvox carteri (XP_002952600)Chlorophyta Chlamydomonadales255 aa9.304166 bp 752 bp 2055 bp 434 bp
Galdieria sulphuraria (XP_002952600)Rhodophyta Cyanidiales254 aa9.91446 bp 51 bp 56 bp 53 bp
Phytophthora infestans (XP_002898022)Oomycetes Peronosporales275 aa5.601114 bp
Saprolegnia diclina (XP_008621469)Oomycetes Saprolegniales268 aa6.75251 bp 47 bp
Aphanomyces invadans (XP_008869471)Oomycetes Saprolegniales270 aa6.32253 bp 88 bp
Albugo candida (CCI47927)Oomycetes Albuginales281 aa7.62349 bp 30 bp 57 bp
Guillardia theta (XP_005830762)Cryptophyta Pyrenomonadales200 aa8.78851 bp 158 bp 49 bp 47 bp 45 bp 50 bp 47 bp 51 bp
Nannochloropsis gaditana (EWM21023)Eustigmatophyceae Eustigmatales261 aa9.782160 bp 263 bp
Ectocarpus siliculosus (CBN78175)Phaeophyceae Ectocarpales141 aa10.283782 bp 1315 bp 1091 bp
Plasmodium fragile (XP_012337233)Alveolata Aconoidasida209 aa9.480cNo introns
Toxoplasma gondii (KYF38827)Alveolata Conoidasida290 aa9.390No introns
Perkinsus marinus (XP_002786272)Alveolata Perkinsida188 aa6.33448 bp 59 bp 457 bp 200 bp
Bodo saltans (CUF18891)Euglenozoa Kinetoplastida163 aa10.140No introns

Amborella trichopoda is included as most basal Angiosperm representative.

Phytozome accession number.

Only introns in the protein coding region are counted.

Table 2

Features of BAP-like proteins and genes in the selected Uniconta species.

Species nameFeatures of proteins and genes
NCBI accession numberTaxonProtein lengthProteinpINumber of intronsLength of introns
Homo sapiens BAP 31 (EAW72818)Chordata246 aa8.4462181 bp 5183 bp 11448 bp 901 bp 828 bp 1032 bp
Mammalia
Homo sapiens BAP 29 (AAP35627)Chordata241 aa9.5563018 bp 9973 bp 1762 bp 4395 bp 2827 bp 8198 bp
Mammalia
Bos taurus BAP31 (AC_000187)Chordata245 aa9.2061421 bp 3620 bp 19548 bp 856 bp 795 bp 868 bp
Mammalia
Bos taurus BAP29 (NP_001033164)Chordata Mammalia240 aa9.6065274 bp 8892 bp 1426 bp 4620 bp 21723 bp 4683 bp
Egretta garzetta(NW_009259250)Chordata Aves243 aa9.1163508 bp 2158 bp 4591 bp 2594 bp 7753 bp 1416 bp
Anolis carolinensis(XP_003216939)Chordata Lepidosauria247 aa8.7461679 bp 14262 bp 4948 bp 10491 bp 6440 bp 8714 bp
Xenopus laevis BAP29 (AAH76818)Chordata Amphibia243 aa9.3962408 bp 1685 bp 2474 bp 795 bp 3921 bp 555 bp
Xenopus laevis BAP31 (NP_001086173)Chordata Amphibia244 aa8.656950 bp 4065 bp 3477 bp 273 bp 1233 bp 6242 bp
Danio rerio(AAH49014)Chordata Actinopterygii247 aa8.7361202 bp 1868 bp 15175 bp 18469 bp 3950 bp 12955 bp
Teleostei
Latimeria chalumnae(XP_005999374)Chordata Actinopterygii246 aa8.90410199 bp 16405 bp 1106 bp 20131 bp
Coelacanthidae
Ciona savignyi (H2YK40a)Chordata Tunicata Ascidiacea251 aa6.8461005 bp 789 bp 570 bp 469 bp 226 bp 1015 bp
Oikopleura dioica(CBY14290)Chordata Tunicata Appendicularia235 aa7.614179 bp 47 bp 58 bp 47 bp
Branchiostoma floridae(XP_002606682)Chordata Cephalochordata Branchiostomidae242 aa6.986512 bp 3061 bp 1594 bp 869 bp 450 bp 598 bp
Strongylocentrotus purpuratus(XP_791898)Echinodermata Eleutherozoa Echinozoa161 aa7.763386 bp 826 bp 715 bp
Capitella teleta(ELU03196)Lophotrochozoa Annelida Polychaeta236 aa8.87552 bp 105 bp 732 bp 3501 bp 54 bp
Helobdella robusta(XP_009022177)Lophotrochozoa Annelida Clitellata234 aa9.226144 bp 249 bp 91 bp 3212 bp 116 bp 176 bp
Lingula anatine(XP_013411827)Lophotrochozoa Brachiopoda Linguliformea244 aa6.56710030 bp 354 bp 2478 bp 123 bp 243 bp 209 bp 242 bp
Crassostrea gigas(XP_011435954)Lophotrochozoa Mollusca Bivalvia237 aa7.576819 bp 2358 bp 677 bp 403 bp 613 bp 776 bp
Aplysia californica(XP_005090966)Lophotrochozoa Mollusca Gastropoda246 aa6.2551043 bp 921 bp 5421 bp 332 bp 1603 bp
Lottia gigantea(XP_009058881)Lophotrochozoa Mollusca Gastropoda237 aa9.196611 bp 1442 bp 446 bp 1115 bp 433 bp 591 bp
Biomphalaria glabrata(XP_013084046)Lophotrochozoa Mollusca Gastropoda244 aa6.415988 bp 3021 bp 808 bp 146 bp 499 bp
Octopus bimaculoides(XP_014771618)Lophotrochozoa Mollusca Cephalopoda240 aa9.240No introns
Philodina roseola(ACI90387)Lophotrochozoa Rotifera Bdelloidea250 aa9.44673 bp 60 bp 50 bp 63 bp 95 bp 68 bp
Clonorchis sinensis(GAA53952)Platyhelminthes Trematoda256 aa7.807345 bp 52 bp 4525 bp 415 bp 2538 bp 3249 bp 3650 bp
Opisthorchiida
Dictyocaulus viviparous(KJH45191)Ecdysozoa Nematoda Rhabditida221 aa8.715171 bp 78 bp 68 bp 50 bp 186 bp
Caenorhabditis elegans(NP_500267)Ecdysozoa Nematoda Rhabditida213 aa9.552109 bp 55 bp
Ascaris suum(ERG85477)Ecdysozoa Nematoda Ascaridida235 aa6.2361187 bp 1089 bp 270 bp 626 bp 711 bp 315 bp
Wuchereria bancrofti(EJW86459)Ecdysozoa Nematoda Spirurida210 aa8.80594 bp 121 bp 190 bp 209 bp 216 bp
Ixodes scapularis(XP_002403043)Ecdysozoa Arthropoda Acari243 aa9.087106 bp 1304 bp 319 bp 248 bp 1219 bp 1795 bp 4918 bp
Parasteatoda tepidariorum(XP_015910252)Ecdysozoa Arthropoda Araneae236 aa8.6962745 bp 2307 bp 2605 bp 2023 bp 2331 bp 7025 bp
Limulus polyphemus(XP_013785233)Ecdysozoa Arthropoda Merostomata233 aa8.3965228 bp 3518 bp 2708 bp 673 bp 10338 bp 1550 bp
Daphnia pulex(EFX80917)Ecdysozoa Arthropoda Crustacea236 aa7.80470 bp 95 bp 62 bp 62 bp
Hyalella azteca(XP_018009243)Ecdysozoa Arthropoda Crustacea222 aa8.843502 bp 354 bp 224 bp
Orchesella cincta(ODM98038)Ecdysozoa Arthropoda Hexapoda Entomobryomorpha226 aa9.344100 bp 78 bp 83 bp 83 bp
Bombyx mori(XP_004923642)Ecdysozoa Arthropoda Hexapoda Insecta224 aa8.510No introns
Lepidoptera
Papilio machaon(XP_014360347)Ecdysozoa Arthropoda Hexapoda Insecta222 aa7.770No introns
Lepidoptera
Plutella xylostella(XP_011563833)Ecdysozoa Arthropoda Hexapoda Insecta Lepidoptera223 aa8.810No introns
Agrilus planipennis(XP_018325264)Ecdysozoa Arthropoda Hexapoda Insecta Coleoptera228 aa8.89360 bp 63 bp 719 bp
Nicrophorus vespilloides(XP_017778042)Ecdysozoa Arthropoda Hexapoda Insecta Coleoptera230 aa9.222101 bp 57 bp
Tribolium castaneum(XP_015833546)Ecdysozoa Arthropoda Hexapoda Insecta Coleoptera238 aa9.37347 bp 1864 bp 50 bp
Drosophila elegans(XP_017127683)Ecdysozoa Arthropoda Hexapoda Insecta Diptera228 aa9.041968 bp
Aedes albopictus(KXJ82346)Ecdysozoa Arthropoda Hexapoda Insecta Diptera232 aa9.4629892 bp 208 bp
Musca domestica(XP_005189051)Ecdysozoa Arthropoda Hexapoda Insecta Diptera233 aa6.8616715 bp
Limnephilus lunatus(JDSM01012005)Ecdysozoa Arthropoda Hexapoda Insecta Trichoptera230 aa9.020No introns
Apis mellifera(XP_397055)Ecdysozoa Arthropoda Hexapoda Insecta Hymenoptera229 aa6.921128 bp
Solenopsis invicta(XP_011173333)Ecdysozoa Arthropoda Hexapoda Insecta Hymenoptera232 aa9.46178 bp
Polistes dominula(XP_015172100)Ecdysozoa Arthropoda Hexapoda Insecta Hymenoptera234 aa9.141571 bp
Zootermopsis nevadensis(KK853387)Ecdysozoa Arthropoda Hexapoda Insecta Isoptera231 aa6.540No introns
Acyrthosiphon pisum(NP_001155477)Ecdysozoa Arthropoda Hexapoda Insecta Hemiptera212 aa7.900No introns
Pediculus humanus corporis(XP_002429237)Ecdysozoa Arthropoda Hexapoda Insecta Phthiraptera224 aa9.323436 bp 92 bp 216 bp
Homalodisca liturata(GECU01024317)Ecdysozoa Arthropoda Hexapoda Insecta Hemiptera229 aa8.890No introns
Exaiptasia pallida(KXJ12234)Cnidaria Anthozoa Hexacorallia Actiniaria245 aa6.395167 bp 801 bp 134 bp 69 bp 454 bp
Acropora digitifera(XP_015771945)Cnidaria Anthozoa Hexacorallia Scleractinia178 aa4.935114 bp 528 bp 3385 bp 456 bp 748 bp
Hydra vulgaris(XP_004209484)Cnidaria Hydrozoa242 aa9.3245752 bp 18903 bp 9438 bp 75 bp
Intoshia linei(OAF65985)Mesozoa Orthonectida220 aa9.62144 bp
Trichoplax adhaerens(XP_002113615)Placozoa Trichoplax167 aa9.3431456 bp 340 bp 267 bp
Amphimedon queenslandica(XP_011406596)Porifera Demospongiae172 aa6.21563 bp 1057 bp 47 bp 54 bp 107 bp
Monosiga brevicollis(XP_001745234)Choanoflagellida Codonosigidae230 aa9.044189 bp 125 bp 139 bp 98 pb
Salpingoeca rosetta(XP_004993388)Choanoflagellida Salpingoecidae185 aa9.745143 bp 251 bp 250 bp 239 bp 162 bp
Capsaspora owczarzaki(XP_004345792)Ichthyosporea Capsaspora216 aa9.436151 bp 181 bp 87 bp 96 bp 171 bp 103 bp
Fonticula alba(XP_009495214)Nucleariidae and Fonticula group226 aa9.246109 bp 176 bp 115 bp 148 bp 313 bp 232 bp
Saccharomyces cerevisiae Yet3p (AJU84316)Fungi Dikarya Ascomycota Saccharomycotina203 aa5.930No introns
Candida glabrata(XP_445444)Fungi Dikarya Ascomycota Saccharomycotina191 aa9.370No introns
Penicillium italicum(KGO75036)Fungi Dikarya Ascomycota Pezizomycotina210 aa9.133140 bp 57 bp 57 bp
Aspergillus calidoustus(CEN62854)Fungi Dikarya Ascomycota Pezizomycotina213 aa8.91366 bp 51 bp 125 bp
Mycena chlorophos(GAT59077)Fungi Dikarya Basidiomycota Agaricomycotina208 aa9.36655 bp 54 bp 54 bp 50 bp 57 bp 52 bp
Ustilago maydis(XP_011388692)Fungi Dikarya Basidiomycota Ustilaginomycotina192 aa9.42282 bp 95 bp
Allomyces macrogynus(KNE58291)Fungi Blastocladiomycota Blastocladiomycetes239 aa6.99481 bp 73 bp 80 bp 73 bp
Spizellomyces punctatus(XP_016610704)Fungi Chytridiomycota Chytridiomycetes210 aa9.33972 bp 72 bp 61 bp 78 bp 46 bp 64 bp 64 bp 57 bp 62 bp
Gonapodya prolifera(KXS12654)Fungi Chytridiomycota Monoblepharido- mycetes197 aa7.89666 bp 65 bp 76 bp 60 bp 64 bp 64 bp
Conidiobolus coronatus(KXN68246)Fungi Entomophthoro-mycota207 aa7.75663 bp 42 bp 53 bp 54 bp 58 bp 52 bp
Rhizophagus irregularis(ESA18893)Fungi Glomeromycota Glomeromycetes211 aa7.77885 bp 87 bp 72 bp 66 bp 68 bp 72 bp 81 bp 77 bp
Encephalitozoon intestinalis(XP_003072841)Fungi Microsporidia Unikaryonidae199 aa9.340No introns
Edhazardia aedis(EJW03664)Fungi Microsporidia Edhazardia192 aa9.680No introns
Nosema bombycis(EOB11729)Fungi Microsporidia Nosematidae197 aa9.300No introns
Thecamonas trahens(XP_013761453)Apusozoa Apusomonadidae150 aa9.751427 bp
Entamoeba invadens(XP_004260006)Amoebozoa Archamoebae Entamoebidae167 aa6.591121 bp
Acanthamoeba castellanii(XP_004337423)Amoebozoa Discosea Longamoebia223 aa6.062100 bp 280 bp
Polysphondylium pallidum(EFA82930)Amoebozoa Mycetozoa Dictyosteliida206 aa9.400No introns
Acytostelium subglobosum(XP_012752758)Amoebozoa Mycetozoa Dictyosteliida203 aa9.400No introns
Dictyostelium lacteum(KYR02632)Amoebozoa Mycetozoa Dictyosteliida206 aa9.370No introns

UniProt (http://www.uniprot.org/) accession number.

Features of BAP-like proteins and genes in the selected Biconta speciesa. Amborella trichopoda is included as most basal Angiosperm representative. Phytozome accession number. Only introns in the protein coding region are counted. Features of BAP-like proteins and genes in the selected Uniconta species. UniProt (http://www.uniprot.org/) accession number.

Materials and methods

Annotation of the predicted genes and proteins was mined at the National Center for Biotechnology Information database (NCBI) and Phytozome database, version 11. Additional annotation of other predicted BAP-like proteins and genes was extracted from The UniProt Knowledgebase database (http://www.uniprot.org/).
Subject areaBiology
More specific subject areaPhylogenomics
Type of dataTable
How data was acquiredGene sequence organizations and protein parameters were derived using ExPASy software, NCBI, Dendrome and Phytozome databases (see below)
Data formatAnalyzed
Experimental factorsAmino acid and nucleotide sequences were retrieved from NCBI, Dendrome and Phytozome databases (see below).
Experimental featuresGene and protein sequences were derived using NCBI, Dendrome and Phytozome databases (see below)
Data source locationNCBI: http://www.ncbi.nlm.nih.gov/
Phytozome v11: http://www.phytozome.net/
ExPASy: http://www.expasy.org/tools/
Dendrome: http://dendrome.ucdavis.edu/
Data accessibilityWith this article
  3 in total

1.  Phylogenetic and functional analyses of a plant protein related to human B-cell receptor-associated proteins.

Authors:  Anastasia K Atabekova; Anna V Pankratenko; Svetlana S Makarova; Ekaterina A Lazareva; Robert A Owens; Andrey G Solovyev; Sergey Y Morozov
Journal:  Biochimie       Date:  2016-10-19       Impact factor: 4.079

Review 2.  Molecular paleontology and complexity in the last eukaryotic common ancestor.

Authors:  V Lila Koumandou; Bill Wickstead; Michael L Ginger; Mark van der Giezen; Joel B Dacks; Mark C Field
Journal:  Crit Rev Biochem Mol Biol       Date:  2013 Jul-Aug       Impact factor: 8.250

3.  Structural and biophysical characterization of the cytoplasmic domains of human BAP29 and BAP31.

Authors:  Esben M Quistgaard; Christian Löw; Per Moberg; Fatma Guettou; Karthik Maddi; Pär Nordlund
Journal:  PLoS One       Date:  2013-08-13       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.