Literature DB >> 28480320

Data of 10 SSR markers for genomes of homo sapiens and monkeys.

K K V V V S Reddy1, S Viswanadha Raju2, Chinta Someswara Rao3.   

Abstract

In this data, we present 10 Simple Sequence Repeat(SSR) markers TAGA, TCAT, GAAT, AGAT, AGAA, GATA, TATC, CTTT, TCTG and TCTA which are extracted from the genomes of homo sapiens and monkeys using string matching mechanism [1]. All loci showed 4 Base Pair(bp) in allele size, indicating that there are some polymorphisms between individuals correlating to the number of SSR repeats that maybe useful for the detection of similarity among the genotypes. Collectively, these data show that the SSR extraction is a valuable method to illustrate genetic variation of genomes.

Entities:  

Keywords:  Genomes; Homo sapiens; Monkeys; SSR

Year:  2017        PMID: 28480320      PMCID: PMC5407499          DOI: 10.1016/j.dib.2017.04.010

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data Data sets obtained from genomes of homo sapiens and monkeys with string matching have shown the high specificity. These data suggest that SSR extraction is an useful method for providing information for various detections. Access to the raw sequencing data allows researchers to perform further bio-informatics analysis based on their own computational algorithms.

Data

10 SSR markers data which were extracted from genomes of Homo sapiens and Monkeys are shown in Table 1. The data presented here shows that the SSR extraction with string matching was very useful and was able to reveal variation of selected genome collections. These SSR markers can be used to assess maternity, paternity, personal and theft identifications. All chromosomes of Homo sapiens and monkeys (Callithrix jacchus, Chlorocebus sabaeus, Gorilla gorilla,Macaca fascicularis, Macacamulatta, Nomascus leucogenys, Pan troglodytes, Papio anubis and Pongo abelli) are considered for the extraction of the 10 SSR markers which are shown in Table 1 [1].
Table 1

Genome sequences used to extract 10 SSRs.

Genome setNumber of chromosomesTotal Number of Tandem Repeats extracted(≥3)
Homo sapiens1 to 22, MT, X, Y and Un(26)12,83,780
Callithrix jacchus1 to 22, X, Y and Un(25)12,21,444
Chlorocebus sabaeus1 to 29, MT, X, Y and Un(33)12,48,422
Gorilla gorilla1, 2A, 2B, 3 to 22, MT, X and Un(26)12,23,871
Macaca fascicularis1 to 20, MT, X and Un(23)
Macaca mulatta1 to 20, MT, X and Un(23)13,73,963
Nomascus leucogenys1 to 6, 7b, 8 to 21, 22a, 23 to 25, X and Un(27)12,77,214
Pan troglodytes1, 2A, 2B, 3 to 22, MT, X, Y and Un(27)13,07,857
Papio anubis1 to 20, MT, X and Un(23)13,97,131
Pongo abelli1, 2 A, 2B, 3 to 22, MT, X and Un(26)13,88,580
10259

10 SSR markers overall count in homo sapiens and monkeys

Table 2 shows the 10 SSRs overall count of Callithrix jacchus, Chlorocebus sabaeus, Gorilla gorilla, Homo sapiens, Macaca fascicularis, Macacamulatta, Nomascus leucogenys, Pan troglodytes, Papio anubis and Pongo abelli respectively.
Table 2

10 SSRs successive occurrences for all chromosomes of homo sapiens and monkeys.

callithrix_jacchus
chlorocebus_sabaeus
gorilla_gorilla
homo_sapiens
macaca_fascicularis
SSRs
COUNT
%
COUNT
%
COUNT
%
COUNT
%
COUNT
%
TAGA54,3674.45154,2324.34455,0324.49758,5154.55855,1205.317
TCAT126,04610.319129,34810.361129,57910.588134,40110.469132,75612.805
GAAT125,57110.281117,5809.418118,0959.649121,2199.442117,72511.355
AGAT63,6395.2164,0765.13364,2235.24867,6095.26665,7996.347
AGAA321,54126.325337,17227.008310,95325.407335,77426.155383,09036.952
GATA50,4364.12952,5844.21251,1714.18155,2154.30154,6095.267
TATC49,4574.04952,7974.22951,2264.18655,0394.28753,1955.131
CTTT287,85623.567298,31423.895299,73724.491308,98024.06831,0322.993
TCTG88,9887.28587,4577.00588,5787.23888,7416.91288,3528.522
TCTA53,5434.38454,8624.39555,2774.51758,2874.5455,0545.31
macaca_mulattanomascus_leucogenyspan_troglodytespapio_anubispongo_abelli
TAGA54,6543.97854,7174.28459,3964.54154,4483.89763,8424.598
TCAT1,34,7119.8051,26,8819.9341,36,35610.4261,34,4819.6261,45,07410.448
GAAT1,16,7688.4991,14,9408.9991,23,9039.4741,17,8438.4351,35,5339.761
AGAT65,0134.73265,9105.1668,0385.20265,0484.65672,6845.234
AGAA4,49,81132.7383,80,61829.8013,36,97625.7664,64,60633.2543,57,40925.739
GATA53,9963.9351,3644.02255,3284.2354,0733.8759,0924.256
TATC53,0173.85952,1874.08655,1054.21354,1453.87559,2964.27
CTTT3,03,43122.0842,91,06422.7893,23,37424.7253,06,62821.9473,32,50223.945
TCTG88,0966.41284,6506.62890,2066.89791,3066.53598,0987.065
TCTA54,4663.96454,8834.29759,1754.52554,5533.90565,0504.685
Fig. A1 (presented in Appendix A (figures part) from A1(a) to (e)) shows the successive occurrence percentage of 10 SSRs for all chromosomes of Callithrix jacchus, Chlorocebus sabaeus, Gorilla gorilla, Homo sapiens, Macaca fascicularis, Macacamulatta, Nomascus leucogenys, Pan troglodytes, Papio anubis and Pongo abelli
Fig. A1

(a). Successive occurrences percentage of 10 SSRs for all chromosomes of homo sapiens and monkeys (b). Successive occurrences percentage of 10 SSRs for all chromosomes of homo sapiens and monkeys (c). Successive occurrences percentage of 10 SSRs for all chromosomes of homo sapiens and monkeys (d). Successive occurrences percentage of 10 SSRs for all chromosomes of homo sapiens and monkeys (e). Successive occurrences percentage of 10 SSRs for all chromosomes of homo sapiens and monkeys.

Position and MAX number of occurrences of 10 SSRs for each chromosome of homo sapiens and monkeys

Table A1 shows ( presented in Appendix A (tables part)) the 10 SSRs position and MAX number of occurrences for each chromosome of Callithrix jacchus, Chlorocebus sabaeus, Gorilla gorilla, Homo sapiens, Macaca fascicularis, Macacamulatta, Nomascus leucogenys, Pan troglodytes, Papio anubis and Pongo abelli respectively.
Table A1

The 10 SSRs position and MAX number of occurrences for each chromosome of homo sapiens and monkeys.

callithrix_jacchus
chlorocebus_sabaeus
gorilla_gorilla
SSRsChr.PositionMAXChr.PositionMAXChr.PositionMAX
NameNo.occ.NameNo.occ.NameNo.occ.
TAGAchr37973547221chr21618649619chr153864688719
chr710417738819chr319131039619
chr66745474619
TCATchr4282841614chr227993735314chr144297781814
GAATchr161066838113chr63707450014chr11940490812
chr13821724712
chr15253411412
chr1011864863112
chr112190371312
chr49202409212
chr64150392312
chr67108701412
chr616820328012
chr813226708512
AGATchr37973547321chr21618649320chr153864688420
chr66745474320
AGAAchr18552574057chr14860660754chr316436610741
GATAchr37973547420chr21618649420chr153864688520
TATCchr119389063718chr203246361520chr109361647726
chr108947752318
chrX3531433418
CTTTchr71349435251chr131291403642chrUn3306427166
TCTGchr1211670635514chr63975197914chr126594303816
TCTAchr108947752118chr111759355520chr109361647526
chrX3531433218chr203246361720
chr131291403642
homo_sapiensmacaca_fascicularismacaca_mulatta
TAGAchr67874494321chr86808265929chr110240482531
TCATchr181904096912chr37086945119chr1112136941819
chr19570515112
GAATchr72444732612chr12990393114chr120762781815
chr913009461014
chrX12548276214
AGATchr67874494421chr86808266028chr110240482631
AGAAchr15915285042chr630665766218chr120842696184
chr220203975242
GATAchr67874494122chr86808265729chr110240482731
TATCchrX274539125chr219093925833chr214564243421
CTTTchr118300177678chr7135702116221chr173165352879
TCTGchr16695818312chr118380895216chr151658749312
chr312815938012chr810680640512
chr64220996612
chrX3306690512
TCTAchrX274539325chr219093926033chr1113920670921
chr214564243221
nomascus_leucogenyspan_troglodytespapio_anubis
TAGAchr179969442217chr221870575417chr156784106431
chr203742953017chr411412121917
chrX13445235117chr73020890917
chr811756572217
TCATchr117666075412chr116670319610chr33533635315
chr58145437610
chr59896145210
chr514624252410
chr910358809410
GAATchr810736045611chr34986823111chr206137080614
AGATchr203742953117chr73020890618chr156784106132
AGAAchr1111090751352chr1211242039643chr164512473854
GATAchr203742953217chr73020890718chr156784106231
chr811756572018
TATCchr189038495623chr38509874419chr14202045221
CTTTchr117198779533chr1112238598330chr194116969147
TCTGChr22a9816310013chr22691911113chr114546352915
TCTAchr189038495822chr38509874618chr29111429122
pongo_abelli
TAGAchr105130434720
TCATchr203750419812
GAATchr145152504213
AGATchr105130434820
AGAAchr98603459437
GATAchr105130434919
TATCchr417196066219
CTTTchr161349890563
TCTGchr2B10849060811
chr312968964511
TCTAchr2A4711532518
chr417196066418

Data for all the chromosomes of homo sapiens and monkeys for every 10 SSR count

Table A2, Table A3, Table A4, Table A5, Table A6, Table A7, Table A8, Table A9, Table A10, Table A11 show (presented in Appendix A (tables part)) every SSR count for all chromosome of Callithrix jacchus, Chlorocebus sabaeus, Gorilla gorilla, Homo sapiens, Macaca fascicularis, Macacamulatta, Nomascus leucogenys, Pan troglodytes, Papio anubis and Pongo abelli respectively.
Table A2

The 10 SSR counts for all the chromosomes of callithrix_jacchus.

TAGATCATGAATAGATAGAAGATATATCCTTTTCTGTCTA
chr1412094919433481523363381837542074667044079
chr2402891279149466324108367836932096461554071
chr3436390319048482123465397537982073653224158
chr4339477687769376420392301230481792651433306
chr5246964416405290816343222721561512653012409
chr6295270866983352418332280828021637246442985
chr7283870937017336516919266623961547448742553
chr8247556415654302414685242623101285137852444
chr9230654395431279713781221522331265039522381
chr10239657155668280314578210921381296541982276
chr11235558365888284814173216021991245743782376
chr12199251265036234312582185417861163441841904
chr13222351654983266013253211620541202735142235
chr14201048074598240312301184318281117834082104
chr1517684616439920461108215971565992131291777
chr16205946884711240211486195220021017328502075
chr171427345334831729874213741301769322121469
chr18893211520861019563282981948001459851
chr1985722322169992553979874851241640819
chr2090519672020939463873063744991661691
chr211137228021351291608110491077528514661211
chr2258814811451685442751654041561739587
chrUn1249289632081688839111121258808828901441
chrX348964426724402116932349232821472342613303
chrY7411012389316803328811938
Table A3

The 10 SSR counts for all the chromosomes of chlorocebus_sabaeus.

TAGATCATGAATAGATAGAAGATATATCCTTTTCTGTCTA
chr1238860385637285814992247622811300541182353
chr215854194385218821025615131538927831291602
chr320804689405323051116819201985971126922048
chr4209942933853239213253187519381043226672029
chr51202334131251419906111481109813226441223
chr672417591527813519265866350901856686
chr7307366876048364417595301329691523836453119
chr8289368156134352817227303626861531243132797
chr9227858315284272615660218121881404142342309
chr10263761375500308715639243424721398638492617
chr11246460965790298915189240123721384840742573
chr12209449794507249112692202119671129635691964
chr13204647274241229112300185919981077627932035
chr14194149414420231712821196618181123334642080
chr15180944553975222411229180017781031527061875
chr161070278226721286826710051010756126301117
chr171448327929751646856813611341808221781466
chr181415342431171627861113931385754921951442
chr1942712931195511306238635429071248370
chr20230262315876292615377223621561380245642256
chr21256059985495311915798243724531370637692630
chr22195847544522231811758185719221075931592035
chr231591413036411859975415531515886526291617
chr241662397536161968962615151467917527171567
chr2516644086376619061075215771701946927081751
chr269742491228011276141910911577919211009
chr27108524212252125660141029100651691439999
chr2838480770344926043704402276737415
chr29437104388550629874244462410759435
chrMT1111
chrUn477548419534476268811642534493695
chrX328768026007385117890334935811580042783568
chrY178301213221926193182778280179
Table A4

The 10 SSR counts for all the chromosomes of gorilla_gorilla.

TAGATCATGAATAGATAGAAGATATATCCTTTTCTGTCTA
chr14108104439603475323769374238422177369573964
chr2A205548044254232211643179818681096732891945
chr2B263460265603305114902240223481326438442504
chr3387792788506433421974357736571989458053895
chr4417192558315484621606397439511920549554106
chr5287470926414328917005258127011586550382857
chr6336377487087397519327312430721717248673374
chr7292868356231337716910273927341641944042857
chr8282067676128341116193281326261457341792805
chr9215051774864242712210191019451371636392088
chr10240959885448267614344221721151736347692987
chr11238560665625285613794224823191288242682488
chr12247160555671298114649240823251378339302501
chr1321744798411624131086119711962974726882087
chr141597404236371933911615491522893127161538
chr151360342930361508800612151242732824091366
chr161190344631011404758011271157775527981282
chr1718694008372021271048017541716933526531877
chr1816743427316518028645147114631216324011571
chr1959917521513735496158958548231836685
chr20955272924981086593585590980502116924
chr2180215281346882391870566233391067738
chr2240313351286483294040438925661281401
chrMT312
chrUn7787878401604254252853427042297791
chrX338667616088394817643347035821611943723644
Table A5

The 10 SSR counts for all the chromosomes of homo_sapiens.

TAGATCATGAATAGATAGAAGATATATCCTTTTCTGTCTA
chr143851092810062508325987401641042346471674258
chr24898111329911559527772443944652524373344596
chr3407394468601454923020377337312067058583995
chr4429993938513494622740413841182019350594316
chr5386485417658444921362355735581900753583914
chr6344779637123402120548325432071829749273462
chr7307671736480359618774290429791806146093115
chr8307268376262361017958299127841581142412990
chr9229454985063258313365203421791211937102279
chr10244260375365281014966232122161372042682473
chr11247162905812302115318244924521399844552665
chr12260361265751307915686251224621460639752610
chr13221448204182249111360203920711020626802181
chr141708420237402064983416181644956627521706
chr151498361831961676870913551315805725151480
chr161282363833481500844012141228855128211334
chr171204300327501417820511051129799726061207
chr181574352032381856882515361512811022611609
chr1972521971693889640268569661792001763
chr20887274125151078621885994861512117949
chr2186815961419942430681776737041112802
chr2248414331329575327846242329811313432
chrMT311
chrUn911681126541457421224241105
chrX390070286199445118805393838921706744653964
chrY1156107089712633482114211173997896
Table A6

The 10 SSR counts for all the chromosomes of macaca_fascicularis.

TAGATCATGAATAGATAGAAGATATATCCTTTTCTGTCTA

chr14089104899620491727911391037412440572583940
chr2381190638322438023717350838042082457013930
chr3377283747569448423349357137782034256423737
chr4349781627062397321109333231821893148873412
chr5408693408308483723873398840182113050934176
chr6363686597594422024461336335442004654173703
chr7304176236699357519529284727411749153052993
chr8299970706210357519030306429041628044652798
chr9226558995320270115208210721441383242852294
chr101286408937761603968712131226897234561253
chr11257863385856337817447303824101525142112579
chr12254061695460298815796237724791406439352583
chr13192249994368237513477202319051143534552077
chr14232260795608288617473241923031338341762394
chr15205050064534242212577193719171158135012019
chr161151294926391415849911851083829724851114
chr1720754781401323641175919871976999927212044
chr181440349730831682875014111388766622271444
chr197981827147410948318108572464861849735
chr201240326931101565841514821169829926481203
chrMT1242
chrUn12891968112515903352214891394517912081250
chrX323371065975377519182327333631642544273374
Table A7

The 10 SSR counts for all the chromosomes of macaca_mulatta.

TAGATCATGAATAGATAGAAGATATATCCTTTTCTGTCTA
chr13992105929687467626903377038052351872784024
chr2386092468178451522654373635332026356823794
chr3362083667519435423341344036731998956483624
chr4335879026990386820838312932961787549303430
chr5405891698161482122761397839561987348814097
chr6362085687478419123428333735121934653763699
chr7297175526665350518814276126961692052012923
chr8294069576126355417995306228081541343582770
chr9228858295257268714584213521501334342232277
chr101285398437761574945412431196871432991280
chr11258863415783342118892312424601528441932548
chr12207549974462242112608194420101121031072084
chr13204348374260232812718186520171129034331895
chr14228859915538275215042224521981255141062286
chr15202949564484238612241192618861103934471970
chr161124292626291367816711531030791724591086
chr1720414768398123301121319811962985026722058
chr181423336530061701855713581395748321811429
chr197581802148610027446106368059681888672
chr201206326530831549806314551145770726101172
chrMT1131
chrUn1808620422942219114114199422571077127312025
chrX327970945925379219977329733511710443933322
Table A8

The 10 SSR counts for all the chromosomes of nomascus_leucogenys.

TAGATCATGAATAGATAGAAGATATATCCTTTTCTGTCTA
chr1a230055045024268413392213621821257835032318
chr2301974976746353317774280828461610749813046
chr3311072076388359517557296428681598843763078
chr4275865885938330016612268128661510845782947
chr5302067526098352516522283629181458140393015
chr6240751514644271112872211222801158937042362
chr7b213950304579258512619208922671149032572377
chr8204851554639245612414189919041152637142011
chr9262156584892297214597246323281215531872485
chr10196845344159232211685188517851047433121843
chr11252456605195291014199236423391302232162538
chr12206248394587242312463196819291140532211991
chr13187548174433226412864172817501040832921917
chr1417884044352319901030815291525954728571604
chr15208754384916250611784200721101065134132181
chr16209848424387242111473202619881053528652130
chr17167440123686199410746157416431009129791787
chr18186444794025213211676168217871058831171808
chr191316333330521664848113031370763825001477
chr201863401637442104965717571734838823911818
chr211793410137302099986516651670922524571735
chr236791545142674938326035903441914643
chr2437711161004454233333225522901047289
chr256851380122473537656056423164919679
chrUn63118661620272668386429426104412745565
chrX339464965803395717985348737541525641253700
hr22a261758215478309914757243224311337839412539
Table A9

The 10 SSR counts for all the chromosomes of pan_troglodytes.

TAGATCATGAATAGATAGAAGATATATCCTTTTCTGTCTA
chr14398108499972510825438398440852344172464282
chr2A218949974341248312234195019671118034361998
chr2B274762085655314915379252225581400839382672
chr3420395868698467623135390238472091359164177
chr4447497648766509824659428142362196752704535
chr5388086337663443620696360635001965952463921
chr6370982437363421520730340434091921050313695
chr7326675116653382919252308030681869647183207
chr8310769726350366118035307328701642443073005
chr9233454104974263213248210721351212537002322
chr10245260625463286115007242923711476744302533
chr11256563045849306215090245724641401244402643
chr12267462265948312315441254824541503239972737
chr13225348834223250611275205521311016626812224
chr141731420837062063967716461634950127541692
chr151506360432181625847313111299781524881433
chr161342363533051602848012571249865728721400
chr171275298927911510819511471038800726701163
chr181634359032381886878215421543817422541619
chr1977320131596834585867167863561954784
chr20933281726151074611684894663472144981
chr2175815041305839396670769033661028729
chr2241113681313506304639040727691275405
chrMT11
chrUn909147023956844134445536124351409965
chrX333866095848385917083342733841511140713438
chrY53590065571735475396053236931615
Table A10

The 10 SSR counts for all the chromosomes of papio_anubis.

TAGATCATGAATAGATAGAAGATATATCCTTTTCTGTCTA
chr13872104259563468725568378337802344672734049
chr2373191438112437822407361134772027356793784
chr3359683247402427821550342934881928955663564
chr4325078186868377520282304731431763148113273
chr5396391218075469222233394138691944949214048
chr6359685127414414822403333734471909752703616
chr7296774606613357018373279226461658651152868
chr8283069046048342417552290628151512243252747
chr9220757335110260914258205220641325241782224
chr101272393736931547922612391149872132371256
chr11254562065736309415648253123841416741222592
chr12205149984484243112527192319791117631002038
chr13243458445083276214789215022721314241822263
chr14227360215576278014306223321311246141462234
chr15201749624446238112168193518381094534381942
chr1610972903261312437918966964751525111062
chr1719274673388721881075318291911965626811952
chr181446345430691680864413921411754621571411
chr1970218141428923655587565055221825694
chr201152323630411390781712381131772125911152
chrMT1
chrUn2366601637883392141376366743051808058602499
chrX315469775794367618253319732911583143183284
Table A11

The 10 SSR counts for all the chromosomes of pongo_abelli.

TAGATCATGAATAGATAGAAGATATATCCTTTTCTGTCTA
chr147791168510749567728928455443432634880644696
chr2A223553884906270213275219220971187338042437
chr2B295568066049337716502273726881492743982870
chr34347100829272506824397409241542218761554420
chr44801105319292561625309461146602236757514874
chr5416992078162465022580374138631995958054181
chr6369285707637434021391352134491884652103774
chr7326175786898385619078307632741716148523396
chr8338375466745393818813329131321663846183232
chr9248757765215280113646225922171301839572351
chr10259771359532306115947247024591506447032716
chr11255662765961312414353250523651299742292520
chr12275867556034328916312271526341496343602944
chr13240452294512275412205226722791079528742421
chr1419164396391022081020317521674984629091738
chr151611385933641735921714461331838227021532
chr161399376334381624857513261303870631651420
chr171234326630781534886211811373848228861445
chr181612364633971862924015201638843924741709
chr1987420471790972606774475561492136869
chr2096730312639117865629071034624822951089
chr2194616941490997452087174739251125837
chr2244413621306548315241640928901328436
chrMT22
chrUn225719723392110781387388941389932842565
chrX415874746765466620137416045221839350144576
Fig. A2 shows (presented in Appendix A (figure part) from A1(a) to A2(e)) the each SSRs percentage of all chromosomes of Callithrix jacchus, Chlorocebus sabaeus, Gorilla gorilla, Homo sapiens, Macaca fascicularis, Macacamulatta, Nomascus leucogenys, Pan troglodytes, Papio anubis and Pongo abelli.
Fig. A2

(a). Each SSRs percentage of all chromosomes of homo sapiens and monkeys (b). Each SSRs percentage of all chromosomes of homo sapiens and monkeys (c). Each SSRs percentage of all chromosomes of homo sapiens and monkeys (d). Each SSRs percentage of all chromosomes of homo sapiens and monkeys (e). Each SSRs percentage of all chromosomes of homo sapiens and monkeys.

Experimental design, materials and methods

SSR extraction

In this paper all chromosomes of homo sapiens and monkeys(Callithrix jacchus, Chlorocebus sabaeus, Gorilla gorilla,Macaca fascicularis, Macacamulatta, Nomascus leucogenys, Pan troglodytes, Papio anubis and Pongo abelli) and the ten(TAGA, AGAA, GATA, TCTA, TCAT, GAAT, AGAT, CTTT, TATC, TCTG) SSRs are considered. SSRs are extracted from homo sapiens and monkeys using string matching approach. The string matching is a searching mechanism that searches the repeats in a given chromosomal file. Search process: The chromosomes and SSRs are given to main function, then the main function calls the shift process by providing right most character of the SSRs. The shift position is returned to main function by the shift process. The search process compares character by character from both the directions until a complete match or mismatch occurs. If match occurs the successive occurrence of the pattern is searched. If the successive occurrence size is greater than 1 then the data is stored in database [1]. This process is continued for all the SSRs and for entire data in the chromosomes. The detailed description is given in [1].

Paternity identification with similarity measures [2]

In cases related to paternity tests, two or more persons might claim that a child is their biological son/daughter. In such cases, the genome sequence of the child as well as the persons can be compared to identify the similarity of the loci that is stored in the Tandem Repeat Database(TandemRepeatDB). The person having more similarity of the loci with the child DNA will be considered to be the actual biological father/mother. Genome sequence of the child as well as the persons(A and B) is taken. The continuously occurred 10 loci׳s from child and persons (A and B) are extracted and stored in TandemRepeatDB using multiple pattern multiple(2) shaft parallel string matching algorithms [3]. The loci from TandemRepeatDB are extracted. Correlation coefficient, Rank correlation coefficient and Cosine similarity measures are applied to measure the similarity between loci of child and persons(A and B). Similarity measures return the percentage of similarity between the loci of child and persons (A and B). Using the similarity percentage, the similarity can be noticed in both the positive and negative terms. Example of similarity between child and persons (A and B) is shown in Table 3.
Table 3

Example of similarity between child and persons (A and B).

Child vsCorrelation CoefficientRank Correlation CoefficientCosine Similarity
Person A111
Person B0.220.240.21
In Table 3, correlation coefficient, rank correlation coefficient and cosine similarity measures show a positive correlation (1) between the child and person A, whereas between the child and person B show a positive correlation for all the three measures but it is very low compared to child and person A.

DNA finger printing

Performing pattern search in the entire genome of an organism in traditional approach i.e., using laboratory experiments is very time consuming. Even for a small part of a genome, the process will take several hours, moreover the related laboratory experiments are quite expensive. Due to the latest developments in genome sequencing, in the near future, a person can get their entire genome sequenced in a diagnostics centre just like the medical diagnostics. In this situation, the multiple pattern multiple(2) shaft parallel string matching algorithms [3] will play a key role to search the loci in the person׳s genome and will return the occurrence positions, chromosome name, loci name etc., in a quick time and at no cost. DNA finger printing—It is a method used to identify an individual from sample genome sequence by searching the patterns in the locations on all chromosomes. Genome sequences of one family members (father, mother, daughters and sons) are considered. The 10 loci in all the family members genomes are searched using multiple pattern multiple(2) shaft parallel string matching algorithms [3]. If exact match occurs then successive logic is applied. If successive occurrence of the loci is found then its sample name, position, chromosome name, pattern and number of times of occurrence related to all family members genomes are stored in TandemRepeatDB [1] The loci of all family members are extracted from TandemRepeatDB, their position, chromosome name, pattern and number of times of occurrence is compared. If they are matched then where the genomes of father and mother are matched to their child are shown.
Subject areaBio-informatics
More specific subject areaGenomes of homo sapiens and monkeys
Type of dataTables, figures
How data was acquiredSSR markers extraction with string matching
Data formatAnalyzed
Experimental factorsTen SSR motifs: TAGA, TCAT, GAAT, AGAT, AGAA, GATA, TATC, CTTT, TCTG, TCTA were targeted. String matching process is applied on genomes of homo sapiens and monkeys. 10 SSR markers to be used in various detection purposes are extracted with this approach.
Experimental featuresEach of the 10 SSR markers are extracted from genomes of homo sapiens and monkeys. All the 10 SSRs showed the 4 bp in allele size. These differences showed that there are some polymorphisms among the genomes to the number of SSR repeats.
Data source locationBHIMAVARAM, INDIA
Data accessibilityThe data is provided with this article
  2 in total

1.  Next generation sequencing (NGS) database for tandem repeats with multiple pattern 2°-shaft multicore string matching.

Authors:  Chinta Someswara Rao; S Viswanadha Raju
Journal:  Genom Data       Date:  2016-01-29

2.  Similarity analysis between chromosomes of Homo sapiens and monkeys with correlation coefficient, rank correlation coefficient and cosine similarity measures.

Authors:  Chinta Someswara Rao; S Viswanadha Raju
Journal:  Genom Data       Date:  2016-01-07
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.