| Literature DB >> 30761343 |
Muhamad Aidil Zahidin1, Wan Bayani Wan Omar2, Wan Rohani Wan Taib3, Jeffrine Rovie Ryan Japning4, Mohd Tajuddin Abdullah1,2.
Abstract
Orang Asli is the aboriginal people in Peninsular Malaysia who have been recognized as indigenous to the country and still practicing traditional lifestyle. The molecular interest on the Orang Asli started when the earliest prehistoric migration occurred approximately 200 kya and entering Peninsular Malaysia 50 kya in stages. A total of three groups of Orang Asli present in Peninsular Malaysia, namely, Negrito also known as Semang, Senoi and Proto Malays. Through records, there is no research has been conducted on mtDNA variations in the Semoq Beri population, one of the tribes in Senoi group. In this report, variations of mtDNA were analysed in the population in Hulu Terengganu as an initial effort to establish the genetic characterisation and elucidating the history of Orang Asli expansion in Peninsular Malaysia. An array of mtDNA parameters was estimated and the observed polymorphisms with their respective haplogroups in comparison to rCRS were inferred respectively. The DNA sequences are registered in the NCBI with accession numbers KY853670-KY853753.Entities:
Year: 2018 PMID: 30761343 PMCID: PMC6288409 DOI: 10.1016/j.dib.2018.10.158
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Details of the primers used for PCR amplification [3].
| HVI | 16,024–16,569 | conL1 (F) | TCAAGCTTACACCAGTCTTGTAAACC | 600 |
| conH1 (R) | CCTGAAGTAGGAACCAGATG | |||
| HVII | 0–576 | conL4 (F) | GGTCTATCACCCTATTAACCAC | 600 |
| conH4 (R) | CTGTTAAAAGTGCATACCGCCA |
C-stretch region of HVII region between nucleotide positions 233 (*233C) to 250 (250C).
| 24 | C | C | C | C | C | C | C | C | 8 | ||||||||||
| 13 | C | C | C | C | C | C | C | 7 | |||||||||||
| 6 | C | C | C | C | C | C | C | C | C | 9 | |||||||||
| 1 | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C | 16 | ||
| 1 | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C | 17 |
N - deletion base, ns - total number of sequences, n - total number of unbroken bases C series.
Sequence variation for the HVI and HVII regions.
| Nucleotide position (%) | 16,024 to 16,504 (88%) | 72 to 351 (49%) |
| Base pair | 481 bp | 280 bp |
| No. of polymorphic sites | 18 | 26 |
| No. of observed transitions | 16 | 17 |
| No. of observed transversions | 2 | 9 |
| No. of indels | – | 5 |
| Nucleotide composition (%) C | 31.17 | 27.66 |
| T | 23.74 | 27.39 |
| A | 31.20 | 28.75 |
| G | 13.89 | 16.20 |
Frequency distribution of the mtDNA haplotypes.
| Hap 1 | 1 | Semaq Beri 19 | 0.025 | |
| Hap 2 | 2 | Semaq Beri 3, 43 | 0.050 | |
| Hap 3 | 1 | Semaq Beri 45 | 0.025 | |
| Hap 4 | 18 | Semaq Beri 1, 5, 6, 8, 11, 12, 17, 18, 21, 24, 29, 32, 33, 35, 39, 40, 42, 47 | 0.450 | |
| Hap 5 | 7 | Semaq Beri 7, 13, 23, 30, 36, 37, 49 | 0.175 | |
| Hap 6 | 4 | Semaq Beri 20, 27, 28, 34 | 0.100 | |
| Hap 7 | 7 | Semaq Beri 2, 9, 14, 22, 31, 38, 48 | 0.175 | |
| Hap 8 | 1 | Semaq Beri 44 | 0.023 | |
| Hap 9 | 1 | Semaq Beri 46 | 0.023 | |
| Hap 10 | 1 | Semaq Beri 21 | 0.023 | |
| Hap 11 | 1 | Semaq Beri 36 | 0.023 | |
| Hap 12 | 1 | Semaq Beri 35 | 0.023 | |
| Hap 13 | 10 | Semaq Beri 2, 9, 14, 20, 22, 27, 28, 31, 38, 48 | 0.227 | |
| Hap 14 | 1 | Semaq Beri 3 | 0.023 | |
| Hap 15 | 1 | Semaq Beri 25 | 0.023 | |
| Hap 16 | 1 | Semaq Beri 26 | 0.023 | |
| Hap 17 | 3 | Semaq Beri 10, 15, 41 | 0.068 | |
| Hap 18 | 2 | Semaq Beri 19, 50 | 0.045 | |
| Hap 19 | 14 | Semaq Beri 1, 5, 6, 8, 11, 12, 17, 18, 24, 29, 32, 39, 40, 47 | 0.318 | |
| Hap 20 | 7 | Semaq Beri 4, 13, 16, 23, 30, 43, 49 | 0.159 |
N - number of haplotype.
Fig. 1The current Asian and Pacific mtDNA within Manju clan. The tree was reconstructed based on [11]. The uppercase letter (E-East, N-North, S-South, NA-North Asia, EA-East Asia, SEA-Southeast Asia and PM-Peninsular Malaysia) is referring to the geographical location.
Fig. 2The current Asian and Pacific mtDNA within Nasreen clan. The tree was reconstructed based on [11]. The uppercase letter (E-East, N-North, S-South, NA-North Asia, EA-East Asia, SEA-Southeast Asia and PM-Peninsular Malaysia) is referring to the geographical location.
| Subject area | Forensic science |
| More specific subject area | Forensic genetic |
| Type of data | Tables and figure |
| How data were acquired | Data were acquired by extracting, amplifying, purifying, sequencing and analysing the target mtDNA region using PureLink™ Genomic DNA Mini Kit (Invitrogen, USA), QIAquick Purification Kit (QIAGEN Ag., Germany), DNA sequencer (First Base Laboratories, Malaysia), Sequencher 5.4 software ( |
| Data format | Raw and analysed |
| Experimental factors | Blood sample collection, DNA extraction, PCR amplification, DNA purification, sequencing and data interpretation |
| Experimental features | Sequence analysed followed by haplogroup identification |
| Data source location | Kampung Sungai Berua, Hulu Terengganu, Terengganu, Malaysia |
| Data accessibility | The mtDNA sequences are registered in the NCBI with accession number |
| Related research article | Zahidin |