Literature DB >> 24956385

Novel population specific autosomal copy number variation and its functional analysis amongst Negritos from Peninsular Malaysia.

Siti Shuhada Mokhtar1, Christian R Marshall2, Maude E Phipps3, Bhooma Thiruvahindrapuram4, Anath C Lionel4, Stephen W Scherer2, Hoh Boon Peng1.   

Abstract

Copy number variation (CNV) has been recognized as a major contributor to human genome diversity. It plays an important role in determining phenotypes and has been associated with a number of common and complex diseases. However CNV data from diverse populations is still limited. Here we report the first investigation of CNV in the indigenous populations from Peninsular Malaysia. We genotyped 34 Negrito genomes from Peninsular Malaysia using the Affymetrix SNP 6.0 microarray and identified 48 putative novel CNVs, consisting of 24 gains and 24 losses, of which 5 were identified in at least 2 unrelated samples. These CNVs appear unique to the Negrito population and were absent in the DGV, HapMap3 and Singapore Genome Variation Project (SGVP) datasets. Analysis of gene ontology revealed that genes within these CNVs were enriched in the immune system (GO:0002376), response to stimulus mechanisms (GO:0050896), the metabolic pathways (GO:0001852), as well as regulation of transcription (GO:0006355). Copy number gains in CNV regions (CNVRs) enriched with genes were significantly higher than the losses (P value <0.001). In view of the small population size, relative isolation and semi-nomadic lifestyles of this community, we speculate that these CNVs may be attributed to recent local adaptation of Negritos from Peninsular Malaysia.

Entities:  

Mesh:

Year:  2014        PMID: 24956385      PMCID: PMC4067311          DOI: 10.1371/journal.pone.0100371

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Southeast Asia is believed to be one of the earliest regions of Homo genus habitation recorded outside Africa. This may have occurred nearly 2 million years ago, following the arrival of the ancient Javanian, known as Homo erectus [1]. The Negrito people are believed to be direct descendants of humans who arrived in Peninsular Malaysia more than 60,000 years ago [2]–[4]. Ancestral Homo sapiens who originated from Africa [5] migrated into Asia along the coastal route [6]. The Negritos from Peninsular Malaysia are of Austroasiatic origin [7] and thought to be related to the Philippine Aeta and Andaman Islanders as well as the Melanesians, Tasmanians, and certain tropical Australian rainforest foragers based on superficial anatomical features and foraging lifestyles [8]. In Malaysia, Negritos are divided into six tribes based on linguistics, socio-cultural practices, and geographical region inhabited namely, Bateq, Mendriq, Jehai, Kensiu, Lanoh and Kintak, numbering approximately 0.15% of the total population [9]. Studies on various genetic markers including autosomal microsatellite markers and mitochondrial DNA suggest that these tribes are genetically similar and may have experienced high levels of genetic drift [8], [10], [11]. It is believed that they may have adapted to the environmental changes throughout the centuries to cope with limited food resources and the tropical rainforest environment. Currently, the number of Negritos is dwindling rapidly as Malaysia becomes more developed and forests are cleared. Characterizing the genetic variation of the isolated populations such as Negritos provides valuable information to the gene mapping of complex diseases [12]. Thus it is crucial to unveil their genetic makeup in order to better understand how genetic variation contributes to the well-being and health of human populations especially in the Southeast Asian region. Copy number variations (CNV) typically range from 1 kb to several megabases in size [13] and are acknowledged as a major contributor to genetic diversity. This variability plays an important role to determine phenotypes such as physical features and conferring susceptibility to a number of common and complex diseases including HIV, psoriasis, and a number of neuropsychiatric diseases [14]–[17]. This occurs via potentially altering gene expression levels and influencing the gene dosage [18], [19]. They account for a significant proportion of the genome [13], [20], are highly variable, and often harbor regions with genes sensitive to the environmental stimulation such as those involved in immunity, metabolism, olfactory receptors [21]–[23]. Due to their non-random distribution across the genome, it is believed this phenomenon may have trended towards selection bias [21], [24]. Most genetic diversity data in indigenous populations have been based on single nucleotide polymorphisms (SNP)/single nucleotide variations (SNV) [6], [25], [26] and maternal lineage mitochondrial DNA [4], [8], except for a handful of studies [27]–[33]. To date, CNVs in indigenous populations of Peninsular Malaysia have not been reported. As a complement to the existing SNP data, we explored the first CNV map of Negrito individuals from Peninsular Malaysia and report the distribution of novel and population-specific CNVs. Our findings may be able to provide fundamental insights to the genetic architecture of the Negritos which can be translated to aid biomedical and evolutionary investigations.

Materials and Methods

Sample Recruitment

This study was reviewed and approved by the Research and Ethics Committee of Universiti Teknologi MARA [Ref no: 600-RMI (5/1/6)], and Department of Orang Asli Development (Jabatan Kemajuan Orang Asli Malaysia, JAKOA) [Ref no: JHEOA.PP.30.052.Jld 5(17)]. Prior to sample collection, the headman of the tribe and/or the community members were first consulted in a customary courtesy visit and their consent were obtained. During sampling, all participants were interviewed, and informed and written consent were obtained. Process of interview and informed written consent was conducted in Malay language and witnessed by the officer from JAKOA. Only Negrito participants 18 years who gave consent were selected. We collected 10 ml of peripheral blood from 34 unrelated individuals (17 males and 17 females) after obtaining informed consent. The samples consisted of both males and females from sub-tribes Jahai, Bateq, Mendriq and Kensiu. DNA was extracted from using Qiagen Blood Extraction Kit (Qiagen, Hilden, Germany) according to the manufacturer's protocol.

Microarray Genotyping

Genotyping was performed using the Affymetrix SNP6.0 Array platform according to the manufacturer's instructions. Briefly, 250 ng of genomic DNA was digested and ligated. The ligated products were then PCR amplified. Amplicons were electrophoresed, purified and quantified to ensure that the samples passed quality control (QC) measures before further experiment. The products were then fragmented, hybridized onto the Affymetrix SNP6.0 chips and stained. Chips were scanned and raw data was generated using Affymetrix Genotyping Console Software (GTC) version 3.0.2 with default settings.

Copy Number Variation Analysis and Validation

CNVs were called independently using three algorithms, Affymetrix GTC, Birdsuite and iPattern (TCAG) as described previously [35]. We applied stringent filtering criteria such that CNV had to be a minimum of 1 kb and span 5 consecutive probes, and be detected by at least 2 out of the 3 algorithms. In addition we excluded CNVs that were on the X and Y-chromosomes, or approximately 300 kb adjacent to the centromeres and telomeres. To define a set of rare CNVs we excluded known polymorphic loci (ie. Copy number polymorphism, CNP, targeted by the array) and those CNVs with more than 50% reciprocal overlap with those reported in DGV. The filtered CNV calls were then compared with the HapMap3 dataset and subsequently the Singapore Genome Variation Project (SGVP) (http://www.statgen.nus.edu.sg/~SGVP/), to further identify CNVs unique to the Peninsular Malaysia Negritos. We defined a CNV as putative novel and unique to Negritos (denoted as population-specific CNVs) when it is not present in any of the HapMap3 and SGVP samples (defined as <50% reciprocal overlap with HapMap3 and SGVP CNVs). Annotated CNVs unique to the Negrito samples studied with underlying genes were validated with qPCR SyBr Green assay as previously described [34]. A total of 50 ng (10 ng/µl) genomic DNA was amplified in a reaction mixture containing 12.5 µl iQ Sybr Green Supermix (Biorad), 1 µl (7 µM/µl) of respective forward and reverse primers, and top up to total volume of 25 µl with ddH2O. Cycling conditions were 95°C for 3 min, and then 40 cycles of 95°C for 30 s, followed by respective annealing temperatures of each locus for 15 s and 72°C for 30 s. Melting curve was performed to check for specificity of the assay. Efficiency of the assay was observed by the generation of standard curve by created a serial of five-fold dilutions of a top standard of 50 ng/µl to 0.08 ng/µl (10 ng to 0.016 ng) of a single genomic DNA sample. All reactions were run in triplicate, except a few when the genomic DNA was insufficient, were run in duplicate. Normalization to the control gene Forkhead Box P2 (FOXP2) (primers 5'-TGACATGCCAGCTTATCTGTTT-3' and 5'-GAGAAAAGCAATTTTCACAGTCC-3') was used to give an estimate of copy number. The reproducibility of the qRT-PCR assay for each sample was calculated by estimating the within-sample variation measured through the coefficient of variation (C.V. %  = 100*[standard deviation]/mean). Copy number of the target sequence in each test sample is determined by using comparative CT (2-ΔΔCT). Eight out of 12 (66.7%) CNVs were true positive (8 out of 9 were CNVs >10 kb in length). However, all 3 CNV less than 10 kb failed to validate. Considering the low replication rate, we removed the CNVs sized <10 kb from further analysis. The primer sequences and the copy number amplified for the candidate CNVs is listed in Table 1.
Table 1

Candidate genes primer sequences and copy number amplified in SyBr Green qPCR assay.

Locus nameCNV size spanned (bp)Primer sequenceExpected amplicon size (bp)Annealing temp (°C)Copy number
ADH7153,593Forward: gaaggcacaagctgctgttatReverse: catcctgtctttgtcttggatct9959.6°C3(2.80, 0.108)
CSMD1301,535Forward: actctgaacggtgtcctggttt Reverse: ttcctaagctgcaaaggtgtg9262.2°C3(3.11, 0.062)
SH2D4B15,464Forward: atgttctatgctgtggtggatg Reverse: acgaactttgtcagaaacgtga10159.9°C1(0.45, 0.042)
NPAS325,484Forward: ctgttggcttagaggctgagatReverse: agcccttgagatgattcctaca10960°C1(1.32, 0.65)
WDR4165,544Forward: acaggtttgtgagccgtatctcReverse: tcaagaatccagaggtgagtga10660°C2(2.10, 0.14)
LRRC309,547Forward: cttgcacgtgggctcgaatcReverse: ggatgttgttgccctctgcg9566.3°C-
TNFRSF1B83,214Forward: cattaggagatgtgtggtcctgReverse: aacagtatgtcccgttctgtctc9059.6°C3(3.09, 0.008)
PRIMER 136,283Forward: acagaacctaagcggaaatcctReverse: aactggaagcaagatgctgact10764.0°C3(3.40, 0.08)
PRIMER 265,481Forward: ccctgaagcgtgagtctctaat Reverse: tgataacacctctgcacattcc8963.5°C3(2.50, 0.12)
PRIMER 342,399Forward: ggtcttcagtttgtgcttcagat Reverse: catcacttcctagcgccttc8063.4°C3(2.90, 0.07)
PRIMER 463,260Forward: tcctaaagtttccgcaggagReverse: ctcacttcactggtgtcaggtt9963.2°C1(1.14, 0.32)
QCNV29,812Forward: caggcaagttcatatgttccaReverse: agaggaatgccagatagagcag11363.6°C3(2.90, 0.11)
QCNV44,021Forward: acttggtaaattgtgttgaReverse: tgtcagtcctgcattt10452.4°C2(2.20, 0.17)

WDR4 and QCNV4 showed copy number normal and therefore considered as false positive. QCNV2 was detected as a CN gain by microarray, inconsistent with the qPCR validation, therefore considered as false positive. Parentheses, unrounded copy number values calculated using the relative quantification, standard deviation.

WDR4 and QCNV4 showed copy number normal and therefore considered as false positive. QCNV2 was detected as a CN gain by microarray, inconsistent with the qPCR validation, therefore considered as false positive. Parentheses, unrounded copy number values calculated using the relative quantification, standard deviation. The microarray dataset has been submitted to NCBI dbGaP. The accession number assigned is: phs000664.v1.p1.

Gene Ontology Analysis

We submitted the annotated genes list underlying the Negrito-specific CNVs observed to PANTHER (Protein ANalysis THrough Evolutionary Relationships) (http://www.pantherdb.org/) and DAVID (the Database for Annotation, Visualization and Integrated Discovery, version 6.7) (http://david.abcc.ncifcrf.gov/summary.jsp).

Results

General Characteristics of CNV and CNVR

We identified 1,333 autosomal CNVs from Genotyping Console (Affymetrix), with an average 39.2 CNVs per genome, whilst the total number of CNVs being called by Birdsuite and iPattern were 2,636 and 3,692, respectively (mean number of calls per genome 77.5 and 108.6 respectively, Table 2). After applying stringent filtering criteria, 1,111 overlapping CNVs were successfully merged, with an average 32.7 CNVs per genome (CNV call per genome ranged from 19–54), corresponding 105,909,572 bp of the total autosomal genome (Figure 1). These corresponded to 263 CNVRs comprising of 161 losses, 94 gains and 8 multi-allelic sites. Figure 2 shows the length distribution of CNVRs in this study.
Table 2

General characteristics of CNV and CNVR among 34 Negrito genomes from Peninsular Malaysia.

GTCBirdsuiteiPatternMerged*
Total CNV count:
Gain5307351,430330
Loss8031,9012,262781
Complex CNV40
Total1,3332,6363,6921,111
Average number per genome:
Gain15.521.642.09.7
Loss23.655.966.523.0
Total39.277.5108.632.7
Size (bp):
Min1,0001,0191,0101,134
Max1,768,000985,8071,033,7841,033,785

*Merged: stringent CNV calls by at least 2 out of 3 algorithms applied.

Figure 1

CNVR map of Negrito samples.

The ideogram summarizes the distribution of CNVRs on each human chromosome. The red indicates copy number loss, the blue indicates copy number gain while the green indicates multi-allelic loci.

Figure 2

Length distribution of the CNVs in Negrito from Peninsular Malaysia.

CNVR map of Negrito samples.

The ideogram summarizes the distribution of CNVRs on each human chromosome. The red indicates copy number loss, the blue indicates copy number gain while the green indicates multi-allelic loci. *Merged: stringent CNV calls by at least 2 out of 3 algorithms applied.

Comparison of Common CNVs

We first compared the diversity of common CNVs with the HapMap3 populations derived from 10 populations (consisting 1,072 samples). A set of CNVs that showed significant differences of allele frequencies are listed in Table 3. Notably, CNV losses at chromosome 3p22.2 (37,957,108–37,961,932) were observed in 56% of the Negrito samples in this study as compared to the rest of the HapMap3 populations (Table 3; Figure 3). The gene CTDSPL involved in this CNV was found to be associated with prostate cancer (https://www.genome.gov/26525384). The CNV in chromosome 15q13.3 was another region of interest. Frequency of this CNV was found to be higher (0.44) as compared to the HapMap3 samples (ranging from 0.09–0.21). The gene CHRNA7 involved in this CNV was found to be associated with schizophrenia and epilepsy [35], [36].
Table 3

Common CNV with significant difference in allele frequencies compare to the HapMap3 dataset.

CNVsChrStartEndCNV Frequencies
NEGASWCEUCHBCHDGIHJPTLWKMEXTSIYRI
1.240,780,87940,803,1100.210.005.6×103 0.060.020.000.000.000.000.000.00
2.252,605,07452,635,0460.880.170.530.800.800.700.860.040.440.490.09
3.337,957,10837,961,9320.560.070.110.150.160.210.030.010.110.270.03
4.478,495,57978,500,3670.120.000.000.030.070.000.030.000.010.000.01
5.6202,353326,1490.030.150.290.260.370.160.290.170.270.300.09
6.1532,487,97532,617,6800.440.210.090.160.170.180.220.100.150.170.21
7.1614,897,36415,016,0880.090.300.240.420.350.300.360.330.310.210.22
8.1741,750,18742,107,4790.180.000.000.010.020.000.030.000.020.000.00
Figure 3

UCSC Genome Browser view of CNV on chromosome 3p22.2.

Figure produced by custom tracks listing CNV call of Negrito and uploaded to http://genome.ucsc.edu.

UCSC Genome Browser view of CNV on chromosome 3p22.2.

Figure produced by custom tracks listing CNV call of Negrito and uploaded to http://genome.ucsc.edu.

Population Specific CNVs

Our dataset was further compared with HapMap3 dataset. Analysis revealed 62 CNVs (corresponded to 36 CNVRs) unique to our Negrito samples. However, due to the high false discovery, the CNVs sized <10 kb were excluded from further analysis, hence 48 CNVs remained (24 gains; 24 losses), of which 32 were singletons (Table 4). Length distribution of the CNVRs specific to Negritos is shown in Figure 4.
Table 4

Population specific CNVs in 34 genomes of Negrito from Peninsular Malaysia.

Chromosome CytobandStartEndSizeCNV frequency * CNV type(gain/loss)Genes involvedDisrupted genes
1p36.2212,117,03812,200,25183,2140.03gainTNFRSF1B, TNFRSF8TNFRSF8
1q43237,656,846237,667,78610,9410.03loss--
2p2141,716,28841,781,08164,7940.06loss--
41,717,14941,780,40863,2600.06loss--
41,717,14941,785,21468,0660.03loss--
2p1275,457,87875,486,63228,7550.03gain--
75,469,90475,486,63216,7290.03gain--
2q13109,015,589109,051,83136,2430.03loss--
2q37.1232,156,839232,246,17189,3330.03gainC2orf57-
3p26.18,096,025811,997423,9500.03loss--
3q25.33161,089,639161,131,30941,6710.03gainSCHIP1, IQCJ-SCHIP1SCHIP1
4q22.294,375,62594,778,350402,7260.03lossGRID2GRID2
4q23100,542,893100,696,485153,5930.03gainRG9MTD2, C4orf17, ADH7RG9MTD2
100,651,864100,686,03434,1710.03gainC4orf17C4orf17
100,656,576100,695,57238,9970.03gainC4orf17, RG9MTD2C4orf17, RG9MTD2
4q31.23149,474,120149,512,76738,6480.03lossNR3C2-
4q32.3169,960,297169,996,57936,2830.03lossPALLDPALLD
169,961,956169,979,04017,0850.03lossPALLD-
6p12.251,564,00051,583,51919,5200.06loss--
7p22.23,073,6053,094,44920,8450.03gain--
8p23.22,672,7532,738,23365,4810.03gain--
2,672,7532,974,287301,5350.03gainCSMD1CSMD1
2,675,4722,766,57891,1070.03gain--
2,780,1462,947,279167,1340.03gainCSMD1CSMD1
8q24.3143,097,891143,112,52414,6340.03loss--
9p2310,734,1351,0754,09719,9630.03gain--
9p21.128,746,34428,839,94993,6060.03loss--
9q21.3386,893,36586,953,89660,5320.03loss--
10q23.182,374,11482,389,57715,4640.03lossSH2D4B-
14q13.132,721,47132,746,95425,4840.03lossNPAS3-
14q32.1290,730,51490,754,44323,9300.03gainC14orf159C14orf159
18p11.23, 18p11.316,863,3547,424,32956,09760.03gainLRRC30, LAMA1, ARHGAP28, LOC400643ARHGAP28
18q21.3358,951,16858,970,36319,1960.03gainBCL2-
19p13.34,795,1524,837,55042,3990.09gainPLIN3PLIN3
20p132,436,2942,547,940111,6470.03gainZNF343, TMC2ZNF343, TMC2
2,436,2942,555,429119,1360.03gainZNF343, TMC2ZNF343, TMC2
2,436,2942,556,157119,8640.03gainZNF343, TMC2ZNF343, TMC2
21q21.223,740,93123,752,18911,2590.06loss--
21q22.343,121,09343,286,636165,5440.03gainNDUFV3, WDR4, PKNOX1PKNOX1

Position of CNVs were coordinated based on Human Genome Assembly NCBI (hg18).

*CNV frequencies calculated based on the 34 Negrito genomes genotyped.

Figure 4

Length distribution of the CNVs unique to the Negrito from Peninsular Malaysia.

Position of CNVs were coordinated based on Human Genome Assembly NCBI (hg18). *CNV frequencies calculated based on the 34 Negrito genomes genotyped. To confirm the uniqueness of these CNVs in Negritos, we further compared our dataset with the metropolitan Chinese, Indians and Malays from SGVP. Seven CNVRs were covered in SGVP but none of these putative CNVs we found had been previously reported.

Gene Ontology and Pathway Analyses

To understand the putative functional implications of these CNVs, we performed the Gene Ontology (GO) and pathway analyses on the gene set within the Negrito-specific CNVs using PANTHER and DAVID (Figure 5). Of the 48 CNVs specific to Negritos, 29 carried annotated genes while the remaining were gene-poor regions (Table 4). For all the CNVRs enriched with genes, copy number gains were significantly higher than the losses (15 gain versus 6 losses) (P<0.001). GO analysis by PANTHER revealed fourteen genes involved in immune system function and regulation, response to stimulus and metabolic pathways; whereas DAVID revealed that transcription regulation, and regulation of RNA metabolic processes to be the most significant GO term. The list of genes involved in the major biological processes is listed in Table 5.
Figure 5

Gene Ontology and pathway analyses on the gene set within the Negrito-specific CNVs using PANTHER and DAVID.

(a) PANTHER analysis suggests a major involvement of the genes harboring the population specific CNVs in the immune system process and response to stimulus, as well as the metabolic process; (b) DAVID analysis suggests the involvement of the genes harboring the population specific CNVs in the transcription and regulation of RNA metabolic processes.

Table 5

Pathways and biological processes of the genes underlying the population specific CNvs in Negrito from Peninsular Malysia.

Pathways/Biological functionsGO TermGenes
Immune systems and processesGO:0002376TNFRSF8, CSMD1, SH2D4B, TNFRSF1B, LRRC30
Response to stimulusGO:0050896TNFRSF8, CSMD1, SH2D4B, TNFRSF1B
System processGO:0003008SCHIP1, LAMA1, GRID2, PALLD
Metabolic processesGO:0008152NDUFV3, WDR4, NPAS3, ADH7, PKNOX1
Cellular processesGO:0009987TNFRSF8, TNFSR1B, LAMA1, GRID2
Cell communicationGO:0007154TNFRSF8, TNFSR1B, LAMA1, GRID2
TranscriptionGO:0006350NPAS3, NR3C2, ZNF343
Regulation of transcription, DNA-dependentGO:0006355PBX1, NPAS3, NR3C2, ZNF343
Regulation of RNA metabolic processGO:0051252PBX1, NPAS3, NR3C2, ZNF343

Analysis was performed using PANTHER DAVID.

Gene Ontology and pathway analyses on the gene set within the Negrito-specific CNVs using PANTHER and DAVID.

(a) PANTHER analysis suggests a major involvement of the genes harboring the population specific CNVs in the immune system process and response to stimulus, as well as the metabolic process; (b) DAVID analysis suggests the involvement of the genes harboring the population specific CNVs in the transcription and regulation of RNA metabolic processes. Analysis was performed using PANTHER DAVID.

Discussion

It is estimated that approximately 96% of the current genome-wide association studies were conducted on individuals of European ancestry [37]. There is a growing need to unveil the spectrum of human genetic diversity by investigating minority populations, for instance the aboriginal populations in Southeast Asia (SEA) countries. The Negrito populations from Peninsular Malaysia are of interest, as they are known to be the descendants of earliest migrants to Southeast Asia. Due to their relatively long period of isolation and semi-nomadic lifestyles, they have had less exposure to urbanization. Their genomes are therefore perceived to be considerably less diverse owing to genetic drift and possibly founder effects. This makes them ideal for investigating genetic forces acting in human evolution, which provides fundamental knowledge to inform disease-based genetic studies as well as gene mapping. In this study, we identified 263 CNVRs in 34 Negrito subjects from Peninsular Malaysia, of which 27 we believe are novel and unique to Negritos. After excluding the small CNVs, an average 23 CNVs was observed per Negrito genome. It was found to have more losses (72.6%) than gains, in line with most reported studies [12], [29], [32]. Overall size of the CNV observed also corresponded well. Approximately 58% of the CNVs found in Negrito were <30 kb, in line with reports by Yim et al. [27] on the Korean genomes and Ku et al. [32]; but was relatively higher than the reported by Zhang et al. [30] and McElroy et al. [33]. The average number of CNVs detected in the HapMap3 dataset (average CNV call per genome  = 102.2) (data not shown) and the Chinese populations (average CNV call per genome  = 140.9) [29] were much higher. The number of novel CNVRs identified in Negrito was also lower (0.85 per genome) than those previously reported [29]–[30], [32]–[33]. This is expected as we have excluded all the small CNVs <10 kb from our analyses in this study (comprised ∼30.8% of the total CNVs identified). Moreover, more populations being genotyped, the CNV map gets more saturated consequently hence less novel variants are observed. Collectively we observed less CNVRs but more alleles (CNVs) in the Negrito genomes. Though in general, the CNV profile of Negrito genomes looks similar to those reported especially by Ku et al. [32] in three other SEA populations except for the X-chromosome which was not considered in our study. The variation of the number of CNVs detected could be attributed to several reasons: i) the technology applied for CNV detection and its resolution; ii) levels of stringency applied when performing the CNV call; iii) the algorithms applied when performing CNV call; iv) we excluded the X-chromosome, telomeric and centromeic CNVs. The application of three independent CNV algorithms would minimize the false positive result rates, as evidenced by Pinto et al. [38]. The poor validation rate for the small CNVs (<10 kb) could be attributed to several reasons: (i) poor signal to noise ratio of the samples thus leading to false positive calls by the algorithms; and (ii) inaccurate estimation of breakpoints for the small CNVs due to the limitation of the probe density, thus leading to inaccuracy when identifying a precise CNV during qPCR validation. Therefore precautious should be taken when analysing the small CNVs. Collectively, our approach would increase the confidence of higher quality calls, at an expense of fewer positives being called. Although the number of CNVs was relatively lower then the previous studies, this report is considerably more stringent and with a higher confidence level. We believe more novel CNVRs unique to Negrito could be identified if larger sample sizes were to be investigated. Interestingly, the CNVRs enriched with genes showed a significantly higher copy number gains. In addition to that, these genes were known to be involved in immunity and response to stimuli, as well as metabolic pathways. We speculate that the Negrito may have undergone processes of local adaptation and positive selection, which necessitated their expansions and eventual settlement in forest habitats. This hypothesis is supported by several previously reported studies [24], [30], [39]. However, possibilities of other processes such genetic drift due to random duplications or deletions should not be ruled out [40]. Nevertheless, further investigations should be carried out to confirm the findings. The health of Negritos has not been studied comprehensively for several decades and there are few recent publications [9]. However, early studies indicated that Negritos were under various medical stresses especially with high prevalence of communicable diseases including malaria, tuberculosis, leptospirosis and various intestinal infections [41]. This could be attributed to their life style in the early days, whereby the hunting-gathering activities were practiced hence are exposed to a variety of transmissible diseases. Malnutrition has been reported to be common amongst aboriginal communities, especially women [42]. Although there are no specific reports on the nutritional status on Negritos of late, our observations and direct communications with the Negrito tribes lead us to believe that the majority is undernourished. Although we cannot provide unequivocal evidence, it is conceivable that their biomedical stresses experienced in over the years resulted in the enrichment of selected genes in these Negrito specific CNVs. This is the first study of genome-wide CNVs in the Negrito population from Peninsular Malaysia. We identified putative novel CNVs unique to the Negrito populations from Peninsular Malaysia. Although the smaller sample size does not allow us to perform functional and statistical analysis, our data was analyzed with most stringent QC criteria and were then compared with a number of datasets including DGV, HapMap3 and the SGVP. As such, we think our data is highly reliable. Population studies to catalogue the patterns, frequencies and distribution of CNV in non-disease based cohort is crucial to provide fundamental understanding of its impact to the phenotypic diversity and disease susceptibility. Hence, characterization of more diverse populations is needed to improve the saturation of CNV map for human genome. To this end, we are continuing our investigations amongst the indigenous in Malaysia.

Summary

Our findings provide fundamental knowledge, different perspectives and insights to the genetic diversity of Negritos of Peninsular Malaysia. This can inform studies of local adaptation, natural selection and also potentially influence health programmes in the near future.
  37 in total

1.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

2.  Mapping human genetic diversity in Asia.

Authors:  Mahmood Ameen Abdulla; Ikhlak Ahmed; Anunchai Assawamakin; Jong Bhak; Samir K Brahmachari; Gayvelline C Calacal; Amit Chaurasia; Chien-Hsiun Chen; Jieming Chen; Yuan-Tsong Chen; Jiayou Chu; Eva Maria C Cutiongco-de la Paz; Maria Corazon A De Ungria; Frederick C Delfin; Juli Edo; Suthat Fuchareon; Ho Ghang; Takashi Gojobori; Junsong Han; Sheng-Feng Ho; Boon Peng Hoh; Wei Huang; Hidetoshi Inoko; Pankaj Jha; Timothy A Jinam; Li Jin; Jongsun Jung; Daoroong Kangwanpong; Jatupol Kampuansai; Giulia C Kennedy; Preeti Khurana; Hyung-Lae Kim; Kwangjoong Kim; Sangsoo Kim; Woo-Yeon Kim; Kuchan Kimm; Ryosuke Kimura; Tomohiro Koike; Supasak Kulawonganunchai; Vikrant Kumar; Poh San Lai; Jong-Young Lee; Sunghoon Lee; Edison T Liu; Partha P Majumder; Kiran Kumar Mandapati; Sangkot Marzuki; Wayne Mitchell; Mitali Mukerji; Kenji Naritomi; Chumpol Ngamphiw; Norio Niikawa; Nao Nishida; Bermseok Oh; Sangho Oh; Jun Ohashi; Akira Oka; Rick Ong; Carmencita D Padilla; Prasit Palittapongarnpim; Henry B Perdigon; Maude Elvira Phipps; Eileen Png; Yoshiyuki Sakaki; Jazelyn M Salvador; Yuliana Sandraling; Vinod Scaria; Mark Seielstad; Mohd Ros Sidek; Amit Sinha; Metawee Srikummool; Herawati Sudoyo; Sumio Sugano; Helena Suryadi; Yoshiyuki Suzuki; Kristina A Tabbada; Adrian Tan; Katsushi Tokunaga; Sissades Tongsima; Lilian P Villamor; Eric Wang; Ying Wang; Haifeng Wang; Jer-Yuarn Wu; Huasheng Xiao; Shuhua Xu; Jin Ok Yang; Yin Yao Shugart; Hyang-Sook Yoo; Wentao Yuan; Guoping Zhao; Bin Alwi Zilfalil
Journal:  Science       Date:  2009-12-11       Impact factor: 47.728

3.  Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants.

Authors:  Dalila Pinto; Katayoon Darvishi; Xinghua Shi; Diana Rajan; Diane Rigler; Tom Fitzgerald; Anath C Lionel; Bhooma Thiruvahindrapuram; Jeffrey R Macdonald; Ryan Mills; Aparna Prasad; Kristin Noonan; Susan Gribble; Elena Prigmore; Patricia K Donahoe; Richard S Smith; Ji Hyeon Park; Matthew E Hurles; Nigel P Carter; Charles Lee; Stephen W Scherer; Lars Feuk
Journal:  Nat Biotechnol       Date:  2011-05-08       Impact factor: 54.908

4.  De novo rates and selection of large copy number variation.

Authors:  Andy Itsara; Hao Wu; Joshua D Smith; Deborah A Nickerson; Isabelle Romieu; Stephanie J London; Evan E Eichler
Journal:  Genome Res       Date:  2010-09-14       Impact factor: 9.043

5.  The 'human revolution' in lowland tropical Southeast Asia: the antiquity and behavior of anatomically modern humans at Niah Cave (Sarawak, Borneo).

Authors:  Graeme Barker; Huw Barton; Michael Bird; Patrick Daly; Ipoi Datan; Alan Dykes; Lucy Farr; David Gilbertson; Barbara Harrisson; Chris Hunt; Tom Higham; Lisa Kealhofer; John Krigbaum; Helen Lewis; Sue McLaren; Victor Paz; Alistair Pike; Phil Piper; Brian Pyatt; Ryan Rabett; Tim Reynolds; Jim Rose; Garry Rushworth; Mark Stephens; Chris Stringer; Jill Thompson; Chris Turney
Journal:  J Hum Evol       Date:  2006-10-01       Impact factor: 3.895

6.  Genomic drift and copy number variation of sensory receptor genes in humans.

Authors:  Masafumi Nozawa; Yoshihiro Kawahara; Masatoshi Nei
Journal:  Proc Natl Acad Sci U S A       Date:  2007-12-05       Impact factor: 11.205

Review 7.  Mutational and selective effects on copy-number variants in the human genome.

Authors:  Gregory M Cooper; Deborah A Nickerson; Evan E Eichler
Journal:  Nat Genet       Date:  2007-07       Impact factor: 38.330

8.  Genetic and fossil evidence for the origin of modern humans.

Authors:  C B Stringer; P Andrews
Journal:  Science       Date:  1988-03-11       Impact factor: 47.728

9.  Psoriasis is associated with increased beta-defensin genomic copy number.

Authors:  Edward J Hollox; Ulrike Huffmeier; Patrick L J M Zeeuwen; Raquel Palla; Jesús Lascorz; Diana Rodijk-Olthuis; Peter C M van de Kerkhof; Heiko Traupe; Gys de Jongh; Martin den Heijer; André Reis; John A L Armour; Joost Schalkwijk
Journal:  Nat Genet       Date:  2007-12-02       Impact factor: 38.330

10.  Bias of selection on human copy-number variants.

Authors:  Duc-Quang Nguyen; Caleb Webber; Chris P Ponting
Journal:  PLoS Genet       Date:  2006-02-17       Impact factor: 5.917

View more
  5 in total

1.  A genome-wide characterization of copy number variations in native populations of Peninsular Malaysia.

Authors:  Ruiqing Fu; Siti Shuhada Mokhtar; Maude Elvira Phipps; Boon-Peng Hoh; Shuhua Xu
Journal:  Eur J Hum Genet       Date:  2018-02-23       Impact factor: 4.246

2.  A 3.4-kb Copy-Number Deletion near EPAS1 Is Significantly Enriched in High-Altitude Tibetans but Absent from the Denisovan Sequence.

Authors:  Haiyi Lou; Yan Lu; Dongsheng Lu; Ruiqing Fu; Xiaoji Wang; Qidi Feng; Sijie Wu; Yajun Yang; Shilin Li; Longli Kang; Yaqun Guan; Boon-Peng Hoh; Yeun-Jun Chung; Li Jin; Bing Su; Shuhua Xu
Journal:  Am J Hum Genet       Date:  2015-06-11       Impact factor: 11.025

3.  Differential positive selection of malaria resistance genes in three indigenous populations of Peninsular Malaysia.

Authors:  Xuanyao Liu; Yushimah Yunus; Dongsheng Lu; Farhang Aghakhanian; Woei-Yuh Saw; Lian Deng; Mohammad Ali; Xu Wang; Fadzilah Mohd Nor; Fadzilah Ghazali; Thuhairah Abdul Rahman; Shahrul Azlin Shaari; Mohd Zaki Salleh; Maude E Phipps; Rick Twee-Hee Ong; Shuhua Xu; Yik-Ying Teo; Boon-Peng Hoh
Journal:  Hum Genet       Date:  2015-01-30       Impact factor: 5.881

4.  A map of copy number variations in the Tunisian population: a valuable tool for medical genomics in North Africa.

Authors:  Lilia Romdhane; Nessrine Mezzi; Hamza Dallali; Olfa Messaoud; Jingxuan Shan; Khalid A Fakhro; Rym Kefi; Lotfi Chouchane; Sonia Abdelhak
Journal:  NPJ Genom Med       Date:  2021-01-08       Impact factor: 8.617

5.  Rare Copy Number Variants Identified Suggest the Regulating Pathways in Hypertension-Related Left Ventricular Hypertrophy.

Authors:  Hoh Boon-Peng; Julia Ashazila Mat Jusoh; Christian R Marshall; Fadhlina Majid; Norlaila Danuri; Fashieha Basir; Bhooma Thiruvahindrapuram; Stephen W Scherer; Khalid Yusoff
Journal:  PLoS One       Date:  2016-03-01       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.