Literature DB >> 17597891

GCSDB: an integrated database system for the Georgia Centenarian Study.

Jianliang Dai1, Adam Davey, Ilene C Siegler, Jonathan Arnold, Leonard W Poon.   

Abstract

UNLABELLED: GCSDB is a web-oriented integrated database system for the Georgia Centenarian Study, a phase III, population-based, multidisciplinary study of centenarians. The Study recruited 244 centenarians and near-centenarians (age 98 and older), 80 octogenarians and 400 young controls in Northern Georgia. GCSDB incorporates more than 40 relational tables containing data about the participants including demographics, family longevity, physical health, cognition, neuropsychology, mental health, neuropathology, functional capacity, and genetics. The GCSDB web site includes detailed information about these tables and functions for genetic and other kinds of data analysis. More data and functions will be added as the study progresses. GCSDB provides a resource that could be used to identify what biological, psychological, and social factors as well as their epistatic interactions help these centenarians achieve long life. AVAILABILITY: http://qa.genetics.uga.edu (login information can be obtained from authors).

Entities:  

Year:  2006        PMID: 17597891      PMCID: PMC1891685     

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Over the past two decades, interest in study of centenarians has increased steadily [1,2,3], and several countries have initiated their own centenarian studies, such as the United States [4,5,6], Japan [7], Italy [8], Hungary [9], France [10 ], Sweden [11], Finland, and Denmark. [ 12] The fundamental question is how centenarians live longer and what specific biological, psychological, and sociological factors help centenarians survive to become the oldest of the old. [1,2 ,13] Bringing together these data from multiple disciplines in ways that are simple and useful is a common problem that must be solved for research to progress. Phase 3 of the Georgia Centenarian Study (2001-2007) is a multi-disciplinary program project designed to identify further the roles of biological, psychological, and social factors as well as their epistatic interactions contributing to the successful late-life aging. [2] Four specific projects within the study are to examine: 1) genetic structure of the Georgia population with a focus on the oldest-old; 2) relations between dementia and neuropathology among centenarians with additional neuropsychological data collected for persons who give permission for autopsy; 3) determinants of functional capacity from neuropsychological, cognitive, health, and demographic variables; and 4) predictors differentiating centenarians who are independent, healthy, and experience a sense of well-being from those who are dependent, frail, and do not experience a sense of wellbeing. A population-based sample of individuals was recruited from 44 counties in Northeast Georgia, consisting of 244 centenarians and near-centenarians (age 98 and older), 80 octogenarians primarily for the cognitive and adaptation studies, and 400 controls in equal numbers from the 2nd to 5th decades of life for the genetic studies. Data were collected using standardized data forms and protocols by interviewers from the Data Acquisition Core, then scanned, checked, corrected, verified, and saved as separate electronic files according to the source of the data using the Teleform software package. [14] For ease of data management and data sharing in a multidisciplinary and multi-institutional setting, a weboriented, integrated database-GCSDB-was established to serve as a resource for effective collaborative research. This changes the paradigm for most social science studies. In this paper, we report on the status of GCSDB and highlight the database table features and web interface functions that were incorporated or developed for genetic data analysis as well as for other data analyses. The database and its interface are designed to promote cross-project questions and analyses of the data.

Methodology

Database Implementation

GCSDB was developed on a Sun microsystems SunBlade with the Solaris 5.8 operating system; database management was Oracle version 10g. GCSDB was constructed following the relational database schema from the data collection instruments and booklets. All tables in GCSDB share a common field-the unique participant ID number as the primary key (GCSID) for linking tables generated by multiple projects and/or cores. Web interface to the database is produced from Java Sever Page (JSP) technology and the Struts framework under the SQL query tool. Web pages are served using the Apache-Tomcat web server version 5.0.30. In addition to the relational database, underlying statistical analyses were implemented via Java or FORTRAN.

Database Content

GCSDB contains more than 40 relational tables for 4 interrelated projects. Number of columns in these tables ranges from 23 to 519. Column attributes are scale (double), ordinal (integer), or nominal (String). Each data entry in every table corresponds to one study participant (centenarian, octogenarian, or young control). Depending upon a participant's level of involvement in the project, these tables include the participants' information in the following tests or domain areas: (1) Demographics and Family Longevity. Data include age, date of birth, gender, racial and ethnic backgrounds, handedness, years of education, county and place of residence, and occupational history. Family longevity includes ages and causes of death of mother, father, grandparents, siblings, number of siblings, lifetime residential locations, and military services. (2) Physical Health. The protocol is composed of 3 domains: (a) Physical Examination and Health History, including vital signs, presence/absence of certain diseases, anthropometrics, neurological/musculoskeletal measures and current medications; (b) Blood Chemistry Profile, including glucose, BUN, creatinine, albumin, complete blood count, , C-reactive protein, hemoglobin A1c, ferritin, thyroxine, thyroid stimulating hormone, vitamin B12, and vitamin D; and (c) Physical Function Assessment includes measures of bed mobility, bed to chair transfer skills, standing balance, walking, step-up, and chair standing abilities; (3) Cognition, Neuropsychology, Mental Health, and Neuropathology. Tests and assessments include: the Mini-Mental State Examination (MMSE) [15], Global Deterioration Scale [16], and Severe Impairment Battery [17], Fuld Object Memory Evaluation (FOME) [18], Wechsler Adult Intelligence Scale-III, Similarities sub-test, Letter Number Sequencing sub-test, and Matrix Reasoning subtest [19], Behavioral Dyscontrol Scale (DBS) [20], ILS Health & Safety Scale [21], COWAT [22], Clock Drawing Test [23,24], Geriatric Depression Scale [25], CERAD battery [26], and Clinical Dementia Rating Scale. [27] Magnetic Resonance Imaging data of the voluntarily donated brain tissue were also available for some centenarians who passed-away before the completion of the project. (4) Functional Capacity and Independence. The basic functional capacity was assessed using both a self-report as well as performance-based measures of basic/physical and instrumental activities of daily living (BADL and IADL). Physical functional capacity was measured with NIA Short Performance Battery (SPPB) [28] and Physical Performance and Mobility Examination (PPME). [29] The performance-based measures include selected subtests of the Direct Assessment of Functional Status-Revised (DAFS-R). [30] The measures and Performance Rating Scale was taken from OARS. [ 31] In addition, there are other tables summarizing personality, life events, social support, and economic resources. (5) Genetics. The Single Nucleotide Polymorphism (SNPs) table includes 15 SNPs from three candidate longevity genes, i.e., APOE [32,33], HRAS1 [34], and LASS1/LAG1 [35,36] for all of the participants. The SNPs are located in exon or promoter regions in these genes.

Database Interface

The GCSDB homepage contains three sections (Figure 1): an introduction to the Georgia Centenarian Study; links to web pages for the four individual projects (Figure 2); ways to cross link among the tables (users may retrieve data either by selecting table names and column names or directly by SQL; a plot function is available to obtain a preliminary plot for any two column variables of interest from any table in the study to help investigators choose variables for further statistical analysis; Regression, Principal Component Analysis, and canonical correlation analysis tools are available to identify relationships among set of variables of interest.
Figure 1

Composite of screen displays demonstrating the homepage of GCSDB. (1) A brief introduction of The Georgia Centenarian Study Project with a link to a detailed introduction. (2) Four links to the web pages of four individual projects (see Figure 2 for Project 3). (3) Methods to cross link between four projects. The lower left-hand corner shows the example result of Principal Component Analysis, PCA

Figure 2

Screen display showing the web page for project 3. The page consists of the proposed project 3 research model which connects the potentially interrelated domain areas together. Clicking the domain area picture in the model brings the user to the web page for all the tables in that area

For each table, the web page provides the following information and functions: a dictionary listing the data-type, data length, column name, full name, and each category value for ordinal columns; a PDF file showing frequencies and descriptive statistics for most columns; a raw data table and function giving values for the user selected column for a given participant; missing value counts for every column and a function shows what columns are missing for a given participant; frequency counts for a user selected ordinal column and frequency counts for ordinal columns; maximum, minimum, and mean values and number of participants with values less or larger than mean values for all scale columns; number of participants with values larger or less than a given cut-off value for the user selected scale column; For the SNPs table (Figure 3), the following important information and genetic analyses are available;
Figure 3

A portion of the web page for the SNPs and Demographics and Family Longevity data. (A) Information and functions provided for SNPs data analyses. (B) Information and functions provided for preliminary analyses of Demographics and Family Longevity data as well as other data

SNPs descriptor includes the name, identity, cytogenetic map, chromosomal and physical location for every SNP; the 2 x 2 x 2 contingency table (age, race, SNP allele) and log odds ratios for each SNP [37] ; exact tests of association for each SNP [37] ; exact test of Hardy-Weinberg Law for each SNP; Linkage Disequibrium Test for pair-wise SNPs; haplotype analysis results for SNPs from APOE, HRAS1 and LASS1, respectively and for all the SNPs. [38] Data dictionaries, forms, code books, questionnaires, and other relevant metadata are on the Web site or are being added as requested. The database is passworded to protect the anonymity of centenarians, and centenarian names have been replaced with a GCSID.

Future Development

Data acquisition for functional capacity, and adaptation and resource projects has been completed; genetic data analysis is still under way from the collected blood samples. Neuropathology collection is expected to continue until 2007. GCSDB is updated as data become available. Functions and tools are added as data reduction and analysis begin. The database/Web interface constitutes a virtual resource for interested researchers. At present, GCSDB is only available to our project personnel, but will be released to the research community one year following completion of our project according to the data sharing plan on the Web site. It is our obligation and desire to share data, cell lines, and brain tissues with qualified researchers at that time. This data archive is part of making results available online. We expect that these rare resources can be used to test new hypotheses to gain new knowledge on contributors for the extreme longevity of these centenarians and that our system can serve as a model to other cross-disciplinary projects.
  26 in total

1.  Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data.

Authors:  D Fallin; N J Schork
Journal:  Am J Hum Genet       Date:  2000-08-22       Impact factor: 11.025

2.  Exact sample size needed to detect dependence in 2 x 2 x 2 tables.

Authors:  Jianliang Dai; Li Li; Sangkyu Kim; Beth Kimball; S Michal Jazwinski; Jonathan Arnold
Journal:  Biometrics       Date:  2007-12       Impact factor: 2.571

3.  [Centenarian--a contribution to longevity research].

Authors:  U Lehr
Journal:  Z Gerontol       Date:  1991 Sep-Oct

Review 4.  Biodemographic trajectories of longevity.

Authors:  J W Vaupel; J R Carey; K Christensen; T E Johnson; A I Yashin; N V Holm; I A Iachine; V Kannisto; A A Khazaeli; P Liedo; V D Longo; Y Zeng; K G Manton; J W Curtsinger
Journal:  Science       Date:  1998-05-08       Impact factor: 47.728

5.  A new scale for the assessment of functional status in Alzheimer's disease and related disorders.

Authors:  D A Loewenstein; E Amigo; R Duara; A Guterman; D Hurwitz; N Berkowitz; F Wilkie; G Weinberg; B Black; B Gittelman
Journal:  J Gerontol       Date:  1989-07

6.  An allele of HRAS1 3'variable number of tandem repeats is a frailty allele: implication for an evolutionarily-conserved pathway involved in longevity.

Authors:  Massimiliano Bonafè; Cristiana Barbi; Fabiola Olivieri; Anatoli Yashin; Kirill F Andreev; James W Vaupel; Giovanna De Benedictis; Giuseppina Rose; Giuseppina Carrieri; S Michal Jazwinski; Claudio Franceschi
Journal:  Gene       Date:  2002-03-06       Impact factor: 3.688

7.  [Extended family, immediate family and caregiver contacts of 100-year-old patients in Hungary].

Authors:  O Regius; E Beregi; A Klinger
Journal:  Z Gerontol       Date:  1994 Nov-Dec

8.  The Global Deterioration Scale for assessment of primary degenerative dementia.

Authors:  B Reisberg; S H Ferris; M J de Leon; T Crook
Journal:  Am J Psychiatry       Date:  1982-09       Impact factor: 18.112

9.  The Consortium to Establish a Registry for Alzheimer's Disease (CERAD). Part I. Clinical and neuropsychological assessment of Alzheimer's disease.

Authors:  J C Morris; A Heyman; R C Mohs; J P Hughes; G van Belle; G Fillenbaum; E D Mellits; C Clark
Journal:  Neurology       Date:  1989-09       Impact factor: 9.910

10.  Genetic associations with human longevity at the APOE and ACE loci.

Authors:  F Schächter; L Faure-Delanef; F Guénot; H Rouger; P Froguel; L Lesueur-Ginot; D Cohen
Journal:  Nat Genet       Date:  1994-01       Impact factor: 38.330

View more
  3 in total

1.  Cry-Bt identifier: a biological database for PCR detection of Cry genes present in transgenic plants.

Authors:  Vinay Kumar Singh; Sonu Ambwani; Soma Marla; Anil Kumar
Journal:  Bioinformation       Date:  2009-10-23

2.  Predicting successful aging in a population-based sample of georgia centenarians.

Authors:  Jonathan Arnold; Jianliang Dai; Lusine Nahapetyan; Ankit Arte; Mary Ann Johnson; Dorothy Hausman; Willard L Rodgers; Robert Hensley; Peter Martin; Maurice Macdonald; Adam Davey; Ilene C Siegler; S Michal Jazwinski; Leonard W Poon
Journal:  Curr Gerontol Geriatr Res       Date:  2010-09-14

3.  Assessing physical performance in centenarians: norms and an extended scale from the georgia centenarian study.

Authors:  M Elaine Cress; Yasuyuki Gondo; Adam Davey; Shayne Anderson; Seock-Ho Kim; Leonard W Poon
Journal:  Curr Gerontol Geriatr Res       Date:  2010-09-14
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.