| Literature DB >> 26297754 |
Kobus Herbst1, Sanjay Juvekar2, Tathagata Bhattacharjee2, Martin Bangha3, Nidhi Patharia2, Titus Tei3, Brendan Gilbert4, Osman Sankoh5.
Abstract
The International Network for the Demographic Evaluation of Populations and Their Health (INDEPTH) is a global network of research centers that conduct longitudinal health and demographic evaluation of populations in low- and middle-income countries (LMICs) currently in 52 health and demographic surveillance system (HDSS) field sites situated in sub-Saharan Africa (14 countries), Asia (India, Bangladesh, Thailand, Vietnam, and Indonesia), and Oceania (Papua New Guinea). Through this network of HDSS field sites, INDEPTH is capable of producing reliable longitudinal data about the lives of people in the research communities as well as how development policies and programs affect those lives. The aim of the INDEPTH Data Repository is to enable INDEPTH member centers and associated researchers to contribute and share fully documented, high-quality datasets with the scientific community and health policy makers.Entities:
Keywords: data repository; data sharing; health and demographic surveillance; metadata; research data management
Mesh:
Year: 2015 PMID: 26297754 PMCID: PMC4547208 DOI: 10.1177/1556264615594600
Source DB: PubMed Journal: J Empir Res Hum Res Ethics ISSN: 1556-2646 Impact factor: 1.742
HDSSs That Have Contributed Core Micro Datasets to the INDEPTH Data Repository.
| Network HDSS[ | Country | Period (from-to) | Current population[ | Total population[ | Person years |
|---|---|---|---|---|---|
| Ouagadougou (BF041) | Burkina Faso | 2009-2012 | 82,124 | 121,728 | 381,195 |
| Taabo (CI011) | Côte d’Ivoire | 2009-2012 | 40,625 | 62,331 | 194,054 |
| Gilgel Gibe (ET021) | Ethiopia | 2006-2012 | 58,150 | 73,825 | 402,338 |
| Kilite Awlaelo (ET031) | Ethiopia | 2010-2012 | 64,549 | 74,794 | 210,738 |
| Kersa (ET041) | Ethiopia | 2008-2012 | 60,262 | 67,247 | 294,872 |
| Dabat (ET051) | Ethiopia | 2009-2012 | 43,782 | 51,908 | 186,398 |
| Vadu (IN021) | India | 2009-2012 | 138,072 | 189,075 | 582,408 |
| Kilifi (KE011)[ | Kenya | 2003-2012 | 256,317 | 547,542 | 3,937,309 |
| Kisumu (KE021)[ | Kenya | 2003-2012 | 246,403 | 427,815 | 2,796,712 |
| Nairobi (KE031) | Kenya | 2002-2012 | 66,428 | 190,862 | 1,273,514 |
| Mbita (KE041) | Kenya | 2009-2012 | 59,790 | 76,212 | 260,878 |
| Kombewa (KE051) | Kenya | 2011-2012 | 140,312 | 145,814 | 194,338 |
| Karonga (MW011) | Malawi | 2003-2012 | 36,739 | 59,774 | 430,316 |
| IRD-Mlomp (SN012) | Senegal | 1990-2012 | 8,416 | 16,705 | 269,617 |
| IRD–Niakhar (SN013) | Senegal | 1983-2012 | 42,592 | 77,715 | 1,429,640 |
| Agincourt (ZA011) | South Africa | 1992-2012 | 98,923 | 210,384 | 2,696,271 |
| Dikgale (ZA021) | South Africa | 1995-2012 | 37,182 | 46,851 | 281,920 |
| The Africa Centre for Health and Population Studies (ZA031) | South Africa | 2000-2012 | 71,813[ | 138,964 | 1,377,854 |
| Ifakara Health Institute–Ifakara Rural (TZ011) | Tanzania | 1997-2012 | 130,256 | 279,450 | 2,415,177 |
| Ifakara Health Institute–Rufiji (TZ012) | Tanzania | 1998-2012 | 93,441 | 172,144 | 1,554,215 |
| Ifakara Health Institute–Ifakara Urban (TZ013) | Tanzania | 2008-2012 | 40,380 | 66,722 | 267,860 |
| Magu (TZ021) | Tanzania | 1994-2012 | 33,058 | 105,632 | 1,117,087 |
| Iganga/Mayuge (UG011) | Uganda | 2005-2012 | 77,113 | 123,052 | 710,676 |
| Hanoi Medical University–Filabavi (VN012) | Vietnam | 1999-2012 | 51,817 | 75,839 | 865,450 |
| Chililab (VN021) | Vietnam | 2004-2012 | 53,399 | 74,491 | 547,633 |
| Total | 2,031,943 | 3,476,876 | 24,678,471 |
Note. HDSS = Health and Demographic Surveillance System; INDEPTH = International Network for the Demographic Evaluation of Populations and Their Health. IRD = L’Institut de recherche pour le développement.
The text in brackets is the center code that identifies the HDSS in the dataset.
Population under observation at the end of the reporting period.
The total number of individuals who have contributed to the person years of exposure.
Awaiting data use approval for placement on the repository.
Resident population only.
Standard Metadata Template for INDEPTH Data Repository Datasets.
| Section | Description |
|---|---|
| Document description | This section contains information about the metadata itself, which is the DDI document used to describe the dataset. |
| Title | Contains the full authoritative title of the DDI document. Equivalent to Dublin Core Title. |
| DDI document ID number | A unique identifier for the DDI documentation file. The document ID is constructed as follows: |
| Metadata producer | Name of the person(s) or organization(s) who documented the dataset. |
| Date of production | Date the marked-up document was produced (not distributed or archived). Equivalent to Dublin Core Date. |
| DDI document version | A version number and description of this version of the document |
| Version notes | Additional information regarding the version, in particular to indicate what makes a new version different from its predecessor. |
| Study description | This section contains information about the study or data collection that is the source of the dataset/s being documented and shared. This section includes information about how the study should be cited, who collected or compiled the data, who distributes the data, keywords about the content of the data, summary (abstract) of the content of the data, data collection methods and processing |
| Identification | Citation for the data collection/study described by the metadata. |
| Title | Contains the full authoritative title of the data collection. The title will in most cases be identical to the Document Title (see above) |
| ID number | The ID number of a dataset is a unique number that is used to identify that dataset. This number forms the basis of the doi associated with the dataset and is identical to the suffix of the doi. It is of the form: |
| Study type | A broad category defining the type survey or study, e.g., demographic surveillance, sample survey, clinical trial, etc. |
| Series information | If the dataset is part of network program or working group the name of the programme or working group. |
| Version | Identify substantive changes to the dataset/s. |
| Description | A version number followed by a version label. |
| Production date | The date of this version. |
| Notes | Additional information regarding the version, in particular to indicate what makes the new dataset different from its predecessor. |
| Overview | |
| Abstract | A summary describing the purpose, nature, and scope of the data collection, special characteristics of its contents, major subject areas covered, and what questions the PIs attempted to answer when they conducted the study. |
| Kind of data | The type of data included in the dataset |
| Unit of analysis | Basic unit(s) of analysis or observation that the study describes |
| Description of scope | A description of the themes covered by the survey. It can be viewed as a summary of the modules that are included in the questionnaire. |
| Topic classifications | The classification field indicates the broad substantive topic(s) that the data cover. The INDEPTH Data Repository makes use of Medical Subject Headings (MeSH) as a controlled vocabulary. |
| Coverage | Information about a study’s chronological and geographic coverage |
| Country | Indicates the country or countries covered in the dataset. |
| Geographic coverage | Information on the geographic coverage of the data. Include the total geographic scope of the data, and any additional levels of geographic coding provided in the variables. Maps to Dublin Core Coverage. |
| Universe | A description of the population covered by the data in the file; the group of persons or other elements that are the object of the study and to which the study results refer. Age, nationality, and residence commonly help to delineate a given universe, but any of a number of factors may be involved, such as age limits, sex, marital status, race, ethnic group, nationality, income, etc. |
| Producers and sponsors | |
| Investigators | The persons, corporate body, or agency responsible for the data collection’s substantive and intellectual content. |
| Other producers | This field is provided to list other interested parties and persons that have played a significant but not the leading technical role in implementing and producing the data. |
| Funding | The source(s) of funds for production of the data collection. |
| Other acknowledgments | This mandatory field is used to acknowledge the data managers involved in producing the dataset. |
| INDEPTH member center | The INDEPTH member center/site of origin. If multi-centre datasets are released as a single unit, then this field will be set to INDEPTH Network. |
| Sampling | |
| Sampling procedure | The type of sample and sample design used to select the survey respondents to represent the population. |
| Response rates | The percentage of sample members who provided information. |
| Data collection | |
| Dates of collection | Contains the date(s) when the data were collected. Provide details of the start and end date of each data collection. |
| Time periods | The time periods covered by the data, not the dates of coding or making documents machine-readable or the dates the data were collected. |
| Frequency of data collection | If the data were collected at more than one point in time, the frequency with which the data were collected. In the case of demographic surveillance sites the number of data collection rounds per year. |
| Mode of data collection | The method used to collect the data |
| Notes on data collection | Used to describe noteworthy aspects of the data collection situation. Include information on factors such as cooperativeness of respondents, duration of interviews, number of call-backs, etc. |
| Questionnaires | The questionnaire(s) used for the data collection. |
| Data collectors | Information regarding the persons and/or agencies that took charge of the data collection |
| Supervision | Information on the oversight of the data collection |
| Data processing | |
| Data editing | Information on how the data were treated or controlled for in terms of consistency and coherence |
| Other processing | Information as possible on the data entry design, including details such as: |
| Data appraisal | |
| INDEPTH data quality metrics | A listing of the INDEPTH quality metrics (provided in the controlled vocabulary) and the measured value of the quality metric. |
| Data access | |
| Access authority | The contact person or entity to gain authority to access the data. This field is only applicable if the data have restricted access. Most datasets have direct access and can be downloaded without requesting special permission. |
| Access conditions | Access to INDEPTH Network data is governed by the INDEPTH Data Access and Sharing policy |
| Citation requirement | The way that the dataset should be referenced when cited in any publication. Includes a DOI to must be quoted when the dataset is cited. |
| Disclaimer and copyright | Information regarding responsibility for uses of the data collection and the copyright statement for the data collection. |
| File description | Consists of information about the particular data file containing numeric and/or numeric + textual information. The data fingerprint of the data file is included as part of this metadata. |
| Variable description | Consists of elements allowing for detailed descriptive information about each variable in the dataset. This includes information about response and analysis units, question text, interviewer instructions, universe, valid and invalid data ranges, derived variables, and summary statistics |
Note. INDEPTH = International Network for the Demographic Evaluation of Populations and Their Health; DDI = Data Documentation Initiative.
Figure 1.Dataset production process.
INDEPTH Data Access Policy Levels Compared With the Built-In Access Levels on the Repository.
| INDEPTH data access policy level | INDEPTH data repository access level | Description |
|---|---|---|
| Data not available | Not applicable | |
| Open access | Direct access data files | The user is not required to be logged into the site and no personal information is collected on the person downloading the data. |
| Licensed access | Public use data files | The user must be logged in and registered on the site before they are able to download the data. The user is required to agree to a terms of use of the data and the repository keeps a record of who downloads the data. |
| Restricted licensed access | Licensed data files | Users are required to fill in and submit a detailed application form listing their reasons for wanting access to the data. Once the user submits the application form the system informs the system administrator that an application has been made. For the person to get access to the data, the system administrator needs to review the application and approve it. |
| Closed access | Data available in an enclave | No data are shared on the repository. Users submit an application to access the data on-site at the submitting INDEPTH member center. |
| Data available from external repository | The repository allows for studies and their metadata to be listed on the repository but for a link to be created to another site where the data reside. |
Note. INDEPTH = International Network for the Demographic Evaluation of Populations and Their Health.
Dataset Download by Region Between July 1, 2013, and June 29, 2015.
| Region | |
|---|---|
| Africa | 225 |
| Asia | 110 |
| Europe | 171 |
| North America | 217 |
| Other | 1 |
| Total | 724 |