Literature DB >> 33009560

Capacity building for whole genome sequencing of Mycobacterium tuberculosis and bioinformatics in high TB burden countries.

Emmanuel Rivière1, Tim H Heupink1, Nabila Ismail2, Anzaan Dippenaar1, Charlene Clarke2, Gemeda Abebe3, Peter Heusden4, Rob Warren5, Conor J Meehan6, Annelies Van Rie7.   

Abstract

BACKGROUND: Whole genome sequencing (WGS) is increasingly used for Mycobacterium tuberculosis (Mtb) research. Countries with the highest tuberculosis (TB) burden face important challenges to integrate WGS into surveillance and research.
METHODS: We assessed the global status of Mtb WGS and developed a 3-week training course coupled with long-term mentoring and WGS infrastructure building. Training focused on genome sequencing, bioinformatics and development of a locally relevant WGS research project. The aim of the long-term mentoring was to support trainees in project implementation and funding acquisition. The focus of WGS infrastructure building was on the DNA extraction process and bioinformatics.
FINDINGS: Compared to their TB burden, Asia and Africa are grossly underrepresented in Mtb WGS research. Challenges faced resulted in adaptations to the training, mentoring and infrastructure building. Out-of-date laptop hardware and operating systems were overcome by using online tools and a Galaxy WGS analysis pipeline. A case studies approach created a safe atmosphere for students to formulate and defend opinions. Because quality DNA extraction is paramount for WGS, a biosafety level 3 and general laboratory skill training session were added, use of commercial DNA extraction kits was introduced and a 2-week training in a highly equipped laboratory was combined with a 1-week training in the local setting.
INTERPRETATION: By developing and sharing the components of and experiences with a sequencing and bioinformatics training program, we hope to stimulate capacity building programs for Mtb WGS and empower high-burden countries to play an important role in WGS-based TB surveillance and research.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Keywords:  zzm321990 Mycobacterium tuberculosiszzm321990 ; Africa; bioinformatics; capacity building; whole genome sequencing

Year:  2021        PMID: 33009560      PMCID: PMC8293823          DOI: 10.1093/bib/bbaa246

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


Introduction

Tuberculosis (TB) remains a major public health problem with an estimated 10 million new cases annually [1, 2]. Two decades ago, the first complete genome of Mycobacterium tuberculosis (Mtb) was described [3]. Even though bioinformatics and genomics are relatively new biomedical disciplines, they have already made important contributions to the health of patients and populations by mapping TB transmission dynamics and predicting the comprehensive drug resistance profile in individual patients [4]. By unraveling the complete DNA sequence of the Mtb genome, whole genome sequencing (WGS) is also increasingly used in basic science research aimed at understanding the evolution and pathogenicity of Mtb. WGS has thus become an important tool in TB research and surveillance [5]. In Europe and the USA, WGS of Mtb is increasingly being used in routine care settings for species identification, determination of drug resistance profiles and to complement epidemiological source investigation [6, 7]. For example, in 2017, Public Health England introduced WGS in the National Health Service for diagnosis of TB, detection of drug resistance and typing of Mtb strains at the population level [8]. Similarly, the New York State Department of Health’s Wadsworth Centre and the Dutch National Institute for Public Health and the Environment implemented WGS for routine drug resistance profiling [9, 10]. In the past decade, important progress has been made in the standardization of the technical approach to WGS, and its cost has dropped dramatically [5]. Consequently, incorporation of WGS into TB research has now become a realistic option for high TB burden countries. Scientists at reference laboratories and universities in high TB burden countries are thus well positioned to play an important role in WGS-based surveillance and research. Unfortunately, most countries face important challenges when implementing WGS in a research setting and even greater challenges in the context of a clinical setting. Bringing the scientific and technological advances of genomics to resource-poor countries in a way that is relevant to the local health priorities poses a major challenge [11, 12]. Establishing WGS facilities requires investment in human, laboratory and computational infrastructure [5]. Laboratory and biosafety equipment is needed for Mtb culture, DNA extraction, library preparation and sequencing. In addition to the initial investment in equipment, WGS requires continuous funding to cover the purchase of reagents and instrument maintenance and insurance. Computational infrastructure has to be upgraded to ensure that it is powerful enough to store, transfer and analyse the vast amounts of genomic data generated by WGS. With regard to human resources, WGS research requires a multidisciplinary team with knowledge of relevant laboratory skills, biology (of Mtb) and bioinformatics, and preferably also knowledge of computer sciences, genetics, epidemiology and medicine. Many countries with a high burden of TB suffer from limited capacities in education and human skills development [13]. Currently, the interdisciplinary field of bioinformatics is still in its infancy in most high TB burden countries, and the number of institutes that offer formal bioinformatics degrees are too few to meet the demand. As a consequence, many universities and reference laboratories in high TB burden countries lack bioinformaticians with experience in Mtb research. Organizing effective training programs to advance the genome sequencing and bioinformatics skills of current academic researchers and staff of reference laboratories will thus be critical to facilitate the integration of WGS into Mtb research and surveillance activities in developing countries [14, 15]. In this article, we present an overview of the global status of Mtb WGS research, outline the development of a training program and highlight the main challenges faced during the first two trainings.

Methods

Global status of Mtb WGS research

To explore the status of Mtb WGS research capacity at the global level, we performed a PubMed search on 3 March 2020, to identify all published manuscripts on WGS of Mtb using the following search terms (‘whole genome sequencing’ AND tuberculosis). We extracted data from 466 eligible articles. We assessed the number of publications over time by geographic location of the samples included in the analysis and by region of affiliation of the first author. To assess for possible imbalance between regional participation in Mtb WGS research and the burden of TB, we compared the location of data collection, location of affiliation of first author and the burden of TB in 2018 (most recent data reported by WHO [1]) between four regions: Europe, North America and Oceania (predominantly Australia and New Zealand); Asia; Africa and South and Central America.

Development of the Mtb WGS and bioinformatics short course

Instead of organizing a ‘fly in, teach and leave’ workshop, which often fails to have durable impact [16], we aimed to build a critical mass of junior researchers and scientists who, upon completion of the training course, would integrate their newly acquired skills into their TB research and/or surveillance work. The short course focused on acquisition of both theoretical and practical skills related to genome sequencing of Mtb, bioinformatics analysis of Mtb WGS data and development of a research project. The short course was complemented with exposure to relevant research using WGS of Mtb and long-term mentoring of trainees. To have maximal impact, the training was intended for a group of 10–15 scholars from academic institutions (preferably employed at lecturer or assistant professor level, at the start of their career) and reference laboratories (preferably holders of a master’s degree). In addition to human capacity building, the program also aimed to build infrastructure in bioinformatics and sequencing by creating functional bioinformatics and genome sequencing units that use standard operating procedures. WGS has infrastructure requirements for sample preparation, DNA extraction, library preparation, sequencing and data analysis. Because sequencing instruments become outdated very rapidly, we opted to focus on high-quality DNA extraction and data analysis steps. Following DNA extraction, the Mtb DNA can then be shipped for library preparation and sequencing. Outsourcing the library preparation and sequencing steps can be highly cost-effective and is an approach that is also employed by many sequencing research groups in high-income, low TB burden countries. To test and refine the training program, we built upon an existing collaboration with Jimma University in Ethiopia. Ethiopia is one of the 30 high TB burden countries with an estimated 165 000 new TB cases and 1600 new cases of rifampicin-resistant TB in 2018 [1]. The Tuberculosis Omics ResearCH (TORCH) consortium acquired funding from VLIR-UOS (Flemish Interuniversities Council–—University Development Co-operation) to develop a bioinformatics and sequencing training program through an academic collaboration between Jimma University in Ethiopia, Stellenbosch University in South Africa and the University of Antwerp and the Institute of Tropical Medicine in Belgium.

Results

Global status of Mtb WGS research reporting on original WGS-based TB research

After reviewing title, abstract and full text, 466 articles were eligible for inclusion, with the first manuscript being published in 2009. The number of articles published increased gradually over time and reached a peak of 98 articles in 2018 (Figure 1). Of the 444 manuscripts published on Mtb WGS in the past decade (2009–2019, 2020 excluded), the data collection and performance of research took place in Europe, North America or Australia/New Zealand in 57% (n = 254), Asia in 25% (n = 111), Africa in 11% (n = 51) and South and Central America in 6% (n = 28) of studies. For the African region, most (61%) studies took place in South Africa (31 of the 51 manuscripts).
Figure 1

Number of published articles on WGS of Mtb in peer reviewed journals by year and geographic region.

Number of published articles on WGS of Mtb in peer reviewed journals by year and geographic region. Next, we investigated the region of the affiliation of the first author. Of the 51 manuscripts with study location in Africa, the first author was from an African institution in 33 (65%) of the manuscripts (23 of the 31 papers from South Africa and 10 of 20 manuscripts from other African countries). Of the 28 manuscripts with study location in South or Central America, the first author was from a South or Central American institute in 23 (82%) of the manuscripts. Of the 111 manuscripts with study location in Asia, the first author was from an Asian institute in 104 (94%) of the manuscripts. Lastly, of the manuscripts with study location in Europe, North America or Oceania, all (100%) first authors were affiliated with an institute in one of these regions. In total, the first author was affiliated with Europe, North America or Oceania in 284 (64%) of the manuscripts, while 254 (57%) of the manuscripts had a study location in these regions. The distribution of the region where the WGS of Mtb data were collected differed from the relative burden of TB (Figure 2). In 2018, 11% of published articles on WGS of Mtb originated from Africa, while the continent accounted for 26% of the global TB burden. Likewise, 23% of articles originated from Asia, while accounting for 69% of the global TB burden. Eight percent of articles originated from South or Central America, while accounting for 3% of the global TB burden. The greatest imbalance occurred in Europe, North America and Oceania, as 57% of articles originated from there even though these regions combined accounted for only 3% of the global TB burden [1].
Figure 2

Regional comparison of relative burden of TB (proportion of global TB incidence by region), origin of samples included in research articles on WGS of Mtb published in peer reviewed journals by continent in 2018, and region of first author of such publications.

Regional comparison of relative burden of TB (proportion of global TB incidence by region), origin of samples included in research articles on WGS of Mtb published in peer reviewed journals by continent in 2018, and region of first author of such publications.

Initial program goals and development: Mtb WGS, bioinformatics and research proposal writing

The Mtb WGS laboratory training started with a theoretical overview of a typical WGS experiment, the underlying mechanics and applications of different WGS technologies. The training mainly focused on Illumina technology and only briefly touched upon PacBio and Oxford Nanopore Technologies platforms as these are currently less commonly used. During the interactive practical sessions, the trainees received instruction on the sample preparation steps of a sequencing experiment. DNA extraction was performed under supervision using a cetyltrimethylammonium bromide (CTAB) method. M. smegmatis solid cultures were used so that inactivation of the samples could be demonstrated outside of biosafety level 3 (BSL-3) laboratory conditions. Other practical sessions included template DNA quality control, library preparation and clean-up, library quality control and starting a sequencing run on an Illumina MiSeq instrument. Bioinformatics training included a hands-on session on computing in UNIX operating systems followed by a theoretical session on WGS reference-mapping approaches and a tutorial on variant calling. To demonstrate a standard bioinformatics analysis of Mtb WGS data, we selected the MTBseq pipeline because this is a modular, easy to install, publicly available pipeline consisting of open source software implemented in Perl. MTBseq can be invoked by a single Linux command, is customizable, expandable and can be used without an Internet connection [17]. Next, the WGS reference-mapping training was taught over 2 days. First, a theoretical explanation was given of the process with extensive examples to highlight the advantages and disadvantages for the three primary tasks of strain identification, drug resistance profiling and transmission studies. Hands-on sessions were split into online tools and UNIX tools. PhyResSE and TBProfiler were used for the former, indicating that each has a command line equivalent that trainees can use when more comfortable with UNIX [18, 19]. A Galaxy interface (allowing for UNIX tools to be used through a graphical user interface) was provided for a more lightweight WGS analysis pipeline involving the tools snippy and tb_variant_filter [20-22]. This approach allowed the trainees to see the benefits of WGS without making the UNIX learning curve too steep. To increase the likelihood that trainees would implement their newly acquired skills after completion of the short course, we included the development of a research proposal by each of the trainees as a component of the course. Trainees were given an overview of past and on-going WGS of Mtb research by the instructors to provide examples of Mtb WGS research. In addition, other opportunities for exposure to relevant research were created through attendance of fora where young researchers present research in the area of bioinformatics, medical informatics and TB research (Biomina lunch talk at the University of Antwerp or annual ‘Acid Fast Club’ at the University of Stellenbosch). For the program, trainees were asked to identify a topic of interest for Mtb WGS research. During an interactive group discussion lead by senior TB researchers (epidemiologist, molecular biologist and bioinformatician), the relevance of WGS for the research question, the relevance of the research question for the local setting and the novelty and feasibility were discussed. A few days later, a similar second session took place, so trainees could present their revised or refined research idea based on the feedback received. Half a day was scheduled to spend in the library to perform a literature review and build the rationale for the proposed question and place their idea in context of published studies. At the end of the 2nd week, students presented the rationale for and expected impact of their research idea. In the 3rd week, research ideas were discussed in an interactive manner to demonstrate how one translates a research idea into specific aims and hypotheses, chooses the optimal study design, study setting and study population to address the specific aim, and how to generate a sample size calculation. After each session, students were asked to apply the information to their own research idea and presented their work at the next session. Finally, issues relating to ethics, risks and risk management of the research projects were highlighted and discussed. To support the transition of trainees into independent researchers, we aimed to mentor each trainee to support the completion of the research proposal developed during the training. To incentivize this process, each trainee who generated a high-quality research proposal was promised 2100 euro in research funds. We also aimed to assist trainees with the identification of additional freely available software relevant to their research project and to help identify external funding opportunities. We also aimed to establish a network for sustained dialogue among participants, including peer review of research proposals and journal clubs.

Refining the training course based on experiences and trainees’ feedback

In June 2018 and July 2019, 15 individuals from southwest, central and northwest Ethiopia participated in one of the two trainings. During the first two short courses, we faced multiple challenges. Based on the experiences gained and trainees’ feedback on the first training, the program was modified for the 2nd year. After the second training, further adjustments were made to generate the final training program. The main challenges and the solutions implemented to overcome these are discussed below and summarized in Table 1. The complete final program is provided in Table 2.
Table 1

Challenges and implemented solutions experienced in capacity building for Mtb WGS and bioinformatics in Ethiopia

TopicChallengeSolution
Identification of junior scientistsObtaining a gender balanceLower requirements for female trainees (e.g. BA degree instead of MA)
Brain–drain after recruitment processComplement competitive selection process with targeted selection by university, lower requirements to BA
Bioinformatics trainingComputing infrastructure is lackingDevelop WGS pipelines that can run on laptops and expand online tool usage
Laboratory skills for DNA extractionPoor adherence to BSL-3 good laboratory practiceAdd a session on good BSL-3 practice
Poor basic laboratory skillsAdd hands-on session on pipetting skill training
Difficult transition from supervised DNA extraction training to performing DNA extraction in local laboratory settingPerform 3rd week training in local laboratory and add independent hands-on trainings to the supervised sessions
Equipment for DNA extractionLack of reagents and safety equipment (fume hood) for CTAB DNA extraction method in some laboratoriesSwitch to use of commercially available DNA extraction kits
Lack of spectrophotometer to assess quality and quantity of extracted DNA in most laboratoriesPurchase spectrophotometer
Identification of research ideas and create research proposalCommunication barrier: limited experience with the use of interactive teaching methodsAwareness of this culture clash, use of experienced instructors, use of case studies and explicit creation of a safe atmosphere where opinions can be voiced
Long-term mentoring and creation of trainee networkPoor Internet infrastructure and Internet outages during civil unrest hampered communication and the ability to hold monthly conference and organize e-journal clubsUse of Slack to create chat rooms by topic and facilitate communication between trainees and between trainees and instructors
Transition towards independent scientistLimited funding available to support WGS-based research projectsPromote use of existing samples for the first research project
Table 2

Training program for 3-week short course training on WGS of Mtb bioinformatics for academic and reference laboratory scientists from high burden low-income countries

DayAreaTopicFormatRationale
1GeneralIntroduction to trainingFormal presentationIntroduce trainees and instructors give overview of training program
GeneralOverview of WGS research performed in TORCH consortiumFormal presentationGive overview of WGS research performed by instructors
Genome sequencingWGS theoryFormal presentationTeach the theoretical basis underlying WGS
GeneralBasic BSL-3 and pipetting skillsHands-on laboratory trainingTo ensure everyone implements good BSL-3 and basic laboratory practices
2General Mtb culture in BSL-3Hands-on laboratory trainingTo implement good biosafety and culture practices
Genome sequencingDNA extraction from liquid clinical Mtb culture: heat killing, pelleting and enzymatic lysis stepsHands-on laboratory training under supervisionAcquire practical skills for DNA extraction
3Genome sequencingDNA extraction from liquid culture using optimized protocol with Qiagen kitHands-on laboratory training under supervisionAcquire practical skills for DNA extraction
Genome sequencingDNA extraction from solid culture: heat killing, pelleting and enzymatic lysis stepsHands-on laboratory training under supervisionImportant to acquire skills to extract DNA from both liquid and solid culture
Genome sequencingSpectrophotometer to measure DNA quantity and purityHands-on laboratory training under supervisionValidation of DNA extraction by measuring DNA quality and quantity (when fluorometer is not available)
Genome sequencingDNA library preparationFormal presentationPresentation on library preparation with emphasis on outsourcing
4Genome sequencingDNA extraction from solid culture using optimized protocol with Qiagen kitHands-on laboratory training under supervisionImportant to acquire skills to extract DNA from both liquid and solid culture
Genome sequencingSpectrophotometer to measure DNA quantity and purityHands-on laboratory training under supervisionValidation of DNA extraction by measuring DNA quality (spectrophotometer) and quantity (fluorometer) if fluorometer is available
5Research proposal developmentEvaluation of research ideas suggested by trainees: relevance for WGS, relevance for Ethiopia, novelty, feasibilityInteractive group discussion led by senior TB researchersAfter a week of exposure to WGS, trainees have first opportunity to develop a research idea
GeneralLocal research conferenceConferenceExpose trainees to presentations of relevant research projects at different stages of completion
6Research proposal developmentEvaluation of updated research ideas suggested by trainees: relevance for WGS, relevance for Ethiopia, novelty, feasibilityInteractive group discussion led by senior TB researchersBased on feedback in first session, trainees present their updated research idea
BioinformaticsUNIX tutorial—using Linux for windowsHands-on computer trainingAcquire basic Linux skills for bioinformatic analysis
7BioinformaticsGalaxy WGS pipeline—SnippyHands-on computer trainingIntroduce user-friendly bioinformatics platform for basic WGS analysis
BioinformaticsPhyResSE and TBProfilerHands-on computer trainingUse of freely available online bioinformatics pipeline resources
8BioinformaticsPhylogeny and phylodynamics of MtbFormal presentationTeach the theoretical basis underlying WGS transmission studies
GeneralFindings of Mtb transmission studiesFormal presentationExpose trainees to the field of transmission research
Research proposal developmentLibrary timeLiterature reviewSelf-study to update research idea
9BioinformaticsData analysisHands-on computer trainingWrap up bioinformatics sessions with Q&A
Genome sequencingLibrary preparation demonstrationSupervised laboratory trainingAcquire practical skills to understand where library preparation can go wrong
Genome sequencingDNA shippingFormal presentationPresent optimal way to package DNA sample for shipping
10Research proposal developmentRationale, novelty and impact of research ideaInteractive group discussion led by senior TB researchersBased on literature review, trainees present updated research idea, its rationale, novelty and impact
GeneralSocial eventPromote group coherence and social interaction with instructors
11Genome sequencingDNA extraction from liquid and solid clinical Mtb culture: heat killing, pelleting and enzymatic lysis stepsIndependent hands-on laboratory trainingApply DNA extraction skills in local setting in an independent manner
Research proposal developmentDevelop specific aims for research ideaInteractive group discussion led by senior TB researchersTeach how a research idea is translated into specific aims using one of the proposed research ideas as case study
Research proposal developmentDevelop specific aims for research ideaHomework assignmentApply newly acquired skills to own research idea
12Genome sequencingDNA extraction from liquid and solid culture using optimized protocol with Qiagen kitIndependent hands-on laboratory trainingApply DNA extraction skills in local setting in an independent manner
Genome sequencingSpectrophotometer to measure DNA quantity and purityIndependent hands-on laboratory trainingApply DNA extraction skills in local setting in an independent manner (if infrastructure available)
13Research proposal developmentDevelop specific aims for research ideaInteractive group discussion led by senior TB researchersGroup discussion of specific aims developed by each of the trainees
Research proposal developmentSelect appropriate study design, study setting and study population for research ideaInteractive group discussion led by senior TB researchersTeach how to select the appropriate study design, study setting and study population using one of the proposed research ideas as case study
Research proposal developmentSelect appropriate study design, study setting and study population for research ideaHomework assignmentApply newly acquired skills to own research idea
14Research proposal developmentPresentation of homework.Formulate hypothesis and generate sample sizeInteractive group discussion led by senior TB researchersTeach how a specific aim translates into a hypothesis and how to generate a sample size using one of the proposed research ideas as case study
Research proposal developmentFormulate hypothesis and define assumptions for sample size calculationsHomework assignmentApply newly acquired skills to own research idea
15Research proposal developmentPresentation of homework.Select appropriate study procedures, reflect on ethical issues, risk and risk management strategiesInteractive group discussion led by senior TB researchersTeach how to translate a specific aim into study procedures using one of the proposed research ideas as case study. Identify key ethical issues and risks in the different studies proposed by trainees
GeneralCertificate ceremony and social eventPromote group coherence and wrap up training
Challenges and implemented solutions experienced in capacity building for Mtb WGS and bioinformatics in Ethiopia Training program for 3-week short course training on WGS of Mtb bioinformatics for academic and reference laboratory scientists from high burden low-income countries Trainees were selected based on their motivation and active involvement in TB research, TB diagnostics or TB surveillance activities. During the selection process, the number of female applicants was low. After the selection process, several trainees dropped out before the start of the training due to a change in employment or emigration. To overcome these challenges, we lowered the requirement from holder of a master’s degree to holder of a relevant bachelor’s degree and complemented open recruitment with hand-picked, targeted recruitment paying special attention to the recruitment of female scientists. We experienced many challenges with training on the laboratory aspects of Mtb WGS. Even though we recruited trainees from academic and reference laboratories, we observed during the first hands-on training session that the level of experience with and adherence to good BSL-3 practices and general pipetting skill training were not optimal for all trainees. We therefore added a session on BSL-3 practices and a general pipetting skill training session. A follow-up visit to Ethiopia after the first training highlighted greater than expected challenges with DNA extraction. We therefore decided to focus even more practical laboratory training on the DNA extraction process. In the second training, the steps of library preparation and a sequencing run on an Illumina instrument were limited to a theoretical introduction. On-site follow-up of the trainees also demonstrated that laboratory reagents, such as lysozyme, proteinase K and sodium acetate required for the standard CTAB DNA extraction method were not readily available and that a fume hood (needed for safe handling of chloroform and isoamyl alcohol) was not present in all laboratories. We therefore switched to a user-friendly commercial DNEasy Ultraclean Microbial DNA extraction kit (Qiagen, Hilden, Germany), with an added overnight enzymatic lysing step (lysozyme 10 mg/ml) to maximize the amount of Mtb DNA extracted (supplementary data: supplementary file 1). This kit provides a simple and standardized approach suitable for low-income countries with varied levels of laboratory infrastructure. In the second training, trainees used the kit to perform DNA extraction on both solid and liquid bacterial cultures, to highlight the differences between these two culturing methods as starting material. Additionally, we switched to cultures of Mtb instead of M. smegmatis to more accurately represent a real Mtb experiment and simultaneously emphasize good BSL-3 practices. Finally, the validation of the DNA extraction process proved challenging as a functioning spectrophotometer was not available. Funding had to be freed to purchase a new spectrophotometer. Finally, because of the many challenges with the transition of supervised demonstration of DNA extraction and WGS techniques in a well-equipped laboratory during the training to implementation in the home laboratory, we changed from a 3-week training in a well-resourced setting (Antwerp University) to a 2-week training in a well-resourced laboratory (Stellenbosch University) complemented with a 1-week hands-on training in the trainees’ local setting (Jimma University) so that trainees could individually apply their acquired skills in their home laboratory. With regard to bioinformatics infrastructure, it quickly became clear that the trainees’ personal computers did not meet the hardware specifications required. The computer hardware needed to run the standard bioinformatics pipelines for strain typing, drug resistance profiling and transmission studies was lacking. Furthermore, most existing bioinformatics pipelines are not configured to run on computers with low memory, making even basic analyses impossible. For example, the minimum 8GB RAM required to run MTBseq was not available on many of the trainees’ personal laptops and many had out-of-date operating systems. During the first training, we resolved this by using a local server to run analysis pipelines in a Linux virtual machine. In the second training, we used online tools and a Galaxy implementation of a WGS pipeline running on the Ilifu cloud (https://www.ilifu.ac.za) to circumvent this. Another hurdle was that, for many trainees, the jump from inexperienced computer user to UNIX user for interpreting the WGS outputs was too large. Therefore, for the second training course, the UNIX tutorial was expanded to give more time for trainees to grasp the importance and usefulness of this platform. The switch to online tools proved more intuitive, allowed them to see the benefit of this approach and encouraged them to spend time learning such skills. During the research proposal development session, it became clear that trainees were not familiar with the interactive teaching methods that are considered integral to higher education in many high-income countries. The lack of experience with peer review, critical thinking, freedom of expression and mentoring in Ethiopian trainees resulted initially in poor interaction during the sessions. Awareness of differences in educational styles and use of experienced instructors allowed the program to still strive towards maximal interaction between the instructor and the trainees and between trainees. In addition, case studies were used to promote collective analyses of a case in order to identify challenges and proposed solutions and created an atmosphere where students felt safe to formulate and defend their opinions. Once this was established, interesting research plans were developed, indicating that trainees were highly capable of being independent researchers once given the correct guidance. During the long-term follow-up of the 15 trainees, we were faced with several challenges that we either did not expect or underestimated the impact of. One student dropped out during the training and two within a few months after completion of the training. The organization of monthly conference calls and journal e-clubs failed due to poor Internet access and disruption of the Internet access during civil unrest in Ethiopia. In the 2nd year, we switched to the use of Slack, a chat-based system that provides private message boards organized by topic and direct messaging between users. The initial budget of 2100 euro available per trainee proved to be too small for a prospective pilot studies and was raised to make a meaningful project possible. To increase the development of research projects, the use of existing samples was actively promoted in the second training session. Within 1 year of training, nine trainees submitted a fundable proposal. One student successfully obtained additional external funding.

Discussion

We found a gradually increasing number of WGS of Mtb research papers in the past decade but a striking imbalance between the geographic origin of the research and the burden of TB. Specifically, WGS of Mtb research in Europe and North America is hugely overrepresented while the African and Asian regions are grossly underrepresented. Furthermore, ownership of research was unjust, especially for research in the African region. This is similar to findings of a review of genomics research in Africa published between 2004 and 2013, where less than half (47%) of first authors were affiliated with an African institution [23]. This highlights insufficient progress despite the existence of multiple training initiatives, including the African Society for Bioinformatics and Computational Biology founded in 2004 and the H3ABioNet pan-African bioinformatics network founded in 2012 [24-27]. This may be due to the challenges of online courses, including a relatively high drop-out rate [27, 28], or because these courses focus on bioinformatics without hands-on training for the laboratory component of pathogen genomics research. To achieve equity and maximum impact of the genomics revolution for TB, it is essential that scientists from high TB burden countries lead WGS research of Mtb activities [29]. Local capacity needs to be built in both the laboratory and bioinformatics aspects of Mtb WGS as many institutions continue to suffer from an insufficient number of experienced personnel that can perform, supervise and train others in bioinformatics and sequencing research [25, 26]. To address this dearth of expertise, we built a WGS of Mtb training program on five pillars: combine short course training with long-term mentoring, include both theoretical training and hands-on laboratory training, focus on DNA extraction as library preparation and sequencing can be centralized or outsourced, guide trainees on the development of locally relevant WGS research proposals and use bioinformatics tools that require low computer resources but achieve accurate results. We faced many challenges during the first two trainings. We experienced that trainees were often underprepared for basic bioinformatics instruction and we struggled to maintain long-term commitment from the trainees after the trainings. Many challenges could be overcome by tweaking the program, including a 1-week in-country training, and switching to a commercial DNA extraction kit. Nevertheless, additional resources will be needed to complement training programs by providing secure access to remote computing services, developing light-weight pipelines that can run in resource poor settings, investment in WGS research in high TB burden settings by both national and international agencies and ensuring that trainees are kept up-to-date as the field and tools change rapidly. In the future, one could explore the use of video training to reduce the cost of on-site training for the laboratory component of Mtb WGS. In addition to training on the standard Illumina sequencing technology, other platforms such as the Oxford Nanopore Technologies MinION platform could be included in the training. This technology holds great potential for low-income countries given its portability, ease of use and minimal instrumental investment cost. This WGS technology also has a library preparation method that can be performed in a decentralized laboratory, which allows for the complete WGS workflow to be carried out in the local setting. However, for Mtb, MinION sequencing and sample processing is still in its infancy and would need to be further developed before becoming core to WGS training instead of Illumina technologies [30].

Conclusion

There continues to be an important underrepresentation of scientists from high TB burden countries in Mtb WGS research, especially from the African and Asian continents. While infrastructural issues can in part be overcome by outsourcing the library preparation and sequencing steps, development of expertise in extraction of quality Mtb DNA and acquisition of bioinformatics skills for in-country analysis of WGS data continues to pose great challenges. By developing and sharing the components of a 3-week course on WGS and bioinformatics for TB research, we hope to stimulate the development of such programs and empower scientists from high TB burden countries to play an important role in WGS-based TB surveillance and TB research. Africa and Asia are grossly underrepresented in Mycobacterium tuberculosis WGS research, compared to their tuberculosis burden. We developed a 3-week training course on WGS of tuberculosis, combined with long-term mentoring and infrastructure building. We faced several challenges, resulting in iterative adaptations to the training program, mentoring and infrastructure building. We present our optimized training concept and framework, which could stimulate other capacity building initiatives. Click here for additional data file.
  23 in total

Review 1.  Whole genome sequencing of Mycobacterium tuberculosis: current standards and open issues.

Authors:  Conor J Meehan; Galo A Goig; Thomas A Kohl; Lennert Verboven; Anzaan Dippenaar; Matthew Ezewudo; Maha R Farhat; Jennifer L Guthrie; Kris Laukens; Paolo Miotto; Boatema Ofori-Anyinam; Viola Dreyer; Philip Supply; Anita Suresh; Christian Utpatel; Dick van Soolingen; Yang Zhou; Philip M Ashton; Daniela Brites; Andrea M Cabibbe; Bouke C de Jong; Margaretha de Vos; Fabrizio Menardo; Sebastien Gagneux; Qian Gao; Tim H Heupink; Qingyun Liu; Chloé Loiseau; Leen Rigouts; Timothy C Rodwell; Elisa Tagliani; Timothy M Walker; Robin M Warren; Yanlin Zhao; Matteo Zignol; Marco Schito; Jennifer Gardy; Daniela M Cirillo; Stefan Niemann; Inaki Comas; Annelies Van Rie
Journal:  Nat Rev Microbiol       Date:  2019-09       Impact factor: 60.633

2.  Delivering blended bioinformatics training in resource-limited settings: a case study on the University of Khartoum H3ABioNet node.

Authors:  Azza E Ahmed; Ayah A Awadallah; Mawada Tagelsir; Maram A Suliman; Atheer Eltigani; Hassan Elsafi; Basil D Hamdelnile; Mohamed A Mukhtar; Faisal M Fadlelmola
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

3.  Translational Genomics in Low- and Middle-Income Countries: Opportunities and Challenges.

Authors:  Fasil Tekola-Ayele; Charles N Rotimi
Journal:  Public Health Genomics       Date:  2015-06-26       Impact factor: 2.000

4.  Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence.

Authors:  S T Cole; R Brosch; J Parkhill; T Garnier; C Churcher; D Harris; S V Gordon; K Eiglmeier; S Gas; C E Barry; F Tekaia; K Badcock; D Basham; D Brown; T Chillingworth; R Connor; R Davies; K Devlin; T Feltwell; S Gentles; N Hamlin; S Holroyd; T Hornsby; K Jagels; A Krogh; J McLean; S Moule; L Murphy; K Oliver; J Osborne; M A Quail; M A Rajandream; J Rogers; S Rutter; K Seeger; J Skelton; R Squares; S Squares; J E Sulston; K Taylor; S Whitehead; B G Barrell
Journal:  Nature       Date:  1998-06-11       Impact factor: 49.962

5.  Limited resources of genome sequencing in developing countries: Challenges and solutions.

Authors:  Mohamed Helmy; Mohamed Awad; Kareem A Mosa
Journal:  Appl Transl Genom       Date:  2016-03-10

6.  The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.

Authors:  Enis Afgan; Dannon Baker; Bérénice Batut; Marius van den Beek; Dave Bouvier; Martin Cech; John Chilton; Dave Clements; Nate Coraor; Björn A Grüning; Aysam Guerler; Jennifer Hillman-Jackson; Saskia Hiltemann; Vahid Jalili; Helena Rasche; Nicola Soranzo; Jeremy Goecks; James Taylor; Anton Nekrutenko; Daniel Blankenberg
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

7.  MTBseq: a comprehensive pipeline for whole genome sequence analysis of Mycobacterium tuberculosis complex isolates.

Authors:  Thomas Andreas Kohl; Christian Utpatel; Viola Schleusener; Maria Rosaria De Filippo; Patrick Beckert; Daniela Maria Cirillo; Stefan Niemann
Journal:  PeerJ       Date:  2018-11-13       Impact factor: 2.984

8.  Bioinformatics education--perspectives and challenges out of Africa.

Authors:  Özlem Tastan Bishop; Ezekiel F Adebiyi; Ahmed M Alzohairy; Dean Everett; Kais Ghedira; Amel Ghouila; Judit Kumuthini; Nicola J Mulder; Sumir Panji; Hugh-G Patterton
Journal:  Brief Bioinform       Date:  2014-07-02       Impact factor: 11.622

9.  Designing a course model for distance-based online bioinformatics training in Africa: The H3ABioNet experience.

Authors:  Kim T Gurwitz; Shaun Aron; Sumir Panji; Suresh Maslamoney; Pedro L Fernandes; David P Judge; Amel Ghouila; Jean-Baka Domelevo Entfellner; Fatma Z Guerfali; Colleen Saunders; Ahmed Mansour Alzohairy; Samson P Salifu; Rehab Ahmed; Ruben Cloete; Jonathan Kayondo; Deogratius Ssemwanga; Nicola Mulder
Journal:  PLoS Comput Biol       Date:  2017-10-05       Impact factor: 4.475

10.  Whole-Genome Sequencing as Tool for Investigating International Tuberculosis Outbreaks: A Systematic Review.

Authors:  Marieke J van der Werf; Csaba Ködmön
Journal:  Front Public Health       Date:  2019-04-17
View more
  2 in total

1.  Whole genome characterization, and geographical distribution of M. tuberculosis in central region of Veracruz, Mexico.

Authors:  Esdras Antonio Fernández-Morales; Gustavo Bermudez; Hilda Montero; Manuel Luzania-Valerio; Roberto Zenteno-Cuevas
Journal:  Braz J Infect Dis       Date:  2022-05-06       Impact factor: 3.257

2.  Sequencing Mycobacteria and Algorithm-determined Resistant Tuberculosis Treatment (SMARTT): a study protocol for a phase IV pragmatic randomized controlled patient management strategy trial.

Authors:  Annelies Van Rie; Elise De Vos; Emilyn Costa; Lennert Verboven; Felex Ndebele; Tim H Heupink; Steven Abrams; Boitumelo Fanampe; Anneke Van der Spoel Van Dyk; Salome Charalambous; Gavin Churchyard; Rob Warren
Journal:  Trials       Date:  2022-10-08       Impact factor: 2.728

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.