Introduction
According to the World Health Organization (WHO), around 10.6 million people across the world fell ill with Tuberculosis (TB) in 2021. Until the COVID 19 pandemic, there was a yearly 2% decline in TB incidence being observed over the past 20 years, but the post-pandemic world actually saw an increase of 3.6% in the overall incidence of TB between 2020-2021 [
1]. Between 2015-2021, the African region where a quarter of all TB occurs, saw a decline of 22% in TB incidence, thus meeting the 2020 target of 20% decline [
1]. However, despite all efforts, more than 4 million, approximately 40% of all incident cases remain undiagnosed or unreported to the National TB control programs (NTP). Missing patients, especially the ones who go undiagnosed can unknowingly transmit disease to their families and communities. Interventions need to be strengthened to find these missing TB cases and connect them to care.
Nigeria has the 6th highest burden of TB cases in the world, where 4.4% of all cases are found. The country is also among the top 5 countries that have the highest gap in estimated incidence and reported number of people with newly diagnosed TB [
1]. In 2021, only 44% of the estimated number of incident cases were notified in the country. Finding the missing cases early and treating them remains the single most important priority in the country for TB control.
Apart from treating those with existing TB, it is imperative that the halting of transmission is prioritised. A disease like TB which could remain asymptomatic for long periods of time will need more than just passive case finding to control. Active case finding (ACF) is about proactively reaching out to the community, finding people who have undiagnosed infections and connecting them to care [
2]. Current evidence supports that ACF interventions can help to reach undiagnosed individuals early and are beneficial in low resource settings. A multi-faceted community-based TB case finding intervention in 2 southern states of Nigeria observed that the intervention led to a 138% increase in case detection as compared to the expected notifications in the absence of any intervention [
3].
In another study focused on potentially high transmission settings like urban slums, they found a 6.4% TB positivity among the target population, of which 65% were smear positive. They reported that the high transmission could be due to poor living conditions and overcrowding [
4]. Another ACF project implemented in Ebonyi state in Nigeria, combined house to house outreach with ACF activity to target attendees of a health facility in the general outpatient departments, ANC and MCH clinics, and people living with HIV (PLHIV) reporting to ART centres - including contact tracing of index clients. They found 3.2% TB positivity among those who were evaluated [
5] facility-based screening like these can reach a big population in a relatively short period of time, but the overall impact can be variable. While ACF interventions are necessary and impactful, they can be resource intensive. Innovative and data driven approaches can make them more efficient and help to reduce the overall resources consumed. An early warning outbreak recognition system (EWORS) was implemented in 14 states in Nigeria which helped to identify areas with potentially high TB spread at the ward level, and subsequent ACF interventions in the hotspot wards yielded a significantly lower number needed to screen as compared to non-hotspot wards [
6].
It is now advocated that communities should use locally tailored interventions to target specific areas which could be at increased risk of TB. Subnational estimation of TB burden though extremely valuable to guide local community-based interventions are rather limited in low-middle income countries. Most countries still depend on case notifications, prevalence surveys with small-area estimation and surveys of infection to estimate burden of TB. While such surveys are expensive and time consuming, case notifications are often subject to bias due to differential access to healthcare and under-reporting in low- and middle-income countries [
7]. The MATCH Framework proposed by Rood E et al., emphasises using subnational data such as disaggregated notifications and local screening data for deriving granular insights on a subnational scale [
8]. Also, the risk of TB transmission, delayed diagnosis and treatment is determined by several contextual factors like population demographics, socio economic conditions, nutritional status, access to health services and environmental conditions, which are important to consider along with notification data [
9,
10,
11]. Therefore, there is value in leveraging data and available technology to make evidence based decisions for routine programmatic activities and improve effectiveness of interventions [
12].
The goal of this paper is to describe the approach taken for developing a TB risk predictive model at the community level and discuss the impact it had on the yield of TB ACF interventions. The ACF yield in clusters that overlap with model predicted hotspot locations was compared with those that do not overlap with hotspots, our hypothesis being that if implementers selected sites that had high predicted TB positivity rates for ACF - “TB hotspots” - they would find more cases of undiagnosed TB than at sites not predicted to be hotspots.
Methodology
Data generation methodology. A retrospective analysis was conducted of the data generated from community based ACF interventions led by Society for Family Health (SFH), Nigeria in 4 south-western states namely Lagos, Oyo, Ogun and Osun. The project was funded by The United States Agency for International Development (USAID), which established the Local Organizations Networks (LON) to increase the level of TB cases detected and treated in Nigeria over five years (2020-2025). The project engages Community Based Organizations (CBOs) and Community Volunteers (CVs) for contact tracing of index TB cases enrolled for treatment, community outreaches, active case finding and sensitization meetings in the communities.
Implementing a closed loop dynamic model training pipeline. Principles of Machine Learning were used to develop an epidemiological digital representation (‘twin model’) of the TB situation in the four states of Nigeria. This digital twin model was based on data generated from local ACF program implementation and contextual data and followed the principles of the MATCH framework as implemented in Pakistan TB ACF settings [
13,
14,
15]. The outputs generated by the model were visualised on a web interface (hereafter referred to as the geoportal) that was used by the local teams to identify high priority neighbourhoods for routine activities. The new data thus generated from the routine ACF events was regularly incorporated into the training data. Continuous flow of new data from the program formed a feedback loop, improved the model’s ability to learn from data coming in and distinguish better among the low and high TB risk areas and calibrated previous predictions. This method allowed the outputs to become finetuned over time, keeping in mind the need to support program teams in finding the last missing TB cases. The first model was trained in September 2020 with very limited data including only 24 unique ACF locations. This grew to the present 857 unique locations as at June 2022.
Justification for a Bayesian modelling approach. The approach used Bayesian networks that provided a powerful machine learning technology to reason with uncertainty in complex environments. In this case it refers to the spatially aware programmatic and socio-economic data, multiple variables with potentially nonlinear relationships, large amounts of natural variation, and missing values. The model was queried for TB positivity rate across the four states, including areas where observations were not available, using known local contextual data at the relevant subnational resolution. The Bayesian framework used was a proprietary Naive Bayes implementation. Naive Bayes is founded on the assumption of conditional independence between predictors – called the Naive Bayes Assumption. The benefits of such an approach are well known in terms of ease of implementation, scalability to number of predictors and data points, and ability to be trained on relatively limited input data.
Data geolocation. Routinely collected TB screening data from ACF sites was received on a monthly interval. Each row corresponded to a unique location or community where the screening activity took place and contained corresponding information on the state, local government area (LGA) and ward name. Google maps was used to manually look up geo coordinates of each community, and the ones that did not show up on Google maps were excluded.
Data preparation and transformation. The TB positivity rate (proportion) was derived from the number diagnosed with bacteriologically positive TB and absolute number screened at each ACF event. Similar proportions were also calculated for contact investigations (close contacts of index TB clients diagnosed positive over total contacts screened) and facility-based screening (number of attendees diagnosed positive over number of attendees screened).
Socioeconomic data processing. Indicators of sociodemographic situation and human development known to be associated with TB [
10,
16] were accessed from open-source platforms. Data on age and gender related population estimates [
17], population density [
18], poverty [
19], night-time lights [
18] and elevation [
18] was obtained from WorldPop. Spatially modelled data from the Demographic and Health Surveys (DHS) platform on literacy, access to clean water, sanitation services, stunting in children (proxy for nutritional status), vaccination coverage (indicator of access to care and health seeking behaviour) was used [
20]. Travel time to health care facilities [
21], distance to major roads [
18] and health facility density [
22] were also used as indicators of access to care. Modelled estimates of Human Immunodeficiency Virus (HIV) prevalence [
23] and child mortality [
24] were available from The Institute for Health Metrics and Evaluation (IHME) and Global Health Data Exchange (GHDx) platform. A more detailed view on the variables used has been provided in
Table 1.
Population clustering and thiessen polygon generation. In highly populous countries like Nigeria, the usual administrative units like the wards can be very large in some regions. In order to provide a model output that is able to guide neighbourhood level interventions, wards were divided into further smaller units. This was done by an observation-weighted k-means clustering [
25,
26] algorithm that divided the population in a given ward into polygons of varying shape and population, such that each polygon contained approximately 10,000 population and did not cross the ward borders. This way the four states are disaggregated into 7,088 population clusters (also called thiessen polygons) [
27] of variable sizes. High population density areas had smaller size clusters while low density areas like rural areas had comparatively large sized clusters. K-means clustering is a well-known unsupervised machine learning technique for finding clusters in data. The benefits of using an unsupervised algorithm to cluster population density include estimates of local population density centroids (from the k-means cluster centroids), facilitating microplanning in the absence of high-resolution municipal boundaries. The clusters also allowed our clients to set targets based on the proportion of each Thiessen population to be screened. The model was trained to produce an output for each of these 7,088 population clusters.
Data aggregation and standardization. Each variable mentioned in
Table 1 was aggregated to match the resolution of the newly designed polygon level and scaled to a rate form, such that each population cluster had a unique profile defined by its local contextual information.
Model training and hotspot recommendations. The model was trained on the TB positivity rate derived from ACF events that took place in a limited number of population clusters, to predict a TB positivity rate for all other clusters. The predicted output thus allowed identification of other clusters that could be prioritised for ACF activities. The geoportal allowed the local teams to activate a filter and select sites with the highest predicted TB positivity rate in their region of activity. The location was then communicated to the field teams responsible for organising TB screening events in the community. The yield obtained at the ACF event (observed TB positivity rate) was allocated to the respective population cluster to calculate the new average observed rate. If the site had never been screened, the new yield obtained after ACF was included as a new data point on the dataset. The model was dynamic in nature, receiving new ACF data at regular intervals and retrained every 2-3 months. The outputs were expected to improve over time and the model identified ‘hotspots’ or population clusters that could be at risk of increased TB positivity driven by the local contextual conditions such as poverty, access to health facilities or population density.
Platform-assisted program steering. The hotspot predictions were being used in the 4 States since early 2021 for selecting the most optimal sites for ACF, however the uptake was variable across the region. Some of the ACF events were planned using the model predictions, while others were selected based on the conventional approach. The latter was based on facility level TB notifications, aggregated to the LGA level which serves as the functional unit of the TB control program in Nigeria. LGA level data was submitted to the State TB and Leprosy Control Officer (STBLCO), who then reported to the NTP [
28]. Thus facility level notifications were used to identify the catchment areas with potentially high TB burden, which were then targeted for active case finding. The project implementers who preferred to select their ACF sites using the geoportal commonly chose sites with high predicted TB positivity rates, between the top 10% to top 50%.
Comparin “hotspots” and “non-hotpsots”. The ACF yield in population clusters that overlap with model predicted hotspot locations was compared with those that were not predicted as hotspots (other sites). These other sites were chosen based on the conventional approach of using facility level notifications. The comparison was done between proportions of TB positivity (yield) using chi square test. For the sake of uniformity the threshold was set at 30% - all clusters that had a TB positivity rate falling in the top 30% range of predicted values in each state were classified to be hotspots.
Investigating the relationship between covariates and model output. Pearson’s correlation analysis was performed for all covariates and the output variables in a pairwise manner, to investigate their magnitude and the direction of the relationships as a data quality check and to investigate the impact of various covariates on the model outputs. Significant relationships between covariates and the output variable could provide clues to causal or confounding relationships in the data warranting investigation.
Results
TB positivity rate was predicted at population cluster level (below ward level) in 4 south western states of Nigeria,
Figure 1. The predicted outputs were used to select more suitable ACF sites, a step towards making data driven decisions. To facilitate reporting the predicted TB burden at standard municipal boundaries, population-weighted averages of the predicted rates at cluster level were also calculated at Ward level, which is the lowest municipal level in Nigeria (See Supplementary Data File S1).
Figure 2 shows a further refined view of
Figure 1, limited only to the areas that are inhabited by individuals after eliminating all unsettled areas such as fields, forests, and barren land. The areas with ‘clustering’ of a large number of settlements represent major cities and urban areas, surrounded by less densely packed settlements in peri urban and rural areas.
Table 2 describes the overall administrative division, the number of population clusters further created during this modelling exercise and the total number of ACF locations that could be mapped using the approach described earlier. In the four states combined the mappable population clusters form 12% of the total 7088 clusters and the respective data generated was used for model training.
Figure 3 shows the geographical distribution of ACF sites mapped across the four states (in Blue) together with the population clusters identified as ‘hotspots’ (in Red) by the model. The geoportal allowed the user to visualise several layers of their interest such as their previous ACF activity, locations of TB diagnostic services and newly predicted hotspots.
Figure 4 shows the graphical representation of median TB positivity yield obtained when ACF activity happened in the predicted hotspots in comparison with other sites for each of the states. The median yield in hotspot clusters was higher than the non-hotspot clusters in three out of four states. The lowest median yield was observed in Lagos in the hotspot locations, albeit with exceptionally high yield in some locations, thus the average yield was still higher than the non-hotspot group for the state as a whole. Outliers seemed to be common in all states and in both groups.
Table 3 shows the comparative analysis of average ACF yields obtained in each of the states. For each state, the table depicts outcome of ACF in the predicted hotspots, sites that were identified by the model to be at high risk of TB and other sites which were not predicted to be among high TB positivity locations but were nevertheless chosen for ACF by the local teams based on the conventional approach.
Comparison of yield differences between “hotspot” and “non-hotspot” sites. In terms of ACF coverage of the predicted hotspots within each state, Lagos had the highest coverage at 8%, while Ogun state only selected 3% of its predicted hotspots for screening. Fewer sites were selected based on the model predictions (128), and the conventional approach (729) seemed to be more popular overall in the 4 states. Even though much less screening happened in the predicted hotspots, the proportion of TB positive individuals diagnosed was at least 1.7 times higher individually in each state and the difference was statistically significant. In Osun state, the yield obtained in the predicted hotspots was more than double as compared to that in the other sites. While Ogun state had the lowest uptake of predictive sites for ACF, they reported the second highest overall yield that was 95% more than that observed in the other sites selected conventionally.
Investigating the relationships between model covariates and model predictions. Based on Pearson’s correlation between covariates and the predicted output variables, the variable with the highest correlation with the bacteriologically positive predictions was evaluated_tot_norm_f (observed evaluated rate at facility level, the number tested divided by total screened), with a positive correlation (r=0,73, p-value=0). Similarly, presumptives_tot_norm_f (presumprtive rate at facility level) also correlated highly with predicted B-positive rates (r= 0,719, p-value=0), while all-forms diagnosis and B-positive rates were slightly less correlated with predicted B-positive rates (r= 0,413, p-value= 4,801E-154 and r=0,388, p-value= 6,4685E-135, respectively). Conversely, the variable with the lowest correlation to the predicted bacteriologically positive rate, was the total screened at facility level (r= -0,335, p-value 1,79E-98). The contextual variable with the highest correlation to predicted B-positive rates was HIV prevalence (r=0.12, p-value=0) and the least correlated contextual variable was population density over 60 years (r= -0,133, p-value=0). See Supplementary Data File S2.
Discussion
Using community ACF data for high resolution TB burden modelling. Routine ACF data generated at the community level was used to make a high-resolution TB predictive model for the four southwestern states in Nigeria. The outputs were accessible to the local team via a customised Geoportal which allowed the ground teams to visualise the hotspots on a mapping portal and make data driven decisions for their ACF activities. Apart from planning community-based TB screening, the predictive outputs were also used for other program objectives such as to identify priority areas for engaging private care providers and community engagements.
Using unsupervised population clusters combined with local context to model underserved locations. This predictive model is the first of its kind in Nigeria which predicts hotspots on specially designed population clusters below the ward level. These population clusters were designed in discussion with the local teams such that each cluster contained a manageable size of population (up to 10,000) for ACF. Identifying the high TB transmission areas on the basis of facility level notification data alone can be challenging. This model does not depend heavily on the number of registered TB clients at the facility level, which is often affected by the capacity and quality of diagnostic and treatment services available in an area. Areas that are rather remote and located away from quality TB diagnostic facilities can have low notification rates and might be mistaken for low TB transmission [
7,
16,
29]
In contrast, a study in Kampala, Uganda reported that facility based passive case finding could predict high TB prevalent settings and was sufficient to effectively guide ACF. However, their study area was a 2.2 KM^2 region with a population of approximately 49,000 individuals with quite homogenous access to TB diagnostic and treatment services. The author’s remarks that facility-based notifications may be less useful in bigger regions where access to health services is variable [
30]. Therefore it is plausible that the current approach is well suited for these populous southwestern states in Nigeria.
Leveraging limited ACF data with local context for data-driven program steering. This model leveraged community-based screening data from a limited number of locations, enriched it with other local determinants of TB transmission and predicted the TB positivity rate at the community level across the whole geographic region. The advantage of using ACF data could be that it reflected the TB transmission among the people who live or frequent the screened location. The model thus ‘predicted’ and helped to identify even those communities that were traditionally not known to report many TB cases or were not reached by any previous ACF activities. This can also be seen as an added advantage over the commonly used spatial clustering techniques. These techniques are able to identify statistically significant clustering patterns from heterogenous disease distributions. While they would detect significant clusters if several data points fell adjacent to each other, they can fail to identify small areas of TB transmission if they exist in an isolated and remote location [
31]. Identification of disease hotspots at the local level is believed to be one of the important components of epidemic elimination [
7]. Thus, the local teams could make informed decisions for selecting ACF sites and reach out to the communities to find a higher proportion of undiagnosed TB.
Population density associated with higher predicted TB rates. It can be seen that the highest TB risk areas were predicted to be in and around the densely populated urban centres. While urban centres are often characterised by high accessibility to better health care services, they also typically have overcrowded houses and people living in lower socioeconomic conditions, determinants of TB transmission [
32,
33].
Increased yield from model recommendations obtained. TB positivity yield obtained at the ACF sites that were also predicted by the model to be at high risk of TB were compared to other sites that were selected based on the conventional approach for ACF planning (using notification data). The overall yield in predicted hotspots was 73% higher in Lagos, 95% higher in Ogun, 103% higher in Osun and 75% higher in Oyo state as compared to the other sites, and the results were statistically significant. The results were especially impressive because the model-recommended ACF sites only form a very small proportion of the total ACF that happened across the four states. This supports the fact these model-predicted hotspots were in fact better suited for ACF while the non-hotspot sites did not find as many new TB cases with extensive screening. Ogbudebe et al. who used their EWORS system to identify TB hotspots in 14 states of Nigeria reported that the number needed to screen to diagnose a TB case in the hotspot and non–hotspot areas was 146 and 193 per 10,000 people, respectively, that translated to almost 24% higher yield [
6].
Potential and challenges for targeted ACF in highly populated cities. Looking at Lagos state, the median yield observed in the predicted hotspots is almost zero, that means up to 50% ACF events did not find any TB cases. Although the overall yield (average) is higher as compared to the other sites, it could have been potentially driven by the locations that found an extremely high yield. Lagos is the most densely populated state in Nigeria with a population density of almost 3,791 per square kilometre. The state is also the centre of focus for the National TB control program with several government and non-government organisations implementing their TB case finding interventions programs across the state. It is possible that some population clusters were repeatedly screened by separate organisations at different points of time over the past few years. This could have been one of the reasons for the lowest median yield among the four states despite the highest extent of screening. This also points towards a need for increasing collaboration among locally active organisations that perform ACF and sharing of data to make evidence-based decisions. Pooling of data from different ACF programs across the state could further boost predictive models like these and benefit the whole community. A recent study from Pakistan which retrospectively analysed their ACF program data, found that among more than 1500 individual ACF events conducted in and around Karachi city, almost three fourths did not find any TB cases. However, a small proportion of 5% events accounted for 40% of the TB cases diagnosed. The authors concluded that a more targeted approach for ACF in high population metropolitan cities can increase the yield and cost-effectiveness of interventions [
34].
The benefits of using local context and ACF data for TB burden modelling. The interquartile range of yield as observed in sites other than hotspots reflects that the sites selected by the local teams could have been similar in their local context and TB burden. It is common practice that in most states ACF sites are chosen based on facility level notifications. However, the broader range and extreme values of yield found in predicted hotspot locations shows that even with much fewer sites, these were potentially different from those selected by conventional approach (non-hotspot sites) and had a much higher number of undiagnosed TB. The reason could be that the predictive model learnt from local contextual and ACF data to uncover TB transmission sites which could have been easily missed if only aggregated notification data was used. Further, training models with local context could potentially ameliorate the effects of underreporting or other systematic reasons for low notification rates, that may not accurately reflect the true disease burden on their own.
Limitations of only using notification data for program steering. Another reason for these high yields could be that the predicted hotspots were previously seldom targeted based on reluctance of the local teams to explore communities which are not justifiably represented on the treatment registers (facility notification registers). This leaves a pool of undiagnosed TB waiting to be diagnosed at the earliest opportunity.
Local TB burden heterogeneity necessitates local approaches to ACF. Similarly, a study in Peru mapped geographic coordinates of individuals treated for TB in an urban district over five years and analysed their spatial distributions. They reported heterogeneous distributions of TB with clustering of local hotspots and cold spots across the 74 neighbourhoods in the district. The study found that while the median rate of reported cases in the district was 123.6, the range among neighbourhoods varied from 0 to 800 cases per 100,000 population. While the study utilised notification registers alone, it highlights the possibility of local epidemics and relevance of geographic mapping of individual cases from their addresses to identify areas at risk of TB transmission and its practical implications for decision making [
35].
Challenges to the implementation and uptake of model assisted ACF. However, there are a number of challenges associated with deployment of the geoportal and model driven case finding. It took some handholding and mentorship before the local teams could trust the predictions as a number of the predicted hotspots were not known as high yielding in the past. Also, limited digital literacy of some of the local teams particularly restricted the use to some cadres of field staff. Additionally, it was challenging getting data back into the model as reporting was basically paper based with sparse electronic data capturing.
Using a predictive model allows the identification of new ACF sites. The main strength of this approach was that the local ACF data allowed predictive models to learn about the distribution of undiagnosed TB cases in the community, thus improving their potential of predicting new ACF sites, as opposed to the facility level notification data. This approach leverages incremental learning and does not require large volumes of data from the beginning, allowing data driven decision making in low resource settings.
Limitations of the approach. Our results should be interpreted in the light of certain limitations and assumptions. Individuals screened at a certain population cluster or community were assumed to be belonging to that location, hence the routine data collection sheets reported aggregated and not individual level data. The geo-coordinates of ACF sites were manually searched online and mapped; this could have led to some discrepancies in precise mapping. Also, it was not possible to find all communities on Google maps and thus they had to be excluded. The uptake of using models for ACF planning was variable across the states and we had no control on how frequently or the way this approach was being used. The retrospective nature of this analysis did not allow us to control for any biases that could have an impact on the way the screening events were planned. For example, if the community volunteers knew that they were screening in a predicted hotspot, it could have affected some of their practices. Also, because the total training data at population cluster level was very limited as compared to the total number of clusters the model was predicting for, it was not possible to perform further disaggregation into age groups, gender or train the model on unseen data. The current work was unable to show the overall impact of this case finding intervention on the total notifications in the 4 states as the TB-LON 3 program did not have access to all the notification data on the state level. Nevertheless, notifications to the National TB control program are affected by a number of factors including the diagnostic and treatment initiation capacities, while the model predictions were only used to guide the ACF site selection. Finally, as the analysis was done retrospectively, hotspots predicted from the final training set may not have been predicted as hotspots at the time of screening and could have been selected by the conventional approach, and thus prospective studies are needed to more accurately quantify the improvement in yield of the data driven approach when used for targeted ACF. Potential confounding factors identified by Pearson’s correlation analysis included a negative relationship between total screened at facility level, and the predicted B-positive rates. Since facilities that are in communities with better health infrastructure may tend to screen more patients, this would create an artificial relationship between the total screened and B-positivity, although it can be argued that the total screened does provide indirect evidence for access to care in a community. Overall, the relationships identified between Pearson’s correlation between covariates and predicted B-positivity, were in line with expected relationships in the field, such as the positive correlation between HIV prevalence and predicted B-positivity by the model.
Future directions for locally targeted ACF. Based on the findings of this work, for future ACF programs and interventions, the recording of geographic information such as coordinates should be promoted as much as possible. These results are motivating enough to increase confidence in such data driven approaches. A more systematic and large-scale comparison in a prospective intervention design can provide stronger evidence for advocating this approach in other regions.
Conclusion
This approach leveraged local data, looked beyond facility level aggregated notification and enabled outreach to previously underserved locations. The program implementers were able to find much higher yields when ACF activity happened in the model-predicted ‘hotspots’.
List of Abbreviations
| ACF |
Active Case Finding |
| CBO |
Community Based Organization |
| CV |
Community Volunteers |
| DHS |
Demographic and Health Surveys |
| EWORS |
Early Warning Outbreak Recognition System |
| GHDX |
Global Health Data Exchange |
| HIV |
Human Immunodeficiency Virus |
| IHME |
The Institute for Health Metrics and Evaluation |
| LGA |
Local Government Area |
| LON |
Local Organizations Network |
| NTP |
National Tuberculosis Control Program |
| SFH |
Society for Family Health |
| STBLCO |
State Tuberculosis and Leprosy Control Officer |
| TB |
Tuberculosis |
| USAID |
The United States Agency for International Development |
| WHO |
World Health Organization |
Author Contributions
Abiola Alege: Conception or design of the work. Data analysis and interpretation. Drafting the article. Sumbul Hashmi: Conception or design of the work. Data analysis and interpretation. Drafting the article. Rupert Eneogu: Critical revision of the article. Final approval of the version to be published. Vincent Meurrens: Conception or design of the work. Anne-Laure Budts: Conception or design of the work. Data collection. Data analysis and interpretation. Michael Pedro: Data collection. Olugbenga Daniel: Critical revision of the article. Omokhoudu Idogho: Conception or design of the work. Austin Ihesie: Critical revision of the article. Matthys Potgieter: Conception or design of the work. Critical revision of the article. Obioma Chijioke Akaniro: Final approval of the version to be published. Omoshalewa Oyelaran: Final approval of the version to be published. Mensah Olalekan Charles: Conception or design of the work. Aderonke Agbaje: Conception or design of the work. Data analysis and interpretation. Critical revision of the article.
Funding
The TBLON 3 program is funded by USAID but the funder has no role in the conceptualization, design, data collection, analysis, decision to publish, or preparation of the manuscript.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Availability of data and materials
The data that support the findings of this study are available from the Institute of Human Virology but restrictions apply to the availability of these data, which were used under licence for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of the Institute of Human Virology.
Acknowledgements
Not applicable.
Conflicts of Interest
The authors declare that they have no competing interests.
References
- “Global Tuberculosis Report 2022.” Accessed: Mar. 20, 2024. [Online]. Available: https://www.who.int/teams/global-tuberculosis-programme/tb-reports/global-tuberculosis-report-2022.
- C. M. Yuen et al., “Turning off the tap: stopping tuberculosis transmission through active case-finding and prompt effective treatment,” Lancet, vol. 386, no. 10010, p. 2334, Dec. 2015. [CrossRef]
- A. S. Eyo et al., “A Multi-Faceted Approach to Tuberculosis Active Case Finding among Remote Riverine Communities in Southern Nigeria,” International Journal of Environmental Research and Public Health 2021, Vol. 18, Page 9424, vol. 18, no. 18, p. 9424, Sep. 2021. [CrossRef]
- L. Ogbudebe et al., “Reaching the underserved: Active tuberculosis case finding in urban slums in southeastern Nigeria,” Int J Mycobacteriol, vol. 4, no. 1, pp. 18–24, Mar. 2015. [CrossRef]
- D. Oshi et al., “An evaluation of innovative community-based approaches and systematic tuberculosis screening to improve tuberculosis case detection in Ebonyi State, Nigeria,” Int J Mycobacteriol, vol. 6, no. 3, pp. 246–252, Jul. 2017. [CrossRef]
- C. Ogbudebe,et al. “Identifying Hot Spots of Tuberculosis in Nigeria Using an Early Warning Outbreak Recognition System: Retrospective Analysis of Implications for Active Case Finding Interventions,” JMIR Public Health Surveill, vol. 9, 2023. [CrossRef]
- G. Theron et al., “Data for action: Collecting and using local data to more effectively fight tuberculosis,” Lancet, vol. 386, no. 10010, p. 2324, Dec. 2015. [CrossRef]
- E. Rood et al., “A Spatial Analysis Framework to Monitor and Accelerate Progress towards SDG 3 to End TB in Bangladesh,” ISPRS International Journal of Geo-Information 2019, Vol. 8, Page 14, vol. 8, no. 1, p. 14, Dec. 2018. [CrossRef]
- J. Cai, X. Wang, A. Ma, Q. Wang, X. Han, and Y. Li, “Factors Associated with Patient and Provider Delays for Tuberculosis Diagnosis and Treatment in Asia: A Systematic Review and Meta-Analysis,” PLoS One, vol. 10, no. 3, p. e0120088, Mar. 2015. [CrossRef]
- D. J. Carter et al., “The impact of social protection and poverty elimination on global tuberculosis incidence: a statistical modelling analysis of Sustainable Development Goal 1,” Lancet Glob Health, vol. 6, no. 5, p. e514, May 2018. [CrossRef]
- K. Lönnroth, E. Jaramillo, B. G. Williams, C. Dye, and M. Raviglione, “Drivers of tuberculosis epidemics: The role of risk factors and social determinants,” Soc Sci Med, vol. 68, no. 12, pp. 2240–2246, Jun. 2009. [CrossRef]
- S. Shrestha et al., “Quantifying geographic heterogeneity in TB incidence and the potential impact of geographically targeted interventions in South and North City Corporations of Dhaka, Bangladesh: a model-based study,” Epidemiol Infect, vol. 149, 2021. [CrossRef]
- E. Rood et al., “A Spatial Analysis Framework to Monitor and Accelerate Progress towards SDG 3 to End TB in Bangladesh,” ISPRS International Journal of Geo-Information 2019, Vol. 8, Page 14, vol. 8, no. 1, p. 14, Dec. 2018. [CrossRef]
- “The KIT MATCH Approach for Enhancing TB Care Coverage - KIT Royal Tropical Institute.” Accessed: Mar. 26, 2024. [Online]. Available: https://www.kit.nl/project/the-kit-match-approach-for-enhancing-tb-care-coverage/.
- “Electronic Case based Surveillance: Using predictive modelling and real-time data to plan Active TB Case Finding in Pakistan - KIT Royal Tropical Institute.” Accessed: Mar. 26, 2024. [Online]. Available: https://www.kit.nl/project/electronic-case-based-surveillance-using-predictive-modelling-and-real-time-data-to-plan-active-tb-case-finding-in-pakistan/.
- E. L. N. Maciel et al., “Spatial patterns of pulmonary tuberculosis incidence and their relationship to socio-economic status in Vitoria, Brazil,” Int J Tuberc Lung Dis, vol. 14, no. 11, p. 1395, Nov. 2010, Accessed: Mar. 21, 2024. [Online]. Available: /pmc/articles/PMC3713790/.
- M. Bondarenko, D. Kerr, A. Sorichetta, A. Tatem, and WorldPop, “Estimates of total number of people per grid square broken down by gender and age groupings for 51 countries across sub-Saharan Africa in 2020 using Ecopia.AI and Maxar Technologies building footprints,” Nov. 2020. [CrossRef]
- “WorldPop (www.worldpop.org - School of Geography and Environmental Science, University of Southampton; Department of Geography and Geosciences, University of Louisville; Departement de Geographie, Universite de Namur) and Center for International Earth Science Information Network (CIESIN), Columbia University. Global High Resolution Population Denominators Project - Funded by The Bill and Melinda Gates Foundation (OPP1134076).
- Tatem, P. Gething, S. Bhatt, D. Weiss, and C. Pezzulo, “Pilot high resolution poverty maps, University of Southampton/Oxford.
- “Spatial Data Repository - Modeled Surfaces.” Accessed: Mar. 25, 2024. [Online]. Available: https://spatialdata.dhsprogram.com/modeled-surfaces/#survey=NG|2018|DHS.
- D. J. Weiss et al., “Global maps of travel time to healthcare facilities,” Nature Medicine 2020 26:12, vol. 26, no. 12, pp. 1835–1838, Sep. 2020. [CrossRef]
- “Nigeria: Health facilities - Humanitarian Data Exchange.” Accessed: Mar. 25, 2024. [Online]. Available: https://data.humdata.org/dataset/nigeria-health-facilities.
- “Sub-Saharan Africa HIV Prevalence Geospatial Estimates 2000-2018 | GHDx.” Accessed: Mar. 25, 2024. [Online]. Available: https://ghdx.healthdata.org/record/ihme-data/sub-saharan-africa-hiv-prevalence-geospatial-estimates-2000-2018.
- “Low- and Middle-Income Country Neonatal, Infant, and Under-5 Mortality Geospatial Estimates 2000-2017 | GHDx.” Accessed: Mar. 25, 2024. [Online]. Available: https://ghdx.healthdata.org/record/ihme-data/lmic-under5-mortality-rate-geospatial-estimates-2000-2017.
- “Clustering the US population: observation-weighted k-means | by Carl Anderson | Towards Data Science.” Accessed: Mar. 26, 2024. [Online]. Available: https://towardsdatascience.com/clustering-the-us-population-observation-weighted-k-means-f4d58b370002.
- “View of Population Density Cluster Analysis in DKI Jakarta Province Using K-Means Algorithm.” Accessed: Mar. 26, 2024. [Online]. Available: https://journal-isi.org/index.php/isi/article/view/315/165.
- J. Colston and C. Burgert, “Using Geospatial Analysis to Inform Decision Making in Targeting Health Facility-Based Programs: A Guidance Document.” Accessed: Mar. 25, 2024. [Online]. Available: https://www.measureevaluation.org/resources/publications/ms-14-88.
- U. Onyeonoro, J. U. Onyeonoro, J. Chukwu, C. Nwafor, A. Meka, B. Omotowo, and C. Ogbudebe, “Assessment of the quality of tuberculosis surveillance data in six selected states in Southern Nigeria,” Ann Trop Med Public Health, vol. 8, pp. 94–100, Aug. 2015.
- M. H. Dangisso, D. G. Datiko, and B. Lindtjørn, “Accessibility to tuberculosis control services and tuberculosis programme performance in southern Ethiopia,” Glob Health Action, vol. 8, no. 1, 2015. [CrossRef]
- K. O. Robsky et al., “Spatial distribution of people diagnosed with tuberculosis through routine and active case finding: A community-based study in Kampala, Uganda,” Infect Dis Poverty, vol. 9, no. 1, pp. 1–10, Jun. 2020. [CrossRef]
- D. Shaweno, J. M. Trauer, T. N. Doan, J. T. Denholm, and E. S. McBryde, “Geospatial clustering and modelling provide policy guidance to distribute funding for active TB case finding in Ethiopia,” Epidemics, vol. 36, p. 100470, Sep. 2021. [CrossRef]
- M. Van Gurp et al., “Finding gaps in TB notifications: spatial analysis of geographical patterns of TB notifications, associations with TB program efforts and social determinants of TB risk in Bangladesh, Nepal and Pakistan,” BMC Infect Dis, vol. 20, no. 1, pp. 1–14, Jul. 2020. [CrossRef]
- F. K. C. Ribeiro et al., “Genotypic and Spatial Analysis of Mycobacterium tuberculosis Transmission in a High-Incidence Urban Setting,” Clin Infect Dis, vol. 61, no. 5, p. 758, Sep. 2015. [CrossRef]
- S. M. A. Zaidi et al., “A spatial analysis of TB cases and abnormal X-rays detected through active case-finding in Karachi, Pakistan,” Scientific Reports 2023 13:1, vol. 13, no. 1, pp. 1–11, Jan. 2023. [CrossRef]
- M. B. Brooks et al., “Mapping local hot spots with routine tuberculosis data: A pragmatic approach to identify spatial variability,” PLoS One, vol. 17, no. 3, p. e0265826, Mar. 2022. [CrossRef]
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).