Predicting Food Insecurity Across U.S. Census Tracts: A Machine Learning Analysis Using the USDA Food Access Research Atlas

Authors

  • Oluwatosin Lawal Department of Mathematics Statistical Analytics, Computing and Modeling, Texas, USA
  • Awele Okolie School of Computing and Data Science, Wentworth Institute of Technology, Boston, USA
  • Callistus Obunadike Department of Computer Science and Quantitative Methods, Austin Peay State University, Tennessee, USA
  • Prince Michael Akwabeng Department of Computer Science and Statistics, Austin Peay State University, Tennessee, USA
  • Mark Onons Ikhifa Department of Mathematics and Science Education, Austin Peay State University, Tennessee, USA
  • Paschal Alumona Booth School of Business, University of Chicago, USA

DOI:

https://doi.org/10.54536/ajdsai.v2i1.6285

Keywords:

Food Insecurity, Machine Learning Models, Predictive Modeling, Random Forest, SHAP Interpretability, Socioeconomic Determinants, USDA Food Access Research Atlas, Xgboost

Abstract

Food insecurity still poses a serious public-health and social-equity problem in the United States. The USDA Food Access Research Atlas (N = 72,531 census tracts) served as the basis for this study, which not only created but also evaluated machine-learning models to predict the level of food insecurity in a certain tract, which is determined by the lack of supermarkets being accessible to low-income people. The Logistic Regression, Random Forest, and XGBoost classifiers went through training and standard metric comparison. The tree ensemble models (Random Forest and XGBoost) reached remarkable performance (accuracy ≈ 97%, ROC-AUC ≈ 0.99) far above the logistic regression baseline (ROC-AUC ≈ 0.89). SHAP-based model interpretability recognized the poverty rate, median family income, SNAP participation, and vehicle access as the most critical determinants of food insecurity. These results affirm the value of interpretable machine learning in revealing important socioeconomic factors that contribute to food access inequalities, thereby providing a basis for data-informed interventions. The entire analytic process made use of publicly accessible national data, so the results can be reproduced, and future research and policy applications made more transparent.

References

Almalki, A., Brown, G., & Wong, C. (2021). Geospatial and machine learning regression techniques for food access analysis. ISPRS International Journal of Geo-Information, 10(11), 745. https://doi.org/10.3390/ijgi10110745

Amin, M. D., McCluskey, J., & Syed, S. (2020). Predicting access to healthful food retailers with machine learning at the U.S. census tract scale. Applied Geography, 114, 102053. https://pmc.ncbi.nlm.nih.gov/articles/PMC7564312/

Beaulac, J., Kristjansson, E., & Cummins, S. (2009). A systematic review of food deserts, 1966–2007. Preventing Chronic Disease, 6(3), A105. https://stacks.cdc.gov/view/cdc/20366/

Bitto, E. A., Oakland, M. J., & Sand, M. (2003). Solving the problems of Iowa food deserts: Food insecurity and civic structure. Rural Sociology, 68(3), 371–384. https://doi.org/10.1526/0036011053294628

Bronfenbrenner, U. (1979). The ecology of human development: Experiments by nature and design. Harvard University Press.

Chilukuri, J. (2023). Predicting low food access: An empirical assessment of tree-based machine learning models [Preprint]. TechRxiv. https://doi.org/10.36227/techrxiv.16820688.v1

Coleman-Jensen, A., Rabbitt, M. P., Gregory, C. A., & Singh, A. (2021). Household food security in the United States in 2020 (Report No. ERR-298). U.S. Department of Agriculture, Economic Research Service. https://www.ers.usda.gov/webdocs/publications/102076/err-298.pdf

Esmalian, N. A., Coleman, N., Yuan, F., Xiao, X., & Mostafavi, A. (2022). Characterizing equitable access to grocery stores during disasters using location-based data [Preprint]. arXiv. https://arxiv.org/abs/2201.00745

Food and Agriculture Organization of the United Nations. (2026). FAO – Food and Agriculture Organization of the United Nations. https://www.fao.org

Gundersen, C., & Ziliak, J. P. (2021). Food insecurity and health outcomes. Health Affairs, 34(11), 1830–1839. https://pubmed.ncbi.nlm.nih.gov/26526240/

Hu, Y., Yu, Z., & Chen, X. (2022). Integrating predictive analytics with social vulnerability assessment for food insecurity research. Computers, Environment and Urban Systems, 94, 101787. https://doi.org/10.1016/j.compenvurbsys.2022.101787

Jiao, J., Moudon, A. V., Ulmer, J., Hurvitz, P. M., & Drewnowski, A. (2012). How to identify food deserts: Measuring physical and economic access to supermarkets in King County, Washington. American Journal of Public Health, 102(10), e32–e39. https://doi.org/10.2105/AJPH.2012.300675

Kapoor, S., & Sayer, S. (2024). From bytes to bites: Using country-specific machine learning models to predict famine and food insecurity [Preprint]. arXiv. https://arxiv.org/pdf/2409.09980

Kwan, M. P. (2021). Geospatial analysis and data science in public health research. Annals of GIS, 27(4), 341–354. https://doi.org/10.1080/19475683.2021.1970269

Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems (Vol. 30). https://arxiv.org/abs/1705.07874

Marmot, M., & Wilkinson, R. (2005). Social determinants of health (2nd ed.). Oxford University Press.

Salari, M., Kramer, M. R., Reyna, M. A., Taylor, H. A., & Clifford, G. D. (2023). Combining crowd-sourcing, census data, and public review forums for real-time, high-resolution food desert estimation. BioMedical Engineering OnLine, 22, 69. https://doi.org/10.1186/s12938-023-01108-9

Smith, M. D. (2020). Food insecurity, gender, and international migration in low-income countries. Food Policy, 91, 101838. https://doi.org/10.1016/j.foodpol.2020.101838

Tango, P. (2023). On the forecastability of food insecurity. Scientific Reports, 13, 4309. https://pmc.ncbi.nlm.nih.gov/articles/PMC10038988/

Walker, R. E., Keane, C. R., & Burke, J. G. (2010). Disparities and access to healthy food in the United States: A review of food deserts literature. Health & Place, 16(5), 876–884. https://doi.org/10.1016/j.healthplace.2010.04.013

Downloads

Published

2026-01-22

How to Cite

Lawal, O. ., Okolie, A., Obunadike, C. ., Akwabeng, P. M. ., Ikhifa, M. O. ., & Alumona, P. . (2026). Predicting Food Insecurity Across U.S. Census Tracts: A Machine Learning Analysis Using the USDA Food Access Research Atlas. American Journal of Data Science and Artificial Intelligence, 2(1), 1-14. https://doi.org/10.54536/ajdsai.v2i1.6285