Customer and Product Profitability Analytics for Data‑Driven Financial Planning in U.S. Retail: Integrating Basket-Level Transactions and Accounting Fundamentals

Authors

  • Hailin Zhou Columbia University, NY, USA
  • Yitian Zhang UW-Madison, WI, USA
  • Maoxi Li Fordham University, NY, USA
  • Yibang Liu Baruch College, NY, USA

DOI:

https://doi.org/10.54536/ajise.v5i1.6533

Keywords:

Customer Lifetime Value, Customer Profitability, Financial Planning, Product Profitability, Retail Analytics

Abstract

Retailers operate on razor-thin margins and must leverage data analytics to improve profitability. This study proposes an integrated framework linking customer and product-level profitability analysis with firm-level financial performance for strategic planning. We utilize an open retail transaction dataset of 2,500 U.S. grocery households over two years (the “Complete Journey” data) and public financial statement data (SEC XBRL filings) of a major retail company. We apply descriptive analytics to identify profitable customer segments and products, and we develop machine-learning models (logistic regression, random forests, XGBoost) to predict customer churn and lifetime value. These insights are then connected to corporate financial metrics (e.g. profit margins, return on assets) to evaluate their impact on firm performance. Results show a strong disparity in customer profitability – a small fraction of customers drive a large share of sales (consistent with the Pareto 80/20 rule) – and key products (e.g. staple categories like pasta sauces) contribute disproportionately to revenue. The XGBoost model yields an AUC of ~0.85 in predicting customer churn, enabling targeting of at-risk customers. Scenario analysis indicates that a 5% improvement in customer retention could boost net profit by ~25% (aligning with prior findings that small retention gains can amplify profits significantly). We discuss how these data-driven insights inform financial planning decisions such as budgeting for loyalty programs, product mix optimization, and capital allocation. The study demonstrates the value of integrating granular customer analytics with accounting fundamentals to support strategic financial planning in the retail sector.

Downloads

Download data is not yet available.

References

Bayer, E., Tuli, K. R., & Skiera, B. (2017). Do disclosures of customer metrics lower investors’ and analysts’ uncertainty but hurt firm performance? Journal of Marketing Research, 54(2), 239–259. https://doi.org/10.1509/jmr.14.0185

Boehmke, B. (2025). The Complete Journey User Guide (R package “completejourney” v2.0). Retrieved from CRAN: https://cran.r-project.org/web/packages/completejourney

Chaffey, D. (2020, October 1). Pareto’s 80:20 rule in Marketing – The Pareto principle. Smart Insights (Blog). Retrieved from https://www.smartinsights.com/marketing-planning/marketing-models/paretos-8020-rule-marketing/

Diggs, A., Risner, O., Madrigal, A., Gerzeny, G., & Sitarski, J. (2022). Complete Journey Analysis. (RPubs Publication). Retrieved from https://rstudio-pubs-static.s3.amazonaws.com/952669_d905fdf441ae468a862208d4749d6387.html

dunnhumby. (2019). The Complete Journey [Data set]. 84.51° Source Files. Retrieved from https://www.dunnhumby.com/source-files (Household transactions and demographics for 2,500 U.S. households, 2017–2018)

Eker, O. F. (2021). The Complete Journey: Churn Prediction (GitHub repository). Retrieved from https://github.com/omerfarukeker/The-Complete-Journey

Food Industry Association (FMI). (2024). Food Retailing Industry Speaks – Financial Survey Highlights. Washington, DC: FMI.

Gruca, T. S., & Rego, L. L. (2005). Customer satisfaction, cash flow, and shareholder value. Journal of Marketing, 69(3), 115–130.

Gupta, S., Lehmann, D. R., & Stuart, J. A. (2004). Valuing customers. Journal of Marketing Research, 41(1), 7–18. https://doi.org/10.1509/jmkr.41.1.7.25084

Gupta, S., & Zeithaml, V. (2006). Customer metrics and their impact on financial performance. Marketing Science, 25(6), 718–739. https://doi.org/10.1287/mksc.1060.0221

Kaggle. (2018). US Stocks Fundamentals (XBRL) [Data set]. Retrieved from https://www.kaggle.com/datasets

Kaggle. (2019). Financial Statement Data for Top 200 US Companies [Data set]. Retrieved from https://www.kaggle.com

Kroger Co. (2018). 2017 Annual Report. Cincinnati, OH: Kroger Investor Relations.

Kroger Co. (2019). 2018 Annual Report. Cincinnati, OH: Kroger Investor Relations.

Montgomery, A. L., Li, S., Srinivasan, K., & Liechty, J. (2023). Marketing analytics and big data in accounting (Working Paper). Carnegie Mellon University.

Niraj, R., Gupta, M., & Narasimhan, C. (2001). Customer profitability in a supply chain. Journal of Marketing, 65(3), 1–16. https://doi.org/10.1509/jmkg.65.3.1.18334

Reichheld, F. F., & Sasser, W. E. (1990). Zero defections: Quality comes to services. Harvard Business Review, 68(5), 105–111.

Skiera, B. (2017). Customer analytics in performance measurement and reporting. Accounting Horizons, 31(3), 1–15. https://doi.org/10.2308/acch-51658

Song, T. H., Kim, S. Y., & Kim, J. Y. (2016). The dynamic effect of customer equity across firm growth: The case of online retailers. Journal of Business Research, 69(9), 3755–3764. https://doi.org/10.1016/j.jbusres.2015.12.067

Srinivasan, S., & Hanssens, D. M. (2009). Marketing and firm value: Metrics, methods, findings, and future directions. Journal of Marketing Research, 46(3), 293–312. https://doi.org/10.1509/jmkr.46.3.293

Securities and Exchange Commission (SEC). (2021, Nov 10). The Lessons of Structured Data (Speech by Commissioner C. A. Crenshaw). SEC News. Retrieved from https://www.sec.gov/news/speech/crenshaw-lessons-structured-data-111021

Securities and Exchange Commission (SEC). (2025). Financial Statement Data Sets (2009–2025). Retrieved from https://www.sec.gov/data/financial-statement-data-sets

Xin, Q. (2025). A Deep Reinforcement Learning Approach to Optimizing Cloud Workload Migration. Am. J. Interdiscip. Res. Innov, 4(3), 10–15.

Weissinger, L. (2023). Machine learning in management accounting research: Literature review and pathways for the future. European Accounting Review, 32(3), 611–639. https://doi.org/10.1080/09638180.2022.2137221

Shirakawa, T., Li, Y., Wu, Y., Qiu, S., Li, Y., Zhao, M., Iso, H., & van der Laan, M. (2024). Longitudinal targeted minimum loss-based estimation with temporal-difference heterogeneous transformer. arXiv. https://doi.org/10.48550/arXiv.2404.04399

Downloads

Published

2026-02-13

How to Cite

Zhou, H. ., Zhang, Y. ., Li, M. ., & Liu, Y. . (2026). Customer and Product Profitability Analytics for Data‑Driven Financial Planning in U.S. Retail: Integrating Basket-Level Transactions and Accounting Fundamentals. American Journal of Innovation in Science and Engineering , 5(1), 43-55. https://doi.org/10.54536/ajise.v5i1.6533

Similar Articles

1-10 of 19

You may also start an advanced similarity search for this article.