Customer and Product Profitability Analytics for Data‑Driven Financial Planning in U.S. Retail: Integrating Basket-Level Transactions and Accounting Fundamentals
DOI:
https://doi.org/10.54536/ajise.v5i1.6533Keywords:
Customer Lifetime Value, Customer Profitability, Financial Planning, Product Profitability, Retail AnalyticsAbstract
Retailers operate on razor-thin margins and must leverage data analytics to improve profitability. This study proposes an integrated framework linking customer and product-level profitability analysis with firm-level financial performance for strategic planning. We utilize an open retail transaction dataset of 2,500 U.S. grocery households over two years (the “Complete Journey” data) and public financial statement data (SEC XBRL filings) of a major retail company. We apply descriptive analytics to identify profitable customer segments and products, and we develop machine-learning models (logistic regression, random forests, XGBoost) to predict customer churn and lifetime value. These insights are then connected to corporate financial metrics (e.g. profit margins, return on assets) to evaluate their impact on firm performance. Results show a strong disparity in customer profitability – a small fraction of customers drive a large share of sales (consistent with the Pareto 80/20 rule) – and key products (e.g. staple categories like pasta sauces) contribute disproportionately to revenue. The XGBoost model yields an AUC of ~0.85 in predicting customer churn, enabling targeting of at-risk customers. Scenario analysis indicates that a 5% improvement in customer retention could boost net profit by ~25% (aligning with prior findings that small retention gains can amplify profits significantly). We discuss how these data-driven insights inform financial planning decisions such as budgeting for loyalty programs, product mix optimization, and capital allocation. The study demonstrates the value of integrating granular customer analytics with accounting fundamentals to support strategic financial planning in the retail sector.
Downloads
References
Bayer, E., Tuli, K. R., & Skiera, B. (2017). Do disclosures of customer metrics lower investors’ and analysts’ uncertainty but hurt firm performance? Journal of Marketing Research, 54(2), 239–259. https://doi.org/10.1509/jmr.14.0185
Boehmke, B. (2025). The Complete Journey User Guide (R package “completejourney” v2.0). Retrieved from CRAN: https://cran.r-project.org/web/packages/completejourney
Chaffey, D. (2020, October 1). Pareto’s 80:20 rule in Marketing – The Pareto principle. Smart Insights (Blog). Retrieved from https://www.smartinsights.com/marketing-planning/marketing-models/paretos-8020-rule-marketing/
Diggs, A., Risner, O., Madrigal, A., Gerzeny, G., & Sitarski, J. (2022). Complete Journey Analysis. (RPubs Publication). Retrieved from https://rstudio-pubs-static.s3.amazonaws.com/952669_d905fdf441ae468a862208d4749d6387.html
dunnhumby. (2019). The Complete Journey [Data set]. 84.51° Source Files. Retrieved from https://www.dunnhumby.com/source-files (Household transactions and demographics for 2,500 U.S. households, 2017–2018)
Eker, O. F. (2021). The Complete Journey: Churn Prediction (GitHub repository). Retrieved from https://github.com/omerfarukeker/The-Complete-Journey
Food Industry Association (FMI). (2024). Food Retailing Industry Speaks – Financial Survey Highlights. Washington, DC: FMI.
Gruca, T. S., & Rego, L. L. (2005). Customer satisfaction, cash flow, and shareholder value. Journal of Marketing, 69(3), 115–130.
Gupta, S., Lehmann, D. R., & Stuart, J. A. (2004). Valuing customers. Journal of Marketing Research, 41(1), 7–18. https://doi.org/10.1509/jmkr.41.1.7.25084
Gupta, S., & Zeithaml, V. (2006). Customer metrics and their impact on financial performance. Marketing Science, 25(6), 718–739. https://doi.org/10.1287/mksc.1060.0221
Kaggle. (2018). US Stocks Fundamentals (XBRL) [Data set]. Retrieved from https://www.kaggle.com/datasets
Kaggle. (2019). Financial Statement Data for Top 200 US Companies [Data set]. Retrieved from https://www.kaggle.com
Kroger Co. (2018). 2017 Annual Report. Cincinnati, OH: Kroger Investor Relations.
Kroger Co. (2019). 2018 Annual Report. Cincinnati, OH: Kroger Investor Relations.
Montgomery, A. L., Li, S., Srinivasan, K., & Liechty, J. (2023). Marketing analytics and big data in accounting (Working Paper). Carnegie Mellon University.
Niraj, R., Gupta, M., & Narasimhan, C. (2001). Customer profitability in a supply chain. Journal of Marketing, 65(3), 1–16. https://doi.org/10.1509/jmkg.65.3.1.18334
Reichheld, F. F., & Sasser, W. E. (1990). Zero defections: Quality comes to services. Harvard Business Review, 68(5), 105–111.
Skiera, B. (2017). Customer analytics in performance measurement and reporting. Accounting Horizons, 31(3), 1–15. https://doi.org/10.2308/acch-51658
Song, T. H., Kim, S. Y., & Kim, J. Y. (2016). The dynamic effect of customer equity across firm growth: The case of online retailers. Journal of Business Research, 69(9), 3755–3764. https://doi.org/10.1016/j.jbusres.2015.12.067
Srinivasan, S., & Hanssens, D. M. (2009). Marketing and firm value: Metrics, methods, findings, and future directions. Journal of Marketing Research, 46(3), 293–312. https://doi.org/10.1509/jmkr.46.3.293
Securities and Exchange Commission (SEC). (2021, Nov 10). The Lessons of Structured Data (Speech by Commissioner C. A. Crenshaw). SEC News. Retrieved from https://www.sec.gov/news/speech/crenshaw-lessons-structured-data-111021
Securities and Exchange Commission (SEC). (2025). Financial Statement Data Sets (2009–2025). Retrieved from https://www.sec.gov/data/financial-statement-data-sets
Xin, Q. (2025). A Deep Reinforcement Learning Approach to Optimizing Cloud Workload Migration. Am. J. Interdiscip. Res. Innov, 4(3), 10–15.
Weissinger, L. (2023). Machine learning in management accounting research: Literature review and pathways for the future. European Accounting Review, 32(3), 611–639. https://doi.org/10.1080/09638180.2022.2137221
Shirakawa, T., Li, Y., Wu, Y., Qiu, S., Li, Y., Zhao, M., Iso, H., & van der Laan, M. (2024). Longitudinal targeted minimum loss-based estimation with temporal-difference heterogeneous transformer. arXiv. https://doi.org/10.48550/arXiv.2404.04399
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Hailin Zhou, Yitian Zhang, Maoxi Li, Yibang Liu

This work is licensed under a Creative Commons Attribution 4.0 International License.