Allow covariates in `plot_expected_purchases` #1430

PabloRoque · 2025-01-24T15:45:26Z

Description

Includes covariates when calculation the unconditional frequency expectation. Allows the estimation of future purchases for the "average customer" just after first order for models with covariates.

Additionally:

Introduces a new dataset apparel_trans.csv and accompanying apparel_static_cov.csv. These datasets are extracted from R's CLVTools package.
A new notebook bg_nbd_covariates.ipynb shows an example of the new functionality using the new dataset.

Related Issue

Closes #
Related to Allow static covariates in BGNBDModel #1390. That PR should be merge before this one.

Checklist

Checked that the pre-commit linting/style checks pass. Feel free to comment pre-commit.ci autofix to auto-fix.
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks) using numpydoc format.
If you are a pro: each commit corresponds to a relevant logical change

📚 Documentation preview 📚: https://pymc-marketing--1430.org.readthedocs.build/en/1430/

…taGeoModel. Add some tests

…thCovariates.test_extract_predictive_covariates

review-notebook-app · 2025-01-24T15:45:31Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

codecov · 2025-01-24T15:50:49Z

Codecov Report

Attention: Patch coverage is 75.00000% with 7 lines in your changes missing coverage. Please review.

Project coverage is 91.51%. Comparing base (f88c98d) to head (9abe758).

Files with missing lines	Patch %	Lines
pymc_marketing/clv/utils.py	75.00%	7 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1430      +/-   ##
==========================================
- Coverage   91.59%   91.51%   -0.08%     
==========================================
  Files          60       60              
  Lines        6782     6802      +20     
==========================================
+ Hits         6212     6225      +13     
- Misses        570      577       +7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

ColtAllen · 2025-01-24T15:51:06Z

I'll review this in detail later, but it looks like there may be merge conflicts with #1390

PabloRoque · 2025-02-17T09:48:47Z

@ColtAllen Feel free to have a look at this PR whenever you have some spare time. It is ready for review from my side.

ColtAllen · 2025-02-18T23:55:11Z

Introduces a new dataset apparel_trans.csv and accompanying apparel_static_cov.csv. These datasets are extracted from R's CLVTools package.

Is there documentation available for these datasets? I wasn't able to find anything in the R library.

A new notebook bg_nbd_covariates.ipynb shows an example of the new functionality using the new dataset.

Is it possible to consolidate this into the existing BG/NBD notebook?

PabloRoque · 2025-02-19T08:32:35Z

Is there documentation available for these datasets? I wasn't able to find anything in the R library.

Link to apparelTrans and apparelStaticCov

Is it possible to consolidate this into the existing BG/NBD notebook?

Do you propose to add this new dataset, or use the synthetic covariates that we are using in the ParetoNBD notebook?

ColtAllen · 2025-02-21T19:00:54Z

Link to apparelTrans and apparelStaticCov

Nice, so this dataset is simulated, then.

Do you propose to add this new dataset, or use the synthetic covariates that we are using in the ParetoNBD notebook?

The Pareto/NBD notebook is using real data available here. I think using the same dataset would make for a good comparison in resolving #1496. That said, let's just remove the apparel datasets for now and do the notebook consolidation in a separate PR. No need for any additional changes to the dev notebook here unless you want to test visualizations.

ColtAllen

Let's use the covariate data already being saved in a fitted model rather than require the user to add this to the transaction data (see my comment for more details). This should cut down on the additional code/testing requirements and also be more efficient.

One of these days I'd like to refactor _expected_cumulative_transactions to eliminate the FOR loop, but this would require xarray support in the predictive methods.

ColtAllen · 2025-02-21T19:32:32Z

pymc_marketing/clv/utils.py

+    if model.purchase_covariate_cols or model.dropout_covariate_cols:
+        distinct_covariates_cols = list(
+            set(model.purchase_covariate_cols).intersection(
+                set(model.dropout_covariate_cols)
+            )
+        )
+        distinct_covariates = transactions[distinct_covariates_cols].drop_duplicates()
+    else:
+        distinct_covariates_cols = None


There's a customer_id dimension for the covariates in the fitted model idata. Rather than requiring covariates be added to be transaction dataframe, we can obtain the covariates from the model directly and merge them with repeated_and_first_transactions.

juanitorduz · 2025-04-23T13:32:20Z

@PabloRoque can we bring this one to the finish line?

PabloRoque and others added 20 commits January 15, 2025 11:57

Implement ModifiedBetaGeoNBD, ModifiedBetaGeoNBDRV. Modify ModifiedBe…

5f567eb

…taGeoModel. Add some tests

Add test_notimplemented_logp

1897d62

Merge branch 'main' into ModifiedBetaGeoNBDRV

042780e

Sample recency_frequency from the newly introduced RV block

bc83ddc

Add model coords in distribution_new_customers

f0cb25e

Allow covariates in BG/NBD

59d6826

Merge branch 'main' into BGNBD-static-covar

b33b56b

Merge branch 'main' into BGNBD-static-covar

e74745c

Add BetaGeoModel_extract_predictive_variables. Add TestBetaGeoModelWi…

ffced16

…thCovariates.test_extract_predictive_covariates

Add test_logp

02b7c11

Introduce gamma2, gamma3 dropout coefficients

dc7be23

Adapt tests to 3 coefficients. Fix test_logp

dedb36e

Fix test_expectation_method. Fix test_covariate_model_convergence

91d0168

Merge branch 'main' into BGNBD-static-covar

4af6482

Revert explicit dims in RVs

28c15c0

Include dims. Fix test_distribution_method

6ef492f

Increase recency_frequency tolerance

77dad3e

Add tolerance to dropout_covariate tests

857c885

Revert non-centered priors

f796170

Fix plotting for models with covariates

74de286

github-actions bot added docs Improvements or additions to documentation CLV tests labels Jan 24, 2025

PabloRoque mentioned this pull request Jan 24, 2025

Allow static covariates in BGNBDModel #1390

Merged

14 tasks

PabloRoque added 3 commits February 5, 2025 16:45

Merge branch 'main' into allow-covariates-plot-expected-purchases

60372e0

Fix a_scale, b_scale dims

96c2609

Weighted sum over covariates in _expected_cumulative_transactions

00a07a4

Merge branch 'main' into allow-covariates-plot-expected-purchases

fcd187a

PabloRoque marked this pull request as ready for review February 10, 2025 09:25

PabloRoque mentioned this pull request Feb 10, 2025

Allow plot_expected_purchases_pcc in BetaGeoModel and ModifiedBetaGeoModel #1470

Merged

5 tasks

Merge branch 'main' into allow-covariates-plot-expected-purchases

dd8fcc7

PabloRoque requested a review from ColtAllen February 11, 2025 11:55

PabloRoque added 5 commits February 12, 2025 09:56

Merge branch 'main' into allow-covariates-plot-expected-purchases

d31d8d3

Merge branch 'main' into allow-covariates-plot-expected-purchases

4fbc539

Handle no covariates case

cba21c8

Handle no-covariates case

0eaaf4c

Merge branch 'main' into allow-covariates-plot-expected-purchases

064026f

Merge branch 'main' into allow-covariates-plot-expected-purchases

964d6e2

ColtAllen requested changes Feb 21, 2025

View reviewed changes

Merge branch 'main' into allow-covariates-plot-expected-purchases

06437ac

juanitorduz added 2 commits April 25, 2025 19:52

Merge branch 'main' into allow-covariates-plot-expected-purchases

df26a72

Merge branch 'main' into allow-covariates-plot-expected-purchases

9abe758

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow covariates in `plot_expected_purchases` #1430

Allow covariates in `plot_expected_purchases` #1430

PabloRoque commented Jan 24, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented Jan 24, 2025

Uh oh!

codecov bot commented Jan 24, 2025 •

edited

Loading

Uh oh!

ColtAllen commented Jan 24, 2025

Uh oh!

PabloRoque commented Feb 17, 2025

Uh oh!

ColtAllen commented Feb 18, 2025

Uh oh!

PabloRoque commented Feb 19, 2025 •

edited

Loading

Uh oh!

ColtAllen commented Feb 21, 2025

Uh oh!

ColtAllen left a comment

Uh oh!

ColtAllen Feb 21, 2025

Uh oh!

juanitorduz commented Apr 23, 2025

Uh oh!

Uh oh!

Allow covariates in plot_expected_purchases #1430

Are you sure you want to change the base?

Allow covariates in plot_expected_purchases #1430

Conversation

PabloRoque commented Jan 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Checklist

Uh oh!

review-notebook-app bot commented Jan 24, 2025

Uh oh!

codecov bot commented Jan 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ColtAllen commented Jan 24, 2025

Uh oh!

PabloRoque commented Feb 17, 2025

Uh oh!

ColtAllen commented Feb 18, 2025

Uh oh!

PabloRoque commented Feb 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ColtAllen commented Feb 21, 2025

Uh oh!

ColtAllen left a comment

Choose a reason for hiding this comment

Uh oh!

ColtAllen Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

juanitorduz commented Apr 23, 2025

Uh oh!

Uh oh!

Allow covariates in `plot_expected_purchases` #1430

Allow covariates in `plot_expected_purchases` #1430

PabloRoque commented Jan 24, 2025 •

edited

Loading

codecov bot commented Jan 24, 2025 •

edited

Loading

PabloRoque commented Feb 19, 2025 •

edited

Loading