Implemented credibility_interval() #188

Stefan-Heimersheim · 2022-03-22T16:14:35Z

Description

Added a credibility_interval method to MCMCSampels/NestedSamples
which computes the credibility / confidence interval.

Fixes #178 and replaces #179

Notes:

Discretization error: Using np.cumsum, the value of the CDF at the
first data point is == its weight > 0, but at the last data point
is == 1. This should be negligible for weights<<1 but for
significant weights i.e. small sample sizes this can have an effect.
Should we add an alias function confidence_interval
I have also thrown in upper and lower limits which are not
iso-probability intervals, but rely on the same code

Checklist:

I have performed a self-review of my own code
My code is PEP8 compliant (flake8 anesthetic tests)
My code contains compliant docstrings
(pydocstyle --convention=numpy anesthetic)
New and existing unit tests pass locally with my changes
(python -m pytest)
I have added tests that prove my fix is effective or that my
feature works

codecov · 2022-06-08T17:12:15Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (8726a90) to head (3e6bd4b).

Additional details and impacted files

@@            Coverage Diff            @@
##            master      #188   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           36        36           
  Lines         3058      3127   +69     
=========================================
+ Hits          3058      3127   +69

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

williamjameshandley

Excellent work @Stefan-Heimersheim. The changes are only cosmetic/refactoring, so this should be good to go soon. Very thought provoking

anesthetic/samples.py

tests/test_samples.py

Stefan-Heimersheim · 2022-06-10T21:42:57Z

The sorting

        order = np.argsort(samples)
        samples = samples[order]

breaks with merged data sets, as in #189

Edit: This does not seem to be an issue anymore,

samples = anesthetic.MCMCSamples(np.random.rand(1000,2),columns=["x","y"],index=np.random.randint(0,100,1000))
anesthetic.utils.credibility_interval(samples.x)

works.

Stefan-Heimersheim · 2022-07-19T15:25:07Z

Re CI/lint flake8:

anesthetic/samples.py:11:1: F401 'scipy.interpolate.interp1d' imported but unused
anesthetic/samples.py:12:1: F401 'scipy.optimize.minimize_scalar' imported but unused

Not really sure what's causing flake8 to report an error here. Otherwise this should be good to go; only the quantile treatment isn't merged between credibility_interval and quantile.

anesthetic/samples.py

williamjameshandley

Hi @Stefan-Heimersheim.

I've unified the CDF functions between quantile and credibility_interval. My weighted ECDF was slightly different from yours (defined by min sample has CDF 0, max sample has CDF 1, and the intermediate points have a symmetric average of the weights either side). This coincides with the unweighted form and is symmetric, but I also have some ideas as to how to extend so that the bounds are appropriately slightly larger than the min and max.

I also added some tests so that the coverage is 100%.

If you're happy with these changes please squash and merge.

Best,
Will

Stefan-Heimersheim · 2022-07-28T21:04:08Z

My weighted ECDF was slightly different from yours (defined by min sample has CDF 0, max sample has CDF 1, and the intermediate points have a symmetric average of the weights either side).

I am a bit unsure about the new ECDF function:

Looking at say CDF(10)-CDF(3) in this example, there are 7 out of 10 samples in the interval [3,10], but the ECDF (blue line) gives a larger value -- is that a more-correct approximation that the version I initially implemented (green/red lines)?

williamjameshandley · 2022-07-29T02:17:26Z

Excellent plot!

I've done some investigating, and these really are second order effects. What we should be most concerned about is whether cdf(b)-cdf(a) approximates the true probability mass in between b and a.

import numpy as np
from scipy.interpolate import interp1d
from scipy.stats import norm
import matplotlib.pyplot as plt

# Generate a dataset
np.random.seed(3)
data = np.random.randn(30)
weights = np.random.rand(30)
i = np.argsort(data)
data = data[i]
weights = weights[i]/weights.sum()

# Symmetric empirical CDF
w_ = np.concatenate([[0.],weights[1:]+weights[:-1]]).cumsum()
w_/=w_[-1]
sym = interp1d(data, w_)

# Left hand empirical CDF
lh = interp1d(data, np.cumsum(weights))

# Right hand empirical CDF
rh = interp1d(data, 1-np.cumsum(weights[::-1])[::-1])

plt.plot(data,sym(data), label='Symmetric')
plt.plot(data,lh(data), label='Left Hand')
plt.plot(data,rh(data),  label='Right Hand')
plt.plot(data,norm.cdf(data),  label='True CDF')
plt.xlabel('x')
plt.ylabel('CDF')
plt.tight_layout()
plt.savefig('CDF.png')
plt.close()

So we see that all perform pretty equivalently relative to the correct answer.

Increasing the number of samples improves performance, but the three empirical curves lie on top of one another, which is different at lower order to the true answer.

Examining the histograms of the performance relative to the true answer shows them to all perform in an unbiased way

W, SYM, LH, RH, T = [], [], [], [], []
for _ in range(10000):
    a, b = np.sort(np.random.uniform(data.min(),data.max(),size=(2)))
    w = weights[(data < b) & (data > a)].sum()
    W.append(w)
    T.append(norm.cdf(b)-norm.cdf(a))
    SYM.append(sym(b)-sym(a))
    LH.append(lh(b)-lh(a))
    RH.append(rh(b)-rh(a))

W, SYM, LH, RH, T = np.array([W, SYM, LH, RH, T])
        
plt.hist(W-T,bins=100)
plt.hist(SYM-T,bins=100)
plt.hist(LH-T,bins=100,alpha=0.8)
plt.hist(RH-T,bins=100,alpha=0.8)
plt.tight_layout()
plt.savefig('hist.png')

Increasing to 1000 points just reduces the spread, but not the difference in the methods.

Unless you can find a stress test which shows any of these to be better or worse than the others, I'd vote for the symmetric one.

Stefan-Heimersheim · 2022-08-03T06:59:19Z

I am confused, clearly in my CDF plot there symmetric method always gives you a steeper gradient, and thus should give always give you a larger CDF(b)-CDF(a) difference.

And here is your plot again without subtracting the True value:

plt.figure(constrained_layout=True)
plt.hist(SYM,bins=100, histtype="step", label="SYM")
plt.hist(LH,bins=100, histtype="step", label="LH")
plt.hist(RH,bins=100, histtype="step", label="RH")
plt.hist(T,bins=100, histtype="step", label="T")
plt.xlabel("CDF(b)-CDF(a)")
plt.legend()
plt.savefig('hist.png')
plt.show()

(I did use 10 data points with weights = np.random.ones(10), I didn't get to double check if that is causing any difference in any of the codes.) Here is the same plot including T:

Does the symmetric one do systematically worse? Do both methods have a systematic bias? And what would we expect in theory? Intuitively the LH/RH versions feel like implementations of the basic CDF definition, but I am not sure how to reason about the symmetric formula. I'll try to generalize this benchmark and figure out why we don't see that in your histogram.

Edit: Full code

import numpy as np
from scipy.interpolate import interp1d
from scipy.stats import norm
import matplotlib.pyplot as plt

# Generate a dataset
np.random.seed(3)
data = np.random.randn(10)
weights = np.ones(10)
i = np.argsort(data)
data = data[i]
weights = weights[i]/weights.sum()

# Symmetric empirical CDF
w_ = np.concatenate([[0.],weights[1:]+weights[:-1]]).cumsum()
w_/=w_[-1]
sym = interp1d(data, w_)

# Left hand empirical CDF
lh = interp1d(data, np.cumsum(weights))

# Right hand empirical CDF
rh = interp1d(data, 1-np.cumsum(weights[::-1])[::-1])


W, SYM, LH, RH, T = [], [], [], [], []
for _ in range(10000):
    a, b = np.sort(np.random.uniform(data.min(),data.max(),size=(2)))
    w = weights[(data < b) & (data > a)].sum()
    W.append(w)
    T.append(norm.cdf(b)-norm.cdf(a))
    SYM.append(sym(b)-sym(a))
    LH.append(lh(b)-lh(a))
    RH.append(rh(b)-rh(a))

W, SYM, LH, RH, T = np.array([W, SYM, LH, RH, T])


plt.figure(constrained_layout=True)
plt.hist(SYM,bins=100, histtype="step", label="SYM")
plt.hist(LH,bins=100, histtype="step", label="LH")
plt.hist(RH,bins=100, histtype="step", label="RH")
plt.hist(T,bins=100, histtype="step", label="T")
plt.xlabel("CDF(b)-CDF(a)")
plt.legend()
plt.savefig('hist.png')
plt.show()

…`method` in `credibility_interval` method

Stefan-Heimersheim · 2023-07-24T09:37:57Z

Thanks @lukashergt for implementing the MultiIndex! That's everything done; as you suggested I'll take a look at the code tonight and we can get this merged.

Stefan-Heimersheim · 2023-07-24T20:47:28Z

I think this is fantastic, thanks for implementing the extension!

The only unintuitive case is calling it for a single variable, you have to do

samples[["noise"]].credibility_interval()

rather than

samples["noise"].credibility_interval()

but our other functions like mean() and std() work in either case. But not a big deal and happy to leave it this way if the extra functionality is a hassle.

…das.py` and implement a `Series` option

lukashergt · 2023-07-27T02:55:12Z

I think this is fantastic, thanks for implementing the extension!

The only unintuitive case is calling it for a single variable, you have to do
samples[["noise"]].credibility_interval()
rather than
samples["noise"].credibility_interval()
but our other functions like mean() and std() work in either case. But not a big deal and happy to leave it this way if the extra functionality is a hassle.

samples.x0.credibility_interval(...) now returns the same as credibility_interval(samples.x0, weights=samples.get_weights(), ...).

lukashergt · 2023-07-27T02:58:28Z

@williamjameshandley, since @Stefan-Heimersheim and I have both added functionality here, would you like to review?

williamjameshandley · 2023-07-27T10:02:41Z

anesthetic/utils.py

+    # Set the last element (tested to be approx 1)
+    # to exactly 1 to avoid interpolation errors
+    cdf[-1] = 1


I would strongly prefer for there not to be an assert statement (which the normalisation would fix). This kind of thing would be infuriating as part of a large automated pipeline where floating point errors derail a larger workflow.

Stefan-Heimersheim · 2023-08-04T13:58:29Z

I would strongly prefer for there not to be an assert statement

I agree, and we do not need to check cdf[-1] because this is literally a sum over np.random.dirichlet which by definition sums to 1.

This really just checks whether the NumPy dirichlet function is correct, and we don't need to be checking NumPy here.

lukashergt

Anything else for this PR, @williamjameshandley?

Stefan-Heimersheim · 2023-11-09T19:15:14Z

Were there any other changes needed to merge this?

Stefan Heimersheim and others added 6 commits March 22, 2022 15:40

Implemented Samples.credibility_interval

00c61e5

Improve documentation

8098bfd

Formatting / make flake8 happy

f6afe0a

pydocstyle compliance

6c816eb

Fix typo made while flake8-ing

e3c1f21

Merge branch 'williamjameshandley:master' into credibility-interval

c22e304

williamjameshandley requested changes Jun 8, 2022

View reviewed changes

anesthetic/samples.py Outdated Show resolved Hide resolved

anesthetic/samples.py Outdated Show resolved Hide resolved

anesthetic/samples.py Outdated Show resolved Hide resolved

anesthetic/samples.py Outdated Show resolved Hide resolved

tests/test_samples.py Outdated Show resolved Hide resolved

williamjameshandley added 2 commits July 12, 2022 10:46

Merge branch 'master' into credibility-interval

3f8f2cb

That will teach me to try and use github to manually merge

a39c038

Stefan-Heimersheim mentioned this pull request Jul 19, 2022

[Feature suggestion] Use fast credibility intervals to compute contours handley-lab/fgivenx#20

Open

Stefan Heimersheim added 4 commits July 19, 2022 15:38

Move credibility_interval to utils

be0eb50

Formatting

3b20f1c

flake8 wants def rather than lambda

4a66d74

Remove unneeded variable

0d53bce

lukashergt reviewed Jul 19, 2022

View reviewed changes

anesthetic/samples.py Outdated Show resolved Hide resolved

Stefan Heimersheim and others added 7 commits July 19, 2022 16:50

Remove imports no longer needed

d6b8bb9

docstring format

da7ebaf

Updated CI checks to be in line with github requirements

92f24be

Moving to full coverage

19b3e1a

Added a link to the fastCL repo

e97526e

Unified quantiles, cdfs and credibility-intervals

12a6bc8

Added some CDF tests

bcf2ae1

williamjameshandley previously approved these changes Jul 28, 2022

View reviewed changes

Merge branch 'master' into credibility-interval

095a64a

lukashergt and others added 4 commits June 14, 2023 17:40

implement return_covariance option for lower-limit and upper-limit …

010a02e

…`method` in `credibility_interval` method

update test_credibility_interval to new dataframe return values

f05f3fd

Merge branch 'master' into credibility-interval

f1e4d11

Merge branch 'master' into credibility-interval

707376b

williamjameshandley and others added 4 commits July 26, 2023 19:01

Merge branch 'master' into credibility-interval

463ae88

newline

49863cc

move credibility_interval method from samples.py to `weighted_pan…

1ec5b8e

…das.py` and implement a `Series` option

add tests for Series method of credibility_interval

0c63e6e

Merge branch 'master' into credibility-interval

54bf848

williamjameshandley requested changes Jul 31, 2023

View reviewed changes

lukashergt and others added 3 commits July 31, 2023 10:36

Merge branch 'master' into credibility-interval

292aba5

Merge branch 'master' into credibility-interval

b85b84d

Merge branch 'master' into credibility-interval

525381c

Stefan Heimersheim and others added 8 commits August 4, 2023 15:01

Remove unnecessary assertion

9fabe8d

This really just checks whether the NumPy dirichlet function is correct, and we don't need to be checking NumPy here.

Merge branch 'master' into credibility-interval

3da3115

version bump to 2.3.0

56a0c6c

Merge branch 'master' into credibility-interval

8aa2bb8

Merge branch 'master' into credibility-interval

5832781

version bump to 2.4.0

48b1a8b

Merge branch 'master' into credibility-interval

202d754

version bump to 2.5.0

fe25aeb

lukashergt previously approved these changes Sep 30, 2023

View reviewed changes

Merge branch 'master' into credibility-interval

3e6bd4b

williamjameshandley dismissed lukashergt’s stale review via 3e6bd4b May 15, 2024 20:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented credibility_interval() #188

Implemented credibility_interval() #188

Stefan-Heimersheim commented Mar 22, 2022

codecov bot commented Jun 8, 2022 •

edited

Loading

williamjameshandley left a comment

Stefan-Heimersheim commented Jun 10, 2022 •

edited

Loading

Stefan-Heimersheim commented Jul 19, 2022

williamjameshandley left a comment

Stefan-Heimersheim commented Jul 28, 2022

williamjameshandley commented Jul 29, 2022

Stefan-Heimersheim commented Aug 3, 2022 •

edited

Loading

Stefan-Heimersheim commented Jul 24, 2023

Stefan-Heimersheim commented Jul 24, 2023

lukashergt commented Jul 27, 2023

lukashergt commented Jul 27, 2023

williamjameshandley Jul 27, 2023

Stefan-Heimersheim commented Aug 4, 2023

lukashergt left a comment

Stefan-Heimersheim commented Nov 9, 2023

Implemented credibility_interval() #188

Are you sure you want to change the base?

Implemented credibility_interval() #188

Conversation

Stefan-Heimersheim commented Mar 22, 2022

Description

Notes:

Checklist:

codecov bot commented Jun 8, 2022 • edited Loading

Codecov Report

williamjameshandley left a comment

Choose a reason for hiding this comment

Stefan-Heimersheim commented Jun 10, 2022 • edited Loading

Stefan-Heimersheim commented Jul 19, 2022

williamjameshandley left a comment

Choose a reason for hiding this comment

Stefan-Heimersheim commented Jul 28, 2022

williamjameshandley commented Jul 29, 2022

Stefan-Heimersheim commented Aug 3, 2022 • edited Loading

Stefan-Heimersheim commented Jul 24, 2023

Stefan-Heimersheim commented Jul 24, 2023

lukashergt commented Jul 27, 2023

lukashergt commented Jul 27, 2023

williamjameshandley Jul 27, 2023

Choose a reason for hiding this comment

Stefan-Heimersheim commented Aug 4, 2023

lukashergt left a comment

Choose a reason for hiding this comment

Stefan-Heimersheim commented Nov 9, 2023

codecov bot commented Jun 8, 2022 •

edited

Loading

Stefan-Heimersheim commented Jun 10, 2022 •

edited

Loading

Stefan-Heimersheim commented Aug 3, 2022 •

edited

Loading