Define fallback implementations for mean, var, and entropy #1875

ararslan · 2024-06-26T03:36:15Z

We have the internal expectation function that uses quadgk to compute integrals, as well as a fallback implementation for kldivergence that uses expectation, so it seems reasonable to similarly define fallbacks for other quantities trivially computable using expectation: mean, var, and entropy. We could probably do skewness and kurtosis as well.

To do:

Determine the set of functions that should support expectation-based fallbacks
Add tests
Document that the fallbacks exist

(n.b. I skipped CI on the initial commit purposefully as I haven't given this a ton of thought yet and wanted to see whether anybody was strongly for or against it before doing meaningful work here)

We have the internal `expectation` function that uses `quadgk` to compute integrals, as well as a fallback implementation for `kldivergence` that uses `expectation`, so it seems reasonable to similarly define fallbacks for other quantities trivially computable using `expectation`: `mean`, `var`, and `entropy`. We could probably do `skewness` and `kurtosis` as well. NOTE: Skipping CI until I've actually done more with this since we run a lot of CI on this repo [ci skip]

devmotion · 2024-06-27T13:34:47Z

Copied from #1874 (comment):

My worry (that was also expressed in issues such as #968) is that generally numerical integration is challenging and a fallback might lead to silently incorrect results. It seems such a fallback would be wrong (or at least problematic) e.g. if the moments are not finite (such as e.g. for Cauchy).

So my general feeling is that numerical integration should maybe be restricted to a smaller subset of distributions, or maybe even only be available as a separate function. In case we want to use it more broadly, I think it would also be safer to error if the integration error estimate is too large, to reduce the probability of silently incorrect results.

bvdmitri · 2024-07-25T12:14:53Z

src/univariates.jl

@@ -214,7 +214,7 @@ skewness(d::UnivariateDistribution)

 Compute the entropy value of distribution `d`.
 """
-entropy(d::UnivariateDistribution)
+entropy(d::UnivariateDistribution) = expectation(x -> -log(pdf(d, x)), d)


Suggested change

entropy(d::UnivariateDistribution) = expectation(x -> -log(pdf(d, x)), d)

entropy(d::UnivariateDistribution) = expectation(x -> -logpdf(d, x), d)

pdf can overflow

bvdmitri · 2024-07-25T12:17:08Z

Generally agree with @devmotion . I've hit a similar issue for kldivergence here. In general, its not a good idea to have a silent approximation method and its better to let the user decide

ararslan · 2024-07-26T00:16:31Z

Yep, fair enough.

ararslan mentioned this pull request Jun 26, 2024

Add mean et al. for truncated log normal #1874

Open

bvdmitri reviewed Jul 25, 2024

View reviewed changes

ararslan closed this Jul 26, 2024

ararslan deleted the aa/fallback-expectation branch July 26, 2024 00:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define fallback implementations for mean, var, and entropy #1875

Define fallback implementations for mean, var, and entropy #1875

ararslan commented Jun 26, 2024

devmotion commented Jun 27, 2024

bvdmitri Jul 25, 2024

bvdmitri commented Jul 25, 2024

ararslan commented Jul 26, 2024

	entropy(d::UnivariateDistribution) = expectation(x -> -log(pdf(d, x)), d)
	entropy(d::UnivariateDistribution) = expectation(x -> -logpdf(d, x), d)

Define fallback implementations for mean, var, and entropy #1875

Define fallback implementations for mean, var, and entropy #1875

Conversation

ararslan commented Jun 26, 2024

devmotion commented Jun 27, 2024

bvdmitri Jul 25, 2024

Choose a reason for hiding this comment

bvdmitri commented Jul 25, 2024

ararslan commented Jul 26, 2024