Added sampling methods of crv_types #18754

Smit-create · 2020-03-01T22:38:29Z

Added sampling methods of continuous random variables

References to other Issues or PRs

Closes #17057
Related to #19061

Brief description of what is fixed or changed

Other comments

Release Notes

stats
- Added sampling methods for continuous variables
- Added library option in sample
- sample returns an iterator object since version 1.7

sympy-bot · 2020-03-01T22:38:39Z

✅

Hi, I am the SymPy bot (v158). I'm here to help you write a release notes entry. Please read the guide on how to write release notes.

Your release notes are in good order.

Here is what the release notes will look like:

stats
- Added sampling methods for continuous variables (#18754 by @Smit-create)
- Added library option in sample (#18754 by @Smit-create)
- sample returns an iterator object since version 1.7 (#18754 by @Smit-create)

This will be added to https://github.com/sympy/sympy/wiki/Release-Notes-for-1.7.

Note: This comment will be updated with the latest check if you edit the pull request. You need to reload the page to see it.

Click here to see the pull request description that was parsed.

<!-- Your title above should be a short description of what
was changed. Do not include the issue number in the title. -->
Added sampling methods of continuous random variables
#### References to other Issues or PRs
<!-- If this pull request fixes an issue, write "Fixes #NNNN" in that exact
format, e.g. "Fixes #1234" (see
https://tinyurl.com/auto-closing for more information). Also, please
write a comment on that issue linking back to this pull request once it is
open. -->
Closes #17057 
Related to #19061
#### Brief description of what is fixed or changed


#### Other comments


#### Release Notes

<!-- Write the release notes for this release below. See
https://github.com/sympy/sympy/wiki/Writing-Release-Notes for more information
on how to write release notes. The bot will check your release notes
automatically to see if they are formatted correctly. -->

<!-- BEGIN RELEASE NOTES -->
* stats
   * Added sampling methods for continuous variables
   * Added `library` option in `sample`
   *  `sample` returns an iterator object since version 1.7
<!-- END RELEASE NOTES -->

Update

The release notes on the wiki have been updated.

Smit-create · 2020-03-03T13:05:16Z

Please review.

Smit-create · 2020-03-04T05:09:20Z

This is ready for review. @czgdp1807

sympy/stats/crv.py

czgdp1807 · 2020-03-04T09:44:33Z

The diff coverage is quite low. Can you add more tests or modify current tests to or merge master to increase the coverage?

Smit-create · 2020-03-04T10:53:57Z

Code coverage now increases to 100%

Smit-create · 2020-03-04T16:31:28Z

@czgdp1807 Does it look good to go?

czgdp1807 · 2020-03-05T11:12:45Z

LGTM. Will merge if no one raises objections.

sympy/stats/crv.py

sympy/stats/crv_types.py

oscarbenjamin · 2020-03-05T11:34:24Z

The code here is very repetitive. It would be better to make a data structure that stores the different functions like in lambdify and the code printers.

Would it make more sense to handle this as part of lambdify and the code printers?

Smit-create · 2020-03-05T20:52:02Z

Would it make more sense to handle this as part of lambdify and the code printers?

I will look into this suggestion.

oscarbenjamin · 2020-03-08T13:06:37Z

It probably makes more sense to group the code together in one class for each of the libraries and have a mapping from sympy types to the other library types rather than having each of the sympy classes repeat the boiler plate for calling into the external library. That is what I mean by saying that it is repetitive. If we have m other libraries and k sympy distribution classes then we will end up having to add m*k methods for connecting them all. Since m is smaller than k it makes more sense to have m classes for the output each of which can have a mapping of size k that efficiently describes how to connect between them.

Smit-create · 2020-03-08T13:23:10Z

Thanks! @oscarbenjamin, I get your point. I will think of some way to work upon that.

Smit-create · 2020-03-08T15:43:56Z

I have designed another way. I will explain with an example of sample_scipy.

We will create 5 dictionaries, one which will import the corresponding random variable from scipy, other four will map scipy arguments with sympy arguments. Generally, scipy has 4 main arguments, i.e, a , b, loc, scale. The other four dictionaries will map these arguments using attributes of the corresponding class of random variable.
Then, finally, return the sample using mapped random variable from first dictionary and arguments from other 4 dictionaries.

Smit-create · 2020-03-10T17:33:41Z

I have designed a function that works fine with scipy:


def _sample_scipy(dist, size):

    dist_list =  ['BetaDistribution', 'BetaPrimeDistribution',
    'CauchyDistribution', 'ChiDistribution', 'ChiSquaredDistribution',
    'ExponentialDistribution', 'GammaDistribution', 'GammaInverseDistribution',
    'LogNormalDistribution', 'NormalDistribution', 'GaussianInverseDistribution',
    'ParetoDistribution', 'UniformDistribution']

    if dist.__class__.__name__ not in dist_list:
        return None
    from scipy.stats import (beta, betaprime, cauchy, chi, chi2, expon, gamma,
                             invgamma, lognorm, norm, invgauss, pareto, uniform)

    scipy_rv_map = {
            'BetaDistribution': lambda dist, size: beta.rvs(a=float(dist.a), b=float(dist.b),
                                    size=size),
            'BetaPrimeDistribution':lambda dist, size: betaprime.rvs(a=float(dist.alpha),
                                    b=float(dist.beta), size=size),
            'CauchyDistribution': lambda dist, size: cauchy.rvs(loc=float(dist.x0),
                                    scale=float(dist.gamma), size=size),
            'ChiDistribution': lambda dist, size: chi.rvs(df=float(dist.k), size=size),
            'ChiSquaredDistribution': lambda dist, size: chi2.rvs(df=float(dist.k), size=size),
            'ExponentialDistribution': lambda dist, size: expon.rvs(loc=0, scale=1/float(dist.rate),
                                    size=size),
            'GammaDistribution': lambda dist, size: gamma.rvs(a=float(dist.k), loc=0,
                                    scale=float(dist.theta), size=size),
            'GammaInverseDistribution': lambda dist, size: invgamma.rvs(a=float(dist.a), loc=0,
                                    scale=float(dist.b), size=size),
            'LogNormalDistribution': lambda dist, size: lognorm.rvs(s=std, loc=0,
                                    scale=exp(float(dist.mean), size=size)),
            'NormalDistribution': lambda dist, size: norm.rvs(float(dist.mean), float(dist.std), size=size),
            'GaussianInverseDistribution': lambda dist, size: invgauss.rvs(
                mu=float(dist.mean)/float(dist.shape), scale=float(dist.shape), size=size),
            'ParetoDistribution': lambda dist, size: pareto.rvs(b=float(dist.alpha),
                                    scale=float(dist.xm), size=size),
            'UniformDistribution': lambda dist, size: uniform.rvs(loc=float(dist.left),
                            scale=float(dist.right)-float(dist.left), size=size),
        }
    return scipy_rv_map[dist.__class__.__name__](dist, size)

This will remove all _sample_scipy methods from individual classes, and just a single function will be used.
@oscarbenjamin @czgdp1807 Does this look good to commit for all libraries?

oscarbenjamin · 2020-03-10T17:38:23Z

That looks better. There's probably a way to organise this nicely into one class for each external library and perhaps factor some code out to a base class.

Smit-create · 2020-04-24T09:13:28Z

Can anyone please have a look at the failing test https://travis-ci.org/github/sympy/sympy/jobs/677325425?

doc/src/modules/stats.rst

Smit-create · 2020-04-28T05:06:45Z

Please review this.

sympy/stats/tests/test_continuous_rv.py

czgdp1807 · 2020-04-28T05:57:31Z

LGTM. Ready for merge.

Smit-create · 2020-05-02T14:49:02Z

@czgdp1807 I think we can merge this if no-one has objection?

czgdp1807 · 2020-05-02T17:12:52Z

Let's merge this after, 1.6 branch is created. Some public API change is made in this PR, and various changes made in the coming months to the sampling APIs, so 1.7 will be more appropriate to have this change.

sympy/stats/rv.py

czgdp1807 · 2020-05-07T07:50:16Z

Finally, it's good to go in. Will merge it by tonight.

Upabjojr · 2020-05-10T18:22:18Z

sympy/stats/frv.py

        return FinitePSpace(domain, density)

-    def sample(self, size=()):
+    def sample(self, size=(1,), library='scipy'):


What is the purpose of substituting () with (1,) or 1 ? The tuple notation should represent the dimensions of an N-dimensional array you want to be returned.

added sampling of crv

6f7b569

Smit-create added 3 commits March 2, 2020 11:42

corrected tests

de9c0d7

corrected tests

75e24d4

code quality

9059eed

sylee957 added the stats label Mar 3, 2020

size correction in python

7b5a955

Smit-create changed the title ~~[WIP] Added sampling methods of crv_types~~ Added sampling methods of crv_types Mar 4, 2020

space correction

9f78580

czgdp1807 reviewed Mar 4, 2020

View reviewed changes

sympy/stats/crv.py Outdated Show resolved Hide resolved

czgdp1807 mentioned this pull request Mar 4, 2020

[WIP] Add _sample_python methods #17057

Closed

3 tasks

code cov

5bcec24

Smit-create requested a review from czgdp1807 March 5, 2020 11:06

czgdp1807 reviewed Mar 5, 2020

View reviewed changes

sympy/stats/crv.py Outdated Show resolved Hide resolved

sympy/stats/crv_types.py Outdated Show resolved Hide resolved

remove if elif

aa53e11

Smit-create added 2 commits March 12, 2020 02:30

changed sampling design

41c8ead

remove unused imports

d739ee1

Smit-create added 6 commits April 14, 2020 10:56

skip if scipy not found

1c48592

xfail samplingE

2415dff

Merge branch 'master' into sample_crv

28c7a48

remove docs for deleted functions

888a9a6

typo

647f6d2

use external libraries for sample

5149cc6

czgdp1807 reviewed Apr 25, 2020

View reviewed changes

doc/src/modules/stats.rst Show resolved Hide resolved

Smit-create added 2 commits April 25, 2020 12:30

Merge branch 'master' into sample_crv

0d0c057

remove import_module

0340d7d

czgdp1807 reviewed Apr 28, 2020

View reviewed changes

sympy/stats/tests/test_continuous_rv.py Show resolved Hide resolved

skip at function start

d54d486

czgdp1807 reviewed May 2, 2020

View reviewed changes

sympy/stats/rv.py Show resolved Hide resolved

czgdp1807 reviewed May 2, 2020

View reviewed changes

sympy/stats/rv.py Show resolved Hide resolved

czgdp1807 added the GSoC label May 5, 2020

Smit-create added 2 commits May 6, 2020 12:03

add deprecated warnings

c9ede55

Merge branch 'master' into sample_crv

14a5441

czgdp1807 reviewed May 6, 2020

View reviewed changes

sympy/stats/rv.py Outdated Show resolved Hide resolved

czgdp1807 reviewed May 6, 2020

View reviewed changes

sympy/stats/rv.py Outdated Show resolved Hide resolved

czgdp1807 reviewed May 6, 2020

View reviewed changes

sympy/stats/rv.py Show resolved Hide resolved

add warnings for sample

5370976

czgdp1807 merged commit 6c4df17 into sympy:master May 7, 2020

Upabjojr reviewed May 10, 2020

View reviewed changes

Smit-create mentioned this pull request May 17, 2020

[GSoC] Sampling from external libraries for all RVs #19342

Merged

czgdp1807 mentioned this pull request Nov 26, 2020

Stats: refactory of sampling methods #20494

Merged

Uh oh!

Added sampling methods of crv_types #18754

Added sampling methods of crv_types #18754

Uh oh!

Conversation

Smit-create commented Mar 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

References to other Issues or PRs

Brief description of what is fixed or changed

Other comments

Release Notes

Uh oh!

sympy-bot commented Mar 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Smit-create commented Mar 3, 2020

Uh oh!

Smit-create commented Mar 4, 2020

Uh oh!

Uh oh!

czgdp1807 commented Mar 4, 2020

Uh oh!

Smit-create commented Mar 4, 2020

Uh oh!

Smit-create commented Mar 4, 2020

Uh oh!

czgdp1807 commented Mar 5, 2020

Uh oh!

Uh oh!

Uh oh!

oscarbenjamin commented Mar 5, 2020

Uh oh!

Smit-create commented Mar 5, 2020

Uh oh!

oscarbenjamin commented Mar 8, 2020

Uh oh!

Smit-create commented Mar 8, 2020

Uh oh!

Smit-create commented Mar 8, 2020

Uh oh!

Smit-create commented Mar 10, 2020

Uh oh!

oscarbenjamin commented Mar 10, 2020

Uh oh!

Smit-create commented Apr 24, 2020

Uh oh!

Uh oh!

Smit-create commented Apr 28, 2020

Uh oh!

Uh oh!

czgdp1807 commented Apr 28, 2020

Uh oh!

Smit-create commented May 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

czgdp1807 commented May 2, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

czgdp1807 commented May 7, 2020

Uh oh!

Upabjojr May 10, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Smit-create commented Mar 1, 2020 •

edited

Loading

sympy-bot commented Mar 1, 2020 •

edited

Loading

Smit-create commented May 2, 2020 •

edited

Loading