REGR: fix rank algo for read-only data #37439

jorisvandenbossche · 2020-10-27T08:05:41Z

jreback · 2020-10-27T12:49:21Z

hmm i think this is actually failing see the traivs CI: https://travis-ci.org/github/pandas-dev/pandas/jobs/739214922

>   ranked_mat[:, i] = rank_1d(mat[:, i])
E   TypeError: Argument 'in_arr' has incorrect type (expected numpy.ndarray, got pandas._libs.algos._memoryviewslice)
pandas/_libs/algos.pyx:347: TypeError
_ TestDataFrameCorr.test_corr_nullable_integer[spearman-other_column2-nullable_column1] _
[gw0] linux -- Python 3.7.9 /home/travis/miniconda3/envs/pandas-dev/bin/python
self = <pandas.tests.frame.methods.test_cov_corr.TestDataFrameCorr object at 0x7fcbc4f5a4d0>
nullable_column = <IntegerArray>
[1, 2, <NA>]
Length: 3, dtype: Int64
other_column = array([ 1.,  2., nan]), method = 'spearman'
    @td.skip_if_no_scipy
    @pytest.mark.parametrize(
        "nullable_column", [pd.array([1, 2, 3]), pd.array([1, 2, None])]
    )
    @pytest.mark.parametrize(
        "other_column",
        [pd.array([1, 2, 3]), np.array([1.0, 2.0, 3.0]), np.array([1.0, 2.0, np.nan])],
    )
    @pytest.mark.parametrize("method", ["pearson", "spearman", "kendall"])
    def test_corr_nullable_integer(self, nullable_column, other_column, method):
        # https://github.com/pandas-dev/pandas/issues/33803
        data = DataFrame({"a": nullable_column, "b": other_column})
>       result = data.corr(method=method)
pandas/tests/frame/methods/test_cov_corr.py:190: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
pandas/core/frame.py:8300: in corr
    correl = libalgos.nancorr_spearman(mat, minp=min_periods)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
>   ranked_mat[:, i] = rank_1d(mat[:, i])
E   TypeError: Argument 'in_arr' has incorrect type (expected numpy.ndarray, got pandas._libs.algos._memoryviewslice)

jbrockmendel · 2020-10-27T15:31:42Z

looks like you need to update the types in nancorr_spearman and rank_2d (which call rank_1d)

jorisvandenbossche · 2020-10-27T20:51:46Z

Thanks for the note, indeed needed to update some other places where rank_1d is called

jreback · 2020-10-27T22:09:10Z

thanks @jorisvandenbossche

jreback · 2020-10-28T02:23:25Z

@meeseeksdev backport 1.1.x

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

REGR: fix rank algo for read-only data

01018d1

jorisvandenbossche added Regression Functionality that used to work in a prior pandas version Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff labels Oct 27, 2020

jorisvandenbossche added this to the 1.1.4 milestone Oct 27, 2020

jorisvandenbossche mentioned this pull request Oct 27, 2020

BUG: rank raises error with read-only data #37290

Closed

3 tasks

update call sites of rank_1d

1c6248e

jreback merged commit 9c5500e into pandas-dev:master Oct 27, 2020

meeseeksmachine mentioned this pull request Oct 28, 2020

Backport PR #37439 on branch 1.1.x (REGR: fix rank algo for read-only data) #37459

Merged

meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Oct 28, 2020

Backport PR pandas-dev#37439: REGR: fix rank algo for read-only data

782ddee

jorisvandenbossche deleted the gh-37290-rank-readonly branch October 28, 2020 07:23

jorisvandenbossche added a commit that referenced this pull request Oct 28, 2020

Backport PR #37439: REGR: fix rank algo for read-only data (#37459)

dc39ee2

Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

kesmit13 pushed a commit to kesmit13/pandas that referenced this pull request Nov 2, 2020

REGR: fix rank algo for read-only data (pandas-dev#37439)

f57b0ea

ukarroum pushed a commit to ukarroum/pandas that referenced this pull request Nov 2, 2020

REGR: fix rank algo for read-only data (pandas-dev#37439)

1a08ab0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

REGR: fix rank algo for read-only data #37439

REGR: fix rank algo for read-only data #37439

Uh oh!

jorisvandenbossche commented Oct 27, 2020

Uh oh!

jreback commented Oct 27, 2020

Uh oh!

jbrockmendel commented Oct 27, 2020

Uh oh!

jorisvandenbossche commented Oct 27, 2020

Uh oh!

jreback commented Oct 27, 2020

Uh oh!

jreback commented Oct 28, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

REGR: fix rank algo for read-only data #37439

REGR: fix rank algo for read-only data #37439

Uh oh!

Conversation

jorisvandenbossche commented Oct 27, 2020

Uh oh!

jreback commented Oct 27, 2020

Uh oh!

jbrockmendel commented Oct 27, 2020

Uh oh!

jorisvandenbossche commented Oct 27, 2020

Uh oh!

jreback commented Oct 27, 2020

Uh oh!

jreback commented Oct 28, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants