Fix t-tests when variance is zero #621

ivirshup · 2019-04-25T10:00:58Z

Just a quick fix, should probably replace most of this code with scipy.stats.ttest_ind/ scipy.stats.stats.ttest_ind_from_stats with equal_var=False.

ivirshup · 2019-04-25T10:02:05Z

#620

ivirshup · 2019-04-25T10:06:23Z

Someone should review this before it's merged. I think this won't cause any problems, but I'm also not too familiar with the DE code.

LuckyMD · 2019-04-26T09:27:46Z

I wonder why this wasn't done in the first place. Is scipy not already a dependency of scanpy? Or is this slower than the initial implementation?

falexwolf · 2019-04-26T11:06:44Z

Looks great! I wasn't aware of this high-dimensional version of a t-test in Scipy, which seems to be as efficient as the current implementation. I only investigated thoroughly for Wilcoxon rank and found that Scipy doesn't have a scalable version to offer.

But yes, this will get merged after 1.4.1.

falexwolf · 2019-04-26T11:37:14Z

1.4.1 is out, are we sure this is as scalable as it was before and not a backwards breaking change. If yes, we can merge immediately.

ivirshup · 2019-04-27T06:44:30Z

I'm not completely sure this doesn't break anything, but the regression tests pass. The internal code is very similar, so I'm not too worried about these changes.

It does look like it's (very) slightly slower. Running this a thousand times for pbmc68k dataset took ~2.3% longer (about 1.4 ms per run) than the previous version. That said, we're very inefficient about mean and variance calculation, so I think that's a better place to optimize.

Edit: I've force pushed to fix some minor formatting issues (trailing white space, blank line, typo) that I didn't think deserved it's own commit.

falexwolf · 2019-04-28T18:44:50Z

Great! 2.3% is nothing...

Fix t-tests when variance is zero

Don't call absolute certainty not true

d29467a

falexwolf mentioned this pull request Apr 26, 2019

TODO: Backwards-compat breaking changes #453

Open

15 tasks

Simplify t-tests by calling to scipy

c681343

ivirshup force-pushed the de_quick_fix branch from d1c4347 to c681343 Compare April 27, 2019 06:47

falexwolf merged commit 0712168 into scverse:master Apr 28, 2019

awnimo pushed a commit to dpeerlab/scanpy that referenced this pull request Dec 17, 2019

Merge pull request scverse#621 from ivirshup/de_quick_fix

1076a15

Fix t-tests when variance is zero

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix t-tests when variance is zero #621

Fix t-tests when variance is zero #621

Uh oh!

ivirshup commented Apr 25, 2019

Uh oh!

ivirshup commented Apr 25, 2019

Uh oh!

ivirshup commented Apr 25, 2019

Uh oh!

LuckyMD commented Apr 26, 2019

Uh oh!

falexwolf commented Apr 26, 2019

Uh oh!

falexwolf commented Apr 26, 2019

Uh oh!

ivirshup commented Apr 27, 2019 •

edited

Loading

Uh oh!

falexwolf commented Apr 28, 2019

Uh oh!

Uh oh!

Fix t-tests when variance is zero #621

Fix t-tests when variance is zero #621

Uh oh!

Conversation

ivirshup commented Apr 25, 2019

Uh oh!

ivirshup commented Apr 25, 2019

Uh oh!

ivirshup commented Apr 25, 2019

Uh oh!

LuckyMD commented Apr 26, 2019

Uh oh!

falexwolf commented Apr 26, 2019

Uh oh!

falexwolf commented Apr 26, 2019

Uh oh!

ivirshup commented Apr 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

falexwolf commented Apr 28, 2019

Uh oh!

Uh oh!

ivirshup commented Apr 27, 2019 •

edited

Loading