-
Notifications
You must be signed in to change notification settings - Fork 54
Closed
Description
more of a docs/reminder rather than an issue
- While trying to split
call-dotsinto modular steps, returned toscroing_stepby @nvictus incall-dots.
Testing on "big"-data yielded somewhat familiarmultiprocessing/pickleerror:
multiprocess.pool.MaybeEncodingError: Error sending result: '[
...
[dump of ~500-1,000 pd.DataFrame-s, ~500,000 rows by 30 columns each ]
...
Reason: 'error("'i' format requires -2147483648 <= number <= 2147483647",)'- For the same input parameters it breaks here: https://github.com/mirnylab/cooltools/blob/441a84ab6c1efd3bcd29de6cfd6ee78551873478/cooltools/cli/call_dots.py#L302 , but not here: https://github.com/mirnylab/cooltools/blob/441a84ab6c1efd3bcd29de6cfd6ee78551873478/cooltools/cli/call_dots.py#L404 , as these objects are slices of histograms and are way smaller than
500,000X30 - @Phlya observed same or similar issue even for https://github.com/mirnylab/cooltools/blob/441a84ab6c1efd3bcd29de6cfd6ee78551873478/cooltools/cli/call_dots.py#L404 or https://github.com/mirnylab/cooltools/blob/441a84ab6c1efd3bcd29de6cfd6ee78551873478/cooltools/cli/call_dots.py#L482 while running "modern"
call-dotsinstance that didn't use @nvictus -'sscoring_step. I could not find a corresponding issue anywhere. - apparently
pickleis calculating total number of elements - looks like it doescolumns*rowsby the number of dataframes, otherwise math does not work out (>=2147483647). Is it indeed the case @nvictus @mimakaev @golobor ? what if it were to be a bunch ofstring-s of total length >2bln ?! https://stackoverflow.com/questions/47776486/python-struct-error-i-format-requires-2147483648-number-2147483647 - indeed says something about calculating elements in each objects ... - I'll work around this BS for testing etc purposes but we should address it eventually -
dask? or at leastmultipro-something that is usingdill- @nvictus ? - Also, looks like it was finally fixed bpo-17560: Too small type for struct.pack/unpack in mutliprocessing.Connection python/cpython#10305 ?! can anyone more knowledgeable confirm @nvictus @mimakaev @golobor ?
Metadata
Metadata
Assignees
Labels
No labels