Skip to content

Pyarrow is not properly detected after importing ray #3744

Closed
@devin-petersohn

Description

@devin-petersohn

System information

  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04): WSL
  • Ray installed from (source or binary): pip
  • Ray version: 0.6.1
  • Python version: 3.6
  • Exact command to reproduce: pandas.DataFrame({"col1":[1,2,3], "col2":[2,4,5]}).to_parquet("test.parquet")

Describe the problem

The string version of the pyarrow shipped with ray is not able to be detected by pandas.

Source code / logs

String version in 0.6.1 is:

In [1]: import ray
im
In [2]: import pyarrow as pa

In [3]: pa.__version__
Out[3]: '0.1.dev3212+gc347cd5'

When trying to use pandas to write a parquet file, it does not detect that a valid pyarrow is installed because it is looking for pyarrow>=0.4. Here is the pandas command that reproduces it:

import ray
import pandas
pandas.DataFrame({"col1":[1,2,3], "col2":[2,4,5]}).to_parquet("test.parquet")

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions