Skip to content

Add support for PARQUET file tables #433

@hagenw

Description

@hagenw

In audeering/audformat#419 we introduced storing tables as PARQUET files in audformat.
They differ in two important aspects from tables stored in CSV files:

  • their changes cannot be tracked by audeer.md5(), but by a hash stored in their header
  • they shouldn't be stored in a ZIP file on the server, to make it easier to preview them (and they are already compressed)

Which means, we need to adjust the code to handle them.

As stated in audeering/audformat#419, the hash of those files can be read with:

import pyarrow.parquet as parquet

hash = parquet.read_schema(path).metadata[b"hash"].decode()

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions