Skip to content

Encoding baseline duration exceeds memory limit #147

@james-tanner

Description

@james-tanner

I've been trying to use the encode_baseline measure for words inside of a SPADE script, currently:

with CorpusContext(config) as c:
    if not c.hierarchy.has_token_property('word', 'baseline'):
        print('getting baseline word duration')
        c.encode_baseline('word', 'duration')

This works fine on smaller corpora (like ICE-Can or Modern RP), but exceeds the memory limit (even on Roquefort) for corpora of SOTC-size and larger.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions