Skip to content

Improve Memory Efficiency with CDF contests #967

@yezr

Description

@yezr

While researching memory usage for #962 I created 1 million ballot tests for all providers. The goal with these tests was to feel confident that RCTab can run a 1 million ballot contests for any provider, given our suggested minimum hardware guidelines of 16GB of memory. Given Java defaults, RCTab sets aside ~4GB of heap space when running on machines with 16GB of memory.

During 1 million ballot testing, the CDF provider was consistently running close to the 4GB memory usage. Can we look into the CommonDataFormatReader and see if there are any opportunities to be more efficient with our memory usage?

I copy/pasted within one of our shared Unisyn CVRs at test_data/unisyn_cvrs/101-RCV-CVT_TEST_3_UNIOVS2103-2.xml to get 50k votes within one file. That 50k vote file is 536MB and can be downloaded here. I made twenty copies of that on my machine to get to 1,176,000 total votes for this test.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

Status

Todo

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions