20240610 developer call notes #1224
tomwhite
started this conversation in
Meeting Notes
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
20240610
Pre-notes
PRs
Issues
Discussions
Notes
Attendees
Discussion
JK: Paper is ready to go! Waiting for sign-off from Tim’s org.
What next?
JH: Good time to step back and reevaluate sgkit - e.g. do QC pipeline that works well from scratch. E.g. use JAX not Numba. Also Cubed.
TW: Cubed on HPC interest too: cubed-dev/cubed#467
JK: Implement part of bcftools view on VCF Zarr. Would help large biobanks a lot.
JH: Look at top of funnel process. We have a large API surface - may be a problem, so good to re-start sgkit from new codebase.
JH: Local alleles?
JK: Helpful in general, but not for GeL data
JK: Add more converters to bio2zarr. Plink (¾ done). BGEN too.
TW: Move VCF writing to another repo?
JK: Good question - depends what we all think.
JK: I’d like to work on the bio2zarr cli, but very overcommitted. Hope more people show up.
TW: I’d like to move Hypothesis VCF to its own repo and python package
JK: Does Hypothesis API change much?
RW: No, very stable
Zarr integrity
JH: Does TensorStore have a concept of transactions we could use?
TW: ArrayLake another approach.
New tech
JH: New array store: https://github.com/spiraldb/vortex
EC: Any experience of Mojo?
JH: Focus on PyTorch and JAX…
Beta Was this translation helpful? Give feedback.
All reactions