Looking for collaborators #252
Replies: 6 comments 15 replies
-
|
I am interested in part #1, "Modernize the build using pyproject setup and uv". I will create an issue for this. |
Beta Was this translation helpful? Give feedback.
-
|
Hi @ekzhu , I’d love to help out with some CI improvements and get familiar with the project. Once I’m up to speed, I can contribute to other areas as well. Looking forward to collaborating! |
Beta Was this translation helpful? Give feedback.
-
|
Hi @ekzhu This is my first contribution (ever on github) to the best of my knowledge. |
Beta Was this translation helpful? Give feedback.
-
|
Hey @ekzhu, GPU-based algorithms sounds exciting - I would love to contribute to it. |
Beta Was this translation helpful? Give feedback.
-
|
Hi Ekzhu, I’d love to start by working on benchmarking the performance of different storage backends and exploring possible optimizations. I’ll create an issue to discuss my approach and initial findings. This is going to be my first ever open source contribution too |
Beta Was this translation helpful? Give feedback.
-
|
@ekzhu I have started work on a new benchmark framework in this PR would love to get some feedback on the initial version which handles fetching and caching data to Numpy Memory Mapped arrays. I have detailed the next steps to carry out and the framework I am following. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I am looking for open-source collaborators for the datasketch project.
I created datasketch 10 years ago to open source the code I wrote for my PhD work in dataset search. Many great folks helped build this with me. You can find their contributions.
Now, it is downloaded 4.5M times per month on PyPI -- driving about 1% of all numpy downloads. At this point, datasketch is a foundational block of the Python ecosystem.
Today, I am spending most of my time building AI agent systems so I need help modernizing this project. There are several things I would love to do but need help with:
experimental.The goal is to make datasketch more efficient and scalable, while keeping the API stable.
Don't direct message me, just comment on this thread, and create an issue to address one of the above topics and we can discuss there. I will create a collaborator team and add folks to it as appropriate.
Please, no AI slops. I love using AI coding agents but I always review AI-generated code carefully and communicate with human directly. Don't overburden reviewers with massive code changes just because you can now generate code.
Beta Was this translation helpful? Give feedback.
All reactions