Skip to content
View mookiezi's full-sized avatar
💭
Thinking about thinking
💭
Thinking about thinking

Block or report mookiezi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. interface interface Public

    A Python-based interactive CLI interface for chatting with Hugging Face language models, optimized for casual, Discord-style conversation using ChatML. Supports both quantized and full-precision mo…

    Python 2

  2. dataset-cleaning-toolkit dataset-cleaning-toolkit Public

    A dataset toolbox for preparing and analyzing conversational datasets, including CSV splitting, CSV → Parquet conversion, dataset statistics, Parquet cleaning and sorting, HuggingFace–style metadat…

    Python 1

  3. dataset-pipeline dataset-pipeline Public

    A full Discord dataset pipeline with end-to-end flow from raw Discord data to final Parquet dataset with full statistics — every stage independant, idempotent, and CLI-driven for ease of automation.

    1

  4. dataset-toolbox dataset-toolbox Public

    A dataset toolbox for preparing and analyzing conversational datasets, including CSV splitting, CSV → Parquet conversion, dataset statistics, dialogue-turn filtering, turn-based filtering, token an…

    Python