Skip to content
View sebastianbreguel's full-sized avatar
  • Pontificia Universidad Católica de Chile
  • Santiago, Chile

Highlights

  • Pro

Block or report sebastianbreguel

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for building ConceptNet from raw data.

Roff 2,834 354 Updated Jan 19, 2023

The HELMET Benchmark

Jupyter Notebook 127 20 Updated Apr 10, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,771 204 Updated Feb 25, 2025

When Philosophy meets AI Agents

Python 115 17 Updated Apr 10, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,614 2,784 Updated Apr 12, 2025

📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.

HTML 709 74 Updated Mar 10, 2025

A lightweight, powerful framework for multi-agent workflows

Python 8,518 1,051 Updated Apr 12, 2025

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

HTML 14,487 2,124 Updated Mar 7, 2025

Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)

Python 201 14 Updated Apr 11, 2025

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Python 845 86 Updated Aug 20, 2024

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,935 212 Updated Mar 7, 2025

A simple, easy-to-hack GraphRAG implementation

Python 2,774 279 Updated Apr 12, 2025

[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

Python 2,200 177 Updated Apr 8, 2025

Chilean Humor Database

Jupyter Notebook 10 1 Updated Jun 25, 2024
Python 868 110 Updated Oct 26, 2024

A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!

Python 5,817 212 Updated Oct 26, 2024

MLX: An array framework for Apple silicon

C++ 20,159 1,169 Updated Apr 12, 2025

Comprehensive guide to learn RAG from basics to advanced.

Jupyter Notebook 900 258 Updated Mar 29, 2025

Named Entity Recognition using Claude Citations

Python 70 5 Updated Mar 9, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,347 176 Updated Apr 10, 2025

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

3,050 671 Updated Aug 5, 2024

Gemma open-weight LLM library, from Google DeepMind

Jupyter Notebook 3,156 431 Updated Apr 12, 2025

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 6,649 802 Updated Apr 9, 2025

Evaluating Creative Short Story Generation in Humans and Large Language Models

Jupyter Notebook 3 Updated Mar 20, 2025

A python wrapper for Tavily search API

Python 606 68 Updated Apr 8, 2025

A curated list of 120+ LLM libraries category wise.

3,211 521 Updated Mar 21, 2025

Get your documents ready for gen AI

Python 26,956 1,620 Updated Apr 11, 2025

Neural question generation using transformers

Jupyter Notebook 1,127 349 Updated Apr 5, 2024

tiktoken is an open-source tokeniser for OpenAI, and TiktokenCpp is a C++ported version. TiktokenCpp using modern C++ language features and providing interface functions that are similar to Tiktoke…

C++ 7 Updated Jun 6, 2024
Jupyter Notebook 2,823 383 Updated Mar 21, 2025
Next