Skip to content
View Stealth-py's full-sized avatar
:electron:
epic
:electron:
epic

Highlights

  • Pro

Block or report Stealth-py

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Training Large Language Model to Reason in a Continuous Latent Space

Python 781 60 Updated Jan 24, 2025

Code for "QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs" (accepted at COLING 2025)

Jupyter Notebook 2 Updated Jan 14, 2025

Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

Python 43 1 Updated Jul 10, 2024
Jupyter Notebook 27 5 Updated Jul 16, 2023
Python 4 Updated Oct 5, 2024

Composable building blocks to build Llama Apps

Python 7,127 810 Updated Feb 4, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 92 7 Updated Nov 19, 2024

Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!

HTML 280 19 Updated Oct 10, 2024

KokoMind: Can LLMs Understand Social Interactions?

JavaScript 106 7 Updated Oct 3, 2023
Python 13 Updated Oct 24, 2022

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

Python 23 1 Updated Dec 10, 2024

Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)

Python 11 Updated Feb 15, 2024

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

Python 120 6 Updated Dec 17, 2024

The code used to train and run inference with the ColPali architecture.

Python 1,433 124 Updated Jan 29, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 549 43 Updated Dec 16, 2024

[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

Jupyter Notebook 106 9 Updated Sep 12, 2024

Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks

Python 84 10 Updated Aug 11, 2023

Python package for measuring memorization in LLMs.

Jupyter Notebook 139 25 Updated Nov 22, 2024
Python 15 Updated Oct 13, 2024
Python 40 6 Updated Apr 30, 2024

The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understanding of Discourse Relations> @ ACL 2024

Python 15 3 Updated Aug 7, 2024

A library for generative social simulation

Python 759 169 Updated Feb 3, 2025

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Python 92 2 Updated Aug 18, 2023

Governance of the Commons Simulation (GovSim)

Python 32 10 Updated Jan 19, 2025
Python 1 Updated Oct 24, 2024

ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate

Python 391 55 Updated Oct 3, 2023
Next