Stealth-py

Follow

epic

Neemesh Yadav Stealth-py

epic

Follow

reasoning, fairness, causality, interp | softmaxxing | lcs2, iiitd'24, ugrip'23 mbzuai

55 followers · 28 following

India
08:54 (UTC +05:30)
https://stealth-py.github.io/
@stealth_py
https://scholar.google.com/citations?user=cq5zLiMAAAAJ&hl=en

Achievements

Achievements

Highlights

Pro

Stars

facebookresearch / coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python 781 60 Updated Jan 24, 2025

aflah02 / QUENCH

Code for "QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs" (accepted at COLING 2025)

Jupyter Notebook 2 Updated Jan 14, 2025

dvlab-research / MR-GSM8K

Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

Python 43 1 Updated Jul 10, 2024

cicl-stanford / procedural-evals-tom

Jupyter Notebook 27 5 Updated Jul 16, 2023

neelnanda-io / CoT-Interp

Python 4 Updated Oct 5, 2024

meta-llama / llama-stack

Composable building blocks to build Llama Apps

Python 7,127 810 Updated Feb 4, 2025

kanishkg / strategic-lms-release

Python 2 1 Updated Mar 19, 2023

EleutherAI / nanoGPT-mup

Forked from karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 92 7 Updated Nov 19, 2024

openai / automated-interpretability

Python 988 115 Updated Mar 6, 2024

CHATS-lab / persuasive_jailbreaker

Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!

HTML 280 19 Updated Oct 10, 2024

CHATS-lab / KokoMind

KokoMind: Can LLMs Understand Social Interactions?

JavaScript 106 7 Updated Oct 3, 2023

WadeYin9712 / GeoMLAMA

Python 13 Updated Oct 24, 2022

yrf1 / LLM-MassiveMulticultureNormsKnowledge-NCLB

Python 13 Updated Jan 5, 2025

nlee0212 / BLEnD

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

Python 23 1 Updated Dec 10, 2024

cultural-csk / candle

Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)

Python 11 Updated Feb 15, 2024

zjunlp / KnowledgeCircuits

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

Python 120 6 Updated Dec 17, 2024

illuin-tech / colpali

The code used to train and run inference with the ColPali architecture.

Python 1,433 124 Updated Jan 29, 2025

allenai / OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 549 43 Updated Dec 16, 2024

i-gallegos / Fair-LLM-Benchmark

Python 120 8 Updated Sep 12, 2023

princeton-nlp / MQuAKE

[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

Jupyter Notebook 106 9 Updated Sep 12, 2024

csitfun / LogiQA2.0

Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks

Python 84 10 Updated Aug 11, 2023

iamgroot42 / mimir

Python package for measuring memorization in LLMs.

Jupyter Notebook 139 25 Updated Nov 22, 2024

3DAgentWorld / LLM-Game-Agent

Python 15 Updated Oct 13, 2024

SALT-NLP / CultureBank

Python 40 6 Updated Apr 30, 2024

YisongMiao / DiSQ-Score

The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understanding of Discourse Relations> @ ACL 2024

Python 15 3 Updated Aug 7, 2024

google-deepmind / concordia

A library for generative social simulation

Python 759 169 Updated Feb 3, 2025

nlp-uoregon / Okapi

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Python 92 2 Updated Aug 18, 2023

giorgiopiatti / GovSim

Governance of the Commons Simulation (GovSim)

Python 32 10 Updated Jan 19, 2025

LCS2-IIITD / TOXBART

Python 1 Updated Oct 24, 2024

composable-models / llm_multiagent_debate

ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate

Python 391 55 Updated Oct 3, 2023