Skip to content
View BenItBuhner's full-sized avatar

Highlights

  • Pro

Block or report BenItBuhner

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BenItBuhner/README.md

Bennett Buhner: Beneficial AI Engineering

I love building with AI, and I especially love to scale harnesses and evals for agents. I have made numerous tools and environments before, all with varying use-cases and offerings. From RL environments (for LM certainty) all the way to powerful tools like an open-source Notion/Linear/Calendar alternative with a powerful agent that integrates with it and all your apps, I have significant experience when handling models, refining their harnesses to work as best as possible, and making unique frameworks best optimized for token efficiency, parallelism, and reliability.

If you want to see more, you can also go to my portfolio and check out my various blogs and project highlights.

What I Am Working On

Noetic: An AI-Powered OSS Alternative to Notion, Linear, Todoist, and More (Next.js, TypeScript)

This is one of my favorite developments (and is still a WIP), and I use it myself every single day to manage notes, tasks, and my calendar, all with the power of Noetic AI at the forefront. It offers:

  • Open-source code: It can be ran locally, contributed to by others, and more.
  • The best unification of tools ever: Noetic combines every single tracking/saving tool that millions use daily—Notion, Todoist, Linear, Google Calendar, ChatGPT, and more. It takes the place of all of them and then some, perfectly unifying and melding them into one seamless ecosystem.
  • Powerful AI agency: The agent harness has been rigorously designed and tested to maximize search coverage, speed, and interoperability between features within Noetic as well as remote tools and MCP servers, together.
  • And more: This is super powerful and offers nearly everything else offers, even going as far as including workspaces. You can invite users into your workspace, letting you contribute all simultaneously.

Athena Claude Code: Offering Frontier Long-Horizon Development Workloads in the Cloud (FastAPI, Python)

This is still a WIP, but this is one of my more complex projects. It offers a unique multi-agent orchastration system, that isolates important steps of the development workflow:

  1. Planning: Taking the task at hand into account, reading the codebase rigorously, and drafting an in-depth plan once certain and ready.
  2. Implementation: Implementing the plan created, enabling a dedicated/structured implementation approach.
  3. Testing and debugging: Once done, it is time for more rigorous testing to account for all edge-cases and issues not previously caught. If/when any are found, they will be debugged to find the root causes; if they are found, they will be passed back onto the beginning of the loop, letting the cycle continue until all requirements are satisfied.

The goal of this is to ensure maximal hierarhy, letting each agent perform its task(s). This can work for hours, and when done, it can file a PR for you; followups to Athena Cloud Code can enforce it to iterate further as well if need be.

Model Proxy: Enpowering People to Use Any Model Anywhere, Fallbacks Galore (FastAPI, Python)

This is one of my more exciting little endeavors. I always wanted to use Claude Code with any model, and other "proxies" were either crap, or lacked the features I wanted most. This though, offers everything I ever could've asked for:

  • Both Anthropic and OpenAI Completion formats.
  • API key, provider, and model-level fallbacks, reducing error-rates drastically.
  • Proper tool-calling, mutli-media support, and streaming functionality.

The coolest part as well, was that CrofAI, which has tens of thousands of users, deployed my Anthropic method in beta; thousands of users use this endpoint in production for the likes of Claude Code today, and it has shown to scale well thus far.

Agent Chassis: A Simple Agentic Chassis For Anyone to Build On Top Of (FastAPI, Python)

This was a small project I worked on, and it essentially offers a simple and modular FastAPI framework, enabling anyone to create custom agent frameworks; people can use this (just as I have for Athena Cloud Code), and create their own agents with server-side persistence. It offers auth pipelines, database logic, server-side and client-side chatting/tooling methods, and much more.

Skills & Technologies

  • Languages: Python, Typescript/JavaScript, Tailwind/CSS
  • Frameworks: FastAPI, Django, Next.js, React, Node.js, Bun
  • Platforms: Git/GitHub, Docker, Vercel, Railway, Google Cloud

Strengths

Backend

  • FastAPI and Next apps
  • API design and interfacing
  • Creating long-running agentic systems

Agentic Engineering & Environment Design

  • Creating agentic tools and interfaces for both local and production usage
  • Developing, testing, and scaling RL/eval environments for language models
  • Improving agentic flow, struction, and concision throughout long-running and complex tasks

Pinned Loading

  1. Agent-Chassis Agent-Chassis Public

    A powerful yet malleable foundation for quickly and easily making agents and custom frameworks around them. Out-of-the-box, it comes with MCP and local tool support and ultra-modular classes, allow…

    Python

  2. Model-Proxy Model-Proxy Public

    A model proxy to allow for multiple API key, provider, and model fallbacks, that also translates OpenAI methods to Anthropic, enabling support for the likes of Claude Code and other tools with ease.

    Python 1

  3. Portfolio-SIte Portfolio-SIte Public

    My (work in progress) portfolio site :)

    TypeScript

  4. Insta-AI-Correct Insta-AI-Correct Public

    A rapid auto-correct AutoHotKey script that instantly corrects any punctuation, grammatical, and typo errors with the power of Groq.

    AutoHotkey