ActosVoice

React library for voice applications with LLM and Tool Calling — 100% client-side.

🎯 Overview

ActosVoice is a modular library that combines:

ASR (Speech-to-Text) — Injectable voice recognition
LLM (Large Language Model) — Natural language processing on the client
Tool Calling — Tool pattern inspired by OpenAI/Ollama/Gemini

┌─────────────────────────────────────────────────────────┐
│                    ActosVoice Library                     │
├─────────────────────────────────────────────────────────┤
│                                                         │
│   🎤 ASR                   🧠 LLM                       │
│   ├── webSpeech()          ├── webLLM()                 │
│   ├── whisper()            ├── ollama()                 │
│   └── deepgram()           └── openai()                 │
│                                                         │
│   🛠️ Tools                                              │
│   ├── Built-in tools                                    │
│   └── Custom tools                                      │
│                                                         │
└─────────────────────────────────────────────────────────┘

📦 Installation

npm install @actos-voice/react

🚀 Quick Start

import { ActosVoice, useVoiceAgent } from '@actos-voice/react';
import { webSpeech } from '@actos-voice/asr-webspeech';
import { webLLM } from '@actos-voice/llm-webllm';

const tools = [
  {
    name: 'change_color',
    description: 'Changes the application background color',
    parameters: {
      type: 'object',
      properties: {
        color: { type: 'string', description: 'Color name' }
      },
      required: ['color']
    },
    execute: (args) => {
      document.body.style.backgroundColor = args.color;
      return { success: true };
    }
  }
];

function App() {
  return (
    <ActosVoice
      asr={webSpeech({ language: 'en-US' })}
      llm={webLLM({ model: 'Llama-3.2-1B-Instruct-q4f16_1-MLC' })}
      tools={tools}
    >
      <VoiceInterface />
    </ActosVoice>
  );
}

📚 Documentation

Architecture — How the library works
ASR Providers — Voice recognition providers
LLM Providers — LLM providers
Tool Calling — Tool definition and usage
Configuration — Configuration options
Examples — Use cases

🔧 Requirements

React 18+
Browser with WebGPU (Chrome 113+, Edge 113+) for client-side LLM
Microphone for ASR

📄 License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.changeset		.changeset
demo		demo
docs		docs
packages		packages
.gitignore		.gitignore
.node-version		.node-version
LICENSE		LICENSE
package-lock.json		package-lock.json
package.json		package.json
railway.json		railway.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ActosVoice

🎯 Overview

📦 Installation

🚀 Quick Start

📚 Documentation

🔧 Requirements

📄 License

About

Uh oh!

Releases

Packages

Languages

License

patrick-mns/actos-voice

Folders and files

Latest commit

History

Repository files navigation

ActosVoice

🎯 Overview

📦 Installation

🚀 Quick Start

📚 Documentation

🔧 Requirements

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages