Kasuku

A simple speech transcription web app designed for non-standard speech patterns. The Kasuku webapp helps you convert audio recordings into text with support for Kenyan English and Swahili accents.

🚀 Live Deployment of Demo Webapp

Technology Stack

Category	Technologies
Framework
Language
Infrastructure
Version Control

✨ Features

Audio Recording: Record yourself directly in the browser
Multi-language Support : Transcribe speech in English and Swahili
Real-time Processing: Get instant transcription results
Save & Manage: Keep track of all your transcriptions with timestamps
Search Functionality: Find specific transcriptions by content
Export Options: Copy or save your transcriptions for later use

⚙️ How it works

flowchart TD
   A[🚀 Start Here] --> B[🔐 1. Login]
   
   B --> C[ ✅ 2. Select Language]
   C --> C1[Choose English or Swahili]

   C1 --> D[🎤 3. Record Audio]
   D --> D1[Allow Microphone Access]
   D1 --> D2[Click Microphone Button]
   D2 --> D3[The User Records Themselves]
   D3 --> D4[Click Stop]

   D4 --> E[📝 4. Transcribe]
   E --> E1[Click Transcribe Button]
   E1 --> E2[Wait for Processing]
   E2 --> E3[A Green Card with the Transcribed text Appears with 4 options]

   E3 --> F[💾 5. Manage Transcription]

   F --> F1[Speak Transcription]
   F --> F2[Copy]
   F --> F3[Save]
   F --> F4[Delete Card]

   F1 --> F1a[Select Gender Voice]
   F1a --> F1b[Listen to Transcription]
   F1b --> F

   F2 --> F2a[Text Copied to Clipboard]
   F2a --> F

   F3 --> G[📚 6. View Saved Transcriptions]

   F4 --> D2

   G --> G1[Search Cards]
   G --> G2[Filter by Language]
   G --> G3[View Saved Cards]

   G3 --> H[Saved Card Options]

   H --> H1[Speak]
   H --> H2[Copy]
   H --> H3[Delete]

   H1 --> H1a[Select Language Voice]
   H1a --> H1b[Listen to Card]
   H1b --> H

   H2 --> H2a[Text Copied to Clipboard]
   H2a --> H

   H3 --> H3a[Card Deleted]
   H3a --> G

   style A fill:#ff9800,color:#000,stroke:#e65100,stroke-width:3px
   style B fill:#4caf50,color:#fff,stroke:#2e7d32,stroke-width:2px
   style C fill:#4caf50,color:#fff,stroke:#2e7d32,stroke-width:2px
   style D fill:#4caf50,color:#fff,stroke:#2e7d32,stroke-width:2px
   style E fill:#4caf50,color:#fff,stroke:#2e7d32,stroke-width:2px
   style F fill:#4caf50,color:#fff,stroke:#2e7d32,stroke-width:2px
   style G fill:#4caf50,color:#fff,stroke:#2e7d32,stroke-width:2px

💬 Supported Languages

Currently supports transcription for:

✅ English
✅ Swahili

🎯 Key Benefits

Accessibility: Designed specifically for non-standard speech patterns
User-Friendly: Simple interface that anyone can use
Organized: Keep all your transcriptions in one place with timestamps
Flexible: Bilingual language support for English and Swahili.

📥 Clone This Repository

Follow these steps to set up the project on your local machine:

Open your terminal (or Git Bash).
Navigate to the folder where you want to store the project:
```
cd path/to/your/folder
```

Clone repository

git clone https://github.com/LexMainye/Kasuku-Transcriber

Navigate to the project directory
```
cd your-repo-name
```

Install Dependencies

python -m venv venv
source venv/bin/activate      # On macOS/Linux
venv\Scripts\activate         # On Windows

Install required packages from requirements.txt
```
pip install -r requirements.txt
```
Run the project

After installing the dependencies, run the project locally
```
streamlit run src/app.py
```

🚀 App Deployment on Modal

To iterate the Kasuku streamlit app, you can run it “ephemerally” with modal serve. This will run a local process that watches the files and updates the app if anything changes.
```
modal serve deploy.py
```
To sucessfully deploy the app on modal
```
modal deploy deploy.py
```

For reference, see:

Modal Streamlit Deployment Example

💻 Technical Requirements

Modern web browser with microphone support
Microphone access permissions

Name		Name	Last commit message	Last commit date
Latest commit History 136 Commits
.devcontainer		.devcontainer
.vscode		.vscode
__pycache__		__pycache__
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
deploy.py		deploy.py
gitattributes		gitattributes
prewarm_models.py		prewarm_models.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Kasuku

🚀 Live Deployment of Demo Webapp

Technology Stack

✨ Features

⚙️ How it works

💬 Supported Languages

🎯 Key Benefits

📥 Clone This Repository

🚀 App Deployment on Modal

💻 Technical Requirements

Contact Information

About

Uh oh!

Uh oh!

Languages

License

LexMainye/Kasuku-Transcriber

Folders and files

Latest commit

History

Repository files navigation

Kasuku

🚀 Live Deployment of Demo Webapp

Technology Stack

✨ Features

⚙️ How it works

💬 Supported Languages

🎯 Key Benefits

📥 Clone This Repository

🚀 App Deployment on Modal

💻 Technical Requirements

Contact Information

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages