-
Notifications
You must be signed in to change notification settings - Fork 0
Model loading 2 #4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Single File Benchmark ResultsTest File: ES2004a (1049.4s audio) Accuracy Metrics
⏱️ Performance Timing
Inference Time: 44.734s (96.2% of total) Research Comparison:
|
VAD Benchmark ResultsPerformance Comparison
Industry Leaders:
|
VAD Benchmark ResultsPerformance Comparison
Industry Leaders:
|
VAD Benchmark ResultsPerformance Comparison
Industry Leaders:
|
VAD Benchmark Results❌ Test FailedNo benchmark results were generated. Check the logs for details. |
1 similar comment
VAD Benchmark Results❌ Test FailedNo benchmark results were generated. Check the logs for details. |
VAD Benchmark ResultsPerformance Comparison
Industry Leaders:
|
Diarization Benchmark Results
Performance Timing
Research Comparison
ES2004a • 1049.4s audio • 47.2s inference • Test runtime: 1m 10s • 07/24/2025, 02:43 PM EST |
VAD Benchmark ResultsPerformance Comparison
Industry Leaders:
📊 Detailed Research Comparisons
Note: Direct comparisons should consider dataset differences. MUSAN contains challenging noise conditions. |
ASR Benchmark Results
100 files per dataset • Test runtime: 13m15s • 07/24/2025, 02:55 PM EST RTFx = Real-Time Factor (higher is better) • Calculated as: Total audio duration ÷ Total processing time Note: CI RTFx degraded by M1/M2 Mac virtualization. M1 Mac test: ~28x (clean), ~25x (other). Testing per HuggingFace Open ASR Leaderboard. |
No description provided.