Head-to-head comparison based on real production benchmarks with Gulf Arabic callers.
Ultra-fast Arabic STT with poor transcription quality.
ElevenLabs' realtime STT offering — poor quality and slow for Arabic.
Users had to repeat themselves frequently. Quality unacceptable for production use.
Described as 'shit quality' in production testing. Not viable for Arabic.
| Feature | Speechmatics | ElevenLabs Scribe v2 |
|---|---|---|
| Real-time streaming transcription | ✓ | ✓ |
| Configurable endpointing | ✓ | ✗ |
| Standard and enhanced operating points | ✓ | ✗ |
| Custom dictionary | ✓ | ✗ |
| Multiple language support | ✗ | ✓ |
| LiveKit inference integration | ✗ | ✓ |
| Capability | Speechmatics | ElevenLabs Scribe v2 |
|---|---|---|
| Streaming support | ✓ | ✓ |
| LiveKit plugin | ✗ | ✓ |
| Self-hostable | ✓ | ✗ |
| API style | WebSocket streaming + REST | WebSocket streaming |
| SDKs | Python, Node.js | Python, Node.js |
Amazingly fast but Arabic quality is too poor for production. The speed advantage is meaningless when users have to repeat themselves.
Poor quality and poor latency for Arabic. Not recommended for any Arabic STT use case.
Speechmatics is faster with an average end-of-utterance delay of 460ms, which is 1540ms faster than ElevenLabs Scribe v2.
Speechmatics has a quality rating of 1/5 (Poor). Users had to repeat themselves frequently. Quality unacceptable for production use.
Both providers are viable options. Speechmatics: Amazingly fast but Arabic quality is too poor for production. The speed advantage is meaningless when users have to repeat themselves. ElevenLabs Scribe v2: Poor quality and poor latency for Arabic. Not recommended for any Arabic STT use case.
Speechmatics starts at $0.0042 per minute (Real-time streaming). ElevenLabs Scribe v2 starts at $5 per month (Includes STT credits).