Coming Soon
LAUNCH SPECIAL • 50% OFF WITH CODE: EARLYBIRD

Private
Real-Time
Voice
Transcription.

No APIs. Zero Cloud. 100% Local.

Introducing VoxBar. A customizable UI for private, offline transcription—built for privacy, speed, accuracy, and control.

From vibe-coding and drafting emails, to capturing lectures and Teams or Zoom meetings via mic or system audio, VoxBar Pro transcribes in real time so your words become usable text instantly.

Explore our top-tier offline engines today, and experience what it's like to talk your ideas into reality.

Powered by Voxtral (4B) No API Keys Needed Zero Training — Works Out of the Box
🍎 Now Available on Apple Silicon

VoxBar Pro All our top models now run on Mac

The same S-tier Voxtral engine, now accelerated by Apple Metal GPU. No Docker. No cloud. No subscriptions. Just double-click and talk.

Metal GPU Acceleration

Native Apple Silicon performance. M1, M2, M3, M4 — all supported.

🚫

No Docker. No Hassle.

On Mac, VoxBar Pro runs without Docker. Pure native C engine.

🖱️

One-Click Install

Double-click the launcher. It builds the engine, downloads the model, and launches — all automatically.

🍎 View All Mac-Compatible Models →

Requires Apple Silicon Mac (M1+) with 8GB+ RAM. 16GB recommended.

Choose Your Windows PC Engine ▶ NVIDIA / AMD

Optimized for Windows. Every tier runs 100% locally. No cloud. No subscriptions. One-time purchase, lifetime license.

Performance

VoxBar GLM

GLM-ASR Nano 1.5B • 17 Languages • 4GB VRAM

$39

one-time

🔥 LAUNCH: $19.50 with code EARLYBIRD

  • 8.0 combined — multilingual accuracy
  • 17 languages — EN, ZH, JA, KO + more
  • Generative model — LLM-based ASR
  • Only 4GB — light GPU footprint
  • NVIDIA GPU — 4GB VRAM minimum
  • No Docker — native Python engine
  • 100% offline — your data never leaves

VRAM: ~4GB • NVIDIA only • 17 languages

Get GLM — $39

Performance

VoxBar Nemotron

Nemotron 0.6B • English • 4.8GB VRAM

$39

one-time

🔥 LAUNCH: $19.50 with code EARLYBIRD

  • 8.35 combined — top of mid tier
  • NVIDIA-built — NeMo framework
  • FastConformer — streaming-native
  • Only 4.8GB — light GPU footprint
  • NVIDIA GPU — 5GB VRAM minimum
  • No Docker — native Python engine
  • 100% offline — your data never leaves

VRAM: ~4.8GB • NVIDIA only

Get Nemotron — $39
9.4 Arena

Professional

VoxBar Pro Kyutai 2.6B

Kyutai STT 2.6B • English • 5.8GB VRAM

$59

one-time

🔥 LAUNCH: $29.50 with code EARLYBIRD

  • 9.4 combined — pro-grade accuracy
  • System audio — meetings and lectures
  • Auto-punctuation — context-aware
  • Only 5.8GB — lowest VRAM in Pro tier
  • NVIDIA GPU — 6GB VRAM minimum
  • No Docker — native Python engine
  • 100% offline — your data never leaves

VRAM: ~5.8GB • NVIDIA only • ~2s delay

Get Pro Kyutai 2.6B — $59

Performance

VoxBar Kyutai 1B

Kyutai STT 1B • EN + FR • 2.7GB VRAM

$39

one-time

🔥 LAUNCH: $19.50 with code EARLYBIRD

  • 8.1 combined — real-time streaming
  • <80ms latency — frame-by-frame
  • Auto-punctuation — model-generated
  • Only 2.7GB — tiny GPU footprint
  • NVIDIA GPU — 3GB VRAM minimum
  • No Docker — native Python engine
  • 100% offline — your data never leaves

VRAM: ~2.7GB • NVIDIA only • <80ms

Get Kyutai 1B — $39
9.6 Arena

Flagship

VoxBar Pro Docker

Voxtral 4B • Mistral AI • 14GB VRAM

$59

one-time

🔥 LAUNCH: $29.50 with code EARLYBIRD

  • 9.6 combined — highest arena score
  • True real-time — words as you speak
  • 13 languages — multilingual built-in
  • NVIDIA GPU — 16GB VRAM required
  • Docker Desktop — one-time setup
  • Auto-punctuation — context-aware
  • 100% offline — your data never leaves

VRAM: ~14GB • NVIDIA only • Docker required

Get Pro Docker — $59
9.5 Arena

No Docker

VoxBar Pro Native

Voxtral F16 4B • Mistral AI • 8.5GB VRAM

$59

one-time

🔥 LAUNCH: $29.50 with code EARLYBIRD

  • 9.5 combined — S-tier arena score
  • One-click install — no Docker, no WSL
  • True real-time — sub-200ms latency
  • Only 8.5GB — 40% less than Docker
  • NVIDIA GPU — 10GB VRAM minimum
  • 13 languages — multilingual built-in
  • 100% offline — your data never leaves

VRAM: ~8.5GB • NVIDIA only • No Docker

Get Pro Native — $59

Just want to try it? VoxBar Whisper is free — runs on any hardware, no GPU needed.

Download Free — VoxBar Whisper ($0)

Compare Models

Free GLM Nemotron Kyutai 2.6B Kyutai 1B Pro
Engine VoxBar Faster-Whisper (2GB VRAM) VoxBar Canary Qwen 2.5B (4GB VRAM) VoxBar Nemotron 0.6B (2GB VRAM) VoxBar Kyutai 2.6B (6GB VRAM) VoxBar Kyutai 1B (2.7GB VRAM) VoxBar Voxtral 4B (14GB VRAM)
Speed ⚡ Fast ⚡ Fast ⚡⚡ Very Fast ⚡ Delayed (~4s) ⚡⚡⚡ Instant ⚡⚡⚡ Instant
Accuracy Good Great Great Excellent Very Good S-Tier
Languages English English English EN + FR EN + FR 13+
GPU CPU only NVIDIA NVIDIA NVIDIA NVIDIA NVIDIA/AMD/Apple
Docker Win only
Platform Windows Windows Windows Windows Windows Win + Mac 🍎
Price Free $39 $39 $59 $39 $59

All plans include Overlay Mode • Standard View • Voice Commands • Lifetime License • Free Updates

✨ Just Launched

Overlay Mode

A floating voice bar that disappears into your workflow. Dictate emails in Outlook, narrate code in VS Code, write reports in Google Docs — blending seamlessly into any background with adaptive themes and Glass Mode.

🖤 True Stealth

On dark backgrounds, the bar is completely invisible. Just words appearing out of thin air. Adjust opacity from 25% to 100%.

📌 Drag & Dock Anywhere

Drag the bar to any edge of your screen. It remembers its position between sessions. Resize it to fit any space.

🎤 Voice Commands

Say "delete" to remove the last word. Dictate punctuation naturally: "comma", "full stop", "new paragraph", "open bracket".

✨ Live Demo — Watch Overlay Mode in Action

📁
Projects
💻
Terminal
🌐
Browser
📁
🌐
VS Code
🔊📶11:42 PM
🐍 app.py ×
config.yaml
engine.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
import torch
from transformers import AutoModelForCTC

class TranscriptionEngine:
"""Real-time voice-to-text engine."""

def __init__(self, model_path):
self.model = AutoModelForCTC.from_pretrained(
model_path, device_map="auto"
)
self.sample_rate = 16000

def process_chunk(self, audio_data):
with torch.no_grad():
logits = self.model(audio_data).logits
return self._decode(logits)
main Python 3.11 Ln 13, Col 42 UTF-8 Spaces: 4
Search mail...
To:client@example.com
Cc:team@company.com
Subj:Re: Project Update
|
📄
Q4 Report
File Edit View Insert Format
B I U A
Quarterly Performance Report

The quarterly report shows a 23% increase in user engagement across all platforms. This growth is primarily driven by improvements in our mobile experience and the launch of three new features.

Revenue increased by $2.4M compared to the previous quarter, with customer satisfaction reaching an all-time high of 4.8/5.0 across every region.

Key highlights include: reduced churn by 12%, increased trial-to-paid conversion rate by 18%, and expanded into 3 new markets. The engineering team shipped 47 features with a 99.9% uptime...

|

📁
Documents
🎵
Music
🌐
Chrome
📁
🌐
🎵
🔊📶11:42 PM
Invisible on dark backgrounds

🪟

Multi-Monitor

Open a overlay bar on each display. They work independently.

🧲

Drag Like a Toy

Grab it, move it, resize it. It snaps to edges and remembers position.

🎯

Click to Listen

Listens only when you tell it to. Hit the button, speak, and it auto-stops when you're done.

The Cloud Trap 🔥

  • Your audio is uploaded to Big Tech servers.
  • Requires API Keys & Credit Cards.
  • Monthly subscriptions ($30/mo+).
  • Lag & Buffering: Fails without internet.
  • Censorship: AI filters block "sensitive" words.
  • Data Mining: Your voice trains their future models.
  • Service Outages: If AWS goes down, you stop working.
  • Account Bans: One "violation" and you lose access.
  • Primitive Editors: Can't handle mid-sentence corrections.

VoxBar™ Local 🛡️

  • Air-Gapped: Audio never leaves your RAM.
  • No API Keys, No Training: Pre-tuned AI — just download and go.
  • One-Time Install: Own the software forever.
  • Polyglot Mode: Fluidly switches languages in real-time.
  • Zero Latency: GPU acceleration means instant text.
  • Uncensored: No "safety filters". It types what you say.
  • Unlimited: No "minutes per month" caps. Talk forever.
  • Context Aware: Voxtral Intelligence understands nuance.
  • Smart Editing: Insert text mid-sentence. Auto-correct fixes mistakes.

Standard View — Your Voice, Any App

The full-featured window for when you need to see every word. Dictate documents, emails, clinical notes, and code — in 13 languages, more coming soon.

📄 Chapter12_Draft.docx — Word
Calibri 11 B I

Chapter 12: The Morning Light

The morning light filtered through the curtains, casting long golden shadows across the wooden floor. She sat at her desk, coffee growing cold beside her, fingers hovering above the keyboard.

But today was different. Today she didn't need to type.

|

📥 Inbox
M. Dupont
Re: Projet de collaboration
HR Team
Policy update Q1
Calendar
Meeting at 3:00 PM
J. Smith
Project timeline review
New Message — Re: Projet de collaboration
To: dupont@enterprise.fr
Subject: Re: Projet de collaboration

Composing reply...

Explorer
📁 src
📄 main.py
📄 sort_algo.py ●
📁 tests
📄 test_sort.py
sort_algo.py main.py
def quick_sort(arr):
# TODO: add documentation
if len(arr) <= 1:
return arr
pivot = arr[0]
# |
🏥 Patient Records
García, Carlos
ID: 2024-4851
Müller, Hans
ID: 2024-4850
Tanaka, Yuki
ID: 2024-4849
Clinical Notes — García, Carlos URGENT
Date: 15/02/2026 | Dr. Reyes

Dictating notes...

FileSettingsHelp
VoxBar™
● Ready Words: 0 · Chars: 0

📝 Writers & Authors — Dictate directly into your documents

13 Languages — Powered by Voxtral

EnglishENEnglish
FrenchFRFrench
ChineseZHChinese
ArabicARArabic
GermanDEGerman
JapaneseJAJapanese
SpanishESSpanish
HindiHIHindi
ItalianITItalian
KoreanKOKorean
PortuguesePTPortuguese
RussianRURussian
DutchNLDutch

More languages coming soon.

Who is VoxBar™ Designed For?

🎬

Creators & Gamers

You've got the GPU — put it to work. Dictate scripts, take notes, and transcribe offline.

🎓

Students & Teachers

Transcribe lectures offline. Review notes instantly. No monthly fees.

✍️

Writers & Authors

Write novels by speaking. No subscription limits on word count.

⚖️

Legal & Medical

Dictate sensitive client notes without violating NDAs or HIPAA.

System Requirements

Each tier has different hardware needs. Pick the one that fits your setup.

Free • Whisper+

Lightweight engines — runs on almost anything

OSWindows 10 / 11
GPUNone (CPU) or NVIDIA 4GB+
RAM8 GB
DockerNot required
Storage~3 GB

AI

Canary Qwen 2.5B engine — best balance of speed + accuracy

OSWindows 10 / 11
GPUNVIDIA 6GB+ VRAM
RAM16 GB
DockerDocker Desktop (running)
Storage~8 GB

Pro — Windows

Voxtral 4B engine — maximum intelligence

OSWindows 10 / 11
GPUNVIDIA 12GB+ VRAM
RAM16 GB
DockerDocker Desktop (running)
Storage~12 GB

VoxBar™ Pro requires Docker Desktop and an NVIDIA GPU.

Pro — Mac 🍎

Same Voxtral 4B engine — native Apple Silicon

OSmacOS 13+ (Ventura)
ChipApple Silicon (M1 / M2 / M3 / M4)
RAM8GB+ unified (16GB recommended)
DockerNot required
Storage~11 GB

Metal GPU acceleration • No Docker • One-click install

View All Plans →

All tiers include lifetime license + future updates

Trust & Transparency

🛡️

If you see an "Unknown Publisher" warning

Some Windows users may see a SmartScreen prompt during installation. VoxBar is an independently developed utility — we don't gather or share your data, which means Microsoft doesn't "know" us yet. This is completely normal for independent software.

Quick Installation Steps (Windows):

  1. 1 Click "More Info" on the blue popup.
  2. 2 Select "Run Anyway".

Mac Users:

VoxBar Pro on Mac runs natively — no Docker, no Gatekeeper issues. Just double-click the launcher and the engine builds itself automatically.

No Cloud API
Audio never leaves your RAM
No Data Mining
100% Air-Gapped Operation

The Story

The keyboard was always the bottleneck

Real‑time, multilingual speech‑to‑text isn't science fiction anymore — it's already a reality. Meet VoxBar™: the next generation of human‑to‑machine communication, bridging the gap between your voice and your computer. If you're into coding, creative work, or anything that demands fast output, chances are your hardware is already ready. Imagine all‑day productivity boosts you never believed were possible. That's what happens when you remove the keyboard from the equation.

And here's the part most people miss: the biggest barrier to AI adoption isn't the technology — it's trust. People are reluctant to speak freely when big tech is listening. Client contracts, medical records, business strategy, personal reflections — none of it belongs on someone else's server. No cloud listening in. No big tech harvesting your words. VoxBar™ was built from day one with a simple rule: we don't gather your data. Period.

A developer built it for himself. It was too good not to share. Voice‑first computing isn't a novelty — it's the future of how we work. And the future starts on your machine.

Read the Full Story →