Show-Agnostic · Multi-Voice · Open Source

Markdown script
→ Podcast-ready MP3.

xil-pipeline is an audio production pipeline that takes your markdown script and produces a fully mixed, mastered podcast episode — your choice of TTS backend, zero lock-in.

Three backends.
One pipeline.

Pick the voice quality and cost model that fits your show. Switch backends with a single flag — no code changes.

FREE · NO API KEY
gTTS

Google Text-to-Speech via the free public API. No account, no key, no cost. Perfect for prototyping, local testing, or shows on a budget.

🟢 Zero cost, zero setup 🟢 Works offline-friendly 📦 --backend gtts
OPEN SOURCE · LOCAL
Chatterbox

A high-quality open-source TTS engine that runs locally. No cloud dependency, full privacy, and expressive multi-voice output.

🟣 Runs on your hardware 🟣 No data leaves your machine 📦 --backend chatterbox
COMMERCIAL · PREMIUM
ElevenLabs

Industry-leading voice cloning and ultra-realistic TTS. Ideal for professional productions where voice quality is non-negotiable.

🟡 Highest voice realism 🟡 Voice cloning support 📦 --backend elevenlabs

From install to
episode in minutes.

Every step of the production pipeline has a dedicated command. Scan, parse, produce, mix, and master — all from the terminal.

Everything a podcast
pipeline needs.

Built for creators and developers who want full control over their audio production workflow.

📝

Markdown-First Scripts

Write your script in plain markdown. Assign voices per speaker, add cues, and let the pipeline handle the rest.

🗣️

Multi-Voice Support

Assign different TTS voices to different speakers. Host, guest, narrator — each gets their own voice profile.

🔀

Backend Flexibility

Swap between gTTS, Chatterbox, and ElevenLabs with a single CLI flag. No rewiring, no code changes.

🎚️

Built-in DAW & Mastering

xil-daw mixes segments and xil-master normalises loudness for broadcast-ready output.

🔍

Pre-flight Scanning

Catch script errors before production. xil-scan validates voices, segments, and config before a single byte of audio is generated.

🖥️

Gradio GUI

Not a CLI person? The xil-gui Gradio dashboard gives you a full point-and-click interface for every pipeline stage.

Prefer a dashboard
over a terminal?

xil-gui is a Gradio-powered web dashboard that wraps the entire pipeline in a clean, browser-based UI. Upload your script, pick your backend, hit produce — no CLI required.

  • Visual script upload & voice assignment
  • Backend selector with live cost estimate
  • Real-time production progress bar
  • In-browser audio preview & MP3 download
🎙️ xil-gui · Gradio Dashboard
Script scripts/S01E01.md ✔
Episode S01E01
Backend ▾ Chatterbox (local)
Voices Host · Guest · Narrator
Producing… segment 9 / 12

Start your first episode today.

Free to use, open source, and ready to run. No API key required to get started.

View on GitHub Open an Issue
Made with Unstoppable Domains

Remove this watermark

This site was built with Unstoppable Domains AI Site Builder. The site owner can remove this watermark by subscribing to an AI Credits plan.

View Plans