Voice Cloning (SV2TTS) — Pro Starter

Production-style starter for voice cloning with switchable backends and an API server:

Backends: RTVC (CorentinJ) and Coqui TTS (multi-speaker) via thin wrappers
FastAPI server with /clone endpoint (multipart: speaker WAV + text + backend)
Soft watermark (spread-spectrum) for tagging outputs (embed/detect)
CLI, tests, Docker, CI, pre-commit, and a minimal notebook

⚠️ Ethics & Consent

Clone only your own voice or voices with explicit written consent.

Clearly label synthetic audio.

Follow local laws and platform policies.

Quickstart

python -m venv .venv && source .venv/bin/activate     # Windows: py -m venv .venv && .venv\Scripts\activate
pip install -r requirements.txt -r requirements-dev.txt
pre-commit install

Connect a backend

RTVC (recommended classic)

git clone https://github.com/CorentinJ/Real-Time-Voice-Cloning.git ~/rtvc
export VOICE_CLONER_RTVC_PATH=~/rtvc

Coqui TTS (multi-speaker)

pip install TTS                       # coqui-ai TTS
# Choose a multi-speaker model name, e.g. "tts_models/en/vctk/vits"

CLI

python -m voice_cloner.cli clone   --backend rtvc   --speaker assets/samples/spk.wav   --text "Hello from a consented voice clone."   --out out.wav   --wm-key "secret123"   --wm-tag "demo:soheil:2025-08-13"
# or Coqui
python -m voice_cloner.cli clone   --backend coqui   --model "tts_models/en/vctk/vits"   --speaker assets/samples/spk.wav   --text "Hello from Coqui backend."   --out out.wav

API Server

uvicorn voice_cloner.server:app --host 0.0.0.0 --port 8000

POST /clone (multipart/form-data):

speaker: WAV file (mono recommended)
text: string
backend: rtvc or coqui
model (optional): for coqui, e.g., tts_models/en/vctk/vits
wm_key (optional): watermark secret
wm_tag (optional): watermark tag string

Response: WAV audio stream.

Notebook

See: notebooks/voice_clone_demo.ipynb

Tests

pytest -q

Structure

src/voice_cloner/
  cli.py
  server.py
  watermark.py
  backends/
    rtvc_wrapper.py
    coqui_wrapper.py
  utils/
    audio.py
tests/
  test_cli.py
  test_watermark.py
  test_server_import.py
assets/
  samples/          # put your own sample
...

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
notebooks		notebooks
src/voice_cloner		src/voice_cloner
tests		tests
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice Cloning (SV2TTS) — Pro Starter

Quickstart

Connect a backend

CLI

API Server

Notebook

Tests

Structure

About

Uh oh!

Releases

Packages

Languages

License

SoheilGtex/Voice-Cloning-SV2TTS-

Folders and files

Latest commit

History

Repository files navigation

Voice Cloning (SV2TTS) — Pro Starter

Quickstart

Connect a backend

CLI

API Server

Notebook

Tests

Structure

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages