Skip to content

Add Baidu transcription provider#47

Open
mitre88 wants to merge 2 commits into
nibzard:mainfrom
mitre88:add-baidu-transcription-provider
Open

Add Baidu transcription provider#47
mitre88 wants to merge 2 commits into
nibzard:mainfrom
mitre88:add-baidu-transcription-provider

Conversation

@mitre88
Copy link
Copy Markdown

@mitre88 mitre88 commented May 24, 2026

Summary

  • add a Baidu Speech Recognition provider behind --api baidu
  • use environment-based BAIDU_API_KEY / BAIDU_SECRET_KEY configuration with no committed secrets
  • convert Baidu audio input to mono 16 kHz MP3 and send the short speech recognition request through Baidu's token + server_api flow
  • add focused unit coverage for token usage, request payload construction, language model routing, missing credentials, and API errors
  • document the new environment variables and CLI option

Validation

  • .venv/bin/python -m unittest discover -s tests -v
  • .venv/bin/python -m compileall src tests
  • .venv/bin/sapat --help
  • git diff --check

No secrets, payout details, or private audio are included.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant