Bonsai_Tutorials/transcriber
2025-11-01 20:55:51 +11:00
..
.github Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
docs Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
helpers Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
src/transcriber Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
tests Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
.dockerignore Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
.gitignore Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
.pre-commit-config.yaml Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
.python-version Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
Dockerfile Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
LICENSE Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
Makefile Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
mkdocs.yml Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
pyproject.toml Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
README.md Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
tox.ini Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00
uv.lock Added the transcriber script and generated SRT subtitle text files 2025-11-01 20:55:51 +11:00

transcriber

Release Build status Commit activity License

A python script which uses whisper to transcribe videos and outputs SRT subtitle text files.

UV or Docker - that is the question.

Whether you choose to use uv or docker our Makefile will launch it to a series of interactive questions with sensible defaults. It will use the second smallest language model base.en by default, however I used the largest model medium.en for the SRT files I've commited to this repo. To be honest there wasn't much difference that I could discern.

Using python's uv

If you have uv in your PATH and ffmpeg installed in your operating system, you should be able to do the following to install and use this package.

  1. make - runs the default target of make help to show help on all the make targets.
  2. make install - installs the virtual environment and pre-commit hooks.
  3. make check - optional, runs the code quality tools.
  4. make test - optional, runs unit tests.
  5. make docs-test - optional, generate HTML documents in the site directory.
  6. make transcribe - by default, this converts the videos in the directory above our Makefile to .srt subtitle text files.

Using Docker

If you have docker installed on your system, you should be able to do the following to install and use this package.

  1. make docker-build - Download and build the transcribe-app container image.
  2. make docker-run - Run the transcriber interactively, by default, this converts the videos in the directory above our Makefile to .srt subtitle text files.