History

Doug Scoular beb900a417 Added the transcriber script and generated SRT subtitle text files		2025-11-01 20:55:51 +11:00
..
.github	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
docs	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
helpers	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
src/transcriber	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
tests	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
.dockerignore	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
.gitignore	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
.pre-commit-config.yaml	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
.python-version	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
Dockerfile	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
LICENSE	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
Makefile	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
mkdocs.yml	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
pyproject.toml	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
README.md	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
tox.ini	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00
uv.lock	Added the transcriber script and generated SRT subtitle text files	2025-11-01 20:55:51 +11:00

README.md

transcriber

A python script which uses whisper to transcribe videos and outputs SRT subtitle text files.

Github repository: https://github.com/dscoular/transcriber/
Documentation https://dscoular.github.io/transcriber/

`UV` or `Docker` - that is the question.

Whether you choose to use uv or docker our Makefile will launch it to a series of interactive questions with sensible defaults. It will use the second smallest language model base.en by default, however I used the largest model medium.en for the SRT files I've commited to this repo. To be honest there wasn't much difference that I could discern.

Using python's `uv`

If you have uv in your PATH and ffmpeg installed in your operating system, you should be able to do the following to install and use this package.

make - runs the default target of make help to show help on all the make targets.
make install - installs the virtual environment and pre-commit hooks.
make check - optional, runs the code quality tools.
make test - optional, runs unit tests.
make docs-test - optional, generate HTML documents in the site directory.
make transcribe - by default, this converts the videos in the directory above our Makefile to .srt subtitle text files.

Using `Docker`

If you have docker installed on your system, you should be able to do the following to install and use this package.

make docker-build - Download and build the transcribe-app container image.
make docker-run - Run the transcriber interactively, by default, this converts the videos in the directory above our Makefile to .srt subtitle text files.

README.md

transcriber

UV or Docker - that is the question.

Using python's uv

Using Docker

`UV` or `Docker` - that is the question.

Using python's `uv`

Using `Docker`