| .. | ||
| .github | ||
| docs | ||
| helpers | ||
| src/transcriber | ||
| tests | ||
| .dockerignore | ||
| .gitignore | ||
| .pre-commit-config.yaml | ||
| .python-version | ||
| Dockerfile | ||
| LICENSE | ||
| Makefile | ||
| mkdocs.yml | ||
| pyproject.toml | ||
| README.md | ||
| tox.ini | ||
| uv.lock | ||
transcriber
A python script which uses whisper to transcribe videos and outputs SRT subtitle text files.
- Github repository: https://github.com/dscoular/transcriber/
- Documentation https://dscoular.github.io/transcriber/
UV or Docker - that is the question.
Whether you choose to use uv or docker our Makefile will launch it to a series of interactive
questions with sensible defaults. It will use the second smallest language model base.en by
default, however I used the largest model medium.en for the SRT files I've commited to this repo.
To be honest there wasn't much difference that I could discern.
Using python's uv
If you have uv in your PATH and ffmpeg installed in your operating system, you should
be able to do the following to install and use this package.
make- runs the default target ofmake helpto show help on all themaketargets.make install- installs the virtual environment and pre-commit hooks.make check- optional, runs the code quality tools.make test- optional, runs unit tests.make docs-test- optional, generate HTML documents in thesitedirectory.make transcribe- by default, this converts the videos in the directory above ourMakefileto.srtsubtitle text files.
Using Docker
If you have docker installed on your system, you should
be able to do the following to install and use this package.
make docker-build- Download and build thetranscribe-appcontainer image.make docker-run- Run the transcriber interactively, by default, this converts the videos in the directory above ourMakefileto.srtsubtitle text files.