AudioRead (CLI)

Convert a PDF into a narrated audio file using a chunked LLM pipeline that avoids context window limits.

Acknowledgements

This project is inspired by PDF2Audio:

https://github.com/lamm-mit/PDF2Audio

Install (uv)

uv venv
source .venv/bin/activate
uv pip install -e .

Usage

cp .env.example .env
# edit .env to set OPENAI_API_KEY
audior convert /path/to/book.pdf

Options:

--style narration|podcast|lecture|summary
--mode summary|full|chapter
--text-model gpt-4o-mini
--tts-model gpt-4o-mini-tts
--voice alloy
--max-input-tokens 12000
--overlap-tokens 200
--tts-max-chars 3500

Outputs are placed under outputs/<pdf_name>/.

Contributing

Create an issue to report bugs or suggest features.
Open a PR for improvements or fixes.

License

MIT

Notes

Audio chunks are concatenated as MP3 bytes. Most players handle this fine.
For higher fidelity merges, we can add optional ffmpeg/pydub later.
script_tts.txt is a cleaned version used to avoid speaking markdown symbols.

Demo

Summary audio sample: docs/demo-output.mp3

Source book: Great Physicists: The Life and Times of Leading Physicists from Galileo to Hawking

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
audior.egg-info		audior.egg-info
audior		audior
docs		docs
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AudioRead (CLI)

Acknowledgements

Install (uv)

Usage

Contributing

License

Notes

Demo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AudioRead (CLI)

Acknowledgements

Install (uv)

Usage

Contributing

License

Notes

Demo

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages