syllabreak

Multilingual library for accurate and deterministic hyphenation and syllable counting without relying on dictionaries.

Supported Languages

🇬🇧 English (eng)
🇷🇺 Russian (rus)
🇷🇸 Serbian Cyrillic (srp-cyrl)
🇷🇸 Serbian Latin (srp-latn)
🇹🇷 Turkish (tur)
🇬🇪 Georgian (kat)
🇩🇪 German (deu)
🇫🇷 French (fra)
🇷🇴 Romanian (ron)
🇪🇸 Spanish (spa)
🇵🇹 Portuguese (por)
🇵🇱 Polish (pol)
🏛️ Latin (lat)

Usage

Auto-detect language

When no language is specified, the library automatically detects the most likely language:

>>> from syllabreak import Syllabreak
>>> s = Syllabreak("-")
>>> s.syllabify("hello")
'hel-lo'
>>> s.syllabify("здраво")  # Serbian Cyrillic
'здра-во'
>>> s.syllabify("привет")  # Russian
'при-вет'

Specify language explicitly

You can specify the language code for more predictable results:

>>> s = Syllabreak("-")
>>> s.syllabify("problem", lang="eng")  # Force English rules
'pro-blem'
>>> s.syllabify("problem", lang="srp-latn")  # Force Serbian Latin rules
'prob-lem'

This is useful when:

The text could match multiple languages
You want consistent rules for a specific language
Processing text in a known language

Language Detection

The library returns all matching languages sorted by confidence:

>>> from syllabreak import Syllabreak
>>> s = Syllabreak()
>>> s.detect_language("hello")
['eng', 'srp-latn', 'tur']  # Matches English, Serbian Latin and Turkish
>>> s.detect_language("čovek")
['srp-latn', 'eng', 'tur']  # Serbian Latin has highest confidence due to č

Name		Name	Last commit message	Last commit date
Latest commit History 211 Commits
.github		.github
syllabreak		syllabreak
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
test_readme.py		test_readme.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

syllabreak

Supported Languages

Usage

Auto-detect language

Specify language explicitly

Language Detection

Lines of Code

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

apakabarfm/syllabreak-python

Folders and files

Latest commit

History

Repository files navigation

syllabreak

Supported Languages

Usage

Auto-detect language

Specify language explicitly

Language Detection

Lines of Code

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages