🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
-
Updated
Mar 9, 2025 - Python
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
🦜 NLP for Tibetan, in Python.
repo for Tibetan corpora
[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
Tibetan phonetics engine in Python
This Tibetan tokenizer based on Bi-LSTM+CRF methods, it was created with the aim of aiding researchers in the field of Tibetan natural language processing.
Basic Universal Dependencies Part-of-Speech Tagger for Tibetan
Tibetan-English neural machine translation for edge devices.
An application of PyBo to Tibetan Spell-Checking
syllable-based diffs that make use of google's diff-match-patch and pybo's preprocess
Add a description, image, and links to the tibetan-nlp topic page so that developers can more easily learn about it.
To associate your repository with the tibetan-nlp topic, visit your repo's landing page and select "manage topics."