wiktionary-parser

Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and Neumann, 2018)

morphology linguistics inflection computational-linguistics wiktionary wiktionary-parser

Updated Jun 23, 2020
Python

snizio / italian-wiktionary-parser

Star

This repository contains a python script for parsing an xml dump of the Italian Wiktionary (Wikizionario); it also contains the parsed dictionary in a JSON file and a ONLI (italian database of neologisms) scraper with the scraped data in a CSV file

nlp italian corpus-linguistics italiano onli wiktionary-parser wiktionary-data neologisms

Updated May 4, 2024
Python

Surkal / WiktionnaireParser

Star

A library for parsing the french wiktionary

python python3 francais french wiktionary wiktionary-parser

Updated Feb 20, 2022
Python

slowwavesleep / RuWiktionaryParser

Star

Extraction of the Russian word forms and their segmentation from the Russian Wiktionary

morphology segmentation wiktionary russian-language wiktionary-parser word-forms

Updated May 1, 2021
Python

beviah / ezglot

Star

Selected data processing scripts including language agnostic multilingual wiktionary parser

multilingual dictionary extractor templates pronunciation levenshtein-distance wikitext ipa similarity-measures language-resources wiktionary thesaurus-data wiktionary-parser wiktionary-data wiktionary-tool wiktionary-dataset word-distance

Updated Mar 31, 2024
Python

Vuizur / ruwiktionary-htmldump-parser

Star

Parses the Russian Wiktionary HTML dumps into JSON and generates ereader dictionaries

parser language-learning russian wiktionary wiktionary-parser

Updated Aug 10, 2023
Python

Vuizur / dewiktionary-htmldump-parser

Star

A scraper which extracts data from the German Wiktionary HTML dump.

german czech wiktionary wiktionary-parser wiktionary-dump

Updated Jul 19, 2022
Python

AtilioA / frenchhomophones

Star

🇫🇷 Source code for frenchhomophones website. [inactive]

flask herokuapp mongodb-atlas wiktionary-parser

Updated Feb 3, 2022
Python

cpina / kamus-dictionary

Star

Prototype of an interface to use Wiktionary translations

language translation dictionary wiktionary wiktionary-parser wiktionary-tool

Updated Nov 17, 2024
Python

lennon-c / de_wiktio

Star

A Python package to parse and extract data from the German Wiktionary. It allows users to access wikitext content, either by fetching it directly online or by loading a dump file locally.

wikitext wiktionary wiktionary-parser wiktionary-dump dewiktionary