Main Page: Difference between revisions
Line 1: | Line 1: | ||
Welcome to Kurdî Wikibase, a [https://wikibase.cloud Wikibase Cloud] instance. | Welcome to Kurdî Wikibase, a [https://wikibase.cloud Wikibase Cloud] instance. | ||
=Project Goals= | =Project Goals= | ||
Goal of this project is to transfer Kurdish lexical data from [https://github.com/sinaahmadi/KurdishLexicography Ontolex TTL sources] to this Wikibase, and subsequent transfer to Wikidata, | Goal of this project is to transfer Kurdish lexical data from [https://github.com/sinaahmadi/KurdishLexicography Ontolex TTL sources] to this Wikibase, and subsequent transfer to Wikidata, as an enrichment of existing Wikidata lexemes, or creation of new Wikidata lexemes. Open curation tasks (see below) can be done on this Wikibase. If you want to contribute, please register. | ||
=Publications= | |||
A publication about this project is currently being reviewed. | |||
=Datasets integrated into Kurdî Wikibase= | =Datasets integrated into Kurdî Wikibase= |
Revision as of 17:51, 31 March 2023
Welcome to Kurdî Wikibase, a Wikibase Cloud instance.
Project Goals
Goal of this project is to transfer Kurdish lexical data from Ontolex TTL sources to this Wikibase, and subsequent transfer to Wikidata, as an enrichment of existing Wikidata lexemes, or creation of new Wikidata lexemes. Open curation tasks (see below) can be done on this Wikibase. If you want to contribute, please register.
Publications
A publication about this project is currently being reviewed.
Datasets integrated into Kurdî Wikibase
Northern Kurdish (also known as Kurmanji, kmr)
Over 4,000 headwords are provided in Northern Kurmanji in the Latin-based script. Headwords are defined with part-of-speech tags, grammatical gender, and glosses based on distinct senses in Northern Kurdish and English. Usage examples are also provided in some cases.
Central Kurdish (also referred to as Sorani, ckb)
Over 5,000 headwords are provided in Central Kurdish (Sorani) written in the Latin-based script. This script, unlike Northern Kurmanji, is not much used by Central Kurdish speakers; the Perso-Arabic-based script is mostly used for this variant. Entries are described with part-of-speech tags, glosses in English and, sometimes, usage examples. Grammatical gender is not present in Central Kurdish.
Southern Kurdish (sdh)
The Southern Kurdish resource contains over 11,000 headwords, the highest number among the selected resources. The headwords are written in both Perso-Arabic and Latin-based scripts and are described with glosses in Persian and other varieties of Kurdish. Such varieties include words from Kurdish varieties along with Laki and Luri languages.
Gorani (also known as Hawrami, hac)
In comparison to the other resources, The Gorani resource is the smallest one containing around 1,000 headwords written in the Latin-based script and described with part-of-speech tags, grammatical gender, glosses in English and a few usage examples. Similar to Central Kurdish, this language is mostly written in the Perso-Arabic-based script of Kurdish.
Project Log
See Project Log page.
SPARQL Queries
See some queries on SPARQL Queries page, to explore what is on Kurdî Wikibase.
Community tasks
See the Curation Tasks page.
Issue with language codes
Not all language codes used in our Ontolex sources are available on Wikibase/Wikidata. See https://phabricator.wikimedia.org/T325688.