Main Page: Difference between revisions

From Kurdî Wikibase
mNo edit summary
Line 19: Line 19:
==Gorani (also known as Hawrami, '''hac''')==
==Gorani (also known as Hawrami, '''hac''')==


In comparison to the other resources, The [[Item:Q15|Gorani]] resource is the smallest one containing around 1,000 headwords written in the Latin-based script and described with part-of-speech tags, grammatical gender, glosses in English and a few usage examples. Similar to Central Kurdish, this language is mostly written in the Perso-Arabic-based script of Kurdish.  
In comparison to the other resources, The [[Item:Q37|Gorani]] resource is the smallest one containing around 1,000 headwords written in the Latin-based script and described with part-of-speech tags, grammatical gender, glosses in English and a few usage examples. Similar to Central Kurdish, this language is mostly written in the Perso-Arabic-based script of Kurdish.


=Project Log=
=Project Log=

Revision as of 17:46, 31 March 2023

Welcome to Kurdî Wikibase, a Wikibase Cloud instance.

Project Goals

Goal of this project is to transfer Kurdish lexical data from Ontolex TTL sources to this Wikibase, and subsequent transfer to Wikidata, i.e. alignment to existing Wikidata lexemes, or creation of new Wikidata lexemes. Open curation tasks (see below) can be done on this Wikibase. If you want to contribute, please register.

Datasets integrated into Kurdî Wikibase

Northern Kurdish (also known as Kurmanji, kmr)

Over 4,000 headwords are provided in Northern Kurmanji in the Latin-based script. Headwords are defined with part-of-speech tags, grammatical gender, and glosses based on distinct senses in Northern Kurdish and English. Usage examples are also provided in some cases.

Central Kurdish (also referred to as Sorani, ckb)

Over 5,000 headwords are provided in Central Kurdish (Sorani) written in the Latin-based script. This script, unlike Northern Kurmanji, is not much used by Central Kurdish speakers; the Perso-Arabic-based script is mostly used for this variant. Entries are described with part-of-speech tags, glosses in English and, sometimes, usage examples. Grammatical gender is not present in Central Kurdish.

Southern Kurdish (sdh)

The Southern Kurdish resource contains over 11,000 headwords, the highest number among the selected resources. The headwords are written in both Perso-Arabic and Latin-based scripts and are described with glosses in Persian and other varieties of Kurdish. Such varieties include words from Kurdish varieties along with Laki and Luri languages.

Gorani (also known as Hawrami, hac)

In comparison to the other resources, The Gorani resource is the smallest one containing around 1,000 headwords written in the Latin-based script and described with part-of-speech tags, grammatical gender, glosses in English and a few usage examples. Similar to Central Kurdish, this language is mostly written in the Perso-Arabic-based script of Kurdish.

Project Log

See Project Log page.

SPARQL Queries

See some queries on SPARQL Queries page, to explore what is on Kurdî Wikibase.

Community tasks

See the Curation Tasks page.

Issue with language codes

Not all language codes used in our Ontolex sources are available on Wikibase/Wikidata. See https://phabricator.wikimedia.org/T325688.