Curation Tasks

From Kurdî Wikibase

This page describes issues to be fixed in the lexical data, and proposed practices.

Kurmanji

Superlative used as lexical category

STATUS: TO DO

Superlative describes an adjective form, not a lexical category. These lexemes should be integrated into the entry of the corresponding adjective, which should use its positive form as lemma (and not the superlative). As an example, see how the English adjective good is modeled on Wikidata.

PREFIX kwb: <https://kurdi.wikibase.cloud/entity/>
select ?language ?languageLabel ?lemma where {
  ?lexeme a ontolex:LexicalEntry;
          dct:language ?language;
          wikibase:lemma ?lemma;
          wikibase:lexicalCategory kwb:Q30. # superlative used as POS
  ?language rdfs:label ?languageLabel. filter(lang(?languageLabel)="en")
}

Try it!


Kurmanji / Sorani / Hawrami

English usage example translations that appear as sense glosses

STATUS: TO DO

In some entries, the English translation of the usage example appears as sense gloss (i.e., short definition) at the same time. In most of these cases, the sense gloss should be a subset of the words that make up the usage example translation. For example, in L2015, the only sense for the lemma nîv should be glossed as half, and not as half an hour, which is the translation of the usage example nîvan saetê, correctly translated as half an hour (see uncorrected initial version vs. corrected version of the lexeme sense gloss).

PREFIX kwb: <https://kurdi.wikibase.cloud/entity/>
PREFIX kdp: <https://kurdi.wikibase.cloud/prop/direct/>
PREFIX kp: <https://kurdi.wikibase.cloud/prop/>
PREFIX kps: <https://kurdi.wikibase.cloud/prop/statement/>
PREFIX kpq: <https://kurdi.wikibase.cloud/prop/qualifier/>

select ?lexeme ?languageLabel ?posLabel ?sense ?lemma ?gloss_en ?usgxpl_ku ?usgxpl_en
where {  ?lexeme a ontolex:LexicalEntry;
                 dct:language ?language;
                 wikibase:lemma ?lemma;
                 wikibase:lexicalCategory ?pos;
                 ontolex:sense ?sense.
         ?sense skos:definition ?gloss_en. filter(lang(?gloss_en)="en")
         ?sense kp:P12 [kps:P12 ?usgxpl_ku; kpq:P16 ?usgxpl_en] .
         filter(?gloss_en = ?usgxpl_en) # restricts to senses where EN gloss and EN usage example translation are the same
       SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
       }

Try it!