B1 (Fokus in Gur- und Kwasprachen)

contact: sfb632-B1 Diese E-Mail-Adresse ist vor Spambots geschützt! Zur Anzeige muss JavaScript eingeschaltet sein!

Gur and Kwa Corpus

modality:

spoken, monologue and dialogue, quasi-parallel (QUIS tasks alignable via task numbers), extra experiments specific to each language

formats:

wav (audio), toolbox, EXMARaLDA-XML (annotation)

source data:

focus data elicited with (parts of) QUIS, language specific elicitation tasks as well as quasi-spontaneous speech

languages:

Aja (AJA), Akan (AKA), Awutu-Efutu (AFU), Buli (BWU), Byali (BEH), Dagbani (DAG), Ditammari (TBZ), Ewe (EWE), Fon (FON), Foodo (FOD), Gurene (GUR), Konkomba (XON), Konni (KMA), Lelemi (LEF), Nateni (NTM), Waamma (WWA), Yom (PIL)


metadata:

IMDI-oriented metadata (elicited in .doc tables) included in XML files during processing. Pseudonymisation provided.

subcorpora / versions:

all 17 languages:  part of QUIS experiments for all langugages listed, large collections of extra experimental data (language specific), latest version of June 2007.

selected ANNIS set:

    • Aja:  small sample for ANNIS (QUIS experiments), status: final, latest version of July 2004, manually transcribed, glossed, translated to English and annotated with parts of speech, morphological analysis, phonological tones, definiteness of NPs, syntax and semantic roles, countability of nouns, and animacy.
    • Dagbani:  small sample for ANNIS (QUIS experiments), status: final, latest version of July 2004, manually transcribed, glossed, translated to English and annotated with information status, topic, new information focus, contrastive focus, proposition status .
    • further languages:  small samples of 9 further languages (Buli, Byali, Ditammari, Fon, Foodo, Konni, Nateni, Waamma and Yom) for ANNIS (QUIS experiments), status: in preparation, latest version of July 2007, manually transcribed, glossed, translated to English and annotated with parts of speech, morphological analysis, phonological tones, syntax, definiteness, information status, topic, new information focus, contrastive focus and animacy.