Skip to content

CLDF dataset derived from Kitchen et al.'s "Bayesian phylogenetic analysis of Semitic languages" from 2009

License

Notifications You must be signed in to change notification settings

lexibank/kitchensemitic

Repository files navigation

CLDF dataset derived from Kitchen et al.'s "Bayesian phylogenetic analysis of Semitic languages" from 2009

How to cite

If you use these data please cite

  • the original source

    Bayesian phylogenetic analysis of Semitic languages identifies an Early Bronze Age origin of Semitic in the Near East. Andrew Kitchen, Christopher Ehret, Shiferaw Assefa, Connie J. Mulligan. Proc. R. Soc. B 2009 -; DOI: 10.1098/rspb.2009.0408. Published 29 April 2009

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a https://creativecommons.org/licenses/by-nc/4.0/ license

See also http://rspb.royalsocietypublishing.org/content/early/2009/04/27/rspb.2009.0408

Conceptlists in Concepticon:

Notes

Notes on the Comparison with Original Sources (B. Sapirstein)

Unable to identify original sources for Mɛhri, Jibbali, and Harsusi.

Kitchen et al say:

Wordlists for the Ethiosemitic languages (Amharic, Argobba, Chaha, Gafat, Ge’ez, Geto, Harari, Innemor, Mesmes, Mesqan, Soddo, Tigre, Tigrinya, Walani and Zway) and Ogaden Arabic were drawn from Bender (1971). Wordlists for Moroccan Arabic, South Arabian languages ( Jibbali, Harsusi, Mehri and Soqotri) and extinct non-African Semitic languages (Akkadian, Biblical Aramaic, ancient Hebrew and Ugaritic) were constructed from previously published lexicons (Leslau 1938; Gelb et al. 1956; Sobelman & Harrel 1963; Rabin 1975).

I can't see these three in Leslau, Gelb, Sobelman & Harrel or Rabin or Bender.

Statistics

Glottolog: 100% Concepticon: 100% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 25 (linked to 25 different Glottocodes)
  • Concepts: 97 (linked to 97 different Concepticon concept sets)
  • Lexemes: 2,396
  • Sources: 8
  • Synonymy: 1.04
  • Cognacy: 2,150 cognates in 665 cognate sets (329 singletons)
  • Cognate Diversity: 0.25
  • Invalid lexemes: 0
  • Tokens: 10,911
  • Segments: 110 (0 BIPA errors, 0 CLTS sound class errors, 110 CLTS modified)
  • Inventory size (avg): 37.20

Contributors

Name GitHub user Description Role
Ben Sapirstein orthography profile, integration of original sources Editor
Johann-Mattis List @LinguList maintainer Editor
Simon Greenhill maintainer Editor
Andrew Kitchen data collection Author
Christopher Ehret data collection Author
Shiferaw Assefa data collection Author
Connie J. Mulligan data collection Author

CLDF Datasets

The following CLDF datasets are available in cldf:

About

CLDF dataset derived from Kitchen et al.'s "Bayesian phylogenetic analysis of Semitic languages" from 2009

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 5