Comparing Germanic, Romance and Slavic: Relationships among linguistic distances

W.J. Heeringa, Charlotte Gooskens*, Vincent van Heuven

*Corresponding author for this work

Research output: Contribution to journal/periodicalArticleScientificpeer-review

2 Citations (Scopus)

Abstract

Languages differ along multiple dimensions (lexis, phonology, morphology, syntax). Related languages descend from a common ancestor language but have diverged over time. This paper asks whether languages diverge equally along all dimensions, and, to the extent that they do not, which dimension reflects the traditional language family tree best. We computed measures of (i) lexical distance (ii) phonetic distance, and (iii) syntactic distance. The measures were computed on all words and sentences extracted from a corpus of translations of four relatively short English texts into another four Germanic languages (Danish, Dutch, German, Swedish), five Romance languages (French, Italian, Portuguese, Romanian, Spanish) and six Slavic languages (Bulgarian, Croatian, Czech, Polish, Slovakian, Slovenian). We examined the correlation structure of the distances for all pairs of Germanic (10), Romance (10) and Slavic (15) languages (i.e., within-family comparisons only). The results indicate that the linguistic dimensions are generally correlated (weakly but significantly), and that the correlations are stronger for pairs within families than when all 35 pairs are examined together. Cladistic family trees correlate best with the lexical distance (.851 < r < .887). This confirms that the genealogical language trees are predominantly based on lexical rather than phonetic or syntactic considerations.
Original languageEnglish
JournalLingua
Volume287
DOIs
Publication statusPublished - 15 Mar 2023

Keywords

  • Linguistic distances
  • lexical distance
  • Levenshtein distance
  • syntactic distance
  • genealogical distance
  • language phylogeny

Fingerprint

Dive into the research topics of 'Comparing Germanic, Romance and Slavic: Relationships among linguistic distances'. Together they form a unique fingerprint.

Cite this