Cross-Linguistic Transcription Systems

Contrary to what non-practitioners might expect, the systems of phonetic notation used by linguists are highly idiosyncratic. Not only do various linguistic subfields disagree on the specific symbols they use to denote the speech sounds of languages, but also in large databases of sound inventories considerable variation can be found. Inspired by recent efforts to link cross-linguistic data using reference catalogs - such as Glottolog or Concepticon - across different resources, we present initial efforts to link different phonetic notation systems to a catalog of speech sounds. Our cross-linguistic database of phonetic transcription systems (CLTS) currently registers 5 transcription systems and links to 22 different transcription datasets, in addition to mapping the sounds to 6 different sound class systems.

Cite the database as

Johann-Mattis List, Cormac Anderson, Tiago Tresoldi, Christoph Rzymski, & Robert Forkel. (2024). CLTS. Cross-Linguistic Transcription Systems (v2.3.0) [Data set]. Zenodo.

and the paper introducing CLTS as

Anderson, Cormac, Tiago Tresoldi, Thiago C. Chacon, Anne-Maria Fehn, Mary Walworth, Robert Forkel, and Johann-Mattis List (2018): A Cross-Linguistic Database of Phonetic Transcription Systems. Yearbook of the Poznań Linguistic Meeting 4.1. 21-53. DOI: 10.2478/yplm-2018-0002