Click any symbol to hear a real human recording. Pick a language to see which sounds it uses — sounds outside that language stay clickable, just dimmed.
Hatched cells: articulations judged impossible. Rows = manner of articulation, columns = place (front of the mouth → throat).
Vertical axis: tongue height (close → open). Horizontal: frontness (front → back). Where two symbols share a dot, the left is unrounded and the right is rounded.
Gliding vowels. Each plays a genuine Commons recording of a word containing the diphthong; if a recording can't be reached, the two component vowels are played in sequence.
Co-articulated and alveolo-palatal consonants that sit outside the main grid.