Latent neural dynamics encode temporal context in speech

Hear Res. 2023 Sep 15:437:108838. doi: 10.1016/j.heares.2023.108838. Epub 2023 Jul 4.

Abstract

Direct neural recordings from human auditory cortex have demonstrated encoding for acoustic-phonetic features of consonants and vowels. Neural responses also encode distinct acoustic amplitude cues related to timing, such as those that occur at the onset of a sentence after a silent period or the onset of the vowel in each syllable. Here, we used a group reduced rank regression model to show that distributed cortical responses support a low-dimensional latent state representation of temporal context in speech. The timing cues each capture more unique variance than all other phonetic features and exhibit rotational or cyclical dynamics in latent space from activity that is widespread over the superior temporal gyrus. We propose that these spatially distributed timing signals could serve to provide temporal context for, and possibly bind across time, the concurrent processing of individual phonetic features, to compose higher-order phonological (e.g. word-level) representations.

Keywords: Auditory; Electrocorticography; Latent state; Reduced-rank regression; Superior temporal gyrus.

Publication types

  • Review
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acoustic Stimulation
  • Auditory Cortex* / physiology
  • Humans
  • Phonetics
  • Speech / physiology
  • Speech Perception* / physiology
  • Temporal Lobe / physiology