Meaning from Melody: Music as Language

By Chris Chatham

Khalil Gibran said that "music is the language of the spirit," but today many would claim it can convey only a general impression or mood. However, recent research has shown that the meaning of musical passages can be surprisingly specific. In their 2004 article, Koelsch et al. demonstrate that even 10-second musical excerpts can build up concrete semantic expectations, the same as short sentences.

Normally, when people hear a sentence followed by an unexpected word (e.g., "The shackles allow only a little movement. ... Wideness.") their scalp electrical activity sharply dips around 400 miliseconds after the unexpected word. In contrast, no negative dip in electrical activity is seen if the word is more contextually appropriate—such as "restrained." Likewise, this wave does not appear for words that are only grammatically incorrect. Therefore, this wave (called the N400) seems to be a signature of semantic processing by the brain.

The authors wondered whether the same negative wave would occur if the word "wideness" followed a excerpt from Strauss's Salome—a dissonant musical piece with a narrow tonal range—instead of following a short sentence. In contrast, no N400 should exist if "wideness" follows Valpola's E-Minor Piece for the Accordion, which utilizes a wide tonal range.

The results confirmed this idea—the same N400 wave occurs in response to words following incompatible musical clips as to words following incompatible sentences. Conversely, words following compatible musical excerpts elicited no N400.

Somehow, the musical passages were actually communicating the same expectations and eliciting the same neural activity as language! In fact, the N400 wave was just as strong in response to unexpected abstract words as to unexpected concrete words, suggesting music has impressive communicative flexibility.

Even more suprising is that the same effect was found when classifying musical excerpts based on the composers' self-report. For example, no N400 wave appeared if Schoenberg's String Terzett was played before presentation of the related word "needle"—Schoenberg had crafted this piece to describe the stitching pains of a heart attack. In contrast, N400 waves were found if "river" followed the same passge, nor if words like "needle" followed musical clips where the composers had ideas related to "river" in mind.

Importantly none of the 44 people who participated in the experiment were musical experts. Try listening to the excertps to see if you agree with the "related" and "unrelated" words!

The P600 Effect

To what extent is music like language? I've reviewed how music and language share semantic characteristics, at least insofar as similar scalp electrical activity follows incongruent musical passages as follows incongruent words. But is it also possible that music has grammar, just like language?

In the context of language, ungrammatical words are often accompanied by a sharp rise in scalp electrical activity, around 600 milliseconds after the grammatical violation. This is known as the P600, or sometimes as the syntactic positive shift (SPS). Because this scalp electrical component shows sensitivity to grammatical violations, some suggest that it indexes the function of a grammar-specific module. However, the P600 can also be observed in response to violations of number and gender agreement, phrase structure, verb tense, and even case, suggesting the response is much more general than would be expected from a correlate of a "grammar module" (if one were to exist).

In their 1998 JoCN paper, Patel et al. show that the P600 can distinguish between chord changes that are appropriate to western music versus those that violate more typical chord progressions. To demonstrate this, the authors compared the ERP responses of subjects to phrases like "an old idea" when embedded in three different types of sentences: those that were grammatically OK ("Some of the senators had promoted an old idea of justice") grammatically poor ("Some of the senators endorsed promoted an old idea of justice.") and those that were flat-out ungrammatical ("Some of the senators endorsed the promoted an old idea of justice.")

They then did the same thing, except with music: the authors constructed musical passages based on the "circle of fifths" and inserted notes that were either concordant or discordant with those passages. The inserted notes were never "wrong" in and of themselves (i.e., they were not mistuned or unusual); they simply did not seem to match the harmonic progressions of the musical passage.

The recordings of scalp electrical activity in response to these stimuli revealed that ungrammatical phrases and discordant notes alike were associated with a P600 component, when compared with grammatical phrases and concordant notes respectively. The positive shifts of scalp electrical activity in response to structurally/grammatical incongruous stimuli were statistically indistinguishable from one another, at least in terms of their amplitudes and scalp distributions. So, music seems to have a form of implicit rule structure very much like grammar in language, to the extent that similar neural generators may be involved in processing anomalies across those modalities!

Patel et al note that this similarity is even more striking considering that P600s were evoked from musical passages by manipulating the identity of the target, whereas they were evoked from ungrammatical phrases by manipulating the stimuli preceeding the target.

The authors conclude that the P600 may reflect "knowledge-based structural integration" rather than the activity of a grammar-specific cognitive module. I'm not entirely sure what this means, and they don't fully explain, but it is important to note that the P600 may be a very late version of the P300 response. The P300 is thought to represent attentional reorienting or the updating of working memory as a result of context violations. Unfortunately, they do not discuss why the P600 differs from the N400 response, also shown to be sensitive to words that are incongruent with the preceeding context (whether verbal or musical), but only when the violations have to do with mismatches in meaning.

Chris Chatham is a graduate student at the University of Colorado, Boulder, and author of the Developing Intelligence weblog.


User Rating:Your rating: None