Академический Документы
Профессиональный Документы
Культура Документы
2010
Recommended Citation
Culpepper, Sarah Elizabeth. "Musical time and information theory entropy." thesis, University of Iowa, 2010.
http://ir.uiowa.edu/etd/659.
by
Sarah Elizabeth Culpepper
July 2010
CERTIFICATE OF APPROVAL
_______________________
MASTER'S THESIS
_______________
___________________________________
Nicole Biamonte
___________________________________
Jerry Cain
Is their wish so unique –
To anthropomorphize the inanimate
With a love that masquerades as pure technique?
Donald Justice
“Nostalgia of the Lakefronts”
ii
TABLE OF CONTENTS
CHAPTER
I. INTRODUCTION ............................................................................................1
VII. CONCLUSION.............................................................................................115
BIBLIOGRAPHY ............................................................................................................117
iii
LIST OF TABLES
Table
4.1. Pitch entropies in Webern works, compared with Babbitt and Schubert .................36
4.2. Interval class entropies comparing serial and non-serial works ...............................38
4.3. Vertical and horizontal entropy on one serial and one non-serial work ...................39
4.5. Ordered directional interval class entropy in serial and non-serial works ................42
4.6. Interval entropy in Webern and Babbitt ...................................................................44
4.8. CSEG entropies, random string versus Webern, op. 5, no. 1 ...................................51
4.13. Pc-set entropies in op. 16 and op. 25 using discrete segmentation algorithm ..........62
4.14. Pc-set entropies in op. 16 and op. 25 using window algorithm ................................64
4.15. Vertical pc-set entropy in op. 16 and op. 25, no. 1 ...................................................65
6.2. Pitch class entropy in the vocal line of op. 16, no. 1. ...............................................88
iv
6.9. Pitch entropy in op. 5, no. 4, A and B.....................................................................108
v
LIST OF FIGURES
Figure
4.1. Interval class entropies comparing serial and non-serial works ...............................38
4.2. Registrally-ordered interval class entropy in Webern and Babbitt ...........................41
4.3. Ordered directional interval class entropy in serial and non-serial works ...............42
4.8. CSEG entropies, random string versus Webern, op. 5, no. 1 ...................................51
4.12. Melody generated using the CSEG distribution of Webern, op. 5, no. 1 .................57
4.13. Melody generated using the CSEG distribution of a string of random pitches. .......58
4.16. Pc-set entropies in op. 16 and op. 25 using discrete segmentation algorithm ..........63
4.17. Pc-set entropies in op. 16 and op. 25 using window algorithm ................................64
4.18. Vertical pc-set entropy in op. 16 and op. 25, no. 1 ...................................................65
vi
6.2. Pitch class entropy in op. 16, no. 1 ...........................................................................88
6.3. Pitch class entropy in the vocal line of op. 16, no. 1 ................................................89
vii
1
CHAPTER I
INTRODUCTION
In the conclusion of The Time of Music (1988), Jonathan Kramer gives two
anecdotes of his personal experiences with what he calls musical timelessness. The first
The second anecdote concerns the opposite condition, a “happening” dense enough to
The production began at 7:00 p.m. The noise level was consistently high,
and the visual panorama was dizzying. I found myself, although performing,
focusing my attention on one layer, then another, and then various combinations
of layers.... After what seemed to be a couple of hours, everyone spontaneously
agreed that it was time to stop... I loaded my tape and slides into my car. Only
then did I glance at my watch. It was not yet 8:00! What had seemed like a two-
hour performance must have lasted under 25 minutes by the clock.2
Kramer attributes the disparity between these temporal experiences to the amount and
density of information each performance contained. Music that is predictable and easily
“chunked,” he argues, takes up less mental storage space and seems shorter than music
that is less predictable; “Thus a two-minute pop tune will probably seem shorter than a
1 Jonathan Kramer, The Time of Music (New York: Schirmer, 1988): 379.
2 Ibid., 380.
2
temporal progressions, such as the “misplaced” closing gestures Levy finds in Haydn and
the “evolving themes” Hatten finds in Beethoven, draw power from their violation of
with a musical idiom; tendency to disengage from overly predictable works; how
comfortable the chairs are – the existence of a connection between time and predictability
is clear.
This thesis examines the relationship between time and predictability through the
lens of information theory entropy. Just as traditional entropy speaks to the degree of
entropy was initially developed to determine the most efficient way to encode a message
for radio transmission, it has since been adopted as an analytical tool by a variety of
economy of musical materials that seems well-suited for analysis in terms of information
3 Ibid., 337.
4 Ibid., 16ff.
5 Janet Levy, “Gesture, Form, and Syntax in Haydn's Music,” in Haydn Studies:
Proceedings of the International Haydn Conference (New York: Norton, 1981), 355-362; Robert
Hatten, “The Troping of Temporality in Music,” in Approaches to Meaning in Music, ed. Byron
Almen and Edward Pearsall (Bloomington: Indiana University Press, 2006), 66ff.
3
theory – in the sense that no pitch or gesture seems superfluous or reducible, as though its
omission would not have a marked effect on the passage, or as though it had only been
added to fill space before the beginning of the next phrase.6 (In Adorno’s words: “Every
will be shown in a later chapter; I believe entropy can speak to these same qualities in
Webern's work.
between information content and the listener's perception of time, as will be discussed in
Stockhausen writes, “If we realise, at the end of a piece of music... that we have 'lost all
sense of time', then we have in fact been experiencing time most strongly. This is how we
always react to Webern's music.”8 In a different vein, Ligeti describes Webern's music as
the spatialization of time.9 Perception and analysis of time in Webern is, at the very least,
complicated, but entropy provides a useful metaphor for its description and a useful tool
use of information theory as... well, as number fetishism: as a component of the claim
6 Theodor Adorno, “The Aging of the New Music,” in Essays on Music, ed. Richard
Leppert, trans. Susan Gillespie (Berkeley, Los Angeles: University of California Press, 2002),
187.
7 Theodor Adorno, Quasi una Fantasia: Essays on Modern Music, trans. Rodney
Livingstone (New York: Verso, 1998), 180.
8 Karlheinz Stockhausen, “Structure and Experiential Time,” Die Reihe 2 (Bryn Mawr, PA:
Presser, 1959), 65.
9 Gyorgy Ligeti, “Metamorphoses of Musical Form,” Die Reihe 7 (Bryn Mawr, PA:
Presser, 1965), 16.
4
that music theorists can consider themselves scientists who refute or uphold hypotheses
based on empirical evidence, a notion she depicts as quaint and outdated.10 Indeed, early
uses of information theory often relied upon questionable assumptions, as Hessert (1971)
claims, and were often divorced from diachronic perception of music.11 Nevertheless,
diachronic perception of music – it can be a relevant lens for the examination of musical
time.
questionable indeed, but using information theory to analyze and discuss temporality
seems much less problematic. Writing about traditional entropy, Eddington clarifies the
situation:
I think there are the strongest grounds for placing entropy alongside
beauty and melody, and not with the first three. Entropy is only found
when the parts are viewed in association, and it is by viewing or hearing
the parts in association that beauty and melody are discerned. All three
are features of arrangement. It is a pregnant thought that one of these
three associates should be able to figure as a commonplace quantity of
science. The reason why this stranger can pass itself off among the
aborigines of the physical world is that it is able to speak their language,
viz., the language of arithmetic.12
Entropy is discussed in terms of number but is not the fetishism of number; rather, it is a
10 Vanessa Hawes, “Number Fetishism: The History of the Use of Information Theory as a
Tool for Musical Analysis,” in Music's Intellectual History 2009, ed. Zdravko Blazekovic and
Barbara Dobbs Mackenzie (New York: RILM, 2009), esp. 836-838.
11 Norman Hessert, “The Use of Information Theory in Musical Analysis” (Ph.D diss.,
Indiana University, 1971).
12 A. Eddington, The Nature of the Physical World (Ann Arbor: University of Michigan
Press, 1935), 105.
5
information theory entropy need not be a formula divorced from musical experience, but
can instead be a analytical tool and metaphor for the discussion of something deeply
experiential and even – as Meyer (1957) claims – a way to approach the question of
musical meaning.13
and a history of its use in music theory (chapter 3). In chapter 4, a variety of alternative
approaches to entropy are developed, including entropy calculations based on CSEGs and
pc-sets (as opposed to single pitch classes). Chapter 5 makes a more in-depth argument
for the relationship between information theory entropy and time, recasting analyses of
entropy will be used to analyze time in the first of the Fünf Canons, op. 16, and the fourth
of the Fünf Sätze, op. 5 – two movements in which form is created by perceptible shifts
13 Leonard Meyer, “Meaning in Music and Information Theory,” Journal of Aesthetics and
Art Criticism 15, no. 4 (1957): 412-424.
6
CHAPTER II
INFORMATION THEORY ENTROPY
Information theory entropy is based on the idea that in most alphabets, some
letters communicate more information than other letters do, because they occur less
frequently. If a word has been corrupted during transmission and all that remains is q _ _
_ k, the recipient can easily guess what the original word was, since there are very few
words that contain both a q and a k. By contrast, if all that remains of the word is _ _ i c
_, the original word is much more difficult to guess. Since q and k are uncommon, they
communicate more information about the original message than more common letters
can.14
In general, the more unequal the frequencies of letters in an alphabet are, the
easier it is to determine what letters have been corrupted. If an alphabet only has two
letters, A and B, but the former occurs 90% of the time and the latter occurs 10% of the
time, the message recipient has an excellent chance of guessing any letters that have been
corrupted (since there is a 90% chance any given letter will be an A). By contrast, if A
and B appear 50% of the time, our ability to guess a missing letter is diminished.
From the perspective of a person sending a telegram, the former language is very
inefficient. Assume, for simplicity, any message in this language must contain exactly
90% As and 10% Bs (although in a real language, these would be averages). If the
transmitter is limited to ten characters, there are exactly ten words s/he can send:
common that it is practically meaningless; only the position of the less common letter
differentiates between words, but it occurs very rarely. By contrast, in a language that is
50% A and 50% B, the transmitter would have 2^10 or 1024 word choices. By creating
an alphabet in which all letters occur with the same frequency, the efficiency of
transmission is maximized.
Of course, in addition to being more efficient, the latter alphabet is less resistant
to corruption. Ideally, one must find a balance between the most efficient language
possible and the most robust language possible, to be sure that the message arrives to its
recipient intact but without wasting time or resources during transmission. Finding this
balance – generally for the purpose of data compression or encoding – was one of the
first goals of the field of information theory, pioneered by Bell Labs engineer Claude
alphabet is called the Shannon entropy of that alphabet. If Shannon entropy is low, the
language is inefficient but robust; a few letters occur very frequently and the rest are
uncommon. If Shannon entropy is high, the language is efficient; each letter occurs with
roughly the same frequency and therefore each letter conveys the most information
possible.
formula:
Here, p(x) is the probability that a given event occurs; p(x=6) denotes the probability that
8
The example of bits illustrates the purpose of the logarithm in this formula. Each
bit presents two choices; given six bits, the number of combinations that can be
communicated is two to the sixth power. The entropy formula can seen as taking the
number of possible choices (here, expressed in terms of probability) and returning the
(Log base two is necessary to express these results in terms of bits. Another log base
would create meaningful data if used consistently, but these data would be in terms of
other units of measurement.) Effectively, the use of logarithms in this formula ensures
that the highest entropy is created when each possible outcome has an equal probability
of occurring, and that the lowest entropy is created when one event has a very high
probability of occurring. Consider an alphabet that has one letter, A, that occurs 100% of
– that is, since we are absolutely certain every letter will be an A, the language has an
uncertainty of zero and an entropy of zero. The closer any probability gets to 1, the
smaller the language's entropy becomes. For example, if this language had three letters
instead, in which A occurred 98% of the time, and B and C each occurred 1% of the time,
The logarithmic expression makes the contribution of the first term very small, whereas
the small probabilities make the contributions of the second and third terms very small as
well. By contrast, if each option occurs with roughly equal frequency, the entropy of the
language is
which is the highest possible entropy for an alphabet with three letters. Of course, the
more letters in an alphabet, the higher the maximal entropy becomes. If this same
An alphabet with twenty-six letters has a maximal entropy of 4.7; an alphabet with a
It is clear from these examples that entropy is most useful for comparisons. The
claim that an alphabet with a hundred letters has a maximal entropy of 6.64 is not terribly
meaningful on its own; it only takes on meaning when paired with the statement that an
alphabet with three letters has a maximal entropy of 1.58, or with other entropy
entropy values (as computed above) as a percentage of the maximal possible entropy for
an alphabet of that cardinality. For example, the relative entropies of the cardinality three
and
respectively. Thus, we can think of these two alphabets as having equivalent entropies,
passage. Intuitively, a passage of English text that uses only thirteen letters should not
have the same entropy as a passage of Hawai’ian. One imagines the former would seem
more stilted, more restricted than the latter, since a listener would hear it in the context of
a twenty-six letter alphabet, rather than a thirteen-letter alphabet. Similarly, a piece that
only uses the pitches C, C#, Eb, G#, A, A#, and B with given frequencies is very different
from a passage of chant that uses each of its seven tones with the same frequencies as the
above piece. While it is likely that the former piece will be heard as using a restricted
subset of a twelve-pitch alphabet, the latter piece exhausts its alphabet and would not be
heard as restricted in its materials in the same way as the former. The traditional entropy
formula is unable to reflect this distinction, because any unused letters carry with them a
probability of 0, effectively canceling out any entropic contribution from those letters, but
these unused letters are relevant to the computation of relative entropy through their
conclusion in the maximal entropy for an alphabet of a given cardinality.
Nevertheless, the use of relative entropy requires caution. A piece of music that
11
uses three pitches with equal frequency is much more predictable, mathematically and
aurally, than a piece of music that uses twelve pitches with equal frequencies, even
though their relative frequencies are equal. In other words, although relative entropy
must always be considered alongside the alphabets’ respective absolute entropies. In this
paper, relative entropy will only be invoked in the presence of corresponding absolute
entropy figures or some sort of intuitive justification for hearing these alphabets as
perceptually similar.
It is also clear from these examples that the entropy of a message – at least,
that message. Entropy only reflects characteristics of the language in which that message
is written or encoded. However, the meaning of a message may become relevant if the
'alphabet' in question is not a literal alphabet. For years, literary critics – especially
modernists and post-modernists, and in particular those interested in the work of Thomas
Pynchon – have completed information theory analyses of texts using words or images,
instead of literal letters, as the letters of an alphabet. In this case, the most commonly
occurring “letters” are connective words like articles and prepositions. Consider, for
example, a corrupted block of text from F. Scott Fitzgerald's The Great Gatsby, from
When we pulled out into the winter and the real snow, our snow, began
stretch out beside us and twinkle against windows, and the dim lights
of small Wisconsin echoed by, a sharp wild brace came into the air. We
drew in deep of it as we walked back from through the cold vestibules,
unutterably aware of identity with this country for one hour before we
melted indistinguishably into it again.
Although the result is disjointed in places, it is certainly still intelligible; in places the
reader cannot tell the message has been corrupted at all. Every image found in this
12
excerpt is repeated; if the word 'winter' were corrupted, 'snow' and 'cold' would still
convey its meaning. Additionally, the passage contains many connective words – and
when these words are missing (as in, “our snow began stretch out beside us”) the blanks
can be filled in easily. We conclude that this passage has low word-based entropy,
regardless of any entropy figures computed on the basis of individual letter frequency.
the first Irish post-modern novel) has been similarly corrupted below.
I will relate, said Finn. Till a man accomplished twelve books of poetry,
the same is not for want of poetry but is forced away. man is taken till a
black hole is in the world to the depth of his oxters and he put into it to
gaze it with his lonely head and nothing to but his shield and a stick of.
Then must nine warriors fly their at him, one with the other and together.
From this, we can gather that we are listening to a narrator named Finn; word repetition
clues us in that poetry and war are somehow involved, but there is little else we can say
about this passage. This same lack of repetition makes the original, non-corrupted
I will relate, said Finn. Till a man has accomplished twelve books of poetry,
the same is not taken for want of poetry but is forced away. No man is taken
till a black hole is hollowed in the world to the depth of his two oxters and
he put into it to gaze from it with his lonely head and nothing to him but his
shield and a stick of hazel. Then must nine warriors fly their spears at him,
one with the other and together.
corruption. The Fitzgerald is robust against the corruption of readers lacking context, or
readers being sleepy; in the presence of these forms of corruption the passage is still
readable. The O'Brien is much less robust by comparison. We conclude the passage has
higher entropy.
13
Alternately, we can conclude that the O'Brien passage is more efficient than the
Fitzgerald, since each individual word communicates more information. If the reader can
easily guess the meaning of a missing word, as in the Fitzgerald, then that word has a
very low information content; with these words removed, the passage becomes less
elegant but not much less intelligible. This is the same quality that makes this passage
easy to summarize. Since removing words from the O'Brien limits the reader's ability to
comprehend the passage, we can conclude that the missing words had a higher
information content – that overall, there are fewer redundant or repeated words, and that
in the
The result is nearly unintelligible; the reader cannot guess the original narrator, subject,
or purpose of this passage. What remains is interpretable, certainly, but the reader cannot
16 Although this passage is shorter than the others, the same percentage of words has been
removed in each case.
14
be certain of the original text based on this except. Consequently, this passage has high
entropy.
higher entropy. Shared meaning contributes to lower entropy as well, as seen in the
Fitzgerald example dealing with 'snow,' 'winter,' and 'cold.' From these examples, though,
we can also see that clear syntactical structures reduce entropy. If the reader can perceive
the sentence structure underlying “ We drew in deep ____ of it as we walked back from
____ through the cold vestibules,” the reader can make more educated guesses as to what
the missing words could be. The second missing word appears to be some sort of place;
the first missing word is a noun that can be an object of the verb 'to draw in,' so perhaps
the missing word is 'breaths' or 'gasps' or something along those lines. The general import
of the sentence is still clear. Similarly, in the O'Brien sentence “Then must nine warriors
fly their ____ at him,” as long as the reader can parse that warriors are throwing things at
economy of words and imagery, in that every word contributes a great deal of meaning to
a passage. This is why the Celan example is the least intelligible of the above: there are
few redundant words, and the associations between words are specifically designed to be
unexpected and novel. In other words, each word is intended to convey the greatest
entropy as occurring on a higher level than entropy computed based on literal letters of an
written in binary, or encrypted for secure transmission – without changing its vocabulary,
its low-level, alphabet-based entropy would be quite different but its higher-level, word-
15
based entropy would be the same. To achieve a word-based equivalent to encryption, one
would need a paraphrase of this text by another author, or a similar text that
communicates the same images (snowfall; evening; solitude) or the same themes
(introspection; nostalgia; the notion that a person’s actions and mindsets are influenced
static-filled radio broadcast. Listening to a Haydn string quartet in such a broadcast, one
would still be able to identify the key, the time signature, and the instrumentation; one
could make an educated guess as to which movement the quartet was playing, and
probably one could even hum the missing notes. By contrast, listening to such a broadcast
of the Webern Concerto, op. 24, one might not even be able to determine the
orchestration of the piece, let alone guess the missing notes. One can imagine a similar
“corruption” of the original musical signal being created by a poor ensemble; in this
situation, the Haydn can be considered to have a higher entropy because ensemble
generally much more recognizable than the corresponding mistakes would be in the
Webern. Because the listener is (usually) able to form more confident predictions for
upcoming events in the Haydn, violations of these predictions (including mistakes) are
more striking.18
2.1.
17 From the next paragraph: “I see now that this has been a story of the West, after all –
Tom and Gatsby, Daisy and Jordan and I, were all Westerners, and perhaps we possessed some
deficiency in common which made us subtly unadaptable to Eastern life.”
Despite the corruption, the identity of this piece is readily apparent. Even a listener who
had never heard this piece before could make a reasonable guess at every missing note,
similarly corrupted, non-tonal work, shown in Figure 2.2, is less easy to identify.
A listener already familiar with the piece might be able to identify this as the third
movement of Webern's Variations for Piano, op. 27, but a listener unfamiliar with the
piece would not even be able to guess which of the corrupted objects were pitches and
which were rests. A listener who expects a serial work based on a derived row may be
able to fill in the blanks – surmising in retrospect that the first missing pitch must be a
Bb, creating the ordered interval series <-4 -1> to match the <+4 +1> of the inverted row
form beginning in m. 5 – but probably not on first hearing without a score, and certainly
not as readily as in the Bach. In other words, the second work is more efficient, more
condensed. Because the missing pitches cannot be determined easily based on the
radio static, a corrupted score, or poor acoustics, and figurative transmission factors like
poor performance – raise larger questions about the nature of entropy in music. One can
orchestral work, in roughly the same way one could consider a poorly executed English
corruption as something that can happen within the music itself, as opposed to something
imposed upon the music by external factors (things like radio static or performers’
mistakes), it becomes difficult to decide which musical features are the original signal
and which are “corruption”: is a theme an original signal and its variations corruption? is
the original A section of a ternary form an original signal and its altered A’ corruption?
Since entropy is defined as a message’s ability to resist corruption, what can entropy be
said to measure in these cases? It may be meaningful to say that a theme “resists
variation” or that a melody “resists ornamentation,” if the former is not very memorable
or if the latter is already very elaborate, but these states may or may not coincide with
18
entropy figures generated for these passages (in that a very elaborate melody may still be
More to the point, this approach makes questionable implications about the nature
its own right – even if the aesthetic of the transcription is meaningfully different from the
Bach, or for that matter a Wendy Carlos performance of Bach, in the same light? In cases
of music not governed by a score, which performance is the “canonical” performance and
Meyer also raises the issue of “cultural noise”: corruption that occurs in
transmission as “the result of a time-lag between the habit responses which the audience
actually possess and those which the more adventurous composer envisages for it.”19 This
can be understood as avant-garde music whose language an audience has not yet
audiences. In this case, the music is not “corrupted” by any external factors, but the
audience's perception is; the issue is not signal transmission, but signal reception.
It seems most reasonable, for the purposes of this project, to consider each score
accepting changes that arise through arrangement as part of an original signal. (That is to
say, this project accepts Shelley’s philosophy of translation: that a translation is or should
be a new artwork unto itself rather than a derivative work dependent upon an original.20)
20 Percy Shelley, A Defence of Poetry and Other Essays (1840; Project Gutenberg, 2005),
http://www.gutenberg.org/etext/5428. See Part I.
19
The issue of cultural noise is important, because it is important in every work of analysis;
an information-theoretic analysis cannot assume an audience will hear a work the way an
ideal listener would, but nor can any other kind of analysis that wishes to reflect a
In any case, the factors that lead to high or low entropy in a musical example are
the same as in the excerpted Fitzgerald, O'Brien, and Celan texts. If we analyzed these
texts using literal letters as an alphabet, we would be able to identify the texts as English,
and we would probably be able to make general statements about the author's style – for
example, one could determine the average entropy for a passage saturated with Latinate
vocabulary and the average entropy for Anglo-Saxon vocabulary, based on which letters
occur the most frequently and which letters do not occur at all (such as w and j in Latin),
and from this make predictions about the loftiness or folksiness of the author's writing
style. Similarly, if we accept pitch as an alphabet, we can make predictions about how
diatonic or how chromatic a musical excerpt is, based on which pitches occur the most
frequently. However, loftiness of vocabulary does not result from avoiding the letters w
and j, any more than tonality results from using scale degrees 1 and 5 frequently. Low
entropy (on a pitch-by-pitch basis) is generally symptomatic of tonality, but does not
operating as a part of a word (for example, a single pitch within an arpeggiation), while in
other contexts that pitch may be a “word” unto itself. For this reason, pitch-based entropy
may be more relevant to musical analysis than letter-based entropy is to literary analysis.
Nevertheless, it seems reasonable to claim that the analysis of more complex musical
alphabets may strengthen the link between musical style or predictability and entropy
20
text. In both music and in text, entropy (as perceived intuitively by the listener or reader)
parsimonious voice leading), repetition (motivic material, canons, imitation), and larger
structures (a T-P-D-T phrase structure, a serial row). If alphabets are built that can
address the existence or nonexistence of these elements and structures, a more intuitive
which that meaning is communicated. With that said, though, it is impossible to divorce
the two concepts, just as the meaning of a text cannot be separated from the words with
Earlier, Meyer highlights this same relationship as the source of musical meaning:
nature of the expected consequent.”22 Although this relationship has not always been the
focus of music theory's use of information theory entropy, Meyer's comments imply that
information theory entropy has potential insight into musical meaning as well as musical
style.
22 Ibid.
21
CHAPTER III
EXISTING MUSIC-THEORETIC SCHOLARSHIP ON INFORMATION THEORY
ENTROPY
1958 article “Style as Information,” in which entropies are calculated for eight songs
from Schubert’s Die Schöne Müllerin, six arias from Mendelssohn's St. Paul, and six
songs from Schumann’s Frauen-Liebe und Leben. Only melodies in major keys are
indicates tonic, 2 indicated a raised tonic or a lowered subtonic, and so forth up to 12. His
Youngblood finds the Mendelssohn sample to have the lowest entropy (or, alternately, the
Youngblood also compares the entropy values for these composers to the
23 Joseph Youngblood, “Style as Information,” Journal of Music Theory 2, no. 1 (1958): 24-
35.
22
entropies of a collection of randomly chosen Mode I chants. When these chants are
higher relative entropy than the lieder and arias (HR=96.7%). Youngblood attributes this
to the chants' more regular use of non-final and non-tenor tones, as compared to the
lieder's marked preference for diatonic pitches over chromatic ones. Of course, when
entropy than the works of all three later composers (H=2.72, HR=76%).24
reasons, claiming that Youngblood's sample size is too small for the differences he finds
they construct confidence intervals for Youngblood's data, shown in Figure 3.1. When
Youngblood says the entropy of his Mendelssohn sample is 3.03, he makes the implicit
claim that this sample is representative of all of Mendelssohn – that if an analyst were to
compute a total entropy for all extant Mendelssohn works, that result would be fairly
close to Youngblood's. Confidence intervals measure how certain we are that the sample's
entropy is comparable to that of Mendelssohn’s complete body of work. Figure 3.1 shows
In this example, Knopoff and Hutchinson state with 95% confidence that
Mendelssohn's entropy falls between 2.895 and 3.183, and that Schubert's entropy falls
between 3.016 and 3.244. Since the confidence intervals overlap, one cannot conclude
based on this data that Schubert's and Mendelssohn's total entropies differ; it is entirely
possible, based on this data, that Schubert's total entropy is in fact lower than
24 Although most listeners probably hear chant in terms of a seven-note alphabet, one can
imagine factors that would lead the listener to hear chant in terms of a twelve-note alphabet, such
as placement of the chant between or within tonal works (as in the fragment of chant that
concludes Bruckner’s “Os Justi”), or a listener’s lack of familiarity with the repertoire.
23
For simple random samples, confidence intervals are generally computed using
25 Leon Knopoff and William Hutchinson, “Entropy as a Measure of Style: The Influence of
Sample Length,” Journal of Music Theory 27, no. 1 (1983): 75-97.
24
where x-bar is the mean of the sample (in this case, the sample's entropy), s is the
sample's standard deviation, n is the sample size, and µ is the quantity we wish to
establish: the predicted entropy for the musical style or composer in question.26 As is
clear from this formula, there are two factors that influence the size of a confidence
interval: sample size (Knopoff and Hutchinson's focus) and sample variance (the focus of
a 1990 Snyder article). The former is reasonably intuitive; a very small sample could be a
fluke, but if a large sample of Mendelssohn's work supports the conclusion that his total
entropy is 3.03, then it seems much more probable that Mendelssohn's overall entropy
really is close to 3.03. Snyder adds that variance within the sample can also make us
length and finds them to have entropy values of 2, 4.98, 3.6, and 1.4, that analyst would
have difficulty predicting Mendelssohn's total entropy, since the samples are so disparate.
By contrast, if the first four samples came back as 3.01, 3.08, 2.97, and 2.93 the
conclusions drawn about these data would seem much more reasonable, even if the
26 The multiplier 1.96 specifies a 95% confidence interval – that is, if we take 100 samples,
the means of 95 of the samples will fall within this interval. Multiplying by 1.645 instead would
return a 90% confidence interval.
This formula is provided only to illustrate the concept; confidence intervals in this paper
were calculated for binomial proportions, taking propagation of error into account. See Appendix
A of Knopoff and Hutchinson, 1993, for more information.
modulates from C minor to Eb major, one would expect the pitches C, G, Eb, and Bb to
occur with the greatest frequency – which increases that piece's entropy quite sharply,
since such a piece contains four pitches that occur frequently, instead of the two such
pitches found in a nonmodulatory work. (This logic could be expanded to include scale
degree 7 of each key as well as scale degree 5, but the result would be the same.) In a
piece that modulates from C minor to G major, the shift to a new diatonic collection
for this; although he computes entropies based on a scale-degree system, these scale
degrees are never adjusted for modulations. He notes that this lack of regard for
modulation may have disguised the differences between his Schumann and Schubert
samples, noting that (at least in these samples) Schubert's chromatic pitches tended to
arise from modulation, whereas chromatic pitches in his Schumann samples tended to be
initiated by changes in written key signature. Snyder finds this disregarding of implied
piece that modulates from, say, F major to D minor would register as having higher
entropy insofar as the latter tonal area deviates from its la-tonic. Snyder also questions
the assumption that modulations should be normalized away, arguing that a piece that
begins and ends in distantly related keys ought to have a higher entropy than a piece that
28 Youngblood, 78.
26
begins and ends in the same key.29 Alternately, one could argue that a piece that
modulates from I to V ought to have a lower entropy than a piece that modulates from,
say, I to bII – that the predictability (and perhaps even the smoothness) of a modulation
modulations in entropy. Arguably, Youngblood's system does associate distant keys with
higher entropies, since a modulation from C major to G major would contribute much
less to a piece's entropy than a modulation from C major to Ab major would, based on the
number of pitches held in common between the two respective diatonic collections. One
could imagine combining this method with a weighting system, in which pitches
belonging to passages in non-tonic keys contribute less to the piece's total entropy than
pitches in the tonic key do. Ideally, these weights would be determined in part by the
amount of time spent in the new key (as the listener's ability to remember the home key
diminishes over time), but any such system would almost certainly be criticized as
arbitrary.
The debate over how best to represent modulation in entropy calculations for
tonal repertoire highlights the concern that underlies most if not all entropy-based
and G in C minor and Eb and Bb in Eb major. Similarly, when Snyder adopts a twenty-
29 Snyder, 126-128. While the average listener may not realize that a piece has ended in C#
major instead of C major, this same listener would probably notice if the piece begins in C major
and ends in F# minor, if only for reasons of mode and register – a distinction that cannot be made
within this system.
27
flats (of scale degrees 7, 3, 6, and 2) and double sharps (of scale degrees 1, 4, and 5), his
motivation is the notion that listeners hear F and E# as distinct pitches in certain contexts,
The variety of options available to analysts (even in terms of pitch alone) speaks
to the expressive potential of these alphabets, since they can be altered to best reflect
listeners' perceptions of any given repertoire. This same flexibility can limit the analyst's
ability to compare samples from sufficiently different styles, though. It would seem
unfair to compare Wagner's entropy within a twenty-eight-note system with late serial
Schoenberg's, for example, since Schoenberg's disuse of double sharps does not speak to
any increased predictability in his music as compared to Wagner's, nor would it be fair to
say the listener finds Schoenberg's style more constricted because these letters are
omitted. One of the unstated goals of such analysis, then, is the selection of an alphabet
that is sensitive to perceptual concerns for specific repertoires but also general enough in
its applicability that its use on music from other repertoires seems reasonable.
This challenge is even greater for contextual music. The most common and most
universally applicable alphabet, either pitch names or scale degrees accepting octave and
enharmonic equivalence, is all but useless for serial music or any sort of music that
exhausts the aggregate regularly. Any such piece will have maximal entropy for that
alphabet cardinality, regardless of whether the piece is based on a derived row or an all-
interval row and, indeed, regardless of whether or not the piece is atonal at all. This
entropic equality implies that Webern's Variations for Piano is exactly as predictable as
Boulez's Piano Sonata no. 2, which would be in turn just as predictable as the first few
accomplished through the guise of Markov chains. Such constructs would allow the
28
analyst to look for patterns in the ordering of pitches, rather than relying on their
frequency alone. With a simple pitch alphabet, Markov chains could not differentiate
between serial rows, but could at least distinguish between a serial piece and a non-serial
piece that happens to use each pitch equally. Higher-order constructs have even clearer
tonality, since entropy on its own is order-blind. Thus, from the perspective of zero-order
entropy, the progressions in Figures 3.2 and 3.3 are exactly the same, although certainly
one is more predictable than the other within a tonal paradigm, and certainly one is more
tonal than the other. By contrast, Markov chains could differentiate between these two
strings easily.
in addition to entropies based on zero-order data. That is, rather than accepting C, D, and
C#, C followed by D, and so forth as individual letters, creating an alphabet with 144
letters. However, Hessert notes that the continued effectiveness of this strategy is limited;
an alphabet built from consecutive pitch pairs is almost reasonable at 144 letters, but
calculations.30 One can only imagine the complexity of a higher-order alphabet that does
requiring any first-order computations. He also notes that an alphabet based on intervals
avoids the issue of modulation quite nicely, while reflecting motivic content more
accurately than pitch-based analysis can and potentially allowing for more meaningful
alphabet that combines each pitch with its preceding interval.32 Potentially, such an
alphabet would allow the analyst to distinguish between typical and non-typical
resolutions of dissonant tones; a piece in which any scale degree can be left by any
interval is probably less tonal than a piece in which certain scale degrees (4 and 7, e.g.)
can usually only be left by certain intervals (down by step and up by step, respectively).
30 Hessert, 16ff.
31 Ibid., 43-44.
even a composer whose 7s can resolve anywhere but whose b3s always resolve to b6.
Through its reliance on pitch, this sort of analysis nullifies many of the benefits Hessert
interval will be based on the intervals directly preceding it. His analysis goes up to sixth-
order strings (that is, fifth-order computations based on intervals), but it occurs within a
or complications (e.g., the division of a row into verticalities, leading to the creation of
melodic intervals not present in the original row) that would make such higher-order
analysis impractical.33
remember the previous five intervals of Schoenberg's String Quartet, no. 4, the listener
can predict the sixth interval with complete certainty (assuming the row form in question
has not been altered or truncated). This certainty is not an accurate reflection of listeners'
perceptions of this row, though, even under ideal circumstances; if it were, Lewin argues,
the associated musical experience would be quite dull. Therefore, he concludes, the
listener probably only hears back two or three intervals – perhaps more or fewer,
the line, and other factors, but probably not six. Thus, even if such higher-order analyses
were practical, they may not be a reasonable reflection of the listener's experience.
Of course, one can imagine situations in which a less literal sixth-order analysis
and consider six successive intervals, it seems quite reasonable for a listener to remember
a six-element contour, or six intervals expressed in the form of two or three verticalities.
To date, this form of “entropy chunking” has had almost no mention in the relevant
literature.
Hessert also raises the question of duration, finding it problematic that a half-note
analyses.34 Hiller and Bean reflect this same concern in their 1966 analyses of sonata
expositions, in which longer notes are weighted more heavily than shorter notes – but
Hessert criticizes this approach for its lack of attention to attack, arguing that sixteen
sixteenth-note Cs are quite different perceptually from a single whole-note C.35 One
imagines that any method of computation that addresses both concerns would be
prohibitively complex; it seems most likely that any interested analyst must choose
whichever approach seems least inappropriate for that analyst's particular repertoire.
In any case, the most salient of Hessert's concerns – that an ornamental tone is
treated as equal to a chord tone – seems to be more an issue of interval than of duration,
since ornamental tones are approached by step more often than not (and since an
ornamental tone approached by a large leap is probably aurally surprising enough that it
ought well to contribute as much to the piece's entropy as the chord tone it ornaments). It
seems perceptually reasonable to claim that a whole step is a whole step, regardless of
whether that whole step connects a C and a passing D or a chord tone C and an adjacent
34 Hessert, 68.
35 Lejaren Hiller and Calvert Bean, “Information Theory Analyses of Four Sonata
Expositions,” Journal of Music Theory 10, no. 1 (1966): 96-137.
32
chord tone D. From a pitch-based perspective, the distance a line must travel to arrive at
the next pitch may be more relevant from the perspective of musical predictability than
how long the line stays on that pitch – but even in the absence of pitch, it seems the
primary determinant of predictability is not the duration of each individual pitch, but
instead either the rhythmic pattern in which these pitches present themselves or the
unpublished 1959 Master's thesis by John Brawley (Indiana University). Hessert finds
infinite cardinality, which makes the computation of relative entropy and redundancy
pattern. Does a dotted quarter followed by an eighth note constitute a rhythmic “pattern”?
If this configuration begins on a weak beat or is preceded by an eighth note, is it the same
he notes that such calculations almost necessarily conflate “clock time” with “perceptual
time.”37 In other words, by creating calculations based on the notated tempo we implicitly
privilege the former, which is less defensible given the degree to which analysis based on
analysis that claims to be a reflection of perceptual time must almost certainly encompass
multiple musical domains beyond rhythm, tempo, and duration. In priviliging longer
notes over shorter ones, we run the risk of (for example) privileging extended neighbor
36 Ibid., 45-50.
37 Snyder, 125-126.
33
Other than Rhodes, few analysts have attempted to deal with more than one
musical alphabet simultaneously. The notable exception is Hiller and Fuller's 1967
analysis of the op. 21 Symphony, in which pitch (not pitch class) is combined with the
number of eighth notes between successive attacks. Entropies are also computed on
various types of intervals. These entropy calculations are then used to draw conclusions
about formal sections of each section of the first movement. When pitch is considered
alone, results between zero-order entropy and first- or higher-order chains are
inconclusive; although the development is (as one would expect) the least predictable in
terms of individual pitches, its higher-order results are more predictable than either the
bits the message would require to store or transmit) that each letter of an alphabet
conveys. Hiller and Fuller also express their entropy in terms of bits per second (based on
the notated tempo) – that is, examining entropy in terms of the rate at which information
is presented. Their hope is to distinguish between the listener's experience of a great deal
of information presented quickly, and the same amount of information presented over a
longer timespan. Interpreting entropy in terms of bits per second does not change the
entropy results for op. 21, but the idea bears investigation: that the speed with which
Unfortunately, this measure cannot describe how evenly distributed information is across
38 Lejaren Hiller and Ramon Fuller, “Structure and Information in Webern's Symphonie, op.
21,” Journal of Music Theory 11, no. 1 (Spring 1967): 78.
39 Ibid., 84ff.
34
from a passage with a continuous information rate. Of course, the accuracy of Webern's
notated tempos is problematic in any case, and the frequent ritardandos in his music make
it less plausible that a calculation of this type could be relevant to a performance. Despite
any practical limitations, though, the fact that entropy was considered in terms of the rate
diachronic analysis – and arguably, an early connection between entropy and time, as
well.
The first two are basic criteria without which entropy calculations are impossible; the
second two are desiderata but not necessarily requirements. To these one can add that
entropy can most effectively analyze samples with low variances, since entropy is in
some sense a decontextualized measure of central tendency. Smaller sample sizes may
assist in analysis, if they serve to reduce variance; it is more effective to analyze a small
sample that possesses a given characteristic uniformly than to combine this sample with
another sample lacking this characteristic. Imagine a bimodal grade distribution in which
many students have a 90% average and many have a 60% average. Considering these
students in terms of two smaller sample sizes allows one to generalize about the data
easily, but combining the two samples yields both an unhelpful overall average and a
much higher degree of uncertainty. The same logic ought to apply to musical domains.
Considering data across multiple domains is useful, but considering multiple domains
simultaneously – that is, combining entropies of different domains into a single entropy
35
The cautions one can draw from the history of entropy in music are, for the most
part, no different from the cautions that apply to all analysis. In particular, entropy-based
analyses are problematic when they do not reflect musical experience. If one accepts that
all music analysis is necessarily metaphor – that quantitative analyses are simply a
different way of exploring metaphor – then the most important caution is that these
metaphors must be apt, rather than relying upon their quantitative nature to make their
CHAPTER IV
ALPHABETS FOR ENTROPY-BASED ANALYSIS
Interval Entropy
As discussed previously, pitch class entropy is rarely useful for analysis of post-
tonal music. The table below gives pitch class entropy figures for a collection of post-
tonal vocal works; Youngblood’s results for Schubert’s pitch entropy provides a baseline
Table 4.1: Pitch entropies in Webern works, compared with Babbitt and Schubert
40 Since op. 15, no. 5 is a canon, it is included with the op. 16 canons throughout this
section.
37
distinguish between serial and freely atonal works, or derived rows and all-interval rows.
Even canons are seen as maximally unpredictable, although one imagines the second and
between these styles. Pitch class entropy can only recognize canons iterated at the same
pitch level, but a canon interpreted as a series of intervals should be recognizable at any
pitch level. Although the order-blindness of entropy somewhat limits its effectiveness for
canons, interval class entropy can at least distinguish a canon from a non-canonic piece in
the same style. The same logic applies to serial works; a serial work will generally have
lower entropy than a freely atonal work since any interval appearing in the row would be
repeated many times, while any interval not appearing in the row would be heard very
infrequently. (Similarly, a serial work based on an all-interval row should have roughly
the same number of all interval classes, whereas a work based on a derived row would
have roughly proportionate numbers of a few interval classes and very few of any others.)
Such a measure could be unable to distinguish between a serial work based on a derived
row and a freely atonal work saturated with the pitch class set that forms the basis of the
former's derived row, or a serial work based on an all-interval row and a freely atonal
work that simply exhausts the aggregate of interval classes regularly, but arguably, most
These intuitions are somewhat flawed, in that they assume an idealized linear
row amongst several voices will almost certainly create new intervals not represented in
the original row. Nevertheless, entropy is at heart a measure of predictability, and it seems
reasonable that it should reflect the listener's surprise at hearing an interval not linearly
38
present in the row, even if reflection of this surprise comes at the expense of the
Horizontal interval class analysis of the same works from Table 4.1 provides the
Table 4.2: Interval class entropies comparing serial and non-serial works
Figure 4.1: Interval class entropies comparing serial and non-serial works
39
These data indicate that interval class entropy is able to distinguish between
derived and all-interval rows, and between canons and non-canons from approximately
the same period. These are both important tests of the construct's effectiveness; its ability
to make these distinctions speaks toward its ability to reflect musical saturation and
predictability.
These distinctions are not retained when vertical intervals are included.
Table 4.3: Vertical and horizontal interval entropy on one serial and one non-serial work
Although op. 25's entropy is still lower than op. 16's, the difference is no longer
block chords, it is unlikely the listener would be able to distinguish between them based
shown in Table 4.4 and Figure 4.2. Without inversional equivalence, relative entropies are
higher across the board – since what was originally an emphasis on interval class 1
increase for the same reason, which makes statistically significant distinctions less likely.
40
meaningfully between canons and non-canons (Webern op. 15 vs. op. 16) and between
derived rows and all-interval rows (Webern op. 25 vs. Babbitt). The most interesting
difference between Table 4.2 and Table 4.4 is op. 25, which has a lower interval class
entropy than op. 16 but a higher registrally-ordered interval class (ric) entropy. This
works. In op. 16, an ric1 is not the same as an ric11, since a melodic ric1 in the clarinet
line could not be answered with an ric11 in the vocal line without breaking the canon.
Assumptions of inversional equivalence seem much more reasonable in op. 25, since the
juxtaposition of prime rows with inversional rows leads the listener to hear intervals and
The remaining oddity in these data is the similarity between Webern op. 15 and
Babbitt. To investigate this similarity, we expand intervallic entropy into interval entropy
(-72 < x < 72) and ordered directional interval class entropy (-12 < x < 12).41
Ordered directional interval class entropy bears few surprises. The data in Table 4.5 and
Figure 4.3 show the expected distinction between Webern and Babbitt, but from these
data no conclusions can be drawn about any of the Webern works examined – almost the
41 72 (or six octaves) is a number chosen out of convenience – the distance between the
highest and lowest pitch in any of these pieces, rounded up to the nearest octave.
42
Table 4.5: Ordered directional interval class entropy in serial and non-serial works
Figure 4.3: Ordered directional interval class entropy in serial and non-serial works
43
The inferences to be made from this apparent inconsistency – either that ordered interval
class is less a relevant structure in these Webern works, or that Webern's predictability in
terms of ordered interval class remains consistent across a variety of post-tonal styles –
are at first alarming. Either conclusion makes suspect Hessert's claim that interval-based
entropy is capable of dealing meaningfully with works from disparate periods and styles
– given that registrally-ordered interval class entropy lacks the generality to distinguish
between Babbitt and freely-atonal Webern, while another lacks the generality to
distinguish between Webern works of different styles and time periods. Perhaps the more
useful claim to draw from this perceived lack of generality is that any invocation of
implicit assumptions about a given composer's approach to the interval, assumptions that
One must also keep in mind that although statistically significant differences
between works imply differences in style, the lack of statistically significant differences
does not imply stylistic similarities. The lack of distinction between Webern's op. 15 and
interval classes; rather, the differences between the works are simply not profound
enough for us to be certain that they imply a genuine stylistic difference. In short, an
At the very least, these results demonstrate the utility of examining repertoire
from multiple perspectives on the interval. These results also hint at the possibility of
alphabet is extraordinary large. As a result, these results would have extraordinarily small
relative entropies, which would give an impression of predictability not audible in the
music.
Although op. 16 seems to have a much smaller entropy than all other works
considered, this deviation is not statistically significant. Even if it were, the conclusions
drawn would be slightly problematic. One could not conclude even from significantly
smaller interval entropy that Webern op. 16 relies more upon smaller intervals – merely
that the same intervals are repeated more often. This was not a useful distinction in study
of interval class entropy, since within a mod-12 space we can assume, at least to some
degree, that a few repeated intervals will become predictable regardless of absolute size –
an assumption which rests upon the audience's ability to distinguish between intervals
easily, even if not consciously. By contrast, it seems problematic to credit most audiences
with the ability to distinguish between a minor 13th and a major 14th (which occur four
measures apart in op. 25, no. 2) without a tonal context. If intervals are not
Still, it is reasonable that op. 16 should have the most precipitous drop in entropy.
Octave equivalence within each instrumental line is a tenuous assumption in any of these
works, but it is the most questionable in op. 16. A minor sixth in the dux answered by a
major thirteenth in the comes would constitute a break in the canon; they are
unquestionably different intervals for the purposes of this piece. The correlation between
reasonable if one assumes that a work in which octave equivalence holds will, more often
than not, consist of relatively many of one size of an interval and relatively few of its
whole step and occasionally a major ninth, for example. This results in a narrower
intervals, since most ric2s will stay whole steps, and only a few of them will become
major ninths. If an analyst is willing to make this assumption, then interval entropy can
Unfortunately, the variety of alphabet sizes (and more importantly, the variety of
alphabets untrustworthy – for example, claiming that op. 16 has a lower relative interval
entropy than interval class entropy, when the rarity of very large intervals and the
frequency of very small intervals ensure that any work will have a larger interval entropy.
Even relative entropy cannot make this distinction reliably, since both the size of the
alphabet and the letters' distributions within that alphabet are at issue. Using another
piece as a basis for comparison may improve the trustworthiness of these statements, as
in this chapter's claim that op. 16's entropy decreases more sharply than op. 15's.
In short, for these test cases intervallic entropy accords with analytic expectations
for these works more often than not. The cases that seem dissonant against these
expectations are non-significant results, which need not prove either similarity or
CSEG Entropy
discussed thus far and our intuitions of musical predictability is entropy's inability to
47
reflect the predictability inherent in motivic material.42 Although intervallic entropy can
motive, it is unable to connect these intervals into larger chains without incurring the
on contour segments and contour segment reductions. Such an analysis is able to consider
spans of music as ordered, although these ordered spans are unordered within the larger
composition, the same way pitches are unordered in computation of pitch-based entropy.
The construct of CSEG entropy can be said to measure the degree to which a piece is
saturated with a few contours – the likelihood that a listener will be able to guess the
their relative height; larger numbers represent higher pitches. Consider the first four
pitches of Example 4.1, shown on page 49. The lowest pitch, B, is represented by the
number 0; the next lowest pitch, C#, is represented by 1. This numbering process
continues until all pitches have been ranked. Thus, the first four pitches of Example 4.1
have the CSEG (3120) – the highest pitch, then the second-lowest, then the second-
problematic based on its potential to introduce the analyst's biases. It is likely that an
analyst would focus on the CSEGs he or she finds prominent, ignoring the contour
segments that do not contribute to larger patterns – in other words, decreasing the entropy
42 For the purposes of the test cases in this chapter, I use 'motivic' to refer to a work in
which many pitches can be explained or predicted based on motives to which they belong, and
'non-motivic' to refer to a work in which few pitches can. This is an oversimplification, but the
test cases in this chapter generally fall very clearly into one classification or the other.
48
of a sample by concentrating on the events that are the most predictable. To avoid this
bias, this analysis includes every possible CSEG of a given length by examining a
moving window of that length. That is, for a CSEG of length 3, the contour segment
created from the first through third pitches of a composition will first be considered, then
the contour segment created from the second through fourth pitches, and so on. This
Figures 4.5 and 4.6 show CSEG data for two test cases: one built entirely out of
random pitches (chosen between A0 and C8)43 and one in which the motive C4-C5-A4-
F4 is repeated continuously. These data show exactly what one would expect: a stream of
pitches saturated with motives has a much lower entropy than a stream of random
pitches. (The entropies of the motivic sample do not increase as cardinality increases
43 These random pitches are based on data from random.org, which creates random numbers
based on atmsopheric noise.
49
that all possible cardinality-three CSEGs will appear in a sufficiently long sample, but
extremely unlikely that all possible cardinality-six CSEGs will appear. Unfortunately, this
When comparisons are drawn between this random collection of pitches and the
first movement of Webern's op. 5, a highly motivic work, problems become apparent.
According to the data in Table 4.8 and Figure 4.8, from the perspective of contour the
first movement of Webern's op. 5 is less predictable than a collection of random pitches.
Examining the final tallies of cardinality-three CSEGs for each piece clarifies the
situation:
51
Table 4.8: CSEG entropies, random string versus Webern, op. 5, no. 1
Figure 4.8: CSEG entropies, random string versus Webern op. 5, no. 1
52
(001) 23 0
(010) 39 1
(100) 24 0
(011) 23 0
(101) 36 3
(110) 24 0
(012) 71 76
(021) 133 89
(102) 129 87
(120) 88 78
(201) 94 78
(210) 136 86
Webern's op. 5 – and many other musical works, for that matter – are free to move
CSEGs like (000) and (010) occurring – that is, the string of random pitches containing
repetitions – is so small.
Perhaps there is some truth, then, to the claim that a string of random pitches is
“more predictable” in terms of CSEGs than the first movement of Webern's op. 5. Given
repeated pitch – or, for that matter, two adjacent pitches in the same octave. A pitch above
middle C is more likely to be followed by a lower pitch than a higher one, and vice versa.
54
While developing a more useful control – a string of pitches that contains each
possible CSEG the same number of times – is simple enough, these same problems arise
on a less dramatic scale in comparisons between non-random pieces. Table 4.10 and
Figure 4.10 compare the first movement of op. 5 and op. 18.
Any resemblance at all between these results is contrary to intuition. One would
expect op. 5 (a thoroughly motivic work) to have a much lower entropy than op. 18 (one
disorganized”44) – but at the 3-CSEG level, op. 5 has a significantly higher entropy than
op. 18, and at no other cardinality is the distinction between the two significant.
44 Anne Shreffler, “’Mein Weg geht jetzt vorueber‘: The Vocal Origins of Webern’s Twelve-
Tone Composition,” Journal of the American Musicological Society 47, no. 2 (Summer 1994):
277.
55
Table 4.11 and Figure 4.11. Although op. 15 is much less consistently motivic than the
first movement of op. 5, op. 15 consistently has a lower entropy than op. 5, and for some
Perhaps the analogy of pitch entropy can clarify the situation. It is intrinsic to the
entropy formula that the number of pitch classes present affects the final computations
more than the distribution of those pitch classes, although both contribute. In other words,
from the perspective of the formula, it is more important that a wholly diatonic work use
only seven pitch classes than that it use (e.g.) scale degrees 1 and 5 more frequently than
scale degrees 2 and 6. The addition of chromatic pitches will increase the piece's entropy
more dramatically than a more even distribution of the seven diatonic pitches.
In this case, the eight (cardinality-three) CSEGs that do not involve pitch
repetition are analogous to the diatonic pitches, and the five CSEGs that do involve pitch
repetition are analogous to the chromatic pitches. For the purposes of illustration, we
return to the data from Tables 4.8-4.9 – comparing op. 5 with a string of random pitches.
Example 4.3 shows a melody composed using the CSEG frequencies of the first
movement of Webern, op. 5, versus a melody composed using the CSEG frequencies of a
string of random numbers. To create these, common CSEGs were aligned with common
scale degrees in a major key (and since there are thirteen cardinality-three CSEGs but
only twelve scale degrees including alterations up to enharmonic equivalence, the least-
common CSEG was omitted from consideration). Both pitch collections were then
aligned to C major and reordered to create the most plausible C major melody possible.
Figure 4.12: Melody generated using the CSEG distribution from Webern, op. 5, no. 1
58
The results are not terribly jarring or dissonant, but neither are they terribly reminiscent
of C major, especially since the passage contains no Cs. By contrast, the results generated
from the random pitches seem much more aligned with C major:
Figure 4.13: Melody generated using the CSEG distribution of a string of random pitches
The unnevenness of the distribution in Figure 4.12 makes it more likely that an entire
sample could be generated without any instances of the tonic pitch, even though the tonic
was assigned to be the most common pitch. The distribution also leads to the appearance
of less common scale degrees, absent from Figure 4.13. In short, because the first
movement of Webern's op. 5 uses all thirteen possible CSEG classes, from the
perspective of CSEG entropy it is “less diatonic,” whereas the string of random numbers
is “more diatonic.” This causes the unintuitive results shown in Tables 9 and 10.
Similarly, when CSEG entropy is used to compare two freely-atonal works, the degree to
which these works repeat pitches will have a much greater effect on the results than is
justified.
Arguably, this disparity speaks to a fundamental difference in the way we as
non-motivic material can pass without the listener deciding the piece has become
59
unmotivic. (Certainly this is the case for the CSEG entropy algorithm, since it generates a
new CSEG starting on each pitch – in other words, generating a lot of misses for each
hit.) By contrast the number of successive non-diatonic pitches the listener can hear
without concluding that the piece is non-tonal is much more limited. The entropy formula
is better suited to the latter, since the presence of chromatic pitches increases entropy
figures dramatically.
serial music – in particular, serial music in which repeated pitches are disallowed (which
disqualifies, for example, Babbitt's “Widow's Lament in Springtime,” in which the same
pitch is often attacked repeatedly without constituting a departure from the row).
Returning to the analogy of pitch entropy, comparing two serial pieces is akin to
comparing two wholly diatonic (or almost wholly diatonic) pieces; since there are no
“chromatic” CSEGs, the final results are able to reflect the distributions of the “diatonic”
CSEGs.
Table 4.12 shows CSEG entropies for opp. 18, 25, and 27. As expected, no
significant differences are found between opp. 25 and 27 for any CSEG cardinality. For
cardinalities three and four, the algorithm is able to distinguish between op. 25 (motivic)
and op. 18 (non-motivic). Given that the motives in op. 25 are all less than five pitches in
This algorithm loses some of the motivic character of op. 27, since it cannot
Although the CSEG algorithm is able to pick up this pattern each time it occurs in the B
section of the first movement, it cannot recognize the similarity between the two halves
of this motive. An algorithm that computed entropy based on CSEG prime forms instead
of CSEGs would be useless for cardinality-three CSEGs, but could potentially generate
To address the problem of repetitions one could also alter the CSEG algorithm by
removing the least common third of the CSEGs present. In most cases this will remove
the repeated-pitch CSEGs, although if the repeated-pitch CSEG is motivic and occurs
frequently (as in a Berio Sequenza, for example), it would be retained. This allows the
algorithm to generate meaningful results for non-serial works.
PC-Set Entropy
class sets. Such a construct would stop short of reflecting intervallic organization fully,
but would better reflect proximity of intervals than simple interval entropy does. Lack of
regard for the ordering of pitches within a pc-set may even be an advantage, allowing the
tetrachodral combinatoriality in a way that ordered Markov chains cannot. Entropy based
transformations.
The simplest possible form of pc-set entropy, if not the most useful, is computed
based on discrete pitch successions of a given cardinality in a single melodic line. These
data are based on brute force segmentations; the first three pitches in a line make up the
first trichord, the next three make up the second trichord, and so forth. Table 4.13 shows
horizontal (discrete) pc-set entropy in the vocal lines of opp. 16 and 25. As expected, pc-
set entropy is consistently higher in op. 16 than in op. 25, since the latter is based on a
Table 4.13: Pc-set entropies in op. 16 and op. 25 using discrete segmentation algorithm
63
Figure 4.16: Pc-set entropies in op. 16 and op. 25 using discrete segmentation
algorithm
Table 4.14 re-examines the same musical material, but using an algorithm
modified to accept overlapping pc-sets. To generate this data, each pitch of these vocal
deal of background noise. Even if a composer relies heavily on just two or three pc-sets,
new pc-sets are created by the juxtaposition of these sets – pc-sets that may or may not be
simultaneously, which makes it difficult to hear any particular segmentation clearly. For
64
this reason, the window-algorithm scores are almost all higher than those generated by
Table 4.14: Pc-set entropies for op. 16 and op. 25 using window algorithm
Figure 4.17: Pc-set entropies for op. 16 and op. 25 using window algorithm
65
Table 4.15 shows vertical pc-set entropy figures for these same works.
Table 4.15: Vertical pc-set entropy in op. 16 and op. 25, no. 1
Figure 4.18: Vertical pc-set entropy in op. 16 and op. 25, no. 1
66
Although the (014) trichord saturates both the vocal and piano lines of op. 25 when these
lines are considered independently, no similarly pervasive feature emerges from the
verticalities created between the piano and the voice – even though verticalities presented
in the piano alone frequently feature an (014) subset. By contrast, the verticalities
Opus 25, in particular, seems to necessitate an approach that deals with both the
horizontal and the vertical, given the rhythmic independence of the vocal and piano lines;
even if purely vertical data were of unquestionable validity, one would have difficulty
algorithm that would be able to decide the most appropriate segmentations of musical
material presents sufficient complication to place it beyond the scope of this project.
become questionable if any pitches are not included in a segmentation, since the
algorithm cannot account for singleton pitches. In dealing with pc-sets of non-uniform
sizes, subsets and supersets become an issue, although one could imagine a fuzzy
algorithm that counts pc-sets based on the percentage of pitches held in common between
an original pc-set and its subset. Elisions could be handled in the same manner.
Although these steps would refine the results given above, it seems unlikely that
they would change the results from Tables 4.14 and 4.15 in any meaningful way.
separately gives us an impoverished reflection of the work's organization but still reflects
that organization to some extent and (at least in this case) still provides reasonable results.
67
CHAPTER V
INFORMATION AND TIME
work – often discussed in terms of predictability – and the listener's perception of the
[W]ithin the speed at which a piece is played, the greater the density of
information as deviance from the norms of the system, the longer the time
experiences. In other words, within the tempo of the work, the larger the
amount of density (in number of notes, harmonic complexity, extended
range or whatever dimension) and the further it is from the norms, the
more “effort” (processing activity) is needed to draw it to them and shape
it coherently... [Greater density] requires more processing “effort” because
it has a greater uncertainty.45
In other words, an unpredictable work will be perceived as longer than a predictable one.
Predictable works can be fit into mental schemata and can be easily chunked, and
therefore require less processing time, whereas works that disrupt these schemata require
more effort on the listener's part. What constitutes a “norm” for Barry is flexible. She
uses this framework to argue that the beginning of the Eroica has a shorter perceived
length (relative to its actual length) than the beginning of Brahms's Symphony No. 3,
which she attributes to the latter's more variable rhythmic patterns, greater chromaticism,
and wider range. She then uses the same framework to argue for a similar contrast
individual listener. A listener who is familiar with the beginning of the Brahms but not
with the Beethoven may find the Brahms to have a shorter perceived length, since
familiarity has removed a great deal of the piece’s uncertainty. Alternately, such a listener
45 Barbara Barry, Musical Time: The Sense of Order (Stuyvesant, NY: Pendragon Press,
1990), 167.
68
of the piece allows for more in-depth diachronic analysis. Perhaps the listener is
comparing the current performance of the work to previous performances, deriving new
It also seems likely that musical works that deviate from the norm of a listener’s
experience would bear a temporal learning curve. Kramer, in his anecdote regarding
Satie's Pages mystiques, reports feeling bored (and therefore, one imagines, feeling that
time was passing slowly), until he had adjusted to the work’s depiction of time; one
imagines a similar adjustment period would be experienced in a work much denser than
the listener’s experienced norm. Since Kramer is comparing two works that both fall
outside common practice norms (and Barry is comparing two works that both fall pretty
squarely within them, at least by comparison with Satie), the complication this learning
curve introduces seems less problematic – but a comparison between Brahms and Babbitt
would almost certainly require a more in-depth exploration of the works’ learning curves.
The context of the passage in question can also change the listener’s perception of
time. That is, if we were to compare the beginnings of these symphonies’ third
movements, instead of their first movements, our perception of time would almost
certainly be altered (at least at first) by differences in the endings of the second
perhaps the listener could not find parking and ran into the concert hall at the last minute,
and therefore finds even a dense, unpredictable piece to be slow-paced.
generalization, to be used when comparing two pieces in which all other variables
genuinely are equal, even if a listener’s experience is almost certainly more complicated.
more time we have for reflection, the quicker time passes; the lower the
effective density of alteration (not reduced by recollection or the fact that
the alterations coincide with our expectations), the less time the senses
need to react, so that greater intervals of experiential time lie between
the processes, and the slower time passes.46
This formulation resonates with Kramer's anecdotes describing the passage of time in a
highly repetitive work as shortened and the passage of time in an unpredictable work as
elongated.47 Kramer argues that the more easily chunked a melody is, the easier its
memorization will be, and the shorter its subjective duration. He cites as support an
in which sounds were played in a predictable order (each sound playing ten times before
the next sound was heard) as, on average, 75% the length of a tape in which the same
Kramer also links information content and the listener's perception of musical
Kramer does not directly define discontinuity, but gives the example of a tape splice: “the
46 Karlheinz Stockhausen, “Structure and Experiential Time,” Die Reihe 2, trans. Eric Smith
(Bryn Mawr, Pennsylvania: Presser, 1959), 64.
48 Ibid., 337-338.
50 Jonathan Kramer, “New Temporalities in Music,” Critical Inquiry 7, no. 3 (Spring 1981):
544.
70
discontinuities take many forms, but consistently involve the violation of the listener's
Kramer cites these discontinuities as giving rise to new experiences of time, often
phrasing these experiences in terms of the predictability of the music's progress towards
large-scale goals, events comparable to structural cadences. This progress forms the basis
of the temporal categories he sets forth in The Time of Music – differentiating between
music that makes predictable progress towards a goal (directed linear time), music that
makes progress towards several different goals simultaneously or makes this progress in a
moment time results; when there are neither discontinuities nor progress but simply
stasis, vertical time results.52 Generally, the distortion of perceived clock time is less
important to Kramer than the creation of subjective piece-time, but both Kramer and
Given these connections, the use of information theory to describe the passage of
time seems reasonable. After all, information theory entropy measures predictability, the
cannot equate temporality with predictability – and Kramer points out that even though
discontinuity often facilitates non-directed perceptions of time, the former need not
always imply the latter, and vice versa. Nevertheless, examination of a piece's
Although there are aspects of musical predictability that entropy cannot describe
well (for example, scale degree 7 'predicting' scale degree 1, or a flat ‘predicting’ a
descent), predictability in contextual music is often created in the way entropy measures
best: repetition. Entropy-based analyses make the implicit claim that the more a pitch is
repeated throughout the piece, the more referential power it bears – and while this is often
true in tonal music, one can easily imagine melodies that imply a certain tonic without
ever stating it (for example, consistent use of a dominant seventh built on D to imply G
major). Frequent repetition of a few pitches is often symptomatic of tonality, but the
correlation between repetition and referentiality is not exact. This reliance on repetition
(as opposed to the relationships between scale degrees or harmonic context) seems less
problematic for contextual music, though. Kramer describes goals in Webern's music as
24, but attributes its aural intelligibility to rhythmic patterns and repeated row
something definite. Only repetition can demarcate a series of tones and its purpose.
repetition and predictability or referentiality implies that information theory entropy can
Kramer also notes that the relationship between continuity and linear temporality
can occur in the absence of continuity, “since an event may be implied by events that far
54 Martin Boykan, Silence and Slow Time (Lanham, MD: Scarecrow Press, 2004), 144.
precede it.”56 However, he claims that post-tonal music can only be linearly directed if
other musical domains (for example, the gradual crescendo and range limitations that
linearity in music, continuity is the easiest for information theory to detect. Parsimonious
voice-leading can be understood as more frequent use of a few small melodic intervals –
and although entropy cannot tell us much about interval size, it is very good at describing
interval frequency. One imagines entropy calculations on something like dynamic CSEGs
(in which ppp → pp → p is represented as (012), for example) could describe the
directedness of Atmospheres, as well.
discontinuity involves change – whether this change arises from an unexpected shift to a
new pitch collection, the establishment of a new meter, the dissolution of an established
motive, or any of a variety of possible changes. In each case, a higher entropy score
arises when the passage is examined as a whole. While these changes may not be evident
in pitch-based entropy, they may be evident in an alternative alphabet; for example, pitch-
based entropy cannot detect discontinuities that arise from a motive being dissolved, but
sets (if, for example, the string 123456 123456 124536 124536 is analyzed in terms of
non-overlapping trichords).
57 Ibid., 39.
73
juxtaposed in time") and that Kramer claims as an early example of moment time.58 For
Hasty, continuity and connection can always exist between events, insofar as the listener
is able to compare and relate them. Similarly, Hasty argues, the listener is able to place
events in time based on when they occur in the piece, which creates directionality.59
Bearing this interpretation in mind, the fairest label for what entropy detects seems to be
the potential for discontinuous hearings. A listener who is predisposed to hear two
musical moments as related can probably find reason to do so, but a higher entropy score
implies that these comparisons will be made based more on differences and violated
Overall – at least for this repertoire – we see a lower entropy score correlated with
the factors that lead to Kramer's linearly directed musical time (continuity; repetition of a
few elements) and a higher score correlated with factors that lead to multiply-directed or
non-directed time (many goals repeated with more or less equal frequencies;
discontinuities).60 Of course, there are other factors involved in the perception of musical
goals, things entropy cannot detect easily: symmetrical structures, presenting a referential
pc-set at the very beginning of a work, and dynamics, among others. Because of these
factors, an analyst completing an entropy-based analysis faces the same obstacles s/he
would using any other analytical device: the results must be used in the context of their
musical ecosystem and should either resonate with musical experience or conflict with it
60 This correlation seems slightly more problematic when comparing works by composers
from different periods – claiming, for example, that a more chromatic tonal work is necessarily
less directed than a more diatonic tonal work.
74
Hegel argues that science cannot cope with the unrest of temporality, and
is therefore forced to use 'a paralyzed form, viz., as the numerical unit,...
which... reduces what is self-moving to mere material, so as to possess in
it an indifferent, external, lifeless content'... Temporality is drastically
constrained in this quantitative rendering.61
My discussion of musical time depends upon quantifiable results, but my goal is not to
musical time with Adorno's “rationalized,” clock-bound time through such quantification.
Although entropy results are numerical and objective, they are in some sense metaphors
for a musical experience; their goal is not to reduce piece-time but rather to describe it.
These results are not conclusions in and of themselves, but are instead evidence for a
message is written, not the message itself.62 We can think of entropy as measuring the
a large amount of repetition, which often imbues that language with goal-oriented
tendencies; the repeated event will probably be heard as a goal, and a composer would
have to work actively to prevent that hearing. Extremely low entropy implies something
like Kramer's vertical time; directedness is unlikely because the music never leaves its
referential sonority (or motive, ostinato, or rhythmic pattern), but is still possible, through
dynamics, timbre, tempo, or other devices. High entropy indicates very little repetition,
which implies a fair amount of discontinuity – and while a composer working in such a
62 Entropy “relates not so much to what you do say, as to what you could say.” Shannon and
Weaver, A Mathematical Model of Communication, 5.
75
language can still establish and communicate musical goals, creating these goals becomes
much more difficult. It is likely that these goals would be more difficult for the listener to
hear, as well, while it is entirely possible for the listener to predict goals the composer
never intended in a work with lower entropy. In short, the fairest interpretation of a low
entropy score is, “This passage is written in a language that makes goal-oriented hearings
easier for the listener and the composer and which is more likely to facilitate perception
terms of information content and density. Johnson does not adopt temporality as his
focus, but his description of time in “Abendland III” is noteworthy. He describes the song
as one “in which the suggestion of linear movement is largely undermined by the
tendency of each line to contract into a series of ostinati, as if acknowledging thereby that
the 'wanderings' of each line are essentially without direction.”63 Underwood draws a
similar association between ostinati and stasis in op. 10.64 In each case, a period of stasis
is created through a reduction in the amount of information being received, while the
propulsion.
Of course, this statement cannot be made for all ostinati; the cello line in op. 5,
music's sense of propulsion, rather than decreasing it. Perhaps this is because the ostinato
in op. 5, no. 3 does not constitute a reduction in information content. Since it begins the
64 James Underwood, “Time and Activity in Webern's Opus 10,” Indiana Theory Review 3,
no. 2 (Winter 1980): 34-35.
76
with a more complex cello line and then shifted to the ostinato after a few measures, the
“Abendland III.” Alternately, if we compare the beginning of no. 3 with the ending of no.
2, we find a dramatic acceleration. Even though the first six measures of the cello line of
no. 3 have almost the same information content as the last four measures of no. 2, no. 3
Underwood notes that in Webern's op. 10, thinner, pointillistic textures unfold
over a shorter period of time than denser, lyrical or polyphonic passages.65 This matches
Barry's claims that, ceteris paribus, a dense passage requires more time to process than a
Barry also compares perception of time in the op. 22 Quartet and the op. 21
Symphony, with particular interest in the schemata through which the listener processes
the information s/he receives musically. Barry notes that both serial works necessitate
more difficult learning processes, because they contain “little or no redundancy – and it is
redundancy, as repeated ideas, sections and closing figures – which enables difficult
she finds missing in the Symphony, whose greater pointillism resists perceptions of
continuity and audible structure.67 Barry cites continuity and the listener's ability to
music; she finds their absence in op. 21 to be a cause of perceived spatial time. Lippman
concurs: "[E]very feature of music that makes for continuity and propulsiveness and logic
65 Ibid., 32-34.
here is systematically excluded, from sheer tonal persistence and scalar progression, to
forces of resolution and almost all perceptible varieties of repetition."68 In short, Barry
and Lippman find spatial time to be created by an influx of information that the listener is
not able to chunk into large units or process quickly enough as single pitches.
periodicity. Of the beginning of the Variations for Piano, op. 27, Rochberg writes: “The
beat and meter is now a frame, not a process – a frame on which to construct symmetries
of pitch and rhythm.”69 In the Concerto for Nine Instruments, op. 24, by contrast,
Rochberg finds an “intensity of rhythmic drive, and therefore direction.”70 The
the first movement of op. 27, “the beats... have no more relation to each other than the
seconds which a clock ticks off” – that “[u]nless the initial pulse of the meter is activated,
propelling itself upon the next pulse and the next, the beat and the meter become static
The strong beats of a measure acted as the guiding idea that ruled the order
of consonant and dissonant intervals. In the twentieth century... the arc of
tension and release remains in effect, but now is freed from its positioning
by weak and strong beats.72
70 Ibid., 103
71 Ibid., 102
72 Cesar Franchisena, “El tiempo en la composicion actual,” Revista del Instituto Superior
de Musica 3 (Nov. 1993): 118. (“Es asi que el tiempo fuerte de compás se comportaba como la
idea directriz que regía el orden de los intervalos consonantes y disonantes, etc. entrado el s.
XX... [e]l arco tensión-distensión permanece vigente, pero ahora liberado de su ubicación entre
los tiempos débiles-fuertes.”)
78
examines a single pitch from the fifth measure of op. 9, no. 5, only mentioning the rest of
the piece to contextualize the pitch's duration (expressing it as a fraction of the work's
total length, rather than as an eighth note). This approach seems justified by the pitch's
freedom from metrical accent, as in Rochberg's description of the first movement of op.
27; because the surrounding material does not impose any metrical obligations (i.e.,
metrical predictability) upon this particular pitch, Franchisena is free to consider the
temporal weight of this single pitch. Türcke takes a similar but less extreme view,
in different forms.73
In some sense, this is the spirit in which Forte and Escot's proportion-based
analyses are undertaken. Both demonstrate similarities that cross the borders between
discontinuous sections – Forte hearing these passages in terms of duration, and Escot in
terms of duration, tempo, pitch range, and other factors. Both Forte and Escot aim to
demonstrate large-scale structures through these proportions.74 One imagines that if these
structures were audible, they would not help the listener predict upcoming events, but
could help the listener identify the placement of section endings and beginnings as
reasonable in retrospect and thereby providing a framework for old and new musical
information.
73 Berthold Türcke, “Ein Stehenbleiben, das in die Welte geht: Der Gestus der Zeit in
Weberns Spätwerk,” Musik-Konzepte (November 1984): 12-13, quoted in Brunhilde Sonntag,
"Adornos Webern-Kritik," Musik im Diskurs Band 2: Adorno in seinen musikalischen Schriften
(Regensburg: Gustav Bosse Verlag, 1987), 149-150.
74 Allen Forte, “Aspects of Rhythm in Webern’s Atonal Music,” Music Theory Spectrum 2
(1980): 90-109; Pozzi Escot, “Toward a Theoretical Concept: Non-linearity in Webern’s op.
11/1,” Sonus 3: no. 1 (Fall 1982): 18-29.
79
mind Hasty’s and Kramer's hearings of the Stravinsky Symphonies of Wind Instruments:
the latter examining discontinuities and accepting the work's musical moments as discrete
units, and the former identifying similarities and interactions among these moments. This
distinction demonstrates a crucial point: although the rate at which the listener receives
information shapes his or her perception of time, what the listener chooses (or is
predisposed) to do with that information may have an even stronger effect. Within the
radio transmission metaphor, this situation is comparable to how the receiver chooses to
decode and interpret the original message. Any information-theoretic analysis that
extends beyond the presentation of statistics and into the interpretation of statistics
information are by no means coterminous, but the latter almost certainly influences the
insight into the role of musical expectation and prediction in works composed in that
CHAPTER VI
ANALYSES
The analyses of this chapter, which discuss the first of the op. 16 Fünf Canons and
the fourth of the op. 5 Fünf Sätze, deal with contrasting temporalities as determinant of
form. My hope is to demonstrate, through these analyses, the flexibility and utility of
Opus 16 (Fünf Canons) is the last of Webern's fully nonserial works, composed
from 1923-1924. Shreffler hypothesizes that Webern used these canons and the strict
pitch ordering they necessitate as a way of coming to terms with serial technique; she
notes that serial operations and invariance are in play throughout op. 16.75 Fittingly, the
first of these canons (the last one composed) articulates movement from one type of
A listener who is attentive enough to hear “Christus factus est” as a canon will
hear a clear rupture between the first and second phrases, when the canon breaks in m. 8.
The break is not subtle: the soprano enters forte half a beat earlier than expected,
initiating a string of accented eighth notes. Both the clarinet and bass clarinet are silent on
the first beat of this measure, giving the soprano her first unaccompanied moment in the
From a structural perspective, this canon break establishes the beginning of a new
section; the section is just as clearly ended when the canon breaks again in m. 11.
However, these canon breaks also facilitate the creation of many of the musical
characteristics that distinguish the second phrase from the phrases that surround it.
Perhaps most noticeably, the break in m. 8 initiates a period of metric confusion that lasts
until the canon breaks again in m. 11; when the soprano line enters a quarter note early in
m. 8, it disrupts the half-note tactus that had been established unmistakably by the
straight half notes that began each melodic line (and evidenced by seven measures of cut
time). To accommodate this metric disturbance, m. 8 shifts (jarringly) into 5/4. Measure 9
attempts to resolve this disturbance by returning to cut time – a hearing reinforced by the
perceptible downbeat on the word “Deus” – but the metrically-disturbed melodic line
pushes the listener back into 5/4 for measure 10. Canon figures in mm. 8-10 occur closer
together, either as a result or as a cause of this displacement. The soprano and clarinet,
originally separated by two half notes, are now separated by one half note; the soprano
and bass clarinet, originally separated by one half note, are now separated by a quarter
note – a jolting change from the first phrase, but one that allows a return to (something
As one might expect, the metric ambiguity of mm. 8-11 introduces greater
been heard in the piece thus far. By contrast, in m. 11 each part returns to its original clear
half-note rhythm, although a metrical displacement dissonance is evident among the
parts. This more tranquil rhythmic character is made possible by another break in the
This last canon break creates a sort of melodic variation. Although mm. 11-13 are
82
an audible return to the opening material, the canon has not “fixed” itself; what began as
a one-beat canon interval in mm. 1-7 becomes a half-beat interval in mm. 11-13. The
result is a stretto of the opening material, and a synthesis of the A section’s melodic
Vertical minor seconds make this change even more audible, as shown in Figure
6.1. The A section features many prominent minor seconds attacked on beats. In the B
on every eighth-note of m. 8 (once the A section has begin), on every beat of m. 9, and on
every off-beat of m. 10. When the melodic character of the A section returns in A’, the
minor seconds have become metrically displaced between voices, beginning in one voice
half a beat earlier than in the other – as though passage through the less-stable B section
has made the A section’s minor seconds less stable or less predictable. In all sections,
these vertical minor seconds usually occur with written accents or subito dynamic
The ternary form established by these events is form also supported by pitch
content. The soprano line of the first phrase contains only eight pitch classes: A, Bb, B,
C, C#, D, D#, and F#. Of these, D, A, C#, and Bb are by far the most prevalent; D, C#,
and Bb occur four times each, and A occurs three times. By contrast, the other pitch
classes occur at most twice; this is especially noteworthy given that the first phrase
encompasses less than eight measures, with an average of fewer than three notes per
measure. These four most prominent pitches form inversionally-related pairs at I11
(connecting A with D, and Bb with C#), which allows these four pitches to be similarly
prevalent in the bass clarinet line, which is related to the soprano line by I11. 76
76 Perhaps it is noteworthy that, much as Ligeti claims that Webern's music groups around a
central axis temporally, this inversion causes it to group around a central axis in pitch-space: its
axis of inversion. It is this axis that allows for the “tonal” qualities of the first phrase.
83
In fact, the prevalence of these pitches is so thorough in the bass clarinet and
soprano that it requires very little analytical perversity to interpret the first phrase of these
lines in D minor. The most common pitch classes are scale degrees 1, 5, 7, and b6. The
bass clarinet line begins with clear 5-7-1 motion, and the soprano with a slightly weaker
1-b6-5; the first phrase ends with 5-3-7-1 in the bass clarinet, and 1-#3-6-5 in the soprano
(implying a half-cadence at the end of the first phrase, mm. 7-8). In these lines, melodic
cells in the first phrase almost always end on D or A; the only notable exception is m. 4 in
F#-D in mm. 3-4), and its counterpart Ab in the bass clarinet only appears as #4 on its
way to 5 (as in the corresponding Ab-Bb-A neighbor figure in mm. 6-7, or Ab-F-A in
mm. 3-4). The remaining pitches generally serve as ornamentation of the four structural
the soprano. Scale degree 3 is slightly more problematic, appearing as F in the bass
clarinet and F# in the soprano, but it is always in close proximity to scale degree 1, and
can usually be interpreted as filling in the third-gap between scale degrees 1 and 5
soprano line). It also ignores octave displacement, which is jolting enough in places to
make any tonal interpretation difficult. Finally, although both the soprano and bass
clarinet lines can be read in D minor, the harmonic implications of their respective scale
degrees rarely line up; it is easy to hear the bass clarinet in m. 7 as 5-3-7-1 when the line
is played in isolation, but slightly more difficult when this line is overlaid with b2-7-1-#3
in the soprano.
Nevertheless, one can argue that the first phrase is markedly more tonal than the
86
second phrase. Leaps of intervals 11 and 13 are built into an eighth-note motivic cell that
characterizes this phrase; in these measures, the soprano has three leaps of interval 11 and
four of interval 13, and these intervals are of course preserved in the clarinet and bass
clarinet lines. This allows Webern to exhaust the aggregate by the last beat of m. 8. with
the exception of B, which never appears in this phrase.77 Whether or not aggregate
completion per se is an audible feature of this phrase is debatable, but the speed with
which the aggregate is completed certainly speaks to the chromatic nature of this phrase.
Almost paradoxically, the leaps of intervals 11 and 13 that characterize this phrase
allow the canon lines to have more pitch classes in common than were found in the first
phrase. The first four eighth notes in the clarinet and bass clarinet have identical pitch
class contents but in retrograde (possible since the pitch class set {4, 3, 6, 5} is invariant
under I9) – a tradeoff made more noticeable by the closer timing of canon entries. The
eighth note pattern E-D#-D#-D in the soprano in m. 9 can be heard as a fuzzier echo of
this same pitch class set (bearing in mind that this set is immediately followed by a D in
its first and more prominent entrance). These similarities in pitch content break down
fairly quickly, making it difficult to argue them as a unifying feature of the second phrase,
but it is no stretch to consider them a device that emphasizes the break between the first
chromatic fashion (relative to the postulated D minor of the A section). The first two
notes are preserved both in rhythm and pitch – closely enough that the return of the
opening melodic material is easily audible – and the overall contour of the melodic lines
in mm. 11-13 is strikingly similar to the contour of mm. 1-4. The melodic line repeats no
pitches and suggests no tonal interpretations. One can argue that this progression
77 However, this interpretation requires that the Bb and A on beat one of m. 8 in the soprano
be counted as part of this aggregate. Although they are part of the first phrase, they overlap with
the beginning of the second phrase in the clarinet.
87
represents a development of the melodic material of the A section; having passed through
the highly chromatic B section, the melodic material becomes chromatic itself. It is
certainly noteworthy that the most tonal section of “Christus factus est” is also the most
rhythmically straightforward; in fact, we can argue that the rhythmic ambiguity of the
second phrase imbues the return of the first phrase with rhythmic complexity, much in the
same way that the chromatic content of mm. 8-11 imbues mm. 11-13 with chromaticism.
Despite these sharp distinctions, at first glance pitch class entropy – shown in
Table 6.1 and Figure 6.2 – tells us very little about “Christus Factus Est.” These numbers
mask the distinctions drawn above. Strong tonal implications are present in each line of
mm. 1-8, but the three lines implying different keys cancel each other out as far as
11 do not present tonal implications, these measures still contain repeated pitches – and,
as mentioned, quite a few pitch repetitions between instruments. Measures 11-12 present
no repeated pitches (within any given part), but these measures also contain only twenty-
Isolating the vocal line yields results closer to our intuitions for this piece. While
the results in Table 6.2 and Figure 6.3 show exactly what one might expect – lower
entropy in the vaguely-D-minor A section and higher entropies in later sections – none of
these results is statistically significant. As mentioned, the A’ section contains only eight
pitches; the A and B sections contain a slightly more respectable twenty-three and
nineteen, respectively, but neither sample is large enough to generate significant results.78
Table 6.2: Pitch class entropy in the vocal line of op. 16, no. 1
Figure 6.3: Pitch class entropy in the vocal line of op. 16, no. 1
Analysis of CSEG entropy (omitting the least common CSEGs) leads to the results given
in Table 6.3.
The A section is built from a series of arches; the A' section is a single arch. Compared
with these, the B section is much more complicated in terms of contour, despite the
class content, each section is significantly distinct from the other sections – and what was
intuitively the most chromatic section in terms of pitch becomes the least chromatic in
terms of interval class. It is almost as if the pitch and interval contents of the A and A’
sections are inverted across the axis of the B section; we pass through a period of contour
complication, and when we emerge on the other side, pitch and interval entropies have
been reversed.
79 This analyses also points out the convenience of entropy measures. The pitch-based
argument for ternary form took several pages, but since the interval-class argument is based on
entropy, it takes no time at all.
91
For tetrachords and hexachords, we see pc-set predictability increase over time, as
In other words, “Christus factus est” articulates a progression from one type of
linear directed time – corrupted by the canonic repetitions, which obscure the tonal
content of each individual line. The B section lacks this tonal organization, but achieves
similar results by sheer momentum; without context it seems most likely the listener
chromaticism setting up a return to the diatonicism of an expected A’. Even if the listener
makes different predictions for the B section, it seems clear that the section is headed
somewhere – established by meter changes and by shorter rhythmic durations than heard
93
thus far. But rather than returning to anything diatonic for the A’ section, Webern instead
presents a chromaticized version of the contour that opened the piece. This last section
occurs in non-directed linear time; the low interval-class entropy provides a sense of
continuity, but not in a way that prompts the listener to predict goals. Thus, Webern’s
progression from pitch class organization to interval class organization can also be
This progression is a salient feature of the text, as well. The first phrase of the text
is temporally unambiguous: “Christ became obedient even unto death, death on a cross.”
Instead, it is the second phrase (along with the text that accompanies the return of the first
phrase musically) that introduces temporal ambiguity, by calling into question the
succession of events discussed: “Because of this, God raised Him and bestowed upon
Him a name which is above all other names.” From a temporally superficial perspective,
Christ possessed the name to which the text refers long before the events of the
crucifixion upon which this obedience depends and through which this obedience was
demonstrated (at least insofar as the term 'before' retains meaning in this context); thus,
the effect (exaltation) precedes the cause (crucifixion). Traditionally, Catholic theology
offers the explanation that these events are in fact simultaneous (since “in the Eternal...
Webern described the texts of the original three canons as representing a coherent
a reflection on the crucifixion of Christ and a reflection on baptism, dealing with birth
and death simultaneously. Alternately, one can argue that “Asperges Me” represents a
temporal disconnect because its text is in the future tense – “Thou shalt wash me, and I
shall become whiter than snow” – when the event that “washes” the souls of the faithful
has already happened, both in the literal sense (since the crucifixion occurred two
thousand years ago) and in the modular sense (since this particular text is a prominent
part of the mass for Holy Saturday, which is after the crucifixion within the chronology of
possibly stopping.”82 Under this interpretation, the texts of these canons necessitate the
creation of sacred time for its traditional purpose: contemplation of the temporal nature
of the eternal (or the spatial nature of the omnipresent), and the establishment of a
connection between the events that underlie the formation of Christianity with the present
Just as the textual events of the second half of “Christus factus est” cause us to
question our perceptions of the nature of time, the musical events of the second half cause
us to question our assumptions regarding the temporal nature of the piece. Musical
material that seems directed (if not upon first hearing) in the first phrase is made
adirectional upon its return at the end of the piece. Similarly, just as the temporal
discontinuity contained within “Christus factus est” leads us to perceive the texts of the
other canons as temporally ambiguous (or, rather, to interpret them within a larger,
temporally ambiguous framework), the musical atemporality of this canon highlights the
Furthermore, one might argue that placing these texts within this larger temporal
considered within the context of musical form. The act of labeling mm. 1-8 as A and mm.
11-13 as A' confers upon them a degree of simultaneity, insofar as it establishes them as
parallel – but what does it mean for something directional to be parallel to something
simultaneous?
directional with the circuitous, and Bhabha's discussion of the necessity of a “temporality
event of theory becomes the negotiation of contradictory and antagonistic instances that
open up hybrid sites and objectives of struggle, and destroy those negative polarities
between knowledge and its objects, and between theory and practical-political reason.”83
Elsewhere in the same text, Bhabha refers to critical theory as a parallel to political
activism, implying that critical theory is faced with the same seemingly-paradoxical goal
of reconciling conflicting political objectives and also reconciling this conflict with the
Perhaps mathematical set theory also has insight to offer. We define an infinite set
as a set which has itself as a proper subset – in other words, a set that can contain itself as
well as other things, a set which can be therefore intuitively “smaller” than itself. This
seems paradoxical; intuitively, we believe that there ought to be fewer even numbers than
there are whole numbers, since only half of the whole numbers are even, but these two
sets are both the same size, since they are both countably infinite. However, the cause for
alarm lies not in our definition of an infinite set, but in our intuitive (lack of)
a contradiction at all (as Kramer suggests84), but rather an intrinsic quality of non-
Here, Ligeti is referring to retrograde motion and musical palindromes, as well as the
recast this argument in terms of formal structures, however: “Christus factus est” can be
“chromatic” in pitch content. Thus, although Christopher Hasty disagrees with Ligeti –
events, both superficially (“a lyrical continuity which can be felt in much of [Webern's]
music”) and fundamentally – these same aspects of form that unite and give direction to
85 Gyorgy Ligeti, “Metamorphoses of Musical Form,” Die Reihe 7 (Bryn Mawr, PA:
Presser, 1965), 16.
Op. 5, no. 4
are at the heart of the listener’s perception of form in the fourth of Webern's Fünf Sätze
Für Streichquartett, op. 5. As the listener progresses through the movement, he or she
experiences frequent shifts among non-directed linear time, multiply-directed linear time,
and directed time brought about by these changes – eventually creating the same
parallelism between various forms of time heard in op. 16. Since predictability and
directedness are exactly what entropy hopes to measure, it stands to reason that entropy
should have useful things to say about the passage of time in this movement – even
position statistically than the song cycles that typify discussions of entropy in tonal
music.
descending, mostly quarter-note figures that travel from instrument to instrument; the end
(mm. 6-10) is characterized by drones in the cello and second violin and by an
accompaniment part in triplets in the viola, throwing the first violin's arco melodic line
(so zart als möglich) into relief. In m. 10 a triplet variant on the ascending figure found in
m. 6 returns to mark the end of the B section. In the remaining measures, the material
from mm. 1-6 returns in condensed form; the second violin line and viola lines in mm.
11-12 are identical in pitch content (except for octave displacement) to the cello line of
m. 5, and the second violin line in mm. 11-12 is similarly taken from the first violin line
of m. 5, while the first violin line recalls the viola line of this same measure.
Much scholarly work on this movement concerns itself with the ascending figure
heard in mm. 6, 10, and 12-13. Lewin terms this the FLYAWAY motive, and notes a
relationship between its three forms: m. 6 is mm. 12-13 transposed up by the first interval
98
of the series, and m. 10 when transposed up by its last interval produces mm. 12-13.
Lewin diagrams this relationship as in Figure 6.1. In this diagram, Lewin depicts the
closing figure (mm. 12-13, Ab-C-D...) as central, as balanced between its two other
For Lewin, the FLYAWAY motive not only demarcates formal boundaries, but helps us to
interpret the roles of these sections, allowing us to hear the A' as synthetic of and central
Others have noted relationships between the flyaway motive and the pitch content
of the A and B sections. Perle writes that the first four notes of m. 6's motive match the
first violin line in mm. 1-2, uniting material from (0156) and (0167).88 Similarly, Perle
hears the final pitch of m. 13's motive, F#, as recalling the first violin's F#s in m. 2 and m.
12. Arguably, this hearing brings more closure than the previous appearances of the
87 David Lewin, Generalized Musical Intervals and Transformations (New Haven and
London: Yale University Press, 1987), 188-189. Diagram appears on 188.
88 George Perle, Serial Composition and Atonality (Berkeley and Los Angeles: University
of California Press, 1981), 16-18.
99
motive, since m. 13 is the only time the motive ends on the same pitch class as the phrase
immediately before it. The motive in m. 6 ends on Bb, not reaching the C that began and
ended the first violin's line in the same measure (in each case heard an octave lower); the
motive in m. 10 ends on on Eb, overreaching both the first violin's C and the viola's D in
m. 9. This interpretation seems parallel to Lewin's, as the final appearance of the motive
becomes central to the earlier appearances: the first underreaches, the second
Alternately, one can hear the final pitches of the m. 10 and m. 13 motives as
recalling the registral extremes of mm. 1-2. These pitches form the upper extremes of
later sections; F# is the highest pitch present in both the A and A' sections, while Eb is the
highest pitch present in the B section. The motive in m. 10 ends on the highest pitch of
the section it completes, achieving provisional closure, but the motive in m. 13 extends
Perle notes that the viola's E and F# are the only pitches not held in common between {B
C E F} and {B C F F#}; bearing this in mind, the viola's movement between E and F#
The viola line remains preoccupied with these pitches until m. 5, when the first
violin and cello shift from (0167) to (0125) – that is, a transformation of (0156), with the
5 held constant and the other pitches transposed up by ic3 (literally, B, C, and E are
transposed up by ic1 while F is transposed down by ic3). This change marks the
most of the A section – in which case m. 5 marks not a change in pitch material but a
100
change in how this pitch material is realized.89 Measures 1-4 in the violins are heard in
terms of linear (01267)s – but by the cello's imitative entrance at the end of m. 4, these
(01267)s have become cross-sections, indicating a more rapid harmonic motion. The
89 David Clampitt, “Ramsey Theory, Unary Transformations, and Webern's Op. 5, No. 4,”
Integral13 (1999): 84. Diagram appears on 89.
102
previously did not; a pitch that is initially heard in terms of the preceding pc-set (such as
the cello B natural in m. 4) is then heard as member of the succeeding pc-set. Melodies
that in mm. 3-4 seemed directed and clear become open to reinterpretation. In either case,
a distinction between mm. 3-4 and m. 5 is apparent. Changes in the melodic character of
m. 5 sharpen this distinction. The imitated melody of mm. 3-4 is unstable in terms of
range, descending from F#6 to C2 in less than three measures, whereas the imitated
melody of mm. 5-6 sits comfortably within a perfect fourth. In mm. 3-4, steepness and
imitation create propulsion; the listener may not be able to guess C as the “goal” of this
passage, but the passage's direction is clear. No such sense of large-scale direction is
When the flyaway motive makes its first appearance in m. 6, it returns to the pitch
materials and the tessitura of mm. 1-4. Analyzed in terms of pitch class sets, this motive
is an (0167) joined with an (016), one ic1 away from the combined pitch content of mm.
1-2. Boretz analyzes the pitch material of mm. 1-2 as a succession of ic7s joined by ic6s,
impression is that of a conclusion, although not a synthetic one; after the contrasting
material of m. 5, the listener has returned to material from the beginning of the section.
At the same time, m. 6 departs from the few commonalities heard between mm. 3-
4 and m. 5. The melodies in mm. 3-4 and m. 5 were both presented imitatively (the latter
less frequently), and both melodies descended (the latter less drastically) – whereas m. 6
90 Benjamin Boretz, “Meta-Variations Part IV: Analytical Fallout (I),” Perspectives of New
Music 11: no. 1 (Autumn-Winter 1972): 217-223.
103
is a monophonic ascent. The pitch materials of m. 6 resolve the contrasts between mm. 3-
4 and m. 5, but the motive's texture and direction introduce contrasts of their own –
The B section focuses less on the contrasts that characterized the A section. In
terms of pitch material, the B section is stable, and can in fact be analyzed wholly in
terms of two pc-sets: (0347) in the first violin, and (01468) in the lower three voices. The
first violin can be analyzed as a pair of interlocking major thirds, (04) and (37), in
keeping with the augmented triad articulated in the viola, or as a pair of concatenated
minor thirds, (03) and (47), in keeping with the prominent minor thirds it presents
melodically. Certainly the B section is stable in terms of contour, as well. In short, the B
section provides contrast to the A section because it lacks contrasts within itself.
The A' section resolves all three of the contrasts presented in this movement:
(0156) vs. (0167), mm. 1-4 vs. m. 5, and A vs. B. The pc-set of mm. 11-12, (01267), is
(0156) merged with (0167) – and is the pitch-class content of mm. 1-4 presented in the
contour of m. 5. To some degree the imitative character of the A section is retained, most
notably between the second violin/viola and the cello, but the metric alignment between
voices and the shared pitch material among the lower three voices presents homophony
and stasis, as heard in the B section. The stable ranges of m. 5 and mm. 7-9 are preserved,
converting what sounded in mm. 1-2 as a tetrachord plus a singleton into a pentachord.
The A' section also adopts the timbres of the B section: an arco first violin over am Steg
lower voices, presented with harmonics not heard in the A section. Burkhart notes that the
octave doubling between the second violin and viola echoes the octaves between the
second violin's sustained B and the first violin's centric B in mm. 7-10.91
This same resolution of contrasts is evident through entropy-based analysis. The
91 Charles Burkhart, “The Symmetrical Source of Webern's Op. 5, No. 4,” The Music
Forum 5 (1980): 325.
104
following tables show pitch and interval class for each formal section. (These data are
computed without the flyaway motive.) These figures show support for a ternary hearing
– that is, the A and A' sections are both significantly different from the B section, which
has a much lower entropy than either. Officially we cannot conclude anything from the
fact that entropy cannot tell the A and A’ sections apart (as a result of wide confidence
intervals, owed in part to small sample sizes) but unofficially the entropy scores for A and
A’ are very close, leading one to believe that these sections are stylistically very similar.
Pc-set entropy results prompt the same conclusions, as shown in Table 6.8 and
Figure 6.13. The B section has extraordinarily low entropy – significantly lower than any
other section at any other cardinality, with the exception of the A’ section divided into
hexachords – but only the second violin line is long enough to support hexachordal
divisions. As a result, the discrete hexachords are very predictable, since they are very
few. In smaller cardinalities, the A’ section is consistently more predictable than the A
section but less predictable than the B section, which supports the synthetic conclusions
drawn earlier.
Rather than speaking to similar pitch or interval materials, though, the differing
entropy scores speak to the consistency or predictability with which these materials are
used. If Webern had used half steps in the A section with the same consistency as the
major thirds in the B section, both A and B would have the same interval class entropy;
instead, the fact that B has a much lower interval class entropy than A implies that the B
section is more saturated with its primary intervals than the A section is with its. In other
words, the major and minor thirds of the B are aurally more predictable than the minor
This decreased predictability reflects the contrasts heard throughout the A section,
and the diversity of pitch and intervallic material that create these contrasts. If the higher
108
entropy of the A section only resulted from these contrasts, though – that is, from the
juxtaposition of mm. 1-4 and m. 5 – we would expect either section on its own to have an
entropy score more like that of the B section. Tables 6.9ff show that the opposite is the
case.
Measures 1-4 are markedly less predictable than mm. 7-10 in all types of entropy.
Measure 4-5 are not significantly different from mm. 7-10 in terms of pitch entropy –
which is reasonable, since both passages are characterized by small, stable pitch ranges –
but show significant differences in both types of interval class entropy shown above. As a
result, we cannot conclude that the higher entropy of the A section (when considered as a
whole) is owed entirely to contrasts between mm. 1-4 and mm. 4-5. Instead, the
paths that are not taken. This character first becomes apparent in m. 4, the first
transformational activity of the movement. All (0167)s heard thus far have been
111
expressed as the pitch collection {B C F F#}; the movement's sense of propulsion thus far
F#}, which has spanned the first three measures. The second violin's imitation of the
descending figure marks a change from the predictability and propulsion of an alternation
transformations away from these pitch collections – a change reflected in the section's
higher pitch entropy score. Then, after the listener is given evidence that canonical
transforms will be relevant, the first violin shifts to a contextual transform of this material
at the pickup to m. 5.
The imitative figure of mm. 3-4 presents its own sort of directedness. A tonally-
biased listener may hear these figures as closing gestures, especially since each begins
with a descending perfect fifth and ends with a descending perfect fourth – but even in
the absence of these tonal connotations, a gesture that descends as quickly as this one
seems directed simply because it must end when it reaches the bottom of an instrument's
or an ensemble's range. The existence of Shepard tones seems to prove this point; they
are disorienting because they are descending lines that do not land or run out of the
potential for further descent, implying that landing is exactly the behavior we expect from
a descending line.
That these gestures have a “goal” seems clearer when the cello begins to repeat
the exact pitch classes heard in the first violin. Not only does the listener know that the
larger descent must end, but s/he can now predict where it will end: C. The cello does
reach this goal, in m. 5 – but not until after the first violin's new melody has already
begun. By the time the goal is reached, it is no longer the goal; the counterpoint between
the first violin and the cello has become more perceptually salient than the completion of
the cello’s descent. Even if the cello line is heard in isolation, the elision of the
descending (0167) gesture with the cello’s m. 5 melody makes it less likely the C will be
repetition of B natural (which appears each time in this line with a longer rhythmic
duration than the surrounding notes) creates a perception of motion away from a center
(on bts. 2 and 3 of m. 8) followed by a return to this center (m. 9), a perception supported
by the pedal B natural in the second violin.92 It is quite likely that the listener predicts a
return to B following the C “neighbor tone” at the end of m. 9 – a B that can be heard in
the second violin line in m. 10, whose sustained B extends beyond the end of the first
violin’s melodic line. The result is something more like directed linear time – motion
towards and away from a clear pitch center, contrasting with the multiple averted goals of
the A section.93 Of course, this directed melody occurs over the metronome of the viola,
creating a clock-like temporal regularity.
On a large scale, these two approaches to time are united by the flyaway motive.
The motive establishes almost the reverse of multiply-directed linear time: it serves as a
reasonable goal for each section (if a goal that is only evident in retrospect), but each
section makes this goal reasonable in a different way. That is, the presence of the flyaway
motive leads the listener to think of the movement’s formal sections as parallel in some
motive that repeats no pitches could belong to a fully diatonic piece (recall that, as far as
entropy is concerned, all heptatonic scales are the same, maximally even or otherwise), or
92 The E pedal in the bass can be heard as creating a sense of E major, albeit one
complicated by the Gb augmented triad in the viola. Whether or not the listener chooses to hear
this passage in a key, though, it seems clear that B is the “center” of the first violin line – that a
descending-fifth leap down to E at the end of the passage would seem jarring, not conclusive or
confirming.
93 Of course, any sense of propulsion created by the first violin in mm. 7-10 is limited (or at
least complicated) by the scope of the line. One can argue that propulsion is heard for
approximately half a measure, when the first violin moves away from B natural, and that outside
of this window the passage reverts to stasis, as confirmed by the lower three lines.
113
it could belong to a serial work; without repetitions or a larger sample size, we cannot say
which. The motive's intervallic presentation leaves us with the same problem; <+4 +2 +5
+2 +6 +3> only repeats one interval class, so we do not have enough information to draw
Nevertheless, the musical features that make this motive a logical goal (if only
through the combination of predictable pitch material and unpredictable texture, register,
and contour – the former creating continuity between mm. 1-5 and m. 6, and the latter
ensuring that m. 6 is not simply heard as a continuation of mm. 1-5. If the listener were
inclined to predict this motive’s arrival as the beginning of a new subsection of A, rather
than its conclusion, this projected subsection would still bear significant formal change; it
could be heard as parallel to the entire A section thus far, perhaps, but not simply as a
continuation of it, as a result of the unpredictability the motive introductes. When the
motive returns in m. 10, it provides contrast with the more centric contours of the B
section, and presents in a single measure of one instrumental line almost as many distinct
pitches as were heard in the entire B section. These discontinuities lead the listener away
from the single-directedness of the B section and back into the multiply-directed time that
characterizes the piece as a whole – not resolving the first violin’s C, but removing the
sense of linearity and directedness that necessitated the C’s resolution in the first place.
Of course, the motive’s reappearance in m. 10 also accesses the closure the motive
created in m. 6; since it has been designated as a signal of closure previously, we are
A section with those that led to single-directedness in the B section, the result is stasis –
almost moment time. The contours and stable pitch ranges of m. 5 and mm. 7-10 are now
presented without direction; we do not hear movement toward or away from a single
114
motion away from the pc-sets of mm. 1-4. Instead, we hear a pc-set closely related to the
pc-set of the flyaway motive, the (0167)+(0156) that the flyaway motive was heard as a
logical extension of when it first appeared in m. 6. That is, mm. 11-12 present no
propulsion in terms of pitch content because we have already reached the goal. We hear a
pc-set that now implies stasis presented within static contours and ranges.
The pizzicati in m. 12 close off this passage, echoing the pizzicati that closed mm.
1-2 and drawing additional strength from their low register (dropping twenty-two
semitones in the second violin, twenty-three in the viola, and eighteen in the cello).
Ending the passage of stasis in mm. 11-12 makes real closure possible, and we hear the
The stasis of these measures is born of the synthesis of the A and B sections.
Perhaps in some sense moment time is itself synthetic of multiply-directed and singly-
directed time – the stability of the latter combined with the problematized directionality
of the former, the predictability of the latter combined with the unpredictability of the
former. Under this interpretation, the A' section resolves not just the pitch, texture, and
contour contrasts between the A and B sections, but also the contrasting directionalities of
these sections.
115
CHAPTER VII
CONCLUSIONS
Perhaps the greatest utility of information theory entropy as an analytic tool lies in
its versatility. A statement from one of the earliest articles on information theory is often
This is a theory so general that one does not need to say what kinds of
symbols are being considered... the theory is deep enough so that the
relationships it reveals indiscriminately apply to all these and to other
forms of communication.94
Nearly all musical phenomena can, in some sense, be considered information. The analyst
has a great deal of latitude in determining how best to understand a particular musical
description and analysis of almost any musical work, and the calculations that result can
entropy is not the mindless plugging of numbers into formulas, any more than statistical
hypothesis testing is. Rather, it is reflective of the analyst's choices and assumptions.
alphabet based on beat class sets could offer insight into the predictability of a work's
meter, providing an easy method of comparison between works. This analytical approach
would be especially useful for rap, in particular American rap of the late 1990s and
2000s, in which accent patterns created by rhyme scheme can be an important designator
of form. Results obtained using this alphabet may provide insight into larger-scale trends
in the genre with regard to metric dissonance articulated by rhyme scheme. Beat class
entropy, if used in conjunction with CSEG entropy, could also provide a more
This approach would not only provide insight into particular composers' use of
between wildly different musical styles – comparing, for example, canonic and contextual
the transmission and reception of information, however one wishes to define that term. It
is information, in some sense, that Adorno describes when he writes, regarding the op. 5
Fünf Sätze, “Their intensity of concentration is what makes them a totality: a sigh, as
Schoenberg noted with admiration, was worth an entire novel, a tense gesture of three
notes on the violin was literally the equal of a symphony” – a density of information that
alters the listener’s perception of time and informs nearly every aspect of the listening
have explored entropy’s ability to quantify objective musical perceptions; what remains is
entropy’s ability to describe the non-objective, to act as a quantitative metaphor for the
95 Theodor Adorno, “Anton von Webern,” in Sound Figures, trans. Rodney Livingstone
(Stanford, CA: Stanford University Press, 1999), 96.
117
BIBLIOGRAPHY
Adlington, Robert. “Moving Beyond Motion: Metaphors for Changing Sound.” Journal
of the Royal Musical Association 128: no. 2 (2003): 297-318.
Adorno, Theodor. Essays on Music, ed. Richard Leppert, trans. Susan Gillespie. Berkeley,
Los Angeles: University of California Press, 2002.
-------. Quasi una Fantasia: Essays on Modern Music, trans. Rodney Livingstone. New
York: Verso, 1998.
-------. Sound Figures, trans. Rodney Livingstone. Stanford: Stanford University Press,
1999.
Augustine. The Confessions of St. Augustine. Mineola, New York: Dover, 2002.
Bailey, Kathryn. “Rhythm and Meter in Webern’s Late Works.” Journal of the Royal
Musical Association 120: no. 2 (1995): 251-280.
-------. The Twelve-Note Music of Anton Webern. Cambridge University Press, 1991.
Barry, Barbara. Musical Time: The Sense of Order. Stuyvesant, NY: Pendragon Press,
1990.
Bhabha, Homi. The Place of Culture. London, New York: Routledge, 1994.
Boretz, Benjamin. “Meta-Variations Part IV: Analytical Fallout (I).” Perspectives of New
Music 11: no. 1 (Autumn-Winter 1972): 146-223.
Boykan, Martin. Silence and Slow Time: Studies in Musical Narrative. Oxford:
Scarecrow Press, 2004.
Broyles, Michael and John Titchner. “Meyer, Meaning, and Music.” Journal of Aesthetics
and Art Criticism 32: no. 1 (Autumn 1973): 17-25.
Burkhart, Charles. “The Symmetrical Source of Webern's Op. 5, No. 4.” The Music
Forum 5 (1980): 317-334.
Clampitt, David. “Ramsey Theory, Unary Transformations, and Webern's Op. 5, No. 4.”
118
Clifton, Thomas. “The Poetics of Musical Silence.” The Musical Quarterly 62: no. 2
(April. 1976): 163-181.
Drew, James. “Information, Space, and a New Time-Dialectic.” Journal of Music Theory
12: no. 1 (Spring 1968): 86-103.
Eddington, A. The Nature of the Physical World. Ann Arbor: University of Michigan
Press, 1935.
Escot, Pozzi. “Towards a Theoretical Concept: Non-linearity in Webern’s op. 11, no. 1.”
Sonus 3: no. 1 (Fall 1982): 18-29.
Forte, Allen. “Aspects of Rhythm in Webern’s Atonal Music.” Music Theory Spectrum 2
(Spring 1980): 90-109.
-----------. The Atonal Music of Anton Webern. Yale University Press, 1998.
Franchisena, Cesar. “El tiempo en la composicion actual,” Revista del Instituto Superior
de Musica 3 (Nov. 1993): 109-135.
Fraser, J. T. “The Art of the Audible Now.” Music Theory Spectrum 7 (Spring 1985): 181-
184.
Hawes, Vanessa. “Number Fetishism: The History of the Use of Information Theory as a
Tool for Musical Analysis.” Music's Intellectual History 2009, ed. Zdravko Blazekovic
and Barbara Dobbs Mackenzie. New York: RILM, 2009.
Hellmuth Margullis, Elizabeth and Andrew Beatty. “Musical Style, Psychoaesthetics, and
119
Prospects for Entropy as an Analytic Tool.” Computer Music Journal 32: no. 4 (Winter
2008): 64-78.
Hessert, Norman. “The Use of Information Theory in Musical Analysis.” Ph.D diss.,
Indiana University, 1971.
Hiller, Lejaren and Calvert Bean. “Information Theory Analyses of Four Sonata
Expositions.” Journal of Music Theory 10: no. 1 (1966): 96-137.
Hiller, Lejaren and Ramon Fuller. “Structure and Information in Webern's Symphonie,
op. 21.” Journal of Music Theory 11: no. 1 (Spring 1967): 60-115.
Knopoff, Leon and William Hutchinson. “Information Theory for Musical Continua.”
Journal of Music Theory 25: no. 1 (Spring 1981): 17-44.
-----------. “On the Entropy of Music: The Influence of Sample Length.” Journal of Music
Theory 27: no. 1 (Spring 1983): 75-97.
-----------. “Multiple and Non-Linear Time in Beethoven’s Op. 135.” Perspectives of New
Music 11: no. 2 (Spring/Summer 1973): 122-145.
-----------. “New Temporalities in Music.” Critical Inquiry 7: no. 3 (Spring 1981): 539-
556.
Levinson, Jerrold. Music in the Moment. Ithaca and London: Cornell University Press,
1997.
Lewin, David. Generalized Musical Intervals and Transformations. New Haven and
London: Yale University Press, 1987.
-----------. “A Metrical Problem in Webern’s Op. 27.” Music Analysis 12: no. 3 (1993):
343-354.
120
Lissa, Zofia. “The Temporal Nature of a Musical Work.” Journal of Aesthetics and Art
Criticism 26: no. 4 (Summer 1968): 529-538.
Manzara, Leonard, Ian Witten, and Mark James. “On the Entropy of Music: An
Experiment with Bach Chorale Melodies.” Leonardo Music Journal 2:no. 1 (1992): 81-
88
-----------. “Meaning in Music and Information Theory.” Journal of Aesthetics and Art
Criticism 15: no. 4 (June 1957): 412-424.
Nattiez, Jean-Jacques. The Battle of Chronos and Orpheus. Oxford: Oxford University
Press, 1993.
-----------. Music and Discourse. Princeton, NJ: Princeton University Press, 1990.
Perle, George. Serial Composition and Atonality. Berkeley and Los Angeles: University
121
River Jones, James. “Some Aspects of Rhythm and Meter in Webern’s Op. 27.”
Perspectives of New Music 7: no. 1 (Autumn/Winter 1968): 103-109.
Roeder, John. “A Calculus of Accent.” Journal of Music Theory 39: no. 1 (Spring 1995):
1-46.
Rosenfield, Lawrence. Aristotle and Information Theory. Paris, the Hague: Mouton,
1971.
Shelley, Percy. A Defence of Poetry and Other Essays. 1840; Project Gutenberg, 2005.
http://www.gutenberg.org/etext/5428.
Sherburne, Donald. “Meaning and Music.” Journal of Aesthetics and Art Criticism 24:
no. 4 (Summer 1966): 579-583.
Shreffler, Anne. “Mein Weg geht jetzt vorueber': The Vocal Origins of Webern's Twelve-
Tone Composition.” Journal of the American Musicological Society 47: no. 2 (Summer
1994): 275-339.
Stockhausen, Karlheinz. “Structure and Experiential Time,” Die Reihe 2, trans. Eric
Smith. Bryn Mawr, PA: Presser, 1959: 64-74.
Türcke, Berthold. “Ein Stehenbleiben, das in die Welte geht: Der Gestus der Zeit in
122
Underwood, James. “Time and Activity in Webern's Opus 10.” Indiana Theory Review 3:
no. 2 (Winter 1980): 31-38.
Yuogi, Joji. “Temporality and I: From the Composer’s Workshop.” Perspectives of New
Music 31: no. 2 (Summer 1993): 216-228.