7 views

Uploaded by goni56509

Attribution Non-Commercial (BY-NC)

- Tut+7
- Introduction to Bayesian Image Analysis
- The Hexatonic Systems Under Neo-riemannian Oshita
- grup9
- Permutations to Solve Probability Problems
- K 2 FSchoefs
- Homework 1
- Probability & Mathematical Expectation
- MODULE 1 Counting Techniques
- The Usb Problem
- Apma Guide
- Huang Multi Loop Control
- Bayesian_Learning_Introduction_bayes08.pdf
- Chap 2
- Worksheet Independent Events 2
- Performance Analysis of Selected Classifiers in User Profiling
- Basic Probability
- Problem Set 1
- Presentation
- Slide Pak Ronnie.pdf

You are on page 1of 2

David Temperley Eastman School of Music University of Rochester 26 Gibbs St. Rochester, NY 14604 dtemp@mrl.esm.rochester.edu where s indicates segments and M is a modulation score. For the first segment, M = 1/24 for every key. For each subsequent segment, M = .998 if the key of the segment is the same as the previous one; if not, M = .002 / 23. (These parameters were arrived at through trial-and-error testing.) We now consider the probability of a surface given a structure. We represent the surface simply as a series of pitch-class sets, one in each segment. Let us assume that, in each segment, the composer makes twelve independent decisions as to whether or not to use each pitchclass, depending only on the key of the segment. (Thus we are concerned only with which pitch-classes are present in a segment, disregarding the number of events of that pitch-class in the segment or their duration.) These probabilities are represented in key-profiles [3]. The key-profiles used in the model are shown in Figure 1. The profiles show scale-degrees (pitch-classes in relation to a key); for example, in C major, C is 1 and F# is #4. The key-profiles were derived empirically, using a corpus of excerpts taken from the Kostka and Payne music theory textbook [4], in which keys are explicitly marked. The corpus was divided into segments and analyzed to find the proportion of all major-key segments containing each scale-degree; this yields the value for each scale-degree in the major-key profile. For example, the profile shows that .748 of segments contain scaledegree 1. The process was then repeated for minor-key segments.

ABSTRACT

This model determines the key of a piece (represented as a MIDI file) using key-profiles, in combination with Bayesian principles. The model divides the piece into short segments; it then searches for the most probable key structure, where a key structure is a labeling of each segment with a key. The probability of a key structure is a function of the number of modulations in the structure, and the fit of the key of each segment to the pitch-classes in the segment. The main key for the piece is then defined by the key of the first of segment of this structure. Keywords: key-finding, Bayesian modeling.

1 INTRODUCTION

In this abstract I present a model for key-finding in polyphonic music, using symbolic (MIDI) input. The model uses key-profilesan ideal pitch-class distribution for each keyin combination with Bayesian principles. It is essentially the model presented in [1] (though with different parameters) and [2].

2 THE MODEL

Given a MIDI file as input, the model begins by creating a list of notes with the on-time, off-time and pitch of each note. It then divides the piece into segments of 1.2 seconds in length. (In [1] and [2], the segments were defined by the metrical structure; in this case, no metrical information is available.) The model then searches for the optimal key structure, where a key structure is a labeling of each segment with a key. Here we use Bayesian reasoning. We are confronted by a pattern of pitchesa surfaceand we want to know the underlying key structure that gave rise to itthe structure. Bayes rule tells us that P(structure | surface) P(surface | structure) P(structure) (1)

The structure maximizing the right side of this expression will be the one maximizing the left sidethus telling us the most probable key structure given the surface. The prior probability of a key structurea set of key labels assigned to segmentsis defined as follows: P(structure) =

M s

(2)

Figure 1. Key-profiles for major keys (above) and minor keys (below).

The key-profiles thus indicate the probability of each pitch-class occurring in a segment of a given key. The probability of a pitch-class not occurring in a segment is just 1 minus this value. We can then define the probability of a set of pitch-classes occurring in a segment given a key (we call this a key-segment score): key-segment score =

Using a metric in which partial credit was given for incorrect guesses, the system received a score of 91.4%. On a B 0 computer (see the MIREX hardware specifications), the system ran the 1252 excerpts in a total time of 91 seconds, or .07 seconds per excerpt.

K (1K p ~p

pc

pc

))

(3)

where p refers to pitch-classes present in each segment, ~p refers to pitch-classes not present, and Kpc is the keyprofile value. The probability of a series of pitch-class sets, given a series of key labels for those segments, is then the product of the key-segment scores over all segments. This gives us a measure of the probability of the surface given the structure: P(surface | structure) =

(K (1K s p ~p

pc

pc

))

(4)

The probability of a key structure given a surface is then proportional to P(structure) P(surface | structure), as defined above. Given this formula, the model then searches for the most probable key structure using dynamic programming. Briefly, we go through the segments from left to right; at each segment Sn, for each key K, we calculate the best-so-far key structure ending with that key at Sn; this can be done by adding Sn (with key K) on to each best-so-far analysis at segment Sn-1 (already calculated). At the end of the piece, one key has the best score overall, and that can be traced back to yield the best analysis. Given the optimal key structure (a labeling of each segment with a key), how then does the model compute the main key of the entire piece (bearing in mind that it is only given a short excerpt from the beginning of the piece)? This is a question I have not considered in previous research. After trying several different methods on the MIREX training data, it seemed that the best method was simply to choose the key of the first segment in the optimal key structure, as defined above. It is important to note that this is not the same as simply analyzing the first 1.2-second segment in isolation. Rather, the entire piece is analyzed, with a preference for both minimizing key changes begtween segments and optimizing the fit of each key to the pitches of each segment; the main key is defined by the key of the first segment in the resulting global structure. One further refinement seemed desirable: In the analysis process described above, the initial segment of the piece is divided into four segments of 0.3 seconds in length; in effect this gives the first 1.2-second segment of the piece extra weight in determining the key.

The idea of using key-profiles for key-finding was first proposed by Krumhansl and Schmuckler [3]. Their model creates an input vector representing the total duration of each pitch-class in a passage, correlates this with key-profiles for each key (derived from experimental data), and chooses the key with the highest correlation. In [5], I proposed a modified version of this model, which divides the piece into short segments, represents each pitch-class as present or absent within each segment, calculates the match between input vector and key-profile as the scalar product of the two vectors, and factors in a modulation penalty for key changes. The model presented in [1] is very similar to this, but the Bayesian reasoning proposed there leads to a slightly different mathematical formula (the one presented above). Finally, in [2], I proposed the empirical Kostka-Payne profiles used here.

REFERENCES

[1] D. Temperley. 2002. A Bayesian key-finding model. In C. Anagnostopoulou et al. (eds.), Music and Artificial Intelligence. Berlin: Springer. D. Temperley. Probability. Forthcoming. Music and

C. Krumhansl. 1990. Cognitive Foundations of Musical Pitch. Oxford: Oxford University Press. S. Kostka & D. Payne. 1995. Tonal Harmony. New York: McGraw Hill. D. Temperley. 2001. The Cognition of Basic Musical Structures. Cambridge: MIT Press.

RESULTS

On the MIREX 2005 Evaluation, the Bayesian model described above was tested on its ability to identify the correct key of 1252 excerpts from classical pieces of 10, 20, or 30 seconds in length, represented as midifiles. The model labeled 1127 out of 1252 excerpts correctly.

- Tut+7Uploaded byhoangduc007
- Introduction to Bayesian Image AnalysisUploaded byamit_mehndiratta@sify
- The Hexatonic Systems Under Neo-riemannian OshitaUploaded byubirajara.pires1100
- grup9Uploaded byeni
- Permutations to Solve Probability ProblemsUploaded byItachi_1120
- K 2 FSchoefsUploaded byvictorrbg
- Homework 1Uploaded byAlireza Zahra
- Probability & Mathematical ExpectationUploaded byNoreen Albrando
- MODULE 1 Counting TechniquesUploaded byMini Cabalquinto
- The Usb ProblemUploaded byRasťo Halamiček
- Apma GuideUploaded byJD614
- Huang Multi Loop ControlUploaded bygatzke
- Bayesian_Learning_Introduction_bayes08.pdfUploaded byrcoca_1
- Chap 2Uploaded bytriangularedge
- Worksheet Independent Events 2Uploaded byHazel Clemente Carreon
- Performance Analysis of Selected Classifiers in User ProfilingUploaded byIntegrated Intelligent Research
- Basic ProbabilityUploaded byAbdullah Etizaz
- Problem Set 1Uploaded bySVR07
- PresentationUploaded byShashikiran HP
- Slide Pak Ronnie.pdfUploaded byChoi Choco
- 2017 Spring 600 Midterm TwoUploaded byEd Z
- STA - Part 7Uploaded byHimanshu Gome
- A Icke Lin AUploaded byArie L
- The Right Performance Targets Motivate Executives and the Right Tools Make That PossibleUploaded byManasMekap
- AJC_H2_MATH_P2Uploaded byjimmytanlimlong
- Tugas 2 or 2 Slp, Pdp, Game Theory (1)Uploaded byVeronika Dwika Andhita
- OutputUploaded byIskandar Agung Agung
- NotebookUploaded bymiryalavignesh5796
- 03 Zandieh Soft Computing GAUploaded byArman Vd
- Dougthery Social EffectsUploaded byIsaias Morales

- Platel, H., Baron, J. C., Desgranges, B., Bernard, F., & Eustache, F. (2003). Semantic and Episodic Memory of Music Are Subserved by Distinct Neural Networks. Neuroimage, 20(1), 244-256.Uploaded bygoni56509
- Platt, J. R., & Racine, R. J. (1994). Detection of Implied Harmony Changes in Triadic Melodies. Music Perception, 243-264.Uploaded bygoni56509
- Poulin-Charronnat, B., Bigand, E., & Madurell, F. (2005). the Influence of Voice Leading on Harmonic Priming. Music Perception, 22(4), 613-627.Uploaded bygoni56509
- Poulin-Charronnat, B., Bigand, E., Lalitte, P., Madurell, F., Vieillard, S., & McAdams, S. (2004). Effects of a Change in Instrumentation on the Recognition of Musical Materials. Music Perception, 22(2), 239-263.Uploaded bygoni56509
- Povel, D. J. (2010). Melody Generator- A Device for Algorithmic Music Construction. Journal of Software Engineering & Applications, 3, 683-695.Uploaded bygoni56509
- Pressing, J. (1988). Improvisation- Methods and Models. John a. Sloboda (Hg.)- Generative Processes in Music, Oxford, 129-178.Uploaded bygoni56509
- Pressing, J. (1998). CHAPTER Two Psychological Constraints on Improvisational Expertise and Communication. the Course of Performance. Studies in the World of Musical Improvisation, 47-67Uploaded bygoni56509
- Repp, B. H. (1992). a Review of Carol L. Krumhansl's Cognitive Foundations of Musical Pitch. DOCUMENT RESUME ED 352 694 CS 508 026, 277.Uploaded bygoni56509
- Repp, B. H. (2001). Expressive timing in the mind’s ear. Musical Imagery, 185-200.Uploaded bygoni56509
- Ribeiro-pereira, m. Prolegomena to a Theory of Modulatory Space in Music.Uploaded bygoni56509
- Memory Processes in the Acquisition of Expert Performance- The KP AM Model.Uploaded bygoni56509
- Rohrmeier, M. (2007). Modelling Dynamics of Key Induction in Harmony Progressions. in Proceedings of the 4th Sound and Music Computing Conference (Pp. 82-89).Uploaded bygoni56509
- Rohwer, d. (2006). the Role of Self-efficacy and Modeling in ImprovisationUploaded bygoni56509
- Ronkainen, S., & Kuusi, T. (2009). the Keyboard as a Part of Visual, Auditory and Kinesthetic Processing in Sight-reading at the Piano.Uploaded bygoni56509
- Rosner, B. S., & Narmour, E. (1992). Harmonic Closure- Music Theory and Perception. Music Perception, 383-411.Uploaded bygoni56509
- Russo, F. a., Cuddy, L. L., Galembo, A., & Thompson, W. F. (2007). Sensitivity to Tonality Across the Piano Range. Psychology Publications and Research, 11.Uploaded bygoni56509
- Saintilan, N. (2008). in Search of the Inner Voice. a Qualitative Exploration of the Internalised Use of Aural, Visual, Kinaesthetic, And Other Imagery in the Perception and Performance of Music.Uploaded bygoni56509
- Salamon, J. J. (2008). Chroma-Based Predominant Melody and Bass Line Extraction From Music Audio Signals (Doctoral Dissertation, Universitat Pompeu Fabra Barcelona, Spain).Uploaded bygoni56509
- Samson, S. (2003). Neuropsychological Studies of Musical Timbre. Annals of the New York Academy of Sciences, 999(1), 144-151.Uploaded bygoni56509
- Sanchez, D. J., & Reber, P. J. (2013). Explicit Pre-training Instruction Does Not Improve Implicit Perceptual-motor Sequence Learning. Cognition, 126(3), 341-351.Uploaded bygoni56509
- PhD ProposalUploaded byStatisticka Obrada Podataka
- Schenker, C. (2003). Artistic Research (2003 09-05-25)Uploaded bygoni56509
- Schmuckler, m. a., & Tomovski, r. (2005). Perceptual Tests of an Algorithm for Musical Key-finding. Journal of Experimental Psychology Human Perception and Performance, 31(5), 1124.Uploaded bygoni56509
- Schroeder, D. (2002). Four Approaches to Jazz Improvisation Instruction. Philosophy of Music Education Review, 10(1), 36-40.Uploaded bygoni56509
- Schubert, E., & Stevens, C. (2006). the Effect of Implied Harmony, Contour and Musical Expertise on Judgments of Similarity of Familiar Melodies. Journal of New Music Research, 35(2), 161-174.Uploaded bygoni56509
- Schürmann, M., Raij, T., Fujiki, N., & Hari, R. (2002). Mind's ear in a musician- Where and when in the brain. Neuroimage, 16(2), 434-440.Uploaded bygoni56509
- Shenoy, A., & Wang, Y. (2005). Key, Chord, And Rhythm Tracking of Popular Music Recordings. Computer Music Journal, 29(3), 75-86.Uploaded bygoni56509
- Sherriff, B. N. (2011). Exploring Everyday Musical Imagery. an Experience-sampling Study (Doctoral Dissertation).Uploaded bygoni56509
- Sisley, B. a. (2008). a Comparative Study of Approaches to Teaching Melodic Dictation (Doctoral Dissertation, Kent State University).Uploaded bygoni56509
- Skull, C. (2011). Sustained Excellence- Toward a Model of Factors Sustaining Elite Performance in Opera.Uploaded bygoni56509

- Peasant autonomy, solidarity and revolt.pdfUploaded byArián Laguna
- Lecture2-thirdmicroprocessorCUploaded byxkurt358792
- Spong_Robot Dynamics and Control - SECOND EDITIONUploaded byLuon Chan Dong
- Honeywell BPR CaseUploaded bysanjanasingh29
- Surface TensionUploaded byAditya Bansal
- Pnrc27.pdfUploaded bykyle
- 8141bUploaded bypagumilano
- New vbUploaded bysyedwaliullah
- Afreen Hr ProjectUploaded byRuishabh Runwal
- New Process for Taking Packet Tracer Assessments.pdfUploaded byJuan Carlos Arias
- European Food Law Handbook PDFUploaded byPhilip
- 88680711-jurnal-kinetika-kimia.pdfUploaded byjosuabutarbutar
- 3 UserguideUploaded byalarmming
- lecture 1001 -- the gas lawsUploaded byapi-196433526
- Coding and CompressionUploaded byHermandeep Kaur
- Pmp Sample QuestionsUploaded byadjei
- EE6411 2009 Unit00 Introduction c ChapterUploaded bylila001
- Navi 1 Navi Xt User Guide 1.06Uploaded byAshleigh White
- Bioinformatics & Rational Drug Design (SIT, MLACW).pdfUploaded byKavisa Ghosh
- Perfume Industry AssignmentUploaded byHajar Hadis
- Lesson 1-CHE 358Uploaded bypercydziks
- Operation Research - money in motionUploaded byGeorgia Liu
- Performance TuningUploaded byapi-3701108
- Special Report - Tax Office of Tomorrow - Download (1)Uploaded byJCCompany
- Mantle Programming Guide and API ReferenceUploaded byStanford Adderley
- Sony DSC-W120 W125 Level3 Ver1.1Uploaded byEldergo Marquis
- cellandmollecularbiocatalog.pdfUploaded byphyaravi
- Load Sharing Analysis of Planetary Gear BoxUploaded byesatjournals
- Study the Static Stiffness of a Propeller Shaft of a Car for a Maximum Torque Load to Evaluate the Shear StressUploaded byInternational Journal of Innovative Science and Research Technology
- Essentials of Horary Astrology or Prasnapadavi – R. BhatUploaded byJyotish Freedom