|
|
Evaluation of musical phrasesIn this project an attempt has been undertaken of automatic musical phrases evaluation. Estimation of esthetical values using strictly mathematical formulas doesn't really seem to be easy (if it's possible at all), that's why poor effects of this experiment should not surprise. From the same reason, that aim function is not strictly defined we felt eligible to use solutions (in some cases it were even "tricks"), which validity is hard to prove but which (probably) were leading to some approximated way of phrase estimation, taking on account particular properties. 4 functions of melody estimation were used:
Last stage is computation of user's preference function (eval) on basis of given feature estimation (eval_index). Phrase estimation functions"Richness" of melody(Melo)Function computes Discrete Fourier Transform (DFT); set of input samples is a vector of notes pitches, transformed in a way that sequence of successive, identical values is proportional to note's length which this sequence suits. Additional stage of computation fills gaps with neighbor values. Value eval_index_Melo is in approximation a sum of amplitudes of positive successive DFT constituents divided by constituent number. Additionally function Melo is only estimation function that takes melody tempo on account. This function in its assumption was about to increase its value due to appearance of larger intervals or/and faster fragments of melody. Differentiation of dynamics (Velo)Function computes Discrete Fourier Transform (DFT); set of input samples is a vector of note pitches values prepared using the same method as above.
Mean interval (Inter)
Weight divk,k+1 introduction is justified because large interval between successive notes is less perceptible while these sounds are separated by large pause or first of them is a short pre-note (ornament) of second.
Value of above function is a sum of four constituents. Two of them depend on logarithm from size of interval and distance of interval from size of 6 halftones. tabeval vector contains (arbitrary chosen) values dependent (in approximation) from degree of interval's dissonance. Last ingredient is equal zero only when interval is greater than octave. "Richness" of rhythm(Rthm)set of input samples is a vector of values increasing by 1 which each note.
Both replacing Fourier transform with cosine transform and its computation separately for each bar (together with taking into consideration variety of these values) is justified by willingness to pull out not only the richness' of rhythmic division among whole melody but also differences of local division. Phrases estimation on basis of characteristics estimation.For each inspected characteristic value of preference function equals:
It is quotient scaling to range [0,1]. Usage of constantadd (except of quotient scaling) prevents from incorrect arithmetical operations if best oreval_indexwas equal 0. Constant Best i add values were set arbitrarily. |