BN.com Gift Guide

An Introduction to Audio Content Analysis: Applications in Signal Processing and Music Informatics

( 1 )

Overview

With the proliferation of digital audio distribution over digital media, audio content analysis is fast becoming a requirement for designers of intelligent signal-adaptive audio processing systems. Written by a well-known expert in the field, this book provides quick access to different analysis algorithms and allows comparison between different approaches to the same task, making it useful for newcomers to audio signal processing and industry experts alike. A review of relevant fundamentals in audio signal ...

See more details below
Other sellers (Hardcover)
  • All (6) from $80.11   
  • New (4) from $94.24   
  • Used (2) from $80.11   
Sending request ...

Overview

With the proliferation of digital audio distribution over digital media, audio content analysis is fast becoming a requirement for designers of intelligent signal-adaptive audio processing systems. Written by a well-known expert in the field, this book provides quick access to different analysis algorithms and allows comparison between different approaches to the same task, making it useful for newcomers to audio signal processing and industry experts alike. A review of relevant fundamentals in audio signal processing, psychoacoustics, and music theory, as well as downloadable MATLAB files are also included.

Please visit the companion website: www.AudioContentAnalysis.org

Read More Show Less

Editorial Reviews

From the Publisher
“The book is simply a treasure for music analysts, and I would strongly recommend it for any scientific library.” (Computing Reviews, 29 May 2013)

“Overall, this is a very practical book. It’s a good source of concise information on many topics in audio analysis, and I recommend it for practitioners of digital audio.” (Computing Reviews, 4 January 2013)

Read More Show Less

Product Details

  • ISBN-13: 9781118266823
  • Publisher: Wiley
  • Publication date: 8/14/2012
  • Edition number: 1
  • Pages: 272
  • Sales rank: 1,105,294
  • Product dimensions: 7.22 (w) x 10.04 (h) x 1.06 (d)

Meet the Author

ALEXANDER LERCH, PhD, is Managing Director and co-owner of zplane.development, a research and development firm of licensable technology for digital audio signal processing for both the music software industry and major audio content distributors. Dr. Lerch also teaches audio content analysis at the Technical University of Berlin.

Read More Show Less

Table of Contents

List of Figures xiii

List of Tables xvii

Preface xix

Acronyms xxi

List of Symbols xxv

1 Introduction 1
1.1 Audio Content 3
1.2 A Generalized Audio Content Analysis System 4

2 Fundamentals 7
2.1 Audio Signals 7
2.1.1 Periodic Signals 7
2.1.2 Random Signals 9
2.1.3 Sampling and Quantization 9
2.1.4 Statistical Signal Description 13
2.2 Signal Processing 14
2.2.1 Convolution 14
2.2.2 BlockBased Processing 18
2.2.3 Fourier Transform 20
2.2.4 Constant Q Transform 23
2.2.5 Auditory Filterbanks 24
2.2.6 Correlation Function 24
2.2.7 Linear Prediction 28

3 Instantaneous Features 31
3.1 Audio PreProcessing 33
3.1.1 DownMixing 33
3.1.2 DC Removal 33
3.1.3 Normalization 34
3.1.4 DownSampling 34
3.1.5 Other PreProcessing Options 35
3.2 Statistical Properties 35
3.2.1 Arithmetic Mean 36
3.2.2 Geometric Mean 36
3.2.3 Harmonic Mean 36
3.2.4 Generalized Mean 36
3.2.5 Centroid 37
3.2.6 Variance and Standard Deviation 37
3.2.7 Skewness 38
3.2.8 Kurtosis 39
3.2.9 Generalized Central Moments 40
3.2.10 Quantiles and Quantile Ranges 40
3.3 Spectral Shape 41
3.3.1 Spectral Rolloff 42
3.3.2 Spectral Flux 44
3.3.3 Spectral Centroid 45
3.3.4 Spectral Spread 47
3.3.5 Spectral Decrease 48
3.3.6 Spectral Slope 49
3.3.7 Mel Frequency Cepstral Coefficients 51
3.4 Signal Properties 54
3.4.1 Tonalness 54
3.4.2 Auto Correlation Coefficients 61
3.4.3 Zero Crossing Rate 62
3.5 Feature PostProcessing 63
3.5.1 Derived Features 64
3.5.2 Normalization and Mapping 65
3.5.3 Subfeatures 66
3.5.4 Feature Dimensionality Reduction 66

4 Intensity 71
4.1 Human Perception of Intensity and Loudness 71
4.2 Representation of Dynamics in Music 73
4.3 Features 73
4.3.1 Root Mean Square 73
4.4 Peak Envelope 76
4.5 PsychoAcoustic Loudness Features 77
4.5.1 EBU R128 78

5 Tonal Analysis 79
5.1 Human Perception of Pitch 79
5.1.1 Pitch Scales 79
5.1.2 Chroma Perception 81
5.2 Representation of Pitch in Music 82
5.2.1 Pitch Classes and Names 82
5.2.2 Intervals 83
5.2.3 Root Note, Mode, and Key 83
5.2.4 Chords and Harmony 86
5.2.5 The Frequency of Musical Pitch 88
5.3 Fundamental Frequency Detection 91
5.3.1 Detection Accuracy 92
5.3.2 PreProcessing 94
5.3.3 Monophonic Input Signals 97
5.3.4 Polyphonic Input Signals 103
5.4 Tuning Frequency Estimation 106
5.5 Key Detection 108
5.5.1 Pitch Chroma 108
5.5.2 Key Recognition 112
5.6 Chord Recognition 116

6 Temporal Analysis 119
6.1 Human Perception of Temporal Events 119
6.1.1 Onsets 119
6.1.2 Tempo and Meter 122
6.1.3 Rhythm 122
6.1.4 Timing 123
6.2 Representation of Temporal Events in Music 123
6.2.1 Tempo and Time Signature 123
6.2.2 Note Value 124
6.3 Onset Detection 124
6.3.1 Novelty Function 125
6.4 Beat Histogram 133
6.4.1 Beat Histogram Features 134
6.5 Detection of Tempo and Beat Phase 135
6.6 Detection of Meter and Downbeat 136

7 Alignment 139
7.1 Dynamic Time Warping 139
7.1.1 Example 143
7.1.2 Common Variants 144
7.1.3 Optimizations 145
7.2 AudiotoAudio
Alignment 146
7.2.1 Ground Truth Data for Evaluation 147
7.3 AudiotoScore Alignment 148
7.3.1 RealTime Systems 148
7.3.2 Non RealTime Systems 149

8 Musical Genre, Similarity and Mood 151
8.1 Musical Genre Classification 151
8.1.1 Musical Genre 152
8.1.2 Feature Extraction 154
8.1.3 Classification 155
8.2 Related Research Fields 156
8.2.1 Music Similarity Detection 156
8.2.2 Mood Classification 158
8.2.3 Instrument Recognition 161

9 Audio Fingerprinting 163
9.1 Fingerprint Extraction 164
9.2 Fingerprint Matching 165
9.3 Fingerprinting System: Example 166

10 Music Performance Analysis 169
10.1 Musical Communication 169
10.1.1 Score 169
10.1.2 Music Performance 170
10.1.3 Production 172
10.1.4 Recipient 172
10.2 Music Performance Analysis 172
10.2.1 Analysis Data 174
10.2.2 Research Results 177

A Convolution Properties 181
A.1 Identity 181
A.2 Commutativity 181
A.3 Associativity 182
A.4 Distributivity 183
A.5 Circularity 183

B Fourier Transform 185
B.1 Properties of the Fourier Transformation 186
B.1.1 Inverse Fourier Transform 186
B.1.2 Superposition 186
B.1.3 Convolution and Multiplication 186
B.1.4 Parseval’s Theorem 187
B.1.5 Time and Frequency Shift 188
B.1.6 Symmetry 188
B.1.7 Time and Frequency Scaling 189
B.1.8 Derivatives 190
B.2 Spectrum of Example Time Domain Signals 190
B.2.1 Delta Function 190
B.2.2 Constant 190
B.2.3 Cosine 190
B.2.4 Rectangular Window 191
B.2.5 Delta Pulse 191
B.3 Transformation of sampled time signals 191
B.4 Short Time Fourier Transform of Continuous Signals 192
B.4.1 Window Functions 193
B.5 Discrete Fourier Transform 195
B.5.1 Window Functions 196
B.5.2 Fast Fourier Transform 197

C Principal Component Analysis 199
C.1 Computation of the Transformation Matrix 200
C.2 Interpretation of the Transformation Matrix 200

D Software for Audio Analysis 201
D.1 Software Frameworks & Applications 202
D.1.1 Marsyas 202
D.1.2 CLAM 202
D.1.3 jMIR 203
D.1.4 CoMIRVA 203
D.1.5 Sonic Visualiser 203
D.2 Software Libraries & Toolboxes 204
D.2.1 Feature Extraction 204
D.2.2 Plugin Interfaces 205
D.2.3 Other Software 206

References 207

Index 237

Read More Show Less

Customer Reviews

Average Rating 4
( 1 )
Rating Distribution

5 Star

(0)

4 Star

(1)

3 Star

(0)

2 Star

(0)

1 Star

(0)

Your Rating:

Your Name: Create a Pen Name or

Barnes & Noble.com Review Rules

Our reader reviews allow you to share your comments on titles you liked, or didn't, with others. By submitting an online review, you are representing to Barnes & Noble.com that all information contained in your review is original and accurate in all respects, and that the submission of such content by you and the posting of such content by Barnes & Noble.com does not and will not violate the rights of any third party. Please follow the rules below to help ensure that your review can be posted.

Reviews by Our Customers Under the Age of 13

We highly value and respect everyone's opinion concerning the titles we offer. However, we cannot allow persons under the age of 13 to have accounts at BN.com or to post customer reviews. Please see our Terms of Use for more details.

What to exclude from your review:

Please do not write about reviews, commentary, or information posted on the product page. If you see any errors in the information on the product page, please send us an email.

Reviews should not contain any of the following:

  • - HTML tags, profanity, obscenities, vulgarities, or comments that defame anyone
  • - Time-sensitive information such as tour dates, signings, lectures, etc.
  • - Single-word reviews. Other people will read your review to discover why you liked or didn't like the title. Be descriptive.
  • - Comments focusing on the author or that may ruin the ending for others
  • - Phone numbers, addresses, URLs
  • - Pricing and availability information or alternative ordering information
  • - Advertisements or commercial solicitation

Reminder:

  • - By submitting a review, you grant to Barnes & Noble.com and its sublicensees the royalty-free, perpetual, irrevocable right and license to use the review in accordance with the Barnes & Noble.com Terms of Use.
  • - Barnes & Noble.com reserves the right not to post any review -- particularly those that do not follow the terms and conditions of these Rules. Barnes & Noble.com also reserves the right to remove any review at any time without notice.
  • - See Terms of Use for other conditions and disclaimers.
Search for Products You'd Like to Recommend

Recommend other products that relate to your review. Just search for them below and share!

Create a Pen Name

Your Pen Name is your unique identity on BN.com. It will appear on the reviews you write and other website activities. Your Pen Name cannot be edited, changed or deleted once submitted.

 
Your Pen Name can be any combination of alphanumeric characters (plus - and _), and must be at least two characters long.

Continue Anonymously
Sort by: Showing 1 Customer Reviews
  • Posted July 4, 2013

    more from this reviewer

    The digital music revolution has profoundly changed the way that

    The digital music revolution has profoundly changed the way that we listen to, discover, share and appreciate music. Digitization of music has turned music into data, and has enabled a much more systematic and rigorous analysis of the music as such. “An Introduction to Audio Content Analysis” is an excellent resource for the state-of-the art conceptual and analytic tools that are used these days for the analysis of the audio signal. 




    Like many such technical books that are called an introduction, this classification is a bit of a misnomer. The book is fairly advanced; both in terms of content as well as the mathematics used, and would be more appropriate for a higher-level undergraduate of beginning graduate class. 




    Even thought the title of this book suggests that this is a general audio analysis resource, the book predominantly focuses on the content analysis of music. 




    This book is not the best textbook material out there. It lacks a wide variety of worked-out examples, and there are no exercises at the ends of the chapters. However, it does come with an accompanying website which is a great resource in its own right. The website provides a plethora of Matlab examples, which are one of the best aspects of this book.




    Overall, this is a pretty good resource that will be a handy both for the audio analysis professionals as well as those who want to learn more about this growing and exciting field. 

    Was this review helpful? Yes  No   Report this review
Sort by: Showing 1 Customer Reviews

If you find inappropriate content, please report it to Barnes & Noble
Why is this product inappropriate?
Comments (optional)