Feature Extraction for Musical Genre Classification MUS15

Similar documents
LEARNING TEMPORAL FEATURES USING A DEEP NEURAL NETWORK AND ITS APPLICATION TO MUSIC GENRE CLASSIFICATION

Acoustic Gait-based Person Identification using Hidden Markov Models

Introduction to Pattern Recognition

FAULT DIAGNOSIS IN DEAERATOR USING FUZZY LOGIC

Automated analysis of microscopic images of cellular tissues

Creation and Exploration of Musical Information Spaces

A computer program that improves its performance at some task through experience.

FEATURES. Features. UCI Machine Learning Repository. Admin 9/23/13

Introduction to Pattern Recognition

CS 7641 A (Machine Learning) Sethuraman K, Parameswaran Raman, Vijay Ramakrishnan

Example 1: One Way ANOVA in MINITAB

Bayesian Optimized Random Forest for Movement Classification with Smartphones

E. Agu, M. Kasperski Ruhr-University Bochum Department of Civil and Environmental Engineering Sciences

Modelling Approaches and Results of the FHDO Biomedical Computer Science Group at ImageCLEF 2015 Medical Classification Task

MIXFOOD KNOCK DRUM MACHINE MANUAL

Closure duration analysis of incomplete stop consonants due to stop-stop interaction

BLOOD PRESSURE SENSOR BT17i USER S GUIDE

Tick Tick Step Sequencer

World Leading Traffic Analysis

Investigation of Gait Representations in Lower Knee Gait Recognition

Navigate to the golf data folder and make it your working directory. Load the data by typing

Organizing Quantitative Data

Ball Toss. Vernier Motion Detector

Driv e accu racy. Green s in regul ation

Pitching Performance and Age

MEDIA KIT Spring 2019

Closure duration analysis of incomplete stop consonants due to stop-stop interaction

Chapter 11 Motion. Displacement-. Always includes Shorter than distance

Gait Pattern Classification with Integrated Shoes

PUV Wave Directional Spectra How PUV Wave Analysis Works

Geometric moments for gait description

Two Machine Learning Approaches to Understand the NBA Data

Predicting Horse Racing Results with Machine Learning

Sea State Analysis. Topics. Module 7 Sea State Analysis 2/22/2016. CE A676 Coastal Engineering Orson P. Smith, PE, Ph.D.

July 2015 Sept Cork City Pedestrian Counter Report

AUSTRALIA. 10th-13th July 2019 QUT GARDENS THEATRE 2 GEORGE STREET BRISBANE CITY QLD 4000 OPEN RULES

GEOPHYSICAL RESEARCH LETTERS

Reliable Real-Time Recognition of Motion Related Human Activities using MEMS Inertial Sensors

JPEG-Compatibility Steganalysis Using Block-Histogram of Recompression Artifacts

Title: Modeling Crossing Behavior of Drivers and Pedestrians at Uncontrolled Intersections and Mid-block Crossings

J. Electrical Systems 11-2 (2015): Regular paper. Abnormal gait detection using Hexagonal method on Model based Front view model

Human Performance Evaluation

The role of investor sentiment in the contemporaneous dynamics in energy futures prices: A Discrete Wavelet Transformation

Cricket umpire assistance and ball tracking system using a single smartphone camera

Deakin Research Online

Phys 201A. Lab 6 - Motion with Constant acceleration Kinematic Equations

THe rip currents are very fast moving narrow channels,

Honors/AP Physics 1 Homework Packet #2

Design of a Pedestrian Detection System Based on OpenCV. Ning Xu and Yong Ren*

Wind farm performance

DANCE COMPETITIONS REGISTRATION PACKET

HOW TO GET STARTED DRESSAGE TO MUSIC AND RIDING TO MUSIC

INTERNATIONAL DANCE FEDERATION TECHNICAL RULES

Spatio-temporal analysis of team sports Joachim Gudmundsson

Pedestrian Protection System for ADAS using ARM 9

SoundCast Design Intro

SEISMIC CHARACTERIZATION OF THE NOVEMBER 8, 10, AND 11, 1999 DEAD SEA UNDERWATER CHEMICAL CALIBRATION EXPLOSIONS USING CEPSTRAL MODELING AND INVERSION

PREDICTING the outcomes of sporting events

Performance of Fully Automated 3D Cracking Survey with Pixel Accuracy based on Deep Learning

Table of Contents. Career Overview... 4

Kinematics 1. A. coefficient of friction between the cart and the surface. B. mass of the cart. C. net force acting on the cart

BLOOD PRESSURE SENSOR 0377i

Using the CONVAL software for the petrochemical plant control valves checking. Case study

Person identification from walking sound on wooden floor

Energy Utilisation of Wind

Atqasuk Wind Resource Report

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG

Tying Knots. Approximate time: 1-2 days depending on time spent on calculator instructions.

WIM #37 I-94, MP OTSEGO, MN APRIL 2012 MONTHLY REPORT

CFD Analysis of Giromill Type Vertical Axis Wind Turbine

Open Research Online The Open University s repository of research publications and other research outputs

Uncertainty Estimates in Satellite Derived Bathymetry

CYCLIC WAVEFORM SIGNAL ANALYSIS FOR ONLINE MONITORING OF VALVE SEAT ASSEMBLY PROCESSES

Analysis of the Radar Doppler Signature of a Moving Human

LINEAR MOTION. General Review

Movement-Type Classification Using Acceleration Sensor

Relative Motion. New content!

A study of advection of short wind waves by long waves from surface slope images

Wave Transformation, Prediction, and Analysis at Kaumalapau Harbor, Lanai, Hawaii

Site Description: Tower Site

Generation of See-Through Baseball Movie from Multi-Camera Views

NDA NATIONAL CHAMPIONSHIP

A Big Data approach to understand Central Banks Big Data Spain 2018

(12) Patent Application Publication (10) Pub. No.: US 2009/ A1

A NEW GLOBAL MOTION ESTIMATION ALGORITHM AND ITS APPLICATION TO RETRIEVAL IN SPORTS EVENTS

Prediction of Crossing Driveways of a Distracted Pedestrian from Walking Characteristics

Harmonic Motion: Pendulums Student Version

2261. Pedestal looseness extent recognition method for rotating machinery based on vibration sensitive time-frequency feature and manifold learning

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Wenbing Zhao. Department of Electrical and Computer Engineering

Statistical Analysis of Gait Data to Assist Clinical Decision Making

Meter Data Distribution Market Trials

Trial of a process to estimate depth of cover on buried pipelines

Outline. Terminology. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Steps in Capacity Planning and Management

Tutorial: Setting up and importing Splat Maps from World Machine

RHY TH M & GR O OV E S DANC E CLASS ES Ballet, Contemporary Dance, Girls Style, Street Jazz & Private Rhythmic Gymnastics Sessions GIRLS STYLE

Exploring Factors Affecting Metrorail Ridership in Washington D.C.

Classification of Acceleration Data for Biometric Gait Recognition on Mobile Devices

Las Vegas Challenge Course adidas TERREX Mini Caddy Book

Complex terrain wind resource estimation with the wind-atlas method: Prediction errors using linearized and nonlinear CFD micro-scale models

Image compression: ER Mapper 6.0 ECW v2.0 versus MrSID 1.3

Transcription:

Feature Extraction for Musical Genre Classification MUS15 Feature Extraction for Musical Genre Classification MUS15 Kilian Merkelbach June 22, 2015 Kilian Merkelbach June 21, 22, 2015 0/26 1/26

Intro - Contents 1 Intro 2 History 3 Methods Tzanetakis & Cook Correa, Costa & Saito 4 Conclusion Kilian Merkelbach June 22, 2015 2/26

Intro - What are genres? Kilian Merkelbach June 22, 2015 3/26

Intro - Genres provide order Kilian Merkelbach June 22, 2015 4/26

Intro - Pipeline - Feature extraction Kilian Merkelbach June 22, 2015 5/26

Intro - Pipeline - Training and Classification Kilian Merkelbach June 22, 2015 6/26

History - Contents 1 Intro 2 History 3 Methods Tzanetakis & Cook Correa, Costa & Saito 4 Conclusion Kilian Merkelbach June 22, 2015 7/26

History - MP3 revolution 1998: First MP3 players available 1999: Napster, mp3.com go online Commercial and private music collections explode Kilian Merkelbach June 22, 2015 8/26

History - First steps 2001: Tzanetakis and Cook build foundation by publishing first paper from then on: methods evolve from previous ones, always trying to improve performance Kilian Merkelbach June 22, 2015 9/26

Methods - Contents 1 Intro 2 History 3 Methods Tzanetakis & Cook Correa, Costa & Saito 4 Conclusion Kilian Merkelbach June 22, 2015 10/26

Methods - State of the Art Tzanetakis, Cook: "Musical Genre Classification of Audio Signals" Correa, Costa, Saito: "Tracking the Beat: Classification of Music Genres and Synthesis of Rhythms" Kilian Merkelbach June 22, 2015 11/26

Methods - Tzanetakis & Cook Features Overview Timbral Texture Features (19 dim.) Rhythmic Content Features (6 dim.) Pitch Content Features (5 dim.) Kilian Merkelbach June 22, 2015 12/26

Methods - Tzanetakis & Cook Timbral Texture Features I based on short time Fourier transform (STFT) Analysis Window (23ms) vs. Texture Window (1s) [1] Kilian Merkelbach June 22, 2015 13/26

Methods - Tzanetakis & Cook Timbral Texture Features II calculate four features from STFT output and use their mean and variance for classification e.g. Spectral Flux: F t = N (N t [n] N t 1 [n]) 2 n=1 additional features based on Mel-frequency cepstral coefficients (MFCCs) Kilian Merkelbach June 22, 2015 14/26

Methods - Tzanetakis & Cook Rhythmic Content Features Discrete Wavelet Transform (DWT) to analyze signal autocorrelation function recognizes strong beats (example) map peaks into Beat Histogram Kilian Merkelbach June 22, 2015 15/26

Methods - Tzanetakis & Cook Beat Histogram [3] Kilian Merkelbach June 22, 2015 16/26

Methods - Tzanetakis & Cook Pitch Content Features Pitch Histogram: like Beat Histogram (0.5-1.5s) but with shorter time frame (2-50ms) Kilian Merkelbach June 22, 2015 17/26

Methods - Tzanetakis & Cook Results 59% accuracy with 10 different genres 77% among classical genres 61% among jazz genres [3] Kilian Merkelbach June 22, 2015 18/26

Methods - Correa, Costa & Saito Tracking the Beat Songs given as MIDI files Use directed graphs to describe song Kilian Merkelbach June 22, 2015 19/26

Methods - Correa, Costa & Saito Digraphs Weighted directed graphs with note lengths as vertices Weights defined by frequency of note sequence [2] Kilian Merkelbach June 22, 2015 20/26

Methods - Correa, Costa & Saito Features Build digraph for each song: 18 18 = 324 dim. Additional features from digraph: 15 dim. Use PCA to obtain 52 dim. feature vector Kilian Merkelbach June 22, 2015 21/26

Methods - Correa, Costa & Saito Results 85.72% accuracy with 4 different genres (blues, bossa-nova, reggae and rock) Kilian Merkelbach June 22, 2015 22/26

Conclusion - Contents 1 Intro 2 History 3 Methods Tzanetakis & Cook Correa, Costa & Saito 4 Conclusion Kilian Merkelbach June 22, 2015 23/26

Conclusion - Prediction by humans College students found correct genre (out of 10) with 70% accuracy after 3 seconds [3] Machines are on par Human experts still better if there are many genres Kilian Merkelbach June 22, 2015 24/26

Conclusion - Future Research step away from speech recognition features (e.g. MFCCs) fuzzy classification (e.g. 90% Rock, 10% Blues) larger genre sets, higher accuracy Kilian Merkelbach June 22, 2015 25/26

Conclusion - References Wikimedia Commons. Short time fourier transform, 2006. Debora C Correa, Luciano da F Costa, and Jose H Saito. Tracking the beat: Classification of music genres and synthesis of rhythms. G. Tzanetakis and P. Cook. Musical genre classification of audio signals. Speech and Audio Processing, IEEE Transactions on, 10(5):293 302, Jul 2002. Kilian Merkelbach June 22, 2015 26/26