Using Machine Learning for Real-time Activity Recognition and Estimation of Energy Expenditure

Similar documents
Human Performance Evaluation

DATA MINING SAMPLE RESEARCH: ACTIVITY RECOGNITION CLASSIFICATION IN ACTION

Reliable Real-Time Recognition of Motion Related Human Activities using MEMS Inertial Sensors

A computer program that improves its performance at some task through experience.

GET UP AND MOVE: THE SCIENCE BEHIND SITTING DISEASE

An Energy Expenditure Estimation Algorithm for a Wearable System

Bayesian Optimized Random Forest for Movement Classification with Smartphones

Activity Time Intensity code Hours Minutes

measures Second generation accelerometers

Using Accelerometry: Methods Employed in NHANES

MEASURING PHYSICAL ACTIVITY IN OBESE POPULATIONS USING ACCELEROMETRY

Diederik van der Valk. EWI Biomedical Signals and Systems Telemedicine

Energy Expenditure Estimation DEMO Application

Objective Physical Activity Monitoring for Health-Related Research: A Discussion of Methods, Deployments, and Data Presentations

ROSE-HULMAN INSTITUTE OF TECHNOLOGY Department of Mechanical Engineering. Mini-project 3 Tennis ball launcher

CS 528 Mobile and Ubiquitous Computing Lecture 7a: Applications of Activity Recognition + Machine Learning for Ubiquitous Computing.

User Activity Related Data Sets for Context Recognition

Gait Recognition. Yu Liu and Abhishek Verma CONTENTS 16.1 DATASETS Datasets Conclusion 342 References 343

PREDICTING the outcomes of sporting events

ACTIVITY MONITORING SYSTEM

Activity Recognition from User-Annotated Acceleration Data

Smart-Walk: An Intelligent Physiological Monitoring System for Smart Families

Increasing Exercise Adherence through Environmental Interventions. Chapter 8

wesport: Utilising Wrist-Band Sensing to Detect Player Activities in Basketball Games

HHS Public Access Author manuscript Int J Cardiol. Author manuscript; available in PMC 2016 April 15.

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG

Physical Activity. Assessment Why Physical Activity Is Important? Background Information Tips: Walking Steps Tips: Walking Minutes Goals

Physical Activity. Assessment Why Physical Activity Is Important? Background Information Tips: Walking Steps Tips: Walking Minutes Goals

Why tennis? 34 reasons to play tennis

iphone as a Physical Activity Measurement Platform

In addition to reading this assignment, also read Appendices A and B.

W C S D. on the MOVE

... Participant Guide

Fall Prevention Midterm Report. Akram Alsamarae Lindsay Petku 03/09/2014 Dr. Mansoor Nasir

TRAINING WITH! PHYSICLO RESISTANCE GEAR. Testing & Validation

Monitoring Daily Energy Expenditure using a 3-Axis Accelerometer with a Low-Power Microprocessor

Weekly Activity Tracker

Ambulatory monitoring of gait quality with wearable inertial sensors

Experiment HE-9: Resting, Active, and Exercising Metabolic Rates

Experiment HE-9: Resting, Active, and Exercising Metabolic Rates

Building an NFL performance metric

Exercise Energy Expenditure Estimation Based on Acceleration Data Using the Linear Mixed Model

Video Based Accurate Step Counting for Treadmills

Validity of Four Activity Monitors during Controlled and Free-Living Conditions

Available online at ScienceDirect. Procedia Computer Science 98 (2016 )

Analysis of Acceleration Value based on the Location of the Accelerometer

iworx Sample Lab Experiment HE-5: Resting Metabolic Rate (RMR)

... Participant Guide

Towards Human Energy Expenditure Estimation Using Smart Phone Inertial Sensors

Session 8: Step Up Your Physical Activity Plan

References PEDOMETER PULSOMETER

Navigate to the golf data folder and make it your working directory. Load the data by typing

iworx Sample Lab Experiment HE-4: Respiratory Exchange Ratio (RER)

Towards Practical Energy Expenditure Estimation With Mobile Phones

Validation of a Step Test in Children Ages 7-11

iworx Sample Lab Experiment HE-5: Resting Metabolic Rate (RMR)

Vector Rope Trainer Cardiovascular Program Fitness Level: GENERAL

#10 Work Physiology. By : Dewi Hardiningtyas, ST., MT., MBA. Industrial Engineering Dept. University of Brawijaya

AggPro: The Aggregate Projection System

Continuous sweep versus discrete step protocols for studying effects of wearable robot assistance magnitude

Minimal influence of wind and tidal height on underwater noise in Haro Strait

An Engineering Approach to Precision Ammunition Development. Justin Pierce Design Engineer Government and International Contracts ATK Sporting Group

Real-time Posture and Activity Recognition by SmartShoe

Sea Surface Temperature (SST) in South China Sea Retrieved from Chinese Satellite FY-3B VIRR Data

Application of the ActiGraph GT9X IMU for the Assessment of Turning During Walking and Running

LifeBeat should be worn during daytime as well as night time as it can record activity levels as well as sleep patterns.

Thursday, Nov 20, pm 2pm EB 3546 DIET MONITORING THROUGH BREATHING SIGNAL ANALYSIS USING WEARABLE SENSORS. Bo Dong. Advisor: Dr.

The estimation of energy expenditure (EE) is of interest

Health + Track Mobile Application using Accelerometer and Gyroscope

Chapter 6 - ATHLETIC DEVELOPMENT FOR GOLF

Review Article Review of Prediction Models to Estimate Activity-Related Energy Expenditure in Children and Adolescents

Using Spatio-Temporal Data To Create A Shot Probability Model

Accelerometers: An Underutilized Resource in Sports Monitoring

CS 7641 A (Machine Learning) Sethuraman K, Parameswaran Raman, Vijay Ramakrishnan

FEATURES. Features. UCI Machine Learning Repository. Admin 9/23/13

SESSION ONE SESSION 1, P 4

VALIDATION OF HEAT FLUX TECHNOLOGY TO ASSESS ENERGY EXPENDITURE DURING EXERCISE. Erin L. Thomas

What s Health Got to Do With It? Health and Land Use Planning

PhenX Measure: Total Physical Activity - Comprehensive (#150700) PhenX Protocol: Total Physical Activity - Comprehensive - Adult Protocol (#150702)

The Scientific Bulletin of VALAHIA University MATERIALS and MECHANICS Nr. 5 (year 8) 2010

Enhancing Activity Recognition by Fusing Inertial and Biometric Information

VALIDATION OF THE PHYSICAL ACTIVITY INDEX (PAI) AS A MEASURE OF TOTAL ACTIVITY LOAD AND TOTAL KILOCALORIE EXPENDITURE

Define terms and compute basic physics problems related to sprinting

Where am I: Recognizing On-Body Positions of Wearable Sensors

Walkable Communities and Adolescent Weight

MSD RISK ASSESSMENT. Risk Factor "Potential Risk" "High Risk" Contact Stress Repetition Grip Force Lift/Lower Force Awkward Posture Vibration WSPS.

February Funded by NIEHS Grant #P50ES RAND Center for Population Health and Health Disparities

Promoting Health in Low-Wealth Communities: Physical Activity

Evaluation of Pedometer Performance Across Multiple Gait Types Using Video for Ground Truth

Effects of Age and Body Mass Index on Accuracy of Simple Moderate Vigorous Physical Activity Monitor Under Controlled Condition

Global Physical Activity Questionnaire (GPAQ)

Speed Training. Speed Training

An investigation of kinematic and kinetic variables for the description of prosthetic gait using the ENOCH system

Predicting the Total Number of Points Scored in NFL Games

Evaluating and Classifying NBA Free Agents

On11: An Activity Recommendation Application to Mitigate Sedentary Lifestyle

Competitive Performance of Elite Olympic-Distance Triathletes: Reliability and Smallest Worthwhile Enhancement

Fail Operational Controls for an Independent Metering Valve

UNDER 9 AND UNDER 12 PLAYER EVALUATION (fundamental and developmental stages)

The Impact of Policy and Environmental Outcomes on Youth Physical Activity

Transcription:

Using Machine Learning for Real-time Activity Recognition and Estimation of Energy Expenditure Emmanuel Munguia Tapia PhD Thesis Defense House_n Massachusetts Institute of Technology

Do you know How many calories you expend each day? How many calories you need to to stay healthy?

Motivation Obesity is a major health threat: 65% of U.S. adults are overweight 30% of U.S. adults are obese 16% of children are obese National Center for Health statistics

Obesity is a risk factor for Hypertension Type 2 diabetes Coronary heart disease Stroke Gallbladder disease Osteoarthritis Sleep apnea and respiratory problems Some cancers (endometrial, breast, and colon)

Projected prevalence of obesity Data from WHO and IUNS

Projected prevalence of obesity Data from WHO and IUNS

Energy (im)balance Body composition change Energy intake Energy expenditure Three ways to address the problem Magic pill Eat less or healthier Burn more calories

Two ways to help Knowing what people are doing Knowing how many calories are burned

Lying down Washing dishes Cycling Walking moderate Walking light Standing Sitting fidgeting legs Sitting Non-Exercise Physical Activity 0 5 10 15 20 25 30 35 40 Percentage of time If a mobile phone could Non-Exercise Physical Activity Lying down Washing dishes Cycling Walking moderate Walking light Standing Sitting fidgeting legs Sitting 0 5 10 15 20 25 30 35 40 Percentage of time 1. Real-time feedback 2. Just-in-time interventions 3. Non-exercise activity thermogenesis

How is physical activity and energy expenditure presently measured?

In the lab Room indirect calorimetry Portable indirect calorimeter

During free-living Paper diaries Electronic diaries burdensome + time consuming

Electronic monitoring (during free-living ) Actigraph (1axis) Bodybugg (2axis) Problems: Little or no contextual information Low performance on upper body and lower body _activity

Goal of this work Develop algorithms based on wireless wearable sensors that: Recognize activity type, intensity and duration Estimate energy expenditure Achieve reasonable performance Are amenable for real-time performance Work when sensors worn in convenient locations This work explores the trade-offs that need to be made in order to achieve these goals

Activity Recognition Algorithms Experiments

Previous work Kern et al. 2003: 8 activities, 18min, 1 researcher Blum et al. 2005 8 activities, 24hrs, 1 researcher Bao et al. 2005 20 activities, 30hrs, 20 subjects Olguin et al. 2006 8 activities, 3 subjects Ravi et al. 2008 8 activities, 2 subjects Huynh and Schiele 2005 6 activities, 200min, 2 participants Lester et al. 2006 10 activities, 3 subjects

Contributions 52 activities, 120hrs, 20 subjects Collected at a gym and residential home Recognize activity type and intensity Systematic experiments to determine Algorithm parameters Value of accelerometers versus heart rate Location and number of the sensors Proof of viability of real-time system to recognize arbitrary activities

Demo: Activity recognition 1. Wear three wireless accelerometers 2. Select 10 physical activities 3. Provide 2 minutes of data per activity

Activity Recognition Algorithm

Walking treadmill 4mph 0% Dominant Foot Hip Dominant wrist

Walking treadmill 4mph 0% Dominant Foot Hip Dominant wrist Zoom in Accelerometer data

Segmentation: Sliding windows Dominant Foot Hip Dominant wrist 4.2s

Interpolation: Cubic splines 4500 Annotation - Walking - Treadmill 4mph - Treadmill 0% Data window(4.2s) 4500 Annotation - Lying down Data window(4.2s) 4000 4000 3500 3500 3000 3000 2500 2500 2000 2000 1500 1500 1000 500 Before interpolation 1000 500 After interpolation 90 100 110 120 130 140 90 95 100 105 110 115 120 125 130 135 140

Signal processing: Filtering 4500 4000 3500 3000 2500 Annotation - Lying down Data window (4.2s) x y z 2000 1500 1000 500 After interpolation 90 95 100 105 110 115 120 125 130 135 140

Signal processing: Filtering x y z BandPass Filter 0.1 20Hz LowPass Filter 1Hz Motion Information Posture Information

Feature computation For each of the 9 acceleration axis, compute the following features referred as invariant reduced Signal variability Variance Posture information Posture Distances Activity intensity Energy between 0.3-3.5Hz Frequency/periodicity of motion Top 5 peaks of the FFT

Time domain features Variance x x y z xz xy yz Posture distances

Frequency domain features x 5 FFT peaks Freq1,Mag1 Freq2,Mag2, Freq3,Mag3, Freq4,Mag4, Freq5,Mag5 0.3-3.5Hz Band energy

Training of classifier ACVar (9) ACFFTPeaks (90) ACBandEnergy (9) DCPostureDist (9) C4.5 Decision Tree Classifier [val_1 val_2 val_117] Vector size: 117

C4.5 decision tree

Subject independent evaluation Train 1 2 3 4 Total subjects - 1 Test Left out subject Repeat for as many subjects available and average results

Subject dependent evaluation Results 10-Fold crossvalidation Train Test Repeat for as many subjects we have and average results

Target activities (52) Type Intensity Type Intensity Lying down Not applicable Running Treadmill 5mph at 0% grade Standing Not applicable Running Treadmill 6mph at 0% grade Sitting Not applicable Stairs Ascend stairs Not applicable Sitting Fidget feet legs Stairs Descend stairs Not applicable Sitting Fidget hands arms Cycling 80 rpm, light, moderate, hard Kneeling Not applicable Cycling 60 rpm, light Walking Treadmill 2mph 0% grade Cycling 100 rpm, light Walking Treadmill 3mph 0% grade Rowing 30 spm, light, moderate, hard Walking Treadmill 3mph at 3% grade Bicep curls Light, moderate, hard Walking Treadmill 3mph at 6% grade Bench weight lifting Light, moderate, hard Walking Treadmill 3mph at 9% grade Sit-ups Not applicable Running Treadmill 4mph at 0% grade Crunches Not applicable Gymnasium activity subset

Target activities (52) Type Carrying groceries Doing dishes Gardening Ironing Making the bed Mopping Playing videogames Scrubbing a surface Stacking groceries Sweeping Typing Type Vacuuming Walking around block Washing windows Watching TV Weeding Wiping/Dusting Writing taking out trash Household activities subset

Sensing equipment (a) Wireless accelerometers, (b) HR transceiver, (c) Actigraphs, (d) HR monitor, (e) pedometer, (f) Bodybugg armband

Sensor placement 20 Subjects 18 and 42 years old Start/end times of activities annotated 3-4min per activity except physically demanding activities ~1min

Final system design Acceleration only Three sensors: hip, wrist, foot out of seven explored Computes features over each axis Feature set: minimizes dependency on sensor placement Classifier C4.5 classifier Sliding windows of 5.6s in length

Performance: 52 activities Evaluation Method Accuracy (%) TP Range (%) Random guess: 1.96% FP Range (%) Subject dependent 87.9 80-93 0.1 0.2 Subject independent 50.6 34 77 0.5 1.3 Percent change 73%

Performance: 52 activities Evaluation Method Accuracy (%) TP Range (%) Random guess: 1.96% FP Range (%) Subject dependent 87.9 80-93 0.1 0.2 Subject independent 50.6 34 77 0.5 1.3 Percent change 73%

Performance: 52 activities Higher performance: Postures and exercises Lowest performance: household and resistance activities Confused: Intensity levels Household Household with postures and ambulation Activities involving upper body motion

How much training data? Subject dependent performance Training: 75%, Testing: 25% Varied training from 75% to 7.5% 75% training data: Accuracy= 80.6% 60% training data: Accuracy= 76% At 60% of data: 2min for most activities, 1min for physically demanding activities

Performance: Activity subsets Activities to recognize Total Activities Included All 51 All 51 activities All with no intensities 31 No intensity levels for Bicep curls, bench weight lifting, walking, running, cycling, rowing, and sitting Postures, ambulation and two MET intensity categories 11 Lying down, sitting, standing, kneeling, walking (2, 3mph), running (4,5, and 6 mph), Postures and Ambulation with no intensity moderate, vigorous 8 Lying down, sitting, standing, kneeling, walking, running, ascending stairs, descending stairs Postures 4 Lying down, sitting, standing, kneeling

Performance: Activity subsets Subject Dependent Subject Independent Activities to recognize Random Guess (%) Total Accuracy (%) Total Accuracy (%) All (51) 1.9% 87.9 50.6 All with no intensities 3.2% 91.4 72.0 (31) Postures, ambulation 9% 96.5 81.3 and two MET intensity categories (11) Postures and 12.5% 98.4 92.9 Ambulation with no intensity (8) Postures (4) 25% 99.3 98.0

Performance: Activity subsets Subject Dependent Subject Independent Activities to recognize Random Guess (%) Total Accuracy (%) Total Accuracy (%) All (51) 1.9% 87.9 50.6 All with no intensities 3.2% 91.4 72.0 (31) Postures, ambulation 9% 96.5 81.3 and two MET intensity categories (11) Postures and 12.5% 98.4 92.9 Ambulation with no intensity (8) Postures (4) 25% 99.3 98.0 If activity intensities are merged, SI accuracy =72%

Performance: Activity subsets Subject Dependent Subject Independent Activities to recognize Random Guess (%) Total Accuracy (%) Total Accuracy (%) All (51) 1.9% 87.9 50.6 All with no intensities 3.2% 91.4 72.0 (31) Postures, ambulation 9% 96.5 81.3 and two MET intensity categories (11) Postures and 12.5% 98.4 92.9 Ambulation with no intensity (8) Postures (4) 25% 99.3 98.0

Performance: Activity subsets Subject Dependent Subject Independent Activities to recognize Random Guess (%) Total Accuracy (%) Total Accuracy (%) All (51) 1.9% 87.9 50.6 All with no intensities 3.2% 91.4 72.0 (31) Postures, ambulation 9% 96.5 81.3 and two MET intensity categories (11) Postures and 12.5% 98.4 92.9 Ambulation with no intensity (8) Postures (4) 25% 99.3 98.0

Performance: Activity subsets Subject Dependent Subject Independent Activities to recognize Random Guess (%) Total Accuracy (%) Total Accuracy (%) All (51) 1.9% 87.9 50.6 All with no intensities 3.2% 91.4 72.0 (31) Postures, ambulation 9% 96.5 81.3 and two MET intensity categories (11) Postures and 12.5% 98.4 92.9 Ambulation with no intensity (8) Postures (4) 25% 99.3 98.0

52 activities: Sensor subsets Sensor Combination Subject Dependent Accuracy All sensors 87.9 ± 2.0 Hip + DWrist + DFoot -1.8 % Hip + DFoot -3.5 % Hip + DWrist -4.9 % DWrist + DThigh -7.2 % DWrist + DFoot -7.7 % Hip -8.4 % DFoot -14.9 % DThigh -15.1 % DUpperArm -15.4 % DWrist -19.6 % Sensor Combination Subject Independent Accuracy All sensors 50.6 ± 5.2 Hip + DWrist + DFoot -7.9 % DWrist + DThigh -8.1 % DWrist + DFoot -13.0 % Hip + DWrist -15.6 % Hip + DFoot -18.9 % DUpperArm -26.5 % DWrist -27.7 % Hip -28.5 % DFoot -34.8 % DThigh -42.7 %

52 activities: Sensor subsets Sensor Combination Subject Dependent Accuracy All sensors 87.9 ± 2.0 Hip + DWrist + DFoot -1.8 % Hip + DFoot -3.5 % Hip + DWrist -4.9 % DWrist + DThigh -7.2 % DWrist + DFoot -7.7 % Hip -8.4 % DFoot -14.9 % DThigh -15.1 % DUpperArm -15.4 % DWrist -19.6 % Sensor Combination Subject Independent Accuracy All sensors 50.6 ± 5.2 Hip + DWrist + DFoot -7.9 % DWrist + DThigh -8.1 % DWrist + DFoot -13.0 % Hip + DWrist -15.6 % Hip + DFoot -18.9 % DUpperArm -26.5 % DWrist -27.7 % Hip -28.5 % DFoot -34.8 % DThigh -42.7 %

52 activities: Sensor subsets Sensor Combination Subject Dependent Accuracy All sensors 87.9 ± 2.0 Hip + DWrist + DFoot -1.8 % Hip + DFoot -3.5 % Hip + DWrist -4.9 % DWrist + DThigh -7.2 % DWrist + DFoot -7.7 % Hip -8.4 % DFoot -14.9 % DThigh -15.1 % DUpperArm -15.4 % DWrist -19.6 % Sensor Combination Subject Independent Accuracy All sensors 50.6 ± 5.2 Hip + DWrist + DFoot -7.9 % DWrist + DThigh -8.1 % DWrist + DFoot -13.0 % Hip + DWrist -15.6 % Hip + DFoot -18.9 % DUpperArm 26.5 % DWrist -27.7 % Hip -28.5 % DFoot -34.8 % DThigh -42.7 %

52 activities: Sensor subsets Sensor Combination Subject Dependent Accuracy All sensors 87.9 ± 2.0 Hip + DWrist + DFoot -1.8 % Hip + DFoot -3.5 % Hip + DWrist -4.9 % DWrist + DThigh -7.2 % DWrist + DFoot -7.7 % Hip -8.4 % DFoot -14.9 % DThigh -15.1 % DUpperArm -15.4 % DWrist -19.6 % Sensor Combination Subject Independent Accuracy All sensors 50.6 ± 5.2 Hip + DWrist + DFoot -7.9 % DWrist + DThigh -8.1 % DWrist + DFoot -13.0 % Hip + DWrist -15.6 % Hip + DFoot -18.9 % DUpperArm 26.5 % DWrist -27.7 % Hip -28.5 % DFoot -34.8 % DThigh -42.7 %

52 activities: Sensor subsets Sensor Combination Subject Dependent Accuracy All sensors 87.9 ± 2.0 Hip + DWrist + DFoot -1.8 % Hip + DFoot -3.5 % Hip + DWrist -4.9 % DWrist + DThigh -7.2 % DWrist + DFoot -7.7 % Hip -8.4 % DFoot -14.9 % DThigh -15.1 % DUpperArm -15.4 % DWrist -19.6 % Sensor Combination Subject Independent Accuracy All sensors 50.6 ± 5.2 Hip + DWrist + DFoot -7.9 % DWrist + DThigh -8.1 % DWrist + DFoot -13.0 % Hip + DWrist -15.6 % Hip + DFoot -18.9 % DUpperArm 26.5 % DWrist -27.7 % Hip -28.5 % DFoot -34.8 % DThigh -42.7 %

Why 5.6s sliding windows? Measured performance while varying window length from 1.4s to 91s Performance increases with longer windows Improvement ~5% from 5.6s to 45s. Window length depends on activity type but this is computationally expensive Long windows for household activities (e.g. 22-45s) Short windows for postures (e.g. 5.6s ) Long windows: low performance over short duration activities and long real-time delays.

Why not combine HR+ACC data? Subject Independent Evaluation Features subsets Accuracy (%) TP Rate (%) FP Rate (%) ScaledHR 13.8 4 16 1.6 2.3 Invariant Reduced 50 34-77 0.5 1.3 Invariant Reduced + ScaledHR 52 37-76 0.5 1.3 Subject Dependent Evaluation Features subsets Accuracy (%) TP Rate (%) FP Rate (%) ScaledHR 38.4 24 39 1.4 1.6 Invariant Reduced 88 80-97 0.1 0.4 Invariant Reduced + ScaledHR 89.5 82 97 0.1 0.4

Why not combine HR+ACC data? Subject Independent Evaluation Features subsets Accuracy (%) TP Rate (%) FP Rate (%) ScaledHR 13.8 4 16 1.6 2.3 Invariant Reduced 50 34-77 0.5 1.3 Invariant Reduced + ScaledHR 52 37-76 0.5 1.3 Subject Dependent Evaluation Features subsets Accuracy (%) TP Rate (%) FP Rate (%) ScaledHR 38.4 24 39 1.4 1.6 Invariant Reduced 88 80-97 0.1 0.4 Invariant Reduced + ScaledHR 89.5 82 97 0.1 0.4

Why not combine HR+ACC data? Subject Independent Evaluation Features subsets Accuracy (%) TP Rate (%) FP Rate (%) ScaledHR 13.8 4 16 1.6 2.3 Invariant Reduced 50 34-77 0.5 1.3 Invariant Reduced + ScaledHR 52 37-76 0.5 1.3 Subject Dependent Evaluation Features subsets Accuracy (%) TP Rate (%) FP Rate (%) ScaledHR 38.4 24 39 1.4 1.6 Invariant Reduced 88 80-97 0.1 0.4 Invariant Reduced + ScaledHR 89.5 82 97 0.1 0.4 Percent change = 2-4%

Why such a low improvement? Stabilization time Running 5mph 0% Standing still Heart rate lags physical activity and remains altered once activity has ended. Thus, errors concentrated at start - end

Why such a low improvement? Raw data Filtered data Ascending stairs Descending stairs Ascending stairs Descending stairs Errors also occur for activities where heart rate constantly increases or decreases over time (e.g. physically demanding)

Real-time pilot study Five participants were asked to: Wear 3 accelerometers and Type in 10 physical activities, exercises, postures, or activities of their choice Perform activities provided continuously for 2 minutes.

Real-time pilot study Subject Activities performed Total Accuracy (%) True Positive Range (%) False Positive Range (%) 1 Bouncing on a ball Waving hand Shaking my leg Taekwondo Form #1 Side stretch Jumping jacks Punching as I walk forward Lifting dumbbells Riding a bike Playing the drums 89.6 89.3 94.8 0.8-1.0 2 Walking Sitting still Scratching head Carrying box Washing dishes 3 Throwing Bowling Bouncing Typing Stepping 4 Walk Type in computer Washing window Drawing in paper Wiping surface 5 Walk Bicep curls Stretching Applying cream Brushing teeth Shaking hands Tossing ball in air Typing Talking on phone Stretching arm Walking Tennis serve Stretching legs Bending Talking on the phone Sweeping Combing my hair Hammering a nail Eating Wash dish Knitting Wash hands Filing nails Play piano 91.7 84.5 98.2 0.4 0.17 78.9 70.7 93.2 1.3 3.8 89.3 74.1 94.8 0.6 2.1 85.2 77.6 94.8 0.6 2.7

Energy Expenditure Algorithm Experiments

Accelerometer at the hip Hip accelerometer 1min windows Feature: overall _motion Linear regression _ Predict EE in METs kcal/min kj/min.

Compendium of physical activities Activity Label Walking 2mph Lying down: 1.0MET Walking 2mph 2.5MET Sweeping 3.3MET Look-up table Compendium of physical activities Estimated EE 2.5MET

Crouter et al. 2007 Actigraph Data Walking 2mph Activity Recognition Off Off On Sedentary MET =1 Ambulatory Linear regression Lifestyle Exponential Regression No output 2MET No output Estimated EE 2MET 17 activities, 20 subjects, 3hours/subject r = 0.96, RMSE=0.73MET, MAED=0.75MET

Crouter et al. 2007

EE in this work This thesis extends the work of Crouter et al. by: Exploring the use of 51 activity dependent regression models The utilization of 7 accelerometers The exploration of 41 features The use of shorter window lengths The use of linear and non-linear regression models

Activity dependent regression Off Regression Model 1 Lying down Activity Label Walking 2mph On Off Regression Model 2 Walking 2mph Regression Model 3 Sweeping 2MET Estimated EE 2MET Off Regression Model n Washing dishes

EE estimation assumptions Predicting EE in METs 1MET = EE while lying down METs normalize EE with respect to body mass Gross EE prediction Gross=resting + motion energy expenditure Non-steady state EE is not eliminated Might be difficult to reach during free-living More realistic evaluation

MIT EE dataset Reduced version of data collected Removed sessions containing any activity with low EE values (< 40%) Poor mask attachment 13 out of 40 sessions removed 15 gym and 12 household sessions 16 Participants men=7, woman=9 18-40 years old, Body mass 60-103kg

Evaluation measures Correlation coefficient (r) r = [0,1] Root mean squared error (RMSE) 1 N p i a i N i 1 2

Baseline EE experiments Error Measures Total Correlation Coefficient Total root Mean Square Error Maximum absolute Deviation Crouter et al. Actigraph Compendium Comparable Activities (29 activities) 0.4 0.9 (125%) 2.7 1.27 (-53%) Compendium Closest Activities (52 activities) 0.8 (100%) 1.6 (-41%) Linear Regression 0.73 (82%) 1.4 (-48%) One Linear Regression model per activity 0.87 (117%) 1.0 (-63%) One nonlinear regression model per activity 0.91 (127%) 0.88 (-67.4%) 6.9 4.17 5.6 4.1 4.2 3.4 Performance over 52 activities using the ACAbsArea feature computed per sensor over one-minute sliding windows.

Baseline EE results Error Measures Total Correlation Coefficient Total root Mean Square Error Maximum absolute Deviation Crouter et al. Actigraph Compendium Comparable Activities (29 activities) 0.4 0.9 (125%) 2.7 1.27 (-53%) Compendium Closest Activities (52 activities) 0.8 (100%) 1.6 (-41%) Linear Regression 0.73 (82%) 1.4 (-48%) One Linear Regression model per activity 0.87 (117%) 1.0 (-63%) One nonlinear regression model per activity 0.91 (127%) 0.88 (-67.4%) 6.9 4.17 5.6 4.1 4.2 3.4 Performance lower than the obtained over 17 activities r = 0.92, RMSE=0.73MET High maximum error deviation!

Crouter: Lower body activity Subject MIT-018

Crouter: Upper body activity Subject MIT-018

Baseline EE results Error Measures Total Correlation Coefficient Total root Mean Square Error Maximum absolute Deviation Crouter et al. Actigraph Compendium Comparable Activities (29 activities) 0.4 0.9 (125%) 2.7 1.27 (-53%) Compendium Closest Activities (52 activities) 0.8 (100%) 1.6 (-41%) Linear Regression 0.73 (82%) 1.4 (-48%) One Linear Regression model per activity 0.87 (117%) 1.0 (-63%) One nonlinear regression model per activity 0.91 (127%) 0.88 (-67.4%) 6.9 4.17 5.6 4.1 4.2 3.4 This result indicates that knowledge of the activity being _performed is important. Performance depends on mean EE listings in Compendium

Baseline EE results Error Measures Total Correlation Coefficient Total root Mean Square Error Maximum absolute Deviation Crouter et al. Actigraph Compendium Comparable Activities (29 activities) 0.4 0.9 (125%) 2.7 1.27 (-53%) Compendium Closest Activities (52 activities) 0.8 (100%) 1.6 (-41%) Linear Regression 0.73 (82%) 1.4 (-48%) One Linear Regression model per activity 0.87 (117%) 1.0 (-63%) One nonlinear regression model per activity 0.91 (127%) 0.88 (-67.4%) 6.9 4.17 5.6 4.1 4.2 3.4 Performance improves over Crouter s mainly due to the use of six additional accelerometers.

Baseline EE results Error Measures Total Correlation Coefficient Total root Mean Square Error Maximum absolute Deviation Crouter et al. Actigraph Compendium Comparable Activities (29 activities) 0.4 0.9 (125%) 2.7 1.27 (-53%) Compendium Closest Activities (52 activities) 0.8 (100%) 1.6 (-41%) Linear Regression 0.73 (82%) 1.4 (-48%) One Linear Regression model per activity 0.87 (117%) 1.0 (-63%) One nonlinear regression model per activity 0.91 (127%) 0.88 (-67.4%) 6.9 4.17 5.6 4.1 4.2 3.4 Improvement over single linear regression model: R=19%, RMSE=-28.5% Activity dependent models help by allowing regression _coefficients to be tuned for each activity

Baseline EE results Error Measures Total Correlation Coefficient Total root Mean Square Error Maximum absolute Deviation Crouter et al. Actigraph Compendium Comparable Activities (29 activities) 0.4 0.9 (125%) 2.7 1.27 (-53%) Compendium Closest Activities (52 activities) 0.8 (100%) 1.6 (-41%) Linear Regression 0.73 (82%) 1.4 (-48%) One Linear Regression model per activity 0.87 (117%) 1.0 (-63%) One nonlinear regression model per activity 0.91 (127%) 0.88 (-67.4%) 6.9 4.17 5.6 4.1 4.2 3.4 Improvement over activity-dependent linear regression: r=5%, RMSE=-12% Improvement over single linear model is: r=25%, RMSE=-37%

Performance per activity Weakest: activities with resistance/load Best: postures and household activities A single linear regression using 7sensors Overestimates EE for postures Predicts EE well for lower and upper body activities The Compendium of Physical activates Overestimates EE for household activities and short duration short duration activities Estimates EE better for activities that reached steady-state EE

Summary of results EE estimation is improved by Accelerometers at upper and lower body Activity-dependent regression models Questions to answer Fewer accelerometers? Performance of activity recognition algorithm? Heart rate data? For full detail see thesis!

Final system design The final EE estimation algorithm uses the following parameters: Only accelerometer data Three accelerometers: Hip, dominant wrist, and dominant foot. Feature: Top 5 FFT peaks per sensor. 5.6s sliding windows.

Sensor combinations Sensor Combination Correlation RMSE All sensors 0.71 1.28 Hip + DWrist + DFoot -2.8% +2.3% DWrist + DFoot -2.8% +3.0% Hip + DFoot -4.2% +3.0% DWrist + DThigh -12.7% +5.2% Hip + DWrist -5.6% +12.3% DFoot -8.5% +5.2% DThigh -11.3% +9.2% DUpperArm -15.5% +11.1% Hip -33.8% +13.5% DWrist -2.8% +21.5%

Sensor combinations Sensor Combination Correlation RMSE All sensors 0.71 1.28 Hip + DWrist + DFoot -2.8% +2.3% DWrist + DFoot -2.8% +3.0% Hip + DFoot -4.2% +3.0% DWrist + DThigh -12.7% +5.2% Hip + DWrist -5.6% +12.3% DFoot -8.5% +5.2% DThigh -11.3% +9.2% DUpperArm -15.5% +11.1% Hip -33.8% +13.5% DWrist -2.8% +21.5%

Sensor combinations Sensor Combination Correlation RMSE All sensors 0.71 1.28 Hip + DWrist + DFoot -2.8% +2.3% DWrist + DFoot -2.8% +3.0% Hip + DFoot -4.2% +3.0% DWrist + DThigh -12.7% +5.2% Hip + DWrist -5.6% +12.3% DFoot -8.5% +5.2% DThigh -11.3% +9.2% DUpperArm -15.5% +11.1% Hip -33.8% +13.5% DWrist -2.8% +21.5%

Sensor combinations Sensor Combination Correlation RMSE All sensors 0.71 1.28 Hip + DWrist + DFoot -2.8% +2.3% DWrist + DFoot -2.8% +3.0% Hip + DFoot -4.2% +3.0% DWrist + DThigh -12.7% +5.2% Hip + DWrist -5.6% +12.3% DFoot -8.5% +5.2% DThigh -11.3% +9.2% DUpperArm -15.5% +11.1% Hip -33.8% +13.5% DWrist -2.8% +21.5%

Activity dependent regression C4.5 decision tree + Invariant reduced Feature set Activity Recognition Subject Dependent Subject independent Linear regression + ACFFTPeaks Energy Expenditure Estimation Subject independent

51 activity dependent models Method Activity Energy Correlation RMSE Feature set feature set LR - ScaledHR 0.84 1.01 LR - ACFFTPeaks 0.72 1.28 51 activities Invariant ACFFTPeaks 0.77 1.31 ARSI LR reduced 51 activities ARSD LR Invariant reduced ACFFTPeaks 0.88 0.99 Estimation of energy expenditure over 51 activities using three sensors at the hip, dominant wrist and dominant foot.

51 activity dependent models Method Activity Energy Correlation RMSE Feature set feature set LR - ScaledHR 0.84 1.01 LR - ACFFTPeaks 0.72 1.28 51 activities Invariant ACFFTPeaks 0.77 1.31 ARSI LR reduced 51 activities ARSD LR Invariant reduced ACFFTPeaks 0.88 0.99 Results during subject dependent evaluation are very close to the ones obtained when activity is assumed to be known (<2%).

51 activity dependent models Method Activity Energy Correlation RMSE Feature set feature set LR - ScaledHR 0.84 1.01 LR - ACFFTPeaks 0.72 1.28 51 activities Invariant ACFFTPeaks 0.77 1.31 ARSI LR reduced 51 activities ARSD LR Invariant reduced ACFFTPeaks 0.88 0.99 Heart rate data outperforms best accelerometer- _based feature. Activity-dependent models using subject dependent _activity recognition achieve a performance close HR _data.

Interesting findings Features other than overall amount of motion improve performance Use of 5 FFT peaks + energy + mean crossing rate features instead of overall motion feature at hip sensor improves r=+13%, RMSE=-21%

Interesting findings Features other than overall amount of motion improve performance Use of 5 FFT peaks + energy + mean crossing rate features instead of overall motion feature at hip sensor improves r=+13%, RMSE=-21% Addition of heart rate data to best accelerometer feature (5 FFT peaks) improves performance SI: r=+22%, RMSE=-31%

Activity dependent mean values Sensor Data Activity Recognition Lying down: 1.0MET Walking 2mph 2.5MET Estimated EE 2.5MET Walking 2mph Sweeping 3.3MET Look-up table Mean EE values per activity

Activity dependent mean values Method Feature set Correlation RMSE 51 activities Invariant 0.80 ± 0.08 1.15 ± 0.31 ARSI Mean reduced 51 activities ARSD Mean Invariant reduced 0.90 ± 0.04 0.84 ± 0.23 This appears to be the best EE estimation strategy at least _on the dataset explored Improvement with respect to activity-dependent linear _regression models Subject dependent: r=4%, RMSE=-12% Subject independent: r=2%, RMSE=-15%

Problems with AR-based EE Mean EE estimation would overestimate EE for short duration activities (physically intense). Misclassifications could affect EE estimates. Spurious misclassifications need to be filtered.

Alternatives Train on large set of mutually exclusive activities Recognize the unknown activity and use generic EE model for this activity prompt user at the end of day for unknown periods

Contributions: Activity recognition Recognition of 52 activities and subsets on 20 non-researchers Recognition of activity intensity 2min Subject dependent training is a promising strategy Three sensors at hip, wrist, foot Acceptable performance without HR Real-time system than can be trained to recognize arbitrary activities

Contributions: EE estimation Activity-dependent models improve performance Accelerometer and heart rate Performance is close to ACC+HR Estimation of mean EE values outperforms linear regression models when using activity-dependent models EE estimation using HR outperforms EE estimation using accelerometer data Exploration of impact of parameters

Future work directions Create the user interfaces necessary to allow interactive training. Allow users to fix the recognition. algorithm (more data/modify models). Experiments when data is collected over several days for same subjects (n>40). Include activity duration EE estimation. Use activity transition information to improve EE estimates.

Committee members Dr. William L. Haskell, Professor of Medicine at Stanford University Prof. Alex (Sandy) Pentland, Toshiba Professor in Media Arts and Sciences Dr. Joseph A. Paradiso, Associate Professor of Media Arts and Sciences Dr. Stephen S. Intille, Research Scientist Kent Larson, Principal Research Scientist MIT Department of Architecture

Thank you! Any Questions? Contact: Emmanuel Munguia Tapia emunguia@mit.edu