Petacat: Applying ideas from Copycat to image understanding

Similar documents
1.1 The size of the search space Modeling the problem Change over time Constraints... 21

Object Recognition. Selim Aksoy. Bilkent University

Using Perceptual Context to Ground Language

Application of Bayesian Networks to Shopping Assistance

THe rip currents are very fast moving narrow channels,

Pedestrian Dynamics: Models of Pedestrian Behaviour

WELCOME! TOWN CENTRE PARK INFORMATION SESSION WHAT IS HAPPENING? GOALS FOR TODAY. TOWN STAY CONNECTED WITH US! #1 INFORM WE ARE HERE

CS472 Foundations of Artificial Intelligence. Final Exam December 19, :30pm

Cricket umpire assistance and ball tracking system using a single smartphone camera

DOWNTOWN TENINO. Goal: Get more People in Downtown. A Pamphlet of Concepts for

Glossary. Access Ramp: A slope which allows a wheelchair to enter or exit a location.

Design of a Pedestrian Detection System Based on OpenCV. Ning Xu and Yong Ren*

Section 1 Types of Waves. Distinguish between mechanical waves and electromagnetic waves.

Performance of Fully Automated 3D Cracking Survey with Pixel Accuracy based on Deep Learning

Application of Dijkstra s Algorithm in the Evacuation System Utilizing Exit Signs

For IEC use only. Technical Committee TC3: Information structures, documentation and graphical symbols

Centre for Transport Studies

Local-to-Global Algorithms in Biology: Drosophila and Beyond (maybe...)

Terminals Operators Guide

Virtual Breadboarding. John Vangelov Ford Motor Company

Simulation Analysis of Intersection Treatments for Cycle Tracks

The Transport Study 3Land Basel

Introduction to Pattern Recognition

/435 Artificial Intelligence Fall 2015

Modeling Planned and Unplanned Store Stops for the Scenario Based Simulation of Pedestrian Activity in City Centers

Grade: 8. Author(s): Hope Phillips

AN UNDERWATER AUGMENTED REALITY SYSTEM FOR COMMERCIAL DIVING OPERATIONS

The Walkability Indicator. The Walkability Indicator: A Case Study of the City of Boulder, CO. College of Architecture and Planning

IMPROVED SAFETY SURFACE ACCESS AT LOW COST AIRPORTS: KUALA LUMPUR INTERNATIONAL AIRPORT, MALAYSIA CASE STUDY

A Novel Approach to Predicting the Results of NBA Matches

CS 528 Mobile and Ubiquitous Computing Lecture 7a: Applications of Activity Recognition + Machine Learning for Ubiquitous Computing.

Neural Network in Computer Vision for RoboCup Middle Size League

Combined impacts of configurational and compositional properties of street network on vehicular flow

Update on Regional Bicycle, Pedestrian, & Trail Planning. Presented to TCC November 21, 2014

Advanced PMA Capabilities for MCM

Public Workshop #2. September 14, 2016

CAAD CTF 2018 Rules June 21, 2018 Version 1.1

Community Task Force March 14, 2018

Bicycle and Pedestrian Level of Service Evaluation

HIGHWAY 11 CORRIDOR STUDY

Exhibit 1 PLANNING COMMISSION AGENDA ITEM

#19 MONITORING AND PREDICTING PEDESTRIAN BEHAVIOR USING TRAFFIC CAMERAS

Developing a Bike/Pedestrian Plan Using ArcInfo and Public Participation Rob Shumowsky, Madison County Council of Governments

THE UNIVERSITY OF HULL

NCCP Swimming 301 Course Summary

Safety Assessment of Installing Traffic Signals at High-Speed Expressway Intersections

Pedestrian, Bicycle and Traffic Calming Strategic Implementation Plan. January 18, 2011

F u n d a m e n t a l s o f G r a p h i c D e s i g n

Robust Task Execution: Procedural and Model-based. Outline. Desiderata: Robust Task-level Execution

ONTARIO RACING PROGRAM Implementation Criteria

Lesson: Airspeed Control

Title: Modeling Crossing Behavior of Drivers and Pedestrians at Uncontrolled Intersections and Mid-block Crossings

Spreading Activation in Soar: An Update

Appendix C. Corridor Spacing Research

Course Information Version

Tuesday, September 25 th Mount Holly Municipal Complex 400 East Central Avenue 5:30pm to 7:30pm. Activity Stations o Where do you Live?

Predicting Horse Racing Results with Machine Learning

The Application of Pedestrian Microscopic Simulation Technology in Researching the Influenced Realm around Urban Rail Transit Station

UAV-based monitoring of pedestrian groups

Active Travel and Exposure to Air Pollution: Implications for Transportation and Land Use Planning

Rules Modernization FAQs March 2018 Update

Complete Streets for Eastham Prioritization Plan Eastham Library JUNE 13, 2018

TALKING ABOUT HOCKEY. Warm Up. Overview. Materials. Essential Question. Standards

Self-Driving Vehicles That (Fore) See

INTRODUCTION TO PATTERN RECOGNITION

SERIES SIMULATORS SIMULATORS SIMULATORS

Supplementary Material for Bayes Merging of Multiple Vocabularies for Scalable Image Retrieval

DESIGNING PERCEPTUAL MOTOR SKILL DEVELOPMENT PROGRAMS FOR SQUASH

WALKNBIKE DRAFT PLAN NASHVILLE, TENNESSEE EXECUTIVE SUMMARY NASHVILLE, TENNESSEE

Open Research Online The Open University s repository of research publications and other research outputs

5.0 Roadway System Plan

Introduction to Pattern Recognition

APPENDIX A. GIS ANALYSIS FOR THE MRT

Chapter 3: Two-Dimensional Motion and Vectors

VGI for mapping change in bike ridership

Turbulence and How to Avoid It

Lee s Summit Road Improvement Study Public Open House June 7, 2007 Summary of Comment Card Responses

Designing a Bicycle and Pedestrian Count Program in Blacksburg, VA

HANGING ROCK STATE PARK EXPANSION MASTER PLAN PHASE TWO INITIAL CONCEPTS INSTRUCTIONS

Department of Transportation

REPORT District of Maple Ridge

TIPS Planning Form for Infants and Toddlers

ComfRide - A Smartphone based Comfortable Public Route Recommendation

The previous chapters in this plan have established the value of this plan (Chapter 1), the

Vision Zero High Injury Network Methodology

Rules Modernization FAQs March 2018 Update

Cyclists gaze behavior in urban space: an eye-tracking experiment on bicycle facilities.

Rhode Island Moving Forward Long-Range Transportation Plan 2040 Municipal Roundtable Newport County

Universal Style Transfer via Feature Transforms

AN EARTHWORK HILL, RINGED WITH SEATING AND SHAPED TO SLOPE DOWN TO THE RIVER, FORMS THE CENTERPIECE TO A NEW SOCIAL PIER. A WOOD SEATING TERRACE

February 2018 Mixed Topic Survey Results

Web Based Bicycle Trip Planning for Broward County, Florida

The system design must obey these constraints. The system is to have the minimum cost (capital plus operating) while meeting the constraints.

Diver Training Options

TULSA CITY COUNCIL CAPITAL IMPROVEMENT PROGRAM TASK FORCE MEETING JANUARY

Chapter 3: Multi-Modal Circulation and Streetscapes

Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up Warm up

A CHANCE TO RIDE. Lesson 3 Pedestrian Safety & Awareness of Surroundings

Adaptability and Fault Tolerance

explore AerodYnAmics Texas Alliance for Minorities in Engineering, Trailblazer II

Transcription:

Petacat: Applying ideas from Copycat to image understanding

How Streetscenes Works (Bileschi, 2006) 1. Densely tile the image with windows of different sizes. 2. HMAX C2 features are computed in each window. 3. The features in each window are given as input to each of five trained support vector machines ( pedestrian, car, bicycle, building, tree ) 4. If any return a classification with score above a learned threshold, that object is said to be detected.

Object detection (here, car ) with HMAX model (Bileschi, 2006)

Limitations of Streetscenes approach for image understanding

Limitations of Streetscenes approach for image understanding Exhaustive search not scalable Does not recognize spatial and abstract relationships among objects for whole scene understanding Has no prior knowledge about object categories and their place in conceptual space HMAX model is completely feed-forward; no feedback to allow context to aid in scene understanding. Where should feedback come in?

Representation of High-Level Knowledge: A Simple Semantic Network (or Ontology ) walking holds leash attached to Person action action walking

But...

Modified Ontology walking holds leash attached to Group Person action action walking running

Modified Ontology walking holds leash attached to Group Person action walking action running Allowing conceptual slippage

But...

Modified Ontology walking holds leash attached to Group Person action action Cat walking running Iguana

But...

But...

But...

But...

Modified Ontology Helicopter walking Bicycle holds leash attached to Group Person action action Cat Car walking running Iguana

But...

Sidewalk Beach Outside Inside Grass Lawn mower Sky Above Holding Attached to Gasoline Ground Car walking Tree Runway Left of Person Backpack Leash Airplane Standing Far from Walking grooming Close to Fanny pack Running Stick Track Helicopter Army

Need dynamical process of constructing representation.

Need dynamical process of constructing representation. Information gained during the unfolding of perception feeds back to guide the directions the perceptual process takes.

Need dynamical process of constructing representation. Information gained during the unfolding of perception feeds back to guide the directions the perceptual process takes. Ongoing perception of context brings in appropriate concepts and conceptual slippages, and avoids exhaustive search

Need dynamical process of constructing representation. Information gained during the unfolding of perception feeds back to guide the directions the perceptual process takes. Ongoing perception of context brings in appropriate concepts and conceptual slippages, and avoids exhaustive search Prior, higher-level knowledge interacts with lower-level vision in both directions (bottom-up and top-down).

Need dynamical process of constructing representation. Information gained during the unfolding of perception feeds back to guide the directions the perceptual process takes. Ongoing perception of context brings in appropriate concepts and conceptual slippages, and avoids exhaustive search Prior, higher-level knowledge interacts with lower-level vision in both directions (bottom-up and top-down). Concepts are fluid, allowed to slip in certain contexts.

Need dynamical process of constructing representation. Information gained during the unfolding of perception feeds back to guide the directions the perceptual process takes. Ongoing perception of context brings in appropriate concepts and conceptual slippages, and avoids exhaustive search Prior, higher-level knowledge interacts with lower-level vision in both directions (bottom-up and top-down). Concepts are fluid, allowed to slip in certain contexts. This allows perception of essential similarity in the face of superficial differences i.e., analogy-making.

Active Symbol Architecture (Hofstadter et al., 1995)

Active Symbol Architecture (Hofstadter et al., 1995) Basis for Copycat (analogy-making), Hofstadter & Mitchell Tabletop (anlaogy-making), Hofstadter & French Metacat (analogy-making and self-awareness), Hofstadter & Marshall and many others

Active Symbol Architecture (Hofstadter et al., 1995) Semantic network Workspace Temperature Perceptual agents (codelets)

Petacat: (Descendant of Copycat) Integration of Active Symbol Architecture and HMAX Initial task: Decide if image is an instance of taking a dog for a walk, and if so, how good an instance it is.

indoors has location Semantic Network taking a dog for a walk outdoors Object road a beach a grass a sidewalk is on person is in front of is touching leash dog is touching is on Action Spatial Relation trail has action runs has action stands walks drives sits belt flies swims rope string horse cat is on is behind is touching is next to has location is in front of

indoors has location Semantic Network taking a dog for a walk Property links Slip links road a beach a trail has action grass a sidewalk outdoors is on runs has action stands person walks drives sits is in front of is touching belt flies swims leash is touching rope string horse dog cat is on is on is behind Object Action Spatial Relation is touching is next to has location is in front of

indoors has location Semantic Network taking a dog for a walk Property links Slip links road a beach a trail has action grass a sidewalk outdoors is on runs has action stands person walks drives sits is in front of is touching belt flies swims leash is touching rope string horse dog cat is on is on is behind Object Action Spatial Relation is touching is next to Properties of nodes has location is in front of

Workspace

Workspace Semantic network

Perceptual Agents (Codelets) Semantic network Codelets as active symbols

indoors has location taking a dog for a walk outdoors Object road a beach a grass a sidewalk is on person is in front of is touching leash dog is touching is on Action Spatial Relation trail has action runs has action stands walks drives sits belt flies swims rope string horse cat is on is behind is touching is next to has location is in front of

indoors has location taking a dog for a walk outdoors Object road a beach a grass a sidewalk is on person is in front of is touching leash dog is touching is on Action Spatial Relation trail has action runs has action stands walks drives sits belt flies swims rope string horse cat is on is behind is touching is next to has location is in front of

indoors has location taking a dog for a walk outdoors Object road a beach a grass a sidewalk is on person is in front of is touching leash dog is touching is on Action Spatial Relation trail has action runs has action stands walks drives sits belt flies swims rope string horse cat is on is behind is touching is next to has location is in front of

indoors has location taking a dog for a walk outdoors Object road a beach a grass a sidewalk is on person is in front of is touching leash dog is touching is on Action Spatial Relation trail has action runs has action stands walks drives sits belt flies swims rope string horse cat is on is behind is touching is next to has location is in front of

Illustration of what we plan to have happen not a real run of Petacat?

Illustration of what we plan to have happen not a real run of Petacat Person???

Illustration of what we plan to have happen not a real run of Petacat Person??? Sidewalk?

Illustration of what we plan to have happen not a real run of Petacat Person? Outdoors??? Sidewalk??

Illustration of what we plan to have happen not a real run of Petacat Person? Outdoors??? Sidewalk?? Scout codelets: Send C1 features in window to corresponding SVM. If positive result, post builder codelet with urgency equal to SVM s confidence.

Illustration of what we plan to have happen not a real run of Petacat Person? negative? negative? negative Outdoors? positive: 0.7 Sidewalk? positive: 0.4? positive: 0.8 Scout codelets: Send C1 features in window to corresponding SVM. If positive result, post builder codelet with urgency equal to SVM s confidence.

Illustration of what we plan to have happen not a real run of Petacat Person? negative? negative? negative Outdoors? positive: 0.7 Sidewalk? positive: 0.4? positive: 0.8 Builder codelets: Ask HMAX to compute C2 features using prototypes specific to the object (or scene), and send them to corresponding SVM. If positive, decide to build structure with probability equal to SVM confidence. Break competing structures if necessary.

Illustration of what we plan to have happen not a real run of Petacat Outdoors Builder codelets: Ask HMAX to compute object-/scene-specific C2 features, and send them to corresponding SVM. If positive, decide to build structure with probability equal to SVM confidence. Break competing structures if necessary.

indoors has location taking a dog for a walk outdoors Object road a beach a grass a sidewalk is on person is in front of is touching leash dog is touching is on Action Spatial Relation trail has action runs has action stands walks drives sits belt flies swims rope string horse cat is on is behind is touching is next to has location is in front of

Illustration of what we plan to have happen not a real run of Petacat Person? Outdoors Leash? Leash? Person?? Sidewalk?

Illustration of what we plan to have happen not a real run of Petacat Outdoors Person Strength: 0.6 Sidewalk

Illustration of what we plan to have happen not a real run of Petacat Outdoors Sidewalk

indoors has location taking a dog for a walk outdoors Object road a beach a grass a sidewalk is on person is in front of is touching leash dog is touching is on Action Spatial Relation trail has action runs has action stands walks drives sits belt flies swims rope string horse cat is on is behind is touching is next to has location is in front of

Illustration of what we plan to have happen not a real run of Petacat Leash? Outdoors Leash?? Rope?? Sidewalk? Sidewalk

Illustration of what we plan to have happen not a real run of Petacat Leash Outdoors (weak) Sidewalk

Illustration of what we plan to have happen not a real run of Petacat Leash Outdoors (strong) (weak) Sidewalk

Illustration of what we plan to have happen not a real run of Petacat Leash Outdoors Sidewalk

indoors has location taking a dog for a walk outdoors Object road a beach a grass a sidewalk is on person is in front of is touching leash dog is touching is on Action Spatial Relation trail has action runs has action stands walks drives sits belt flies swims rope string horse cat is on is behind is touching is next to has location is in front of

Illustration of what we plan to have happen not a real run of Petacat is next to is in front of Leash Outdoors is next to is in front of group Sidewalk Once objects begin to be built, relation and grouping codelets can run on them.

Illustration of what we plan to have happen not a real run of Petacat is next to Leash Outdoors is next to group Sidewalk Once objects begin to be built, relation and grouping codelets can run on them.

Illustration of what we plan to have happen not a real run of Petacat is next to Leash Outdoors is next to is next to group Sidewalk

How codelets decide where to look System starts out with weak segmentation (e.g., normalized cuts algorithm)

How codelets decide where to look System starts out with weak segmentation (e.g., normalized cuts algorithm) System creates heat maps for location and scale of objects in general (at each pixel, probability of finding an object at this location and at a particular height/width of bounding box. + + + +

How codelets decide where to look System starts out with weak segmentation (e.g., normalized cuts algorithm) System creates heat maps for location and scale of objects in general (at each pixel, probability of finding an object at this location and at a particular height/width of bounding box. Object scout codelets choose location and + + + + scale probabilisitically from these heat maps.

How codelets decide where to look When codelets look for individual object categories (e.g., dog), objectspecific heat maps are created + + Person heat map

How codelets decide where to look When codelets look for individual object categories (e.g., dog), objectspecific heat maps are created Person? Person? As codelets build structure, heat maps are continually updated to reflect prior (learned) expectations about location and scale as a function of location and scale of built objects (as well as original weak segmentation). + + Person heat map

Illustration of what we plan to have happen not a real run of Petacat How Petacat makes a final decision taking a dog for a walk is next to Leash Outdoors is next to is next to group Sidewalk Temperature

Illustration of what we plan to have happen not a real run of Petacat How Petacat makes a final decision taking a dog for a walk is next to Leash Outdoors is next to is next to group Sidewalk Temperature Situation codelet is more likely to run when temperature is low.

Illustration of what we plan to have happen not a real run of Petacat is next to Leash Outdoors is next to Situation codelet tries to match prototypical situation with existing workspace structures, possibly allowing slippages. is next to group Sidewalk

is next to taking a dog for a walk Leash person is in front of leash has location Outdoors outdoors is next to dog Situation codelet tries to match prototypical situation with existing workspace structures, possibly allowing slippages. is next to group Sidewalk

is next to taking a dog for a walk person Leash is in front of leash has location Outdoors outdoors is next to dog is next to is next to group group Sidewalk

taking a dog for a walk If resulting temperature is low enough, is next to classify scene as positive person Leash is in front of leash has location Outdoors outdoors is next to dog is next to is next to group group Sidewalk

taking a dog for a walk If resulting temperature is low enough, is next to classify scene as positive person Leash is in front of leash is next to has location Outdoors is next to If situation codelet fails enough times is next or to does not run for a long time, group Sidewalk program has increasing chance of ending with negative classification. group outdoors dog

If Petacat classifies the picture as positive, the temperature at the end of the run gives a measure of how good an instance the picture is (e.g., of the dog walking situation).

Summary:

Summary: How does Petacat avoid exhaustive search?

Summary: How does Petacat avoid exhaustive search? Recall Streetscenes system, which, given an image, does exhaustive search over: Window size and location in the image C1, C2 features in windows Object categories (e.g., car, pedestrian, tree, etc.)

Summary: How does Petacat avoid exhaustive search? Recall Streetscenes system, which, given an image, does exhaustive search over: Window size and location in the image In Petacat, codelets choose window size and location based on learned expectations and perceived context, with probabilities continually changing as more information is obtained C1, C2 features in windows Object categories (e.g., car, pedestrian, tree, etc.)

Summary: How does Petacat avoid exhaustive search? Recall Streetscenes system, which, given an image, does exhaustive search over: Window size and location in the image In Petacat, codelets choose window size and location based on learned expectations and perceived context, with probabilities continually changing as more information is obtained C1, C2 features in windows Codelets request C2 features only in relevant windows, and request only C2 features that are relevant to what the codelet is looking for. Object categories (e.g., car, pedestrian, tree, etc.)

Summary: How does Petacat avoid exhaustive search? Recall Streetscenes system, which, given an image, does exhaustive search over: Window size and location in the image In Petacat, codelets choose window size and location based on learned expectations and perceived context, with probabilities continually changing as more information is obtained C1, C2 features in windows Codelets request C2 features only in relevant windows, and request only C2 features that are relevant to what the codelet is looking for. Object categories (e.g., car, pedestrian, tree, etc.) Codelets look for object categories that are activated by context, based on prior expectations and currently perceived information.

Summary: How does Petacat avoid exhaustive search? Petacat effects a parallel terraced scan (Hofstadter, 1995): Codelets build structures at a rate (urgency) based on their perceived promise, which is continually updated as new information is perceived. Temperature allows this (continually changing) rate to depend on the global state of the system.

Relation to neuroscience/psychophysics Gilbert & Sigman (2007): Emphasis of role to top-down processing in vision. V1 and V2 may work as active blackboards that integrate and sustain the result of computations performed in higher areas. Kahneman, Triesman, and Gibbs (1992): Notion of object files : temporary and modifiable perceptual structures, created on the fly in working memory, which interact with a permanent network of concepts. Churchland, Ramachandran, and Sejnowski: Theory of interactive vision Treisman and colleagues: Shift between parallel, random, preattentive bottom-up processing and more deterministic, focused, serial, attentive top-down processing.

Does Petacat understand pictures?

Does Petacat understand pictures? Understanding (MM s defintion): - Ability to appropriately use one s knowledge and make appropriate conceptual slippages in a wide variety of environments/contexts. - Ability to use one s existing concepts to learn new concepts