Neural Control and Learning for Versatile, Adaptive, Autonomous Behavior of Walking Machines

Similar documents
RECOGNIZING that, to date, most research in the domain

Supplementary information accompanying the manuscript. Self-organized adaptation of simple neural circuits enables complex robot behavior

INITIATING NORMAL WALKING OF A DYNAMIC BIPED WITH A BIOLOGICALLY MOTIVATED CONTROL

Dynamically stepping over large obstacle utilizing PSO optimization in the B4LC system

Adaptive Locomotion Controller for a Quadruped Robot

Kungl Tekniska Högskolan

Centre for Autonomous Systems

Decentralized Autonomous Control of a Myriapod Locomotion Robot

Using sensory feedback to improve locomotion performance of the salamander robot in different environments

Improvement of the Cheetah Locomotion Control

arxiv: v1 [cs.ai] 11 Jul 2014

Motion Control of a Bipedal Walking Robot

Reservoir-based Online Adaptive Forward Models with Neural Control for Complex Locomotion in a Hexapod Robot

Contents. Sensor-Driven Neural Control for Omnidirectional Locomotion and Versatile Reactive Behaviors of Walking Machines.

GOLFER. The Golf Putting Robot

Robot motion by simultaneously wheel and leg propulsion

Toward a Human-like Biped Robot with Compliant Legs

From Passive to Active Dynamic 3D Bipedal Walking - An Evolutionary Approach -

EVOLVING HEXAPOD GAITS USING A CYCLIC GENETIC ALGORITHM

Kenzo Nonami Ranjit Kumar Barai Addie Irawan Mohd Razali Daud. Hydraulically Actuated Hexapod Robots. Design, Implementation. and Control.

Efficient Gait Generation using Reinforcement Learning

Body Stabilization of PDW toward Humanoid Walking

REPORT DOCUMENTATION PAGE

Sensing and Modeling of Terrain Features using Crawling Robots

A Bio-inspired Behavior Based Bipedal Locomotion Control B4LC Method for Bipedal Upslope Walking

LOCOMOTION CONTROL CYCLES ADAPTED FOR DISABILITIES IN HEXAPOD ROBOTS

Controlling Walking Behavior of Passive Dynamic Walker utilizing Passive Joint Compliance

Keywords--Bio-Robots, Walking Robots, Locomotion and Stability Controlled Gait.

ZSTT Team Description Paper for Humanoid size League of Robocup 2017

Development and Test of a Model for the Cheetah Robot

The Incremental Evolution of Gaits for Hexapod Robots

Development of Fish type Robot based on the Analysis of Swimming Motion of Bluefin Tuna Comparison between Tuna-type Fin and Rectangular Fin -

Robotics and Autonomous Systems

ABSTRACT 1 INTRODUCTION

Robotics and Autonomous Systems

PERCEPTIVE ROBOT MOVING IN 3D WORLD. D.E- Okhotsimsky, A.K. Platonov USSR

Spring Locomotion Concepts. Roland Siegwart, Margarita Chli, Martin Rufli. ASL Autonomous Systems Lab. Autonomous Mobile Robots

Humanoid Robots and biped locomotion. Contact: Egidio Falotico

Today: the first activity system. Neural Control of Movement LOCOMOTION GAIT DESCRIPTIONS. Review: that amazing spinal cord! What we do: gait patterns

Supplementary Figure 1 An insect model based on Drosophila melanogaster. (a)

The Influence of Friction on Gait and Energy Efficiency of the Walking Robot Based on Rhythmic Control

Robot Phase Entrainment on Quadruped CPG Controller

Open Research Online The Open University s repository of research publications and other research outputs

YAN GU. Assistant Professor, University of Massachusetts Lowell. Frederick N. Andrews Fellowship, Graduate School, Purdue University ( )

Creation of a Fallback Catch Method. Megan Berry Mechanical Engineering Senior MAE 490 (4 credits)

Robot Walking with Genetic Algorithms

ZMP Trajectory Generation for Reduced Trunk Motions of Biped Robots

Control principles for locomotion looking toward biology

The development of a biomechanical leg system and its neural control

Spider Robot for Motion with Quasistatic. Force Constraints

Jurassic Chicken: An Avian Bipedal Robot 2001 Florida Conference on Recent Advances in Robotics May 10-11, 2001, FAMU-FSU College of Engineering

Computer Aided Drafting, Design and Manufacturing Volume 26, Number 2, June 2016, Page 53. The design of exoskeleton lower limbs rehabilitation robot

,WHUDWLYH3URGXFW(QJLQHHULQJ(YROXWLRQDU\5RERW'HVLJQ

Passive compliant quadruped robot using central pattern generators for locomotion control

Walking Experiment of Biped Robot with Antagonistic Actuation Using Non-Linear Spring

SIMULATION AND EXPERIMENTAL STUDIES ON WALKING ROBOT WITH FLEXIBLE LEGS

ITF Coaches Education Programme Biomechanics of the forehand stroke

Programming Self-Recovery in the humanoid Leong Ti Xean 1 Yap Kian Tiong 2

Gait. Kinesiology RHS 341 Lecture 12 Dr. Einas Al-Eisa

In this course you will learn the following

Walking and Running BACKGROUND REVIEW. Planar Pendulum. BIO-39 October 30, From Oct. 25, Equation of motion (for small θ) Solution is

intended velocity ( u k arm movements

LEGGED animals are capable of adjusting their leg stiffness

A Development of Mechanical Arm for Weight Scale Calibration

A Neuromuscular Model of Human Locomotion and its Applications to Robotic Devices

Written Report of Robot Design

An investigation of kinematic and kinetic variables for the description of prosthetic gait using the ENOCH system

Design, Fabrication and Analysis of Microcontroller Based Bipedal Walking Robot Vaidyanathan.V.T 1 and Sivaramakrishnan.R 2

Dynamic locomotion of a biomorphic quadruped Tekken robot using various gaits: walk, trot, free-gait and bound


Gait Analysis of a Little Biped Robot. Received May 2015; accepted July 2015

Transformation of nonfunctional spinal circuits into functional states after the loss of brain input

the world s most advanced humanoid robot

Locomotion Concepts. Autonomous Mobile Robots. Concepts Legged Locomotion Wheeled Locomotion. Autonomous Systems Lab. Zürich. Localization.

Learning Energy Efficient Walking Based on Ballistics

Biomimetic Walking Robot Scorpion: Control and Modeling

Learning Energy Efficient Walking with Ballistic Walking

Biomechanics and Models of Locomotion

INSTANTANEOUS ON-LINE MODIFICATION OF BIPED WALK COMPOSED FROM RECONFIGURABLE ADAPTIVE MOTION PRIMITIVES

Sample Solution for Problem 1.a

BIOLOGICALLY INSPIRED QUADRUPED ROBOT BIOSBOT: MODELING, SIMULATION AND EXPERIMENT

Toward a Human-like Biped Robot with Compliant Legs

Generation of adaptive splitbelt treadmill walking by a biped robot using nonlinear oscillators with phase resetting

Coaching the Triple Jump Boo Schexnayder

Emergent walking stop using 3-D ZMP modification criteria map for humanoid robot

The Itsy-Bitsy Spider: An Analysis of Spider Locomotion

In memory of Dr. Kevin P. Granata, my graduate advisor, who was killed protecting others on the morning of April 16, 2007.

Mobile Robots (Legged) (Take class notes)

Outline. Outline. Recent Popular Legged Robots in Japan. Self-contained! Adaptive? Limit Cycle Based. What is legged locomotion?

Fail Operational Controls for an Independent Metering Valve

Proof Copy. Controlling the Walking Period of a Pneumatic Muscle Walker. Takashi Takuma Koh Hosoda. Abstract. 1. Introduction

Evolving Gaits for the Lynxmotion Hexapod II Robot

The Discus. By Al Fereshetian. Nature of the Event

Stabilizing Walking Gaits using Feedback from Gyroscopes

Toward a human-like biped robot with compliant legs

Gait Evolution for a Hexapod Robot

USA Track & Field Heptathlon Summit- November

Dynamic analysis and motion measurement of ski turns using inertial and force sensors

Chapter 9. Insect observations and hexapod design

A CPG-based Decentralized Control of a Quadruped Robot Inspired by True Slime Mold

Transcription:

Neural Control and Learning for Versatile, Adaptive, Autonomous Behavior of Walking Machines Poramate Manoonpong 1 and Florentin Wörgötter 2 Bernstein Center for Computational Neuroscience (BCCN), University of Göttingen, Germany 1 poramate@nld.ds.mpg.de, 2 worgott@nld.ds.mpg.de Abstract This article presents two different types of walking machines: an insect-like robot and a biped robot which have been developed during last years. Both walking machines are attractive in the way that they now combine three key aspects: versatility, adaptivity, and autonomy. Versatility in this sense means a variety of reactive behaviors, while adaptivity implies to online learning capabilities, and autonomy is an ability to function without continuous human guidance. These three key elements are achieved under neural control and an online learning mechanism. In addition, this contribution will point out that such control technique is shown to be a power method of solving sensor-motor coordination problems of high complexity systems. 1. Introduction Living creatures like walking animals and humans have found fascinating solutions for the problem of legged locomotion in the way that their movements give the impression of elegance and smoothness. They are able to move around not only on flat terrains but also on rough terrains and even to perform a variety of walking behaviors. Furthermore, they can adapt themselves to environmental changes in order to survive. Neurophysiological and ethological studies have revealed that solving such tasks basically results from coupling appropriate biomechanics [1] with neural control [2]. For instance, walking animals (e.g., insects, cats) can walk forward, backward, and in lateral directions and perform self-stabilization to minor disturbances (stumbling) because of their appropriate biomechanical design. Neural control, on the other hand, plays a role in generating different walking behaviors as well as adaptivity. Therefore, during the last few decades several roboticists have begun actively to look to the biological sciences for the constructions (biomechanics) and the controller (neural control) design of robotic systems in particular walking machines to approach the living creatures in their levels of performance. The diverse researches in the domain of biologicallyinspired walking machines have been ongoing for over 20 years [3, 4]. Most of them have focused on the mechanical design to have animal-like properties and perform efficient locomotion [5]. Others have concentrated on the generation of locomotion based on engineering technologies [6] as well as biological principles [7]. While impressive in their own right, the versatility (behavioral repertoire) of these systems is much smaller. Typically they are not adaptive (learning capabilities) and most of them still fail to be autonomous (function without continuous human guidance). From this point of view, during last years we have developed two different types of walking machines (an insect-like robot [8, 9] and a biped robot [10]) where they now combine the three key aspects (versatility, adaptivity, and autonomy) under neural sensor-motor control and an online learning mechanism. The description of the development of each walking machine system and its performance is presented in the following. 2. AMOS: Insect-Like Hexapod AMOS is an advanced mobility sensor-driven walking device (see Fig. 1a). It consists of a two-part body, at which six identical legs and one tail are attached. Each leg has three joints (three degrees of freedom) controlled by analog servo motors: the thoraco-coxal joint enables forward (+) and backward ( ) movements, the coxa-trochanteral joint enables elevation (+) and depression ( ) of the leg, and the femur-tibia joint enables extension (+) and flexion ( ) of the tibia [9]. The morphology of these multi-jointed legs is modeled on the basis of a cockroach leg [8]. Each tibia contains a spring compliant element to absorb impact force as well as to measure ground contact during walking. The body of AMOS consists of two segments: a front segment where two forelegs are installed and a central body seg-

a b Adaptive neural control Sensors Sensor-driven neural control Neural preprocessing Modular neural control Motors Environment Figure 1. a, The physical insect-like hexapod AMOS (see [9] for the details of the location of the sensors on AMOS). b, The diagram of the neural controller of AMOS. The controller acts as an artificial perception-action system; i.e., the sensor signals are passed through the neural preprocessing unit into the modular neural control unit which directly drives the actuators. In addition, the adaptive neural control functions as high level control. It is used for learning capability. As a result, the hexapod s behavior is generated by the interaction with its physical environment in a sensorimotor loop. ment where the two middle and the two hind legs are attached. They are connected by one active backbone joint inspired by the invertebrate morphology of the American cockroach s trunk [8]. This backbone joint driven by a digital servo motor is for up- and downward bending, which allows the walking machine to climb over obstacles. Moreover, a tail with two degrees of freedom, rotating around a horizontal and a vertical axis, is implemented in the center on the back of the trunk. On this actively moveable tail, which can be manually controlled, a mini wireless camera is installed for monitoring the environment while the machine is walking. This walking machine has a multitude of sensors: six foot contact sensors, six reflexive optical sensors, eight infra-red sensors, two light dependent resistor sensors, one upside-down detector sensor, one gyro sensor, one inclinometer sensor, and one auditory-wind detector sensor. The hexapod receives 26 sensory inputs, and controls 19 motors to achieve a broad behavioral repertoire including foothold searching, elevator reflex (swinging a leg over obstacles), self-protective reflex (standing in an upside-down position), obstacle avoidance, auditory- and wind-evoked escape responses, phototaxis (turn towards a light source), climbing over obstacles, and five different gaits. These complex autonomous behaviors are generated through a so-called sensor-driven neural controller consisting of a neural preprocessing unit and a modular neural control unit (Fig. 1b). The neural preprocessing unit filters sensory noise, combines, and shapes sensory data to drive corresponding behaviors. On the other hand, the modular neural control unit is used for locomotive generation. It consists of three subordinate networks or modules: a neural oscillator network, two velocity regulating networks (VRNs), and a phase switching network (PSN). A simple neural oscillator network serves as a central pattern generator [11] (CPG) producing the basic rhythmic leg movements and regulates walking speed. Other modules, like the velocity regulating and the phase switching networks, enhance the walking capability of the machine to walk in omnidirection. Furthermore, adaptive neural control using a correlation based differential Hebbian learning rule (see Section 4 and [12] for details) has been implemented. It allows the hexapod to learn to respond to a conditioned stimulus, e.g., predator-recognition learning. As a result, all reactive and adaptive behaviors of the hexapod are accomplished by interacting with a physical environment through a sensorimotor loop (see [8, 9] for more details). The results of the real robot walking experiments can be seen as video clips at http://www.nld.ds.mpg.de/ poramate/amoswd06.html. 3. RunBot: Planar Dynamic Biped RunBot is a planar dynamic biped robot (see Fig. 2a). It consists of four actuated joints: left hip, right hip, left knee and right knee. Each joint is driven by a modified servo motor where the built-in pulse width modulation (PWM) control circuit is disconnected, while its built-in potentiometer is used to measure the joint angles. RunBot has no actuated ankle joints, resulting in very light feet and efficiency for fast walking. Its feet were designed having a small circular form (4.5 cm long). Each foot is equipped with a switch sensor to detect ground contact events. A mechanical stop-

a b Adaptive reflex neural control Adaptive neural control Reflexive neural control Sensors Bimechanical Setup of RunBot Motors Environment Figure 2. a, The planar dynamic biped robot RunBot (see [10] for the details of the location of the sensors on RunBot). b, The diagram of the neural controller of RunBot where the adaptive network is implemented on top as high level control to modulate the reflexive networks (lower level) through learner neurons (not shown but see [10] for details). Accordingly, the adaptive dynamic walking behavior of RunBot is achieved by the interaction with its physical environment in a sensorimotor loop. per is implemented on each knee joint to prevent it from going into hyperextension. Approximately seventy percent of the robot s weight is concentrated on its trunk and the parts of the trunk are assembled in a way that its center of mass is located forward of the hip axis. In addition, it has an upper body component, which can be actively moved to shift the center of mass backward or forward for walking on different terrains, e.g., level floor versus up or down a ramp. It leans backward during walking on a level floor (see Fig. 2a) and this position is also suitable for walking down a ramp while it will lean forward (reflex action) when RunBot falls backward or after it successfully learned to walk up a ramp. The corresponding reflex is controlled by an accelerometer sensor (AS). The AS is installed on top of the right hip joint. Additionally, one infrared (IR) sensor is implemented at the front part of RunBot pointing downwards to detect a ramp. These IR and AS sensory signals are used for adaptive control (see [10] for details). As described here, the biomechanical design of RunBot has the following special features that distinguish it from other powered biped robots and that facilitate high-speed walking and exploitation of natural dynamics [10]: (a) small, curved feet allowing for rolling action; (b) unactuated, hence light, ankles; (c) lightweight structure; (d) light and fast motors; (e) proper mass distribution of the limbs; and (f) properly positioned mass center of the trunk. Utilizing all these properties, RunBot can perform self-stabilization of gaits [10] and it also exhibits passive walking characteristics [10] reflected by the fact that during one quarter of its step cycle all motor voltages remain zero. RunBot s locomotion is driven by adaptive reflex neural control (see Fig. 2b) which doesn t employ any trajectory control. It consists of two main circuits: adaptive and reflexive neural control circuits including signal preprocessing. The reflexive neural control based on reflex mechanisms uses proprioceptor signals (sensory feedback) coming from ground contact sensors, stretch receptors, and joint angle sensors to generate dynamic stable gaits while its AS sensor is used to trigger the body reflex. In the adaptive neural control where the correlation based differential Hebbian learning rule (see Section 4 and [12] for details) is applied, it serves for gait and posture adaptation during walking up a ramp. As a consequence, through the tight coupling of biomechanics with neural control, RunBot can autonomously walk with a high speed (> 3.0 leg length/s), self-adapting to minor disturbances, and reacting in a robust way to abruptly induced gait changes [10]. At the same time, it can learn walking on different terrains, requiring only few learning experiences [10]. The results of the real robot walking experiments can be seen as video clips at http://www.nld.ds.mpg.de/ poramate/runbot.html. 4. Learning Algorithm It is known that neurons can change their synaptic strength according to the order of the arriving inputs. That

is, if a predictive input u 1 (conditioned stimulus (CS), see Fig. 3) is followed by a reflex input u 0 (unconditioned stimulus (US), see Fig. 3), the plastic synapse of the predictive input gets strengthened ρ 1 but it will get weakened if the order is reversed. Hence, this form of plasticity depends on the timing of correlated neural signals (STDP, spike timingdependent plasticity). This rule will lead to weight stabilization as soon as u 0 = 0 [12], meaning that the reflex has successfully been avoided. As a result, we obtain behavioral and synaptic stability at the same time without any additional weight-control mechanisms. In this learning rule (see Fig. 3), only the plastic synapse ρ 1 is allowed to change while the synapse of the reflex input ρ 0 is set to a positive value, e.g., 1.0. u 1 u 0 CS US ñ 1 X ñ 0 d/dt v Figure 3. Correlation based differential Hebbian learning mechanism. In the terminology of conditioning, CS = conditioned stimulus, US = unconditioned stimulus. Formally, we have v = ρ 0 u 0 + ρ 1 u 1 (1) as the neuron output driven by inputs (u 0, u 1 ). The plastic synapse ρ 1 gets changed by differential Hebbian learning using the cross-correlation between both inputs u 0 and u 1. It is given by: dρ 1 dt = µu du 0 1 dt. (2) µ is the learning rate which will define how fast a system can learn. 5. Conclusions Taken together, this article gives two main contributions. On the one hand, it shows that neural control and learning can be successful for complex sensori-motor control problems in systems with a large numbers of sensors and motors. On the other hand, it guides that the biologically-inspired walking machines are fascinating technology to study with respect to their biomechanical design including sensor and actuator systems as well as the realizations of control concepts. For example, they can serve as scientific tools for better understanding and solving the sensorimotor coordination problems of many degrees of freedom, for performing experiments with neural controllers, and for the development of versatile artificial perception-action systems. In particular, the biped walking robot can serve as an experimental device in order to understand human walking which is a formidable challenge and which has been addressed through physiological studies as well as robotics research. Moreover, this walking machine technology is shown to be a highly interdisciplinary technology, uniting contributions from several areas as diverse as biology, biomechanics, material science, neuroscience, engineering, and computer science. References [1] M. H. Dickinson, C. T. Farley, R. J. Full, M. A. R. Koehl, R. Kram, S. Lehman, How animals move: An integrative view, Science 288: 100 106, 2000. [2] G. N. Orlovsky, T. G. Deliagina, S. Grillner, Neuronal control of locomotion: From mollusk to man, Oxford University Press, 1999. [3] D. J. Todd, Walking Machines: an introduction to legged robots, Kogan Page, 1985. [4] F. Delcomyn, Insect walking and robotics, Annu Rev Entomol. 49: 51 70, 2004. [5] A. J. Ijspeert, A. Crespi, D. Ryczko, J. M. Cabelguen, From swimming to walking with a salamander robot driven by a spinal cord model, Science 315(5817): 1416 1420, 2007. [6] A. Shkolnik, R. Tedrake, Inverse kinematics for a point-foot quadruped robot with dynamic redundancy resolution, Proceedings of the IEEE International Conference on Robotics and Automation (ICRA): 4331 4336, 2007. [7] H. Kimura, Y. Fukuoka, A. H. Cohen, Adaptive dynamic walking of a quadruped robot on natural ground based on biological concepts, International Journal of Robotics Research 26(5): 475 490, 2007. [8] P. Manoonpong, Neural preprocessing and control of reactive walking machines: Towards versatile artificial perception-action systems, Cognitive Technologies, Springer-Verlag, 2007. [9] P. Manoonpong, F. Pasemann, F. Wörgötter, Sensordriven neural control for omnidirectional locomotion

and versatile reactive behaviors of walking machines, Robotics and Autonomous Systems 56(3): 265 288, 2008. [10] P. Manoonpong, T. Geng, T. Kulvicius, P. Bernd, F. Wörgötter, Adaptive, fast walking in a biped robot under neuronal control and learning, PLoS Computational Biology 3(7): e134, 2007. [11] A. J. Ijspeert, Central pattern generators for locomotion control in animals and robots: A review, Neural Networks 21(4): 642 653, 2008. [12] B. Porr and F. Wörgötter, Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only, Neural Computation 18(6): 1380 1412, 2006.