CS 287 Lecture 24 (Fall 2019) Autonomous Helicopter Flight Pieter - - PowerPoint PPT Presentation

cs 287 lecture 24 fall 2019 autonomous helicopter flight
SMART_READER_LITE
LIVE PREVIEW

CS 287 Lecture 24 (Fall 2019) Autonomous Helicopter Flight Pieter - - PowerPoint PPT Presentation

CS 287 Lecture 24 (Fall 2019) Autonomous Helicopter Flight Pieter Abbeel UC Berkeley EECS Challenges in Helicopter Control n Unstable n Nonlinear n Complicated dynamics n Air flow n Coupling n Blade dynamics n Noisy estimates of position,


slide-1
SLIDE 1

CS 287 Lecture 24 (Fall 2019) Autonomous Helicopter Flight

Pieter Abbeel UC Berkeley EECS

slide-2
SLIDE 2
slide-3
SLIDE 3
slide-4
SLIDE 4

n Unstable n Nonlinear n Complicated dynamics

n Air flow n Coupling n Blade dynamics

n Noisy estimates of position, orientation, velocity, angular rate

(and perhaps blade and engine speed)

Challenges in Helicopter Control

slide-5
SLIDE 5

n Just a few examples:

n

Bagnell & Schneider, 2001;

n

LaCivita, Papageorgiou, Messner & Kanade, 2002;

n

Ng, Kim, Jordan & Sastry 2004a (2001); Ng et al., 2004b;

n

Roberts, Corke & Buskey, 2003;

n

Saripalli, Montgomery & Sukhatme, 2003;

n

Shim, Chung, Kim & Sastry, 2003;

n

Doherty et al., 2004;

n

Gavrilets, Martinos, Mettler and Feron, 2002.

n Varying control techniques: inner/outer loop PID with hand or

automatic tuning, H1, LQR, …

Success Stories: Hover and Forward Flight

slide-6
SLIDE 6

[Ng, Coates, Tse, et al, 2004]

slide-7
SLIDE 7

Alan Szabo – Sunday at the Lake

slide-8
SLIDE 8

One of our first attempts at autonomous flips [using similar methods to what worked for ihover]

Target trajectory: meticulously hand-engineered Model: from (commonly used) frequency sweeps data

slide-9
SLIDE 9

n

Hover / stationary flight regimes:

n Restrict attention to specific flight regime n Extensive data collection = collect control inputs, position, orientation,

velocity, angular rate

n Build model + model-based controller

à Successful autonomous flight. n

Aggressive flight maneuvers --- additional challenges:

n Task description: What is the target trajectory? n Dynamics model: How to obtain accurate model?

Stationary vs. Aggressive Flight

slide-10
SLIDE 10

n Gavrilets, Martinos, Mettler and Feron, 2002

n 3 maneuvers: split-S, snap axial roll, stall-turn n Key: Expert engineering of controllers after human pilot demonstrations

Aggressive, Non-Stationary Regimes

slide-11
SLIDE 11

Sunday in Open Loop

slide-12
SLIDE 12

n Our work:

n Key: Automatic engineering of controllers after human pilot

demonstrations through machine learning

n Wide range of aggressive maneuvers n Maneuvers in rapid succession

Aggressive, Non-Stationary Regimes

slide-13
SLIDE 13

n Learning a target trajectory n Learning a dynamics model n Autonomous flight results

Learning Dynamic Maneuvers

slide-14
SLIDE 14

n Difficult to specify by hand:

n Required format: position + orientation over time n Needs to satisfy helicopter dynamics

n Our solution:

n Collect demonstrations of desired maneuvers n Challenge: extract a clean target trajectory from many

suboptimal/noisy demonstrations

Target Trajectory

Abbeel, Coates, Ng, IJRR 2010

slide-15
SLIDE 15

Expert Demonstrations

slide-16
SLIDE 16
  • HMM-like generative model

Dynamics model used as HMM transition model

Demos are observations of hidden trajectory

  • Problem: how do we align observations to hidden trajectory?

Learning a Trajectory

Demo 1 Demo 2 Hidden

Abbeel, Coates, Ng, IJRR 2010

slide-17
SLIDE 17

n Dynamic Time Warping (Needleman&Wunsch 1970,

Sakoe&Chiba, 1978)

n Extended Kalman filter / smoother

Learning a Trajectory

Demo 1 Demo 2 Hidden

Abbeel, Coates, Ng, IJRR 2010

slide-18
SLIDE 18

Results: Time-Aligned Demonstrations

§ White helicopter is inferred “intended” trajectory.

slide-19
SLIDE 19

Results: Loops

Even without prior knowledge, the inferred trajectory is much closer to an ideal loop.

Abbeel, Coates, Ng, IJRR 2010

slide-20
SLIDE 20

n Learning a target trajectory n Learning a dynamics model n Autonomous flight results

Learning Dynamic Maneuvers

slide-21
SLIDE 21

Standard Modeling Approach

Abbeel, Coates, Ng, IJRR 2010

3G error!

slide-22
SLIDE 22

Key Observation

Errors observed in the “baseline” model are clearly consistent after aligning demonstrations.

Abbeel, Coates, Ng, IJRR 2010

slide-23
SLIDE 23

n If we fly the same trajectory repeatedly, errors are consistent

  • ver time once we align the data.

n There are many unmodeled variables that we can’t expect our model to

capture accurately.

n Air (!), actuator delays, etc.

n If we fly the same trajectory repeatedly, the hidden variables tend to be

the same each time. ~ muscle memory for human pilots

Key Observation

Abbeel, Coates, Ng, IJRR 2010

slide-24
SLIDE 24

n Learn locally-weighted model from aligned demonstrations

n Since data is aligned in time, we can weight by time to

exploit repeatability of unmodeled variables.

n For model at time t: n Obtain a model for each time t into the maneuver by

running weighted regression for each time t

Trajectory-Specific Local Models

Abbeel, Coates, Ng, IJRR 2010

slide-25
SLIDE 25

n Learning a target trajectory n Learning a dynamics model n Autonomous flight results

Learning Dynamic Maneuvers

Abbeel, Coates, Ng, IJRR 2010

slide-26
SLIDE 26

Experimental Setup

Microstrain 3DM-GX1 @333Hz RPM sensor @20-30Hz Sonar Offboard Cameras 1280x960@20Hz Extended Kalman Filter RHDDP controller Controls @ 20Hz “Position” 3-axis magnetometer, accelerometer, gyroscope (“Orientation”)

Abbeel, Coates, Quigley, Ng, NIPS 2007

slide-27
SLIDE 27
  • 1. Collect sweeps to build a baseline dynamics model
  • 2. Our expert pilot demonstrates the airshow several times.
  • 3. Learn a target trajectory.
  • 4. Learn a dynamics model.
  • 5. Find the optimal control policy for learned target and

dynamics model.

  • 6. Autonomously fly the airshow
  • 7. Learn an improved dynamics model. Go back to step 4.

à Learn to fly new maneuvers in < 1hour.

Experimental Procedure

Abbeel, Coates, Ng, IJRR 2010

slide-28
SLIDE 28

Results: Autonomous Airshow

slide-29
SLIDE 29

Results: Flight Accuracy

slide-30
SLIDE 30

Autonomous Autorotation Flights

Abbeel, Coates, Hunter, Ng, ISER 2008

slide-31
SLIDE 31

Chaos [“flip/roll” parameterized by yaw rate]

slide-32
SLIDE 32
slide-33
SLIDE 33

Behind the scenes

slide-34
SLIDE 34

Thank you!