CS 287 Lecture 24 (Fall 2019) Autonomous Helicopter Flight Pieter - - PowerPoint PPT Presentation

▶

Jun 12, 2023 144 likes •497 views

CS 287 Lecture 24 (Fall 2019) Autonomous Helicopter Flight Pieter Abbeel UC Berkeley EECS Challenges in Helicopter Control n Unstable n Nonlinear n Complicated dynamics n Air flow n Coupling n Blade dynamics n Noisy estimates of position,

SLIDE 1

CS 287 Lecture 24 (Fall 2019) Autonomous Helicopter Flight

Pieter Abbeel UC Berkeley EECS

SLIDE 2

SLIDE 3

SLIDE 4

n Unstable n Nonlinear n Complicated dynamics

n Air flow n Coupling n Blade dynamics

n Noisy estimates of position, orientation, velocity, angular rate

(and perhaps blade and engine speed)

Challenges in Helicopter Control

SLIDE 5

n Just a few examples:

Bagnell & Schneider, 2001;

LaCivita, Papageorgiou, Messner & Kanade, 2002;

Ng, Kim, Jordan & Sastry 2004a (2001); Ng et al., 2004b;

Roberts, Corke & Buskey, 2003;

Saripalli, Montgomery & Sukhatme, 2003;

Shim, Chung, Kim & Sastry, 2003;

Doherty et al., 2004;

Gavrilets, Martinos, Mettler and Feron, 2002.

n Varying control techniques: inner/outer loop PID with hand or

automatic tuning, H1, LQR, …

Success Stories: Hover and Forward Flight

SLIDE 6

[Ng, Coates, Tse, et al, 2004]

SLIDE 7

Alan Szabo – Sunday at the Lake

SLIDE 8

One of our first attempts at autonomous flips [using similar methods to what worked for ihover]

Target trajectory: meticulously hand-engineered Model: from (commonly used) frequency sweeps data

SLIDE 9

Hover / stationary flight regimes:

n Restrict attention to specific flight regime n Extensive data collection = collect control inputs, position, orientation,

velocity, angular rate

n Build model + model-based controller

à Successful autonomous flight. n

Aggressive flight maneuvers --- additional challenges:

n Task description: What is the target trajectory? n Dynamics model: How to obtain accurate model?

Stationary vs. Aggressive Flight

SLIDE 10

n Gavrilets, Martinos, Mettler and Feron, 2002

n 3 maneuvers: split-S, snap axial roll, stall-turn n Key: Expert engineering of controllers after human pilot demonstrations

Aggressive, Non-Stationary Regimes

SLIDE 11

Sunday in Open Loop

SLIDE 12

n Our work:

n Key: Automatic engineering of controllers after human pilot

demonstrations through machine learning

n Wide range of aggressive maneuvers n Maneuvers in rapid succession

Aggressive, Non-Stationary Regimes

SLIDE 13

n Learning a target trajectory n Learning a dynamics model n Autonomous flight results

Learning Dynamic Maneuvers

SLIDE 14

n Difficult to specify by hand:

n Required format: position + orientation over time n Needs to satisfy helicopter dynamics

n Our solution:

n Collect demonstrations of desired maneuvers n Challenge: extract a clean target trajectory from many

suboptimal/noisy demonstrations

Target Trajectory

Abbeel, Coates, Ng, IJRR 2010

SLIDE 15

Expert Demonstrations

SLIDE 16

HMM-like generative model

–

Dynamics model used as HMM transition model

–

Demos are observations of hidden trajectory

Problem: how do we align observations to hidden trajectory?

Learning a Trajectory

Demo 1 Demo 2 Hidden

Abbeel, Coates, Ng, IJRR 2010

SLIDE 17

n Dynamic Time Warping (Needleman&Wunsch 1970,

Sakoe&Chiba, 1978)

n Extended Kalman filter / smoother

Learning a Trajectory

Demo 1 Demo 2 Hidden

Abbeel, Coates, Ng, IJRR 2010

SLIDE 18

Results: Time-Aligned Demonstrations

§ White helicopter is inferred “intended” trajectory.

SLIDE 19

Results: Loops

Even without prior knowledge, the inferred trajectory is much closer to an ideal loop.

Abbeel, Coates, Ng, IJRR 2010

SLIDE 20

n Learning a target trajectory n Learning a dynamics model n Autonomous flight results

Learning Dynamic Maneuvers

SLIDE 21

Standard Modeling Approach

Abbeel, Coates, Ng, IJRR 2010

3G error!

SLIDE 22

Key Observation

Errors observed in the “baseline” model are clearly consistent after aligning demonstrations.

Abbeel, Coates, Ng, IJRR 2010

SLIDE 23

n If we fly the same trajectory repeatedly, errors are consistent

ver time once we align the data.

n There are many unmodeled variables that we can’t expect our model to

capture accurately.

n Air (!), actuator delays, etc.

n If we fly the same trajectory repeatedly, the hidden variables tend to be

the same each time. ~ muscle memory for human pilots

Key Observation

Abbeel, Coates, Ng, IJRR 2010

SLIDE 24

n Learn locally-weighted model from aligned demonstrations

n Since data is aligned in time, we can weight by time to

exploit repeatability of unmodeled variables.

n For model at time t: n Obtain a model for each time t into the maneuver by

running weighted regression for each time t

Trajectory-Specific Local Models

Abbeel, Coates, Ng, IJRR 2010

SLIDE 25

n Learning a target trajectory n Learning a dynamics model n Autonomous flight results

Learning Dynamic Maneuvers

Abbeel, Coates, Ng, IJRR 2010

SLIDE 26

Experimental Setup

Microstrain 3DM-GX1 @333Hz RPM sensor @20-30Hz Sonar Offboard Cameras 1280x960@20Hz Extended Kalman Filter RHDDP controller Controls @ 20Hz “Position” 3-axis magnetometer, accelerometer, gyroscope (“Orientation”)

Abbeel, Coates, Quigley, Ng, NIPS 2007

SLIDE 27

1. Collect sweeps to build a baseline dynamics model
2. Our expert pilot demonstrates the airshow several times.
3. Learn a target trajectory.
4. Learn a dynamics model.
5. Find the optimal control policy for learned target and

dynamics model.

6. Autonomously fly the airshow
7. Learn an improved dynamics model. Go back to step 4.

à Learn to fly new maneuvers in < 1hour.

Experimental Procedure

Abbeel, Coates, Ng, IJRR 2010

SLIDE 28

Results: Autonomous Airshow

SLIDE 29

Results: Flight Accuracy

SLIDE 30

Autonomous Autorotation Flights

Abbeel, Coates, Hunter, Ng, ISER 2008

SLIDE 31

Chaos [“flip/roll” parameterized by yaw rate]

SLIDE 32

SLIDE 33

Behind the scenes

SLIDE 34

CS 287 Lecture 24 (Fall 2019) Autonomous Helicopter Flight

(and perhaps blade and engine speed)

Challenges in Helicopter Control

automatic tuning, H1, LQR, …

Success Stories: Hover and Forward Flight

[Ng, Coates, Tse, et al, 2004]

Alan Szabo – Sunday at the Lake

One of our first attempts at autonomous flips [using similar methods to what worked for ihover]

Target trajectory: meticulously hand-engineered Model: from (commonly used) frequency sweeps data

Hover / stationary flight regimes:

velocity, angular rate

Aggressive flight maneuvers --- additional challenges:

Stationary vs. Aggressive Flight

Aggressive, Non-Stationary Regimes

Sunday in Open Loop

demonstrations through machine learning

Aggressive, Non-Stationary Regimes

Learning Dynamic Maneuvers

suboptimal/noisy demonstrations

Target Trajectory

Expert Demonstrations

Dynamics model used as HMM transition model

Demos are observations of hidden trajectory

Learning a Trajectory

Demo 1 Demo 2 Hidden

Sakoe&Chiba, 1978)

Learning a Trajectory

Demo 1 Demo 2 Hidden

Results: Time-Aligned Demonstrations

§ White helicopter is inferred “intended” trajectory.

Results: Loops

Even without prior knowledge, the inferred trajectory is much closer to an ideal loop.

Learning Dynamic Maneuvers

Standard Modeling Approach

3G error!

Key Observation

Errors observed in the “baseline” model are clearly consistent after aligning demonstrations.

capture accurately.

the same each time. ~ muscle memory for human pilots

Key Observation

exploit repeatability of unmodeled variables.

running weighted regression for each time t

Trajectory-Specific Local Models

Learning Dynamic Maneuvers

Experimental Setup

dynamics model.

à Learn to fly new maneuvers in < 1hour.

Experimental Procedure

Results: Autonomous Airshow

Results: Flight Accuracy

Autonomous Autorotation Flights

Chaos [“flip/roll” parameterized by yaw rate]

Behind the scenes

Thank you!