Objectives RNNs are trained only for limited timesteps Can they - - PowerPoint PPT Presentation

▶

Aug 29, 2023 125 likes •243 views

Understanding and Controlling Memory in RNN D. Haviv, A. Rivkind, O. Barak Network Biology Research Laboratories Technion Israel Institute of Technology Objectives RNNs are trained only for limited timesteps Can they form long term

SLIDE 1

Understanding and Controlling Memory in RNN

D. Haviv, A. Rivkind, O. Barak

Network Biology Research Laboratories Technion – Israel Institute of Technology

SLIDE 2

Objectives

RNNs are trained only for limited timesteps – Can they form long term

memories?

How are these memories (short or long-term) represented as

dynamical objects?

Can these dynamical objects be manipulated to explicitly demand

long term memorization?

SLIDE 3

Task Definition

SLIDE 4

Can RNN Form Long-Term Memories?

SLIDE 5

Slow Point? Fixed Point 20 Timesteps 1000 Timesteps

Can RNN Form Long-Term Memories?

SLIDE 6

! ℎ# = ! ℎ#%& − ∇𝑇 ℎ, 𝐽, -.

/012

Slow-Points and How to Find Them

𝑇 ℎ3, 𝐽 = ℎ34& − ℎ3

5 5

SLIDE 7

Slow-Point Speed Predicts Memory Robustness

SLIDE 8

Fine-tuning with modified loss:

! 𝑀 = 𝑀78 + 𝜇 ;

<∈>

𝑇(ℎ<, 𝐽)

Regularize Speed for Long-Term Memories

SLIDE 9

RNNs can form long term memories, but not all memories are created

equal

Slow-Point speed is quantitatively correlated to memory robustness
We can explicitly demand long-term memorization by regularizing the

hidden-state speed

Key Findings

SLIDE 10

Understanding and Controlling Memory in RNN

Network Biology Research Laboratories Technion – Israel Institute of Technology

Objectives

memories?

dynamical objects?

long term memorization?

Task Definition

Can RNN Form Long-Term Memories?

Slow Point? Fixed Point 20 Timesteps 1000 Timesteps

Can RNN Form Long-Term Memories?

! ℎ# = ! ℎ#%& − ∇𝑇 ℎ, 𝐽, -.

/012

Slow-Points and How to Find Them

𝑇 ℎ3, 𝐽 = ℎ34& − ℎ3

5 5

Slow-Point Speed Predicts Memory Robustness

Fine-tuning with modified loss:

! 𝑀 = 𝑀78 + 𝜇 ;

<∈>

𝑇(ℎ<, 𝐽)

Regularize Speed for Long-Term Memories

equal

hidden-state speed

Key Findings

Poster #2 #258 at Pacific Ballroom Code: https://github.com/DoronHaviv/MemoryRNN Thanks for Listening!