Hermes-A: An Asynchronous NoC Router with Distributed Routing
Julian Pontes Matheus Moreira Fernando Moraes Ney Calazans
1
Hermes-A: An Asynchronous NoC Router with Distributed Routing - - PowerPoint PPT Presentation
Hermes-A: An Asynchronous NoC Router with Distributed Routing Julian Pontes Matheus Moreira Fernando Moraes Ney Calazans 1 Outline Introduction Related Work Architecture Input Port Path Calculation Output
1
2
3
CD CD R e g R e g Logic Logic R e g R e g CD CD R e g R e g A B S SF 1 ST 1 ST 1 1 SF
4
– Traditional Tools (~45 Thousand loop breakers in a 3x3 NoC) – Asynchronous Synthesis Tools (Balsa, Teak) » Lack of traditional optimizations (Pin Swapping, Reordering, Retime, …)
– Liberty File Support (is_async_reg) – New Set of Constraints (Cycle Time Definition)
5
6
– Permit the implementation of fine grain control power » MVS » Power Shut-Off
7
Characteristics
Routing / Flow Control Network Interface Asynchronous Style Links and encoding Implementati
NoC
2D Mesh (Irreg/Reg) 8VCs Source / wormhole / credit-based with preemption
N.A
4-phase bundled-data 10-bit flits 180 nm, 200Mflits/s, ASIC RasP Framework / point-to- point (Irreg/Reg) Source / bit serial Ad hoc QDI Point-to- point pipelined serial links 180nm, 700Mb/s ASPIN 2D Mesh (Reg) Distributed XY / wormhole / EOP A2S, S2A FIFOs Bundled-data/ QDI Dual-rail, 4- ph., 34-bit flits 90nm, 714Mflits/s ANoC 2D Mesh 2VCs Source / Adaptive
One of Four 130nm/ 5Gb/s (router) Hermes-A 2D Mesh Distributed XY / wormhole / BOP- EOP Dual-Rail SCAFFI QDI Dual-Rail 180nm, 727Mbits/s, (454Mflits/s 3.6Gb/s per router) ASIC
8
Characteristics
Routing / Flow Control Network Interface Asynchronous Style Links and encoding Implementati
NoC
2D Mesh (Irreg/Reg) 8VCs Source / wormhole / credit-based with preemption
N.A
4-phase bundled-data 10-bit flits 180 nm, 200Mflits/s, ASIC RasP Framework / point-to- point (Irreg/Reg) Source / bit serial Ad hoc QDI Point-to- point pipelined serial links 180nm, 700Mb/s ASPIN 2D Mesh (Reg) Distributed XY / wormhole / EOP A2S, S2A FIFOs Bundled-data/ QDI Dual-rail, 4- ph., 34-bit flits 90nm, 714Mflits/s ANoC 2D Mesh 2VCs Source / Adaptive
One of Four 130nm/ 5Gb/s (router) Hermes-A 2D Mesh Distributed XY / wormhole / BOP- EOP Dual-Rail SCAFFI QDI Dual-Rail 180nm, 727Mbits/s, (454Mflits/s 3.6Gb/s per router) ASIC
9
Characteristics
Routing / Flow Control Network Interface Asynchronous Style Links and encoding Implementati
NoC
2D Mesh (Irreg/Reg) 8VCs Source / wormhole / credit-based with preemption
N.A
4-phase bundled-data 10-bit flits 180 nm, 200Mflits/s, ASIC RasP Framework / point-to- point (Irreg/Reg) Source / bit serial Ad hoc QDI Point-to- point pipelined serial links 180nm, 700Mb/s ASPIN 2D Mesh (Reg) Distributed XY / wormhole / EOP A2S, S2A FIFOs Bundled-data/ QDI Dual-rail, 4- ph., 34-bit flits 90nm, 714Mflits/s ANoC 2D Mesh 2VCs Source / Adaptive
One of Four 130nm/ 5Gb/s (router) Hermes-A 2D Mesh Distributed XY / wormhole / BOP- EOP Dual-Rail SCAFFI QDI Dual-Rail 180nm, 727Mbits/s, (454Mflits/s 3.6Gb/s per router) ASIC
10
Characteristics
Routing / Flow Control Network Interface Asynchronous Style Links and encoding Implementati
NoC
2D Mesh (Irreg/Reg) 8VCs Source / wormhole / credit-based with preemption
N.A
4-phase bundled-data 10-bit flits 180 nm, 200Mflits/s, ASIC RasP Framework / point-to- point (Irreg/Reg) Source / bit serial Ad hoc QDI Point-to- point pipelined serial links 180nm, 700Mb/s ASPIN 2D Mesh (Reg) Distributed XY / wormhole / EOP A2S, S2A FIFOs Bundled-data/ QDI Dual-rail, 4- ph., 34-bit flits 90nm, 714Mflits/s ANoC 2D Mesh 2VCs Source / Adaptive
One of Four 130nm/ 5Gb/s (router) Hermes-A 2D Mesh Distributed XY / wormhole / BOP- EOP Dual-Rail SCAFFI Clock Stretching QDI Dual-Rail 180nm, 727Mbits/s, (454Mflits/s 3.6Gb/s per router) ASIC
11
Characteristics
Routing / Flow Control Network Interface Asynchronous Style Links and encoding Implementati
NoC
2D Mesh (Irreg/Reg) 8VCs Source / wormhole / credit-based with preemption
N.A
4-phase bundled-data 10-bit flits 180 nm, 200Mflits/s, ASIC RasP Framework / point-to- point (Irreg/Reg) Source / bit serial Ad hoc QDI Point-to- point pipelined serial links 180nm, 700Mb/s ASPIN 2D Mesh (Reg) Distributed XY / wormhole / EOP A2S, S2A FIFOs Bundled-data/ QDI Dual-rail, 4- ph., 34-bit flits 90nm, 714Mflits/s ANoC 2D Mesh 2VCs Source / Adaptive
One of Four 130nm/ 5Gb/s (router) Hermes-A 2D Mesh Distributed XY / wormhole / BOP- EOP Dual-Rail SCAFFI Clock Stretching QDI Dual-Rail 180nm, 727Mbits/s, (454Mflits/s 3.6Gb/s per router) ASIC
12
Characteristics
Routing / Flow Control Network Interface Asynchronous Style Links and encoding Implementati
NoC
2D Mesh (Irreg/Reg) 8VCs Source / wormhole / credit-based with preemption
N.A
4-phase bundled-data 10-bit flits 180 nm, 200Mflits/s, ASIC RasP Framework / point-to- point (Irreg/Reg) Source / bit serial Ad hoc QDI Point-to- point pipelined serial links 180nm, 700Mb/s ASPIN 2D Mesh (Reg) Distributed XY / wormhole / EOP A2S, S2A FIFOs Bundled-data/ QDI Dual-rail, 4- ph., 34-bit flits 90nm, 714Mflits/s ANoC 2D Mesh 2VCs Source / Adaptive
One of Four 130nm/ 5Gb/s (router) Hermes-A 2D Mesh Distributed XY / wormhole / BOP- EOP Dual-Rail SCAFFI Clock Stretching QDI Dual-Rail 180nm, 727Mbits/s, (454Mflits/s 3.6Gb/s per router) ASIC
13
Characteristics
Routing / Flow Control Network Interface Asynchronous Style Links and encoding Implementati
NoC
2D Mesh (Irreg/Reg) 8VCs Source / wormhole / credit-based with preemption
N.A
4-phase bundled-data 10-bit flits 180 nm, 200Mflits/s, ASIC RasP Framework / point-to- point (Irreg/Reg) Source / bit serial Ad hoc QDI Point-to- point pipelined serial links 180nm, 700Mb/s ASPIN 2D Mesh (Reg) Distributed XY / wormhole / EOP A2S, S2A FIFOs Bundled-data/ QDI Dual-rail, 4- ph., 34-bit flits 90nm, 714Mflits/s ANoC 2D Mesh 2VCs Source / Adaptive
One of Four 130nm/ 5Gb/s (router) Hermes-A 2D Mesh Distributed XY / wormhole / BOP- EOP Dual-Rail SCAFFI Clock Stretching QDI Dual-Rail 180nm, 727Mbits/s, (454Mflits/s 3.6Gb/s per router) ASIC
14
15
16
10
17
18
14
19
4 10
20
4 10
21
14
22
4 10
23
4 10
24
4 10
25
14
26
4 10
27
4 K
28
14
29
K
30
S-Control
31
32
LAST FLIT LAST FLIT BOP = EOP =1 S-Control INPUT S-Control – Output Port A Ack A S-Control – Output Port B AckB Input Ack
33
34
35
Async Router In North Out North Out West In West In South Out South In East Out East Network Interface Bop Eop Ack DR Data IP I n L
a l O u t L
a l
36
37
38
39
40
41
42