Erik Wijmans, 10/29/2020
Neural Architecture Search
CS 4803 / 7643 Deep Learning
Neural Architecture Search CS 4803 / 7643 Deep Learning Erik - - PowerPoint PPT Presentation
Neural Architecture Search CS 4803 / 7643 Deep Learning Erik Wijmans, 10/29/2020 Background 2 Background <latexit
Erik Wijmans, 10/29/2020
CS 4803 / 7643 Deep Learning
2
3
<latexit sha1_base64="CaktXdkYARXyqYFqNAIrfVL3XGI=">ACWXicbVFda9swFJXdJc2yj2btY1/EwiCBEeyx0sFeStdCH/bQwdIWYmNk5ToRlWQjXY8G4z/Zh8HYX+lDldiMLd0BcQ/nfkj3KC2ksBgEvzx/51mnu9t73n/x8tXrvcGb/Subl4bDlOcyNzcpsyCFhikKlHBTGAqlXCd3n5Z569/gLEi19xVUCs2EKLTHCGTkoGRaSETqoIl4CspFiuEzT6rxOqtHde7oa08gK1eicyeqsriMJGc7+KF8bYZQ14e5zMysyYrHEsRvRsibEyWAYTIN6FMStmRIWlwmg/tonvNSgUYumbWzMCgwrphBwSXU/ai0UDB+yxYwc1QzBTauNs7U9J1T5jTLjTsa6Ub9u6NiytqVSl3leh+7nVuL/8vNSsw+xZXQRYmgeXNRVkqKOV3bTOfCAEe5coRxI9xbKV8ywzi6z+g7E8LtlZ+Sqw+T8GgSfPs4PDlt7eiRQ/KWjEhIjskJuSCXZEo4+UkevI7X9X7nt/z+02p7U9B+Qf+AePvBy1iA=</latexit>minθ
4
<latexit sha1_base64="CaktXdkYARXyqYFqNAIrfVL3XGI=">ACWXicbVFda9swFJXdJc2yj2btY1/EwiCBEeyx0sFeStdCH/bQwdIWYmNk5ToRlWQjXY8G4z/Zh8HYX+lDldiMLd0BcQ/nfkj3KC2ksBgEvzx/51mnu9t73n/x8tXrvcGb/Subl4bDlOcyNzcpsyCFhikKlHBTGAqlXCd3n5Z569/gLEi19xVUCs2EKLTHCGTkoGRaSETqoIl4CspFiuEzT6rxOqtHde7oa08gK1eicyeqsriMJGc7+KF8bYZQ14e5zMysyYrHEsRvRsibEyWAYTIN6FMStmRIWlwmg/tonvNSgUYumbWzMCgwrphBwSXU/ai0UDB+yxYwc1QzBTauNs7U9J1T5jTLjTsa6Ub9u6NiytqVSl3leh+7nVuL/8vNSsw+xZXQRYmgeXNRVkqKOV3bTOfCAEe5coRxI9xbKV8ywzi6z+g7E8LtlZ+Sqw+T8GgSfPs4PDlt7eiRQ/KWjEhIjskJuSCXZEo4+UkevI7X9X7nt/z+02p7U9B+Qf+AePvBy1iA=</latexit>minθ
5
6
<latexit sha1_base64="kwMgK7bRn8ZwDzGpM64k+2Cfa3M=">ACcnicbVFdaxQxFM2MH61btavFwUbXYQtlGWmKAp9Ka2KDz5UcNvCZhgy2Tu7oUlmSO5Il2F+gH/PN3+FL/0Bze4MfrReCPdwTu69uSdZqaTDKPoZhLdu37m7tn6vt3H/wcPN/qPHJ6orICxKFRhzLuQEkDY5So4Ky0wHWm4DQ7P1rqp9/AOlmYr7goIdF8ZmQuBUdPpf3vTEuT1jl0lCmOc4FV/XHpqGtwHAOyJtWyrL6Q5PWw4tdutihzEn9p+R90zAFOU5+M59bYpi36WK/7cWsnM1x7foUJuStD+IRtEq6E0Qd2BAujhO+z/YtBCVBoNCcecmcVRiUnOLUihoeqxyUHJxzmcw8dBwDS6pV5Y19JVnpjQvrD8G6Yr9u6Lm2rmFzvzN5T7urYk/6dNKszfJbU0ZYVgRDsorxTFgi79p1NpQaBaeMCFlf6tVMy5QL9L/W8CfH1lW+Ck71R/GYUfXk9ODjs7Fgnz8hLMiQxeUsOyCdyTMZEkF/Bk+B5sB1chk/DF2HnXRh0NVvknwh3rwAOiL7n</latexit>minf∈F min θ
7
<latexit sha1_base64="kwMgK7bRn8ZwDzGpM64k+2Cfa3M=">ACcnicbVFdaxQxFM2MH61btavFwUbXYQtlGWmKAp9Ka2KDz5UcNvCZhgy2Tu7oUlmSO5Il2F+gH/PN3+FL/0Bze4MfrReCPdwTu69uSdZqaTDKPoZhLdu37m7tn6vt3H/wcPN/qPHJ6orICxKFRhzLuQEkDY5So4Ky0wHWm4DQ7P1rqp9/AOlmYr7goIdF8ZmQuBUdPpf3vTEuT1jl0lCmOc4FV/XHpqGtwHAOyJtWyrL6Q5PWw4tdutihzEn9p+R90zAFOU5+M59bYpi36WK/7cWsnM1x7foUJuStD+IRtEq6E0Qd2BAujhO+z/YtBCVBoNCcecmcVRiUnOLUihoeqxyUHJxzmcw8dBwDS6pV5Y19JVnpjQvrD8G6Yr9u6Lm2rmFzvzN5T7urYk/6dNKszfJbU0ZYVgRDsorxTFgi79p1NpQaBaeMCFlf6tVMy5QL9L/W8CfH1lW+Ck71R/GYUfXk9ODjs7Fgnz8hLMiQxeUsOyCdyTMZEkF/Bk+B5sB1chk/DF2HnXRh0NVvknwh3rwAOiL7n</latexit>minf∈F min θ
Set of networks
8
High Level Overview
9
High Level Overview
Search Space
10
High Level Overview
Search Space
11
<latexit sha1_base64="kwMgK7bRn8ZwDzGpM64k+2Cfa3M=">ACcnicbVFdaxQxFM2MH61btavFwUbXYQtlGWmKAp9Ka2KDz5UcNvCZhgy2Tu7oUlmSO5Il2F+gH/PN3+FL/0Bze4MfrReCPdwTu69uSdZqaTDKPoZhLdu37m7tn6vt3H/wcPN/qPHJ6orICxKFRhzLuQEkDY5So4Ky0wHWm4DQ7P1rqp9/AOlmYr7goIdF8ZmQuBUdPpf3vTEuT1jl0lCmOc4FV/XHpqGtwHAOyJtWyrL6Q5PWw4tdutihzEn9p+R90zAFOU5+M59bYpi36WK/7cWsnM1x7foUJuStD+IRtEq6E0Qd2BAujhO+z/YtBCVBoNCcecmcVRiUnOLUihoeqxyUHJxzmcw8dBwDS6pV5Y19JVnpjQvrD8G6Yr9u6Lm2rmFzvzN5T7urYk/6dNKszfJbU0ZYVgRDsorxTFgi79p1NpQaBaeMCFlf6tVMy5QL9L/W8CfH1lW+Ck71R/GYUfXk9ODjs7Fgnz8hLMiQxeUsOyCdyTMZEkF/Bk+B5sB1chk/DF2HnXRh0NVvknwh3rwAOiL7n</latexit>minf∈F min θ
Set of networks
High Level Overview
Search Space Search Method
12
High Level Overview
Search Space Search Method Evaluation Method
13
Proposed Architecture
High Level Overview
Search Space Search Method Evaluation Method
14
Proposed Architecture
High Level Overview
Search Space Search Method Evaluation Method
Best Model
15
Proposed Architecture
High Level Overview
Search Space Search Method Evaluation Method
Best Model
16
Proposed Architecture
Evaluation Method
17
Evaluation Method
18
Evaluation Method
performance on held-out data.
19
High Level Overview
20
Search Space Search Method Evaluation Method
Proposed Architecture
High Level Overview
21
Search Space Search Method Evaluation Method
Proposed Architecture
22
NAS-RL
23
string of variable length (i.e. Breadth-first traversal of their DAG)
NAS-RL
24
string of variable length (i.e. Breadth-first traversal of their DAG)
NAS-RL
25
Input Op 1 Op 2 Op N Softmax
NAS-RL
26
Input Op 1 Op 2 Op N Softmax
NAS-RL
27
NAS-RL
28
NAS-RL
29
NAS-RL
30
NAS-RL
31
NAS-RL
32
time) and trained >12,000 candidate architectures
NAS-RL
33
NASNet
34
NASNet
35
NASNet
36
NASNet
37
NASNet
38
NASNet
39
Normal Cell
hi hi-1
...
hi+1
concat sep! 3x3 avg! 3x3 avg! 3x3 sep! 5x5 sep! 3x3 iden! tity iden! tity sep! 3x3 sep! 5x5 avg! 3x3 add add add add add
NASNet
40
NASNet
Reduction Cell
hi hi-1
...
hi+1
concat avg! 3x3 sep! 5x5 sep! 7x7 sep! 5x5 max! 3x3 sep! 7x7 add add add add add sep! 3x3 iden! tity avg! 3x3 max! 3x3
41
NASNet
42
but with less parameters/compute
Efficient Neural Networks (MnasNet)
43
performance need not be the only metric
Efficient Neural Networks (MnasNet)
44
performance need not be the only metric
Efficient Neural Networks (MnasNet)
45
performance need not be the only metric
Efficient Neural Networks (MnasNet)
46
performance need not be the only metric
Efficient Neural Networks (MnasNet)
47
Differentiable Architecture Search (DARTS)
48
Differentiable Architecture Search (DARTS)
49
Differentiable Architecture Search (DARTS)
50
Differentiable Architecture Search (DARTS)
51
Differentiable Architecture Search (DARTS)
52
Differentiable Architecture Search (DARTS)
53
<latexit sha1_base64="M9eU2h7V0GhTc1H2BdmRbasI0=">AChXicbVFNT9tAEF27tND0Ky3HXlZERWnVWjaiHxdU1F564ABSA0jZ1BpvJmTFeu3ujlEjy/+kv4ob/6abxCAIHWmlp/fe6M3OZKVWjuL4KgfrD18tL7xuPk6bPnL7ovXx27orISB7LQhT3NwKFWBgekSONpaRHyTONJdv59rp9coHWqMD9pVuIohzOjJkoCeSrt/t0WuTKpAF1OgYscaCpB1wdNWl+AbvqCpkjw61/6Xj7nreAC9HZFoR/qHYRQ0XvysY81U/3+N8kVAvlWu+uZtFpS5TrvJSLu9OIoXxe+DpAU91tZh2r0U40JWORqSGpwbJnFJoxosKamx6YjKYQnyHM5w6KGBHN2oXmyx4W8M+aTwvpniC/Y2x015M7N8sw756O7VW1O/k8bVjT5MqVKStCI5dBk0pzKvj8JHysLErSMw9AWuVn5XIKFiT5w3X8EpLVL98HxztR8jGKj3Z7+9/adWyw12yL9VnCPrN9oMdsgGTQRj0gyTYCdfD+Fu+GlpDYO2Z5PdqfDrP/Wzwcg=</latexit>minDifferentiable Architecture Search (DARTS)
54
<latexit sha1_base64="M9eU2h7V0GhTc1H2BdmRbasI0=">AChXicbVFNT9tAEF27tND0Ky3HXlZERWnVWjaiHxdU1F564ABSA0jZ1BpvJmTFeu3ujlEjy/+kv4ob/6abxCAIHWmlp/fe6M3OZKVWjuL4KgfrD18tL7xuPk6bPnL7ovXx27orISB7LQhT3NwKFWBgekSONpaRHyTONJdv59rp9coHWqMD9pVuIohzOjJkoCeSrt/t0WuTKpAF1OgYscaCpB1wdNWl+AbvqCpkjw61/6Xj7nreAC9HZFoR/qHYRQ0XvysY81U/3+N8kVAvlWu+uZtFpS5TrvJSLu9OIoXxe+DpAU91tZh2r0U40JWORqSGpwbJnFJoxosKamx6YjKYQnyHM5w6KGBHN2oXmyx4W8M+aTwvpniC/Y2x015M7N8sw756O7VW1O/k8bVjT5MqVKStCI5dBk0pzKvj8JHysLErSMw9AWuVn5XIKFiT5w3X8EpLVL98HxztR8jGKj3Z7+9/adWyw12yL9VnCPrN9oMdsgGTQRj0gyTYCdfD+Fu+GlpDYO2Z5PdqfDrP/Wzwcg=</latexit>minDifferentiable Architecture Search (DARTS)
55
<latexit sha1_base64="M9eU2h7V0GhTc1H2BdmRbasI0=">AChXicbVFNT9tAEF27tND0Ky3HXlZERWnVWjaiHxdU1F564ABSA0jZ1BpvJmTFeu3ujlEjy/+kv4ob/6abxCAIHWmlp/fe6M3OZKVWjuL4KgfrD18tL7xuPk6bPnL7ovXx27orISB7LQhT3NwKFWBgekSONpaRHyTONJdv59rp9coHWqMD9pVuIohzOjJkoCeSrt/t0WuTKpAF1OgYscaCpB1wdNWl+AbvqCpkjw61/6Xj7nreAC9HZFoR/qHYRQ0XvysY81U/3+N8kVAvlWu+uZtFpS5TrvJSLu9OIoXxe+DpAU91tZh2r0U40JWORqSGpwbJnFJoxosKamx6YjKYQnyHM5w6KGBHN2oXmyx4W8M+aTwvpniC/Y2x015M7N8sw756O7VW1O/k8bVjT5MqVKStCI5dBk0pzKvj8JHysLErSMw9AWuVn5XIKFiT5w3X8EpLVL98HxztR8jGKj3Z7+9/adWyw12yL9VnCPrN9oMdsgGTQRj0gyTYCdfD+Fu+GlpDYO2Z5PdqfDrP/Wzwcg=</latexit>minDifferentiable Architecture Search (DARTS)
56
<latexit sha1_base64="liC1gU+cPOANjsyPoQj9p2yLT98=">ACInicbVDLSgMxFM34tr6qLt0Ei1BFyowo6k5048JFBWsLnTrcSdM2mMkMyR2hDP0WN/6KGxeKuhL8GN2Fr4OBA7nMvNPWEihUHX/XAmJqemZ2bn5gsLi0vLK8XVtWsTp5rxGotlrBshGC6F4jUKHkj0RyiUPJ6eHs29Ot3XBsRqyvsJ7wVQVeJjmCAVgqKx76CUELg0x6QP0IsMdAZheDILsDOSj72OMINzvlcWJ7l+YkKJbcijsC/Uu8nJRIjmpQfPbMUsjrpBJMKbpuQm2MtAomOSDgp8angC7hS5vWqog4qaVjU4c0C2rtGkn1vYpCP1+0QGkTH9KLTJ4QnmtzcU/OaKXaOWplQSYpcsfGiTiopxnTYF20LzRnKviXAtLB/pawHGhjaVgu2BO/3yX/J9V7FO6i4l/ulk9O8jmyQTZJmXjkJyQc1IlNcLIPXkz+TFeXCenFfnfRydcPKZdfIDzucX/OGj5A=</latexit>rαLval(θ∗(α), α)Differentiable Architecture Search (DARTS)
57
<latexit sha1_base64="3AeO1Cc/5mWn53lmDIZFEJL7lJU=">ACVHicbVHLSgMxFM2Mr1pfVZdugkVQ0DIji5FNy5cVLAqdMpwJ01taCYTkjvFMvQjdSH4JW5cmD4UXxcCJ+ecm+SeJFoKi0Hw6vkzs3PzC6XF8tLyupaZX3j1ma5YbzBMpmZ+wQsl0LxBgqU/F4bDmki+V3Suxjpd31urMjUDQ40b6XwoERHMEBHxZVeBFqb7JFGChIJcQRSd4FGKWCXgSyuhnHRBzncjbDLEejBl3Gy/2FEA0J9Wvfp5Ky9LxBXqkEtGBf9C8IpqJp1ePKc9TOWJ5yhUyCtc0w0NgqwKBgkg/LUW65BtaDB950UEHKbasYhzKkO45p05m3FJIx+z3jgJSawdp4pyjGexvbUT+pzVz7Jy2CqF0jlyxyUWdXFLM6Ch2haGM5QDB4AZ4d5KWRcMHT/UHYhL9H/gtuD2vhcS24PqenU/jKJEtsk12SUhOyBm5JHXSIw8kTePeJ734r37M/7cxOp705N8qP81Q+JKbPQ</latexit>⇡ rαLval(θ rθLtrain(θ, α), α) <latexit sha1_base64="liC1gU+cPOANjsyPoQj9p2yLT98=">ACInicbVDLSgMxFM34tr6qLt0Ei1BFyowo6k5048JFBWsLnTrcSdM2mMkMyR2hDP0WN/6KGxeKuhL8GN2Fr4OBA7nMvNPWEihUHX/XAmJqemZ2bn5gsLi0vLK8XVtWsTp5rxGotlrBshGC6F4jUKHkj0RyiUPJ6eHs29Ot3XBsRqyvsJ7wVQVeJjmCAVgqKx76CUELg0x6QP0IsMdAZheDILsDOSj72OMINzvlcWJ7l+YkKJbcijsC/Uu8nJRIjmpQfPbMUsjrpBJMKbpuQm2MtAomOSDgp8angC7hS5vWqog4qaVjU4c0C2rtGkn1vYpCP1+0QGkTH9KLTJ4QnmtzcU/OaKXaOWplQSYpcsfGiTiopxnTYF20LzRnKviXAtLB/pawHGhjaVgu2BO/3yX/J9V7FO6i4l/ulk9O8jmyQTZJmXjkJyQc1IlNcLIPXkz+TFeXCenFfnfRydcPKZdfIDzucX/OGj5A=</latexit>rαLval(θ∗(α), α)Differentiable Architecture Search (DARTS)
58
<latexit sha1_base64="1WZXfNZSeRmUCjHqYxmaX0vV32I=">ACIHicbVDLSgMxFM34tr6qLt0Ei6AgZUaUuiy6ceFCwWqhU4Y7aWqDmUxI7hTL0E9x46+4caGI7vRrTB8LrR4IHM45l9x7Yi2FRd/9KamZ2bn5hcWC0vLK6trxfWNa5tmhvEaS2Vq6jFYLoXiNRQoeV0bDks+U18dzrwb7rcWJGqK+xp3kzgVom2YIBOioqVELQ26T0NFcQSohCk7gANE8AOA5mf96O8C7K/G2KHI+zTUWAvKpb8sj8E/UuCMSmRMS6i4kfYSlmWcIVMgrWNwNfYzMGgYJL3C2FmuQZ2B7e84aiChNtmPjywT3ec0qLt1LinkA7VnxM5JNb2ktglB4vbSW8g/uc1MmwfN3OhdIZcsdFH7UxSTOmgLdoShjOUPUeAGeF2pawDBhi6TguhGDy5L/k+qAcHJX9y8NS9WRcxwLZItklwSkQqrkjFyQGmHkgTyRF/LqPXrP3pv3PopOeOZTfIL3tc3Vjyjqw=</latexit>⇡ rαLval(θ, α) <latexit sha1_base64="3AeO1Cc/5mWn53lmDIZFEJL7lJU=">ACVHicbVHLSgMxFM2Mr1pfVZdugkVQ0DIji5FNy5cVLAqdMpwJ01taCYTkjvFMvQjdSH4JW5cmD4UXxcCJ+ecm+SeJFoKi0Hw6vkzs3PzC6XF8tLyupaZX3j1ma5YbzBMpmZ+wQsl0LxBgqU/F4bDmki+V3Suxjpd31urMjUDQ40b6XwoERHMEBHxZVeBFqb7JFGChIJcQRSd4FGKWCXgSyuhnHRBzncjbDLEejBl3Gy/2FEA0J9Wvfp5Ky9LxBXqkEtGBf9C8IpqJp1ePKc9TOWJ5yhUyCtc0w0NgqwKBgkg/LUW65BtaDB950UEHKbasYhzKkO45p05m3FJIx+z3jgJSawdp4pyjGexvbUT+pzVz7Jy2CqF0jlyxyUWdXFLM6Ch2haGM5QDB4AZ4d5KWRcMHT/UHYhL9H/gtuD2vhcS24PqenU/jKJEtsk12SUhOyBm5JHXSIw8kTePeJ734r37M/7cxOp705N8qP81Q+JKbPQ</latexit>⇡ rαLval(θ rθLtrain(θ, α), α) <latexit sha1_base64="liC1gU+cPOANjsyPoQj9p2yLT98=">ACInicbVDLSgMxFM34tr6qLt0Ei1BFyowo6k5048JFBWsLnTrcSdM2mMkMyR2hDP0WN/6KGxeKuhL8GN2Fr4OBA7nMvNPWEihUHX/XAmJqemZ2bn5gsLi0vLK8XVtWsTp5rxGotlrBshGC6F4jUKHkj0RyiUPJ6eHs29Ot3XBsRqyvsJ7wVQVeJjmCAVgqKx76CUELg0x6QP0IsMdAZheDILsDOSj72OMINzvlcWJ7l+YkKJbcijsC/Uu8nJRIjmpQfPbMUsjrpBJMKbpuQm2MtAomOSDgp8angC7hS5vWqog4qaVjU4c0C2rtGkn1vYpCP1+0QGkTH9KLTJ4QnmtzcU/OaKXaOWplQSYpcsfGiTiopxnTYF20LzRnKviXAtLB/pawHGhjaVgu2BO/3yX/J9V7FO6i4l/ulk9O8jmyQTZJmXjkJyQc1IlNcLIPXkz+TFeXCenFfnfRydcPKZdfIDzucX/OGj5A=</latexit>rαLval(θ∗(α), α)Differentiable Architecture Search (DARTS)
59
better or on-par with existing NAS methods
60
Neural Architecture Search without Training
61
Neural Architecture Search without Training
62
how “flexible” the network is.
Neural Architecture Search without Training
63
how “flexible” the network is. This “flexibility” can be determined without training.
Neural Architecture Search without Training
64
how “flexible” the network is. This “flexibility” can be determined without training.
65
performant network architectures
66
performant network architectures
67
performant network architectures
68
performant network architectures