2 code implementations • 13 Feb 2020 • Bharadwaj Pudipeddi, Maral Mesmakhosroshahi, Jinwen Xi, Sujeeth Bharadwaj
By running the optimizer in the host EPS, we show a new form of mixed precision for faster throughput and convergence.
Distributed Optimization Neural Architecture Search