NeuroSpector is a scheduling optimization framework that systematically analyzes the dataflow and mapping possibilities of DNN workloads in accelerators and rapidly identifies optimal execution schemes. NeuroSpector finds scheduling solutions for a variety of DNN accelerators and workloads 7,958x faster than previous work with only 1.5% energy and cycle differences on average to the optimal schemes, whereas the prior techniques produce hit-or-miss results with 100.1% greater energy and cycle results than the optimal solutions and as much as 14.9x in the worst case. In addition, NeuroSpector supports many essential features of DNN accelerators and workloads, including group convolutions, multi-chip accelerators, data bypassing in buffers, unified/separate buffer types, static power modeling, and network-wise scheduling optimization, which were overlooked or only partly supported in related work.
Download and Documentation
The latest release of NeuroSpector is v1.5 (as of May. 2024). For detailed instructions regarding the prerequisite, installation, and execution of NeuroSpector, please visit the GitHub repository: https://github.com/yonseicasl/neurospector, and refer to the README file.
To reference NeuroSpector, please use our TPDS paper.
@article{park_tpds2023, author = {C. Park and and B. Kim and S. Ryu and W. Song}, title = {{NeuroSpector: Systematic Optimization of Dataflow Scheduling in DNN Accelerators}}, journal = {IEEE Transactions on Parallel and Distributed Systems}, volume = {34}, number = {8}, month = {Aug.}, year = {2023}, pages = {2279-2294}, }