Journal Articles

Inferencing on Edge Devices: A Time- and Space-aware Co-scheduling Approach

Danny Pereira, Indian Institute of Technology Kharagpur
Anirban Ghose, Indian Institute of Technology Kharagpur
Sumana Ghosh, Indian Statistical Institute, Kolkata
Soumyajit Dey, Indian Institute of Technology Kharagpur

Article Type

Research Article

Publication Title

ACM Transactions on Design Automation of Electronic Systems

Abstract

Neural Network (NN)-based real-time inferencing tasks are often co-scheduled on GPGPU-style edge platforms. Existing works advocate using different NN parameters for the same detection task in different environments. However, realizing such approaches remains challenging, given accelerator devices' limited on-chip memory capacity. As a solution, we propose a multi-pass, time- and space-aware scheduling infrastructure for embedded platforms with GPU accelerators. The framework manages the residency of NN parameters in the limited on-chip memory while simultaneously dispatching relevant compute operations. The mapping decisions for memory operations and compute operations to the underlying resources of the platform are first determined in an offline manner. For this, we proposed a constraint solver-assisted scheduler that optimizes for schedule makespan. This is followed by memory optimization passes, which take the memory budget into account and accordingly adjust the start times of memory and compute operations. Our approach reports a 74%-90% savings in peak memory utilization with 0%-33% deadline misses for schedules that suffer miss percentage in ranges of 25%-100% when run using existing methods.

DOI

10.1145/3576197

Publication Date

7-10-2022

Recommended Citation

Pereira, Danny; Ghose, Anirban; Ghosh, Sumana; and Dey, Soumyajit, "Inferencing on Edge Devices: A Time- and Space-aware Co-scheduling Approach" (2022). Journal Articles. 3040.
https://digitalcommons.isical.ac.in/journal-articles/3040

This document is currently not available here.

COinS

Journal Articles

Inferencing on Edge Devices: A Time- and Space-aware Co-scheduling Approach

Article Type

Publication Title

Abstract

DOI

Publication Date

Recommended Citation

Browse

Search

Author Corner

Links

Journal Articles

Inferencing on Edge Devices: A Time- and Space-aware Co-scheduling Approach

Authors

Article Type

Publication Title

Abstract

DOI

Publication Date

Recommended Citation

Share

Browse

Search

Author Corner

Links