Skip to main navigation Skip to search Skip to main content

Poster Abstract: Xpi: Real-Time Progressive Inference Serving with Explainable AI in Edge-Cloud Systems

  • Changyao Lin*
  • , Zhenming Chen
  • , Jie Liu
  • *Corresponding author for this work
  • Harbin Institute of Technology
  • Ltd
  • Harbin Institute of Technology Shenzhen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The constrained computing and memory resources at the edge pose challenges for satisfying different service-level objectives (SLOs) of deep learning inference requests. In this paper, we propose a novel edge-cloud progressive inference framework Xpi, which integrates explainable AI technique to facilitate early-exit, and learning-based online execution control to satisfy different SLOs and optimize edge resource overheads. We implement Xpi on an edge-cloud platform, and conduct partial experiments on two datasets. Xpi outperforms several advanced edge-cloud progressive inference frameworks in terms of accuracy and deadline satisfaction rate.

Original languageEnglish
Title of host publicationProceedings - 23rd ACM/IEEE International Conference on Information Processing in Sensor Networks, IPSN 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages273-274
Number of pages2
ISBN (Electronic)9798350362015
DOIs
StatePublished - 2024
Externally publishedYes
Event23rd ACM/IEEE International Conference on Information Processing in Sensor Networks, IPSN 2024 - Hong Kong, China
Duration: 13 May 202416 May 2024

Publication series

NameProceedings - 23rd ACM/IEEE International Conference on Information Processing in Sensor Networks, IPSN 2024

Conference

Conference23rd ACM/IEEE International Conference on Information Processing in Sensor Networks, IPSN 2024
Country/TerritoryChina
CityHong Kong
Period13/05/2416/05/24

Keywords

  • edge computing
  • explainable AI
  • progressive inference
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Poster Abstract: Xpi: Real-Time Progressive Inference Serving with Explainable AI in Edge-Cloud Systems'. Together they form a unique fingerprint.

Cite this