Skip to main navigation Skip to search Skip to main content

Deep learning for multimedia: Science or technology?

  • Jitao Sang
  • , Rainer Lienhart
  • , Jun Yu
  • , Peng Cui
  • , Ramesh Jain
  • , Jiashi Feng
  • Beijing Jiaotong University
  • Augsburg University
  • Hangzhou Dianzi University
  • Tsinghua University
  • University of California at Irvine
  • National University of Singapore

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Deep learning has been successfully explored in addressing different multimedia topics recent years, ranging from object detection, semantic classification, entity annotation, to multimedia captioning, multimedia question answering and storytelling. Open source libraries and platforms such as Tensorflow, Caffe, MXnet significantly help promote the wide deployment of deep learning in solving real-world applications. On one hand, deep learning practitioners, while not necessary to understand the involved math behind, are able to set up and make use of a complex deep network. One recent deep learning tool based on Keras even provides the graphical interface to enable straightforward 'drag and drop' operation for deep learning programming. On the other hand, however, some general theoretical problems of learning such as the interpretation and generalization, have only achieved limited progress. Most deep learning papers published these days follow the pipeline of designing/modifying network structures - tuning parameters - reporting performance improvement in specific applications. We have even seen many deep learning application papers without one single equation. Theoretical interpretation and the science behind the study are largely ignored. While excited about the successful application of deep learning in classical and novel problems, we multimedia researchers are responsible to think and solve the fundamental topics in deep learning science. Prof. Guanrong Chen recently wrote an editorial note titled 'Science and Technology, not SciTech' [1]. This panel falls into similar discussion and aims to invite prestigious multimedia researchers and active deep learning practitioners to discuss the positioning of deep learning research now and in the future.

Original languageEnglish
Title of host publicationMM 2018 - Proceedings of the 2018 ACM Multimedia Conference
PublisherAssociation for Computing Machinery
Pages1354-1355
Number of pages2
ISBN (Electronic)9781450356657
DOIs
StatePublished - 15 Oct 2018
Externally publishedYes
Event26th ACM Multimedia conference, MM 2018 - Seoul, Korea, Republic of
Duration: 22 Oct 201826 Oct 2018

Publication series

NameMM 2018 - Proceedings of the 2018 ACM Multimedia Conference

Conference

Conference26th ACM Multimedia conference, MM 2018
Country/TerritoryKorea, Republic of
CitySeoul
Period22/10/1826/10/18

Keywords

  • Application
  • Deep learning
  • Theory

Fingerprint

Dive into the research topics of 'Deep learning for multimedia: Science or technology?'. Together they form a unique fingerprint.

Cite this