Skip to main navigation Skip to search Skip to main content

Three-Skips CNN for road scene semantic segmentation

  • Jing Tang
  • , Xin Wang*
  • *Corresponding author for this work
  • Harbin Institute of Technology Shenzhen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper we propose a deep learning architecture to make the best use of global and local information for pixel-wise semantic segmentation. The architecture of three-skips CNN is built with convolutional layers in VGG16 network and its mirrored convolutional layers. Our architecture aims to road scene understanding. In order to save memory and computational time, we use unpooling layers to map low resolution feature maps to the input resolution. We introduce three skip architectures which combine local information and global information to produce accurate and detailed segmentations. Besides, we present the median balance method to deal with class unbalance problem in road scene datasets. Thorough evaluations on CamVid dataset demonstrate our approach has state-of-the-art performance and less computational time.

Original languageEnglish
Title of host publication2017 IEEE International Conference on Information and Automation, ICIA 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages858-863
Number of pages6
ISBN (Electronic)9781538631546
DOIs
StatePublished - 20 Oct 2017
Externally publishedYes
Event2017 IEEE International Conference on Information and Automation, ICIA 2017 - Macau, China
Duration: 18 Jul 201720 Jul 2017

Publication series

Name2017 IEEE International Conference on Information and Automation, ICIA 2017

Conference

Conference2017 IEEE International Conference on Information and Automation, ICIA 2017
Country/TerritoryChina
CityMacau
Period18/07/1720/07/17

Keywords

  • Convolutional neural network
  • Pixel-wise Semantic segmentation
  • Road scene

Fingerprint

Dive into the research topics of 'Three-Skips CNN for road scene semantic segmentation'. Together they form a unique fingerprint.

Cite this