Skip to main navigation Skip to search Skip to main content

VVA: Video Values Analysis

  • People’s Daily Online
  • Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

User-generated content videos have attracted increasingly attention due to its dominant role in social platforms. It is crucial to analyze values in videos because the extensive range of video content results in significant variations in the subjective quality of videos. However, the research literature on Video Values Analysis (VVA) is very scarce, which aims to evaluate the compatibility between video content and the social mainstream values. Meanwhile, existing video content analysis methods are mainly based on classification techniques, which can not adequate VVA due to their coarse-grained manners. To tackle this challenge, we propose a framework to generate more fine-grained scores for diverse videos, termed as Video Values Analysis Model (VVAM), which consists of a feature extractor based on R3D, a feature aggregation module based on Transformer and a regression head based on MLP. In addition, considered texts in videos can be key clues to improve VVA, we design a new pipeline, termed as Text-Guided Video Values Analysis Model (TG-VVAM), in which texts in videos are spotted by OCR tools and a cross-modal fusion module is used to combine the vision and text features. To further facilitate the VVA, we construct a large-scale dataset, termed as Video Values Analysis Dataset (VVAD), which contains 53,705 short videos of various types from main social platforms. Experiments demonstrate that our proposed VVAM and TG-VVAM achieves promising results in the VVAD.

Original languageEnglish
Title of host publicationPattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Proceedings
EditorsQingshan Liu, Hanzi Wang, Rongrong Ji, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages346-358
Number of pages13
ISBN (Print)9789819985395
DOIs
StatePublished - 2024
Event6th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2023 - Xiamen, China
Duration: 13 Oct 202315 Oct 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14431 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2023
Country/TerritoryChina
CityXiamen
Period13/10/2315/10/23

Keywords

  • Text-guided video values analysis model
  • Video values analysis
  • Video values analysis dataset
  • Video values analysis model

Fingerprint

Dive into the research topics of 'VVA: Video Values Analysis'. Together they form a unique fingerprint.

Cite this