Video primal sketch: a unified middle-level representation for video
From MaRDI portal
Publication:890113
DOI10.1007/S10851-015-0563-2zbMATH Open1343.94010arXiv1502.02965OpenAlexW2053380516MaRDI QIDQ890113FDOQ890113
Song-Chun Zhu, Zhi Han, Zongben Xu
Publication date: 9 November 2015
Published in: Journal of Mathematical Imaging and Vision (Search for Journal in Brave)
Abstract: This paper presents a middle-level video representation named Video Primal Sketch (VPS), which integrates two regimes of models: i) sparse coding model using static or moving primitives to explicitly represent moving corners, lines, feature points, etc., ii) FRAME /MRF model reproducing feature statistics extracted from input video to implicitly represent textured motion, such as water and fire. The feature statistics include histograms of spatio-temporal filters and velocity distributions. This paper makes three contributions to the literature: i) Learning a dictionary of video primitives using parametric generative models; ii) Proposing the Spatio-Temporal FRAME (ST-FRAME) and Motion-Appearance FRAME (MA-FRAME) models for modeling and synthesizing textured motion; and iii) Developing a parsimonious hybrid model for generic video representation. Given an input video, VPS selects the proper models automatically for different motion patterns and is compatible with high-level action representations. In the experiments, we synthesize a number of textured motion; reconstruct real videos using the VPS; report a series of human perception experiments to verify the quality of reconstructed videos; demonstrate how the VPS changes over the scale transition in videos; and present the close connection between VPS and high-level action models.
Full work available at URL: https://arxiv.org/abs/1502.02965
Image processing (compression, reconstruction, etc.) in information and communication theory (94A08)
Cites Work
- Title not available (Why is that?)
- Matching pursuits with time-frequency dictionaries
- A parametric texture model based on joint statistics of complex wavelet coefficients
- Probabilistic detection and tracking of motion boundaries
- A probabilistic exclusion principle for tracking multiple objects
- Dynamic textures
- Intrackability: characterizing video statistics and pursuing video representations
- Mixed-state auto-models and motion texture modeling
- Equivalence of Julesz ensembles and FRAME models
- Video primal sketch: a unified middle-level representation for video
Cited In (2)
Uses Software
This page was built for publication: Video primal sketch: a unified middle-level representation for video
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q890113)