Unimore logo AImageLab

Semantic video transcoding using classes of relevance

Abstract: In this work we present a framework for on-the-fly video transcoding that exploits computer vision-based techniques to adapt the Web access to the user requirements. Theproposed transcoding approach aims at coping with both user bandwidth and resources capabilities, and with user interests in the video's content. We propose an object-basedsemantic transcoding that, according to the user-dened classes of relevance, applies different transcoding techniques to the objects segmented in a scene. Object extraction is provided by on-the-fly video processing, without manual annotation. Multiple transcoding policies are reviewed and a performance evaluation metric based on the Weighted Mean Square Error (and corresponding PSNR), that takes into account the perceptual user requirements by means of classes of relevance, is dened. Results are analyzed by varying transcoding techniques, bandwidth requirements and video types (with indoor and outdoor scenes), showing that the use of semantics can dramatically improve the bandwidth to distortion ratio.


Citation:

Cucchiara, Rita; Grana, Costantino; Prati, Andrea "Semantic video transcoding using classes of relevance" INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, vol. 3, pp. 145 -169 , 2003 DOI: 10.1142/S0219467803000956

 not available

Paper download: