SAVAM — Semiautomatic Visual-Attention Modeling

MSU Graphics & Media Lab (Video Group)

Projects, ideas: Dr. Dmitriy Vatolin, Prof. Galina Rozhkova
Implementation: Mikhail Erofeev, Yury Gitman, Andrey Bolshakov, Alexey Fedorov
In cooperation with IITP RAS

The database

Introduction

The maps of attention can be applied in many fields: user interface design, computer graphics, video processing, etc. Many technologies, algorithms and filters can be improved using information about the saliency distribution. During our work we have created the database of human eye-movements captured while viewing various videos (static and dynamic scenes, shots from cinema-like films and scientific databases)

Features/Benefits

High quality

Includes only FullHD and 4K UHDTV video sequences
Includes only stereoscopic video sequences
Eye-movements were captured with high quality eye-tracking device: SMI iViewXTM Hi-Speed 1250, with a 500 Hz frequency (20 fixations per frame)
Additional post-processing was applied to improve records' accuracy

Diversity

43 fragments of motion video from various feature movies, commercial clips and stereo video databases
About 13 minutes of video (19760 frames)
50 observers of different ages (mostly between 18–27 years old)

Please note: while the database contains S3D videos actually, only the left view was demonstrated to observers.

Data post-processing

To improve data's accuracy several levels of verification and correction were applied.

The test sequence was divided into three five-minute parts. Before each part, we carried out the calibration procedure. The observer followed a target that was placed successively at 13 locations across the screen. Next, we validated the calibration by measuring the error of the gaze position at four points. If the estimated error was greater than 0.3 angular degrees, we restarted the calibration.

To reduce inter-video influence we inserted cross-fade by adding a black frame between adjacent scenes. Additionally, to measure observer's fatigue we placed a special pattern after each three-scene part. We asked observers to track a stimulus, enabling us to measure the squared tracking error, which we defined as the fatigue value. On the next step, we improve the accuracy of determining the position of gaze using transformation, which is obtained by averaging of eye tracking data on calibrate pattern.

To understand the influence of an observer's fatigue on fixations at the end of a sequence, we asked eight observers to view the whole sequence a second time with the scenes appearing in reverse order.

Downloads

ICCP Paper (2017)

Accepted version of the paper: Download

Supplementary materials: final compression examples pdf zip

ICIP Paper (2014)

Accepted version of the paper: Download

Published version of the paper: IEEE link

Saliency-aware video encoder

A fork of x264 video encoder supporting custom saliency maps as an additional input to improve quality of salient objects.

View on GitHub

Robust Saliency Map Comparison

Saliency maps comparison method invariant to most common transforms:

Version 2. View on GitHub
Version 1. View on GitHub

The Base of Gaze Map

To download the database, please fill-in the request form.
You will get the download link for all data via e-mail.

Reference

Citation

Y. Gitman, M. Erofeev, D. Vatolin, A. Bolshakov, A. Fedorov. "Semiautomatic Visual-Attention Modeling and Its Application to Video Compression". 2014 IEEE International Conference on Image Processing (ICIP). Paris, France, pp. 1105-1109.

Bibtex

  @INPROCEEDINGS {
    Gitm1410:Semiautomatic,
    AUTHOR    = "Yury Gitman and Mikhail Erofeev and Dmitriy Vatolin
                 and Andrey Bolshakov and Alexey Fedorov",
    TITLE     = "Semiautomatic {Visual-Attention} Modeling and Its 
                 Application to Video Compression",
    BOOKTITLE = "2014 IEEE International Conference on Image Processing
                 (ICIP) (ICIP 2014)",
    ADDRESS   = "Paris, France",
    PAGES     = "1105-1109",
    DAYS      =  27,
    MONTH     =  oct,
    YEAR      =  2014,
    KEYWORDS  = "Saliency;Visual attention;Eye-tracking;Saliencyaware 
                 compression;H.264",
  }

Application to video compression

x264, 1920x1080, 1500 kbps

Acknowledgments

This work was supported by the Intel/Cisco Video Aware Wireless Networking (VAWN) Program. We acknowledge Institute of Information Transmission Problems for help with eye tracking.

Video resources:

3D and stereo video
Projects on 3D and stereo video processing and analysis

MSU Video Quality Measurement tools
Programs with different objective and subjective video quality metrics implementation

Codecs comparisons
Objective and subjective quality evaluation
tests for video and image codecs

MSU Video Codecs Comparison 2025
MSU Video Codecs Comparison 2023-2024
MSU Video Codecs Comparison 2022
MSU Video Codecs Comparison 2021
MSU Video Codecs Comparison 2020
MSU Cloud Benchmark 2020
Cloud Encoding Services Comparison 2019
HEVC/AV1 Codec Comparison 2019
HEVC/AV1 Codec Comparison 2018
HEVC/AV1 Codec Comparison 2017
HEVC Codec Comparison 2016
HEVC Codec Comparison 2015
9-th MPEG4-AVC/H.264 Comparison
8-th MPEG4-AVC/H.264 Comparison
7-th MPEG4-AVC/H.264 Comparison

6-th MPEG4-AVC/H.264 Comparison
5-th MPEG4-AVC/H.264 Comparison
Codec Analysis for Companies:
- Private Video Codec Comparison
- MSU Video Quality Measurement Tool
Options Analysis of Codec x264
4-th MPEG4-AVC/H.264 Comparison
Lossless Video Codecs Comparison 2007
3-rd MPEG4-AVC/H.264 Comparison
HD Photo and JPEG 2000 Comparison
MPEG-2 Decoders Crash Test
Subjective Compar. of 4 Modern Codecs
2-nd MPEG-4 AVC/H.264 Comparison
- FAQ of 2-nd H.264 Comparison
1-st MPEG-4 AVC/H.264 Codecs Comparison
- FAQ of 1-st H.264 Comparison
JPEG 2000 Image Codecs Comparison
Lossless Video Codecs Comparison 2004
MPEG-4 SP/ASP Codecs Comparison
Video codecs comparison (old)

Ext. link: x264 parameters efficiency comparison

Public MSU video filters
Here are available VirtualDub and AviSynth filters. For a given type of digital video filtration we typically develop a family of different algorithms and implementations. Generally there are also versions optimized for PC and hardware implementations (ASIC/FPGA/DSP). These optimized versions can be licensed to companies. Please contact us for details via video(at)graphics.cs.msu_ru.

Filters for companies
We are working with Intel, Samsung, RealNetworks and other companies on adapting our filters other video processing algorithms for specific video streams, applications and hardware like TV-sets, graphics cards, etc. Some of such projects are non-exclusive. Also we have internal researches. Please let us know via video(at)graphics.cs.msu_ru if you are interested in acquiring a license for such filters or making a custom R&D project on video processing, compression, computer vision.

Video codecs projects
Different research and development
projects on video codecs

MSU Lossless Video Codec (Top!)
MSU Screen Capture Lossless Codec (Top!)
MSU MPEG-2 Video Codec
x264 Codec Improvement

Other
Other information

Crazy gallery (filters screams :)
License for commercial usage of MSU VideoGroup Public Software (please be careful: some soft like metrics has another license!)

Last updated: 28-May-2025

Server size: 8069 files, 1215Mb (Server statistics)

Project updated by
Server Team and MSU Video Group

Project sponsored by YUVsoft Corp.

Project supported by MSU Graphics & Media Lab

Everything about the data compression

Видео Video

Сравнения кодеков MSU Video Codecs Comparisons

Video Quality Measurement Tool (MSU VQMT)

Benchmarks

3D-пїЅпїЅпїЅпїЅпїЅ 3D video

Проекты

Projects

Книга «Методы сжатия данных»

Разделы

About О сервере

News:

News:

SAVAM — Semiautomatic Visual-Attention Modeling

MSU Graphics & Media Lab (Video Group)

Take a look at this article on the new site! Follow the link
https://videoprocessing.ai/saliency/savam.html

The database

Introduction

Features/Benefits

High quality

Diversity

Data post-processing

Downloads

ICCP Paper (2017)

ICIP Paper (2014)

Saliency-aware video encoder

Robust Saliency Map Comparison

The Base of Gaze Map

Reference

Citation

Bibtex

Application to video compression

Acknowledgments

Проекты

Projects

Разделы

News:

News:

SAVAM — Semiautomatic Visual-Attention Modeling

Take a look at this article on the new site! Follow the link https://videoprocessing.ai/saliency/savam.html

The database

Introduction

Features/Benefits

High quality

Diversity

Data post-processing

Downloads

ICCP Paper (2017)

ICIP Paper (2014)

Saliency-aware video encoder

Robust Saliency Map Comparison

The Base of Gaze Map

Reference

Citation

Bibtex

Application to video compression

Acknowledgments

Take a look at this article on the new site! Follow the link
https://videoprocessing.ai/saliency/savam.html