BIAS: A Biologically Inspired Algorithm for Video Saliency Detection

2026-04-10Computer Vision and Pattern Recognition

Computer Vision and Pattern Recognition
AI summary

The authors developed BIAS, a quick and brain-inspired method to find important parts of videos by looking at movement and still images together. Their model uses a special way to spot where to pay attention by fitting shapes to highlight key spots. BIAS works very fast and does better than some other methods, especially in videos where sudden changes grab attention. They tested it on traffic videos and showed it can predict accidents slightly before people can mark them. Overall, the authors combined ideas from biology and computer science for efficient and understandable video analysis.

visual saliencyItti-Koch modelmotion detectionsaliency mapwinner-take-allDHF1K datasetmulti-Gaussian peak fittingbottom-up attentiontraffic accident analysisdynamic video stream
Authors
Zhao-ji Zhang, Ya-tang Li
Abstract
We present BIAS, a fast, biologically inspired model for dynamic visual saliency detection in continuous video streams. Building on the Itti--Koch framework, BIAS incorporates a retina-inspired motion detector to extract temporal features, enabling the generation of saliency maps that integrate both static and motion information. Foci of attention (FOAs) are identified using a greedy multi-Gaussian peak-fitting algorithm that balances winner-take-all competition with information maximization. BIAS detects salient regions with millisecond-scale latency and outperforms heuristic-based approaches and several deep-learning models on the DHF1K dataset, particularly in videos dominated by bottom-up attention. Applied to traffic accident analysis, BIAS demonstrates strong real-world utility, achieving state-of-the-art performance in cause-effect recognition and anticipating accidents up to 0.72 seconds before manual annotation with reliable accuracy. Overall, BIAS bridges biological plausibility and computational efficiency to achieve interpretable, high-speed dynamic saliency detection.