Anomaly Detection and Localization in Crowded Scenes by Motion-field Shape Description and Similarity-based Statistical Learning

In crowded scenes, detection and localization of abnormal behaviors is challenging in that high-density people make object segmentation and tracking extremely difficult. We associate the optical flows of multiple frames to capture short-term trajectories and introduce the histogram-based shape descr...

Full description

Saved in:

Bibliographic Details
Main Authors	Zhang, Xinfeng, Yang, Su, Zhang, Xinjian, Zhang, Weishan, Zhang, Jiulong
Format	Journal Article
Language	English
Published	27.05.2018
Subjects	Computer Science - Computer Vision and Pattern Recognition
Online Access	Get full text
DOI	10.48550/arxiv.1805.10620

Cover

Abstract	In crowded scenes, detection and localization of abnormal behaviors is challenging in that high-density people make object segmentation and tracking extremely difficult. We associate the optical flows of multiple frames to capture short-term trajectories and introduce the histogram-based shape descriptor referred to as shape contexts to describe such short-term trajectories. Furthermore, we propose a K-NN similarity-based statistical model to detect anomalies over time and space, which is an unsupervised one-class learning algorithm requiring no clustering nor any prior assumption. Firstly, we retrieve the K-NN samples from the training set in regard to the testing sample, and then use the similarities between every pair of the K-NN samples to construct a Gaussian model. Finally, the probabilities of the similarities from the testing sample to the K-NN samples under the Gaussian model are calculated in the form of a joint probability. Abnormal events can be detected by judging whether the joint probability is below predefined thresholds in terms of time and space, separately. Such a scheme can adapt to the whole scene, since the probability computed as such is not affected by motion distortions arising from perspective distortion. We conduct experiments on real-world surveillance videos, and the results demonstrate that the proposed method can reliably detect and locate the abnormal events in the video sequences, outperforming the state-of-the-art approaches.
AbstractList	In crowded scenes, detection and localization of abnormal behaviors is challenging in that high-density people make object segmentation and tracking extremely difficult. We associate the optical flows of multiple frames to capture short-term trajectories and introduce the histogram-based shape descriptor referred to as shape contexts to describe such short-term trajectories. Furthermore, we propose a K-NN similarity-based statistical model to detect anomalies over time and space, which is an unsupervised one-class learning algorithm requiring no clustering nor any prior assumption. Firstly, we retrieve the K-NN samples from the training set in regard to the testing sample, and then use the similarities between every pair of the K-NN samples to construct a Gaussian model. Finally, the probabilities of the similarities from the testing sample to the K-NN samples under the Gaussian model are calculated in the form of a joint probability. Abnormal events can be detected by judging whether the joint probability is below predefined thresholds in terms of time and space, separately. Such a scheme can adapt to the whole scene, since the probability computed as such is not affected by motion distortions arising from perspective distortion. We conduct experiments on real-world surveillance videos, and the results demonstrate that the proposed method can reliably detect and locate the abnormal events in the video sequences, outperforming the state-of-the-art approaches.
Author	Zhang, Xinjian Zhang, Jiulong Zhang, Xinfeng Yang, Su Zhang, Weishan
Author_xml	– sequence: 1 givenname: Xinfeng surname: Zhang fullname: Zhang, Xinfeng – sequence: 2 givenname: Su surname: Yang fullname: Yang, Su – sequence: 3 givenname: Xinjian surname: Zhang fullname: Zhang, Xinjian – sequence: 4 givenname: Weishan surname: Zhang fullname: Zhang, Weishan – sequence: 5 givenname: Jiulong surname: Zhang fullname: Zhang, Jiulong
BackLink	https://doi.org/10.48550/arXiv.1805.10620$$DView paper in arXiv
BookMark	eNqFjjsOwjAQRF1Awe8AVPgCCQ4QRIv4iAIq6KMlWWAlZx3ZFhAqjk4SIVqqkWZ2n15XtNgwCjGMVDhbxLEag33SPYwWKg4jNZ-ojngv2eSgS7lGj6knwxI4k3uTgqYXNAWxXFnzyDCTxxQZnTyX8mDqLbgQ6qq-QYEVwqWWih_kSDlpsOTL4Ayu_vYV0Hmq2HKPYJn42hftC2iHg2_2xGi7Oa12QeOaFJZysGVSOyeN8_T_xQcGZlA1
ContentType	Journal Article
Copyright	http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml	– notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID	AKY GOX
DOI	10.48550/arxiv.1805.10620
DatabaseName	arXiv Computer Science arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	1805_10620
GroupedDBID	AKY GOX
ID	FETCH-arxiv_primary_1805_106203
IEDL.DBID	GOX
IngestDate	Tue Jul 22 23:39:43 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-arxiv_primary_1805_106203
OpenAccessLink	https://arxiv.org/abs/1805.10620
ParticipantIDs	arxiv_primary_1805_10620
PublicationCentury	2000
PublicationDate	2018-05-27
PublicationDateYYYYMMDD	2018-05-27
PublicationDate_xml	– month: 05 year: 2018 text: 2018-05-27 day: 27
PublicationDecade	2010
PublicationYear	2018
Score	3.3172977
SecondaryResourceType	preprint
Snippet	In crowded scenes, detection and localization of abnormal behaviors is challenging in that high-density people make object segmentation and tracking extremely...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Computer Vision and Pattern Recognition
Title	Anomaly Detection and Localization in Crowded Scenes by Motion-field Shape Description and Similarity-based Statistical Learning
URI	https://arxiv.org/abs/1805.10620
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07T8MwED6VTiyIClB5FG5gtSCN8xpRoaoQhSEgZYuc-gKRqKnagOjGT-fshMfS1bFOsS3n-77cC-Dc01FQ-jMSyk9ISJ14QkkdCS9kOJe2p7Vvs5Gn9-HkSd5mQdYB_MmFUcvP6qOpD1ysLrz4MmB9GQ5ZlG8xUbDJvA9Z45x0pbja-X_zmGO6oX8gMd6FnZbd4VVzHD3okNmDL5bYc_W6xmuqXeCTQZbveGdRpM2CxMrgiAWxJo3pzH5-sFjj1HXYES7IDNMXtSA28XvNnZG0mlesTZlKC4tHPGBd6yv3gxrb2qnP-3A2vnkcTYR753zRFJjI7XJytxz_ALrmzVAfkCjxS6lkom3bMK3jgveTykBpRmwV60Pob7JytPnRMWwzBYitP3wYnUC3Xr7TgGG2Lk7dXn8DLGqEGg
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Anomaly+Detection+and+Localization+in+Crowded+Scenes+by+Motion-field+Shape+Description+and+Similarity-based+Statistical+Learning&rft.au=Zhang%2C+Xinfeng&rft.au=Yang%2C+Su&rft.au=Zhang%2C+Xinjian&rft.au=Zhang%2C+Weishan&rft.date=2018-05-27&rft_id=info:doi/10.48550%2Farxiv.1805.10620&rft.externalDocID=1805_10620