Anomaly Detection and Localization in Crowded Scenes by Motion-field Shape Description and Similarity-based Statistical Learning

In crowded scenes, detection and localization of abnormal behaviors is challenging in that high-density people make object segmentation and tracking extremely difficult. We associate the optical flows of multiple frames to capture short-term trajectories and introduce the histogram-based shape descr...

Full description

Saved in:
Bibliographic Details
Main Authors Zhang, Xinfeng, Yang, Su, Zhang, Xinjian, Zhang, Weishan, Zhang, Jiulong
Format Journal Article
LanguageEnglish
Published 27.05.2018
Subjects
Online AccessGet full text
DOI10.48550/arxiv.1805.10620

Cover

Abstract In crowded scenes, detection and localization of abnormal behaviors is challenging in that high-density people make object segmentation and tracking extremely difficult. We associate the optical flows of multiple frames to capture short-term trajectories and introduce the histogram-based shape descriptor referred to as shape contexts to describe such short-term trajectories. Furthermore, we propose a K-NN similarity-based statistical model to detect anomalies over time and space, which is an unsupervised one-class learning algorithm requiring no clustering nor any prior assumption. Firstly, we retrieve the K-NN samples from the training set in regard to the testing sample, and then use the similarities between every pair of the K-NN samples to construct a Gaussian model. Finally, the probabilities of the similarities from the testing sample to the K-NN samples under the Gaussian model are calculated in the form of a joint probability. Abnormal events can be detected by judging whether the joint probability is below predefined thresholds in terms of time and space, separately. Such a scheme can adapt to the whole scene, since the probability computed as such is not affected by motion distortions arising from perspective distortion. We conduct experiments on real-world surveillance videos, and the results demonstrate that the proposed method can reliably detect and locate the abnormal events in the video sequences, outperforming the state-of-the-art approaches.
AbstractList In crowded scenes, detection and localization of abnormal behaviors is challenging in that high-density people make object segmentation and tracking extremely difficult. We associate the optical flows of multiple frames to capture short-term trajectories and introduce the histogram-based shape descriptor referred to as shape contexts to describe such short-term trajectories. Furthermore, we propose a K-NN similarity-based statistical model to detect anomalies over time and space, which is an unsupervised one-class learning algorithm requiring no clustering nor any prior assumption. Firstly, we retrieve the K-NN samples from the training set in regard to the testing sample, and then use the similarities between every pair of the K-NN samples to construct a Gaussian model. Finally, the probabilities of the similarities from the testing sample to the K-NN samples under the Gaussian model are calculated in the form of a joint probability. Abnormal events can be detected by judging whether the joint probability is below predefined thresholds in terms of time and space, separately. Such a scheme can adapt to the whole scene, since the probability computed as such is not affected by motion distortions arising from perspective distortion. We conduct experiments on real-world surveillance videos, and the results demonstrate that the proposed method can reliably detect and locate the abnormal events in the video sequences, outperforming the state-of-the-art approaches.
Author Zhang, Xinjian
Zhang, Jiulong
Zhang, Xinfeng
Yang, Su
Zhang, Weishan
Author_xml – sequence: 1
  givenname: Xinfeng
  surname: Zhang
  fullname: Zhang, Xinfeng
– sequence: 2
  givenname: Su
  surname: Yang
  fullname: Yang, Su
– sequence: 3
  givenname: Xinjian
  surname: Zhang
  fullname: Zhang, Xinjian
– sequence: 4
  givenname: Weishan
  surname: Zhang
  fullname: Zhang, Weishan
– sequence: 5
  givenname: Jiulong
  surname: Zhang
  fullname: Zhang, Jiulong
BackLink https://doi.org/10.48550/arXiv.1805.10620$$DView paper in arXiv
BookMark eNqFjjsOwjAQRF1Awe8AVPgCCQ4QRIv4iAIq6KMlWWAlZx3ZFhAqjk4SIVqqkWZ2n15XtNgwCjGMVDhbxLEag33SPYwWKg4jNZ-ojngv2eSgS7lGj6knwxI4k3uTgqYXNAWxXFnzyDCTxxQZnTyX8mDqLbgQ6qq-QYEVwqWWih_kSDlpsOTL4Ayu_vYV0Hmq2HKPYJn42hftC2iHg2_2xGi7Oa12QeOaFJZysGVSOyeN8_T_xQcGZlA1
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
GOX
DOI 10.48550/arxiv.1805.10620
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 1805_10620
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_1805_106203
IEDL.DBID GOX
IngestDate Tue Jul 22 23:39:43 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_1805_106203
OpenAccessLink https://arxiv.org/abs/1805.10620
ParticipantIDs arxiv_primary_1805_10620
PublicationCentury 2000
PublicationDate 2018-05-27
PublicationDateYYYYMMDD 2018-05-27
PublicationDate_xml – month: 05
  year: 2018
  text: 2018-05-27
  day: 27
PublicationDecade 2010
PublicationYear 2018
Score 3.3172977
SecondaryResourceType preprint
Snippet In crowded scenes, detection and localization of abnormal behaviors is challenging in that high-density people make object segmentation and tracking extremely...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computer Vision and Pattern Recognition
Title Anomaly Detection and Localization in Crowded Scenes by Motion-field Shape Description and Similarity-based Statistical Learning
URI https://arxiv.org/abs/1805.10620
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07T8MwED6VTiyIClB5FG5gtSCN8xpRoaoQhSEgZYuc-gKRqKnagOjGT-fshMfS1bFOsS3n-77cC-Dc01FQ-jMSyk9ISJ14QkkdCS9kOJe2p7Vvs5Gn9-HkSd5mQdYB_MmFUcvP6qOpD1ysLrz4MmB9GQ5ZlG8xUbDJvA9Z45x0pbja-X_zmGO6oX8gMd6FnZbd4VVzHD3okNmDL5bYc_W6xmuqXeCTQZbveGdRpM2CxMrgiAWxJo3pzH5-sFjj1HXYES7IDNMXtSA28XvNnZG0mlesTZlKC4tHPGBd6yv3gxrb2qnP-3A2vnkcTYR753zRFJjI7XJytxz_ALrmzVAfkCjxS6lkom3bMK3jgveTykBpRmwV60Pob7JytPnRMWwzBYitP3wYnUC3Xr7TgGG2Lk7dXn8DLGqEGg
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Anomaly+Detection+and+Localization+in+Crowded+Scenes+by+Motion-field+Shape+Description+and+Similarity-based+Statistical+Learning&rft.au=Zhang%2C+Xinfeng&rft.au=Yang%2C+Su&rft.au=Zhang%2C+Xinjian&rft.au=Zhang%2C+Weishan&rft.date=2018-05-27&rft_id=info:doi/10.48550%2Farxiv.1805.10620&rft.externalDocID=1805_10620