Anomaly Detection and Localization in Crowded Scenes by Motion-field Shape Description and Similarity-based Statistical Learning
In crowded scenes, detection and localization of abnormal behaviors is challenging in that high-density people make object segmentation and tracking extremely difficult. We associate the optical flows of multiple frames to capture short-term trajectories and introduce the histogram-based shape descr...
Saved in:
| Main Authors | , , , , |
|---|---|
| Format | Journal Article |
| Language | English |
| Published |
27.05.2018
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.48550/arxiv.1805.10620 |
Cover
| Abstract | In crowded scenes, detection and localization of abnormal behaviors is
challenging in that high-density people make object segmentation and tracking
extremely difficult. We associate the optical flows of multiple frames to
capture short-term trajectories and introduce the histogram-based shape
descriptor referred to as shape contexts to describe such short-term
trajectories. Furthermore, we propose a K-NN similarity-based statistical model
to detect anomalies over time and space, which is an unsupervised one-class
learning algorithm requiring no clustering nor any prior assumption. Firstly,
we retrieve the K-NN samples from the training set in regard to the testing
sample, and then use the similarities between every pair of the K-NN samples to
construct a Gaussian model. Finally, the probabilities of the similarities from
the testing sample to the K-NN samples under the Gaussian model are calculated
in the form of a joint probability. Abnormal events can be detected by judging
whether the joint probability is below predefined thresholds in terms of time
and space, separately. Such a scheme can adapt to the whole scene, since the
probability computed as such is not affected by motion distortions arising from
perspective distortion. We conduct experiments on real-world surveillance
videos, and the results demonstrate that the proposed method can reliably
detect and locate the abnormal events in the video sequences, outperforming the
state-of-the-art approaches. |
|---|---|
| AbstractList | In crowded scenes, detection and localization of abnormal behaviors is
challenging in that high-density people make object segmentation and tracking
extremely difficult. We associate the optical flows of multiple frames to
capture short-term trajectories and introduce the histogram-based shape
descriptor referred to as shape contexts to describe such short-term
trajectories. Furthermore, we propose a K-NN similarity-based statistical model
to detect anomalies over time and space, which is an unsupervised one-class
learning algorithm requiring no clustering nor any prior assumption. Firstly,
we retrieve the K-NN samples from the training set in regard to the testing
sample, and then use the similarities between every pair of the K-NN samples to
construct a Gaussian model. Finally, the probabilities of the similarities from
the testing sample to the K-NN samples under the Gaussian model are calculated
in the form of a joint probability. Abnormal events can be detected by judging
whether the joint probability is below predefined thresholds in terms of time
and space, separately. Such a scheme can adapt to the whole scene, since the
probability computed as such is not affected by motion distortions arising from
perspective distortion. We conduct experiments on real-world surveillance
videos, and the results demonstrate that the proposed method can reliably
detect and locate the abnormal events in the video sequences, outperforming the
state-of-the-art approaches. |
| Author | Zhang, Xinjian Zhang, Jiulong Zhang, Xinfeng Yang, Su Zhang, Weishan |
| Author_xml | – sequence: 1 givenname: Xinfeng surname: Zhang fullname: Zhang, Xinfeng – sequence: 2 givenname: Su surname: Yang fullname: Yang, Su – sequence: 3 givenname: Xinjian surname: Zhang fullname: Zhang, Xinjian – sequence: 4 givenname: Weishan surname: Zhang fullname: Zhang, Weishan – sequence: 5 givenname: Jiulong surname: Zhang fullname: Zhang, Jiulong |
| BackLink | https://doi.org/10.48550/arXiv.1805.10620$$DView paper in arXiv |
| BookMark | eNqFjjsOwjAQRF1Awe8AVPgCCQ4QRIv4iAIq6KMlWWAlZx3ZFhAqjk4SIVqqkWZ2n15XtNgwCjGMVDhbxLEag33SPYwWKg4jNZ-ojngv2eSgS7lGj6knwxI4k3uTgqYXNAWxXFnzyDCTxxQZnTyX8mDqLbgQ6qq-QYEVwqWWih_kSDlpsOTL4Ayu_vYV0Hmq2HKPYJn42hftC2iHg2_2xGi7Oa12QeOaFJZysGVSOyeN8_T_xQcGZlA1 |
| ContentType | Journal Article |
| Copyright | http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| Copyright_xml | – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| DBID | AKY GOX |
| DOI | 10.48550/arxiv.1805.10620 |
| DatabaseName | arXiv Computer Science arXiv.org |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| ExternalDocumentID | 1805_10620 |
| GroupedDBID | AKY GOX |
| ID | FETCH-arxiv_primary_1805_106203 |
| IEDL.DBID | GOX |
| IngestDate | Tue Jul 22 23:39:43 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-arxiv_primary_1805_106203 |
| OpenAccessLink | https://arxiv.org/abs/1805.10620 |
| ParticipantIDs | arxiv_primary_1805_10620 |
| PublicationCentury | 2000 |
| PublicationDate | 2018-05-27 |
| PublicationDateYYYYMMDD | 2018-05-27 |
| PublicationDate_xml | – month: 05 year: 2018 text: 2018-05-27 day: 27 |
| PublicationDecade | 2010 |
| PublicationYear | 2018 |
| Score | 3.3172977 |
| SecondaryResourceType | preprint |
| Snippet | In crowded scenes, detection and localization of abnormal behaviors is
challenging in that high-density people make object segmentation and tracking
extremely... |
| SourceID | arxiv |
| SourceType | Open Access Repository |
| SubjectTerms | Computer Science - Computer Vision and Pattern Recognition |
| Title | Anomaly Detection and Localization in Crowded Scenes by Motion-field Shape Description and Similarity-based Statistical Learning |
| URI | https://arxiv.org/abs/1805.10620 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07T8MwED6VTiyIClB5FG5gtSCN8xpRoaoQhSEgZYuc-gKRqKnagOjGT-fshMfS1bFOsS3n-77cC-Dc01FQ-jMSyk9ISJ14QkkdCS9kOJe2p7Vvs5Gn9-HkSd5mQdYB_MmFUcvP6qOpD1ysLrz4MmB9GQ5ZlG8xUbDJvA9Z45x0pbja-X_zmGO6oX8gMd6FnZbd4VVzHD3okNmDL5bYc_W6xmuqXeCTQZbveGdRpM2CxMrgiAWxJo3pzH5-sFjj1HXYES7IDNMXtSA28XvNnZG0mlesTZlKC4tHPGBd6yv3gxrb2qnP-3A2vnkcTYR753zRFJjI7XJytxz_ALrmzVAfkCjxS6lkom3bMK3jgveTykBpRmwV60Pob7JytPnRMWwzBYitP3wYnUC3Xr7TgGG2Lk7dXn8DLGqEGg |
| linkProvider | Cornell University |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Anomaly+Detection+and+Localization+in+Crowded+Scenes+by+Motion-field+Shape+Description+and+Similarity-based+Statistical+Learning&rft.au=Zhang%2C+Xinfeng&rft.au=Yang%2C+Su&rft.au=Zhang%2C+Xinjian&rft.au=Zhang%2C+Weishan&rft.date=2018-05-27&rft_id=info:doi/10.48550%2Farxiv.1805.10620&rft.externalDocID=1805_10620 |