Insights into aerial intelligence: assessing CNN-based algorithms for human action recognition and object detection in diverse environments
Today’s era follows a data-driven decision process for large-scale environment analysis. Aerial view-based decision process plays a key role in various domains including surveillance, disaster responses, city planning, and military operations. One of the important and challenging data-driven process...
Saved in:
| Published in | Multimedia tools and applications Vol. 84; no. 16; pp. 16481 - 16523 |
|---|---|
| Main Authors | , , , , |
| Format | Journal Article |
| Language | English |
| Published |
New York
Springer US
01.05.2025
Springer Nature B.V |
| Subjects | |
| Online Access | Get full text |
| ISSN | 1573-7721 1380-7501 1573-7721 |
| DOI | 10.1007/s11042-024-19611-z |
Cover
| Summary: | Today’s era follows a data-driven decision process for large-scale environment analysis. Aerial view-based decision process plays a key role in various domains including surveillance, disaster responses, city planning, and military operations. One of the important and challenging data-driven processes is human action recognition and object detection from the aerial view. The major challenge in aerial view is its notable variance in the scale and perspective of humans and objects. The size of an individual depends on the distance from the camera making object location and human action recognition difficult. In addition, low resolution or poor visibility, occlusion clutter background, and motion blur limit the accurate object detection and localization. CNN’s ability to learn hierarchical representation, take advantage of spatial relationships, manage translation invariance, and leverage transfer learning make it a powerful tool among all for HAR and object detection from an aerial view. We studied applications and progress of aerial view-based human activity recognition (HAR) and object detection (OD) techniques. Various challenges in different environments have been addressed for both HAR and OD examining recent datasets and algorithms spanning seven years. Initially, a categorization of various CNN-based algorithms and their strength and weaknesses are presented for both HAR and OD respectively. Later, a comparative analysis using evaluation matrices and research challenges are discussed in the paper. The comparative study reveals that the self-attention mechanism in CNN performed better among all models but more extensive research is necessary to raise performance standards. A promising future trajectory is outlined for both HAR and OD from an aerial view perspective. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 1573-7721 1380-7501 1573-7721 |
| DOI: | 10.1007/s11042-024-19611-z |