Automatic Target Recognition using Unmanned Aerial Vehicle Images with Proposed YOLOv8-SR and Enhanced Deep Super-Resolution Network
Modern surveillance necessitates the use of automatic target recognition (ATR) to identify targets or objects quickly and accurately for multiclass classification in unmanned aerial vehicles (UAVs) such as pedestrians, people, bicycles, cars, vans, trucks, tricycles, buses, and motors. The inadequat...
        Saved in:
      
    
          | Published in | Journal of electronics, electromedical engineering, and medical informatics Vol. 7; no. 4; pp. 1240 - 1258 | 
|---|---|
| Main Authors | , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
          
        15.10.2025
     | 
| Online Access | Get full text | 
| ISSN | 2656-8632 2656-8632  | 
| DOI | 10.35882/jeeemi.v7i4.888 | 
Cover
| Abstract | Modern surveillance necessitates the use of automatic target recognition (ATR) to identify targets or objects quickly and accurately for multiclass classification in unmanned aerial vehicles (UAVs) such as pedestrians, people, bicycles, cars, vans, trucks, tricycles, buses, and motors. The inadequate recognition rate in target detection for UAVs could be due to the fundamental issues provided by the poor resolution of photos recorded from the distinct perspective of the UAVs. The VisDrone dataset used for image analysis consists of a total of 10,209 UAV photos. This research work presents a comprehensive framework specifically for multiclass target classification using VisDrone UAV imagery. The YOLOv8-SR, which stands for "You Only Looked Once Version 8 with Super-Resolution," is a developed model that builds on the YOLOv8s model with the Enhanced Deep Super-Resolution Network (EDSR). The YOLOv8-SR uses the EDSR to convert the low-resolution image to a high-resolution image, allowing it to estimate pixel values for better processing better. The high-resolution image was generated by the EDSR model, having a Peak Signal-to-Noise Ratio (PSNR) of 25.32 and a Structural Similarity Index (SSIM) of 0.781. The YOLOv8-SR model's precision is 63.44%, recall is 46.64%, F1-score is 52.69%, mean average precision (mAP@50) is 51.58%, and the mAP@50–95 is 50.67% over the range of confidence thresholds. The investigation fundamentally transforms the precision and effectiveness of ATR, indicating a future in which ingenuity overcomes obstacles that were once considered insurmountable. This development is characterized by the use of an improved deep super-resolution network to produce super-resolution images from low-resolution inputs. The YoLov8-SR model, a sophisticated version of the YoLov8s framework, is key to this breakthrough. By amalgamating the EDSR methodology with the advanced YOLOv8-SR framework, the system generates high-resolution images abundant in detail, markedly exceeding the informational quality of their low-resolution versions. | 
    
|---|---|
| AbstractList | Modern surveillance necessitates the use of automatic target recognition (ATR) to identify targets or objects quickly and accurately for multiclass classification in unmanned aerial vehicles (UAVs) such as pedestrians, people, bicycles, cars, vans, trucks, tricycles, buses, and motors. The inadequate recognition rate in target detection for UAVs could be due to the fundamental issues provided by the poor resolution of photos recorded from the distinct perspective of the UAVs. The VisDrone dataset used for image analysis consists of a total of 10,209 UAV photos. This research work presents a comprehensive framework specifically for multiclass target classification using VisDrone UAV imagery. The YOLOv8-SR, which stands for "You Only Looked Once Version 8 with Super-Resolution," is a developed model that builds on the YOLOv8s model with the Enhanced Deep Super-Resolution Network (EDSR). The YOLOv8-SR uses the EDSR to convert the low-resolution image to a high-resolution image, allowing it to estimate pixel values for better processing better. The high-resolution image was generated by the EDSR model, having a Peak Signal-to-Noise Ratio (PSNR) of 25.32 and a Structural Similarity Index (SSIM) of 0.781. The YOLOv8-SR model's precision is 63.44%, recall is 46.64%, F1-score is 52.69%, mean average precision (mAP@50) is 51.58%, and the mAP@50–95 is 50.67% over the range of confidence thresholds. The investigation fundamentally transforms the precision and effectiveness of ATR, indicating a future in which ingenuity overcomes obstacles that were once considered insurmountable. This development is characterized by the use of an improved deep super-resolution network to produce super-resolution images from low-resolution inputs. The YoLov8-SR model, a sophisticated version of the YoLov8s framework, is key to this breakthrough. By amalgamating the EDSR methodology with the advanced YOLOv8-SR framework, the system generates high-resolution images abundant in detail, markedly exceeding the informational quality of their low-resolution versions. | 
    
| Author | Tanwar, Rohit Mishra, Gangeshwar Gupta, Prinima  | 
    
| Author_xml | – sequence: 1 givenname: Gangeshwar orcidid: 0000-0002-9183-0928 surname: Mishra fullname: Mishra, Gangeshwar – sequence: 2 givenname: Rohit orcidid: 0000-0002-9087-6019 surname: Tanwar fullname: Tanwar, Rohit – sequence: 3 givenname: Prinima orcidid: 0000-0002-8575-6047 surname: Gupta fullname: Gupta, Prinima  | 
    
| BookMark | eNqFkEFPwjAYhhuDiYjcPfYPDLt17eqRICoJEQNo4mnpyrdR3Nql3SDc_eFO8ODN0_vlzfe8h-ca9Yw1gNBtSEaUCRHd7QCg0qN9ouOREOIC9SPOeCA4jXp_7is09H5HCIlEwlhI-uhr3Da2ko1WeC1dAQ1egrKF0Y22BrdemwK_mUoaAxs8Bqdlid9hq1UJeFbJAjw-6GaLX52tre9-PhbzxV4EqyWWZoOnZiuN6uoHgBqv2hpcsARvy_a0_wLNwbrPG3SZy9LD8DcHaP04XU-eg_niaTYZzwMlqAh4kisJgic8iyWJWEJknPFckSwLQ4hBUM4pEE4VsATCOMtzEDGT2T2jLIo5HaDwPNuaWh4PsizT2ulKumMakvQkMj2LTH9Epp3IjiFnRjnrvYP8f-QbKCp8yA | 
    
| ContentType | Journal Article | 
    
| DBID | AAYXX CITATION ADTOC UNPAY  | 
    
| DOI | 10.35882/jeeemi.v7i4.888 | 
    
| DatabaseName | CrossRef Unpaywall for CDI: Periodical Content Unpaywall  | 
    
| DatabaseTitle | CrossRef | 
    
| DatabaseTitleList | CrossRef | 
    
| Database_xml | – sequence: 1 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Engineering | 
    
| EISSN | 2656-8632 | 
    
| EndPage | 1258 | 
    
| ExternalDocumentID | 10.35882/jeeemi.v7i4.888 10_35882_jeeemi_v7i4_888  | 
    
| GroupedDBID | AAYXX ALMA_UNASSIGNED_HOLDINGS CITATION M~E ADTOC UNPAY  | 
    
| ID | FETCH-LOGICAL-c838-67fcae8676b4a02570a4b6fc0bb11e4e83663e063ce57e14bffe845ab95352463 | 
    
| IEDL.DBID | UNPAY | 
    
| ISSN | 2656-8632 | 
    
| IngestDate | Sun Oct 19 05:41:53 EDT 2025 Sat Oct 25 05:10:26 EDT 2025  | 
    
| IsDoiOpenAccess | true | 
    
| IsOpenAccess | true | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 4 | 
    
| Language | English | 
    
| License | https://creativecommons.org/licenses/by-sa/4.0 cc-by-sa  | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-c838-67fcae8676b4a02570a4b6fc0bb11e4e83663e063ce57e14bffe845ab95352463 | 
    
| ORCID | 0000-0002-8575-6047 0000-0002-9183-0928 0000-0002-9087-6019  | 
    
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://jeeemi.org/index.php/jeeemi/article/download/888/334 | 
    
| PageCount | 19 | 
    
| ParticipantIDs | unpaywall_primary_10_35882_jeeemi_v7i4_888 crossref_primary_10_35882_jeeemi_v7i4_888  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2025-10-15 | 
    
| PublicationDateYYYYMMDD | 2025-10-15 | 
    
| PublicationDate_xml | – month: 10 year: 2025 text: 2025-10-15 day: 15  | 
    
| PublicationDecade | 2020 | 
    
| PublicationTitle | Journal of electronics, electromedical engineering, and medical informatics | 
    
| PublicationYear | 2025 | 
    
| SSID | ssj0002875510 | 
    
| Score | 2.3087213 | 
    
| Snippet | Modern surveillance necessitates the use of automatic target recognition (ATR) to identify targets or objects quickly and accurately for multiclass... | 
    
| SourceID | unpaywall crossref  | 
    
| SourceType | Open Access Repository Index Database  | 
    
| StartPage | 1240 | 
    
| Title | Automatic Target Recognition using Unmanned Aerial Vehicle Images with Proposed YOLOv8-SR and Enhanced Deep Super-Resolution Network | 
    
| URI | https://jeeemi.org/index.php/jeeemi/article/download/888/334 | 
    
| UnpaywallVersion | publishedVersion | 
    
| Volume | 7 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2656-8632 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0002875510 issn: 2656-8632 databaseCode: M~E dateStart: 20190101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1NTxsxEB1BOCAOQFsQqQryoRcqLclir9eReokgUUAlIEgQHNDK3p1tSpNNlGRB7YETPxzPfiDKpYLramRZM6Pxm_XzG4CvkeE6jnTouJ6xDUqj3nCMp7TjutJIEXOLYumh8ElXdvri-Mq7WoDv5VuYW0Qc5Vf4mVwgaUQUH2uFL2sRyciPdVSj7o1zsQhL0rNIvAJL_e5Z85rmyVmY4ijJ9_ObSe5ZIFmufef_EjY_1D8n0XKaTPSfez0cvjhe2mtwU24sZ5X83kvnZi_8-0qz8b07X4fVAneyZm7yARYw-QgrL9QIP8FjM52PMwVX1sv44ey8ZBeNE0YE-Z-sn4w0VWbWzDKXXeKA1mNHI1uXZoz-6rIzGrwwszbXpz9O75Rzcc50ErFWMsjYBuwQccIu0glOHbo8yFOfdXM--gb02q3eQccphjQ4obK1UvpxqFFJXxqh6zQSTwt6P1Q3xnVRoOIW0qDFQSF6PrrCxDEq4WnTIF0ZIfkmVJJxglvAiA_g-b4ydR2KiKNSDbuaTaV9JFUwrMJuGa9gkktxBLaFyWIb5I4OKLaBdW4Vvj0H9L_Gn99i_AUq82mK2xaKzM0OLJ48tHaKvHsCBDHoJw | 
    
| linkProvider | Unpaywall | 
    
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LSwMxEA5aD-LBt6io5OBFYW3XZLNZ8FJ8oKJtqa20B1mS3Vnro9uiXUXP_nAz-5DqRfS6DCHMDJNvNl--IWQ71ExFoQos29GmQfEqnqUdqSzbFlrwiBkUiw-FL2vitM3PO05nghwUb2HuAaCfXeGncoGoEZF_LOe-LIcoIz9QYRm7N8b4JJkSjkHiJTLVrjWqXZwnZ2CKJQXbz24mmWOAZLH2i3vHTX7IbyfRdBIP1durenwcO15O5shNsbGMVfKwl4z0XvD-Q7PxvzufJ7M57qTVzGSBTEC8SGbG1AiXyEc1GQ1SBVfaSvnhtFmwiwYxRYL8LW3HfYWVmVbTzKXX0MP16Fnf1KVnin91aQMHLzwbm279ov4irasmVXFIj-NeyjagRwBDepUM4cnCy4Ms9Wkt46Mvk9bJcevw1MqHNFiBNLVSuFGgQApXaK4qOBJPcXw_VNHatoGDZAbSgMFBATgu2FxHEUjuKO2hrgwXbIWU4kEMq4QiH8BxXakrKuAhAyk9s5pJpX1AVTBYIztFvPxhJsXhmxYmja2fOdrH2PrGuWtk9yugvxqv_8V4g5RGTwlsGigy0lt5xn0Cl8Pm9g | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Automatic+Target+Recognition+using+Unmanned+Aerial+Vehicle+Images+with+Proposed+YOLOv8-SR+and+Enhanced+Deep+Super-Resolution+Network&rft.jtitle=Journal+of+electronics%2C+electromedical+engineering%2C+and+medical+informatics&rft.au=Mishra%2C+Gangeshwar&rft.au=Tanwar%2C+Rohit&rft.au=Gupta%2C+Prinima&rft.date=2025-10-15&rft.issn=2656-8632&rft.eissn=2656-8632&rft.volume=7&rft.issue=4&rft.spage=1240&rft.epage=1258&rft_id=info:doi/10.35882%2Fjeeemi.v7i4.888&rft.externalDBID=n%2Fa&rft.externalDocID=10_35882_jeeemi_v7i4_888 | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2656-8632&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2656-8632&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2656-8632&client=summon |