Automatic detection of urban infrastructure elements from terrestrial images using deep learning

Urban infrastructure element detection is important for the domain of public management in large urban centres. The diversity of objects in the urban environment makes object detection and classification a challenging task, requiring fast and accurate methods. Advances in deep learning methods have...

Full description

Saved in:
Bibliographic Details
Published inBoletim de Ciências Geodésicas Vol. 30; pp. 1 - 18
Main Authors Macuácua, Jaime Carlos, Centeno, Jorge António Silva, Firmino, Fernando Alves Barros, Do Crato, Jorgiana Kamila Teixeira, Vestena, Kauê de Moraes, Amisse, Caisse
Format Journal Article
LanguageEnglish
Published Curitiba Universidade Federal do Paraná, Centro Politécnico 01.01.2024
Subjects
Online AccessGet full text
ISSN1413-4853
1982-2170
1982-2170
DOI10.1590/s1982-21702024000100011

Cover

More Information
Summary:Urban infrastructure element detection is important for the domain of public management in large urban centres. The diversity of objects in the urban environment makes object detection and classification a challenging task, requiring fast and accurate methods. Advances in deep learning methods have driven improvement in detection techniques (processing, speed, accuracy) that do not rely on manually crafted models, but, instead, use learning approaches with corresponding large training sets to detect and classify objects in images. We applied an object detection model to identify and classify four urban infrastructure elements in the Mappilary dataset. We use YOLOv5, one of the top-performing object detection models, a recent release of the YOLO family, pre-trained on the COCO dataset but fine-tuned on Mappilary dataset. Experimental results from the dataset show that YOLOv5 can make qualitative predictions, for example, the power grid pole class presented the mean Average Precision (mAP) of 78% and the crosswalk class showed mAP around 79%. A lower degree of certainty was verified in the detection of public lighting (mAP=64%) and accessibility (mAP=61%) classes due to the low resolution of certain objects. However, the proposed method showed the capability of automatically detection and location of urban infrastructure elements in real-time, which could contribute to improve decision-making.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1413-4853
1982-2170
1982-2170
DOI:10.1590/s1982-21702024000100011