Scene text localization using keypoints

Scene text localization and recognition (also known as text localization and recognition in real-world images, nature scene OCR or text-in-the-wild problem) is an open problem, attracting increasing interest from researchers. In this paper, we address the localization issue and leave the recognition...

Full description

Saved in:
Bibliographic Details
Published inSignal Processing and Communications Applications Conference pp. 1917 - 1920
Main Authors Erdogmus, Nesli, Ozuysal, Mustafa
Format Conference Proceeding
LanguageEnglish
Turkish
Published IEEE 01.05.2015
Subjects
Online AccessGet full text
ISSN2165-0608
DOI10.1109/SIU.2015.7130235

Cover

Abstract Scene text localization and recognition (also known as text localization and recognition in real-world images, nature scene OCR or text-in-the-wild problem) is an open problem, attracting increasing interest from researchers. In this paper, we address the localization issue and leave the recognition part out of its scope. For the purpose of scene text localization, Scale-Invariant Feature Transform (SIFT) keypoints are extracted from the images and classified as text and non-text. Subsequently, the text keypoints are utilized to compute the bounding boxes around text regions. The proposed technique is tested on the database of ICDAR 2013 Robust Reading Competition - Challenge 2 and the experimental results are reported in detail. Although the idea introduced here is still at its infancy, it is observed to achieve remarkable results and due to the fact that there is a large room for improvement, it is found to be promising.
AbstractList Scene text localization and recognition (also known as text localization and recognition in real-world images, nature scene OCR or text-in-the-wild problem) is an open problem, attracting increasing interest from researchers. In this paper, we address the localization issue and leave the recognition part out of its scope. For the purpose of scene text localization, Scale-Invariant Feature Transform (SIFT) keypoints are extracted from the images and classified as text and non-text. Subsequently, the text keypoints are utilized to compute the bounding boxes around text regions. The proposed technique is tested on the database of ICDAR 2013 Robust Reading Competition - Challenge 2 and the experimental results are reported in detail. Although the idea introduced here is still at its infancy, it is observed to achieve remarkable results and due to the fact that there is a large room for improvement, it is found to be promising.
Author Erdogmus, Nesli
Ozuysal, Mustafa
Author_xml – sequence: 1
  givenname: Nesli
  surname: Erdogmus
  fullname: Erdogmus, Nesli
  email: neslierdogmus@iyte.edu.tr
  organization: Bilgisayar Muhendisligi Bolumu, Izmir Yuksek Teknoloji Enstitusu, Izmir, Turkey
– sequence: 2
  givenname: Mustafa
  surname: Ozuysal
  fullname: Ozuysal, Mustafa
  email: mustafaozuysal@iyte.edu.tr
  organization: Bilgisayar Muhendisligi Bolumu, Izmir Yuksek Teknoloji Enstitusu, Izmir, Turkey
BookMark eNotz0tLw0AUQOERKtjXXnCTnavEe-dmXkspPgoFF23XZSZzI6NxUpoI1l_vwq7O7oMzE5PcZxbiFqFCBPewXe8rCagqgwSS1JWYYa0NGbLaTcRUolYlaLA3YjkMHwCA2kpycirutw1nLkb-GYuub3yXfv2Y-lx8Dym_F598PvYpj8NCXLe-G3h56Vzsn592q9dy8_ayXj1uyoSgxjKwjSFgo0MItQ3RSW2cq0k1aCSqOobYeqRGRQIVgyUloa25BRediRpoLu7-3cTMh-MpffnT-XD5oj-l_EJq
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SIU.2015.7130235
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 1467373869
9781467373869
EndPage 1920
ExternalDocumentID 7130235
Genre orig-research
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
M43
OCL
RIE
RIL
RNS
ID FETCH-LOGICAL-i105t-be8dbb1c6bbb48bd926799435c172154dbdfa13c5d305db83520f4ef09d97d603
IEDL.DBID RIE
ISSN 2165-0608
IngestDate Wed Aug 27 02:14:15 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
Turkish
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i105t-be8dbb1c6bbb48bd926799435c172154dbdfa13c5d305db83520f4ef09d97d603
PageCount 4
ParticipantIDs ieee_primary_7130235
PublicationCentury 2000
PublicationDate 2015-May
PublicationDateYYYYMMDD 2015-05-01
PublicationDate_xml – month: 05
  year: 2015
  text: 2015-May
PublicationDecade 2010
PublicationTitle Signal Processing and Communications Applications Conference
PublicationTitleAbbrev SIU
PublicationYear 2015
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0001682392
Score 1.5711031
Snippet Scene text localization and recognition (also known as text localization and recognition in real-world images, nature scene OCR or text-in-the-wild problem) is...
SourceID ieee
SourceType Publisher
StartPage 1917
SubjectTerms Computer vision
Conferences
Feature extraction
keypoint
Optical character recognition software
scene text localization
SIFT
Text analysis
Text recognition
Title Scene text localization using keypoints
URI https://ieeexplore.ieee.org/document/7130235
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1La8MwDBZtT9tlj3bsTQ6DXZY0iR-xz2OlG3QMukJvJbKdUQZp6dLLfv1sJ223scNuxhgj2ciSrE8SwA3juVFGqlDIHK2DojFEomWoCMlZnghBfSml0TMfTujTlE1bcLfNhTHGePCZidzQx_L1Qq3dV1k_c1E2wtrQzjJZ52rt_lO4SInvgZwmnIUxj8UmKhnL_vhx4mBcLGq2-NFLxauSwQGMNkTUCJL3aF1hpD5_1Wf8L5WH0Nsl7QUvW3V0BC1THsP-t3qDXbgdK_u0BQ7sEXgl1iRhBg79_hZYeV4u5mX10YPJ4OH1fhg2nRLCubWPqhCN0IiJ4ohIBWqZ8kxKawkp5-ExqlEXeUIU01a8NTqrKy6oKWKpZaZ5TE6gUy5KcwoBV3aZIRIzRWmaKmshpJwpkRRUCsyTM-g6lmfLuhjGrOH2_O_pC9hzx14jBC-hU63W5spq8Qqv_fV9AQuMmmg
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEB5qPagXH634dg-CF3e7u3lschal1bYIbaG3spOkUoRt0e3FX2-yu21VPHgLIYQZwmRe38wA3DCeGmWk8oVM0TooGn0kWvqKkJSlkRC0aKXU6_P2iD6N2bgGd-taGGNMAT4zgVsWuXw9V0sXKmslLstG2BZsM-tVJGW11iaiwkVMiinIccSZH_JQrPKSoWwNOiMH5GJBdcmPaSqFMnnch96KjBJD8hYscwzU568Ojf-l8wCam7I972WtkA6hZrIj2PvWcbABtwNlPzfPwT28Qo1VZZiew7-_elaiF_NZln80YfT4MLxv-9WsBH9mLaTcRyM0YqQ4IlKBWsY8kdLaQsr5eIxq1NM0IoppK-Aand0VTqmZhlLLRPOQHEM9m2fmBDyu7DFDJCaK0jhW1kaIOVMimlIpMI1OoeFYnizKdhiTituzv7evYac97HUn3U7_-Rx23ROUeMELqOfvS3NpdXqOV8VTfgFfW525
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Signal+Processing+and+Communications+Applications+Conference&rft.atitle=Scene+text+localization+using+keypoints&rft.au=Erdogmus%2C+Nesli&rft.au=Ozuysal%2C+Mustafa&rft.date=2015-05-01&rft.pub=IEEE&rft.issn=2165-0608&rft.spage=1917&rft.epage=1920&rft_id=info:doi/10.1109%2FSIU.2015.7130235&rft.externalDocID=7130235
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2165-0608&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2165-0608&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2165-0608&client=summon