Auditory augmented reality: Object sonification for the visually impaired

Augmented reality applications have focused on visually integrating virtual objects into real environments. In this paper, we propose an auditory augmented reality, where we integrate acoustic virtual objects into the real world. We sonify objects that do not intrinsically produce sound, with the pu...

Full description

Saved in:
Bibliographic Details
Published in2012 IEEE 14th International Workshop on Multimedia Signal Processing pp. 319 - 324
Main Authors Ribeiro, Flavio, Florencio, Dinei, Chou, Philip A., Zhang, Zhengyou
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.09.2012
Subjects
Online AccessGet full text
ISBN9781467345705
1467345709
DOI10.1109/MMSP.2012.6343462

Cover

Abstract Augmented reality applications have focused on visually integrating virtual objects into real environments. In this paper, we propose an auditory augmented reality, where we integrate acoustic virtual objects into the real world. We sonify objects that do not intrinsically produce sound, with the purpose of revealing additional information about them. Using spatialized (3D) audio synthesis, acoustic virtual objects are placed at specific real-world coordinates, obviating the need to explicitly tell the user where they are. Thus, by leveraging the innate human capacity for 3D sound source localization and source separation, we create an audio natural user interface. In contrast with previous work, we do not create acoustic scenes by transducing low-level (for instance, pixel-based) visual information. Instead, we use computer vision methods to identify high-level features of interest in an RGB-D stream, which are then sonified as virtual objects at their respective real-world coordinates. Since our visual and auditory senses are inherently spatial, this technique naturally maps between these two modalities, creating intuitive representations. We evaluate this concept with a head-mounted device, featuring modes that sonify flat surfaces, navigable paths and human faces.
AbstractList Augmented reality applications have focused on visually integrating virtual objects into real environments. In this paper, we propose an auditory augmented reality, where we integrate acoustic virtual objects into the real world. We sonify objects that do not intrinsically produce sound, with the purpose of revealing additional information about them. Using spatialized (3D) audio synthesis, acoustic virtual objects are placed at specific real-world coordinates, obviating the need to explicitly tell the user where they are. Thus, by leveraging the innate human capacity for 3D sound source localization and source separation, we create an audio natural user interface. In contrast with previous work, we do not create acoustic scenes by transducing low-level (for instance, pixel-based) visual information. Instead, we use computer vision methods to identify high-level features of interest in an RGB-D stream, which are then sonified as virtual objects at their respective real-world coordinates. Since our visual and auditory senses are inherently spatial, this technique naturally maps between these two modalities, creating intuitive representations. We evaluate this concept with a head-mounted device, featuring modes that sonify flat surfaces, navigable paths and human faces.
Author Chou, Philip A.
Ribeiro, Flavio
Florencio, Dinei
Zhang, Zhengyou
Author_xml – sequence: 1
  givenname: Flavio
  surname: Ribeiro
  fullname: Ribeiro, Flavio
  email: fr@lps.usp.br
  organization: Electronic Systems Eng. Dept., Universidade de São Paulo, Brazil
– sequence: 2
  givenname: Dinei
  surname: Florencio
  fullname: Florencio, Dinei
  email: dinei@microsoft.com
  organization: Microsoft Research, One Microsoft Way, Redmond, WA
– sequence: 3
  givenname: Philip A.
  surname: Chou
  fullname: Chou, Philip A.
  email: pachou@microsoft.com
  organization: Microsoft Research, One Microsoft Way, Redmond, WA
– sequence: 4
  givenname: Zhengyou
  surname: Zhang
  fullname: Zhang, Zhengyou
  email: zhang@microsoft.com
  organization: Microsoft Research, One Microsoft Way, Redmond, WA
BookMark eNo1j9tKw0AURUdU0NZ8gPgyP9CYOWcuiW-leCm0VLDv5SQ50Sm5lGQi5O8VrPtlsV4W7Jm4aruWhbhXSaxUkj1utx_vMSQKYosatYULMVPaOtTGgbkUUebSf0_MjYiG4Zj8LgWFGd6K9XIsfej6SdL42XAbuJQ9U-3D9CR3-ZGLIIeu9ZUvKPiulVXXy_DF8tsPI9X1JH1zIt9zeSeuK6oHjs6ci_3L8371ttjsXter5WbhQauwsCZ3TgFhYVEx50BOO0KtIYfCFIiqUtaQLS1jYlIAhDRjLDWVBSvCuXj4y3pmPpx631A_Hc7n8QfRBU9T
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/MMSP.2012.6343462
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1467345725
9781467345712
1467345717
9781467345729
EndPage 324
ExternalDocumentID 6343462
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ADFMO
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
IERZE
OCL
RIE
RIL
ID FETCH-LOGICAL-i241t-65b7712a3c631eeb2a747a3442b2c5c331f165a6d6e3058223289e3d4adce1a3
IEDL.DBID RIE
ISBN 9781467345705
1467345709
IngestDate Wed Aug 27 02:52:33 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i241t-65b7712a3c631eeb2a747a3442b2c5c331f165a6d6e3058223289e3d4adce1a3
PageCount 6
ParticipantIDs ieee_primary_6343462
PublicationCentury 2000
PublicationDate 2012-09
PublicationDateYYYYMMDD 2012-09-01
PublicationDate_xml – month: 09
  year: 2012
  text: 2012-09
PublicationDecade 2010
PublicationTitle 2012 IEEE 14th International Workshop on Multimedia Signal Processing
PublicationTitleAbbrev MMSP
PublicationYear 2012
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000821393
Score 1.6808057
Snippet Augmented reality applications have focused on visually integrating virtual objects into real environments. In this paper, we propose an auditory augmented...
SourceID ieee
SourceType Publisher
StartPage 319
SubjectTerms Acoustics
augmented reality
blind
Cameras
Encoding
Face recognition
natural user interface
Rendering (computer graphics)
sonification
spatialization
Training
Visualization
visually impaired
Title Auditory augmented reality: Object sonification for the visually impaired
URI https://ieeexplore.ieee.org/document/6343462
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA5zJ08qU_xNDh5tt_QlaetNxDGF6sAJu438qgzHJrMV5l_vS9tNFA_e0hxCyKPvfe_l-14IubDaqESnUeCkgoDHicB_jkWBzVUepTpPjfJ1yOxBDp75_ViMW-Ryo4VxzlXkMxf6YXWXbxem9KWyrgQO3DvcrThOa63Wpp6CoQzBDFTaLRkDF3EvXbd0ar5Fc6vJemk3y56GntgVhc2iP15XqYJLf4dk623VnJLXsCx0aD5_dWz87753yf63jI8ONwFqj7TcvEPurr0OY7FcUVW-VB05LUXg6NH4FX3UvixDEYN7BlFlNIqoliJKpB_T91LNZivqhZXoKO0-GfVvRzeDoHlPIZhinC4CKXSMdlBgJDCHKbXCXEIB55GOjDAALGdSKGmlQy-AyAEwG3NgubLGMQUHpD1fzN0hoQm4iPGcaQUxV7iuVJgIuV5imU1ya49Ix5_C5K3umDFpDuD47-kTsu0tUTO3Tkm7WJbuDEN9oc8rG38B4y-lzw
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8JAEN4QPOhJDRjf7sGjLWz30dabMRJQiiRiwo3sq4ZIwGBrgr_e2bZgNB68bfew2eykM9_Mft8sQpdGaRmpOPCskNRjYcThnyOBZ1KZBrFKYy1dHTIZiO4zux_zcQ1dbbQw1tqCfGZ9Nyzu8s1C565U1hKUUeYc7haHrCIs1VqbigoEM4AztFBviZAyHrbjdVOn6ptX95qkHbeS5GnoqF2BXy37432VIrx0dlGy3ljJKnn180z5-vNXz8b_7nwPNb-FfHi4CVH7qGbnDdS7cUqMxXKFZf5S9OQ0GKCjw-PX-FG5wgwGFO44RIXZMOBaDDgRf0zfczmbrbCTVoKrNE006tyNbrte9aKCN4VInXmCqxAsIakWlFhIqiVkE5IyFqhAc00pSYngUhhhwQ8AdqCQj1lqmDTaEkkPUH2-mNtDhCNqA8JSoiQNmYR1hYRUyLYjQ0yUGnOEGu4UJm9lz4xJdQDHf09foO3uKOlP-r3BwwnacVYpeVynqJ4tc3sGgT9T54W9vwDPPqkg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2012+IEEE+14th+International+Workshop+on+Multimedia+Signal+Processing&rft.atitle=Auditory+augmented+reality%3A+Object+sonification+for+the+visually+impaired&rft.au=Ribeiro%2C+Flavio&rft.au=Florencio%2C+Dinei&rft.au=Chou%2C+Philip+A.&rft.au=Zhang%2C+Zhengyou&rft.date=2012-09-01&rft.pub=IEEE&rft.isbn=9781467345705&rft.spage=319&rft.epage=324&rft_id=info:doi/10.1109%2FMMSP.2012.6343462&rft.externalDocID=6343462
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467345705/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467345705/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467345705/sc.gif&client=summon&freeimage=true