Joint optimization of scale factors and Huffman code books for MPEG-4 AAC

This paper addresses the optimization problem of minimizing the distortion subject to a rate constraint for an MPEG-4 Advanced Audio Coding (AAC) encoder. We first develop a mathematical model of the AAC encoding process. In previous work, the joint optimization problem is modeled as a Viterbi searc...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on signal processing Vol. 54; no. 1; pp. 177 - 189
Main Authors Bauer, C., Vinton, M.
Format Journal Article
LanguageEnglish
Published New York, NY IEEE 01.01.2006
Institute of Electrical and Electronics Engineers
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN1053-587X
1941-0476
DOI10.1109/TSP.2005.861090

Cover

Abstract This paper addresses the optimization problem of minimizing the distortion subject to a rate constraint for an MPEG-4 Advanced Audio Coding (AAC) encoder. We first develop a mathematical model of the AAC encoding process. In previous work, the joint optimization problem is modeled as a Viterbi search for a cheapest path through a trellis. This method involves an iteration over a Lagrangian multiplier. We improve on this method by deriving a very accurate guess for the value of the final Lagrangian multiplier of the iteration as a function of the Perceptual Entropy of the signal and the given rate constraint. This reduces the complexity of the Trellis Search significantly. Whereas previous methods including the Trellis Search did not provide optimal solutions to the problem of minimizing the distortion subject to a rate constraint, we establish two methods that for the first time solve this problem optimally. Our first method is based on the formulation and solution of a Mixed Integer Linear Program, whereas our second method uses a Dynamic Programming solution that does not rely on the iteration over a Lagrangian multiplier. Based on our optimal methods, we evaluate the performance of the heuristic Two Loop Search (TLS), which is used in most commercial AAC implementations to solve the problem under consideration, and the performance of the Trellis Search.
AbstractList This paper addresses the optimization problem of minimizing the distortion subject to a rate constraint for an MPEG-4 Advanced Audio Coding (AAC) encoder. We first develop a mathematical model of the AAC encoding process. In previous work, the joint optimization problem is modeled as a Viterbi search for a cheapest path through a trellis. This method involves an iteration over a Lagrangian multiplier. We improve on this method by deriving a very accurate guess for the value of the final Lagrangian multiplier of the iteration as a function of the Perceptual Entropy of the signal and the given rate constraint. This reduces the complexity of the Trellis Search significantly. Whereas previous methods including the Trellis Search did not provide optimal solutions to the problem of minimizing the distortion subject to a rate constraint, we establish two methods that for the first time solve this problem optimally. Our first method is based on the formulation and solution of a Mixed Integer Linear Program, whereas our second method uses a Dynamic Programming solution that does not rely on the iteration over a Lagrangian multiplier. Based on our optimal methods, we evaluate the performance of the heuristic Two Loop Search (TLS), which is used in most commercial AAC implementations to solve the problem under consideration, and the performance of the Trellis Search.
Whereas previous methods including the Trellis Search did not provide optimal solutions to the problem of minimizing the distortion subject to a rate constraint, we establish two methods that for the first time solve this problem optimally.
Author Bauer, C.
Vinton, M.
Author_xml – sequence: 1
  givenname: C.
  surname: Bauer
  fullname: Bauer, C.
  organization: Dolby Labs., San Francisco, CA, USA
– sequence: 2
  givenname: M.
  surname: Vinton
  fullname: Vinton, M.
  organization: Dolby Labs., San Francisco, CA, USA
BackLink http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17401154$$DView record in Pascal Francis
BookMark eNp9kU1LJDEQhoO4sOp63oOXILieeqx0J-nOcRj8WhQFXdhbyKQrEO1JxqTnsP560zuC4MFcUoHnrUry7JPdEAMS8pPBjDFQZ48P97MaQMw6WY6wQ_aY4qwC3srdUoNoKtG1f7-T_ZyfABjnSu6R69_Rh5HG9ehX_tWMPgYaHc3WDEidsWNMmZrQ06uNcysTqI090mWMz5m6mOjt_fllxel8vvhBvjkzZDx83w_In4vzx8VVdXN3eb2Y31SW83asmr51CKZDEBwVX2IvykLmJKimtuVipm9L0YAUS-gUs8pyJ2yPrmlc2zcH5HTbd53iywbzqFc-WxwGEzBusu6UrHmtJC_kry_JugPGBLQFPP4EPsVNCuUVupNSKqHqCTp5h8z0Oy6ZYH3W6-RXJv3TrOVTt2nq2ZazKeac0H0goCdTupjSkym9NVUS4lPC-vG_ijEZP3yRO9rmPCJ-TBGSiU40b1rQnwU
CODEN ITPRED
CitedBy_id crossref_primary_10_1250_ast_29_362
Cites_doi 10.1007/978-3-662-09562-1
10.1109/MMSP.2004.1436432
10.1109/49.608
10.1109/5.842996
10.1109/ICASSP.2001.940356
10.1109/29.17498
10.1121/1.383662
10.1109/ICIP.2001.958171
10.1287/opre.11.3.399
10.1109/29.90373
10.1137/1013001
10.1109/SCFT.2000.878430
ContentType Journal Article
Copyright 2006 INIST-CNRS
Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2006
Copyright_xml – notice: 2006 INIST-CNRS
– notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2006
DBID 97E
RIA
RIE
AAYXX
CITATION
IQODW
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
F28
FR3
DOI 10.1109/TSP.2005.861090
DatabaseName IEEE All-Society Periodicals Package (ASPP) 2005–Present
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Electronic Library (IEL)
CrossRef
Pascal-Francis
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
ANTE: Abstracts in New Technology & Engineering
Engineering Research Database
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
Engineering Research Database
ANTE: Abstracts in New Technology & Engineering
DatabaseTitleList Technology Research Database

Technology Research Database
Technology Research Database
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Applied Sciences
EISSN 1941-0476
EndPage 189
ExternalDocumentID 2348479431
17401154
10_1109_TSP_2005_861090
1561585
Genre orig-research
GroupedDBID -~X
.DC
0R~
29I
3EH
4.4
53G
5GY
5VS
6IK
85S
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABFSI
ABQJQ
ABVLG
ACGFO
ACIWK
ACKIV
ACNCT
AENEX
AETIX
AGQYO
AGSQL
AHBIQ
AI.
AIBXA
AJQPL
AKJIK
AKQYR
ALLEH
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
E.L
EBS
EJD
F5P
HZ~
H~9
ICLAB
IFIPE
IFJZH
IPLJI
JAVBF
LAI
MS~
O9-
OCL
P2P
RIA
RIE
RNS
TAE
TN5
VH1
AAYXX
CITATION
IQODW
RIG
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
F28
FR3
ID FETCH-LOGICAL-c447t-3d7fe0a8e054e94bed5555e1f60932c001ad732c3065b0891c9c4f5cdef33f7d3
IEDL.DBID RIE
ISSN 1053-587X
IngestDate Sat Sep 27 20:37:02 EDT 2025
Mon Sep 29 06:33:05 EDT 2025
Sun Jun 29 16:14:30 EDT 2025
Mon Jul 21 09:16:34 EDT 2025
Thu Apr 24 23:04:09 EDT 2025
Wed Oct 01 06:30:19 EDT 2025
Tue Aug 26 16:36:36 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Lagrange multiplier
Audio signal processing
Limits
Coding
Huffman code
Performance analysis
Scale factor
signal processing for communications
Optimization
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
CC BY 4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c447t-3d7fe0a8e054e94bed5555e1f60932c001ad732c3065b0891c9c4f5cdef33f7d3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
PQID 866695927
PQPubID 23500
PageCount 13
ParticipantIDs proquest_miscellaneous_896242964
crossref_primary_10_1109_TSP_2005_861090
proquest_miscellaneous_28011507
ieee_primary_1561585
pascalfrancis_primary_17401154
proquest_journals_866695927
crossref_citationtrail_10_1109_TSP_2005_861090
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2006-Jan.
2006-01-00
2006
20060101
PublicationDateYYYYMMDD 2006-01-01
PublicationDate_xml – month: 01
  year: 2006
  text: 2006-Jan.
PublicationDecade 2000
PublicationPlace New York, NY
PublicationPlace_xml – name: New York, NY
– name: New York
PublicationTitle IEEE transactions on signal processing
PublicationTitleAbbrev TSP
PublicationYear 2006
Publisher IEEE
Institute of Electrical and Electronics Engineers
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: Institute of Electrical and Electronics Engineers
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References ref12
ref15
Najafzadeh (ref14)
Beaton (ref11) 1996
ref2
ref17
Bosi (ref5) 1997; 45
ref16
ref18
ref8
Yang (ref19)
ref7
Gordon (ref9)
ref4
ref3
ref6
Najafzadeh (ref13)
Aggarwal (ref1) 2002
Zwicker (ref21) c1999
Zoelzer (ref20) 1997
References_xml – volume-title: Psychoacoustics: Facts and Models
  year: c1999
  ident: ref21
  doi: 10.1007/978-3-662-09562-1
– volume: 45
  start-page: 791
  issue: 10
  year: 1997
  ident: ref5
  article-title: ISO/IEC MPEG-2 advanced audio coding
  publication-title: J. Audio Eng. Soc.
– start-page: 126
  year: 1996
  ident: ref11
  article-title: Collected papers of audio bit-rate reduction
  publication-title: Objective Perceptual Measurement of Audio Quality
– ident: ref4
  doi: 10.1109/MMSP.2004.1436432
– ident: ref12
  doi: 10.1109/49.608
– ident: ref15
  doi: 10.1109/5.842996
– volume-title: Toward weighted mean-squared optimality of scalable audio coding
  year: 2002
  ident: ref1
– ident: ref3
  doi: 10.1109/ICASSP.2001.940356
– ident: ref6
  doi: 10.1109/29.17498
– volume-title: 115th AES Convention
  ident: ref19
  article-title: Cascaded trellis-based optimization for MPEG-4 advanced audio coding
– ident: ref16
  doi: 10.1121/1.383662
– ident: ref18
  doi: 10.1109/ICIP.2001.958171
– ident: ref7
  doi: 10.1287/opre.11.3.399
– ident: ref17
  doi: 10.1109/29.90373
– ident: ref8
  doi: 10.1137/1013001
– volume-title: Int. Conf. Acoustics, Speech, Signal Processing (ICASSP)
  ident: ref14
  article-title: Improving perceptual coding of narrow-band audio signals at low rates
– volume-title: Graphs and Algorithms
  ident: ref9
– volume-title: Digital Audio Signal Processing
  year: 1997
  ident: ref20
– ident: ref2
  doi: 10.1109/SCFT.2000.878430
– volume-title: Int. Conf. Acoustics, Speech, Signal Processing (ICASSP)
  ident: ref13
  article-title: Perceptual bit allocation for low rate coding of narrow-band audio
SSID ssj0014496
Score 1.8505397
Snippet This paper addresses the optimization problem of minimizing the distortion subject to a rate constraint for an MPEG-4 Advanced Audio Coding (AAC) encoder. We...
Whereas previous methods including the Trellis Search did not provide optimal solutions to the problem of minimizing the distortion subject to a rate...
SourceID proquest
pascalfrancis
crossref
ieee
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 177
SubjectTerms Applied sciences
Audio coding
Coding, codes
Constraint optimization
Distortion
Dynamic programming
Encoders
Encoding
Entropy
Exact sciences and technology
Information, signal and communications theory
Iterative methods
Lagrangian functions
Limits
Mathematical model
Mathematical models
Methods
MPEG 4 Standard
Multipliers
Optimization
performance analysis
Searching
Signal and communications theory
signal processing for communications
Studies
Telecommunications and information theory
Viterbi algorithm
Title Joint optimization of scale factors and Huffman code books for MPEG-4 AAC
URI https://ieeexplore.ieee.org/document/1561585
https://www.proquest.com/docview/866695927
https://www.proquest.com/docview/28011507
https://www.proquest.com/docview/896242964
Volume 54
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE Xplore
  customDbUrl:
  eissn: 1941-0476
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0014496
  issn: 1053-587X
  databaseCode: RIE
  dateStart: 19910101
  isFulltext: true
  titleUrlDefault: https://ieeexplore.ieee.org/
  providerName: IEEE
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Lb9QwEB61PcGBV0GEQvGBAwe89W4cP46rqmWptKgSrbS3KLHHEqJNEJtc-PWM4-wuBSqRkyVPosn49Y3tmQ_gHSqBUgTHna8sl8ogr2vjuVCVzulxftjvWH5Wi2t5sSpWe_BhGwuDiMPlM5zE4nCW71vXx62yE_I1pgRv92FfG5VitbYnBlIOXFwEF3JeGL0a0_hMhT25-nKZNk9MzC0u7qxAA6VKvBBZrckmIZFZ_DUvD4vN-WNYbtRMd0y-Tfqunriff2Rw_N__eAKPRtTJ5qmbPIU9bJ7Bw99yER7Cp4v2a9OxlqaQ2zE2k7WBRX2Rjaw8rGo8W_Qh3FYNi8HwLGL0NSPgy5aXZx-5ZPP56XO4Pj-7Ol3wkWeBOyl1x3OvA4rKIME3tLJGX9CD06AEoTtHxq28pkIkma-FsVNnnQyF8xjyPGifv4CDpm3wJbBKFYR3ZlSnSaQmT13IEJ1CzKXHXGcw2di-dGMS8siFcVMOzoiwJTVWpMYsytRYGbzfvvA95d-4X_QwmnonlqycwfGdxt3VRzJCQpAZHG1auxwH8Jo-qZQt7Iw0frutpZEXj1OqBtt-Xc5MgtMZsHskjI3RN1bJV_9W7Qge7HZ0XsNB96PHN4Rxuvp46Ny_AEOS91o
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Lb9QwEB6VcgAOvAoiFFofOHDAW-_GduLjqmrZlm5Via20t8ixxxKCJohNLvx6xkl2lwKVyMmSJ9Fk_PrG9swH8A61QCmC485bw6XOkZdl7rnQNkvpcb7b75hf6tm1PF-q5Q582MTCIGJ3-QxHsdid5fvatXGr7Ih8jTHB23twX0kpVR-ttTkzkLJj4yLAkHKVZ8shkc9YmKPF56t--ySP2cXFrTWoI1WJVyLtiqwSejqLv2bmbrk5fQLztaL9LZOvo7YpR-7nHzkc__dPnsLjAXeyad9RnsEOVs_h0W_ZCPfg7Lz-UjWspknkZojOZHVgUV9kAy8Ps5VnszaEG1uxGA7PIkpfMYK-bH518pFLNp0ev4Dr05PF8YwPTAvcSZk1PPVZQGFzJACHRpboFT04DloQvnNkXOszKkSa-VLkZuyMk0E5jyFNQ-bTl7Bb1RW-Ama1IsQzobqMREry1YUM0S3EVHpMswRGa9sXbkhDHtkwvhWdOyJMQY0VyTFV0TdWAu83L3zvM3DcLboXTb0V662cwMGtxt3WRzpCwpAJ7K9buxiG8Io-qbVRZkIaH25qaezFAxVbYd2uikneA-oE2B0SuYnxN0bL1_9W7RAezBbzi-Li7PLTPjzc7u-8gd3mR4tvCfE05UHX0X8BWo36pw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Joint+optimization+of+scale+factors+and+huffman+code+books+for+MPEG-4+AAC&rft.jtitle=IEEE+transactions+on+signal+processing&rft.au=BAUER%2C+Claus&rft.au=VINTON%2C+Mark&rft.date=2006&rft.pub=Institute+of+Electrical+and+Electronics+Engineers&rft.issn=1053-587X&rft.volume=54&rft.issue=1&rft.spage=177&rft.epage=189&rft_id=info:doi/10.1109%2Ftsp.2005.861090&rft.externalDBID=n%2Fa&rft.externalDocID=17401154
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1053-587X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1053-587X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1053-587X&client=summon