Joint optimization of scale factors and Huffman code books for MPEG-4 AAC
This paper addresses the optimization problem of minimizing the distortion subject to a rate constraint for an MPEG-4 Advanced Audio Coding (AAC) encoder. We first develop a mathematical model of the AAC encoding process. In previous work, the joint optimization problem is modeled as a Viterbi searc...
        Saved in:
      
    
          | Published in | IEEE transactions on signal processing Vol. 54; no. 1; pp. 177 - 189 | 
|---|---|
| Main Authors | , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        New York, NY
          IEEE
    
        01.01.2006
     Institute of Electrical and Electronics Engineers The Institute of Electrical and Electronics Engineers, Inc. (IEEE)  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1053-587X 1941-0476  | 
| DOI | 10.1109/TSP.2005.861090 | 
Cover
| Abstract | This paper addresses the optimization problem of minimizing the distortion subject to a rate constraint for an MPEG-4 Advanced Audio Coding (AAC) encoder. We first develop a mathematical model of the AAC encoding process. In previous work, the joint optimization problem is modeled as a Viterbi search for a cheapest path through a trellis. This method involves an iteration over a Lagrangian multiplier. We improve on this method by deriving a very accurate guess for the value of the final Lagrangian multiplier of the iteration as a function of the Perceptual Entropy of the signal and the given rate constraint. This reduces the complexity of the Trellis Search significantly. Whereas previous methods including the Trellis Search did not provide optimal solutions to the problem of minimizing the distortion subject to a rate constraint, we establish two methods that for the first time solve this problem optimally. Our first method is based on the formulation and solution of a Mixed Integer Linear Program, whereas our second method uses a Dynamic Programming solution that does not rely on the iteration over a Lagrangian multiplier. Based on our optimal methods, we evaluate the performance of the heuristic Two Loop Search (TLS), which is used in most commercial AAC implementations to solve the problem under consideration, and the performance of the Trellis Search. | 
    
|---|---|
| AbstractList | This paper addresses the optimization problem of minimizing the distortion subject to a rate constraint for an MPEG-4 Advanced Audio Coding (AAC) encoder. We first develop a mathematical model of the AAC encoding process. In previous work, the joint optimization problem is modeled as a Viterbi search for a cheapest path through a trellis. This method involves an iteration over a Lagrangian multiplier. We improve on this method by deriving a very accurate guess for the value of the final Lagrangian multiplier of the iteration as a function of the Perceptual Entropy of the signal and the given rate constraint. This reduces the complexity of the Trellis Search significantly. Whereas previous methods including the Trellis Search did not provide optimal solutions to the problem of minimizing the distortion subject to a rate constraint, we establish two methods that for the first time solve this problem optimally. Our first method is based on the formulation and solution of a Mixed Integer Linear Program, whereas our second method uses a Dynamic Programming solution that does not rely on the iteration over a Lagrangian multiplier. Based on our optimal methods, we evaluate the performance of the heuristic Two Loop Search (TLS), which is used in most commercial AAC implementations to solve the problem under consideration, and the performance of the Trellis Search. Whereas previous methods including the Trellis Search did not provide optimal solutions to the problem of minimizing the distortion subject to a rate constraint, we establish two methods that for the first time solve this problem optimally.  | 
    
| Author | Bauer, C. Vinton, M.  | 
    
| Author_xml | – sequence: 1 givenname: C. surname: Bauer fullname: Bauer, C. organization: Dolby Labs., San Francisco, CA, USA – sequence: 2 givenname: M. surname: Vinton fullname: Vinton, M. organization: Dolby Labs., San Francisco, CA, USA  | 
    
| BackLink | http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17401154$$DView record in Pascal Francis | 
    
| BookMark | eNp9kU1LJDEQhoO4sOp63oOXILieeqx0J-nOcRj8WhQFXdhbyKQrEO1JxqTnsP560zuC4MFcUoHnrUry7JPdEAMS8pPBjDFQZ48P97MaQMw6WY6wQ_aY4qwC3srdUoNoKtG1f7-T_ZyfABjnSu6R69_Rh5HG9ehX_tWMPgYaHc3WDEidsWNMmZrQ06uNcysTqI090mWMz5m6mOjt_fllxel8vvhBvjkzZDx83w_In4vzx8VVdXN3eb2Y31SW83asmr51CKZDEBwVX2IvykLmJKimtuVipm9L0YAUS-gUs8pyJ2yPrmlc2zcH5HTbd53iywbzqFc-WxwGEzBusu6UrHmtJC_kry_JugPGBLQFPP4EPsVNCuUVupNSKqHqCTp5h8z0Oy6ZYH3W6-RXJv3TrOVTt2nq2ZazKeac0H0goCdTupjSkym9NVUS4lPC-vG_ijEZP3yRO9rmPCJ-TBGSiU40b1rQnwU | 
    
| CODEN | ITPRED | 
    
| CitedBy_id | crossref_primary_10_1250_ast_29_362 | 
    
| Cites_doi | 10.1007/978-3-662-09562-1 10.1109/MMSP.2004.1436432 10.1109/49.608 10.1109/5.842996 10.1109/ICASSP.2001.940356 10.1109/29.17498 10.1121/1.383662 10.1109/ICIP.2001.958171 10.1287/opre.11.3.399 10.1109/29.90373 10.1137/1013001 10.1109/SCFT.2000.878430  | 
    
| ContentType | Journal Article | 
    
| Copyright | 2006 INIST-CNRS Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2006  | 
    
| Copyright_xml | – notice: 2006 INIST-CNRS – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2006  | 
    
| DBID | 97E RIA RIE AAYXX CITATION IQODW 7SC 7SP 8FD JQ2 L7M L~C L~D F28 FR3  | 
    
| DOI | 10.1109/TSP.2005.861090 | 
    
| DatabaseName | IEEE All-Society Periodicals Package (ASPP) 2005–Present IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef Pascal-Francis Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts  Academic Computer and Information Systems Abstracts Professional ANTE: Abstracts in New Technology & Engineering Engineering Research Database  | 
    
| DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional Engineering Research Database ANTE: Abstracts in New Technology & Engineering  | 
    
| DatabaseTitleList | Technology Research Database Technology Research Database Technology Research Database  | 
    
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Engineering Applied Sciences  | 
    
| EISSN | 1941-0476 | 
    
| EndPage | 189 | 
    
| ExternalDocumentID | 2348479431 17401154 10_1109_TSP_2005_861090 1561585  | 
    
| Genre | orig-research | 
    
| GroupedDBID | -~X .DC 0R~ 29I 3EH 4.4 53G 5GY 5VS 6IK 85S 97E AAJGR AARMG AASAJ AAWTH ABAZT ABFSI ABQJQ ABVLG ACGFO ACIWK ACKIV ACNCT AENEX AETIX AGQYO AGSQL AHBIQ AI. AIBXA AJQPL AKJIK AKQYR ALLEH ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 E.L EBS EJD F5P HZ~ H~9 ICLAB IFIPE IFJZH IPLJI JAVBF LAI MS~ O9- OCL P2P RIA RIE RNS TAE TN5 VH1 AAYXX CITATION IQODW RIG 7SC 7SP 8FD JQ2 L7M L~C L~D F28 FR3  | 
    
| ID | FETCH-LOGICAL-c447t-3d7fe0a8e054e94bed5555e1f60932c001ad732c3065b0891c9c4f5cdef33f7d3 | 
    
| IEDL.DBID | RIE | 
    
| ISSN | 1053-587X | 
    
| IngestDate | Sat Sep 27 20:37:02 EDT 2025 Mon Sep 29 06:33:05 EDT 2025 Sun Jun 29 16:14:30 EDT 2025 Mon Jul 21 09:16:34 EDT 2025 Thu Apr 24 23:04:09 EDT 2025 Wed Oct 01 06:30:19 EDT 2025 Tue Aug 26 16:36:36 EDT 2025  | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 1 | 
    
| Keywords | Lagrange multiplier Audio signal processing Limits Coding Huffman code Performance analysis Scale factor signal processing for communications Optimization  | 
    
| Language | English | 
    
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html CC BY 4.0  | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-c447t-3d7fe0a8e054e94bed5555e1f60932c001ad732c3065b0891c9c4f5cdef33f7d3 | 
    
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Article-2 ObjectType-Feature-1 content type line 23  | 
    
| PQID | 866695927 | 
    
| PQPubID | 23500 | 
    
| PageCount | 13 | 
    
| ParticipantIDs | proquest_miscellaneous_896242964 crossref_primary_10_1109_TSP_2005_861090 proquest_miscellaneous_28011507 ieee_primary_1561585 pascalfrancis_primary_17401154 proquest_journals_866695927 crossref_citationtrail_10_1109_TSP_2005_861090  | 
    
| ProviderPackageCode | CITATION AAYXX  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2006-Jan. 2006-01-00 2006 20060101  | 
    
| PublicationDateYYYYMMDD | 2006-01-01 | 
    
| PublicationDate_xml | – month: 01 year: 2006 text: 2006-Jan.  | 
    
| PublicationDecade | 2000 | 
    
| PublicationPlace | New York, NY | 
    
| PublicationPlace_xml | – name: New York, NY – name: New York  | 
    
| PublicationTitle | IEEE transactions on signal processing | 
    
| PublicationTitleAbbrev | TSP | 
    
| PublicationYear | 2006 | 
    
| Publisher | IEEE Institute of Electrical and Electronics Engineers The Institute of Electrical and Electronics Engineers, Inc. (IEEE)  | 
    
| Publisher_xml | – name: IEEE – name: Institute of Electrical and Electronics Engineers – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)  | 
    
| References | ref12 ref15 Najafzadeh (ref14) Beaton (ref11) 1996 ref2 ref17 Bosi (ref5) 1997; 45 ref16 ref18 ref8 Yang (ref19) ref7 Gordon (ref9) ref4 ref3 ref6 Najafzadeh (ref13) Aggarwal (ref1) 2002 Zwicker (ref21) c1999 Zoelzer (ref20) 1997  | 
    
| References_xml | – volume-title: Psychoacoustics: Facts and Models year: c1999 ident: ref21 doi: 10.1007/978-3-662-09562-1 – volume: 45 start-page: 791 issue: 10 year: 1997 ident: ref5 article-title: ISO/IEC MPEG-2 advanced audio coding publication-title: J. Audio Eng. Soc. – start-page: 126 year: 1996 ident: ref11 article-title: Collected papers of audio bit-rate reduction publication-title: Objective Perceptual Measurement of Audio Quality – ident: ref4 doi: 10.1109/MMSP.2004.1436432 – ident: ref12 doi: 10.1109/49.608 – ident: ref15 doi: 10.1109/5.842996 – volume-title: Toward weighted mean-squared optimality of scalable audio coding year: 2002 ident: ref1 – ident: ref3 doi: 10.1109/ICASSP.2001.940356 – ident: ref6 doi: 10.1109/29.17498 – volume-title: 115th AES Convention ident: ref19 article-title: Cascaded trellis-based optimization for MPEG-4 advanced audio coding – ident: ref16 doi: 10.1121/1.383662 – ident: ref18 doi: 10.1109/ICIP.2001.958171 – ident: ref7 doi: 10.1287/opre.11.3.399 – ident: ref17 doi: 10.1109/29.90373 – ident: ref8 doi: 10.1137/1013001 – volume-title: Int. Conf. Acoustics, Speech, Signal Processing (ICASSP) ident: ref14 article-title: Improving perceptual coding of narrow-band audio signals at low rates – volume-title: Graphs and Algorithms ident: ref9 – volume-title: Digital Audio Signal Processing year: 1997 ident: ref20 – ident: ref2 doi: 10.1109/SCFT.2000.878430 – volume-title: Int. Conf. Acoustics, Speech, Signal Processing (ICASSP) ident: ref13 article-title: Perceptual bit allocation for low rate coding of narrow-band audio  | 
    
| SSID | ssj0014496 | 
    
| Score | 1.8505397 | 
    
| Snippet | This paper addresses the optimization problem of minimizing the distortion subject to a rate constraint for an MPEG-4 Advanced Audio Coding (AAC) encoder. We... Whereas previous methods including the Trellis Search did not provide optimal solutions to the problem of minimizing the distortion subject to a rate...  | 
    
| SourceID | proquest pascalfrancis crossref ieee  | 
    
| SourceType | Aggregation Database Index Database Enrichment Source Publisher  | 
    
| StartPage | 177 | 
    
| SubjectTerms | Applied sciences Audio coding Coding, codes Constraint optimization Distortion Dynamic programming Encoders Encoding Entropy Exact sciences and technology Information, signal and communications theory Iterative methods Lagrangian functions Limits Mathematical model Mathematical models Methods MPEG 4 Standard Multipliers Optimization performance analysis Searching Signal and communications theory signal processing for communications Studies Telecommunications and information theory Viterbi algorithm  | 
    
| Title | Joint optimization of scale factors and Huffman code books for MPEG-4 AAC | 
    
| URI | https://ieeexplore.ieee.org/document/1561585 https://www.proquest.com/docview/866695927 https://www.proquest.com/docview/28011507 https://www.proquest.com/docview/896242964  | 
    
| Volume | 54 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Xplore customDbUrl: eissn: 1941-0476 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0014496 issn: 1053-587X databaseCode: RIE dateStart: 19910101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Lb9QwEB61PcGBV0GEQvGBAwe89W4cP46rqmWptKgSrbS3KLHHEqJNEJtc-PWM4-wuBSqRkyVPosn49Y3tmQ_gHSqBUgTHna8sl8ogr2vjuVCVzulxftjvWH5Wi2t5sSpWe_BhGwuDiMPlM5zE4nCW71vXx62yE_I1pgRv92FfG5VitbYnBlIOXFwEF3JeGL0a0_hMhT25-nKZNk9MzC0u7qxAA6VKvBBZrckmIZFZ_DUvD4vN-WNYbtRMd0y-Tfqunriff2Rw_N__eAKPRtTJ5qmbPIU9bJ7Bw99yER7Cp4v2a9OxlqaQ2zE2k7WBRX2Rjaw8rGo8W_Qh3FYNi8HwLGL0NSPgy5aXZx-5ZPP56XO4Pj-7Ol3wkWeBOyl1x3OvA4rKIME3tLJGX9CD06AEoTtHxq28pkIkma-FsVNnnQyF8xjyPGifv4CDpm3wJbBKFYR3ZlSnSaQmT13IEJ1CzKXHXGcw2di-dGMS8siFcVMOzoiwJTVWpMYsytRYGbzfvvA95d-4X_QwmnonlqycwfGdxt3VRzJCQpAZHG1auxwH8Jo-qZQt7Iw0frutpZEXj1OqBtt-Xc5MgtMZsHskjI3RN1bJV_9W7Qge7HZ0XsNB96PHN4Rxuvp46Ny_AEOS91o | 
    
| linkProvider | IEEE | 
    
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Lb9QwEB6VcgAOvAoiFFofOHDAW-_GduLjqmrZlm5Via20t8ixxxKCJohNLvx6xkl2lwKVyMmSJ9Fk_PrG9swH8A61QCmC485bw6XOkZdl7rnQNkvpcb7b75hf6tm1PF-q5Q582MTCIGJ3-QxHsdid5fvatXGr7Ih8jTHB23twX0kpVR-ttTkzkLJj4yLAkHKVZ8shkc9YmKPF56t--ySP2cXFrTWoI1WJVyLtiqwSejqLv2bmbrk5fQLztaL9LZOvo7YpR-7nHzkc__dPnsLjAXeyad9RnsEOVs_h0W_ZCPfg7Lz-UjWspknkZojOZHVgUV9kAy8Ps5VnszaEG1uxGA7PIkpfMYK-bH518pFLNp0ev4Dr05PF8YwPTAvcSZk1PPVZQGFzJACHRpboFT04DloQvnNkXOszKkSa-VLkZuyMk0E5jyFNQ-bTl7Bb1RW-Ama1IsQzobqMREry1YUM0S3EVHpMswRGa9sXbkhDHtkwvhWdOyJMQY0VyTFV0TdWAu83L3zvM3DcLboXTb0V662cwMGtxt3WRzpCwpAJ7K9buxiG8Io-qbVRZkIaH25qaezFAxVbYd2uikneA-oE2B0SuYnxN0bL1_9W7RAezBbzi-Li7PLTPjzc7u-8gd3mR4tvCfE05UHX0X8BWo36pw | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Joint+optimization+of+scale+factors+and+huffman+code+books+for+MPEG-4+AAC&rft.jtitle=IEEE+transactions+on+signal+processing&rft.au=BAUER%2C+Claus&rft.au=VINTON%2C+Mark&rft.date=2006&rft.pub=Institute+of+Electrical+and+Electronics+Engineers&rft.issn=1053-587X&rft.volume=54&rft.issue=1&rft.spage=177&rft.epage=189&rft_id=info:doi/10.1109%2Ftsp.2005.861090&rft.externalDBID=n%2Fa&rft.externalDocID=17401154 | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1053-587X&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1053-587X&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1053-587X&client=summon |