Exact full-RSB SAT/UNSAT transition in infinitely wide two-layer neural networks
We analyze the problem of storing random pattern-label associations using two classes of continuous non-convex weights models, namely the perceptron with negative margin and an infinite-width two-layer neural network with non-overlapping receptive fields and generic activation function. Using a full...
        Saved in:
      
    
          | Published in | SciPost physics Vol. 18; no. 4; p. 118 | 
|---|---|
| Main Authors | , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
            SciPost
    
        01.04.2025
     | 
| Online Access | Get full text | 
| ISSN | 2542-4653 2542-4653  | 
| DOI | 10.21468/SciPostPhys.18.4.118 | 
Cover
| Abstract | We analyze the problem of storing random pattern-label associations using two classes of continuous non-convex weights models, namely the perceptron with negative margin and an infinite-width two-layer neural network with non-overlapping receptive fields and generic activation function. Using a full-RSB Ansatz we compute the exact value of the SAT/UNSAT transition. Furthermore, in the case of the negative perceptron we show that the overlap distribution of typical states displays an overlap gap (a disconnected support) in certain regions of the phase diagram defined by the value of the margin and the density of patterns to be stored. This implies that some recent theorems that ensure convergence of Approximate Message Passing (AMP) based algorithms to capacity are not applicable. Finally, we show that Gradient Descent is not able to reach the maximal capacity, irrespectively of the presence of an overlap gap for typical states. This finding, similarly to what occurs in binary weight models, suggests that gradient-based algorithms are biased towards highly atypical states, whose inaccessibility determines the algorithmic threshold. | 
    
|---|---|
| AbstractList | We analyze the problem of storing random pattern-label associations using two classes of continuous non-convex weights models, namely the perceptron with negative margin and an infinite-width two-layer neural network with non-overlapping receptive fields and generic activation function. Using a full-RSB Ansatz we compute the exact value of the SAT/UNSAT transition. Furthermore, in the case of the negative perceptron we show that the overlap distribution of typical states displays an overlap gap (a disconnected support) in certain regions of the phase diagram defined by the value of the margin and the density of patterns to be stored. This implies that some recent theorems that ensure convergence of Approximate Message Passing (AMP) based algorithms to capacity are not applicable. Finally, we show that Gradient Descent is not able to reach the maximal capacity, irrespectively of the presence of an overlap gap for typical states. This finding, similarly to what occurs in binary weight models, suggests that gradient-based algorithms are biased towards highly atypical states, whose inaccessibility determines the algorithmic threshold. | 
    
| ArticleNumber | 118 | 
    
| Author | Malatesta, Enrico Annesi, Brandon L. Zamponi, Francesco  | 
    
| Author_xml | – sequence: 1 givenname: Brandon L. surname: Annesi fullname: Annesi, Brandon L. – sequence: 2 givenname: Enrico orcidid: 0000-0001-8558-6175 surname: Malatesta fullname: Malatesta, Enrico – sequence: 3 givenname: Francesco orcidid: 0000-0001-9260-1951 surname: Zamponi fullname: Zamponi, Francesco  | 
    
| BookMark | eNplkd1KAzEQhYNUUGsfQdgX2DaT391LFX8KosXqdchmE02NuyXZUvftXVtRQRjmDAfOd3HmBI2atrEInQGeEmCimC2NX7SpW7z2aQrFlE0BigN0TDgjOROcjv7cR2iS0gpjTABKEPwYLa4-tOkytwkhf1xeZMvzp9nz_bCzLuom-c63Tea_xvnGdzb02dbXNuu2bR50b2PW2E3UYZDBim_pFB06HZKdfOsYPV9fPV3e5ncPN_PL87vcUBBdTqEUEiTTFcOVxjU4STXIwmBZG8kILWzFoTIlrQgWNS9KzE1ZGioIMbwq6RjN99y61Su1jv5dx1612qud0cYXpWPnTbCKSEmJw4AFpYw6XVlJREEs4xics3xgiT1r06x1v9Uh_AABq13NKhm_HmpeDzUrKBRTQ81DkO-DJrYpRev-5_685zf3CeD2hmo | 
    
| Cites_doi | 10.1038/ncomms4725 10.1103/PhysRevA.45.4146 10.1137/20M132016X 10.1103/PhysRevX.11.031059 10.7566/JPSJ.94.014802 10.1016/0375-9601(79)90708-4 10.1073/pnas.2108492118 10.1103/PhysRevA.45.7590 10.1103/PhysRevLett.127.278301 10.1016/0550-3213(85)90374-8 10.1021/jp402235d 10.1162/neco_a_01494 10.1073/pnas.1908636117 10.1007/s00440-023-01248-y 10.1088/0305-4470/21/1/031 10.1109/FOCS52979.2021.00041 10.1088/0305-4470/21/1/030 10.1088/1751-8113/49/14/145001 10.1088/1742-5468/2016/02/023301 10.1088/0305-4470/22/12/004 10.1088/0305-4470/11/5/028 10.1103/PhysRevLett.123.170602 10.48550/arXiv.2309.09240 10.1103/PhysRevLett.115.128101 10.1103/PhysRevE.109.034305 10.1103/PhysRevLett.131.227301 10.1088/0305-4470/14/1/027 10.1146/annurev-conmatphys-070909-104045 10.1017/9781108120494 10.48550/arXiv.2402.05719 10.1103/PhysRevE.65.046137 10.1007/s10955-022-02976-6 10.1103/PhysRevE.103.L020301 10.1103/PhysRevE.106.014116 10.1103/PhysRevE.105.024134 10.1088/0022-3719/17/32/012 10.1088/0305-4470/13/4/009 10.1103/PhysRevLett.65.2312 10.1103/PhysRevE.90.052813 10.1103/PhysRevLett.43.1754 10.1103/PhysRevLett.123.115702 10.1142/0271 10.1088/0305-4470/13/3/042 10.21468/SciPostPhys.2.3.019 10.1103/PhysRevLett.123.160602 10.1051/jphys:0198900500200305700 10.1037/h0042519 10.1103/PhysRevE.88.032135 10.1103/PhysRevE.108.024310 10.1088/1742-5468/abdc16 10.1007/978-1-4612-0745-0_2 10.48550/arXiv.2402.05696  | 
    
| ContentType | Journal Article | 
    
| DBID | AAYXX CITATION ADTOC UNPAY DOA  | 
    
| DOI | 10.21468/SciPostPhys.18.4.118 | 
    
| DatabaseName | CrossRef Unpaywall for CDI: Periodical Content Unpaywall DOAJ Directory of Open Access Journals  | 
    
| DatabaseTitle | CrossRef | 
    
| DatabaseTitleList | CrossRef | 
    
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Physics | 
    
| EISSN | 2542-4653 | 
    
| ExternalDocumentID | oai_doaj_org_article_27732f01063343fabe72682e4501ffe5 10.21468/scipostphys.18.4.118 10_21468_SciPostPhys_18_4_118  | 
    
| GroupedDBID | 5VS AAFWJ AAYXX ADBBV AFPKN ALMA_UNASSIGNED_HOLDINGS BCNDV CITATION GROUPED_DOAJ M~E OK1 ADTOC UNPAY  | 
    
| ID | FETCH-LOGICAL-c316t-31967174ab40ba0d1f73a178c07dc74238eb51bc93b206d58905c99c3622c5b93 | 
    
| IEDL.DBID | UNPAY | 
    
| ISSN | 2542-4653 | 
    
| IngestDate | Fri Oct 03 12:43:40 EDT 2025 Sun Sep 07 11:15:40 EDT 2025 Tue Jul 01 05:11:22 EDT 2025  | 
    
| IsDoiOpenAccess | true | 
    
| IsOpenAccess | true | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 4 | 
    
| Language | English | 
    
| License | cc-by | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-c316t-31967174ab40ba0d1f73a178c07dc74238eb51bc93b206d58905c99c3622c5b93 | 
    
| ORCID | 0000-0001-9260-1951 0000-0001-8558-6175  | 
    
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://doi.org/10.21468/scipostphys.18.4.118 | 
    
| ParticipantIDs | doaj_primary_oai_doaj_org_article_27732f01063343fabe72682e4501ffe5 unpaywall_primary_10_21468_scipostphys_18_4_118 crossref_primary_10_21468_SciPostPhys_18_4_118  | 
    
| ProviderPackageCode | CITATION AAYXX  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2025-04-01 | 
    
| PublicationDateYYYYMMDD | 2025-04-01 | 
    
| PublicationDate_xml | – month: 04 year: 2025 text: 2025-04-01 day: 01  | 
    
| PublicationDecade | 2020 | 
    
| PublicationTitle | SciPost physics | 
    
| PublicationYear | 2025 | 
    
| Publisher | SciPost | 
    
| Publisher_xml | – name: SciPost | 
    
| References | ref13 ref12 ref15 ref14 ref53 ref52 ref11 ref10 ref17 ref16 ref19 ref18 ref51 ref50 ref46 ref45 ref48 ref47 ref42 ref41 ref44 ref43 ref49 ref8 ref7 ref9 ref4 ref3 ref6 ref5 ref40 ref35 ref34 ref37 ref36 ref31 ref30 ref33 ref32 ref2 ref1 ref39 ref38 ref24 ref23 ref26 ref25 ref20 ref22 ref21 ref28 ref27 ref29  | 
    
| References_xml | – ident: ref8 doi: 10.1038/ncomms4725 – ident: ref28 doi: 10.1103/PhysRevA.45.4146 – ident: ref27 doi: 10.1137/20M132016X – ident: ref16 doi: 10.1103/PhysRevX.11.031059 – ident: ref14 doi: 10.7566/JPSJ.94.014802 – ident: ref42 doi: 10.1016/0375-9601(79)90708-4 – ident: ref25 doi: 10.1073/pnas.2108492118 – ident: ref13 doi: 10.1103/PhysRevA.45.7590 – ident: ref21 doi: 10.1103/PhysRevLett.127.278301 – ident: ref49 doi: 10.1016/0550-3213(85)90374-8 – ident: ref31 doi: 10.1021/jp402235d – ident: ref40 doi: 10.1162/neco_a_01494 – ident: ref18 doi: 10.1073/pnas.1908636117 – ident: ref36 doi: 10.1007/s00440-023-01248-y – ident: ref2 doi: 10.1088/0305-4470/21/1/031 – ident: ref38 doi: 10.1109/FOCS52979.2021.00041 – ident: ref1 doi: 10.1088/0305-4470/21/1/030 – ident: ref6 doi: 10.1088/1751-8113/49/14/145001 – ident: ref22 doi: 10.1088/1742-5468/2016/02/023301 – ident: ref3 doi: 10.1088/0305-4470/22/12/004 – ident: ref48 doi: 10.1088/0305-4470/11/5/028 – ident: ref24 doi: 10.1103/PhysRevLett.123.170602 – ident: ref35 doi: 10.48550/arXiv.2309.09240 – ident: ref39 doi: 10.1103/PhysRevLett.115.128101 – ident: ref17 doi: 10.1103/PhysRevE.109.034305 – ident: ref20 doi: 10.1103/PhysRevLett.131.227301 – ident: ref46 doi: 10.1088/0305-4470/14/1/027 – ident: ref7 doi: 10.1146/annurev-conmatphys-070909-104045 – ident: ref51 doi: 10.1017/9781108120494 – ident: ref30 doi: 10.48550/arXiv.2402.05719 – ident: ref50 doi: 10.1103/PhysRevE.65.046137 – ident: ref26 doi: 10.1007/s10955-022-02976-6 – ident: ref15 doi: 10.1103/PhysRevE.103.L020301 – ident: ref23 doi: 10.1103/PhysRevE.106.014116 – ident: ref33 doi: 10.1103/PhysRevE.105.024134 – ident: ref47 doi: 10.1088/0022-3719/17/32/012 – ident: ref52 doi: 10.1088/0305-4470/13/4/009 – ident: ref12 doi: 10.1103/PhysRevLett.65.2312 – ident: ref37 doi: 10.1103/PhysRevE.90.052813 – ident: ref43 doi: 10.1103/PhysRevLett.43.1754 – ident: ref32 doi: 10.1103/PhysRevLett.123.115702 – ident: ref44 doi: 10.1142/0271 – ident: ref45 doi: 10.1088/0305-4470/13/3/042 – ident: ref9 doi: 10.21468/SciPostPhys.2.3.019 – ident: ref10 doi: 10.1103/PhysRevLett.123.160602 – ident: ref4 doi: 10.1051/jphys:0198900500200305700 – ident: ref11 doi: 10.1103/PhysRevE.105.024134 – ident: ref5 doi: 10.1037/h0042519 – ident: ref53 doi: 10.1103/PhysRevE.88.032135 – ident: ref19 doi: 10.1103/PhysRevE.108.024310 – ident: ref34 doi: 10.1088/1742-5468/abdc16 – ident: ref41 doi: 10.1007/978-1-4612-0745-0_2 – ident: ref29 doi: 10.48550/arXiv.2402.05696  | 
    
| SSID | ssj0002119165 | 
    
| Score | 2.3098748 | 
    
| Snippet | We analyze the problem of storing random pattern-label associations using two classes of continuous non-convex weights models, namely the perceptron with... | 
    
| SourceID | doaj unpaywall crossref  | 
    
| SourceType | Open Website Open Access Repository Index Database  | 
    
| StartPage | 118 | 
    
| SummonAdditionalLinks | – databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8QwEA4iiHoQn7i-yMFrdptXmxxdUURwEXXBW0nSBBaWuriV1X9vJl2l4sGLUHoohSkzTWa-9ptvEDpX1FnnpSBOF5aIKihiJFNEykArVmgZNDQK343ym7G4fZbPnVFfwAlr5YFbxw1YUXAWALlwLngw1hcsV8wLmdEQfFIvzZTugCnYg5NuWS7blh2YXa0Gca3A_FsgVvap6ou4WagfyShp9m-i9bd6Zj4WZjrtJJrrbbS1rBDxRftkO2jF17toLTE13XwP3V-9G9dg-GxOHh6H-PHiaTAexTNuIO0kBhaewBEmUE9OP_BiUnncLF7I1MQCG4OEZTRQtwTw-T4aX189Xd6Q5VgE4jjNGwKLJoIwYazIrMkqGgpuaKFcVlQOfrwqbyW1TnPLsrySSmfSae1iqmJOWs0P0Gr9UvtDhHXwgkMLE9MRJytvVM6rzIncRKAhmeih_pd_ylmrflFG1JAcWnYcWlJViggmVA8NwYvfN4N4dboQQ1ouQ1r-FdIeGnzH4LdZ6FWOZmdds0f_YfYYbTAY7ptoOSdotXl986ex4mjsWXq5PgHNhdHM priority: 102 providerName: Directory of Open Access Journals  | 
    
| Title | Exact full-RSB SAT/UNSAT transition in infinitely wide two-layer neural networks | 
    
| URI | https://doi.org/10.21468/scipostphys.18.4.118 https://doaj.org/article/27732f01063343fabe72682e4501ffe5  | 
    
| UnpaywallVersion | publishedVersion | 
    
| Volume | 18 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 2542-4653 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0002119165 issn: 2542-4653 databaseCode: DOA dateStart: 20160101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources (selected full-text only) customDbUrl: eissn: 2542-4653 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0002119165 issn: 2542-4653 databaseCode: M~E dateStart: 20160101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3da9swED-2lLHuYd9j2UfRw17lWl-29JiOljJYKGsD3ZORZAnCghMWh7T766ez3ZBtDDYwfjAWJ9_pfPeT7gPgg2be-aAk9aZ0VNZRU6u4pkpFVvPSqGgwUfjztDifyU_X6npIVsdcmL3ze2w5rTG7ZbVct4jzM6YzmXRc34eDQiXXewQHs-nF5Cs2kFOSU6wV1mfp_H3sL_anK9P_CB5umpW93drFYs-2nD2B6d2s-pCSb9mmdZn_8VvBxn-e9lN4PHiZZNIvi2dwLzTP4UEX7enXL-Di9Mb6luDWO_1yeUIuJ1fHs2m6kxZNVxfFReZ4xTn6pItbsp3XgbTbJV3Y5KQTLIOZCDR9EPn6JczOTq8-ntOhtQL1ghUtRcVLQE5aJ3Nn85rFUlhWap-XtcfDWx2cYs4b4Xhe1EqbXHljfDJ33CtnxCsYNcsmvAZiYpAC06C4SVhbB6sLUedeFjaBFcXlGLI7hlervoJGlZBHx6Yq_cCwKTF-f8V0JRMg0WM4QbHsXsYC2N2DxOBq0KeKl6XgEQGtEFJE60LJC82DVDmLMagxHO-E-ifZPensyL757xFv4ZBjN-AujucdjNrvm_A-uSitO-qg_dGwPH8CKzjlSw | 
    
| linkProvider | Unpaywall | 
    
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3da9swED-2lLHtYd-j2Rd62Ktc68uWHtPRUgYLZW2gezKSLEFocMLikHZ__XS2G7KNwQbGD8bi5Dud737SfQB81Mw7H5Sk3pSOyjpqahXXVKnIal4aFQ0mCn-ZFmcz-flKXQ3J6pgLs3d-jy2nNWa3rJbrFnF-xnQmk47r-3BQqOR6j-BgNj2ffMMGckpyirXC-iydv4_9xf50Zfofw8NNs7K3W7tY7NmW06cwvZtVH1JynW1al_kfvxVs_OdpP4Mng5dJJv2yeA73QvMCHnTRnn79Es5PbqxvCW69068Xx-Ricnk0m6Y7adF0dVFcZI5XnKNPurgl23kdSLtd0oVNTjrBMpiJQNMHka9fwez05PLTGR1aK1AvWNFSVLwE5KR1Mnc2r1kshWWl9nlZezy81cEp5rwRjudFrbTJlTfGJ3PHvXJGvIZRs2zCIRATgxSYBsVNwto6WF2IOveysAmsKC7HkN0xvFr1FTSqhDw6NlXpB4ZNifH7K6YrmQCJHsMximX3MhbA7h4kBleDPlW8LAWPCGiFkCJaF0peaB6kylmMQY3haCfUP8nuSWdH9s1_j3gLjzh2A-7ieN7BqP2-Ce-Ti9K6D8PC_AnQ5-RW | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Exact+full-RSB+SAT%2FUNSAT+transition+in+infinitely+wide+two-layer+neural+networks&rft.jtitle=SciPost+physics&rft.au=Annesi%2C+Brandon+L.&rft.au=Malatesta%2C+Enrico&rft.au=Zamponi%2C+Francesco&rft.date=2025-04-01&rft.issn=2542-4653&rft.eissn=2542-4653&rft.volume=18&rft.issue=4&rft_id=info:doi/10.21468%2FSciPostPhys.18.4.118&rft.externalDBID=n%2Fa&rft.externalDocID=10_21468_SciPostPhys_18_4_118 | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2542-4653&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2542-4653&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2542-4653&client=summon |