Exact full-RSB SAT/UNSAT transition in infinitely wide two-layer neural networks

We analyze the problem of storing random pattern-label associations using two classes of continuous non-convex weights models, namely the perceptron with negative margin and an infinite-width two-layer neural network with non-overlapping receptive fields and generic activation function. Using a full...

Full description

Saved in:
Bibliographic Details
Published inSciPost physics Vol. 18; no. 4; p. 118
Main Authors Annesi, Brandon L., Malatesta, Enrico, Zamponi, Francesco
Format Journal Article
LanguageEnglish
Published SciPost 01.04.2025
Online AccessGet full text
ISSN2542-4653
2542-4653
DOI10.21468/SciPostPhys.18.4.118

Cover

Abstract We analyze the problem of storing random pattern-label associations using two classes of continuous non-convex weights models, namely the perceptron with negative margin and an infinite-width two-layer neural network with non-overlapping receptive fields and generic activation function. Using a full-RSB Ansatz we compute the exact value of the SAT/UNSAT transition. Furthermore, in the case of the negative perceptron we show that the overlap distribution of typical states displays an overlap gap (a disconnected support) in certain regions of the phase diagram defined by the value of the margin and the density of patterns to be stored. This implies that some recent theorems that ensure convergence of Approximate Message Passing (AMP) based algorithms to capacity are not applicable. Finally, we show that Gradient Descent is not able to reach the maximal capacity, irrespectively of the presence of an overlap gap for typical states. This finding, similarly to what occurs in binary weight models, suggests that gradient-based algorithms are biased towards highly atypical states, whose inaccessibility determines the algorithmic threshold.
AbstractList We analyze the problem of storing random pattern-label associations using two classes of continuous non-convex weights models, namely the perceptron with negative margin and an infinite-width two-layer neural network with non-overlapping receptive fields and generic activation function. Using a full-RSB Ansatz we compute the exact value of the SAT/UNSAT transition. Furthermore, in the case of the negative perceptron we show that the overlap distribution of typical states displays an overlap gap (a disconnected support) in certain regions of the phase diagram defined by the value of the margin and the density of patterns to be stored. This implies that some recent theorems that ensure convergence of Approximate Message Passing (AMP) based algorithms to capacity are not applicable. Finally, we show that Gradient Descent is not able to reach the maximal capacity, irrespectively of the presence of an overlap gap for typical states. This finding, similarly to what occurs in binary weight models, suggests that gradient-based algorithms are biased towards highly atypical states, whose inaccessibility determines the algorithmic threshold.
ArticleNumber 118
Author Malatesta, Enrico
Annesi, Brandon L.
Zamponi, Francesco
Author_xml – sequence: 1
  givenname: Brandon L.
  surname: Annesi
  fullname: Annesi, Brandon L.
– sequence: 2
  givenname: Enrico
  orcidid: 0000-0001-8558-6175
  surname: Malatesta
  fullname: Malatesta, Enrico
– sequence: 3
  givenname: Francesco
  orcidid: 0000-0001-9260-1951
  surname: Zamponi
  fullname: Zamponi, Francesco
BookMark eNplkd1KAzEQhYNUUGsfQdgX2DaT391LFX8KosXqdchmE02NuyXZUvftXVtRQRjmDAfOd3HmBI2atrEInQGeEmCimC2NX7SpW7z2aQrFlE0BigN0TDgjOROcjv7cR2iS0gpjTABKEPwYLa4-tOkytwkhf1xeZMvzp9nz_bCzLuom-c63Tea_xvnGdzb02dbXNuu2bR50b2PW2E3UYZDBim_pFB06HZKdfOsYPV9fPV3e5ncPN_PL87vcUBBdTqEUEiTTFcOVxjU4STXIwmBZG8kILWzFoTIlrQgWNS9KzE1ZGioIMbwq6RjN99y61Su1jv5dx1612qud0cYXpWPnTbCKSEmJw4AFpYw6XVlJREEs4xics3xgiT1r06x1v9Uh_AABq13NKhm_HmpeDzUrKBRTQ81DkO-DJrYpRev-5_685zf3CeD2hmo
Cites_doi 10.1038/ncomms4725
10.1103/PhysRevA.45.4146
10.1137/20M132016X
10.1103/PhysRevX.11.031059
10.7566/JPSJ.94.014802
10.1016/0375-9601(79)90708-4
10.1073/pnas.2108492118
10.1103/PhysRevA.45.7590
10.1103/PhysRevLett.127.278301
10.1016/0550-3213(85)90374-8
10.1021/jp402235d
10.1162/neco_a_01494
10.1073/pnas.1908636117
10.1007/s00440-023-01248-y
10.1088/0305-4470/21/1/031
10.1109/FOCS52979.2021.00041
10.1088/0305-4470/21/1/030
10.1088/1751-8113/49/14/145001
10.1088/1742-5468/2016/02/023301
10.1088/0305-4470/22/12/004
10.1088/0305-4470/11/5/028
10.1103/PhysRevLett.123.170602
10.48550/arXiv.2309.09240
10.1103/PhysRevLett.115.128101
10.1103/PhysRevE.109.034305
10.1103/PhysRevLett.131.227301
10.1088/0305-4470/14/1/027
10.1146/annurev-conmatphys-070909-104045
10.1017/9781108120494
10.48550/arXiv.2402.05719
10.1103/PhysRevE.65.046137
10.1007/s10955-022-02976-6
10.1103/PhysRevE.103.L020301
10.1103/PhysRevE.106.014116
10.1103/PhysRevE.105.024134
10.1088/0022-3719/17/32/012
10.1088/0305-4470/13/4/009
10.1103/PhysRevLett.65.2312
10.1103/PhysRevE.90.052813
10.1103/PhysRevLett.43.1754
10.1103/PhysRevLett.123.115702
10.1142/0271
10.1088/0305-4470/13/3/042
10.21468/SciPostPhys.2.3.019
10.1103/PhysRevLett.123.160602
10.1051/jphys:0198900500200305700
10.1037/h0042519
10.1103/PhysRevE.88.032135
10.1103/PhysRevE.108.024310
10.1088/1742-5468/abdc16
10.1007/978-1-4612-0745-0_2
10.48550/arXiv.2402.05696
ContentType Journal Article
DBID AAYXX
CITATION
ADTOC
UNPAY
DOA
DOI 10.21468/SciPostPhys.18.4.118
DatabaseName CrossRef
Unpaywall for CDI: Periodical Content
Unpaywall
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList CrossRef

Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Physics
EISSN 2542-4653
ExternalDocumentID oai_doaj_org_article_27732f01063343fabe72682e4501ffe5
10.21468/scipostphys.18.4.118
10_21468_SciPostPhys_18_4_118
GroupedDBID 5VS
AAFWJ
AAYXX
ADBBV
AFPKN
ALMA_UNASSIGNED_HOLDINGS
BCNDV
CITATION
GROUPED_DOAJ
M~E
OK1
ADTOC
UNPAY
ID FETCH-LOGICAL-c316t-31967174ab40ba0d1f73a178c07dc74238eb51bc93b206d58905c99c3622c5b93
IEDL.DBID UNPAY
ISSN 2542-4653
IngestDate Fri Oct 03 12:43:40 EDT 2025
Sun Sep 07 11:15:40 EDT 2025
Tue Jul 01 05:11:22 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 4
Language English
License cc-by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c316t-31967174ab40ba0d1f73a178c07dc74238eb51bc93b206d58905c99c3622c5b93
ORCID 0000-0001-9260-1951
0000-0001-8558-6175
OpenAccessLink https://proxy.k.utb.cz/login?url=https://doi.org/10.21468/scipostphys.18.4.118
ParticipantIDs doaj_primary_oai_doaj_org_article_27732f01063343fabe72682e4501ffe5
unpaywall_primary_10_21468_scipostphys_18_4_118
crossref_primary_10_21468_SciPostPhys_18_4_118
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2025-04-01
PublicationDateYYYYMMDD 2025-04-01
PublicationDate_xml – month: 04
  year: 2025
  text: 2025-04-01
  day: 01
PublicationDecade 2020
PublicationTitle SciPost physics
PublicationYear 2025
Publisher SciPost
Publisher_xml – name: SciPost
References ref13
ref12
ref15
ref14
ref53
ref52
ref11
ref10
ref17
ref16
ref19
ref18
ref51
ref50
ref46
ref45
ref48
ref47
ref42
ref41
ref44
ref43
ref49
ref8
ref7
ref9
ref4
ref3
ref6
ref5
ref40
ref35
ref34
ref37
ref36
ref31
ref30
ref33
ref32
ref2
ref1
ref39
ref38
ref24
ref23
ref26
ref25
ref20
ref22
ref21
ref28
ref27
ref29
References_xml – ident: ref8
  doi: 10.1038/ncomms4725
– ident: ref28
  doi: 10.1103/PhysRevA.45.4146
– ident: ref27
  doi: 10.1137/20M132016X
– ident: ref16
  doi: 10.1103/PhysRevX.11.031059
– ident: ref14
  doi: 10.7566/JPSJ.94.014802
– ident: ref42
  doi: 10.1016/0375-9601(79)90708-4
– ident: ref25
  doi: 10.1073/pnas.2108492118
– ident: ref13
  doi: 10.1103/PhysRevA.45.7590
– ident: ref21
  doi: 10.1103/PhysRevLett.127.278301
– ident: ref49
  doi: 10.1016/0550-3213(85)90374-8
– ident: ref31
  doi: 10.1021/jp402235d
– ident: ref40
  doi: 10.1162/neco_a_01494
– ident: ref18
  doi: 10.1073/pnas.1908636117
– ident: ref36
  doi: 10.1007/s00440-023-01248-y
– ident: ref2
  doi: 10.1088/0305-4470/21/1/031
– ident: ref38
  doi: 10.1109/FOCS52979.2021.00041
– ident: ref1
  doi: 10.1088/0305-4470/21/1/030
– ident: ref6
  doi: 10.1088/1751-8113/49/14/145001
– ident: ref22
  doi: 10.1088/1742-5468/2016/02/023301
– ident: ref3
  doi: 10.1088/0305-4470/22/12/004
– ident: ref48
  doi: 10.1088/0305-4470/11/5/028
– ident: ref24
  doi: 10.1103/PhysRevLett.123.170602
– ident: ref35
  doi: 10.48550/arXiv.2309.09240
– ident: ref39
  doi: 10.1103/PhysRevLett.115.128101
– ident: ref17
  doi: 10.1103/PhysRevE.109.034305
– ident: ref20
  doi: 10.1103/PhysRevLett.131.227301
– ident: ref46
  doi: 10.1088/0305-4470/14/1/027
– ident: ref7
  doi: 10.1146/annurev-conmatphys-070909-104045
– ident: ref51
  doi: 10.1017/9781108120494
– ident: ref30
  doi: 10.48550/arXiv.2402.05719
– ident: ref50
  doi: 10.1103/PhysRevE.65.046137
– ident: ref26
  doi: 10.1007/s10955-022-02976-6
– ident: ref15
  doi: 10.1103/PhysRevE.103.L020301
– ident: ref23
  doi: 10.1103/PhysRevE.106.014116
– ident: ref33
  doi: 10.1103/PhysRevE.105.024134
– ident: ref47
  doi: 10.1088/0022-3719/17/32/012
– ident: ref52
  doi: 10.1088/0305-4470/13/4/009
– ident: ref12
  doi: 10.1103/PhysRevLett.65.2312
– ident: ref37
  doi: 10.1103/PhysRevE.90.052813
– ident: ref43
  doi: 10.1103/PhysRevLett.43.1754
– ident: ref32
  doi: 10.1103/PhysRevLett.123.115702
– ident: ref44
  doi: 10.1142/0271
– ident: ref45
  doi: 10.1088/0305-4470/13/3/042
– ident: ref9
  doi: 10.21468/SciPostPhys.2.3.019
– ident: ref10
  doi: 10.1103/PhysRevLett.123.160602
– ident: ref4
  doi: 10.1051/jphys:0198900500200305700
– ident: ref11
  doi: 10.1103/PhysRevE.105.024134
– ident: ref5
  doi: 10.1037/h0042519
– ident: ref53
  doi: 10.1103/PhysRevE.88.032135
– ident: ref19
  doi: 10.1103/PhysRevE.108.024310
– ident: ref34
  doi: 10.1088/1742-5468/abdc16
– ident: ref41
  doi: 10.1007/978-1-4612-0745-0_2
– ident: ref29
  doi: 10.48550/arXiv.2402.05696
SSID ssj0002119165
Score 2.3098748
Snippet We analyze the problem of storing random pattern-label associations using two classes of continuous non-convex weights models, namely the perceptron with...
SourceID doaj
unpaywall
crossref
SourceType Open Website
Open Access Repository
Index Database
StartPage 118
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8QwEA4iiHoQn7i-yMFrdptXmxxdUURwEXXBW0nSBBaWuriV1X9vJl2l4sGLUHoohSkzTWa-9ptvEDpX1FnnpSBOF5aIKihiJFNEykArVmgZNDQK343ym7G4fZbPnVFfwAlr5YFbxw1YUXAWALlwLngw1hcsV8wLmdEQfFIvzZTugCnYg5NuWS7blh2YXa0Gca3A_FsgVvap6ou4WagfyShp9m-i9bd6Zj4WZjrtJJrrbbS1rBDxRftkO2jF17toLTE13XwP3V-9G9dg-GxOHh6H-PHiaTAexTNuIO0kBhaewBEmUE9OP_BiUnncLF7I1MQCG4OEZTRQtwTw-T4aX189Xd6Q5VgE4jjNGwKLJoIwYazIrMkqGgpuaKFcVlQOfrwqbyW1TnPLsrySSmfSae1iqmJOWs0P0Gr9UvtDhHXwgkMLE9MRJytvVM6rzIncRKAhmeih_pd_ylmrflFG1JAcWnYcWlJViggmVA8NwYvfN4N4dboQQ1ouQ1r-FdIeGnzH4LdZ6FWOZmdds0f_YfYYbTAY7ptoOSdotXl986ex4mjsWXq5PgHNhdHM
  priority: 102
  providerName: Directory of Open Access Journals
Title Exact full-RSB SAT/UNSAT transition in infinitely wide two-layer neural networks
URI https://doi.org/10.21468/scipostphys.18.4.118
https://doaj.org/article/27732f01063343fabe72682e4501ffe5
UnpaywallVersion publishedVersion
Volume 18
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 2542-4653
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0002119165
  issn: 2542-4653
  databaseCode: DOA
  dateStart: 20160101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources (selected full-text only)
  customDbUrl:
  eissn: 2542-4653
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0002119165
  issn: 2542-4653
  databaseCode: M~E
  dateStart: 20160101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3da9swED-2lLHuYd9j2UfRw17lWl-29JiOljJYKGsD3ZORZAnCghMWh7T766ez3ZBtDDYwfjAWJ9_pfPeT7gPgg2be-aAk9aZ0VNZRU6u4pkpFVvPSqGgwUfjztDifyU_X6npIVsdcmL3ze2w5rTG7ZbVct4jzM6YzmXRc34eDQiXXewQHs-nF5Cs2kFOSU6wV1mfp_H3sL_anK9P_CB5umpW93drFYs-2nD2B6d2s-pCSb9mmdZn_8VvBxn-e9lN4PHiZZNIvi2dwLzTP4UEX7enXL-Di9Mb6luDWO_1yeUIuJ1fHs2m6kxZNVxfFReZ4xTn6pItbsp3XgbTbJV3Y5KQTLIOZCDR9EPn6JczOTq8-ntOhtQL1ghUtRcVLQE5aJ3Nn85rFUlhWap-XtcfDWx2cYs4b4Xhe1EqbXHljfDJ33CtnxCsYNcsmvAZiYpAC06C4SVhbB6sLUedeFjaBFcXlGLI7hlervoJGlZBHx6Yq_cCwKTF-f8V0JRMg0WM4QbHsXsYC2N2DxOBq0KeKl6XgEQGtEFJE60LJC82DVDmLMagxHO-E-ifZPensyL757xFv4ZBjN-AujucdjNrvm_A-uSitO-qg_dGwPH8CKzjlSw
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3da9swED-2lLHtYd-j2Rd62Ktc68uWHtPRUgYLZW2gezKSLEFocMLikHZ__XS2G7KNwQbGD8bi5Dud737SfQB81Mw7H5Sk3pSOyjpqahXXVKnIal4aFQ0mCn-ZFmcz-flKXQ3J6pgLs3d-jy2nNWa3rJbrFnF-xnQmk47r-3BQqOR6j-BgNj2ffMMGckpyirXC-iydv4_9xf50Zfofw8NNs7K3W7tY7NmW06cwvZtVH1JynW1al_kfvxVs_OdpP4Mng5dJJv2yeA73QvMCHnTRnn79Es5PbqxvCW69068Xx-Ricnk0m6Y7adF0dVFcZI5XnKNPurgl23kdSLtd0oVNTjrBMpiJQNMHka9fwez05PLTGR1aK1AvWNFSVLwE5KR1Mnc2r1kshWWl9nlZezy81cEp5rwRjudFrbTJlTfGJ3PHvXJGvIZRs2zCIRATgxSYBsVNwto6WF2IOveysAmsKC7HkN0xvFr1FTSqhDw6NlXpB4ZNifH7K6YrmQCJHsMximX3MhbA7h4kBleDPlW8LAWPCGiFkCJaF0peaB6kylmMQY3haCfUP8nuSWdH9s1_j3gLjzh2A-7ieN7BqP2-Ce-Ti9K6D8PC_AnQ5-RW
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Exact+full-RSB+SAT%2FUNSAT+transition+in+infinitely+wide+two-layer+neural+networks&rft.jtitle=SciPost+physics&rft.au=Annesi%2C+Brandon+L.&rft.au=Malatesta%2C+Enrico&rft.au=Zamponi%2C+Francesco&rft.date=2025-04-01&rft.issn=2542-4653&rft.eissn=2542-4653&rft.volume=18&rft.issue=4&rft_id=info:doi/10.21468%2FSciPostPhys.18.4.118&rft.externalDBID=n%2Fa&rft.externalDocID=10_21468_SciPostPhys_18_4_118
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2542-4653&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2542-4653&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2542-4653&client=summon