HashTrie: a space-efficient multiple string matching algorithm

The famous multiple string matching algorithm AC consumed huge memory when the string signatures were massive, thus unable to process high speed network traffic efficiently. To solve this problem, a space-efficient multiple string matching algorithm-HashTrie was proposed. This algorithm adopted recu...

Full description

Saved in:
Bibliographic Details
Published inTongxin Xuebao Vol. 36; no. 10; pp. 2015215 - 2015219
Main Authors Zhang, Ping, Liu, Yan-Bing, Yu, Jing, Tan, Jian-Long
Format Journal Article
LanguageChinese
Published Editorial Department of Journal on Communications 01.10.2015
Subjects
Online AccessGet full text
ISSN1000-436X
DOI10.11959/j.issn.1000-436x.2015215

Cover

Abstract The famous multiple string matching algorithm AC consumed huge memory when the string signatures were massive, thus unable to process high speed network traffic efficiently. To solve this problem, a space-efficient multiple string matching algorithm-HashTrie was proposed. This algorithm adopted recursive hash function to store the patterns in bit-vectors in place of the state transition table in order to reduce space consumption. Further more it made use of the rank operation for fast verification. Theoretic analysis shows that the space complexity of HashTrie is O(|P|), which is linear with the size of pattern set |P| and is independent of the alphabetsize sigma. The space complexity is superior to the complexity O(|P|sigmal log|P|) of AC. Experiments on synthetic datasets and real-world datasets (such as Snort, ClamAV and URL) show that HashTrie saves up to 99.6% storage cost compared with AC, and in the meanwhile it runs at a matching speed that is about half of AC. HashTrie is a space-efficient multiple s
AbstractList The famous multiple string matching algorithm AC consumed huge memory when the string signatures were massive, thus unable to process high speed network traffic efficiently. To solve this problem, a space-efficient multiple string matching algorithm-HashTrie was proposed. This algorithm adopted recursive hash function to store the patterns in bit-vectors in place of the state transition table in order to reduce space consumption. Further more it made use of the rank operation for fast verification. Theoretic analysis shows that the space complexity of HashTrie is O(|P|), which is linear with the size of pattern set |P| and is independent of the alphabetsize sigma. The space complexity is superior to the complexity O(|P|sigmal log|P|) of AC. Experiments on synthetic datasets and real-world datasets (such as Snort, ClamAV and URL) show that HashTrie saves up to 99.6% storage cost compared with AC, and in the meanwhile it runs at a matching speed that is about half of AC. HashTrie is a space-efficient multiple s
The famous multiple string matching algorithm AC consumed huge memory when the string signatures were massive,thus unable to process high speed network traffic efficiently.To solve this problem,a space-efficient multiple string matching algorithm-HashTrie was proposed.This algorithm adopted recursive hash function to store the patterns in bit-vectors in place of the state transition table in order to reduce space consumption.Further more it made use of the rank operation for fast verification.Theoretic analysis shows that the space complexity of HashTrie is O(|P|),which is linear with the size of pattern set |P|and is independent of the alphabetsize σ.The space complexity is superior to the complexity O(|P|σlog|P|)of AC.Experiments on synthetic datasets and real-world datasets(such as Snort,ClamAV and URL)show that HashTrie saves up to 99.6% storage cost compared with AC,and in the meanwhile it runs at a matching speed that is about half of AC.HashTrie is a space-efficient multiple string matching algorithm t
Author Tan, Jian-Long
Liu, Yan-Bing
Yu, Jing
Zhang, Ping
Author_xml – sequence: 1
  givenname: Ping
  surname: Zhang
  fullname: Zhang, Ping
– sequence: 2
  givenname: Yan-Bing
  surname: Liu
  fullname: Liu, Yan-Bing
– sequence: 3
  givenname: Jing
  surname: Yu
  fullname: Yu, Jing
– sequence: 4
  givenname: Jian-Long
  surname: Tan
  fullname: Tan, Jian-Long
BookMark eNo10DtPwzAUBWAPRaKU_oewsaT4EdsJAxKqgFaqxFIktujGvm5d5YXtSvDvKRSmc3SGbzhXZNIPPRJyw-iCsUpWd4eFj7FfMEppXgj1ueCUSc7khEz_t_dLMo_RN1QyoRUVbEoeVhD32-DxPoMsjmAwR-e88dinrDu2yY8tZjEF3--yDpLZ_xRod0Pwad9dkwsHbcT5X87I2_PTdrnKN68v6-XjJrdMVSk3zkFhJG04QGklNloqpbRzEqkVrFSKA9UlVoZysKJQpqBUlpoXtKJWWTEj67NrBzjUY_AdhK96AF__DkPY1RCSNy3WwjpVVcKcLCy4dY102gmjpQCLivOTdXu2xjB8HDGmuvPRYNtCj8Mx1kzrkqpCnQ76Bp1gaVs
ContentType Journal Article
DBID 7SP
8FD
L7M
DOA
DOI 10.11959/j.issn.1000-436x.2015215
DatabaseName Electronics & Communications Abstracts
Technology Research Database
Advanced Technologies Database with Aerospace
DOAJ Directory of Open Access Journals
DatabaseTitle Technology Research Database
Advanced Technologies Database with Aerospace
Electronics & Communications Abstracts
DatabaseTitleList Technology Research Database

Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ - Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
EndPage 2015219
ExternalDocumentID oai_doaj_org_article_3df6993c9c0e42dfb5f7f3c753ade622
GroupedDBID -0Y
7SP
8FD
ALMA_UNASSIGNED_HOLDINGS
CCEZO
CUBFJ
L7M
GROUPED_DOAJ
ID FETCH-LOGICAL-d169t-cffa4c50b2aa8d5eb756667ff5e0d318662a078e9c02ad346c40058724090d6d3
IEDL.DBID DOA
ISSN 1000-436X
IngestDate Tue Oct 14 19:09:22 EDT 2025
Thu Jul 10 23:08:00 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly true
Issue 10
Language Chinese
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-d169t-cffa4c50b2aa8d5eb756667ff5e0d318662a078e9c02ad346c40058724090d6d3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink https://doaj.org/article/3df6993c9c0e42dfb5f7f3c753ade622
PQID 1778064660
PQPubID 23500
PageCount 5
ParticipantIDs doaj_primary_oai_doaj_org_article_3df6993c9c0e42dfb5f7f3c753ade622
proquest_miscellaneous_1778064660
PublicationCentury 2000
PublicationDate 20151001
2015-10-01
PublicationDateYYYYMMDD 2015-10-01
PublicationDate_xml – month: 10
  year: 2015
  text: 20151001
  day: 01
PublicationDecade 2010
PublicationTitle Tongxin Xuebao
PublicationYear 2015
Publisher Editorial Department of Journal on Communications
Publisher_xml – name: Editorial Department of Journal on Communications
SSID ssib051376031
ssj0002912165
ssib058759023
ssib001102965
ssib023646527
ssib023168036
ssib036439991
ssib050281523
ssib000968473
Score 2.1125448
Snippet The famous multiple string matching algorithm AC consumed huge memory when the string signatures were massive, thus unable to process high speed network...
The famous multiple string matching algorithm AC consumed huge memory when the string signatures were massive,thus unable to process high speed network traffic...
SourceID doaj
proquest
SourceType Open Website
Aggregation Database
StartPage 2015215
SubjectTerms Algorithms
bit-vector
Complexity
Consumption
intrusion detection
multiple string matching
recursive hash function
Searching
Signatures
space-efficient
Stores
String matching
Strings
Title HashTrie: a space-efficient multiple string matching algorithm
URI https://www.proquest.com/docview/1778064660
https://doaj.org/article/3df6993c9c0e42dfb5f7f3c753ade622
Volume 36
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  issn: 1000-436X
  databaseCode: M~E
  dateStart: 19800101
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://road.issn.org
  omitProxy: true
  ssIdentifier: ssib058759023
  providerName: ISSN International Centre
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3LSsNAFB2kC3Ejior1RQpuYyfzzLhTaSmCrlroLkzm0SraSh8gLvx270zSB7hw4yaQBBIyN3PPmdybcxC6dg5766RLLZEkZdz4VHMXOnMyanOrvPZR7fNZ9AbscciHW1ZfoSeskgeuBq5NrReAoUYZ7BixvuReemqAZWvrBInZF-dqazFVE3NIu9t_jGKiNvU8EuyaMBWbfcEEJ2sgpgGn1UZFhgMKA9Ctr8ez0EtS189CzicqI1n0rQzfy1NGxXAXtWJGUly1X-OMvlmd-wwtZYCbvDYK-IUDEdy6B2i_ZqXJXTUah2jna3yEICvNx31YSt_qBPKOcamLehMAU8mqDzEJth-TUQK8NzZlJvptNJ29LMbvx2jQ7fQfemnttpDaTKhFarzXzHBcEq1zy10pgekJ6T132NKgi0c08AkHgSDaUiYMC56EEiiBwlZYeoIak-nEnaLE5LlhiuakhNWWLJ3ywDu9LmG-c29o1kT34bGLj0pQowgS1_EABL6oA1_8Ffgmaq0GrYApEeoceuKmy3mRSZkD0xICn_3Hjc7RXghX1cF3gRqL2dJdAhNZlFfxpYPt03fnB3Vk0JI
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=HashTrie%3A+a+space-efficient+multiple+string+matching+algorithm&rft.jtitle=Tongxin+Xuebao&rft.au=Zhang%2C+Ping&rft.au=Liu%2C+Yan-Bing&rft.au=Yu%2C+Jing&rft.au=Tan%2C+Jian-Long&rft.date=2015-10-01&rft.issn=1000-436X&rft.volume=36&rft.issue=10&rft.spage=2015215&rft.epage=2015219&rft_id=info:doi/10.11959%2Fj.issn.1000-436x.2015215&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1000-436X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1000-436X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1000-436X&client=summon