XSLC:分层编码并面向查询的XML数据压缩算法

TP393; XML(extensible mallkup language)文档已经被广泛用作应用程序的一个数据交换格式,针对XML数据的压缩技术也逐渐成为新的研究领域.提出XSLC(XML stream layerecl-coding compression)算法,通过预先扫描DTD对数据模式进行分析,继而根据元素的父子关系进行子元素层面的编码;同时根据数据类型进行数据压缩,能够在压缩之后的文档上进行查询,因为仅需一遍压缩扫描所以可以应用于数据流环境.实验表明:XSLC算法的压缩比率和压缩时间均优于传统算法....

Full description

Saved in:
Bibliographic Details
Published in计算机科学与探索 Vol. 4; no. 2; pp. 145 - 152
Main Authors 付强, 王腾蛟, 李红燕, 杨冬青, 唐世渭
Format Journal Article
LanguageChinese
Published 北京大学,高可信软件技术教育部重点实验室,北京,100871%北京大学,信息科学技术学院,北京,100871 2010
北京大学,机器感知与智能教育部重点实验室,北京,100871
北京大学,信息科学技术学院,北京,100871
Subjects
Online AccessGet full text
ISSN1673-9418
DOI10.3778/j.issn.1673-9418.2010.02.006

Cover

Abstract TP393; XML(extensible mallkup language)文档已经被广泛用作应用程序的一个数据交换格式,针对XML数据的压缩技术也逐渐成为新的研究领域.提出XSLC(XML stream layerecl-coding compression)算法,通过预先扫描DTD对数据模式进行分析,继而根据元素的父子关系进行子元素层面的编码;同时根据数据类型进行数据压缩,能够在压缩之后的文档上进行查询,因为仅需一遍压缩扫描所以可以应用于数据流环境.实验表明:XSLC算法的压缩比率和压缩时间均优于传统算法.
AbstractList TP393; XML(extensible mallkup language)文档已经被广泛用作应用程序的一个数据交换格式,针对XML数据的压缩技术也逐渐成为新的研究领域.提出XSLC(XML stream layerecl-coding compression)算法,通过预先扫描DTD对数据模式进行分析,继而根据元素的父子关系进行子元素层面的编码;同时根据数据类型进行数据压缩,能够在压缩之后的文档上进行查询,因为仅需一遍压缩扫描所以可以应用于数据流环境.实验表明:XSLC算法的压缩比率和压缩时间均优于传统算法.
Abstract_FL XML documents have been widely used as a data exchange format. XML (extensible markup language) data compression technology has become a new field of research. A compression methyl called XSLC (XML stream layered-coding compression) is proposed to compress and decompress XML stream in real time. When DTD (document type definition) is available, XSLC can analyze the data model and encode elements according to the relationship of father node and son node, compress data part according to its type, and support query operations applied on compressed files, as for only one time of scanning data is needed, all the processes can be implemented in XML data stream environment. Experimental results show that XSLC outperforms other methods in compression ratio and compression efficiency.
Author 杨冬青
李红燕
付强
王腾蛟
唐世渭
AuthorAffiliation 北京大学,信息科学技术学院,北京,100871;北京大学,高可信软件技术教育部重点实验室,北京,100871%北京大学,信息科学技术学院,北京,100871;北京大学,机器感知与智能教育部重点实验室,北京,100871
AuthorAffiliation_xml – name: 北京大学,信息科学技术学院,北京,100871;北京大学,高可信软件技术教育部重点实验室,北京,100871%北京大学,信息科学技术学院,北京,100871;北京大学,机器感知与智能教育部重点实验室,北京,100871
Author_FL YANG Dongqing
WANG Tengjiao
LI Hongyan
TANG Shiwei
FU Qiang
Author_FL_xml – sequence: 1
  fullname: FU Qiang
– sequence: 2
  fullname: WANG Tengjiao
– sequence: 3
  fullname: LI Hongyan
– sequence: 4
  fullname: YANG Dongqing
– sequence: 5
  fullname: TANG Shiwei
Author_xml – sequence: 1
  fullname: 付强
– sequence: 2
  fullname: 王腾蛟
– sequence: 3
  fullname: 李红燕
– sequence: 4
  fullname: 杨冬青
– sequence: 5
  fullname: 唐世渭
BookMark eNrjYmDJy89LZWBQMTTQMzY3t9DP0sssLs7TMzQzN9a1NDG00DMyAEoZGOkZGJixMHDCxTkYeIuLM5MMTE1MjAzNzSw4GSwign2crZ52tD3d2PR8z7TnCxqf7tz2cu6ipxMmPpu_9MX6Rc9ntUT4-jybuuFZ77qnfd3P96x8vm76s81TeRhY0xJzilN5oTQ3Q4iba4izh66Pv7uns6OPbrKZoRmQMDIyME-ztLBINDdNTElNNEtLtUiySDZINDA0M0hKTDFPTTNMSjZKS0xNMzU3TTM2TzY2SDZINU2xMLJIsUgx5mbQhBhbnpiXlpiXHp-VX1qUB7QwPqs4K7uisqQY5FUDI6BHjQGcP18M
ClassificationCodes TP393
ContentType Journal Article
Copyright Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
Copyright_xml – notice: Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
DBID 2B.
4A8
92I
93N
PSX
TCJ
DOI 10.3778/j.issn.1673-9418.2010.02.006
DatabaseName Wanfang Data Journals - Hong Kong
WANFANG Data Centre
Wanfang Data Journals
万方数据期刊 - 香港版
China Online Journals (COJ)
China Online Journals (COJ)
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
DocumentTitle_FL XSLC: Layered Coding and Query-Oriented XML Data Compression Algorithm
EndPage 152
ExternalDocumentID jsjkxyts201002006
GrantInformation_xml – fundername: 国家自然科学基金; 国家高技术研究发展计划(2007AA01Z191; 2009AA01Z150); 教育部科技创新工程重大项目培育基金
  funderid: (60673113); (863计划)(2007AA01Z191; 2009AA01Z150); (708001)
GroupedDBID 2B.
4A8
92I
93N
ALMA_UNASSIGNED_HOLDINGS
M~E
PSX
TCJ
ID FETCH-LOGICAL-c616-c62207f988a75adea6fe8b8c0a0160bad7ef1bc2faef575f37c30c0e5d828d8d3
ISSN 1673-9418
IngestDate Thu May 29 04:00:16 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords 压缩
文档类型定义
document type definition(DTD)
extensible markup language(XML)
数据流
compression
data stream
可扩展标记语言
Language Chinese
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c616-c62207f988a75adea6fe8b8c0a0160bad7ef1bc2faef575f37c30c0e5d828d8d3
PageCount 8
ParticipantIDs wanfang_journals_jsjkxyts201002006
PublicationCentury 2000
PublicationDate 2010
PublicationDateYYYYMMDD 2010-01-01
PublicationDate_xml – year: 2010
  text: 2010
PublicationDecade 2010
PublicationTitle 计算机科学与探索
PublicationTitle_FL JOURNAL OF FRONTIERS OF COMPUTER SCIENCE & TECHNOLOGY
PublicationYear 2010
Publisher 北京大学,高可信软件技术教育部重点实验室,北京,100871%北京大学,信息科学技术学院,北京,100871
北京大学,机器感知与智能教育部重点实验室,北京,100871
北京大学,信息科学技术学院,北京,100871
Publisher_xml – name: 北京大学,信息科学技术学院,北京,100871
– name: 北京大学,高可信软件技术教育部重点实验室,北京,100871%北京大学,信息科学技术学院,北京,100871
– name: 北京大学,机器感知与智能教育部重点实验室,北京,100871
SSID ssib054421768
ssib002040941
ssib002423894
ssib051375751
ssib023646573
ssib036438069
ssib002040926
Score 1.8144118
Snippet TP393; XML(extensible mallkup language)文档已经被广泛用作应用程序的一个数据交换格式,针对XML数据的压缩技术也逐渐成为新的研究领域.提出XSLC(XML stream...
SourceID wanfang
SourceType Aggregation Database
StartPage 145
Title XSLC:分层编码并面向查询的XML数据压缩算法
URI https://d.wanfangdata.com.cn/periodical/jsjkxyts201002006
Volume 4
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources (selected full-text only)
  issn: 1673-9418
  databaseCode: M~E
  dateStart: 20070101
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://road.issn.org
  omitProxy: true
  ssIdentifier: ssib054421768
  providerName: ISSN International Centre
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LaxRBEB5CBPEiiopvgqSPG2d7erqrvfVMZgmSeDHK3sI8FYUVzAY0B0EREUTRQxQEUQIevERFEIT8nc3qv7Cq57GbByF6GXqqaqrr6xq6qmf64TjTmVRC8kS1FE_clhAJtDAMy1Ys3CQFP848e9zbwjU5d0Nc7frdicmbY7OWVvrJTLq657qS__Eq0tCvtEr2HzzbKEUCltG_eEUP4_VAPu5enw9RlkU-A2AgqRC0GXAWKRaETEsqGBfTRcvSLECKZnqWGU4U7TKNLMl0hxmfRcBMx7IU04aB6C7MWy4-61IB8MHIVhcxCKpajLa1REwrkgk8Vh5oWee8Vm3ETHubmA5ZYCxFWRt8ZlA5midYAKSfqosqYwKBhfrdsCIB02AxkZoRR9WWoQpkRlTQKNsZiUiCT_oVGVDqB1XbPBIxVj82qgmrNiuP06u_kVTzZOl9toKhBSas0tDiEYStAcatGlMbDh1qa5Lv2KYBFszWlMD6CfEbmohSNhaxrCe0tug4CzgpBM9aqgkDuofAcPI01YttHdlKDZlXUtAqHu5hMA9pAyaFgc0_MJzGeElvGFm42527UTQsTVjQAfvaMxY1pfJaWlSBtAqrYqz34GMhsl1uH1plW-1y_-KdgdxTCmwgJ_0zjf5qKibtsbtj_3Sbkd1ZvnP3wcP-Mkm55U78h7jCDJSm9D6KRmkpRi49Pqyme7FtfTfm8U2cojMWpD9K8_HWA1c2wwC_7Sn6fdncC4ED_XIVbm35YWe6gnV5P1B26WCviHu3xrLcxWPO0Wp4OmXKvua4M7F6-4QD1M9cGTx_Nvj-ZLj5dvjp8eDXzz8f1gev32x9_Pz76_rw_VPsKbbWvm293Bi8ejHc_DLceLf1Y-2ks9iJFsO5VnXiSiuVbYkXzl1VaIBYYTedx7LIIYHUjWkfyiTOVF60k5QXcV4g4MJTqeembu5nwCGDzDvlTPbu9fLTzlSObVLESc5BpcLLQRcF97w0T3TiZ34Wn3EuVUCXqg51eWmX-84eROicc6Sc0UOfRc87k_37K_kFHCj0k4vW638Br33BkQ
linkProvider ISSN International Centre
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=XSLC%3A%E5%88%86%E5%B1%82%E7%BC%96%E7%A0%81%E5%B9%B6%E9%9D%A2%E5%90%91%E6%9F%A5%E8%AF%A2%E7%9A%84XML%E6%95%B0%E6%8D%AE%E5%8E%8B%E7%BC%A9%E7%AE%97%E6%B3%95&rft.jtitle=%E8%AE%A1%E7%AE%97%E6%9C%BA%E7%A7%91%E5%AD%A6%E4%B8%8E%E6%8E%A2%E7%B4%A2&rft.au=%E4%BB%98%E5%BC%BA&rft.au=%E7%8E%8B%E8%85%BE%E8%9B%9F&rft.au=%E6%9D%8E%E7%BA%A2%E7%87%95&rft.au=%E6%9D%A8%E5%86%AC%E9%9D%92&rft.date=2010&rft.pub=%E5%8C%97%E4%BA%AC%E5%A4%A7%E5%AD%A6%2C%E9%AB%98%E5%8F%AF%E4%BF%A1%E8%BD%AF%E4%BB%B6%E6%8A%80%E6%9C%AF%E6%95%99%E8%82%B2%E9%83%A8%E9%87%8D%E7%82%B9%E5%AE%9E%E9%AA%8C%E5%AE%A4%2C%E5%8C%97%E4%BA%AC%2C100871%25%E5%8C%97%E4%BA%AC%E5%A4%A7%E5%AD%A6%2C%E4%BF%A1%E6%81%AF%E7%A7%91%E5%AD%A6%E6%8A%80%E6%9C%AF%E5%AD%A6%E9%99%A2%2C%E5%8C%97%E4%BA%AC%2C100871&rft.issn=1673-9418&rft.volume=4&rft.issue=2&rft.spage=145&rft.epage=152&rft_id=info:doi/10.3778%2Fj.issn.1673-9418.2010.02.006&rft.externalDocID=jsjkxyts201002006
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fwww.wanfangdata.com.cn%2Fimages%2FPeriodicalImages%2Fjsjkxyts%2Fjsjkxyts.jpg