XSLC:分层编码并面向查询的XML数据压缩算法
TP393; XML(extensible mallkup language)文档已经被广泛用作应用程序的一个数据交换格式,针对XML数据的压缩技术也逐渐成为新的研究领域.提出XSLC(XML stream layerecl-coding compression)算法,通过预先扫描DTD对数据模式进行分析,继而根据元素的父子关系进行子元素层面的编码;同时根据数据类型进行数据压缩,能够在压缩之后的文档上进行查询,因为仅需一遍压缩扫描所以可以应用于数据流环境.实验表明:XSLC算法的压缩比率和压缩时间均优于传统算法....
Saved in:
Published in | 计算机科学与探索 Vol. 4; no. 2; pp. 145 - 152 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | Chinese |
Published |
北京大学,高可信软件技术教育部重点实验室,北京,100871%北京大学,信息科学技术学院,北京,100871
2010
北京大学,机器感知与智能教育部重点实验室,北京,100871 北京大学,信息科学技术学院,北京,100871 |
Subjects | |
Online Access | Get full text |
ISSN | 1673-9418 |
DOI | 10.3778/j.issn.1673-9418.2010.02.006 |
Cover
Abstract | TP393; XML(extensible mallkup language)文档已经被广泛用作应用程序的一个数据交换格式,针对XML数据的压缩技术也逐渐成为新的研究领域.提出XSLC(XML stream layerecl-coding compression)算法,通过预先扫描DTD对数据模式进行分析,继而根据元素的父子关系进行子元素层面的编码;同时根据数据类型进行数据压缩,能够在压缩之后的文档上进行查询,因为仅需一遍压缩扫描所以可以应用于数据流环境.实验表明:XSLC算法的压缩比率和压缩时间均优于传统算法. |
---|---|
AbstractList | TP393; XML(extensible mallkup language)文档已经被广泛用作应用程序的一个数据交换格式,针对XML数据的压缩技术也逐渐成为新的研究领域.提出XSLC(XML stream layerecl-coding compression)算法,通过预先扫描DTD对数据模式进行分析,继而根据元素的父子关系进行子元素层面的编码;同时根据数据类型进行数据压缩,能够在压缩之后的文档上进行查询,因为仅需一遍压缩扫描所以可以应用于数据流环境.实验表明:XSLC算法的压缩比率和压缩时间均优于传统算法. |
Abstract_FL | XML documents have been widely used as a data exchange format. XML (extensible markup language) data compression technology has become a new field of research. A compression methyl called XSLC (XML stream layered-coding compression) is proposed to compress and decompress XML stream in real time. When DTD (document type definition) is available, XSLC can analyze the data model and encode elements according to the relationship of father node and son node, compress data part according to its type, and support query operations applied on compressed files, as for only one time of scanning data is needed, all the processes can be implemented in XML data stream environment. Experimental results show that XSLC outperforms other methods in compression ratio and compression efficiency. |
Author | 杨冬青 李红燕 付强 王腾蛟 唐世渭 |
AuthorAffiliation | 北京大学,信息科学技术学院,北京,100871;北京大学,高可信软件技术教育部重点实验室,北京,100871%北京大学,信息科学技术学院,北京,100871;北京大学,机器感知与智能教育部重点实验室,北京,100871 |
AuthorAffiliation_xml | – name: 北京大学,信息科学技术学院,北京,100871;北京大学,高可信软件技术教育部重点实验室,北京,100871%北京大学,信息科学技术学院,北京,100871;北京大学,机器感知与智能教育部重点实验室,北京,100871 |
Author_FL | YANG Dongqing WANG Tengjiao LI Hongyan TANG Shiwei FU Qiang |
Author_FL_xml | – sequence: 1 fullname: FU Qiang – sequence: 2 fullname: WANG Tengjiao – sequence: 3 fullname: LI Hongyan – sequence: 4 fullname: YANG Dongqing – sequence: 5 fullname: TANG Shiwei |
Author_xml | – sequence: 1 fullname: 付强 – sequence: 2 fullname: 王腾蛟 – sequence: 3 fullname: 李红燕 – sequence: 4 fullname: 杨冬青 – sequence: 5 fullname: 唐世渭 |
BookMark | eNrjYmDJy89LZWBQMTTQMzY3t9DP0sssLs7TMzQzN9a1NDG00DMyAEoZGOkZGJixMHDCxTkYeIuLM5MMTE1MjAzNzSw4GSwign2crZ52tD3d2PR8z7TnCxqf7tz2cu6ipxMmPpu_9MX6Rc9ntUT4-jybuuFZ77qnfd3P96x8vm76s81TeRhY0xJzilN5oTQ3Q4iba4izh66Pv7uns6OPbrKZoRmQMDIyME-ztLBINDdNTElNNEtLtUiySDZINDA0M0hKTDFPTTNMSjZKS0xNMzU3TTM2TzY2SDZINU2xMLJIsUgx5mbQhBhbnpiXlpiXHp-VX1qUB7QwPqs4K7uisqQY5FUDI6BHjQGcP18M |
ClassificationCodes | TP393 |
ContentType | Journal Article |
Copyright | Copyright © Wanfang Data Co. Ltd. All Rights Reserved. |
Copyright_xml | – notice: Copyright © Wanfang Data Co. Ltd. All Rights Reserved. |
DBID | 2B. 4A8 92I 93N PSX TCJ |
DOI | 10.3778/j.issn.1673-9418.2010.02.006 |
DatabaseName | Wanfang Data Journals - Hong Kong WANFANG Data Centre Wanfang Data Journals 万方数据期刊 - 香港版 China Online Journals (COJ) China Online Journals (COJ) |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
DocumentTitle_FL | XSLC: Layered Coding and Query-Oriented XML Data Compression Algorithm |
EndPage | 152 |
ExternalDocumentID | jsjkxyts201002006 |
GrantInformation_xml | – fundername: 国家自然科学基金; 国家高技术研究发展计划(2007AA01Z191; 2009AA01Z150); 教育部科技创新工程重大项目培育基金 funderid: (60673113); (863计划)(2007AA01Z191; 2009AA01Z150); (708001) |
GroupedDBID | 2B. 4A8 92I 93N ALMA_UNASSIGNED_HOLDINGS M~E PSX TCJ |
ID | FETCH-LOGICAL-c616-c62207f988a75adea6fe8b8c0a0160bad7ef1bc2faef575f37c30c0e5d828d8d3 |
ISSN | 1673-9418 |
IngestDate | Thu May 29 04:00:16 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 2 |
Keywords | 压缩 文档类型定义 document type definition(DTD) extensible markup language(XML) 数据流 compression data stream 可扩展标记语言 |
Language | Chinese |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c616-c62207f988a75adea6fe8b8c0a0160bad7ef1bc2faef575f37c30c0e5d828d8d3 |
PageCount | 8 |
ParticipantIDs | wanfang_journals_jsjkxyts201002006 |
PublicationCentury | 2000 |
PublicationDate | 2010 |
PublicationDateYYYYMMDD | 2010-01-01 |
PublicationDate_xml | – year: 2010 text: 2010 |
PublicationDecade | 2010 |
PublicationTitle | 计算机科学与探索 |
PublicationTitle_FL | JOURNAL OF FRONTIERS OF COMPUTER SCIENCE & TECHNOLOGY |
PublicationYear | 2010 |
Publisher | 北京大学,高可信软件技术教育部重点实验室,北京,100871%北京大学,信息科学技术学院,北京,100871 北京大学,机器感知与智能教育部重点实验室,北京,100871 北京大学,信息科学技术学院,北京,100871 |
Publisher_xml | – name: 北京大学,信息科学技术学院,北京,100871 – name: 北京大学,高可信软件技术教育部重点实验室,北京,100871%北京大学,信息科学技术学院,北京,100871 – name: 北京大学,机器感知与智能教育部重点实验室,北京,100871 |
SSID | ssib054421768 ssib002040941 ssib002423894 ssib051375751 ssib023646573 ssib036438069 ssib002040926 |
Score | 1.8144118 |
Snippet | TP393; XML(extensible mallkup language)文档已经被广泛用作应用程序的一个数据交换格式,针对XML数据的压缩技术也逐渐成为新的研究领域.提出XSLC(XML stream... |
SourceID | wanfang |
SourceType | Aggregation Database |
StartPage | 145 |
Title | XSLC:分层编码并面向查询的XML数据压缩算法 |
URI | https://d.wanfangdata.com.cn/periodical/jsjkxyts201002006 |
Volume | 4 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
journalDatabaseRights | – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources (selected full-text only) issn: 1673-9418 databaseCode: M~E dateStart: 20070101 customDbUrl: isFulltext: true dateEnd: 99991231 titleUrlDefault: https://road.issn.org omitProxy: true ssIdentifier: ssib054421768 providerName: ISSN International Centre |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LaxRBEB5CBPEiiopvgqSPG2d7erqrvfVMZgmSeDHK3sI8FYUVzAY0B0EREUTRQxQEUQIevERFEIT8nc3qv7Cq57GbByF6GXqqaqrr6xq6qmf64TjTmVRC8kS1FE_clhAJtDAMy1Ys3CQFP848e9zbwjU5d0Nc7frdicmbY7OWVvrJTLq657qS__Eq0tCvtEr2HzzbKEUCltG_eEUP4_VAPu5enw9RlkU-A2AgqRC0GXAWKRaETEsqGBfTRcvSLECKZnqWGU4U7TKNLMl0hxmfRcBMx7IU04aB6C7MWy4-61IB8MHIVhcxCKpajLa1REwrkgk8Vh5oWee8Vm3ETHubmA5ZYCxFWRt8ZlA5midYAKSfqosqYwKBhfrdsCIB02AxkZoRR9WWoQpkRlTQKNsZiUiCT_oVGVDqB1XbPBIxVj82qgmrNiuP06u_kVTzZOl9toKhBSas0tDiEYStAcatGlMbDh1qa5Lv2KYBFszWlMD6CfEbmohSNhaxrCe0tug4CzgpBM9aqgkDuofAcPI01YttHdlKDZlXUtAqHu5hMA9pAyaFgc0_MJzGeElvGFm42527UTQsTVjQAfvaMxY1pfJaWlSBtAqrYqz34GMhsl1uH1plW-1y_-KdgdxTCmwgJ_0zjf5qKibtsbtj_3Sbkd1ZvnP3wcP-Mkm55U78h7jCDJSm9D6KRmkpRi49Pqyme7FtfTfm8U2cojMWpD9K8_HWA1c2wwC_7Sn6fdncC4ED_XIVbm35YWe6gnV5P1B26WCviHu3xrLcxWPO0Wp4OmXKvua4M7F6-4QD1M9cGTx_Nvj-ZLj5dvjp8eDXzz8f1gev32x9_Pz76_rw_VPsKbbWvm293Bi8ejHc_DLceLf1Y-2ks9iJFsO5VnXiSiuVbYkXzl1VaIBYYTedx7LIIYHUjWkfyiTOVF60k5QXcV4g4MJTqeembu5nwCGDzDvlTPbu9fLTzlSObVLESc5BpcLLQRcF97w0T3TiZ34Wn3EuVUCXqg51eWmX-84eROicc6Sc0UOfRc87k_37K_kFHCj0k4vW638Br33BkQ |
linkProvider | ISSN International Centre |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=XSLC%3A%E5%88%86%E5%B1%82%E7%BC%96%E7%A0%81%E5%B9%B6%E9%9D%A2%E5%90%91%E6%9F%A5%E8%AF%A2%E7%9A%84XML%E6%95%B0%E6%8D%AE%E5%8E%8B%E7%BC%A9%E7%AE%97%E6%B3%95&rft.jtitle=%E8%AE%A1%E7%AE%97%E6%9C%BA%E7%A7%91%E5%AD%A6%E4%B8%8E%E6%8E%A2%E7%B4%A2&rft.au=%E4%BB%98%E5%BC%BA&rft.au=%E7%8E%8B%E8%85%BE%E8%9B%9F&rft.au=%E6%9D%8E%E7%BA%A2%E7%87%95&rft.au=%E6%9D%A8%E5%86%AC%E9%9D%92&rft.date=2010&rft.pub=%E5%8C%97%E4%BA%AC%E5%A4%A7%E5%AD%A6%2C%E9%AB%98%E5%8F%AF%E4%BF%A1%E8%BD%AF%E4%BB%B6%E6%8A%80%E6%9C%AF%E6%95%99%E8%82%B2%E9%83%A8%E9%87%8D%E7%82%B9%E5%AE%9E%E9%AA%8C%E5%AE%A4%2C%E5%8C%97%E4%BA%AC%2C100871%25%E5%8C%97%E4%BA%AC%E5%A4%A7%E5%AD%A6%2C%E4%BF%A1%E6%81%AF%E7%A7%91%E5%AD%A6%E6%8A%80%E6%9C%AF%E5%AD%A6%E9%99%A2%2C%E5%8C%97%E4%BA%AC%2C100871&rft.issn=1673-9418&rft.volume=4&rft.issue=2&rft.spage=145&rft.epage=152&rft_id=info:doi/10.3778%2Fj.issn.1673-9418.2010.02.006&rft.externalDocID=jsjkxyts201002006 |
thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fwww.wanfangdata.com.cn%2Fimages%2FPeriodicalImages%2Fjsjkxyts%2Fjsjkxyts.jpg |