农产品价格主题搜索引擎的研究与实现

当前农业垂直搜索引擎无法预测农产品价格趋势,难以满足农业生产者行情分析需要。文章设计农产品价格主题搜索引擎。首先网络爬虫从农业综合网站搜集网页,对网页进行转码、去重、提取内容等处理;使用主题相关度算法计算网页的主题相关度,用分类器对网页分类,将与主题相关的网页解析、存储;最后提取农产品价格及其影响因素信息。结果表明,系统可搜集农产品价格信息及影响农产品价格因素信息,为后续农产品价格预测提供数据支持。...

Full description

Saved in:
Bibliographic Details
Published in东北农业大学学报 Vol. 47; no. 9; pp. 64 - 71
Main Author 孟繁疆 姬祥 袁琦 刘东 侯哲鹏
Format Journal Article
LanguageChinese
Published 东北农业大学电气与信息学院,哈尔滨,150030 2016
Subjects
Online AccessGet full text
ISSN1005-9369

Cover

More Information
Summary:当前农业垂直搜索引擎无法预测农产品价格趋势,难以满足农业生产者行情分析需要。文章设计农产品价格主题搜索引擎。首先网络爬虫从农业综合网站搜集网页,对网页进行转码、去重、提取内容等处理;使用主题相关度算法计算网页的主题相关度,用分类器对网页分类,将与主题相关的网页解析、存储;最后提取农产品价格及其影响因素信息。结果表明,系统可搜集农产品价格信息及影响农产品价格因素信息,为后续农产品价格预测提供数据支持。
Bibliography:MENG Fanjiang, JI Xiang, YUAN Qi, LIU Dong, HOU Zhepeng(School of Electrical and Information, Northeast Agricultural University, Harbin 150030, China)
23-1391/S
Current agricultural vertical search engine can't assess the price trend of agricultural products, not suitable for agricultural producers to analysis the market quotations, in view of the present situation, paper researched and designed the agricultural pdces subject search engine, First, web crawler collected web pages from the agricultural comprehensive web site, and the web page for transcoding, de-duplication, extract content and so on; then, use the Topic similarity algorithm of this paper to judge the correlation degree of web pages, and used the classifier to classify the web pages, topic related web pages would be parsed, stored; finally, extracted the information of the price of agricultural products and the information of its influencing factors. Experimental results showed that the system could collect the information of agricultural product
ISSN:1005-9369