Streaming Algorithms for News and Scientific Literature Recommendation: Monotone Submodular Maximization With a d -Knapsack Constraint

Submodular optimization plays a significant role in combinatorial problems, since it captures the structure of the edge cuts in graphs, the coverage of sets, and so on. Many data mining and machine learning problems can be cast as submodular maximization problems with applications in recommendation...

Full description

Saved in:
Bibliographic Details
Published inIEEE access Vol. 6; pp. 53736 - 53747
Main Authors Yu, Qilian, Xu, Li, Cui, Shuguang
Format Journal Article
LanguageEnglish
Published IEEE 2018
Subjects
Online AccessGet full text
ISSN2169-3536
2169-3536
DOI10.1109/ACCESS.2018.2871668

Cover

More Information
Summary:Submodular optimization plays a significant role in combinatorial problems, since it captures the structure of the edge cuts in graphs, the coverage of sets, and so on. Many data mining and machine learning problems can be cast as submodular maximization problems with applications in recommendation systems and data diversification. In this paper, we focus on the problem of maximizing a monotone submodular function subject to a <inline-formula> <tex-math notation="LaTeX">d </tex-math></inline-formula>-knapsack constraint, for which we propose a streaming algorithm that achieves a <inline-formula> <tex-math notation="LaTeX">(({1}/{1+2d})-\epsilon) </tex-math></inline-formula>-approximation of the optimal value, while it only needs one single pass through the data set without storing all the data in the memory. In our experiments, we extensively evaluate the effectiveness of our proposed algorithm via two applications: news recommendation and scientific literature recommendation. It is observed that the proposed streaming algorithm achieves both execution speedup and memory saving by several orders of magnitude, compared with existing approaches.
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2018.2871668