Bootstrapping both Product Properties and Opinion Words from Chinese Reviews with Cross-Training
We investigate the problem of identifying both product properties and opinion words for sentences in a unified process when only a much small labeled corpus is available. Naive Bayesian method is used in this process. Specifically, considering the fact that product properties and opinion words usual...
Saved in:
| Published in | Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence pp. 259 - 262 |
|---|---|
| Main Authors | , |
| Format | Conference Proceeding |
| Language | English |
| Published |
Washington, DC, USA
IEEE Computer Society
02.11.2007
|
| Series | ACM Conferences |
| Subjects | |
| Online Access | Get full text |
| ISBN | 0769530265 9780769530260 |
| DOI | 10.1109/WI.2007.32 |
Cover
| Summary: | We investigate the problem of identifying both product properties and opinion words for sentences in a unified process when only a much small labeled corpus is available. Naive Bayesian method is used in this process. Specifically, considering the fact that product properties and opinion words usually co-occur with high frequency in product review articles, a crosstraining method is proposed to bootstrap both of them, in which the two sub-tasks are boosted by each other iteratively. Experiment results show that with a much small labeled corpus cross-training could produce both product properties and opinion words which are very close to what Naive Bayesian Classifiers could do with a large labeled corpus.. |
|---|---|
| ISBN: | 0769530265 9780769530260 |
| DOI: | 10.1109/WI.2007.32 |