提供者:卢梦依
下载地址:http://help.sentiment140.com/for-students/
简介
数据集概述
Sentiment140是一个可用于情感分析的数据集。
数据集具有以下6个特征:
- 推文的感情色彩(polarity)
- 推文的ID
- 推文的日期
- 查看记录
- 推特(tweeter)的用户名
- 推文的文本内容
文件大小
大小:80 MB(压缩包)
数量
160,000条推文
相关论文
1.Zhang X, Zhao J, Lecun Y. Character-level Convolutional Networks for Text Classification[J]. 2015:649-657.
2.Severyn, A., & Moschitti, A. UNITN: Training Deep Convolutional Neural Network for TwitterSentiment Classification.
3.Xu, J., Wang, P., Tian, G., Xu, B., Zhao, J., Wang, F., & Hao, H. (2015,June). Short TextClustering via Convolutional Neural Networks. In Proceedings of NAACL-HLT (pp.62-69).
4.Wang, P., Xu, J., Xu, B., Liu, C. L., Zhang, H., Wang, F., & Hao, H.(2015). SemanticClustering and Convolutional Neural Network for Short Text Categorization.In Proceedings of the 53rd Annual Meeting of the Association forComputational Linguistics and the 7th International Joint Conference on NaturalLanguage Processing (Vol.2, pp. 352-357).
5.Liu, Y., Liu, Z., Chua, T. S., & Sun, M. (2015, February). Topical Word Embeddings.In Twenty-Ninth AAAI Conference on Artificial Intelligence.