发布网友 发布时间:2022-04-23 13:59
共1个回答
热心网友 时间:2022-05-01 20:29
你可以选用 Default Task 为 Recommender-Systems 的那些数据集。
比如: Anonymous Microsoft Web Data Data Set
这个数据集是从1998年2月的某个星期的微软官方网站的访问日志中取样得到的。
包含了 38000 个匿名用户在一周内对 www.microsoft.com 站点的某些网页的访问记录。
有人用这个数据集来做协同过滤的推荐系统,当然也可以应用apriori算法。
我顺便帮你把数据文件传上来了。
Source:
Creators:
Jack S. Breese, David Heckerman, Carl M. Kadie
Microsoft Research, Redmond WA, 98052-6399, USA
breese '@' microsoft.com, heckerma '@' microsoft.com, carlk '@' microsoft.com
Donors:
Breese:, Heckerman, & Kadie
Data Set Information:
We created the data by sampling and processing the www.microsoft.com logs. The data records the use of www.microsoft.com by 38000 anonymous, randomly-selected users. For each user, the data lists all the areas of the web site (Vroots) that user visited in a one week timeframe.
Users are identified only by a sequential number, for example, User #14988, User #14989, etc. The file contains no personally identifiable information. The 294 Vroots are identified by their title (e.g. "NetShow for PowerPoint") and URL (e.g. "/stream"). The data comes from one week in February, 1998.
Attribute Information:
Each attribute is an area ("vroot") of the www.microsoft.com web site.
The datasets record which Vroots each user visited in a one-week timeframe in Feburary 1998.
Relevant Papers:
J. Breese, D. Heckerman., C. Kadie _Empirical Analysis of Predictive Algorithms for Collaborative Filtering_ Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, Madison, WI, July, 1998.
[Web Link]
Also, expanded as Microsoft Research Technical Report MSR-TR-98-12, The papers are available on-line at: [Web Link]
参考:
http://archive.ics.uci.e/ml/datasets/Anonymous+Microsoft+Web+Data
http://archive.ics.uci.e/ml/machine-learning-databases/anonymous/
下载地址
http://archive.ics.uci.e/ml/machine-learning-databases/anonymous/anonymous-msweb.data
详细说明
http://archive.ics.uci.e/ml/machine-learning-databases/anonymous/anonymous-msweb.info