Data
-
BioCreative III PPI ACT dataset - submitted by krallinger 0 views, 11353 downloads, 0 comments
last edited by krallinger - Dec 31, 2012, 12:22 CET Rating
- Summary:
Dataset used for the article classification task of BioCreative III (see http://www.biocreative.org/tasks/biocreative-iii/ppi/)
- License: unknown
- Tags: article Biocreative categorization Classification ranking text
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: zip (38.5 MB)
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Dataset used for the article classification task of BioCreative III (see http://www.biocreative.org/tasks/biocreative-iii/ppi/)
-
DMOZ Web Directory Topics - submitted by jeanbaptiste 6483 views, 11471 downloads, 0 comments
last edited by jeanbaptiste - Mar 29, 2012, 16:47 CET Rating
- Summary:
Contains parsed webpages along with their topics extracted from DMOZ web directory
- Data Shape: 10630 attributes, 2658 instances ()
- License: unknown
- Tags: bag-of-words Classification DMOZ libsvm multi-class text web-pages
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (4.1 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Contains parsed webpages along with their topics extracted from DMOZ web directory
-
Yahoo! Web Directory Topics - submitted by jeanbaptiste 2331 views, 13246 downloads, 0 comments
last edited by jeanbaptiste - Mar 13, 2012, 15:16 CET Rating
- Summary:
Contains parsed webpages along with their topics extracted from Yahoo! web directory
- Data Shape: 10630 attributes, 2212 instances ()
- License: unknown
- Tags: bag-of-words Classification multi-class text web-pages Yahoo!
- Tasks / Methods / Challenges: 1 tasks, 0 methods, 1 challenges
- Download: HDF5 (3.6 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Contains parsed webpages along with their topics extracted from Yahoo! web directory
Disclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.