View BioCreative III PPI ACT dataset (public)
























- Summary
Dataset used for the article classification task of BioCreative III (see http://www.biocreative.org/tasks/biocreative-iii/ppi/)
- License
- unknown
- Dependencies
- Tags
- article Biocreative categorization Classification ranking text
- Attribute Types
- Download
-
zip (38.5 MB)
Files are converted on demand and the process can take up to a minute. Please wait until download begins.
You can edit this item to add more meta information and make use of the site's premium features.
- Original Data Format
- zip
- Name
- Version mldata
- Comment
- Names
- Data (first 10 data points)
- Description
Dataset used for the article classification task of BioCreative III (see http://www.biocreative.org/tasks/biocreative-iii/ppi/).
Detecting articles describing complex biological events like Protein-protein interaction was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BioCreative III-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times.
- URLs
- http://www.biocreative.org/tasks/biocreative-iii/ppi
- Publications
- Data Source
- PubMed records (abstracts)
- Measurement Details
Evaluation will be based on comparison between automatically generated results and manual examination of a set of PubMed records. Evaluation was done using the evaluation library for the ACT task: http://www.biocreative.org/resources/biocreative-ii5/evaluation-library/
- Usage Scenario
Literature retrieval by biomedical researches, and database curators.
- revision 1
- by krallinger on 2012-12-31 12:22
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
To post a comment, please sign in.This item was downloaded 11353 times and viewed 1 times.
No Tasks yet on dataset BioCreative III PPI ACT dataset
Submit a new Task for this Data itemDisclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.