View regression-datasets cpu_small (public)
























- Summary
(No information yet)
- License
- unknown (from Weka repository)
- Dependencies
- Tags
- arff slurped Weka
- Attribute Types
- Integer,Floating Point
- Download
-
# Instances: 8192 / # Attributes: 13
HDF5 (526.3 KB) XML CSV ARFF LibSVM Matlab OctaveFiles are converted on demand and the process can take up to a minute. Please wait until download begins.
You can edit this item to add more meta information and make use of the site's premium features.
- Original Data Format
- arff
- Name
- cpu_small
- Version mldata
- 0
- Comment
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user university department. Users would typically be doing a large variety of tasks ranging from accessing the internet, editing files or running very cpu-bound programs. The data was collected continuously on two separate occasions. On both occassions, system activity was gathered every 5 seconds. The final dataset is taken from both occasions with equal numbers of observations coming from each collection epoch.
System measures used: 1. lread - Reads (transfers per second ) between system memory and user memory. 2. lwrite - writes (transfers per second) between system memory and user memory. 3. scall - Number of system calls of all types per second. 4. sread - Number of system read calls per second. 5. swrite - Number of system write calls per second . 6. fork - Number of system fork calls per second. 7. exec - Number of system exec calls per second. 8. rchar - Number of characters transferred per second by system read calls. 9. wchar - Number of characters transfreed per second by system write calls. 10. pgout - Number of page out requests per second. 11. ppgout - Number of pages, paged out per second. 12. pgfree - Number of pages per second placed on the free list. 13. pgscan - Number of pages checked if they can be freed per second. 14. atch - Number of page attaches (satisfying a page fault by reclaiming a page in memory) per second. 15. pgin - Number of page-in requests per second. 16. ppgin - Number of pages paged in per second. 17. pflt - Number of page faults caused by protection errors (copy-on-writes). 18. vflt - Number of page faults caused by address translation. 19. runqsz - Process run queue size. 20. freemem - Number of memory pages available to user processes. 21. freeswap - Number of disk blocks available for page swapping. 22. usr - Portion of time (%) that cpus run in user mode. 23. sys - Portion of time (%) that cpus run in system mode. 24. wio - Portion of time (%) that cpus are idle waiting for block IO. 25. idle - Portion of time (%) that cpus are otherwise idle.
The two different regression tasks obtained from these databases are:
CompAct Predict usr, the portion of time that cpus run in user mode from all attributes 1-21.
CompAct(s) Predict usr using a restricted number (excluding the paging information (10-18)
Original source: DELVE repository of data. Source: collection of regression datasets by Luis Torgo (ltorgo@ncc.up.pt) at http://www.ncc.up.pt/~ltorgo/Regression/DataSets.html Characteristics: 8192 cases, 13 continuous attributes
- Names
- lread,lwrite,scall,sread,swrite,fork,exec,rchar,wchar,runqsz,
- Types
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- Data (first 10 data points)
lread lwrite scall sread swrite fork exec rchar wchar runqsz ... 6 2 1036 103 114 1.0 1.0 172076 355965 2.0 ... 1 0 2165 205 101 0.4 1.2 43107 44139 3.0 ... 62 77 3806 258 166 1.4 1.4 492142 268706 5.2 ... 5 0 4721 256 177 0.99 2.58 524787 174964 1.0 ... 42 55 3949 249 244 2.6 4.6 197289 529200 3.4 ... 5 1 1692 132 87 0.4 1.8 220194 107031 2.2 ... 3 0 635 65 47 3.0 3.0 87465 40740 1.0 ... 7 5 1341 240 120 0.4 0.6 718437 672290 4.8 ... 159 40 2443 299 262 1.0 1.0 240375 209450 2.8 ... 1 0 3322 271 170 1.0 3.2 399277 128680 1.0 ... ... ... ... ... ... ... ... ... ... ... ...
- Description
A jarfile containing 30 regression datasets collected by Luis Torgo (regression-datasets.jar, 10,090,266 Bytes).
- URLs
- (No information yet)
- Publications
- Data Source
- Measurement Details
- Usage Scenario
- revision 1
- by mldata on 2010-11-06 09:58
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
To post a comment, please sign in.This item was downloaded 6359 times and viewed 2115 times.
No Tasks yet on dataset regression-datasets cpu_small
Submit a new Task for this Data itemDisclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.