View uci-20070111 mfeat-factors (public)
























- Summary
(No information yet)
- License
- unknown (from Weka repository)
- Dependencies
- Tags
- arff slurped Weka
- Attribute Types
- Integer
- Download
-
# Instances: 2000 / # Attributes: 217
HDF5 (1.7 MB) XML CSV ARFF LibSVM Matlab OctaveFiles are converted on demand and the process can take up to a minute. Please wait until download begins.
You can edit this item to add more meta information and make use of the site's premium features.
- Original Data Format
- arff
- Name
- mfeat
- Version mldata
- 0
- Comment
The multi-feature digit dataset
Oowned and donated by:
Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box 5046, 2600 GA Delft The Netherlands
email: duin@ph.tn.tudelft.nl http : //www.ph.tn.tudelft.nl/~duin tel +31 15 2786143
Usage
A slightly different version of the database is used in
M. van Breukelen, R.P.W. Duin, D.M.J. Tax, and J.E. den Hartog, Handwritten digit recognition by combined classifiers, Kybernetika, vol. 34, no. 4, 1998, 381-386.
M. van Breukelen and R.P.W. Duin, Neural Network Initialization by Combined Classifiers, in: A.K. Jain, S. Venkatesh, B.C. Lovell (eds.), ICPR'98, Proc. 14th Int. Conference on Pattern Recognition (Brisbane, Aug. 16-20),
The database as it is is used in:
A.K. Jain, R.P.W. Duin, J. Mao, Statisitcal Pattern Recognition: A Review, in preparation
Description
This dataset consists of features of handwritten numerals (
0'--
9') extracted from a collection of Dutch utility maps. 200 patterns per class (for a total of 2,000 patterns) have been digitized in binary images. These digits are represented in terms of the following six feature sets (files):- mfeat-fou: 76 Fourier coefficients of the character shapes;
- mfeat-fac: 216 profile correlations;
- mfeat-kar: 64 Karhunen-Love coefficients;
- mfeat-pix: 240 pixel averages in 2 x 3 windows;
- mfeat-zer: 47 Zernike moments;
- mfeat-mor: 6 morphological features.
In each file the 2000 patterns are stored in ASCI on 2000 lines. The first 200 patterns are of class `0', followed by sets of 200 patterns for each of the classes
1' -
9'. Corresponding patterns in different feature sets (files) correspond to the same original character.The source image dataset is lost. Using the pixel-dataset (mfeat-pix) sampled versions of the original images may be obtained (15 x 16 pixels).
Total number of instances:
2000 (200 instances per class)
Total number of attributes:
649 (distributed over 6 datasets,see above)
no missing attributes
Total number of classes:
10
Format:
6 files, see above. Each file contains 2000 lines, one for each instance. Attributes are SPACE separated and can be loaded by Matlab as > load filename No missing attributes. Some are integer, others are real.
Information about the dataset CLASSTYPE: nominal CLASSINDEX: last
- Names
- att1,att2,att3,att4,att5,att6,att7,att8,att9,att10,
- Types
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- Data (first 10 data points)
att1 att2 att3 att4 att5 att6 att7 att8 att9 att10 ... 98 236 531 673 607 647 2 9 3 6 ... 121 193 607 611 585 665 7 9 2 4 ... 115 141 590 605 557 627 12 6 3 3 ... 90 122 627 692 607 642 0 6 4 5 ... 157 167 681 666 587 666 8 6 1 4 ... 128 224 799 690 653 620 16 22 8 9 ... 185 259 575 615 609 673 2 8 5 7 ... 133 173 591 665 594 651 1 7 3 5 ... 206 332 561 588 635 693 3 9 8 7 ... 183 177 583 606 627 676 1 8 1 4 ... ... ... ... ... ... ... ... ... ... ... ...
- Description
A gzip'ed tar containing UCI and UCI KDD datasets (uci-20070111.tar.gz, 17,952,832 Bytes)
- URLs
- (No information yet)
- Publications
- Data Source
- http://www.ics.uci.edu/~mlearn/MLRepository.html http://kdd.ics.uci.edu/
- Measurement Details
- Usage Scenario
- revision 1
- by mldata on 2010-11-06 09:58
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
To post a comment, please sign in.This item was downloaded 4234 times and viewed 2521 times.
No Tasks yet on dataset uci-20070111 mfeat-factors
Submit a new Task for this Data itemDisclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.