View datasets-UCI credit-a (public)
























- Summary
(No information yet)
- License
- unknown (from Weka repository)
- Dependencies
- Tags
- arff slurped Weka
- Attribute Types
- Integer,Floating Point,String
- Download
-
# Instances: 690 / # Attributes: 16
HDF5 (308.3 KB) XML CSV ARFF LibSVM Matlab OctaveFiles are converted on demand and the process can take up to a minute. Please wait until download begins.
You can edit this item to add more meta information and make use of the site's premium features.
- Original Data Format
- arff
- Name
- credit-rating
- Version mldata
- 0
- Comment
Title: Credit Approval
Sources: (confidential) Submitted by quinlan@cs.su.oz.au
Past Usage:
See Quinlan, "Simplifying decision trees", Int J Man-Machine Studies 27, Dec 1987, pp. 221-234. "C4.5: Programs for Machine Learning", Morgan Kaufmann, Oct 1992
Relevant Information:
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect confidentiality of the data.
This dataset is interesting because there is a good mix of attributes -- continuous, nominal with small numbers of values, and nominal with larger numbers of values. There are also a few missing values.
Number of Instances: 690
Number of Attributes: 15 + class attribute
Attribute Information:
A1: b, a. A2: continuous. A3: continuous. A4: u, y, l, t. A5: g, p, gg. A6: c, d, cc, i, j, k, m, r, q, w, x, e, aa, ff. A7: v, h, bb, j, n, z, dd, ff, o. A8: continuous. A9: t, f. A10: t, f. A11: continuous. A12: t, f. A13: g, p, s. A14: continuous. A15: continuous. A16: +,- (class attribute)
Missing Attribute Values: 37 cases (5%) have one or more missing values. The missing values from particular attributes are:
A1: 12 A2: 12 A4: 6 A5: 6 A6: 9 A7: 9 A14: 13
Class Distribution
+: 307 (44.5%) -: 383 (55.5%)
- Names
- A1,A2,A3,A4,A5,A6,A7,A8,A9,A10,
- Types
- nominal:b,a
- numeric
- numeric
- nominal:u,y,l,t
- nominal:g,p,gg
- nominal:c,d,cc,i,j,k,m,r,q,w,x,e,aa,ff
- nominal:v,h,bb,j,n,z,dd,ff,o
- numeric
- nominal:t,f
- nominal:t,f
- Data (first 10 data points)
A1 A2 A3 A4 A5 A6 A7 A8 A9 A10 ... b 30.83 0.0 u g w v 1.25 t t ... a 58.67 4.46 u g q h 3.04 t t ... a 24.5 0.5 u g q h 1.5 t f ... b 27.83 1.54 u g w v 3.75 t t ... b 20.17 5.625 u g w v 1.71 t f ... b 32.08 4.0 u g m v 2.5 t f ... b 33.17 1.04 u g r h 6.5 t f ... a 22.92 11.585 u g cc v 0.04 t f ... b 54.42 0.5 y p k h 3.96 t f ... b 42.5 4.915 y p w v 3.165 t f ... ... ... ... ... ... ... ... ... ... ... ...
- Description
A jarfile containing 37 classification problems, originally obtained from the UCI repository (datasets-UCI.jar, 1,190,961 Bytes).
- URLs
- (No information yet)
- Publications
- Data Source
- http://www.ics.uci.edu/~mlearn/MLRepository.html
- Measurement Details
- Usage Scenario
- revision 1
- by mldata on 2010-11-06 09:57
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
To post a comment, please sign in.This item was downloaded 5395 times and viewed 6406 times.
No Tasks yet on dataset datasets-UCI credit-a
Submit a new Task for this Data itemDisclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.