View regression-datasets housing (public)
























- Summary
UCI boston housing data
- License
- unknown (from Weka repository)
- Dependencies
- Tags
- arff slurped Weka
- Attribute Types
- Integer,Floating Point
- Download
-
# Instances: 506 / # Attributes: 14
HDF5 (59.7 KB) XML CSV ARFF LibSVM Matlab OctaveFiles are converted on demand and the process can take up to a minute. Please wait until download begins.
- Original Data Format
- arff
- Name
- 'housing'
- Version mldata
- 0
- Comment
Title: Boston Housing Data
Sources: (a) Origin: This dataset was taken from the StatLib library which is maintained at Carnegie Mellon University. (b) Creator: Harrison, D. and Rubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978. (c) Date: July 7, 1993
Past Usage:
Used in Belsley, Kuh & Welsch, 'Regression diagnostics ...', Wiley,
- N.B. Various transformations are used in the table on pages 244-261.
- Quinlan,R. (1993). Combining Instance-Based and Model-Based Learning. In Proceedings on the Tenth International Conference of Machine Learning, 236-243, University of Massachusetts, Amherst. Morgan Kaufmann.
Relevant Information:
Concerns housing values in suburbs of Boston.
Number of Instances: 506
Number of Attributes: 13 continuous attributes (including "class" attribute "MEDV"), 1 binary-valued attribute.
Attribute Information:
- CRIM per capita crime rate by town
- ZN proportion of residential land zoned for lots over 25,000 sq.ft.
- INDUS proportion of non-retail business acres per town
- CHAS Charles River dummy variable (= 1 if tract bounds river; 0 otherwise)
- NOX nitric oxides concentration (parts per 10 million)
- RM average number of rooms per dwelling
- AGE proportion of owner-occupied units built prior to 1940
- DIS weighted distances to five Boston employment centres
- RAD index of accessibility to radial highways
- TAX full-value property-tax rate per $10,000
- PTRATIO pupil-teacher ratio by town
- B 1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town
- LSTAT % lower status of the population
- MEDV Median value of owner-occupied homes in $1000's
Missing Attribute Values: None.
- Names
- CRIM,ZN,INDUS,CHAS,NOX,RM,AGE,DIS,RAD,TAX,
- Types
- numeric
- numeric
- numeric
- nominal:0,1
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- Data (first 10 data points)
CRIM ZN INDUS CHAS NOX RM AGE DIS RAD TAX ... 0.00... 18 2.31 0 0.538 6.575 65.2 4.09 1 296 ... 0.02... 0 7.07 0 0.469 6.421 78.9 4.9671 2 242 ... 0.02... 0 7.07 0 0.469 7.185 61.1 4.9671 2 242 ... 0.03... 0 2.18 0 0.458 6.998 45.8 6.0622 3 222 ... 0.06... 0 2.18 0 0.458 7.147 54.2 6.0622 3 222 ... 0.02... 0 2.18 0 0.458 6.43 58.7 6.0622 3 222 ... 0.08... 12 7.87 0 0.524 6.012 66.6 5.5605 5 311 ... 0.14... 12 7.87 0 0.524 6.172 96.1 5.9505 5 311 ... 0.21... 12 7.87 0 0.524 5.631 100.0 6.0821 5 311 ... 0.17... 12 7.87 0 0.524 6.004 85.9 6.5921 5 311 ... ... ... ... ... ... ... ... ... ... ... ...
- Description
A jarfile containing 30 regression datasets collected by Luis Torgo (regression-datasets.jar, 10,090,266 Bytes).
- URLs
- http://archive.ics.uci.edu/ml/datasets/Housing
- Publications
- Data Source
- Originally from the UCI machine learning repository.
- Measurement Details
From the UCI repository:
Title: Boston Housing Data
Sources: (a) Origin: This dataset was taken from the StatLib library which is maintained at Carnegie Mellon University. (b) Creator: Harrison, D. and Rubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978. (c) Date: July 7, 1993
Past Usage:
Used in Belsley, Kuh & Welsch, 'Regression diagnostics ...', Wiley,
- N.B. Various transformations are used in the table on pages 244-261.
- Quinlan,R. (1993). Combining Instance-Based and Model-Based Learning. In Proceedings on the Tenth International Conference of Machine Learning, 236-243, University of Massachusetts, Amherst. Morgan Kaufmann.
Relevant Information:
Concerns housing values in suburbs of Boston.
Number of Instances: 506
Number of Attributes: 13 continuous attributes (including "class" attribute "MEDV"), 1 binary-valued attribute.
Attribute Information:
- CRIM per capita crime rate by town
- ZN proportion of residential land zoned for lots over 25,000 sq.ft.
- INDUS proportion of non-retail business acres per town
- CHAS Charles River dummy variable (= 1 if tract bounds river; 0 otherwise)
- NOX nitric oxides concentration (parts per 10 million)
- RM average number of rooms per dwelling
- AGE proportion of owner-occupied units built prior to 1940
- DIS weighted distances to five Boston employment centres
- RAD index of accessibility to radial highways
- TAX full-value property-tax rate per $10,000
- PTRATIO pupil-teacher ratio by town
- B 1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town
- LSTAT % lower status of the population
- MEDV Median value of owner-occupied homes in $1000's
Missing Attribute Values: None.
- Usage Scenario
Predict the median value of home from all other variables.
- revision 1
- by mldata on 2010-11-06 09:58
- revision 2
- by cong on 2011-09-14 15:17
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
To post a comment, please sign in.This item was downloaded 13126 times and viewed 17644 times.
Tasks defined on dataset regression-datasets housing
- boston housing regression 2010-11-28 16:04
Disclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.