View datasets-numeric auto93 (public)
























- Summary
(No information yet)
- License
- unknown (from Weka repository)
- Dependencies
- Tags
- arff slurped Weka
- Attribute Types
- Integer,Floating Point,String
- Download
-
# Instances: 93 / # Attributes: 23
HDF5 (31.8 KB) XML CSV ARFF LibSVM Matlab OctaveFiles are converted on demand and the process can take up to a minute. Please wait until download begins.
You can edit this item to add more meta information and make use of the site's premium features.
- Original Data Format
- arff
- Name
- 'auto93.names'
- Version mldata
- 0
- Comment
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
Attributes 2,4, and 6 deleted. Midrange price treated as the class attribute.
As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning with encoding length selection. In Progress in Connectionist-Based Information Systems. Singapore: Springer-Verlag.
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
NAME: 1993 New Car Data TYPE: Sample
SIZE: 93 observations, 26 variablesDESCRIPTIVE ABSTRACT: Specifications are given for 93 new car models for the 1993 year. Several measures are given to evaluate price, mpg ratings, engine size, body size, and features.
SOURCES: Consumer Reports: The 1993 Cars - Annual Auto Issue (April 1993), Yonkers, NY: Consumers Union. PACE New Car & Truck 1993 Buying Guide (1993), Milwaukee, WI: Pace Publications Inc.
VARIABLE DESCRIPTIONS: Line 1 Columns 1 - 14 Manufacturer 15 - 29 Model 30 - 36 Type Small, Sporty, Compact, Midsize, Large - as defined in the Consumer Reports article 38 - 41 Minimum Price (in $1,000) - Price for basic version of this model 43 - 46 Midrange Price (in $1,000) - Average of Min and Max prices 48 - 51 Maximum Price (in $1,000) - Price for a premium version 53 - 54 City MPG (miles per gallon by EPA rating) 56 - 57 Highway MPG 59 - 59 Air Bags standard 0 = none, 1 = driver only, 2 = driver & passenger 61 - 61 Drive train type 0 = rear wheel drive 1 = front wheel drive 2 = all wheel drive 63 - 63 Number of cylinders 65 - 67 Engine size (liters) 69 - 71 Horsepower (maximum) 73 - 76 RPM (revs per minute at maximum horsepower)
Line 2 Columns 1 - 4 Engine revolutions per mile (in highest gear) 6 - 6 Manual transmission available 0 = No, 1 = Yes 8 - 11 Fuel tank capacity (gallons) 13 - 13 Passenger capacity (persons) 15 - 17 Length (inches) 19 - 21 Wheelbase (inches) 23 - 24 Width (inches) 26 - 27 U-turn space (feet) 29 - 32 Rear seat room (inches) 34 - 35 Luggage capacity (cu. ft.) 37 - 40 Weight (pounds) 42 - 42 Domestic? 0 = non-U.S. manufacturer, 1 = U.S. manufacturer
Values are aligned and delimited by blanks. Missing values are denoted with *. There are two data lines for each case.
SPECIAL NOTES: The only missing values are for CYLINDERS in the rotary engine Mazda RX-7, REAR SEAT room for the two-seaters (Corvette and RX-7), and LUGGAGE capacity for the vans and two-seaters.
WEIGHT is taken from the Consumer Reports data and includes a full fuel tank, automatic transmission (if available), and air conditioning.
STORY BEHIND THE DATA: Cars were selected at random from among 1993 passenger car models that were listed in both the Consumer Reports issue and the PACE Buying Guide. Pickup trucks and Sport/Utility vehicles were eliminated due to incomplete information in the Consumer Reports source. Duplicate models (e.g., Dodge Shadow and Plymouth Sundance) were listed at most once.
A similar dataset for 1989 model cars appeared as one of the sample datasets shipped with the Student Edition of Execustat (PWS-KENT 1990).
Further description can be found in the "Datasets and Stories" article "1993 New Car Data" in the Journal of Statistics Education (Lock 1993). Send the message
send jse/v1n1/datasets.lock
to the address archive@jse.stat.ncsu.edu
PEDAGOGICAL NOTES: This is a multi-purpose dataset that can be used at many points in an introductory course. It includes many good numeric variables and several options for dividing the cars up into groups. Students tend to be familiar with most of the variables (and specific car models). They can anticipate and pose explanations for many of the relationships to be found in the data, although some surprises may be encountered. One can easily find examples of pairs of variables that demonstrate strong or weak, positive or negative associations. PRICE and MPG variables tend to be popular choices as "dependent" variables. Basic graphs will often reveal unusual data values (like the price for a Mercedes-Benz).
REFERENCES: Lock, R. H. (1993), "1993 New Car Data," Journal of Statistics Education, 1, No. 1. Student Edition of Execustat (1990), Boston, MA: PWS-KENT Publishing Co.
SUBMITTED BY: Robin H. Lock Mathematics Department St. Lawrence University Canton, NY 13617 (315) 379-5960 rlock@stlawu.bitnet
- Names
- Manufacturer,Type,City_MPG,Highway_MPG,Air_Bags_standard,Drive_train_type,Number_of_cylinders,Engine_size,Horsepower,RPM,
- Types
- nominal:Acura,Audi,BMW,Buick,Cadillac,Chevrolet,Chrysler,Dodge,Eagle,Ford,Geo,Honda,Hyundai,Infiniti,Lexus,Lincoln,Mazda,Mercedes-Benz,Mercury,Mitsubishi,Nissan,Oldsmobile,Plymouth,Pontiac,Saab,Saturn,Subaru,Suzuki,Toyota,Volkswagen,Volvo
- nominal:Small,Midsize,Compact,Large,Sporty,Van
- numeric
- numeric
- nominal:0,2,1
- nominal:1,0,2
- numeric
- numeric
- numeric
- numeric
- Data (first 10 data points)
Manu... Type City... High... Air_... Driv... Numb... Engi... Hors... RPM ... Acura Small 25 31 0 1 4 1.8 140 6300 ... Acura Mids... 18 25 2 1 6 3.2 200 5500 ... Audi Comp... 20 26 1 1 6 2.8 172 5500 ... Audi Mids... 19 26 2 1 6 2.8 172 5500 ... BMW Mids... 22 30 1 0 4 3.5 208 5700 ... Buick Mids... 22 31 1 1 4 2.2 110 5200 ... Buick Large 19 28 1 1 6 3.8 170 4800 ... Buick Large 16 25 1 0 6 5.7 180 4000 ... Buick Mids... 19 27 1 1 6 3.8 170 4800 ... Cadi... Large 16 25 1 1 8 4.9 200 4100 ... ... ... ... ... ... ... ... ... ... ... ...
- Description
A jarfile containing 37 regression problems, obtained from various sources (datasets-numeric.jar, 169,344 Bytes).
- URLs
- (No information yet)
- Publications
- Data Source
- Measurement Details
- Usage Scenario
- revision 1
- by mldata on 2010-11-06 09:57
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
To post a comment, please sign in.This item was downloaded 4123 times and viewed 2753 times.
No Tasks yet on dataset datasets-numeric auto93
Submit a new Task for this Data itemDisclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.