View uci-20070111 auto93 (public)

2010-11-06 09:59 by mldata | Version 1 | Rating Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star
Rating
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Overall (based on 0 votes)
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Interesting
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Documentation
Summary

(No information yet)

License
unknown (from Weka repository)
Dependencies
Tags
arff slurped Weka
Attribute Types
Integer,Floating Point,String
Download
# Instances: 93 / # Attributes: 23
HDF5 (31.8 KB) XML CSV ARFF LibSVM Matlab Octave

Files are converted on demand and the process can take up to a minute. Please wait until download begins.

Completeness of this item currently: 55%.
You can edit this item to add more meta information and make use of the site's premium features.
Original Data Format
arff
Name
'auto93.names'
Version mldata
0
Comment

!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Attributes 2,4, and 6 deleted. Midrange price treated as the class attribute.

As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning with encoding length selection. In Progress in Connectionist-Based Information Systems. Singapore: Springer-Verlag.

!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

NAME: 1993 New Car Data TYPE: Sample
SIZE: 93 observations, 26 variables

DESCRIPTIVE ABSTRACT: Specifications are given for 93 new car models for the 1993 year. Several measures are given to evaluate price, mpg ratings, engine size, body size, and features.

SOURCES: Consumer Reports: The 1993 Cars - Annual Auto Issue (April 1993), Yonkers, NY: Consumers Union. PACE New Car & Truck 1993 Buying Guide (1993), Milwaukee, WI: Pace Publications Inc.

VARIABLE DESCRIPTIONS: Line 1 Columns 1 - 14 Manufacturer 15 - 29 Model 30 - 36 Type Small, Sporty, Compact, Midsize, Large - as defined in the Consumer Reports article 38 - 41 Minimum Price (in $1,000) - Price for basic version of this model 43 - 46 Midrange Price (in $1,000) - Average of Min and Max prices 48 - 51 Maximum Price (in $1,000) - Price for a premium version 53 - 54 City MPG (miles per gallon by EPA rating) 56 - 57 Highway MPG 59 - 59 Air Bags standard 0 = none, 1 = driver only, 2 = driver & passenger 61 - 61 Drive train type 0 = rear wheel drive 1 = front wheel drive 2 = all wheel drive 63 - 63 Number of cylinders 65 - 67 Engine size (liters) 69 - 71 Horsepower (maximum) 73 - 76 RPM (revs per minute at maximum horsepower)

Line 2 Columns 1 - 4 Engine revolutions per mile (in highest gear) 6 - 6 Manual transmission available 0 = No, 1 = Yes 8 - 11 Fuel tank capacity (gallons) 13 - 13 Passenger capacity (persons) 15 - 17 Length (inches) 19 - 21 Wheelbase (inches) 23 - 24 Width (inches) 26 - 27 U-turn space (feet) 29 - 32 Rear seat room (inches) 34 - 35 Luggage capacity (cu. ft.) 37 - 40 Weight (pounds) 42 - 42 Domestic? 0 = non-U.S. manufacturer, 1 = U.S. manufacturer

Values are aligned and delimited by blanks. Missing values are denoted with *. There are two data lines for each case.

SPECIAL NOTES: The only missing values are for CYLINDERS in the rotary engine Mazda RX-7, REAR SEAT room for the two-seaters (Corvette and RX-7), and LUGGAGE capacity for the vans and two-seaters.

WEIGHT is taken from the Consumer Reports data and includes a full fuel tank, automatic transmission (if available), and air conditioning.

STORY BEHIND THE DATA: Cars were selected at random from among 1993 passenger car models that were listed in both the Consumer Reports issue and the PACE Buying Guide. Pickup trucks and Sport/Utility vehicles were eliminated due to incomplete information in the Consumer Reports source. Duplicate models (e.g., Dodge Shadow and Plymouth Sundance) were listed at most once.

A similar dataset for 1989 model cars appeared as one of the sample datasets shipped with the Student Edition of Execustat (PWS-KENT 1990).

Further description can be found in the "Datasets and Stories" article "1993 New Car Data" in the Journal of Statistics Education (Lock 1993). Send the message

 send jse/v1n1/datasets.lock

to the address archive@jse.stat.ncsu.edu

PEDAGOGICAL NOTES: This is a multi-purpose dataset that can be used at many points in an introductory course. It includes many good numeric variables and several options for dividing the cars up into groups. Students tend to be familiar with most of the variables (and specific car models). They can anticipate and pose explanations for many of the relationships to be found in the data, although some surprises may be encountered. One can easily find examples of pairs of variables that demonstrate strong or weak, positive or negative associations. PRICE and MPG variables tend to be popular choices as "dependent" variables. Basic graphs will often reveal unusual data values (like the price for a Mercedes-Benz).

REFERENCES: Lock, R. H. (1993), "1993 New Car Data," Journal of Statistics Education, 1, No. 1. Student Edition of Execustat (1990), Boston, MA: PWS-KENT Publishing Co.

SUBMITTED BY: Robin H. Lock Mathematics Department St. Lawrence University Canton, NY 13617 (315) 379-5960 rlock@stlawu.bitnet

Names
Manufacturer,Type,City_MPG,Highway_MPG,Air_Bags_standard,Drive_train_type,Number_of_cylinders,Engine_size,Horsepower,RPM,
Types
  1. nominal:Acura,Audi,BMW,Buick,Cadillac,Chevrolet,Chrysler,Dodge,Eagle,Ford,Geo,Honda,Hyundai,Infiniti,Lexus,Lincoln,Mazda,Mercedes-Benz,Mercury,Mitsubishi,Nissan,Oldsmobile,Plymouth,Pontiac,Saab,Saturn,Subaru,Suzuki,Toyota,Volkswagen,Volvo
  2. nominal:Small,Midsize,Compact,Large,Sporty,Van
  3. numeric
  4. numeric
  5. nominal:0,2,1
  6. nominal:1,0,2
  7. numeric
  8. numeric
  9. numeric
  10. numeric
Data (first 10 data points)
    Manu... Type City... High... Air_... Driv... Numb... Engi... Hors... RPM ...
    Acura Small 25 31 0 1 4 1.8 140 6300 ...
    Acura Mids... 18 25 2 1 6 3.2 200 5500 ...
    Audi Comp... 20 26 1 1 6 2.8 172 5500 ...
    Audi Mids... 19 26 2 1 6 2.8 172 5500 ...
    BMW Mids... 22 30 1 0 4 3.5 208 5700 ...
    Buick Mids... 22 31 1 1 4 2.2 110 5200 ...
    Buick Large 19 28 1 1 6 3.8 170 4800 ...
    Buick Large 16 25 1 0 6 5.7 180 4000 ...
    Buick Mids... 19 27 1 1 6 3.8 170 4800 ...
    Cadi... Large 16 25 1 1 8 4.9 200 4100 ...
    ... ... ... ... ... ... ... ... ... ... ...
Description

A gzip'ed tar containing UCI and UCI KDD datasets (uci-20070111.tar.gz, 17,952,832 Bytes)

URLs
(No information yet)
Publications
    Data Source
    http://www.ics.uci.edu/~mlearn/MLRepository.html http://kdd.ics.uci.edu/
    Measurement Details
    Usage Scenario
    revision 1
    by mldata on 2010-11-06 09:59

    No one has posted any comments yet. Perhaps you would like to be the first?

    Leave a comment

    To post a comment, please sign in.

    This item was downloaded 2491 times and viewed 2091 times.

    No Tasks yet on dataset uci-20070111 auto93

    Submit a new Task for this Data item

    Data

    Sort by

    Disclaimer

    We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.

    Data | Task | Method | Challenge

    Acknowledgements

    This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
    PASCAL Logo
    http://www.pascal-network.org/.