View statlib-20050214 hutsof99_logis (public)

2010-11-06 10:00 by mldata | Version 1 | Rating Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star
Rating
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Overall (based on 0 votes)
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Interesting
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Documentation
Summary

(No information yet)

License
unknown (from Weka repository)
Dependencies
Tags
arff slurped Weka
Attribute Types
Integer,Floating Point
Download
# Instances: 70 / # Attributes: 8
HDF5 (17.0 KB) XML CSV ARFF LibSVM Matlab Octave
Completeness of this item currently: 55%.
You can edit this item to add more meta information and make use of the site's premium features.
Original Data Format
arff
Name
hutsof99-logis
Version mldata
0
Comment

Graeme D. Hutcheson and Nick Sofroniou 1999

The Multivariate Social Scientist: Introductory Statistics Using Generalized Linear Models.

SAGE Publications.

Copyright: Graeme D. Hutcheson & Nick Sofroniou, 1999

This software can be freely used for non-commercial purposes and can be freely distributed.

Readme file

The data sets in this directory are taken from the above book. The data are presented in two formats, .dat (ascii) and .por (SPSS portable). The GLIM code and macros are provided in files .glm and .mac. Please read the errata file which indicates some minor differences between these data sets and those reported in the book.

DATA FILE SOURCE IN BOOK DESCRIPTION

Chapter 1 tab1_01.* Table 1.1 Video Games and Hostility

Chapter 2 tab2_01.* Table 2.1 Normal Errors tab2_02. Table 2.2 Skewed Errors tab2_03. Table 2.3 Curvilinearity

Chapter 3 tab3_01.* Table 3.1 Two Simple Models tab3_05. Table 3.5 Cost and Sound Quality tab3_07. Table 3.7 Exam marks and College Offers tab3_11. Table 3.11 Quality of Children's Testimonies Age: 5-6 = 0; 8-9 = 1 Gender: female = 0; male = 1 Location: 1 = home; 2 = school; 3 = police interview 4 = special interview tab3_11d. Table 3.11 Data in Table 3.11 with indicator dummy codes added

Chapter 4 tab4_01.* Table 4.1 Infection Severity and Treatment Outcome Treatment Outcome: 0 = survived 1 = died tab4_14. Table 4.14 Infection severity, Treatment outcome and Hospital Attended Hospital: 1 = hospital A 2 = hospital B 3 = hospital C tab4.14d. Table 4.14 Infection severity, Treatment outcome and Hospital Attended including dummy codes

logis. Child witness data: copy of tab3_11, but includes prosecution logis_d. Child witness data: copy of tab3_11d, but includes prosecution

                             logis.por and logis_d.por provide the data to
                             obtain the parameters calculated in the book
                             (pages 147 to 152). It should be noted that
                             these differ slightly to the parameters
                             obtained using the data sets logis.dat and
                             logis_d.dat, as the *.dat files only record
                             the variable 'coherence' to 2 decimal places.

Chapter 5

tab5_01.* Table 5.1 Job Satisfaction for doctors and dentists tab5_04. Table 5.4 Race, Housing and Illness tab5_07. Table 5.7 Dopamine and psychosis: integer scoring tab5_08. Table 5.8 Dopamine and psychosis: mid-ranks scoring tab5_10. Table 5.10 Treatment and Depression: integer scoring tab5_11. Table 5.11 Treatment and depression: mid-ranks scoring tab5_13. Table 5.13 Alcohol consumption and Libido: integer scores tab5_16. Table 5.16 Alcohol consumption and libido: low vs medium or high tab5_17. Table 5.17 Alcohol consumption and libido: medium vs high

Chapter 6

tab6_11.* Table 6.11 Child witness example data set

File: ../data/hutsof99/logis.dat

Note: changes from Errata.txt where not included!

Information about the dataset CLASSTYPE: numeric CLASSINDEX: none specific

Names
Age,Gender,Location,Coherence,Maturity,Delay,Prosecute,Quality,
Types
  1. nominal:0,1
  2. nominal:0,1
  3. nominal:1,2,3,4
  4. numeric
  5. numeric
  6. numeric
  7. nominal:0,1
  8. numeric
Data (first 10 data points)
    Age Gender Loca... Cohe... Matu... Delay Pros... Qual...
    0.0 0.0 3.0 3.81 3.62 45.0 0.0 34.11
    0.0 1.0 2.0 1.63 1.61 27.0 0.0 36.59
    0.0 0.0 1.0 3.54 3.63 102.0 0.0 37.23
    0.0 1.0 2.0 4.21 4.11 39.0 0.0 39.65
    0.0 0.0 3.0 3.3 3.12 41.0 0.0 42.07
    0.0 1.0 3.0 2.32 2.13 70.0 1.0 44.91
    0.0 1.0 4.0 4.51 4.31 72.0 0.0 45.23
    0.0 1.0 2.0 3.18 3.08 41.0 0.0 47.53
    0.0 0.0 1.0 2.66 2.72 13.0 0.0 45.81
    0.0 1.0 3.0 4.7 4.98 39.0 0.0 49.38
    ... ... ... ... ... ... ... ...
Description

A gzip'ed tar containing StatLib datasets (statlib-20050214.tar.gz, 12,785,582 Bytes)

URLs
(No information yet)
Publications
    Data Source
    http://lib.stat.cmu.edu/datasets/
    Measurement Details
    Usage Scenario
    revision 1
    by mldata on 2010-11-06 10:00

    No one has posted any comments yet. Perhaps you would like to be the first?

    Leave a comment

    To post a comment, please sign in.

    This item was downloaded 2875 times and viewed 1911 times.

    No Tasks yet on dataset statlib-20050214 hutsof99_logis

    Submit a new Task for this Data item

    Data

    Sort by

    Disclaimer

    We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.

    Data | Task | Method | Challenge

    Acknowledgements

    This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
    PASCAL Logo
    http://www.pascal-network.org/.