View uci-20070111 ionosphere (public)

2010-11-06 09:58 by mldata | Version 1 | Rating Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star
Rating
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Overall (based on 0 votes)
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Interesting
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Documentation
Summary

(No information yet)

License
unknown (from Weka repository)
Dependencies
Tags
arff slurped Weka
Attribute Types
Integer,Floating Point,String
Download
# Instances: 351 / # Attributes: 35
HDF5 (122.0 KB) XML CSV ARFF LibSVM Matlab Octave

Files are converted on demand and the process can take up to a minute. Please wait until download begins.

Completeness of this item currently: 55%.
You can edit this item to add more meta information and make use of the site's premium features.
Original Data Format
arff
Name
ionosphere
Version mldata
0
Comment
  1. Title: Johns Hopkins University Ionosphere database

  2. Source Information: -- Donor: Vince Sigillito (vgs@aplcen.apl.jhu.edu) -- Date: 1989 -- Source: Space Physics Group Applied Physics Laboratory Johns Hopkins University Johns Hopkins Road Laurel, MD 20723

  3. Past Usage: -- Sigillito, V. G., Wing, S. P., Hutton, L. V., & Baker, K. B. (1989). Classification of radar returns from the ionosphere using neural networks. Johns Hopkins APL Technical Digest, 10, 262-266.

    They investigated using backprop and the perceptron training algorithm on this database. Using the first 200 instances for training, which were carefully split almost 50% positive and 50% negative, they found that a "linear" perceptron attained 90.7%, a "non-linear" perceptron attained 92%, and backprop an average of over 96% accuracy on the remaining 150 test instances, consisting of 123 "good" and only 24 "bad" instances. (There was a counting error or some mistake somewhere; there are a total of 351 rather than 350 instances in this domain.) Accuracy on "good" instances was much higher than for "bad" instances. Backprop was tested with several different numbers of hidden units (in [0,15]) and incremental results were also reported (corresponding to how well the different variants of backprop did after a periodic number of epochs).

    David Aha (aha@ics.uci.edu) briefly investigated this database. He found that nearest neighbor attains an accuracy of 92.1%, that Ross Quinlan's C4 algorithm attains 94.0% (no windowing), and that IB3 (Aha & Kibler, IJCAI-1989) attained 96.7% (parameter settings: 70% and 80% for acceptance and dropping respectively).

  4. Relevant Information: This radar data was collected by a system in Goose Bay, Labrador. This system consists of a phased array of 16 high-frequency antennas with a total transmitted power on the order of 6.4 kilowatts. See the paper for more details. The targets were free electrons in the ionosphere. "Good" radar returns are those showing evidence of some type of structure in the ionosphere. "Bad" returns are those that do not; their signals pass through the ionosphere.

Received signals were processed using an autocorrelation function whose arguments are the time of a pulse and the pulse number. There were 17 pulse numbers for the Goose Bay system. Instances in this databse are described by 2 attributes per pulse number, corresponding to the complex values returned by the function resulting from the complex electromagnetic signal.

  1. Number of Instances: 351

  2. Number of Attributes: 34 plus the class attribute -- All 34 predictor attributes are continuous

  3. Attribute Information:
    -- All 34 are continuous, as described above -- The 35th attribute is either "good" or "bad" according to the definition summarized above. This is a binary classification task.

  4. Missing Values: None

Names
a01,a02,a03,a04,a05,a06,a07,a08,a09,a10,
Types
  1. numeric
  2. numeric
  3. numeric
  4. numeric
  5. numeric
  6. numeric
  7. numeric
  8. numeric
  9. numeric
  10. numeric
Data (first 10 data points)
    a01 a02 a03 a04 a05 a06 a07 a08 a09 a10 ...
    1 0 0.99... -0.05... 0.85... 0.02... 0.83... -0.37... 1.0 0.0376 ...
    1 0 1.0 -0.18... 0.93... -0.36... -0.10... -0.93... 1.0 -0.04... ...
    1 0 1.0 -0.03... 1.0 0.00... 1.0 -0.12... 0.88... 0.01... ...
    1 0 1.0 -0.45... 1.0 1.0 0.71... -1.0 0.0 0.0 ...
    1 0 1.0 -0.02... 0.9414 0.06... 0.92... -0.23... 0.77... -0.16... ...
    1 0 0.02... -0.00... -0.09... -0.11... -0.00... -0.11... 0.14... 0.06... ...
    1 0 0.97... -0.10... 0.94... -0.208 0.92... -0.28... 0.85... -0.27... ...
    0 0 0.0 0.0 0.0 0.0 1.0 -1.0 0.0 0.0 ...
    1 0 0.96... -0.07... 1.0 -0.14... 1.0 -0.21... 1.0 -0.36... ...
    1 0 -0.01... -0.08... 0.0 0.0 0.0 0.0 0.1147 -0.26... ...
    ... ... ... ... ... ... ... ... ... ... ...
Description

A gzip'ed tar containing UCI and UCI KDD datasets (uci-20070111.tar.gz, 17,952,832 Bytes)

URLs
(No information yet)
Publications
    Data Source
    http://www.ics.uci.edu/~mlearn/MLRepository.html http://kdd.ics.uci.edu/
    Measurement Details
    Usage Scenario
    revision 1
    by mldata on 2010-11-06 09:58

    No one has posted any comments yet. Perhaps you would like to be the first?

    Leave a comment

    To post a comment, please sign in.

    This item was downloaded 2762 times and viewed 2046 times.

    No Tasks yet on dataset uci-20070111 ionosphere

    Submit a new Task for this Data item

    Data

    Sort by

    Disclaimer

    We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.

    Data | Task | Method | Challenge

    Acknowledgements

    This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
    PASCAL Logo
    http://www.pascal-network.org/.