Downloads: Data Sets ( Renewed on 07/17/2009 )

1. Classification by Domain

Domain Dataset

Social Science

AAUP, AMEX A, Census income,
Coal disasters, Detroit, Energy, NASDAQ G, Parents, Poverty, Subway, US Money, US Population
Natural Science

/Engineering

Botany/Agriculture Acorns, Cereal, Iris, Niche-ll, Scanbio, Ohsumed
Computer Science Netperf, Webstats, Synt2k
Mechanics Cars, Rubber, Uvw
Space Science
Others or Unknown Htong, Swanson

2. Classification by Dimensions:

 

Dimensions Dataset

Small: 1-10

Acorns(04), Htong(04), Iris(04), Coal disasters(05), Out5d(05), US Population(05), Rubber(06), Uvw(06), Detroit(07), Netperf(07), Cars(07), Swanson(07), Venus(07), Parents(07), Scanbio(08), Synt2k(08) AMEX A(08), NASDAQ G(08)

Medium: 11- 50

Cereal(11), Poverty(11), Astronomy(12), Voyager(12), Energy(12), AAUP(14), Webstats(22), Niche-ll(25),Census income(42)

Large: 51+

Superpose(57), US Money(79), Subway(104), Ohsumed(215), SkyServer(361)

3. Classification by Records:

 

Records Dataset

Small: 1-500

Parents (12), Detroit (13), Rubber (30), Webstats (30), Acorns (39), Energy(51) US Money(64) US Population (75), Cereal (77), Iris (150), Netperf (179), Coal disasters (191), Ohsumed (298), Htong (365), Cars (392), Subway (423), Poverty (492), Astronomy (500)

Medium: 501- 5001

Voyager (744), Superpose (1000), AAUP (1161), Scanbio (1356),Swanson (1875)

Large: 5001+

Venus (8784), Out5d (16384), Synt2k (16384), Niche-ll (49324), Census income (95130), AMEX A (130236), Uvw (149769), SkyServer (158426), NASDAQ G (219070)