Dataset Classification

1. Classification by Domain

Domain

Dataset

Social Science

AAUP, Census income, Coal disasters, Detroit,

 

Natural Science/

Engineering

Botany/Agriculture

/Biology/Medical

 

Acorns, Cereal, Iris, Niche-ll, Scanbio, Ohsumed

 

Computer Science

 

Netperf, Webstats, Synt2k

 

Mechanics

Cars, Rubber, Uvw

 

Space Science

Astronomy, Out5d, SkyServer, Superpose, Venus, Voyager

Others or

Unknown

Swanson

 

2. Classification by Dimensions:

Dimensions

Dataset

Small: 1-10

Acorns(04), Iris(04), Coal disasters(05), Out5d(05), Rubber(06), Uvw(06), Detroit(07), Netperf(07), Cars(07), Swanson(07), Venus(07), Scanbio(08), Synt2k(08)

 

Medium: 11- 50

Cereal(11), Astronomy(12), Voyager(12)

AAUP(14), Webstats(22), Niche-ll(25),Census income(42)

 

Large: 51-

Superpose(57), Ohsumed(215), SkyServer(361)

 

 

3. Classification by Records:

Records

Dataset

Small: 1-500

Detroit (13), Rubber (30), Webstats (30), Acorns (39), Cereal (77), Iris (150), Netperf (179), Coal disasters (191), Ohsumed (298), Cars (392), Astronomy (500)

 

Medium: 501- 5001

Voyager (744), Superpose (1000), AAUP (1161), Scanbio (1356)

Swanson (1875)

 

Large: 5001-

Venus (8784), Out5d (16384), Synt2k (16384), Niche-ll (49324), Census income (95130), Uvw (149769), SkyServer (158426)