Introduction
Weka is a collection
of machine learning algorithms for data mining tasks. Weka contains tools
for data preprocessing, classification, regression, clustering, association
rules and visualization. Since the Weka system is open source (convered
by the GNU General Public License), people can modify the Weka system for
their use, as seen in the large list of Weka related projects on their website.
This site provides a modified version of the Weka system, which has some
more filters for preprocessing, an integrated multivariate visualization
system and tools for similarity analysis of time series datasets.
The project started as a way to learn the Weka system for the Data
Mining class at WPI (Worcester
Polytechnic Institute). As the different topics of Data Mining were covered
in the class, the need to understand the system in terms of the implementation
of the algorithms and the complete framework as a whole arised. Initially,
the aim was to implement filters to aid in the preprocessing of the dataset.
But, as the course progressed and more internals of the system was learnt,
it became sort of a passion to add more utilities to the Weka system.
[Back to top]
-
Binary Distribution - weka.zip (approx. 1.76 MB)
- Source Distribution - weka-src.zip (approx. 2.0 MB)
[Back to top]
The Weka environment lacks
a standard module registration procedure. Hence, the distribution packages
the modified modules with the Weka system.
- Install Jausoft's Java binding for OpenGL (GL4Java) (Note that this
might be slightly involved).
- Unzip the distribution file.
- If you have downloaded the binary package, you can use the RunWeka
batch file to run the system. If you have downloaded the source package,
look at the build instructions to build and run the system.
[Back to top]
Unzip the source zip file to a separate directory. This will create the
src directory which has the weka, xmdv and files created by me in separate
directories. The build.xml ant file would also have been unzipped to the
directory. If you have ant installed and the system paths set, then you
should be able to compile the complete directory tree by running ant from
the command prompt. The class files will be created in the bin directory.
You can then run the Modifies Weka system by specifying java weka.gui.GUIChooser
from the bin directory on the command line.
[Back to top]