In a follow-up to this answer I want to ask if any of you know any good (and more importantly easy to understand) tutorials and / or examples of data mining with the Weka toolkit.
I've been very interested in Data Mining ever since I've first heard of it and the things it can do, I've also have some experiments I'd like to do with some of my data and I've already bought four books and I found specially interesting the following two:
Data Mining http://ecx.images-amazon.com/images/I/61DhYb1Z6QL._BO2,204,203,200_PIsitb-sticker-arrow-click,TopRight,35,-76_AA240_SH20_OU01_.jpg
The last one is written by the same authors of Weka and contains a lot of examples but still, I found it a little hard to understand the logic and specially the math. My math skills are currently very rough, I plan to go to the University this year and hopefully I'll learn and be able to better understand the math involved, but until then I want to gain some practice in Data Mining.
Is there any step-by-step tutorial with example data I can read to get me started with the Weka toolkit?
weka is the best software to do data mining. We can use this as very easily for designe data mining algorithms. This software gives very clearly graphical out puts. Very easy to work with software because there is no anything have to do very hard.
II. WEKA: Weka (Waikato Environment for Knowledge Analysis) is a popular suite of machine learning software written in Java, developed at the University of Waikato, New Zealand. Weka is free software available under the GNU General Public License.
To build a classification model select the 'classify' tab on the top in the explorer dashboard. As you see there are multiple algorithms that are available here. I have decided to use the decision tree classification model. Select the decision tree option from here and you can see the results immediately.
When it comes to "applied" data mining, for the starters, you may not need to think about math at all. Weka is product of a university machine-learning project and offers 100+ algorithms. Contrast that with Microsoft SQL server SSAS which offers nine algorithms -- and they do not even bother to explain the math.
They both offer association, clustering, attribute selection, some kind of neural network. So, the trick is to understand what you are trying to achieve, not necessarily the math below. Try reading about Microsoft algorithms (good documentation) and see if you can figure out principles that SSAS and Weka have in common -- this should help you focus on basic principles and get you started.
There is a list of a few Weka tutorials here.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With