Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What are some good machine learning programming exercises? [closed]

Ideally, they would have the following characteristics:

  1. They can be completed in just an evening of coding. It will not require a week or more to get interesting results. That way, I can feel like I've learned and accomplished something in just one (possibly several hour long) sitting.

  2. The problems are from the real world, or they are at least toy versions of a real world problems.

  3. If the problem requires data to test the solution, there are real-world datasets readily available, or it is trivial to generate interesting test data myself.

  4. It is easy to evaluate how good of a job I've done. When I test my solution, it will be clear from the results that I've accomplished something nontrivial, either by simple inspection, or by a quantifiable measure of the quality of the results.

like image 219
jonderry Avatar asked Nov 17 '10 23:11

jonderry


People also ask

Which is the best to learn machine learning?

Best 7 Machine Learning Courses in 2022:Machine Learning Crash Course — Google AI. Machine Learning with Python — Coursera. Advanced Machine Learning Specialization — Coursera* Machine Learning — EdX.

What is machine learning with real time example?

Image recognition is a well-known and widespread example of machine learning in the real world. It can identify an object as a digital image, based on the intensity of the pixels in black and white images or colour images. Real-world examples of image recognition: Label an x-ray as cancerous or not.

Is Python good for machine learning?

To reiterate, Machine Learning is simply recognizing patterns in your data to be able to make improvements and intelligent decisions on its own. Python is the most suitable programming language for this because it is easy to understand and you can read it for yourself.


2 Answers

Implement the following algorithms:

  • Perceptron, margin perceptron: you can try to detect images of faces (classify images of faces and non-faces) using any face database. Try for example the MIT CBCL face database. You can also try the MNIST data and write a poor man's OCR system.
  • LVQ, Kohonen map: you can try to compress images. You can download large images from any wallpaper site.
  • Naive bayes classifier: you can classify spam and not spam. There are also more scientific datasets, such as Reuters and Newsgroups, etc. which you have to determine the topic, given the article.
  • Backpropagation, multi layer perceptron: you can try this with the faces, or with the spam, or with the text/histogram data.
  • Primal SVM linear learning using SGD: you can try this with MNIST digits, for example.

There are a bunch of projects, some of them take a couple hours, some a couple of days, but you will definitely learn a lot.

like image 165
carlosdc Avatar answered Oct 09 '22 02:10

carlosdc


Check the UCI machine learning repository out for real datasets.

The Breast Cancer Wisconsin (Diagnostic) Data Set for example. Check the data set description for more information about it.

Even the Naive Bayes classifier will give great results on this dataset (over 95% cross-validated accuracy). With some variable selection you can even get to 100%, if I remember correctly.

like image 23
George Avatar answered Oct 09 '22 01:10

George