I need some artificial data namely, "Two-Spiral", "Cluster-inside-Cluster", "Half-Kernel", "crescent-full-moon", and "outlier" for Machine Learning purposes.
Is there any guide/package/source-code in MATLAB?
Because I thought it would be useful to have these kind of datasets available and because it would be a fun exercise, I wrote some functions to generate random datasets that are very similar to the ones shown in your picture. There are a bunch of options to control the number of instances, amount of noise, etc. The output for each function is an Nx3 matrix, where each line contains the X,Y coordinates and the class of an instance.
This is what the output looks like:
I did it in 6 scripts of 30-40 lines each. I uploaded the scripts to the Matlab File Exchange but it hasn't been reviewed yet. For now, you can get the files here. There are barely any comments in this first version, but I hope the code is self-explanatory. There is also a demo script (datasetsdemo.m
) that will run all the scripts and produce the image shown above.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With