Below are many downloadable free machine learning datasets. They cover click data, air traffic control data, surveys, temporal datasets of various types, crime data, employee pay data, map data, law data, and many other types.
I am a huge fan of SSPS, scikit-learn, opennlp, and other mainstream libraries but for quick analysis and visualization don’t forget about Pentaho data mining (
https://wiki.pentaho.com/display/DATAMINING/Pentaho+Data+Mining+Community+Documentation ) based on University of Waikato’s Weka (
https://www.cs.waikato.ac.nz/ml/index.html ). It can also be used with Pentaho Kettle to submit to a hadoop cluster and perform advanced multi-step analysis.
[insert page=’searchable-list-of-public-data-mining-datasets’ display=’content’]