Approachable Data Mining Tutorials for the Non Data Miner

A list of several sources to learn data science in a hands-on format https://www.coursera.org/course/ml – The most approachable machine learning course available. And it’s free. https://www.kaggle.com/wiki/Tutorials – Provides data sources, forums, scenarios, and real-world competitions to teach data mining http://deeplearning.net/tutorial/ – Tutorial on Deep Learning – introduction to machine learning image analysis algorithms http://tryr.codeschool.com/ – Interactive introduction to… Continue reading Approachable Data Mining Tutorials for the Non Data Miner

Dictionary of Data Mining Terms

The elements of big data analytics has roots in statistics, knowledge management, and computer science. Many of the data mining terms below appear in these disciplines but may have different connotation or specialized meaning when applied to our problems. The problems of massive parallel processing and the specialized algorithms employed to perform analysis in a… Continue reading Dictionary of Data Mining Terms

2.5TB, 53.5 Billion Clicks Dataset Available for Clickstream Analysis

To foster the study of the structure and dynamics of Web traffic networks, Indiana University has made available a large dataset (‘Click Dataset’) of about 53.5 billion HTTP requests made by users at Indiana University. Gathering anonymized requests directly from the network rather than relying on server logs and browser instrumentation allows one to examine large volumes of… Continue reading 2.5TB, 53.5 Billion Clicks Dataset Available for Clickstream Analysis