The elements of big data analytics has roots in statistics, knowledge management, and computer science. Many of the data mining terms below appear in these disciplines but may have different connotation or specialized meaning when applied to our problems. The problems of massive parallel processing and the specialized algorithms employed to perform analysis in a… Continue reading Dictionary of Data Mining Terms
An agglomeration of datasources, resources, books, links, videos, tutorials, and introductory information for data mining across all domains–machine learning, natural language, semantic analysis, artificial intelligence, image processing, sound processing, video analysis, and others.
2.5TB, 53.5 Billion Clicks Dataset Available for Clickstream Analysis
To foster the study of the structure and dynamics of Web traffic networks, Indiana University has made available a large dataset (‘Click Dataset’) of about 53.5 billion HTTP requests made by users at Indiana University. Gathering anonymized requests directly from the network rather than relying on server logs and browser instrumentation allows one to examine large volumes of… Continue reading 2.5TB, 53.5 Billion Clicks Dataset Available for Clickstream Analysis