NIST Big Data Working Group

The US National Institute of Standards and Technology (NIST) kicked off their Big Data Working Group on June 19th 2013.  The sessions have now been broken down into subgroups for Definitions, Taxonomies, Reference Architecture, and Technology Roadmap.  The charter for the working group: NIST is leading the development of a Big Data Technology Roadmap. This… Continue reading NIST Big Data Working Group

US Government Business Capture Data Mining in Microsoft Excel

View agency activity clustering on geography in Excel using Excel Data Mining Add-ins By Don Krapohl 1.       Ensure you have downloaded the Excel Data Mining Add-ins from Microsoft at http://www.microsoft.com/en-us/download/details.aspx?id=35578 .  The article assumes you have a working version of the DM Addins and a default Analysis Services (SSAS) instance defined.  Search for getting started… Continue reading US Government Business Capture Data Mining in Microsoft Excel

Predicting the Best Parameters for Federal Business Capture using WEKA

Which contract parameters should I choose? What combination of features might I pursue to raise my probability of contract award? Open WEKA explorer On pre-process tab find the government_contracts.arff file. Perform pre-processing Escape non-enclosure single- and double-quotes (\’, \”) if using a delimited text version. Check ‘UniqueTransactionID’ and click ‘Remove’.  Stating the obvious, there is… Continue reading Predicting the Best Parameters for Federal Business Capture using WEKA