Working with Categorical Variables with Multiple Levels: Python, Scikit-Learn, Multiple Correspondence Analysis

Posted on Mon 31 December 2018 in posts • Tagged with Data Cleaning, Python, Scikit-Learn, MCA

Working with categorical variables that have a small number of classes (levels) can be a pleasant surprise from a data cleaning aspect for the data scientist/analyst just trying to get to next phase of their analysis. But sooner or later that one column with an unwieldy amount of classes will come along and slap you upside the head.

Continue reading

Data Visualization: dc.js/d3.js

Posted on Wed 11 July 2018 in posts • Tagged with Data Visualization, d3.js, dc.js

d3.js has been the data visualization gold standard for good reason since it's creation in 2011 by Mike Bostock and company. It's allowed practitioners to make elegant and stunning designs all the while proving to be flexible and adaptable to future technologies.

With the growing popularity of portable notebook designs like jupyter, Mike Bostock and d3 have answered the call with Observable
Continue reading