Pragmatic Programming Techniques
Pages
(Move to ...)
Home
▼
Sunday, August 17, 2014
Lambda Architecture Principles
›
"Lambda Architecture" (introduced by Nathan Marz) has gained a lot of traction recently. Fundamentally, it is a set of design pat...
3 comments:
Sunday, July 27, 2014
Incorporate domain knowledge into predictive model
›
As a data scientist / consultant, in many cases we are being called in to work with domain experts who has in-depth business knowledge of in...
Saturday, June 28, 2014
Interactive Data Visualization
›
Recently, "interactive report" is becoming a hot topic in data visualization. I believe it is becoming the next generation UI par...
Wednesday, March 12, 2014
Common Text Mining workflow
›
In this post, I want to summarize a common pattern that I have used in my previous text mining projects. Text mining is different in that ...
1 comment:
Monday, March 3, 2014
Estimating statistics via Bootstrapping and Monte Carlo simulation
›
We want to estimate some "statistics" (e.g. average income, 95 percentile height, variance of weight ... etc.) from a population. ...
Friday, December 27, 2013
Spark: Low latency, massively parallel processing framework
›
While Hadoop fits well in most batch processing workload, and is the primary choice of big data processing today, it is not optimized for ot...
1 comment:
Thursday, December 12, 2013
Escape local optimum trap
›
Optimization is a very common technique in computer science and machine learning to search for the best (or good enough) solution. Optimiza...
‹
›
Home
View web version