Quote:
Originally Posted by jjshabado
Thanks jj, yes I'm familiar with Hilary Mason, having watched her excellent intro to machine learning on O'Reilly probably a year and a half a ago. I've also read Drew Conway's Machine Learning for Hackers. They're smart coders.
That being said we're being pulled, based on the data and what's proving to more effective in recommendation tests, into an event-driven system. This means that for the most part hadoop doesn't really fit well, and the recommender is actually a small part of delivering recommendations ( a flexible and time aware data model, concurrent program execution, fast response to incoming events are equally important).
Imagine the difference as two different jobs.
In one job you are trying to design a poker bot to decide what to do at the poker table, and are tracking recent history to decide actions (is someone on tilt?). This system should be designed to be event driven.
In another scenario you are tracking monthly sales for a restaurant to help decide food order size. There are some things (like sporting events) that cause fluctuations, but there's a schedule ahead of time, so you know about those things. The sales fluctuate monthly in predictable long term patterns. This system is perfect for Hadoop.