
Meetup: Bay Area Hadoop User Group (HUG) - July 2010
What: Bay Area Hadoop User Group (HUG) July Meetup
When: Wednesday, July 21, 2010 6:00 PM
Where:
Yahoo! Campus - Building C, 2nd Floor, Classroom 5
701 First Avenue
Sunnyvale CA 94089
Agenda:
* 6:00 - 6:30 - Socializing and Beers
* 6:30 – 7:00 – Online Content Optimization with Hadoop, Nitin Motgi, Yahoo!
* 7:00 - 7:30 – Hadoop at eBay, Anil Madan, eBay
* 7:30 – 8:00 - Introduction to Avro, Doug Cutting, Cloudera
* QnA , Open Discussion
Sign up here:
http://www.meetup.com/hadoop/calendar/13546804________________________________________
Online Content Optimization with Hadoop, Nitin Motgi, Yahoo!
We make extensive use of Hadoop technology stack in our content optimization systems. Using Hadoop, we are able to scale to build models for millions of items, and users in near-real time. We leverage HBase for point lookups/stores of these models. We also use Pig for phrasing our workflows so the map-reduce parallelism is abstracted out of core processing.
Hadoop at eBay , Anil Madan ,eBay
This talk will illustrate how eBay is leveraging its data assets to do advanced insights and analytics.
Learn how eBay is sourcing huge volumes of data into the cluster and running Click Stream and Transactional data analysis for user behavior, search quality and research use cases.
Anil Madan is the Director of Engineering at eBay responsible for Hadoop cluster build out.
Introduction to Avro, Doug Cutting, Cloudera
Avro is a serialization system. It supports interoperable, efficient, dynamic data storage and RPC.
It's currently implemented in C, C++, Java, Python and Ruby.
Support for Map-Reduce over Avro data is being developed, and we expect Hadoop to eventually move to Avro for its RPC.