eBay Tech Blog

From the category archives:

Data Infrastructure and Services

HDFS Storage Efficiency using Tiered Storage

by Benoy Antony January 12, 2015 Data Infrastructure and Services

At eBay, we run Hadoop clusters comprised of thousands of nodes that are shared by thousands of users. We store hundreds of petabytes of data in our Hadoop clusters. In this post, we look at how to optimize big data storage based on frequency of data usage. This method helps reduce the cost in an […]

2 comments Read the full article →

Announcing Kylin: Extreme OLAP Engine for Big Data

by eBay Inc. Kylin Team October 20, 2014 Cloud

We are very excited to announce that eBay has released to the open-source community our distributed analytics engine: Kylin (http://kylin.io). Designed to accelerate analytics on Hadoop and allow the use of SQL-compatible tools, Kylin provides a SQL interface and multi-dimensional analysis (OLAP) on Hadoop to support extremely large datasets. Kylin is currently used in production […]

8 comments Read the full article →

NoSQL Data Modeling

by Donovan Hsieh October 10, 2014 Data Infrastructure and Services

Data modeling for RDBMS has been a well-defined discipline for many years. Techniques like logical to physical mapping and normalization / de-normalization have been widely practiced by professionals, including novice users. However, with the recent emergence of NoSQL databases, data modeling is facing new challenges to its relevance. Generally speaking, NoSQL practitioners focus on physical […]

5 comments Read the full article →
Copyright © 2011-2015 eBay Inc. All Rights Reserved - User Agreement - Privacy Policy - Comment Policy