Category Archives: Open Source

eBay’s open-source program

Enhancing the User Experience of the Hadoop Ecosystem

  At eBay, we have multiple large, multi-tenant clusters. Each of these clusters stores hundreds of petabytes of data. These clusters offer tens of thousands of cores to run computations on the data. We have thousands of internal users who use Hadoop in their roles, including data analysts, data scientists, engineers, and product managers. These
Continue Reading »

A Creative Visualization of OLAP Cuboids

Background eBay is one of the world’s largest and most vibrant marketplaces with 1.1 billion live listings every day, 169 million active buyers, and trillions of rows of datasets ranging from terabytes to petabytes. Analyzing such volumes of data required eBay’s Analytics Data Infrastructure (ADI) team to create a fast data analytics platform for this
Continue Reading »

Ready-to-use Virtual-machine Pool Store via warm-cache

Problem overview Conventional on-demand Virtual Machine (VM) provisioning methods on a cloud platform can be time-consuming and error-prone, especially when we need to provision VMs in large numbers quickly. The following list captures different issues that we often encounter while trying to provision a new VM instance on the fly: Insufficient availability of compute resources
Continue Reading »