Category Archives: Open Source

eBay’s open-source program

A Creative Visualization of OLAP Cuboids

Background eBay is one of the world’s largest and most vibrant marketplaces with 1.1 billion live listings every day, 169 million active buyers, and trillions of rows of datasets ranging from terabytes to petabytes. Analyzing such volumes of data required eBay’s Analytics Data Infrastructure (ADI) team to create a fast data analytics platform for this
Continue Reading »

Ready-to-use Virtual-machine Pool Store via warm-cache

Problem overview Conventional on-demand Virtual Machine (VM) provisioning methods on a cloud platform can be time-consuming and error-prone, especially when we need to provision VMs in large numbers quickly. The following list captures different issues that we often encounter while trying to provision a new VM instance on the fly: Insufficient availability of compute resources
Continue Reading »

Secure Communication in Hadoop without Hurting Performance

  Apache Hadoop is used for processing big data at many enterprises. A Hadoop cluster is formed by assembling a large number of commodity machines, and it enables the distributed processing of data. Enterprises store lots of important data on the cluster. Different users and teams process this data to obtain summary information, generate insights,
Continue Reading »