Scuba: Diving into Data at Facebook

Speaker: Janet Wiener , Facebook

Date: Wednesday, April 23, 2014

Time: 4:00 PM to 5:00 PM Note: all times are in the Eastern Time Zone

Refreshments: 3:45 AM

Public: Yes

Location: 32-G449

Event Type:

Room Description:

Host: MIT Big Data Initiative at CSAIL, MIT, CSAIL

Contact: Susana Kevorkova, 617-324-8424, skevorkova@csail.mit.edu

Relevant URL: http://bigdata.csail.mit.edu/talks

Speaker URL: None

Speaker Photo:
None

Reminders to: csail-related@lists.csail.mit.edu, seminars@csail.mit.edu

Reminder Subject: TALK: Scuba: Diving into Data at Facebook

ABSTRACT: Facebook engineers query multiple databases to monitor and analyze Facebook products and services. The fastest of these databases is Scuba, which achieves sub second query response time and latencies of under a minute from events occurring (a client request on a phone, a bug report filed, a code change checked in) to graphs showing those events on engineers’ monitors.
Scuba is a fast, scalable, distributed, in-memory database built at Facebook. It currently ingests millions of rows (events) per second and expires data at the same rate. Scuba stores data completely in memory on hundreds of servers each with 144 GB RAM. To process each query, Scuba aggregates data from all servers. Scuba processes almost a million queries per day. Scuba is used extensively for interactive, ad hoc, analysis queries that run in under a second over live data. In addition, Scuba is the workhorse behind Facebook’s code regression analysis, bug report monitoring, ads revenue monitoring, and performance debugging.
This talk will include content from papers in VLDB 2013 and Sigmod 2014.

BIO: Janet Wiener is a software engineer at Facebook, where she works on Scuba and other data analysis tools. She also teaches Facebook employees how to make product decisions and trouble shoot live systems issues by asking questions, running experiments, and using data tools to collect and analyze the data. Her previous work includes database algorithms, distributed systems performance, and web exploration at
Stanford University, DEC, Compaq, and HP. She earned a PhD in databases from the U. of Wisconsin-Madison in 1995 and a BA from Williams College in 1989.

Research Areas:

Impact Areas:

See other events that are part of the Big Data Lecture Series 2013/2014.

Created by Susana Kevorkova Email at Tuesday, April 01, 2014 at 12:55 PM.