Going Big on Big Data
The hot topic in computer science this year is Big Data. At MIT, I've recently started two "big data" initiatives: the Intel Science and Technology Center (ISTC) in Big Data, and BigData@CSAIL, and industry- sponsored consortium. In this talk, I'll give a bit of background about what big data is, why industry is so excited about it, and my own perspective on the challenging research problems it brings up. I'll also describe a couple of "big data" research projects we've been doing, including BlinkDB, an approximate query processor built on Apache Hive, and DataHub, a new system helping teams of researchers collaborate on data sets.
Samuel Madden is a Professor of Electrical Engineering and Computer Science in MIT's Computer Science and Artificial Intelligence Laboratory. His research interests include databases, distributed computing, and networking. Research projects include the C-Store column-oriented database system, the CarTel mobile sensor network system, and the Relational Cloud "database-as-a-service". Madden is a leader in the emerging field of "Big Data", heading the Intel Science and Technology Center (ISTC) for Big Data, a multi-university collaboration on developing new tools for processing massive quantities of data. He also leads BigData@CSAIL, an industry-backed initiative to unite researchers at MIT and leaders from industry to investigate the issues related to systems and algorithms for data that is high rate, massive, or very complex.
Madden received his Ph.D. from the University of California at Berkeley in 2003 where he worked on the TinyDB system for data collection from sensor networks. Madden was named one of Technology Review's Top 35 Under 35 in 2005, and is the recipient of several awards, including an NSF CAREER Award in 2004, a Sloan Foundation Fellowship in 2007, best paper awards in VLDB 2004 and 2007, MobiCom 2006, SenSys 2009, CIDR 2013, and EuroSys 2013.