Querying Big, Dynamic, Distributed Data

Seminar
Monday, February 18, 2013

Effective Big Data management and analysis poses several difficult challenges for modern database architectures. One key such challenge arises from the naturally streaming nature of big data, which mandates efficient algorithms for querying and analyzing massive, continuous data streams (that is, data that is seen only once and in a fixed order) with limited memory and CPU-time resources. Such streams arise naturally in emerging large-scale event monitoring applications; for instance, network-operations monitoring in large ISPs, where usage information from numerous sites needs to be continuously collected and analyzed for interesting trends. In addition to memory- and time-efficiency concerns, the inherently distributed nature of such applications also raises important communication-efficiency issues, making it critical to carefully optimize the use of the underlying network infrastructure. In this talk, we introduce the distributed data streaming model, and discuss some of our recent results on tracking complex queries over massive distributed streams, as well as new research directions in this space.

Speaker

Minos Garofalakis received the Diploma degree in Computer Engineering and Informatics (School of Engineering Valedictorian) from the University of Patras, Greece in 1992, and the MSc and PhD degrees in Computer Science from the University of Wisconsin-Madison in 1994 and 1998, respectively. He worked as a Member of Technical Staff at Bell Labs, Lucent Technologies in Murray Hill, NJ (1998-2005), as a Senior Researcher at Intel Research Berkeley in Berkeley, CA (2005-2007), and as a Principal Research Scientist at Yahoo! Research in Santa Clara, CA (2007-2008). In parallel, he also held an Adjunct Associate Professor position at the EECS Department of the University of California, Berkeley (2006-2008).As of October 2008, he is a Professor of Computer Science at the Department of Electronic & Computer Engineering of the Technical University of Crete, and the Director of the Software Technology and Network Applications Laboratory (SoftNet); he is also the current ECE Department Chair (2011-2013). Finally, he is a Member of the Board of Directors of Information Society, S.A., a Greek government organization for advancing large-scale, national-level informatics and automation projects. Prof. Garofalakis currently serves as an Editorial Board Member for Foundations and Trends in Databases and the Proceedings of the VLDB Endowment (PVLDB), and as an Associate Editor for Statistical Analysis and Data Mining (SAM) and Knowledge and Information Systems (KAIS). He was the Core Database Technology PC Chair for VLDB'2007, and recently completed terms as Associate Editor for ACM Transactions on Database Systems (ACM TODS, 2008-2011) and IEEE Transactions on Knowledge and Data Engineering (IEEE TKDE, 2007-2011). He has also repeatedly served on the program committees of all major data-management conferences (including ACM SIGMOD, VLDB, ACM PODS, IEEE ICDE, and ACM SIGKDD).