Previous Chapter: Johannes Gehrke Processing Aggregate Queries over Continuous Data Streams
Suggested Citation: "ABSTRACT OF PRESENTATION." National Research Council. 2004. Statistical Analysis of Massive Data Streams: Proceedings of a Workshop. Washington, DC: The National Academies Press. doi: 10.17226/11098.

Abstract of Presentation

Processing Aggregate Queries over Continuous Data Streams Johannes Gehrke, Cornell University

In this talk, I will describe techniques for giving approximate answers for aggregate queries over data streams using probabilistic “sketches” of the data streams that give approximate query answers with provable error guarantees. I will introduce sketches and then talk about two recent technical advances, sketch partitioning and sketch sharing. In sketch partitioning, existing statistical information about the stream is used to significantly decrease error bounds. Sketch sharing allows one to improve the overall space utilization among multiple queries. I will conclude with some open research problems and challenges in data stream processing.

Part of this talk describes joint work with Al Demers, Alin Dobra, and Mirek Riedewald at Cornell and Minos Garofalakis and Rajeev Rastogi at Lucent Bell Labs.

Suggested Citation: "ABSTRACT OF PRESENTATION." National Research Council. 2004. Statistical Analysis of Massive Data Streams: Proceedings of a Workshop. Washington, DC: The National Academies Press. doi: 10.17226/11098.
Page 251
Next Chapter: TRANSCRIPT OF PRESENTATION
Subscribe to Email from the National Academies
Keep up with all of the activities, publications, and events by subscribing to free updates by email.