Argonne National Laboratory Mathematics and Computer Science Division
Argonne Home > MCS Division > Seminar & Events

Seminars & Events

Bookmark and Share

Department of Computer Science
"Large-scale Data Management for the Sciences"

DATE: May 11, 2007
TIME: 2:30pm
SPEAKER: Tanu Malik, Johns Hopkins University
LOCATION: Ryerson 251, University of Chicago
HOST: Ian Foster

Description:
Traditional enterprises and novel scientific applications are accumulating petabyte-scale datasets, which makes the need for large-scale data management more pressing than ever. Geographic distribution of the datasets accompanied by complex demands on data makes large-scale data management challenging. This is especially true for sciences that model complex physical and biological phenomena using data from multiple sources.

In this talk I will address two critical problems in scientific data management: combining large number of diverse data sources for execution of scientific queries and executing data-intensive scientific queries efficiently, in terms of both network and I/O, on these data sources. I will present SkyQuery--a system that federates data from several petabyte size, autonomous and heterogeneous astronomy databases scattered worldwide. Using SkyQuery, scientists can write declarative queries that compare and merge multiple astronomical datasets. For efficient query execution and scalability, I will present Bypass-Yield Caching--a novel caching framework for database systems that dramatically reduces the network bandwidth requirements of data-intensive federations such as SkyQuery making them good network citizens. Distributed applications such as the Bypass Yield Cache often rely on a priori knowledge of query cardinalities to make optimization decisions. In this context, I will present a black-box approach to selectivity estimation that is suitable for distributed applications.

The success of SkyQuery and its adoption by the National Virtual Observatory is an example of data management systems enabling scientific endeavors.

More Information:
The talk will be followed by refreshments in Ryerson 255. People in need of assistance should call 773-834-8977 in advance.

For information on future CS talks: http://www.cs.uchicago.edu/events


more info >>

Save the event to your calendar [schedule.ics]


The Office of Advanced Scientific Computing Research | UChicago Argonne LLC | Privacy & Security Notice | ContactUs