NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

National Research Council (US) Committee on Applied and Theoretical Statistics. Steps Toward Large-Scale Data Integration in the Sciences: Summary of a Workshop. Washington (DC): National Academies Press (US); 2010.

Cover of Steps Toward Large-Scale Data Integration in the Sciences

Steps Toward Large-Scale Data Integration in the Sciences: Summary of a Workshop.

Show details

AWorkshop Agenda

NATIONAL ACADEMIES

WASHINGTON, D.C.

Wednesday, August 19, 2009

8:00 amContinental breakfast available
8:30Chair’s opening remarks
 Michael Stonebraker, MIT
Data Integration Stretch Goals, Technical Needs, and Policy Issues—Views From Various Domains
9:00Geospatial data
 Keith Clarke, University of California, Santa Barbara
9:20Life sciences
 Carl Kesselman, University of Southern California
9:40Physics
 Tim Frazier, Lawrence Livermore National Laboratory
10:00Astronomy
 Alex Szalay, Johns Hopkins University
10:20Earth sciences
 Tom Karl, National Oceanic and Atmospheric Administration
10:40Research libraries
 Clifford Lynch, Coalition for Networked Information
11:00Break
11:20Open discussion
Working Lunch: Agency Perspectives
Noon10–15 minutes apiece from sponsors and other agencies
 James St. Pierre, National Institute of Standards and Technology
 Michael Marron, National Institutes of Health
 Ed Seidel, National Science Foundation
State of the Art in Data Integration—Structured Data
1:20 pmData federations
 Laura Haas, IBM Almaden Research Center
1:40Data type conversion and ETL technology
 Lee Scheffler, IBM Information Integration Solutions
2:00Automatic conversion
 Michael Siegel, MIT
2:20The SciDB approach
 David Maier, Portland State University
2:40Linked open data
 Orri Erling, OpenLink
3:00Microsoft approach to data conversion
 Phil Bernstein, Microsoft
03:20Break
State of the Art Data Integration Solutions—Semistructured Data
3:40Google Approach
 Alon Halevy, Google
4:00Yahoo! Approach
 Raghu Ramakrishnan, Yahoo!
4:20MapReduce/Hive/Pig paradigm
 Amr Awadallah, Cloudera
Policy Perspectives
4:40Policy perspective
 Michael Nelson, Georgetown University
5:00Policy perspective
 Christopher Greer, National Coordination Office for Networking and InformationTechnology R&D
5:20Lessons from a large-scale information integration ecosystem
 Michael Brodie, Verizon
5:40Business perspective
 Josephine Cheng, IBM Almaden Research Center
6:00Develop organization for Day Two 6:30
6:30Working dinner

Thursday, August 20, 2009

Open Brainstorming
8:30 amReflections on Day One
9:00Identify three areas for detailed and moderated discussion
Topic Area One
9:15Open discussion
10:00Break
Topic Area Two
10:30Open discussion
Topic Area Three
11:15Open discussion
NoonAdjourn public workshop
12:30 pmPlanning committee lunch/executive session
3:00Planning committee adjourns
Copyright © 2010, National Academy of Sciences.
Bookshelf ID: NBK45668

Views

  • PubReader
  • Print View
  • Cite this Page
  • PDF version of this title (313K)

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...