October 11, 2010 – October 13, 2010

eScience Workshop 2010

Location: Berkeley, California, US

Monday, October 11

Keynote Presentations

Malcolm Atkinson, e-Science Institute

The global digital revolution provides a fertile and turbulent ecological environment in which e-Science is a small but vital element. There is a deep history of e-Science, but coining the term and injecting leadership and modest funds had a huge impact. A veritable explosion of activity has led to a global burst of new e-Science species. Our challenge is to understand what will enable them to thrive and yield maximum benefit as the digital revolution continues to be driven by commerce and media.

Webcast
This year, Microsoft Research presents the Jim Gray eScience Award to a researcher who has made an outstanding contribution to the field of data-intensive computing. The award—named for Jim Gray, a Technical Fellow for Microsoft Research and a Turing Award winner who disappeared at sea in 2007—recognizes innovators whose work truly makes science easier for scientists.

Webcast
Adam Bly, Seed

The future of science is open, not because it ought to be but because it needs to be. Today, science’s potential is hindered by the disconnected nature of the world’s scientific information and the closed architecture of science itself. So how do we get from here to there? How can technology make open science real?

Webcast

Tutorials

Microsoft Biology Foundation: An Open-Source Library of Re-usable Bioinformatics Functions and Algorithms Built on the .NET Platform

Webcast
Scientific Data Visualization using WorldWide Telescope

Webcast
Data-Intensive Research: Dataset Lifecycle Management for Scientific Workflow, Collaboration, Sharing, and Archiving

Webcast
Parallel Computing with Visual Studio 2010 and the .NET Framework 4

Webcast

Sessions

Webcast

Exploration of Real-Time Provenance-Aware Virtual Sensors Across Scales for Studying Complex Environmental Systems

Yong Liu, National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign

Development and Application of Network of Geosensors for Environmental Monitoring

Rafael Santos, INPE – Brazilian National Institute for Space Research
Webcast

BLAST Atlas: A Function-Based Multiple Genome Browser

Lawrence Buckingham, Queensland University of Technology

DIVE: A Data Intensive Visualization Engine

Dennis Bromley, University of Washington
Webcast

Simplifying Oligonucleotide Primer Design Software to Keep Pace with an Ever Increasing Demand for Assay Formats

Kenneth “Kirby” Bloom, Illumina Corporation

Integration of Sequence Analysis into Third Dimension Explorer Leveraging the Microsoft Biology Framework

Jeremy Kolpak, Janssen Pharmaceutical Companies of Johnson & Johnson
Webcast

Achieving an Ecosystem Based Approach to Planning in the Puget Sound

Stephen Stanley, Washington Department of Ecology

Adapting Environmental Science Methods to Public Policy and Decision Support

Rob Fatland, Microsoft Research
Webcast

An Interactive Modeling Environment for Systems Biology of Aging

Pat Langley, Arizona State University
Webcast

Analyzing the Process of Knowledge Dynamics in Sustainability Innovation: Towards a Data-Intensive Approach to Sustainability Science

Masaru Yarime, University of Tokyo

Data-Intensive Science for Safety, Trust, and Sustainability

Shuichi Iwata, The University of Tokyo
Webcast

BL!P: A Tool to Automate NCBI BLAST Searches and Customize the Results for Exploration in Live Labs Pivot

Vince Forgetta, McGill University

GenoZoom: Browsing the genome with Microsoft Biology Foundation, Deep Zoom, and Silverlight

Xin-Yi Chua, Queensland University of Technology

Tuesday, October 12

Keynote Presentation

Philip Bourne, University of California, San Diego

Anyone can punch a hole in a piece of metal, but a reamer is needed to accurately size and finish that hole. Digital computers are the reamers of life, bringing together a vast array of disparate bits of data to provide an accurate picture of life that can be smoothly transcended across scales–from molecules to populations. Sounds heady, so why do we not fully understand the molecular basis of cancer? Why can’t we accurately model the impact of an oil spill on marine life? Why can’t we decide whether there is a tree of life or a network of life? “Well tonight we are going to sort it all out, for tonight it’s the reaming of life.”

Webcast

Sessions

Webcast

Data, Data, Everywhere, nor Any Drop to Drink: New Approaches to Finding Events of Interest in High Bandwidth Data Streams

Mark Abbott, Oregon State University

Extreme Database-centric Computing in Science

Alex Szalay, Johns Hopkins University
Webcast

Model-Driven Cloud Services for Cancer Research

Marty Humphrey, University of Virginia

Cloud-Based Map-Reduce Architecture for Nuclear Magnetic Resonance-Based Metabolomics

Paul Anderson, Wright State University
Webcast

MyExperimentalScience, Extending the “Workﬂow”

Jeremy Frey, University of Southampton

The Conversion Software Registry

Michal Ondrejcek, National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign
Webcast

oreChem: Planning and Enacting Chemistry on the Semantic Web

Mark Borkum, University of Southampton

Accelerating Chemical Property Prediction with Cloud Computing

Hugo Hiden, Newcastle University
Webcast

Remote Computed Tomography Reconstruction Service on GPU-Equipped Computer Clusters Running Microsoft HPC Server 2008

Timur Gureyev, Commonwealth Scientific and Industrial Research Organisation (CSIRO)

e-LICO: Delivering Data Mining to the Life Science Community

Simon Jupp, University of Manchester
Webcast

SQL is Dead; Long Live SQL: Lightweight Query Services for Ad Hoc Research Data

Bill Howe, University of Washington

SinBiota 2.0 – Planning a New Generation Environmental Information System

João Meidanis, University of Campinas
Webcast

Enhancing the Quality and Trust of Citizen Science Data

Jane Hunter, The University of Queensland

Scientist-Computer Interfaces for Data-Intensive Science

Cecilia Aragon, Lawrence Berkeley National Laboratory

Enabling Scientific Discovery with Microsoft SharePoint

Kenji Takeda, University of Southampton
Webcast

Genome-Wide Association of ALS in Finland

Bryan Traynor, National Institute on Aging, National Institutes of Health

A Framework for Large-Scale Modelling of Population Health

John Ainsworth, University of Manchester

GREAT.stanford.edu: Generating Functional Hypotheses from Genome-Wide Measurements of Mammalian Cis-Regulation

Gill Bejerano, Stanford University
Webcast

Medici: A Scalable Multimedia Environment for Research

Joe Futrelle, National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign

BlogMyData: A Virtual Research Environment for Collaborative Visualization of Environmental Data

Andrew Milsted, University of Southampton

RightField: Rich Annotation of Experimental Biology Through Stealth Using Spreadsheets

Matthew Horridge, University of Manchester
Webcast

musicSpace: Improving Access to Musicological Data

mc schraefel, University of Southampton

Quantifying Historical Geographic Knowledge from Digital Maps

Tenzing Shaw, National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign

Data Intensive Research in Computational Musicology

David De Roure, Oxford e-Research Centre
Webcast

Scaling Information on ‘Biosphere Breathing’ from Chloroplast to the Globe

Dennis Baldocchi, University of California-Berkeley

Agrodatamine: Integrating Analysis of Climate Time Series and Remote Sensing Images

Humberto Razente, UFABC
Webcast

Correction for Hidden Confounders in Genetic Analyses

Jennifer Listgarten, Microsoft Research

BioPatML.NET and Its Pattern Editor: Moving into the Next Era of Biology Software

James Hogan, Queensland University of Technology
Webcast

GRAS Support Network, Its Implementation, Operation, and Use

Fritz Wollenweber, EUMETSAT

Data Intensive Frameworks for Astronomy

Jeffrey Gardner, University of Washington
Dennis Baldocchi, University of California-Berkeley

Experiences and Visions on Archaeo InformaticsChristiaan Hendrikus van der Meijden, IT group, Veterinary Faculty, Ludwig Maximilians University; Peer Kröger, Hans-Peter Kriegel, Department of Computer Science Database Systems Group, Ludwig Maximilians University

Wednesday, October 13

Keynote Presentations

Sam Ramji, Apigee

Microsoft’s open source strategy has shifted over the years, from ignore to fight to interoperate. Recently they have changed course to use open source as an engine of innovation and growth for core businesses. This talk will cover details of projects that showcase the shifts in strategy and expose the underlying dynamics of open source in the software industry.

Webcast
Garrison Sposito, U.C. Berkeley; Mark Stacey, U.C. Berkeley; Stephanie Carlson, U.C. Berkeley; Charlotte Ambrose, NOAA’s National Marine Fisheries Service; James Hunt, U.C. Berkeley

The current opportunities in the physical and biological sciences and their technological applications require the means to fundamentally understand processes at the molecular scale and to extend those processes to predict performance at larger scales. As examples, material science is using resolution at the scale of an atom to predict and design devices that are orders of magnitude larger, and biological processes are dictated by interactions at molecular, cellular, organismal, population, and ecosystem levels. Spatial and temporal scaling across orders of magnitude requires analysis tools that are available for computation, aggregation, and visualization. eScience is developing approaches for conducting this scaling and has been essential in addressing fundamental questions in biology and astronomy. While additional applications remain in the basic sciences, these fields have demonstrated pathways for advances in the applied environmental and social sciences, where the linkages between scales and disciplines require focused contributions from the eScience community. This workshop provides opportunities to observe how eScience has provided the scaling across various fields and to explore some of the challenges that remain.

Webcast

Sessions

Webcast

Panel: Challenges of Data Standards and Tools

Deb Agarwal, LBNL/UCB; Bill Howe, University of Washington; Alex James, Microsoft; Yong Liu, National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign; Maryann Martone, UCSD; Yan Xu, Microsoft Research
Webcast

Scientific Data Sharing and Archiving at UC3/CDL: the Excel Add-in Project and More

John Kunze, California Digital Library/California Curation Center; Tricia Cruse, California Digital Library/California Curation Center

Visualizing All of History with Chronozoom

David Shimabukuro, University of California-Berkeley; Roland Saekow, University of California-Berkeley
Webcast

Proteome-Scale Protein Isoform Characterization with High Performance Computing

Jake Chen, Indiana University

Answering Biological Questions by Querying k-Mer Databases

Paul Greenfield, CSIRO Mathematics, Informatics and Statistics

Tutorials

CoSBiLab: Enabling Simulation-Based Science

Webcast
Scientific Data Visualization using WorldWide Telescope

Webcast
Data-Intensive Research: Dataset Lifecycle Management for Scientific Workflow, Collaboration, Sharing, and Archiving

Webcast
OData – Open Data for the Open Web

Webcast