AGU Session Announcement: New Approaches to Analyze Big Geoscientific Datasets

Posted by Joe Hamman on July 20, 2017

Just a quick note to announce a new AGU session that Charlie Zender, Ryan Abernathey, and myself are putting together. We’re hoping to see a nice collection of presentations covering cutting edge data analysis tool. Final deadline for submitting abstracts is August 2, 2017.

Session ID#: 26340

Session Description:

Rapid analysis and interpretation of large model and measurement datasets is increasingly undertaken as a sequence of institution-supported pre-processing and user-devised post-processing (e.g., scripting of specialized statistics and visualization). By providing a pre-agreed format for data and metadata, the first stage ensures dataset utility and interoperability. In the second stage the user community employs diverse software practices and specialized toolkits to pursue their data analysis. Users now routinely attempt to ingest entire satellite records or MIP archives to complete their analysis. This often requires interactive and batch workflows to scale from the desktop to distributed-HPC systems. Such workflows must adjust to available memory constraints, provide access to CPU and cluster-level parallelism, while remaining flexible and easy to customize. How ought researchers utilize the unprecedented volume of data with metadata-aware analysis tools to answer tomorrow’s data-intensive questions? This session will demonstrate state-of-the-art approaches to gigabyte- through petabyte-scale geoscientific data analysis.

Primary Convener:

  • Charles S Zender, University of California Irvine, Departments of Earth System Science and Computer Science, Irvine, CA, United States


  • Joseph Hamman, National Center for Atmospheric Research, Hydrometeorological Applications Program, Research Applications Laboratory, Boulder, CO, United States
  • Ryan Abernathey, Columbia University of New York, Palisades, NY, United States


  • A - Atmospheric Sciences
  • GC - Global Environmental Change
  • H - Hydrology
  • OS - Ocean Sciences

Index Terms:

  • 0520 Data analysis: algorithms and implementation [COMPUTATIONAL GEOPHYSICS]
  • 1902 Community modeling frameworks [INFORMATICS]
  • 1976 Software tools and services [INFORMATICS]
  • 1998 Workflow [INFORMATICS]