2nd nsf PI Workshop

Data infrastructure building blocks

July 9 - 10, 2018 | Crystal City Marriott Hotel, Arlington VA

The 2 day workshop will feature a series of panel discussions organized by the Program Committee around the white papers and posters submitted by PIs from all active DIBBs awards at the time of the workshop. After each panel discussion, individuals at each table will share their project experiences with each other. A volunteer at each table will summarize that table’s discussion. We will assign attendees to different tables for each of the 4 small group discussions to mix up the discussion groups. This will provide attendees the opportunity to meet more people, share project experiences, and, potentially, spark future collaborations.

The National Science Foundation Cyberinfrastructure Framework for 21st Century Science and Engineering (CIF21) considers an integrated, scalable, and sustainable cyberinfrastructure to be crucial for the advancement of new research practices and transformative advances across all fields of science and engineering. The Data Infrastructure Building Blocks (DIBBs) program supported that vision by encouraging the development of robust and shared data-centric cyberinfrastructure capabilities.

Five DIBBs program solicitations (2012, 2014, 2015, 2016, and 2017) resulted in 56 awards ranging in funding from ~$100,000-~$10,000,000.  Approximately 50 of these awards are active.  A Special Award Condition contained in the DIBBs solicitations noted that awardees were expected to participate in annual meetings, with travel costs supported by the award.

Following on to, and closely adapting the rationale and structure of the first DIBBs17 PI Workshop developed by Dr. David Lifka and Paul Redfern of Cornell University, a second NSF Data Infrastructure Building Blocks PI Workshop (DIBBs18) will be convened in Arlington, VA during July 9-10, 2018 to exchange results and lessons learned from the projects, and to consider the implications of project results for advances in the vision and goals for data cyberinfrastructure, with the focus on Harnessing the Data Revolution, one of the NSF Big Ideas.

Prior to the workshop, each PI will submit a PDF of a poster on their DIBBs successes and a short white paper describing current and future challenges. A Program Committee comprised of a representative set of PIs will use these project-specific materials to organize the workshop panels and small discussion groups. Panels will discuss significant and innovative DIBBs results, current DIBBs challenges and solutions, and future DIBBs challenges, including sustainability issues. Each panel will be immediately followed by small group discussions and report-outs to increase PI/Co-PI participation and facilitate DIBBs community building. A 2018 DIBBs PI Workshop report will summarize the progress and challenges of the DIBBs projects and describe potential gaps and future opportunities that were discussed during the 1.5-day workshop. Links to PDFs of all PI white papers and posters will be included in the report and posted on the DIBBs18 workshop website.

- Sunday July 8 -

West Coast Attendees Arrive at Mariott Hotel

- Monday July 9 -

Morning Attendees Arrive at Mariott Hotel Hotel (no scheduled activities)


Welcome: Workshop Goals & Agenda Review, Introduce the Program Committee - Larry Smarr, Director of the California Institute for Telecommunications and Information Technology (Calit2), PI of CC*DNI DIBBs award #ACI-1541349. DIBBs18 Opening Slides - Larry Smarr
Keynote: DIBBs Successes & Future Challenges: Harnessing the Data Revolution - Manish Parashar, Office Director for the Office of Advanced Cyberinfrastructure (OAC)

Panel Discussion 1: Significant/Innovative DIBBs Results

moderator: Hsinchun Chen 2014 University of Arizona 1443019 DIBBs for Intelligence and Security Informatics Research and Community

panelist: Victor Pankratius 2014 MIT 1442997 An Infrastructure for Computer Aided Discovery in Geoscience

panelist: Camille Crittenden 2015 UC Berkeley 1541349 Pacic Research Platform

panelist: Benjeman Meekhof 2015 University of Michigan 1541335 OSiRIS: Open Storage Research Infrastructure

3:15-4:00 Table Discussions 1: Significant/Innovative DIBBs Results
Highlights from each Table

Panel Discussion 2: Significant DIBBs Challenges/Solutions

moderator: Ilya Baldin 2016 University of North Carolina Chapel Hill 1659367 ImPACT - Infrastructure for Privacy-Assured compuTations

panelist: Kyle Chard 2015 University of Chicago 1541450 Merging Science and Cyberinfrastructure Pathways: The Whole Tale

panelist: Linda Schadler 2016 Rensselaer Polytech Institute 1640840 PD: Ontology-enabled Polymer Nanocomposite Open Community Data Resource

panelist: kc claffy 2017 SDSC 1724853 Integrated Platform for Applied Network Data Analysis (PANDA)

panelist: Chris Paolini 2016 SDSU 1659169 Implementation of a Distributed, Shareable, and Parallel Storage Resource at San Diego State University to Facilitate High-Performance Computing for Climate Science

- Tuesday July 10 -

8:00-8:30 Discussion of Main Takeaways from Day 1 - Larry Smarr & Program Committee
Table Discussions 2: Significant DIBBs Challenges/Solutions
Highlights from each Table

Panel Discussion 3: Future DIBBs Challenges/Sustainability

moderator: Ann Christine Catlin 2017 Purdue University 1724728 Creating a Digital Environment for Enabling Data-driven Science (DEEDS)

panelist: Frank Feltus 2016 Clemson University 1659300 National Cyberinfrastructure for Scientic Data Analysis at Scale (SciDAS)

panelist: David Lifka 2015 Cornell University 1541215 Data Analysis and Management Building Blocks for Multi-Campus Cyberinfrastructure through Cloud Federation

panelist: Rob Gardner 2017 University of Chicago 1724821 SLATE and the Mobility of Capability

panelist: Michela Taufer 2017 University of Tennessee 1724843 Cyberinfrastructure Tools for Precision Agriculture in the 21st Century

10:30-11:15  Table Discussions 3: Future DIBBs Challenges/Sustainability
11:15-11:30  Highlights from each Table
Panel Discussion 4: Implications for Harnessing the Data Revolution

moderator: Tevk Kosar 2017 Buffalo 1724898 OneDataShare: A Universal Data Sharing Building Block for Data-Intensive Applications

panelist: Ilkay Altintas 2015 SDSC Invited Speaker

panelist Santosh Kumar 2016 Memphis 1640813 mProv: Provence-based Data Analytics Cyberinfrastructure for High-frequency Mobile Sensor Data

panelist: Kate Keahey 2014 Argonne 1443080 Scalable Capabilities for Spatial Data Synthesis

panelist: Geoffrey Fox 2014 Indiana 1443054 Middleware and High-Performance Analytics Libraries for Scalable Data Science

Table Discussions 4: Implications for Harnessing the Data Revolution
Highlights from each Table: Final Thoughts
3:00-3:30   Summary Discussion & Wrap Up - Larry Smarr & Program Committee
3:30-4:00 Closing Comments - Amy Walton, Program Director, NSF