NERSCPowering Scientific Discovery Since 1974

Big Data Summit 2018

The Big Data Summit 2018, hosted by NERSC on July 18, 2018, featured results from advanced Data Analytics and Management projects on the NERSC Cori system. Ongoing projects being led by NERSC, Intel, Cray, and five Intel® Parallel Computing Centers (represented by Oxford University, University of Liverpool, University of California – Berkeley, University of California – Davis, and New York University) were featured in the talks.

Welcome and Agenda Review; NERSC Update – Analytics and Management (Prabhat, NERSC) (BDC-00-Prabhat-Big-Data-Center.pdf)

HPC and ML – The New Frontier of Compute (Victor Lee, Intel) (BDC-02-BigDataSummit-VictorLee.pdf)

Data Science and Machine Learning at Supercomputer Scale (Mike Ringenburg, Cray)  (BDC-03-Ringenburg-BDC-Summit-2018.pdf)

Distributed Training of Generative Adversarial Networks for Fast Detector Simulation on Intel® Xeon® HPC Cluster (Vikram Saletore and Hans Pabst, Intel; Damian Podareanu and Valeriu Codreanu, SURFsara B.V.; Sofia Vallecorsa and Federico Carminati, CERN) (BDC-04-Deep-Learning-Fast-Simulation-Big-Data-Summit-NERSC-18July2018-Final.pdf)


Deep Probabilistic Models for Astronomy (Jeffrey Regier, University of California – Berkeley)  (slides not available)

Project DisCo: Physics-based Discovery of Coherent Structures in Spatiotemporal Systems (Adam Rupe, James P. Crutchfield, and Ryan G. James, University of California – Davis) ( BDC-06-Adam-Rupe-Project-Disco.pdf)

Working Towards Distributed Inference Compilation at Scale (Frank Wood, University of British Columbia [formerly Oxford University]) (BDC-07-NERSC-BDC-Intel-PCC.pdf)

Neutrino Detection with Graph Neural Networks (Nicholas Choma and Joan Bruna, New York University) (BDC-08-Nicholas-Choma-Big-Data-Summit-slides.pdf)

Machine Learning and Topological Data Analysis: Applications to Pattern Classification in Fluid and Climate Simulations (Vitaliy Kurlin and Grzegorz Muszynski, University of Liverpool)  (BDC-09-Vitaliy-Kurlin-Big-Data-Summit-talk.pdf ) 

Deep Dive into TensorFlow: Single and Multi-Node Optimizations on Intel Xeon (Ashraf Bhuiyan, Intel; Steven Farrell, NERSC) (BDC-10-Ashraf-Steve-Tensorflow-deep-dive.pptx)

High Productivity Languages – Python (Heidi Pan, Intel; Rollin Thomas, NERSC) (BDC-11-BigDataSummit-Python-Pan-Thomas-v3.pdf)

CosmoFlow: Using Deep Learning to Learn the Universe at Scale (Debbie Bard, NERSC) (BDC-12-CosmoFlow-BDC18.pptx)


Data Management and I/O Deep Dive (Quincey Koziol, NERSC) (BDC-13-BDC-Data-Management.pdf)