NERSCPowering Scientific Discovery Since 1974

Data Analytics and Machine Learning

Analytics is key to gaining insights from massive, complex datasets. NERSC provides general purpose analytics (iPython, Spark, MATLAB, IDL, Mathematica, ROOT), statistics (R), machine learning (Tensorflow, SciKitLearn, Caffe etc) , and imaging tools.


Python is an interpreted, general-purpose high-level programming language. Various versions of Python are installed on NERSC systems with a number of scientific computing libraries like numpy and scipy, and visualization libraries like matplotlib. Read More »

Jupyter and RStudio

This includes web-based integrated development environments like RStudio and notebooks like IPython/Jupyter. Read More »


Julia is a high-level, high-performance dynamic programming language for technical computing, with syntax that is familiar to users of other technical computing environments. It is available at NERSC. Read More »

Spark Distributed Analytic Framework

Apache Spark™ is a fast and general engine for large-scale data processing. Read More »


MATLAB is a high-performance language for technical computing. It integrates computation, visualization, and programming in an easy-to-use environment where problems and solutions are expressed in familiar mathematical notation. Read More »


Mathematica is a fully integrated environment for technical computing. Performs symbolic manipulation of equations, integrals, differential equations and almost any mathematical expression. Numeric results can be evaluated also. Read More »


R is a language and environment for statistical computing and graphics. It provides a wide variety of statistical tools, such as linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, graphics, and it is highly extensible. Read More »


The ROOT system provides a set of OO frameworks with all the functionality needed to handle and analyze large amounts of data in a very efficient way. Read More »


IDL, the Interactive Data Language, is an interactive application used for for data analysis, visualization, and cross-platform application development. Read More »