NERSCPowering Scientific Discovery Since 1974

Charlene Yang

FullSizeRender2
Charlene Yang Ph.D.
Application Performance Specialist
Phone: (510) 486-5230
National Energy Research Scientific Computing Center
Lawrence Berkeley National Laboratory
1 Cyclotron Rd, MS 59R4010A
Berkeley, CA 94720 US

Biographical Sketch

Charlene Yang is an Application Performance Specialist in the Application Performance Group at NERSC. Prior to joining NERSC, Charlene worked as a Supercomputing Application Specialist at the Pawsey Supercomputing Centre in Perth, Australia. Her work was focused on assisting Bioinformatics and Astrophysics researchers in their use of supercomputers to accelerate large-scale scientific simulation, modeling and data analysis workflows. 

Charlene holds a Ph.D. in Electrical, Electronic and Computer Engineering from the University of Western Australia. 

Charlene's current research is focused on performance analysis, performance characterization, and performance optimization for large-scale high-performance computing (HPC) applications. She is the primary liaison of two projects, BerkeleyGW and WDMApp, in the NERSC Exascale Science Applications Program (NESAP). Charlene collaborates with domain scientists from Material Science (BerkeleyGW) and High-Energy Physics (WDMApp) and provides guidance and support on their GPU programming, code development and optimization, to help ensure these projects can smoothly transition to and efficiently utilize the cutting-edge supercomputing resources on Perlmutter

Charlene is also the lead at NERSC on performance tools and performance modeling. She collaborates with vendors such as NVIDIA and Intel, the Computational Research Division (CRD) at LBNL, and the wider HPC community such as researchers in the Department of Energy Exascale Computing Project (ECP), to bring more automated, accurate and efficient performance tools and methodologies to users for their performance analysis and optimization needs. Charlene is an active developer of the Roofline performance model. She has published numerous papers on Roofline-related research, and is the main maintainer of the Empirical Roofline Toolkit (ERT), and the Roofline data collection methodology for NVIDIA GPUs repository. She also hosts the bi-annual Roofline Hackathons, and contributes to/leads the Roofline Performance Tuning tutorials at the annual Supercomputing Conference (SC) and International Supercomputing Conference (ISC).

 

Journal Articles

C. Yang, T. Kurth, and S. Williams, "Hierarchical Roofline Analysis for GPUs: Accelerating Performance Optimization for the NERSC-9 Perlmutter System", Concurrency and Computation: Practice and Experience, DOI: 10.1002/cpe.5547, November 2019,

L. Xu, C. J. Yang, D. Huang, and A. Cantoni, "Exploiting Cyclic Prefix for Turbo-OFDM Receiver Design", IEEE Access, vol. 5, pp. 15762-15775, June 2017,

C. J. Yang, Q. Guo, D. Huang, and S. Nordholm, "A Factor Graph Approach to Exploiting Cyclic Prefix for Equalization in OFDM systems", IEEE Transactions on Communications, vol. 61, no. 12, pp. 4972-4983, June 2013,

Conference Papers

Y. Wang, C. Yang, S. Farrel, Y. Zhang, T. Kurth, and S. Williams, "Time-Based Roofline for Deep Learning Performance Analysis", IEEE/ACM Deep Learning on Supercomputers Workshop 2020, November 2020,

M. Del Ben, C. Yang, Z. Li, F. H. da Jornada, S. G. Louie, and J. Deslippe, "Accelerating Large-Scale Excited-State GW Calculations on Leadership HPC Systems", ACM Gordon Bell Finalist, Supercomputing Conference (SC’20), November 2020,

Y. Wang, C. Yang, S. Farrel, T. Kurth, and S. Williams, "Hierarchical Roofline Analysis for Deep Learning Applications", IEEE International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems Workshop (PMBS'20), November 2020,

C. Yang, "8 Steps to 3.7 TFLOP/s on NVIDIA V100 GPU: Roofline Analysis and Other Tricks", arXiv preprint arXiv:2008.11326, August 2020,

J. R. Madsen, M. G. Awan, H. Brunie, J. Deslippe, R. Gayatri, L. Oliker, Y. Wang,
C. Yang, and S. Williams,
"Timemory: Modular Performance Analysis for HPC", International Supercomputing Conference (ISC’20), June 2020,

C. Yang, R. Gayatri, T. Kurth, P. Basu, Z. Ronaghi, A. Adetokunbo, B. Friesen, B.
Cook, D. Doerfler, L. Oliker, J. Deslippe, and S. Williams,
"An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability", IEEE International Workshop on Performance, Portability and Productivity in HPC (P3HPC'18), November 2018,

B. Austin, C. Daley, D. Doerfler, J. Deslippe, B. Cook, B. Friesen, T. Kurth, C. Yang,
and N. Wright,
"A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity", 9th IEEE International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS'18), November 2018,

R. Gayatri, C. Yang, T. Kurth, and J. Deslippe, "A Case Study for Performance Portability Using OpenMP 4.5", IEEE International Workshop on Accelerator Programming Using Directives (WACCPD'18), November 2018,

T. Koskela, Z. Matveev, C. Yang, A. Adetokunbo, R. Belenov, P. Thierry, Z. Zhao,
R. Gayatri, H. Shan, L. Oliker, J. Deslippe, R. Green, and S. Williams,
"A Novel Multi-Level Integrated Roofline Model Approach for Performance Characterization", International Supercomputing Conference (ISC'18), June 2018,

B. Cook, C. Yang, B. Friesen, T. Kurth and J. Deslippe, "Sparse CSB_Coo Matrix-Vector and Matrix-Matrix Performance on Intel Xeon Architectures", Intel eXtreme Performance Users Group (IXPUG) at International Supercomputing Conference (ISC'18), June 2018,

C. Yang, B. Friesen, T. Kurth, B. Cook, S. Williams, "Toward Automated Application Profiling on Cray Systems", Cray User Group conference (CUG'18), May 2018,

C. J. Yang, Q. Guo, D. Huang, and S. Nordholm, "Exploiting Cyclic Prefix for Joint Detection, Decoding and Channel Estimation in OFDM via EM Algorithm and Message Passing", IEEE International Conference on Communications (ICC'14), pp. 4674-4679, June 2014,

C. J. Yang, Q. Guo, D. Huang, and S. Nordholm, "Enhanced Data Detection in OFDM Systems using Factor Graph", IEEE International Conference on Wireless Communications and Signal Processing (WCSP'13), pp. 1-5, October 2013,

C. J. Yang, Q. Guo, D. Huang, and S. Nordholm, "Exploiting Cyclic Prefix in Turbo FDE Systems using Factor Graph", IEEE Wireless Communications and Networking Conference (WCNC'13), pp. 2536-2541, April 2013,

Presentation/Talks

M. Del Ben, C. Yang, et al, Achieving Performance Portability on Leadership Class HPC Systems for Large Scale GW Calculations, Pacifichem Congress, December 2020,

S. Williams, A. Ilic, Z. Matveev, M. Katz, J. Kwack, C. Yang, and C. Bertoni, Performance Tuning with the Roofline Model on GPUs and CPUs, Half-Day Tutorial, Supercomputing Conference (SC’20), November 2020,

M. Del Ben, C. Yang, and J. Deslippe, Achieving Performance Portability on Hybrid GPU-CPU Architectures for a Large-Scale Material Science Code: The BerkeleyGW Case Study, IEEE International Workshop on Performance, Portability and Productivity in HPC (P3HPC), September 2020,

C. Yang, GW Calculations at Scale, NERSC GPU for Science Day 2020, July 2020,

C. Yang, S. Williams, Y. Wang, Roofline Performance Model for HPC and Deep Learning Applications, NVIDIA GPU Technology Conference (GTC’20), March 2020,

C. Yang, M. Del Ben, Accelerating Large-Scale GW Calculations in Material Science, NVIDIA GPU Technology Conference (GTC’20), March 2020,

C. Yang, and J. Deslippe, Accelerate Science on Perlmutter with NERSC, American Physical Society (APS) March Meeting 2020, March 2020,

M. Del Ben, C. Yang, S. G. Louie, and J. Deslippe, Accelerating Large-Scale GW Calculations on Hybrid GPU-CPU Systems, American Physical Society (APS) March Meeting 2020, March 2020,

S. Williams, C. Yang, and J. Deslippe, Performance Tuning with the Roofline Model on GPUs and CPUs, Half-Day Tutorial, Exascale Computing Program (ECP) Annual Meeting, February 2020,

J. Srinivasan, C. Yang, et al, Perlmutter - a Waypoint for ECP Teams, Birds of a Feather (BoF), Exascale Computing Program (ECP) Annual Meeting, February 2020,

J. Doerfert, C. Yang, et al, OpenMP Roadmap for Accelerators Across DOE Pre- Exascale/Exascale Machines, Birds of a Feather (BoF), Exascale Computing Program (ECP) Annual Meeting, February 2020,

O. Hernandez, C. Yang, et al, Early Experience of Application Developers with OpenMP Offloading at ALCF, NERSC, and OLCF, Birds of a Feather (BoF), Exascale Computing Program (ECP) Annual Meeting, February 2020,

S. Williams, C. Yang, A. Ilic, and K. Rogozhin, Performance Tuning with the Roofline Model on GPUs and CPUs, Half-Day Tutorial, Supercomputing Conference (SC’19), November 2019,

C. Yang, Performance-Related Activities at NERSC/CRD, RRZE Thomas Gruber Visit NERSC, July 2019,

C. Yang, The Current and Future of Roofline, LBNL Brown Bag Seminar, July 2019,

C. Yang, NERSC Application Readiness Process and Strategy, NERSC GPU for Science Day 2019, July 2019,

C. Yang, Z. Matveev, A. Ilic, and D. Marques, Performance Optimization of Scientific Codes with the Roofline Model, Half-Day Tutorial, International Supercomputing Conference (ISC’19), June 2019,

C. Yang, Using Intel Tools at NERSC, Intel KNL Training, May 2019,

C. Yang, Performance Analysis of GPU-Accelerated Applications using the Roofline Model, Cray Center of Excellence (COE) Webinar, April 2019,

C. Yang, Preparing NERSC Applications for Perlmutter as an Exascale Waypoint, Meet the Mentors, Pawsey Supercomputing Centre, March 2019,

C. Yang, S. Williams, Performance Analysis of GPU-Accelerated Applications using the Roofline Model, NVIDIA GPU Technology Conference (GTC’19), March 2019,

R. Gayatri, and C. Yang, Optimizing Large Reductions in BerkeleyGW with CUDA, OpenACC, OpenMP 4.5 and Kokkos, NVIDIA GPU Technology Conference (GTC’19), March 2019,

C. Yang, Preparing NERSC Applications for Perlmutter as an Exascale Waypoint, Tsukuba/LBNL Meeting, March 2019,

C. Yang, OpenACC Updates, NERSC Application Readiness Seminar, February 2019,

C. Yang, Roofline Performance Analysis with nvprof, NERSC-NVIDIA Face-to-Face Meeting, February 2019,

J. Pennycook, C. Yang, and J. Deslippe, Quantitatively Assessing Performance Portability with Roofline, Exascale Computing Project (ECP) Interoperable Design of Extreme-scale Application Software (IDEAS) Webinar, January 2019,

S. Williams, J. Deslippe, C. Yang, Performance Tuning of Scientific Codes with the Roofline Model, Half-Day Tutorial, Exascale Computing Project (ECP) Annual Meeting, January 2019,

S. Williams, A. Ilic, Z. Matveev, C. Yang, Performance Tuning of Scientific Codes with the Roofline Model,, Half-Day Tutorial, Supercomputing Conference (SC’18), November 2018,

C. Yang, Introduction to Performance Scalability Tools, Department of Energy (DOE) Computational Science Graduate Fellowship (CSGF) Annual Review, July 2018,

T. Koskela, A. Ilic, Z. Matveev, R. Belenov, C. Yang, and L. Sousa, A Practical Approach to Application Performance Tuning with the Roofline Model, Half-Day Tutorial, International Supercomputing Conference (ISC’18), June 2018,

C. Yang, T. Kurth, Roofline Performance Model and Intel Advisor, Performance Analysis and Modeling (PAM) Workshop 2018, February 2018,

S. Williams, J. Deslippe, C. Yang, P. Basu, Performance Tuning of Scientific Codes with the Roofline Model, Half-Day Tutorial, Exascale Computing Project (ECP) Annual Meeting, February 2018,

T. Koskela, A. Ilic, Z. Matveev, S. Williams, P. Thierry, and C. Yang, Performance Tuning of Scientific Codes with the Roofline Model, Half-Day Tutorial, Supercomputing Conference (SC’17), November 2017,

C. Yang, Programming for Bioinformatics on the Supercomputer Magnus at Pawsey, 11th GeneMappers Conference, November 2015,

C. Yang, Managing Bioinformatics Software Stack on Magnus, Pawsey Supercomputing Centre Bioinformatics Expo, April 2015,

Posters

T. Kurth, C. Yang, et al, "Roofline Performance Analysis of Deep Learning Kernels and Applications", The Platform for Advanced Scientific Computing Conference (PASC’21) (accepted), July 2021,

M. Del Ben, C. Yang, F. H. da Jornada, S. G. Louie, and J. Deslippe, "Accelerating Large-Scale GW Calculations on Hybrid CPU-GPU Architectures", Supercomputing Conference (SC’19), November 2019,

S. Williams, C. Yang, K. Ibrahim, T. Kurth, N. Ding, J. Deslippe, L. Oliker, "Performance Analysis using the Roofline Model", SciDAC PIs Meeting, 2019,

C. Yang, R. C. Bording, D. Price, and R. Nealon, "Optimizing Smoothed Particle Hydrodynamics Code Phantom on Haswell and KNL", International Supercomputing Conference (ISC'17), June 2017,

C. Yang, S. Yazar, G. Gooden, and A. Hewitt, "Data-Driven Workflows on Crays with Hybrid Scheduling: A Case Study of Celera on Magnus", Supercomputing Conference (SC'16), November 2016,

C. J. Yang, C. Harris, S. Young, and G. Morahan, "Adapting Genome-Wide Association Workflows for HPC Processing at Pawsey", Supercomputing Conference (SC'15), November 2015,