NERSCPowering Scientific Discovery Since 1974

Charlene Yang

yang linkedin
Charlene Yang Ph.D.
Application Performance Specialist
National Energy Research Scientific Computing Center (NERSC)
Lawrence Berkeley National Laboratory (LBNL)
1 Cyclotron Rd, MS 59R4010A
Berkeley, CA 94720 US

Biographical Sketch

Charlene Yang is an Application Performance Specialist in the Application Performance Group at NERSC. Prior to joining NERSC, Charlene worked as a Supercomputing Application Specialist at the Pawsey Supercomputing Centre in Perth, Australia. Her work was focused on assisting Bioinformatics and Astrophysics researchers in their use of supercomputers to accelerate large-scale scientific simulation, modeling and data analytics. 

Charlene holds a Ph.D. in Electrical, Electronic and Computer Engineering from the University of Western Australia. 

Charlene's current research is focused on performance analysis, performance characterization, and performance optimization for large-scale high-performance computing (HPC) applications. She is the primary liaison of two projects, namely BerkeleyGW and WDMApp (Whole Device Modeling Application), in the NERSC Exascale Science Applications Program (NESAP). Charlene collaborates with domain scientists from Material Science (BerkeleyGW) and High-Energy Physics (WDMApp) and provides technical guidance and assistance in GPU programming, code profiling, performance analysis and optimization. Her goal is to help ensure these projects can smoothly transition to and make efficient utilization of the cutting-edge supercomputing resources on Perlmutter

Charlene is also the lead on performance tools and performance modeling at NERSC. She collaborates with vendors such as NVIDIA and Intel, the Computational Research Division (CRD) at LBNL, and the wider HPC community such as researchers in the Department of Energy Exascale Computing Project (ECP), to bring accurate, robust and efficient performance tools and methodologies to HPC users for their performance analysis and optimization needs. Charlene is heavily involved in the development of the Roofline performance model, and has published/presented numerous papers/talks on Roofline-related research. She manages the Empirical Roofline Toolkit (ERT) and Roofline data collection methodology for NVIDIA GPUs repositories, and hosts the bi-annual Roofline Hackathons for the HPC community. She also leads and/or contributes to the long-running Roofline Performance Tuning tutorials at the annual Supercomputing Conference (SC) and International Supercomputing Conference (ISC).

 

Journal Articles

C. Yang, T. Kurth, and S. Williams, "Hierarchical Roofline Analysis for GPUs: Accelerating Performance Optimization for the NERSC-9 Perlmutter System", Concurrency and Computation: Practice and Experience, DOI: 10.1002/cpe.5547, November 2019,

L. Xu, C. J. Yang, D. Huang, and A. Cantoni, "Exploiting Cyclic Prefix for Turbo-OFDM Receiver Design", IEEE Access, vol. 5, pp. 15762-15775, June 2017,

C. J. Yang, Q. Guo, D. Huang, and S. Nordholm, "A Factor Graph Approach to Exploiting Cyclic Prefix for Equalization in OFDM systems", IEEE Transactions on Communications, vol. 61, no. 12, pp. 4972-4983, June 2013,

Conference Papers

C. Yang, Y. Wang, S. Farrel, T. Kurth, and S. Williams, "Hierarchical Roofline Performance Analysis for Deep Learning Applications", Science and Information (SAI) Conference series - Computing Conference 2021, July 2021,

Y. Wang, C. Yang, S. Farrel, Y. Zhang, T. Kurth, and S. Williams, "Time-Based Roofline for Deep Learning Performance Analysis", IEEE/ACM Deep Learning on Supercomputers Workshop (DLS'20), November 2020,

M. Del Ben, C. Yang, Z. Li, F. H. da Jornada, S. G. Louie, and J. Deslippe, "Accelerating Large-Scale Excited-State GW Calculations on Leadership HPC Systems", ACM Gordon Bell Finalist, Supercomputing Conference (SC’20), November 2020,

C. Yang, "8 Steps to 3.7 TFLOP/s on NVIDIA V100 GPU: Roofline Analysis and Other Tricks", arXiv preprint arXiv:2008.11326, August 2020,

J. R. Madsen, M. G. Awan, H. Brunie, J. Deslippe, R. Gayatri, L. Oliker, Y. Wang,
C. Yang, and S. Williams,
"Timemory: Modular Performance Analysis for HPC", International Supercomputing Conference (ISC’20), June 2020,

C. Yang, R. Gayatri, T. Kurth, P. Basu, Z. Ronaghi, A. Adetokunbo, B. Friesen, B.
Cook, D. Doerfler, L. Oliker, J. Deslippe, and S. Williams,
"An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability", IEEE International Workshop on Performance, Portability and Productivity in HPC (P3HPC'18), November 2018,

B. Austin, C. Daley, D. Doerfler, J. Deslippe, B. Cook, B. Friesen, T. Kurth, C. Yang,
and N. Wright,
"A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity", 9th IEEE International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS'18), November 2018,

R. Gayatri, C. Yang, T. Kurth, and J. Deslippe, "A Case Study for Performance Portability Using OpenMP 4.5", IEEE International Workshop on Accelerator Programming Using Directives (WACCPD'18), November 2018,

T. Koskela, Z. Matveev, C. Yang, A. Adetokunbo, R. Belenov, P. Thierry, Z. Zhao,
R. Gayatri, H. Shan, L. Oliker, J. Deslippe, R. Green, and S. Williams,
"A Novel Multi-Level Integrated Roofline Model Approach for Performance Characterization", International Supercomputing Conference (ISC'18), June 2018,

B. Cook, C. Yang, B. Friesen, T. Kurth and J. Deslippe, "Sparse CSB_Coo Matrix-Vector and Matrix-Matrix Performance on Intel Xeon Architectures", Intel eXtreme Performance Users Group (IXPUG) at International Supercomputing Conference (ISC'18), June 2018,

C. Yang, B. Friesen, T. Kurth, B. Cook, S. Williams, "Toward Automated Application Profiling on Cray Systems", Cray User Group conference (CUG'18), May 2018,

C. J. Yang, Q. Guo, D. Huang, and S. Nordholm, "Exploiting Cyclic Prefix for Joint Detection, Decoding and Channel Estimation in OFDM via EM Algorithm and Message Passing", IEEE International Conference on Communications (ICC'14), pp. 4674-4679, June 2014,

C. J. Yang, Q. Guo, D. Huang, and S. Nordholm, "Enhanced Data Detection in OFDM Systems using Factor Graph", IEEE International Conference on Wireless Communications and Signal Processing (WCSP'13), pp. 1-5, October 2013,

C. J. Yang, Q. Guo, D. Huang, and S. Nordholm, "Exploiting Cyclic Prefix in Turbo FDE Systems using Factor Graph", IEEE Wireless Communications and Networking Conference (WCNC'13), pp. 2536-2541, April 2013,

Presentation/Talks

C. Yang, Accelerating Large-Scale Excited-State GW Calculations in Material Science, 2nd Berkeley Excited States Conference (BESC2021), January 2021,

M. Del Ben, C. Yang, et al, Achieving Performance Portability on Leadership Class HPC Systems for Large Scale GW Calculations, Pacifichem Congress, December 2020,

S. Williams, C. Yang, et al., Performance Tuning with the Roofline Model on GPUs and CPUs, Half-Day Tutorial, Supercomputing Conference (SC’20), November 2020,

M. Del Ben, C. Yang, and J. Deslippe, Achieving Performance Portability on Hybrid GPU-CPU Architectures for a Large-Scale Material Science Code: The BerkeleyGW Case Study, IEEE International Workshop on Performance, Portability and Productivity in HPC (P3HPC), September 2020,

C. Yang, GW Calculations at Scale, NERSC GPU for Science Day 2020, July 2020,

C. Yang, S. Williams, Y. Wang, Roofline Performance Model for HPC and Deep Learning Applications, NVIDIA GPU Technology Conference (GTC’20), March 2020,

C. Yang, M. Del Ben, Accelerating Large-Scale GW Calculations in Material Science, NVIDIA GPU Technology Conference (GTC’20), March 2020,

C. Yang, and J. Deslippe, Accelerate Science on Perlmutter with NERSC, American Physical Society (APS) March Meeting 2020, March 2020,

M. Del Ben, C. Yang, S. G. Louie, and J. Deslippe, Accelerating Large-Scale GW Calculations on Hybrid GPU-CPU Systems, American Physical Society (APS) March Meeting 2020, March 2020,

S. Williams, C. Yang, and J. Deslippe, Performance Tuning with the Roofline Model on GPUs and CPUs, Half-Day Tutorial, Exascale Computing Program (ECP) Annual Meeting, February 2020,

J. Srinivasan, C. Yang, et al, Perlmutter - a Waypoint for ECP Teams, Birds of a Feather (BoF), Exascale Computing Program (ECP) Annual Meeting, February 2020,

J. Doerfert, C. Yang, et al, OpenMP Roadmap for Accelerators Across DOE Pre- Exascale/Exascale Machines, Birds of a Feather (BoF), Exascale Computing Program (ECP) Annual Meeting, February 2020,

O. Hernandez, C. Yang, et al, Early Experience of Application Developers with OpenMP Offloading at ALCF, NERSC, and OLCF, Birds of a Feather (BoF), Exascale Computing Program (ECP) Annual Meeting, February 2020,

S. Williams, C. Yang, A. Ilic, and K. Rogozhin, Performance Tuning with the Roofline Model on GPUs and CPUs, Half-Day Tutorial, Supercomputing Conference (SC’19), November 2019,

C. Yang, Performance-Related Activities at NERSC/CRD, RRZE Thomas Gruber Visit NERSC, July 2019,

C. Yang, The Current and Future of Roofline, LBNL Brown Bag Seminar, July 2019,

C. Yang, NERSC Application Readiness Process and Strategy, NERSC GPU for Science Day 2019, July 2019,

C. Yang, Z. Matveev, A. Ilic, and D. Marques, Performance Optimization of Scientific Codes with the Roofline Model, Half-Day Tutorial, International Supercomputing Conference (ISC’19), June 2019,

C. Yang, Using Intel Tools at NERSC, Intel KNL Training, May 2019,

C. Yang, Performance Analysis of GPU-Accelerated Applications using the Roofline Model, Cray Center of Excellence (COE) Webinar, April 2019,

C. Yang, Preparing NERSC Applications for Perlmutter as an Exascale Waypoint, Meet the Mentors, Pawsey Supercomputing Centre, March 2019,

C. Yang, S. Williams, Performance Analysis of GPU-Accelerated Applications using the Roofline Model, NVIDIA GPU Technology Conference (GTC’19), March 2019,

R. Gayatri, and C. Yang, Optimizing Large Reductions in BerkeleyGW with CUDA, OpenACC, OpenMP 4.5 and Kokkos, NVIDIA GPU Technology Conference (GTC’19), March 2019,

C. Yang, Preparing NERSC Applications for Perlmutter as an Exascale Waypoint, Tsukuba/LBNL Meeting, March 2019,

C. Yang, OpenACC Updates, NERSC Application Readiness Seminar, February 2019,

C. Yang, Roofline Performance Analysis with nvprof, NERSC-NVIDIA Face-to-Face Meeting, February 2019,

J. Pennycook, C. Yang, and J. Deslippe, Quantitatively Assessing Performance Portability with Roofline, Exascale Computing Project (ECP) Interoperable Design of Extreme-scale Application Software (IDEAS) Webinar, January 2019,

S. Williams, J. Deslippe, C. Yang, Performance Tuning of Scientific Codes with the Roofline Model, Half-Day Tutorial, Exascale Computing Project (ECP) Annual Meeting, January 2019,

S. Williams, A. Ilic, Z. Matveev, C. Yang, Performance Tuning of Scientific Codes with the Roofline Model,, Half-Day Tutorial, Supercomputing Conference (SC’18), November 2018,

C. Yang, Introduction to Performance Scalability Tools, Department of Energy (DOE) Computational Science Graduate Fellowship (CSGF) Annual Review, July 2018,

T. Koskela, A. Ilic, Z. Matveev, R. Belenov, C. Yang, and L. Sousa, A Practical Approach to Application Performance Tuning with the Roofline Model, Half-Day Tutorial, International Supercomputing Conference (ISC’18), June 2018,

C. Yang, T. Kurth, Roofline Performance Model and Intel Advisor, Performance Analysis and Modeling (PAM) Workshop 2018, February 2018,

S. Williams, J. Deslippe, C. Yang, P. Basu, Performance Tuning of Scientific Codes with the Roofline Model, Half-Day Tutorial, Exascale Computing Project (ECP) Annual Meeting, February 2018,

T. Koskela, A. Ilic, Z. Matveev, S. Williams, P. Thierry, and C. Yang, Performance Tuning of Scientific Codes with the Roofline Model, Half-Day Tutorial, Supercomputing Conference (SC’17), November 2017,

C. Yang, Programming for Bioinformatics on the Supercomputer Magnus at Pawsey, 11th GeneMappers Conference, November 2015,

C. Yang, Managing Bioinformatics Software Stack on Magnus, Pawsey Supercomputing Centre Bioinformatics Expo, April 2015,

Posters

T. Kurth, C. Yang, et al, "Roofline Performance Analysis of Deep Learning Kernels and Applications", The Platform for Advanced Scientific Computing Conference (PASC’21) (accepted), July 2021,

M. Del Ben, C. Yang, F. H. da Jornada, S. G. Louie, and J. Deslippe, "Accelerating Large-Scale GW Calculations on Hybrid CPU-GPU Architectures", Supercomputing Conference (SC’19), November 2019,

S. Williams, C. Yang, K. Ibrahim, T. Kurth, N. Ding, J. Deslippe, L. Oliker, "Performance Analysis using the Roofline Model", SciDAC PIs Meeting, 2019,

C. Yang, R. C. Bording, D. Price, and R. Nealon, "Optimizing Smoothed Particle Hydrodynamics Code Phantom on Haswell and KNL", International Supercomputing Conference (ISC'17), June 2017,

C. Yang, S. Yazar, G. Gooden, and A. Hewitt, "Data-Driven Workflows on Crays with Hybrid Scheduling: A Case Study of Celera on Magnus", Supercomputing Conference (SC'16), November 2016,

C. J. Yang, C. Harris, S. Young, and G. Morahan, "Adapting Genome-Wide Association Workflows for HPC Processing at Pawsey", Supercomputing Conference (SC'15), November 2015,