scispace - formally typeset
F

Fangfang Liu

Researcher at Chinese Academy of Sciences

Publications -  19
Citations -  659

Fangfang Liu is an academic researcher from Chinese Academy of Sciences. The author has contributed to research in topics: Supercomputer & Computer science. The author has an hindex of 8, co-authored 14 publications receiving 542 citations.

Papers
More filters
Journal ArticleDOI

The Sunway TaihuLight supercomputer: system and applications

TL;DR: Preliminary efforts on developing and optimizing applications on the TaihuLight system are reported, focusing on key application domains, such as earth system modeling, ocean surface wave modeling, atomistic simulation, and phase-field simulation.
Proceedings ArticleDOI

10M-core scalable fully-implicit solver for nonhydrostatic atmospheric dynamics

TL;DR: An ultra-scalable fully-implicit solver is developed for stiff time-dependent problems arising from the hyperbolic conservation laws in nonhydrostatic atmospheric dynamics and a highly efficient hybrid domain-decomposed multigrid preconditioner is proposed that can greatly accelerate the convergence rate at the extreme scale.
Proceedings ArticleDOI

Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor

TL;DR: This paper presents a detailed methodology of implementing and optimizing the double-precision general format matrix-matrix multiplication (DGEMM) kernel on the emerging SW26010 processor, which is used to build the Sunway TaihuLight supercomputer.
Proceedings ArticleDOI

26 PFLOPS Stencil Computations for Atmospheric Modeling on Sunway TaihuLight

TL;DR: This work presents a computation-communication overlapping technique to reduce the inter-process communication overhead, a locality- aware blocking method to fully exploit on-chip parallelism with enhanced data locality, and a collaborative data accessing scheme for sharing data among different threads.
Journal ArticleDOI

Performance Optimization of the HPCG Benchmark on the Sunway TaihuLight Supercomputer

TL;DR: This article utilizes a block multicoloring approach for parallelization and proposes methods such as requirement-based data mapping and customized gather collective to enhance the effective memory bandwidth in HPCG.