Researcher profile

Huaqing Zheng

Huaqing Zheng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - Baseline
2works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2026arXiv

Post-Training Quantization of OpenPangu Models for Efficient Deployment on Atlas A2

Huawei's openPangu-Embedded-1B and openPangu-Embedded-7B are variants of the openPangu large language model, designed for efficient deployment on Ascend NPUs. The 7B variant supports three distinct Chain-of-Thought (CoT) reasoning paradigms, namely slow_think, auto_think, and no_think, while the 1B variant operates exclusively in the no_think mode, which employs condensed reasoning for higher efficiency. Although CoT reasoning enhances capability, the generation of extended reasoning traces introduces substantial memory and latency overheads, posing challenges for practical deployment on Ascend NPUs. This paper addresses these computational constraints by leveraging low-bit quantization, which transforms FP16 computations into more efficient integer arithmetic. We introduce a unified low-bit inference framework, supporting INT8 (W8A8) and W4A8 quantization, specifically optimized for openPangu-Embedded models on the Atlas A2. Our comprehensive evaluation on code generation benchmarks (HumanEval and MBPP) demonstrates the efficacy of this approach. INT8 quantization consistently preserves over 90\% of the FP16 baseline accuracy and achieves a 1.5x prefill speedup on the Atlas A2. Furthermore, W4A8 quantization significantly reduces memory consumption, albeit with a moderate trade-off in accuracy. These findings collectively indicate that low-bit quantization effectively facilitates efficient CoT reasoning on Ascend NPUs, maintaining high model fidelity.

preprint2016arXiv

Investigation of Dosimetric Parameters of $^{192}$Ir MicroSelectron v2 HDR Brachytherapy Source Using EGSnrc Monte Carlo Code

The $^{192}$Ir sources are widely used for high dose rate (HDR) brachytherapy treatments. The aim of this study is to simulate $^{192}$Ir MicroSelectron v2 HDR brachytherapy source and calculate the air kerma strength, dose rate constant, radial dose function and anisotropy function established in the updated AAPM Task Group 43 protocol. The EGSnrc Monte Carlo (MC) code package is used to calculate these dosimetric parameters, including dose contribution from secondary electron source and also contribution of bremsstrahlung photons to air kerma strength. The Air kerma strength, dose rate constant and radial dose function while anisotropy functions for the distance greater than 0.5 cm away from the source center are in good agreement with previous published studies. Obtained value from MC simulation for air kerma strength is $9.762\times 10^{-8} \textrm{UBq}^{-1}$and dose rate constant is $1.108\pm 0.13\%\textrm{cGyh}^{-1} \textrm{U}^{-1}$.