Researcher profile

Qin Zhou

Qin Zhou contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

AIConfigurator: Lightning-Fast Configuration Optimization for Multi-Framework LLM Serving

Optimizing Large Language Model (LLM) inference in production systems is increasingly difficult due to dynamic workloads, stringent latency/throughput targets, and a rapidly expanding configuration space. This complexity spans not only distributed parallelism strategies (tensor/pipeline/expert) but also intricate framework-specific runtime parameters such as those concerning the enablement of CUDA graphs, available KV-cache memory fractions, and maximum token capacity, which drastically impact performance. The diversity of modern inference frameworks (e.g., TRT-LLM, vLLM, SGLang), each employing distinct kernels and execution policies, makes manual tuning both framework-specific and computationally prohibitive. We present AIConfigurator, a unified performance-modeling system that enables rapid, framework-agnostic inference configuration search without requiring GPU-based profiling. AIConfigurator combines (1) a methodology that decomposes inference into analytically modelable primitives - GEMM, attention, communication, and memory operations while capturing framework-specific scheduling dynamics; (2) a calibrated kernel-level performance database for these primitives across a wide range of hardware platforms and popular open-weights models (GPT-OSS, Qwen, DeepSeek, LLama, Mistral); and (3) an abstraction layer that automatically resolves optimal launch parameters for the target backend, seamlessly integrating into production-grade orchestration systems. Evaluation on production LLM serving workloads demonstrates that AIConfigurator identifies superior serving configurations that improve performance by up to 40% for dense models (e.g., Qwen3-32B) and 50% for MoE architectures (e.g., DeepSeek-V3), while completing searches within 30 seconds on average. Enabling the rapid exploration of vast design spaces - from cluster topology down to engine specific flags.

preprint2022arXiv

KiPA22 Report: U-Net with Contour Regularization for Renal Structures Segmentation

Three-dimensional (3D) integrated renal structures (IRS) segmentation is important in clinical practice. With the advancement of deep learning techniques, many powerful frameworks focusing on medical image segmentation are proposed. In this challenge, we utilized the nnU-Net framework, which is the state-of-the-art method for medical image segmentation. To reduce the outlier prediction for the tumor label, we combine contour regularization (CR) loss of the tumor label with Dice loss and cross-entropy loss to improve this phenomenon.

preprint2022arXiv

Numerical analysis of a Neumann boundary control problem with a stochastic parabolic equation

This paper analyzes the discretization of a Neumann boundary control problem with a stochastic parabolic equation, where an additive noise occurs in the Neumann boundary condition. The convergence is established for general filtrations, and the convergence rate $ O(τ^{1/4-ε} + h^{1/2-ε}) $ is derived for the natural filtration of the Q-Wiener process.

preprint2021arXiv

Random vector functional link neural network based ensemble deep learning for short-term load forecasting

Electricity load forecasting is crucial for the power systems' planning and maintenance. However, its un-stationary and non-linear characteristics impose significant difficulties in anticipating future demand. This paper proposes a novel ensemble deep Random Vector Functional Link (edRVFL) network for electricity load forecasting. The weights of hidden layers are randomly initialized and kept fixed during the training process. The hidden layers are stacked to enforce deep representation learning. Then, the model generates the forecasts by ensembling the outputs of each layer. Moreover, we also propose to augment the random enhancement features by empirical wavelet transformation (EWT). The raw load data is decomposed by EWT in a walk-forward fashion, not introducing future data leakage problems in the decomposition process. Finally, all the sub-series generated by the EWT, including raw data, are fed into the edRVFL for forecasting purposes. The proposed model is evaluated on twenty publicly available time series from the Australian Energy Market Operator of the year 2020. The simulation results demonstrate the proposed model's superior performance over eleven forecasting methods in three error metrics and statistical tests on electricity load forecasting tasks.

preprint2021arXiv

The Sariçiçek howardite fall in Turkey: Source crater of HED meteorites on Vesta and impact risk of Vestoids

The Sariçiçek howardite meteorite shower consisting of 343 documented stones occurred on 2 September 2015 in Turkey and is the first documented howardite fall. Cosmogenic isotopes show that Sariçiçek experienced a complex cosmic ray exposure history, exposed during ~12-14 Ma in a regolith near the surface of a parent asteroid, and that an ca.1 m sized meteoroid was launched by an impact 22 +/- 2 Ma ago to Earth (as did one third of all HED meteorites). SIMS dating of zircon and baddeleyite yielded 4550.4 +/- 2.5 Ma and 4553 +/- 8.8 Ma crystallization ages for the basaltic magma clasts. The apatite U-Pb age of 4525 +/- 17 Ma, K-Ar age of ~3.9 Ga, and the U,Th-He ages of 1.8 +/- 0.7 and 2.6 +/- 0.3 Ga are interpreted to represent thermal metamorphic and impact-related resetting ages, respectively. Petrographic, geochemical and O-, Cr- and Ti- isotopic studies confirm that Sariçiçek belongs to the normal clan of HED meteorites. Petrographic observations and analysis of organic material indicate a small portion of carbonaceous chondrite material in the Sariçiçek regolith and organic contamination of the meteorite after a few days on soil. Video observations of the fall show an atmospheric entry at 17.3 +/- 0.8 kms-1 from NW, fragmentations at 37, 33, 31 and 27 km altitude, and provide a pre-atmospheric orbit that is the first dynamical link between the normal HED meteorite clan and the inner Main Belt. Spectral data indicate the similarity of Sariçiçek with the Vesta asteroid family spectra, a group of asteroids stretching to delivery resonances, which includes (4) Vesta. Dynamical modeling of meteoroid delivery to Earth shows that the disruption of a ca.1 km sized Vesta family asteroid or a ~10 km sized impact crater on Vesta is required to provide sufficient meteoroids <4 m in size to account for the influx of meteorites from this HED clan.

preprint2020arXiv

Signature of multilayer graphene strain-controlled domain walls in quantum Hall effect

Domain walls, topological defects that define the frontier between regions of different stacking in multilayer graphene, have proved to host exciting physics. The ability of tuning these topological defects in-situ in an electronic transport experiment brings a wealth of possibilities in terms of fundamental understanding of domain walls as well as for electronic applications. Here, we demonstrate through a MEMS (micro-electromechanical system) actuator and magnetoresistance measurements the effect of domain walls in multilayer graphene quantum Hall effect. Reversible and controlled uniaxial strain triggers these topological defects, manifested as new quantum Hall effect plateaus as well as a discrete and reversible modulation of the current across the device. Our findings are supported by theoretical calculations and constitute the first indication of the in-situ tuning of topological defects in multilayer graphene probed through electronic transport, opening the way to the use of reversible topological defects in electronic applications.

preprint2020arXiv

Temporally semidiscrete approximation of a Dirichlet boundary control for a fractional/normal evolution equation with a final observation

Optimal Dirichlet boundary control for a fractional/normal evolution with a final observation is considered. The unique existence of the solution and the first-order optimality condition of the optimal control problem are derived. The convergence of a temporally semidiscrete approximation is rigorously established, where the control is not explicitly discretized and the state equation is discretized by a discontinuous Galerkin method in time. Numerical results are provided to verify the theoretical results.

preprint2020arXiv

Weighted Bilinear Coding over Salient Body Parts for Person Re-identification

Deep convolutional neural networks (CNNs) have demonstrated dominant performance in person re-identification (Re-ID). Existing CNN based methods utilize global average pooling (GAP) to aggregate intermediate convolutional features for Re-ID. However, this strategy only considers the first-order statistics of local features and treats local features at different locations equally important, leading to sub-optimal feature representation. To deal with these issues, we propose a novel weighted bilinear coding (WBC) framework for local feature aggregation in CNN networks to pursue more representative and discriminative feature representations, which can adapt to other state-of-the-art methods and improve their performance. In specific, bilinear coding is used to encode the channel-wise feature correlations to capture richer feature interactions. Meanwhile, a weighting scheme is applied on the bilinear coding to adaptively adjust the weights of local features at different locations based on their importance in recognition, further improving the discriminability of feature aggregation. To handle the spatial misalignment issue, we use a salient part net (spatial attention module) to derive salient body parts, and apply the WBC model on each part. The final representation, formed by concatenating the WBC encoded features of each part, is both discriminative and resistant to spatial misalignment. Experiments on three benchmarks including Market-1501, DukeMTMC-reID and CUHK03 evidence the favorable performance of our method against other outstanding methods.