Source author record

Zhizhong Zheng

Zhizhong Zheng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision physics.med-ph

Catalog footprint

What is connected

2works

2topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A one-step generation model with a Single-Layer Transformer: Layer number re-distillation of FreeFlow

Currently, Flow matching methods aim to compress the iterative generation process of diffusion models into a few or even a single step, with MeanFlow and FreeFlow being representative achievements of one-step generation based on Ordinary Differential Equations (ODEs). We observe that the 28-layer Transformer architecture of FreeFlow can be characterized as an Euler discretization scheme for an ODE along the depth axis, where the layer index serves as the discrete time step. Therefore, we distill the number of layers of the FreeFlow model, following the same derivation logic as FreeFlow, and propose SLT (Single-Layer Transformer), which uses a single shared DiT block to approximate the depth-wise feature evolution of the 28-layer teacher. During training, it matches the teacher's intermediate features at several depth patches, fuses those patch-level representations, and simultaneously aligns the teacher's final velocity prediction. Through distillation training, we compress the 28 independent Transformer Blocks of the teacher model DiT-XL/2 into a single Transformer Block, reducing the parameter count from 675M to 4.3M. Furthermore, leveraging its minimal parameters and rapid sampling speed, SLT can screen more candidate points in the noise space within the same timeframe, thereby selecting higher-quality initial points for the teacher model FreeFlow and ultimately enhancing the quality of generated images. Experimental results demonstrate that within a time budget comparable to two random samplings of the teacher model, our method performs over 100 noise screenings and produces a high-quality sample through the teacher model using the selected points. Quality fluctuations caused by low-quality initial noise under a limited number of FreeFlow sampling calls are effectively avoided, substantially improving the stability and average generation quality of one-step generation.

preprint2016arXiv

Efficient Image Reconstruction and Practical Decomposition for Dual-energy Computed Tomography

Dual-energy computed tomography (DECT) has shown great potential and promising applications in advanced imaging fields for its capabilities of material decomposition. However, image reconstructions and decompositions under sparse views dataset suffers severely from multi factors, such as insufficiencies of data, appearances of noise, and inconsistencies of observations. Under sparse views, conventional filtered back-projection type reconstruction methods fails to provide CT images with satisfying quality. Moreover, direct image decomposition is unstable and meet with noise boost even with full views dataset. This paper proposes an iterative image reconstruction algorithm and a practical image domain decomposition method for DECT. On one hand, the reconstruction algorithm is formulated as an optimization problem, which containing total variation regularization term and data fidelity term. The alternating direction method is utilized to design the corresponding algorithm which shows faster convergence speed compared with the existing ones. On the other hand, the image domain decomposition applies the penalized least square (PLS) estimation on decomposing the material mappings. The PLS includes linear combination term and the regularization term which enforces the smoothness on estimation images. The authors implement and evaluate the proposed joint method on real DECT projections and compare the method with typical and state-of-the-art reconstruction and decomposition methods. The experiments on dataset of an anthropomorphic head phantom show that our methods have advantages on noise suppression and edge reservation, without blurring the fine structures in the sinus area in the phantom. Compared to the existing approaches, our method achieves a superior performance on DECT imaging with respect to reconstruction accuracy and decomposition quality.