Researcher profile

Zeyu Liu

Zeyu Liu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models

Unified multimodal models (UMMs) integrate visual understanding and generation within a single framework. For text-to-image (T2I) tasks, this unified capability allows UMMs to refine outputs after their initial generation, potentially extending the performance upper bound. Current UMM-based refinement methods primarily follow a refinement-via-editing (RvE) paradigm, where UMMs produce editing instructions to modify misaligned regions while preserving aligned content. However, editing instructions often describe prompt-image misalignment only coarsely, leading to incomplete refinement. Moreover, pixel-level preservation, though necessary for editing, unnecessarily restricts the effective modification space for refinement. To address these limitations, we propose Refinement via Regeneration (RvR), a novel framework that reformulates refinement as conditional image regeneration rather than editing. Instead of relying on editing instructions and enforcing strict content preservation, RvR regenerates images conditioned on the target prompt and the semantic tokens of the initial image, enabling more complete semantic alignment with a larger modification space. Extensive experiments demonstrate the effectiveness of RvR, improving Geneval from 0.78 to 0.91, DPGBench from 84.02 to 87.21, and UniGenBench++ from 61.53 to 77.41.

preprint2022arXiv

A multi view multi stage and multi window framework for pulmonary artery segmentation from CT scans

This is the technical report of the 9th place in the final result of PARSE2022 Challenge. We solve the segmentation problem of the pulmonary artery by using a two-stage method based on a 3D CNN network. The coarse model is used to locate the ROI, and the fine model is used to refine the segmentation result. In addition, in order to improve the segmentation performance, we adopt multi-view and multi-window level method, at the same time we employ a fine-tune strategy to mitigate the impact of inconsistent labeling.

preprint2022arXiv

Ballistic Thermal Transport at Sub-10 nm Laser-Induced Hot Spots in GaN Crystal

Gallium nitride (GaN) is a typical wide-bandgap semiconductor with a critical role in a wide range of electronic applications. Ballistic thermal transport at nanoscale hotspots will greatly reduce the performance of a device when its characteristic length reaches the nanometer scale, due to heat dissipation. In this work, we developed a tip-enhanced Raman thermometry approach to study ballistic thermal transport within the range of 10 nm in GaN, simultaneously achieving laser heating and measuring the local temperature. The Raman results showed that the temperature increase from an Au-coated tip-focused hotspot was up to two times higher (40 K) than that in a bare tip-focused region (20 K). To further investigate the possible mechanisms behind this temperature difference, we performed electromagnetic simulations to generate a highly focused heating field, and observed a highly localized optical penetration, within a range of 10 nm. The phonon mean free path (MFP) of the GaN substrate could thus be determined by comparing the numerical simulation results with the experimentally measured temperature increase which was in good agreement with the average MFP weighted by the mode-specific thermal conductivity, as calculated from first-principles simulations. Our results demonstrate that the phonon MFP of a material can be rapidly predicted through a combination of experiments and simulations, which can find wide application in the thermal management of GaN-based electronics.

preprint2022arXiv

De Rham prismatic crystals over $\mathcal{O}_K$

We study de Rham prismatic crystals on $(\mathcal{O}_K)_{\bboldΔ}$. We show that a de Rham crystal is controlled by a sequence of matrices $\{A_{m,1}\}_{m \geq 0}$ with $A_{0,1}$ "nilpotent". Using this, we prove that the natural functor from de Rham crystals over $(\mathcal{O}_K)_{\bboldΔ}$ to the category of nearly de Rham representations is fully faithful. The key ingredient is a Sen style decompletion theorem for de Rham representations of $G_K$.

preprint2022arXiv

Leveraging Low-Fidelity Data to Improve Machine Learning of Sparse High-Fidelity Thermal Conductivity Data via Transfer Learning

Lattice thermal conductivity (TC) of semiconductors is crucial for various applications, ranging from microelectronics to thermoelectrics. Data-driven approach can potentially establish the critical composition-property relationship needed for fast screening of candidates with desirable TC, but the small number of available data remains the main challenge. TC can be efficiently calculated using empirical models, but they have inferior accuracy compared to the more resource-demanding first-principles calculations. Here, we demonstrate the use of transfer learning (TL) to improve the machine learning models trained on small but high-fidelity TC data from experiments and first-principles calculations, by leveraging a large but low-fidelity data generated from empirical TC models, where the trainings on high- and low-fidelity TC data are treated as different but related tasks. TL improves the model accuracy by as much as 23% in R2 and reduces the average factor difference by as much as 30%. Using the TL model, a large semiconductor database is screened, and several candidates with room temperature TC > 350 W/mK are identified and further verified using first-principles simulations. This study demonstrates that TL can leverage big low-fidelity data as a proxy task to improve models for the target task with high-fidelity but small data. Such a capability of TL may have important implications to materials informatics in general.

preprint2022arXiv

P2M-DeTrack: Processing-in-Pixel-in-Memory for Energy-efficient and Real-Time Multi-Object Detection and Tracking

Today's high resolution, high frame rate cameras in autonomous vehicles generate a large volume of data that needs to be transferred and processed by a downstream processor or machine learning (ML) accelerator to enable intelligent computing tasks, such as multi-object detection and tracking. The massive amount of data transfer incurs significant energy, latency, and bandwidth bottlenecks, which hinders real-time processing. To mitigate this problem, we propose an algorithm-hardware co-design framework called Processing-in-Pixel-in-Memory-based object Detection and Tracking (P2M-DeTrack). P2M-DeTrack is based on a custom faster R-CNN-based model that is distributed partly inside the pixel array (front-end) and partly in a separate FPGA/ASIC (back-end). The proposed front-end in-pixel processing down-samples the input feature maps significantly with judiciously optimized strided convolution and pooling. Compared to a conventional baseline design that transfers frames of RGB pixels to the back-end, the resulting P2M-DeTrack designs reduce the data bandwidth between sensor and back-end by up to 24x. The designs also reduce the sensor and total energy (obtained from in-house circuit simulations at Globalfoundries 22nm technology node) per frame by 5.7x and 1.14x, respectively. Lastly, they reduce the sensing and total frame latency by an estimated 1.7x and 3x, respectively. We evaluate our approach on the multi-object object detection (tracking) task of the large-scale BDD100K dataset and observe only a 0.5% reduction in the mean average precision (0.8% reduction in the identification F1 score) compared to the state-of-the-art.

preprint2022arXiv

Synthesizing Diverse and Physically Stable Grasps with Arbitrary Hand Structures using Differentiable Force Closure Estimator

Existing grasp synthesis methods are either analytical or data-driven. The former one is oftentimes limited to specific application scope. The latter one depends heavily on demonstrations, thus suffers from generalization issues; e.g., models trained with human grasp data would be difficult to transfer to 3-finger grippers. To tackle these deficiencies, we formulate a fast and differentiable force closure estimation method, capable of producing diverse and physically stable grasps with arbitrary hand structures, without any training data. Although force closure has commonly served as a measure of grasp quality, it has not been widely adopted as an optimization objective for grasp synthesis primarily due to its high computational complexity; in comparison, the proposed differentiable method can test a force closure within milliseconds. In experiments, we validate the proposed method's efficacy in 6 different settings.

preprint2019arXiv

Experimental Observation of High Intrinsic Thermal Conductivity of AlN

AlN is an ultra-wide bandgap semiconductor which has been developed for applications including power electronics and optoelectronics. Thermal management of these applications is the key for stable device performance and allowing for long lifetimes. AlN, with its potentially high thermal conductivity, can play an important role serving as a dielectric layer, growth substrate, and heat spreader to improve device performance. However, the intrinsic high thermal conductivity of bulk AlN predicted by theoretical calculations has not been experimentally observed because of the difficulty in producing materials with low vacancy and impurity levels, and other associated defect complexes in AlN which can decrease the thermal conductivity. This work reports the growth of thick AlN layers by MOCVD with an air-pocketed AlN layer and the first experimental observation of intrinsic thermal conductivity from 130 K to 480 K that matches density-function-theory calculations for single crystal AlN, producing some of the highest values ever measured. Detailed material characterizations confirm the high quality of these AlN samples with one or two orders of magnitude lower impurity concentrations than seen in commercially available bulk AlN. Measurements of these commercially available bulk AlN substrates from 80 K to 480 K demonstrated a lower thermal conductivity, as expected. A theoretical thermal model is built to interpret the measured temperature dependent thermal conductivity. Our work demonstrates that it is possible to obtain theoretically high values of thermal conductivity in AlN and such films may impact the thermal management and reliability of future electronic and optoelectronics devices.