Source author record

Yufan Zhang

Yufan Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.mtrl-sci physics.chem-ph Robotics Artificial Intelligence eess.IV Human-Computer Interaction Machine Learning

Catalog footprint

What is connected

7works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A phase field model of the effects of dislocation microstructure on grain boundary motion during recrystallization

The internal energy associated with the defect microstructure of strongly deformed crystals provides an important driving force for grain boundary motion during recrystallization. Typical dislocation microstructures are strongly heterogeneous and this heterogeneity affects the motion of recrystallization boundaries. In this study, a phase field model for microstructure evolution encompassing the evolution of both dislocation densities and grain order parameters is formulated. The model is employed to generate typical dislocation microstructures exhibiting multiscale features such as incidental and geometrically necessary dislocation walls. It is then used to study the motion of recrystallization boundaries in the associated complex defect energy 'landscape'. Results are compared to experimental observations.

preprint2026arXiv

DiLA: Disentangled Latent Action World Models

Latent Action Models (LAMs) enable the learning of world models from unlabeled video by inferring abstract actions between consecutive frames. However, LAMs face a fundamental trade-off between action abstraction and generation fidelity. Existing methods typically circumvent this issue by using two-stage training with pre-trained world models or by limiting predictions to optical flow. In this paper, we introduce DiLA, a novel Disentangled Latent Action world model that aims to resolve this trade-off via content-structure disentanglement. Our key insight is that disentanglement and latent action learning are co-evolving: the predictive bottleneck inherent in latent action learning serves as a driving force for disentanglement, compelling the model to distill spatial layouts into the structure pathway while offloading visual details to a separate content pathway for generation. This synergy yields a continuous, semantically structured latent action space without compromising generative quality. DiLA achieves superior results in video generation quality, action transfer, visual planning, and manifold interpretability. These findings establish DiLA as a unified framework that simultaneously achieves high-level action abstraction and high-fidelity generation, advancing the frontier of self-supervised world model learning.

preprint2026arXiv

P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty

This paper presents P2U-SLAM, a visual Simultaneous Localization And Mapping (SLAM) system with a wide Field of View (FoV) camera, which utilizes pose uncertainty and point uncertainty. While the wide FoV enables considerable repetitive observations of historical map points for matching cross-view features, the data properties of the historical map points and the poses of historical keyframes have changed during the optimization process. The neglect of data property changes results in the lack of partial information matrices in optimization, increasing the risk of long-term positioning performance degradation. The purpose of our research is to mitigate the risks posed by wide-FoV visual input to the SLAM system. Based on the conditional probability model, this work reveals the definite impacts of the above data properties changes on the optimization process, concretizes these impacts as point uncertainty and pose uncertainty, and gives their specific mathematical form. P2U-SLAM embeds point uncertainty into the tracking module and pose uncertainty into the local mapping module respectively, and updates these uncertainties after each optimization operation including local mapping, map merging, and loop closing. We present an exhaustive evaluation on 27 sequences from two popular public datasets with wide-FoV visual input. P2U-SLAM shows excellent performance compared with other state-of-the-art methods. The source code will be made publicly available at https://github.com/BambValley/P2U-SLAM.

preprint2022arXiv

DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation

The further development of deep neural networks is hampered by the limited GPU memory resource. Therefore, the optimization of GPU memory resources is highly demanded. Swapping and recomputation are commonly applied to make better use of GPU memory in deep learning. However, as an emerging domain, several challenges remain:1)The efficiency of recomputation is limited for both static and dynamic methods. 2)Swapping requires offloading parameters manually, which incurs a great time cost. 3) There is no such dynamic and fine-grained method that involves tensor swapping together with tensor recomputation nowadays. To remedy the above issues, we propose a novel scheduler manager named DELTA(Dynamic tEnsor offLoad and recompuTAtion). To the best of our knowledge, we are the first to make a reasonable dynamic runtime scheduler on the combination of tensor swapping and tensor recomputation without user oversight. In DELTA, we propose a filter algorithm to select the optimal tensors to be released out of GPU memory and present a director algorithm to select a proper action for each of these tensors. Furthermore, prefetching and overlapping are deliberately considered to overcome the time cost caused by swapping and recomputing tensors. Experimental results show that DELTA not only saves 40%-70% of GPU memory, surpassing the state-of-the-art method to a great extent but also gets comparable convergence results as the baseline with acceptable time delay. Also, DELTA gains 2.04$\times$ maximum batchsize when training ResNet-50 and 2.25$\times$ when training ResNet-101 compared with the baseline. Besides, comparisons between the swapping cost and recomputation cost in our experiments demonstrate the importance of making a reasonable dynamic scheduler on tensor swapping and tensor recomputation, which refutes the arguments in some related work that swapping should be the first and best choice.

preprint2022arXiv

Visualizing Non-Fungible Token Ethics: A Case Study On CryptoPunks

As a blockchain-based application, Non-Fungible Token (NFT) has received worldwide attention over the past few years. Digital artwork is the main form of NFT that can be stored on different blockchains. Although the NFT market is rapidly developing, we observed potential ethical and racial fairness issues in the design of NFT artworks due to a lack of ethical guidelines or censorship. Therefore, we investigated CryptoPunks, the most famous collection in the NFT market, to explore and visualize its potential ethical issues. We explored the ethical issues from three aspects: design, trading transactions, and related topics on Twitter. We scraped data from Twitter and Dune Analytics using python libraries, Twitter crawler, and sentiment analysis tools. Our five visualizations implied that 1.6 times more male punks were created in the initial design process than the female ones. And the male ones have a higher average selling price than females; lighter-skinned punks tend to sell for higher prices. The results of our study and visualizations provide a preliminary exploration of CryptoPunks and further inspire future ethical-related investigation and research in the NFT domain.

preprint2020arXiv

Enforced freedom: electric-field-induced declustering of ionic-liquid ions in the double layer

Whereas the majority of ions in the bulk of a solvent-free ionic liquid is bound into clusters, this is expected to change in the electrical double layer (EDL), in which the resulting electric field 'prefers' to interact with electrical monopoles-free, unclustered ions. The competition between the propensity of ions to stay in a clustered state and the reduction of the energy of ions in electric field in the free state determines the resulting portion of free ions in the EDL. We present a study of this effect, based on the simplest possible mean-field theory. 'Cracking' of ion clusters into individual ions in electric field is accompanied by the change of the dielectric response of ionic liquid which is different in clustered and unclustered states. The predictions of the theory are verified and further explored by specially performed molecular dynamics simulations. A particular finding of the theory is that the differential capacitance vs potential curve displays a bell shape despite low concentration of free charge carriers, because the dielectric response of bound ions reduces the threshold concentration of the bell- to camel-shape transition. Whereas qualitatively these findings make perfect sense, in reality the exact numbers and criteria might be different as the presented simple theory does not take into account overscreening and oscillating charge and electrostatic potential distributions near the electrode. This is why testing the theory with computer simulations is essential, but the latter basically reproduce the qualitative conclusions of the theory.

preprint2016arXiv

The Anion Effect on Li+ Ion Coordination Structure in Ethylene Carbonate Solutions

Rechargeable lithium ion batteries are an attractive alternative power source for a wide variety of applications. To optimize their performances, a complete description of the solvation properties of the ion in the electrolyte is crucial. A comprehensive understanding at the nanoscale of the solvation structure of lithium ions in nonaqueous carbonate electrolytes is, however, still unclear. We have measured by femtosecond vibrational spectroscopy the orientational correlation time of the CO stretching mode of Li+-bound and Li+-unbound ethylene carbonate molecules, in LiBF4, LiPF6, and LiClO4 ethylene carbonate solutions with different concentrations. Surprisingly, we have found that the coordination number of ethylene carbonate in the first solvation shell of Li+ is only two, in all solutions with concentrations higher than 0.5 M. Density functional theory calculations indicate that the presence of anions in the first coordination shell modifies the generally accepted tetrahedral structure of the complex, allowing only two EC molecules to coordinate to Li+ directly. Our results demonstrate for the first time, to the best of our knowledge, the anion influence on the overall structure of the first solvation shell of the Li+ ion. The formation of such a cation/solvent/anion complex provides a rational explanation for the ionic conductivity drop of lithium/carbonate electrolyte solutions at high concentrations.

Yufan Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

A phase field model of the effects of dislocation microstructure on grain boundary motion during recrystallization

DiLA: Disentangled Latent Action World Models

P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty

DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation

Visualizing Non-Fungible Token Ethics: A Case Study On CryptoPunks

Enforced freedom: electric-field-induced declustering of ionic-liquid ions in the double layer

The Anion Effect on Li+ Ion Coordination Structure in Ethylene Carbonate Solutions