Source author record

Zhiyuan Zhang

Zhiyuan Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

35works

24topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Generating synthetic computed tomography for radiotherapy: SynthRAD2025 challenge report

Radiation therapy (RT) requires precise dose delivery over multiple fractions, with CT fundamental for treatment planning due to its electron density information. Repeated CT acquisitions impose radiation exposure and logistical burdens, MRI lacks electron density, and cone-beam CT (CBCT) requires correction for dose calculation. Synthetic CT (sCT) generation addresses these by converting MRI or CBCT into CT-equivalent images with accurate Hounsfield Unit (HU) values, enabling MRI-only RT and CBCT-based adaptive workflows. Building on SynthRAD2023, SynthRAD2025 benchmarked sCT methods on 2,362 patients from five European centers across head and neck, thorax, and abdomen. Two tasks: MRI-to-CT (890 cases) and CBCT-to-CT (1,472 cases), evaluated via image similarity (MAE, PSNR, MS-SSIM), segmentation (Dice, HD95), and dosimetric metrics from photon and proton plans. With 803 participants and 12/13 valid submissions, Task 1 top performance reached MAE $64.8\pm21.3$ HU, PSNR $\sim$30 dB, MS-SSIM $\sim$0.936, Dice 0.79, photon $γ_{2\%/2\text{mm}}>98\%$, proton $γ\approx85\%$. Task 2 improved: MAE $48.3\pm13.4$ HU, PSNR 32.6 dB, MS-SSIM 0.968, Dice 0.86, photon $γ>99\%$, proton $γ\approx89\%$. Strong image--segmentation correlations ($ρ=0.78$--$0.79$) but moderate dose correlations confirmed image quality is insufficient as a dosimetric surrogate. Head-and-neck cases were most consistent; thoracic and abdominal cases showed greater variability. Residual errors at tissue interfaces propagate along beam paths, affecting proton dose more than photon. SynthRAD2025 demonstrates that deep learning yields clinically relevant sCTs, especially for CBCT-to-CT, while identifying persistent MRI-to-CT challenges and underscoring dose-based evaluation as essential for clinical validation.

preprint2026arXiv

ManiFeel: Benchmarking and Understanding Visuotactile Manipulation Policy Learning

Supervised visuomotor policies have shown strong performance in robotic manipulation but often struggle in tasks with limited visual inputs, such as operations in confined spaces and dimly lit environments, or tasks requiring precise perception of object properties and environmental interactions. In such cases, tactile feedback becomes essential for manipulation. While the rapid progress of supervised visuomotor policies has benefited greatly from high-quality, reproducible simulation benchmarks in visual imitation, the visuotactile domain still lacks a similarly comprehensive and reliable benchmark for large-scale and rigorous evaluation. To address this, we introduce ManiFeel, a reproducible and scalable simulation benchmark designed to systematically study supervised visuotactile policy learning. ManiFeel offers a diverse suite of contact-rich and visually challenging manipulation tasks, a modular evaluation pipeline spanning sensing modalities, tactile representations, and policy architectures, as well as real-world validation. Through extensive experiments, ManiFeel demonstrates how tactile sensing enhances policy performance across diverse manipulation scenarios, ranging from precise contact-driven operations to visually constrained settings. In addition, the results reveal task-dependent strengths of different tactile modalities and identify key design principles and open challenges for robust visuotactile policy learning. Real-world evaluations further confirm that ManiFeel provides a reliable and meaningful foundation for benchmarking and future visuotactile policy development. To foster reproducibility and future research, we will release our codebase, datasets, training logs, and pretrained checkpoints, aiming to accelerate progress toward generalizable visuotactile policy learning and manipulation.

preprint2026arXiv

Uncertainty-Aware Trip Purpose Inference from GPS Trajectories via POI Semantic Zones and Pareto Calibration

Large-scale GPS trajectory data offer rich observations of human mobility, yet assigning trip purposes to detected stops remains challenging due to the absence of individual-level ground truth, spatial uncertainty from GPS noise and incomplete points of interest (POIs) coverage, and fundamental behavioral differences across trip purposes. We propose a weakly supervised framework integrating neighborhood-level POI semantic zones with distance-weighted spatial likelihoods, differentiated inference strategies for mandatory and non-mandatory activities, and a multi-phase Pareto optimization that jointly minimizes distributional divergence from household travel survey statistics and maximizes inference reliability without requiring annotated labels. Evaluated on over 81 million staypoints in Los Angeles, the framework reduces activity type frequency Jensen-Shannon distance (JSD) by 23%, start time JSD by 48%, and duration JSD by 12% respectively relative to a comparable baseline. The proposed approach provides a scalable and uncertainty-aware path from raw GPS trajectories to semantically annotated mobility data for travel demand modeling and transportation policy analysis.

preprint2024arXiv

Clique counts for network similarity

Counts of small subgraphs, or graphlet counts, are widely applicable to measure graph similarity. Computing graphlet counts can be computationally expensive and may pose obstacles in network analysis. We study the role of cliques in graphlet counts as a method for graph similarity in social networks. Higher-order clustering coefficients and the Pivoter algorithm for exact clique counts are employed

preprint2022arXiv

$C^1$ actions on manifolds by lattices in Lie groups

In this paper we study Zimmer's conjecture for $C^1$ actions of lattice subgroup of a higher-rank simple Lie group with finite center on compact manifolds. We show that when the rank of an uniform lattice is larger than the dimension of the manifold, then the action factors through a finite group. For lattices in $SL(n, \R)$, the dimensional bound is sharp.

preprint2022arXiv

A Representation Separation Perspective to Correspondences-free Unsupervised 3D Point Cloud Registration

3D point cloud registration in remote sensing field has been greatly advanced by deep learning based methods, where the rigid transformation is either directly regressed from the two point clouds (correspondences-free approaches) or computed from the learned correspondences (correspondences-based approaches). Existing correspondences-free methods generally learn the holistic representation of the entire point cloud, which is fragile for partial and noisy point clouds. In this paper, we propose a correspondences-free unsupervised point cloud registration (UPCR) method from the representation separation perspective. First, we model the input point cloud as a combination of pose-invariant representation and pose-related representation. Second, the pose-related representation is used to learn the relative pose wrt a "latent canonical shape" for the source and target point clouds respectively. Third, the rigid transformation is obtained from the above two learned relative poses. Our method not only filters out the disturbance in pose-invariant representation but also is robust to partial-to-partial point clouds or noise. Experiments on benchmark datasets demonstrate that our unsupervised method achieves comparable if not better performance than state-of-the-art supervised registration methods.

preprint2022arXiv

A spectral condition for the existence of cycles with consecutive odd lengths in non-bipartite graphs

A graph $G$ is called $H$-free, if it does not contain $H$ as a subgraph. In 2010, Nikiforov proposed a Brualdi-Solheid-Turán type problem: what is the maximum spectral radius of an $H$-free graph of order $n$? In this paper, we consider the Brualdi-Solheid-Turán type problem for non-bipartite graphs. Let $K_{a, b}\bullet K_3$ denote the graph obtained by identifying a vertex of $K_{a,b}$ in the part of size $b$ and a vertex of $K_3$. We prove that if $G$ is a non-bipartite graph of order $n$ satisfying $ρ(G)\geq ρ(K_{\lceil\frac{n-2}{2}\rceil, \lfloor\frac{n-2}{2}\rfloor}\bullet K_3)$, then $G$ contains all odd cycles $C_{2l+1}$ for each integer $l\in[2,k]$ unless $G\cong K_{\lceil\frac{n-2}{2}\rceil, \lfloor\frac{n-2}{2}\rfloor}\bullet K_3$, provided that $n$ is sufficiently large with respect to $k$. This resolves the problem posed by Guo, Lin and Zhao (2021).

preprint2022arXiv

A Ta-TaS2 monolithic catalyst with robust and metallic interface for superior hydrogen evolution

The use of highly active and robust catalysts is crucial for producing green hydrogen by water electrolysis as we strive to achieve global carbon neutrality. Noble metals like platinum are currently used in industry for the hydrogen evolution reaction (HER), but suffer from scarcity, high price and unsatisfied performance and stability at large current density, restricting their large scale implementations. Here we report the synthesis of a new type of monolithic catalyst (MC) consisting of a metal disulfide (e.g., TaS2) catalyst vertically bonded to a conductive substrate of the same metal by strong covalent bonds. These features give the MC a mechanically robust and electrically near zero resistance interface, leading to an outstanding HER performance including rapid charge transfer and excellent durability, together with a low overpotential of 398 mV to achieve a current density of 2,000 mA cm-2 as required by industry. The Ta TaS2 MC has a negligible performance decay after 200 h operation at large current densities. In light of its unique interface and the various choice of metal elements giving the same structure, such monolithic materials may have broad uses besides catalysis.

preprint2022arXiv

Adversarial Parameter Defense by Multi-Step Risk Minimization

Previous studies demonstrate DNNs' vulnerability to adversarial examples and adversarial training can establish a defense to adversarial examples. In addition, recent studies show that deep neural networks also exhibit vulnerability to parameter corruptions. The vulnerability of model parameters is of crucial value to the study of model robustness and generalization. In this work, we introduce the concept of parameter corruption and propose to leverage the loss change indicators for measuring the flatness of the loss basin and the parameter robustness of neural network parameters. On such basis, we analyze parameter corruptions and propose the multi-step adversarial corruption algorithm. To enhance neural networks, we propose the adversarial parameter defense algorithm that minimizes the average risk of multiple adversarial parameter corruptions. Experimental results show that the proposed algorithm can improve both the parameter robustness and accuracy of neural networks.

preprint2022arXiv

Context-Aware Video Reconstruction for Rolling Shutter Cameras

With the ubiquity of rolling shutter (RS) cameras, it is becoming increasingly attractive to recover the latent global shutter (GS) video from two consecutive RS frames, which also places a higher demand on realism. Existing solutions, using deep neural networks or optimization, achieve promising performance. However, these methods generate intermediate GS frames through image warping based on the RS model, which inevitably result in black holes and noticeable motion artifacts. In this paper, we alleviate these issues by proposing a context-aware GS video reconstruction architecture. It facilitates the advantages such as occlusion reasoning, motion compensation, and temporal abstraction. Specifically, we first estimate the bilateral motion field so that the pixels of the two RS frames are warped to a common GS frame accordingly. Then, a refinement scheme is proposed to guide the GS frame synthesis along with bilateral occlusion masks to produce high-fidelity GS video frames at arbitrary times. Furthermore, we derive an approximated bilateral motion field model, which can serve as an alternative to provide a simple but effective GS frame initialization for related tasks. Experiments on synthetic and real data show that our approach achieves superior performance over state-of-the-art methods in terms of objective metrics and subjective visual quality. Code is available at \url{https://github.com/GitCVfb/CVR}.

preprint2022arXiv

CVFNet: Real-time 3D Object Detection by Learning Cross View Features

In recent years 3D object detection from LiDAR point clouds has made great progress thanks to the development of deep learning technologies. Although voxel or point based methods are popular in 3D object detection, they usually involve time-consuming operations such as 3D convolutions on voxels or ball query among points, making the resulting network inappropriate for time critical applications. On the other hand, 2D view-based methods feature high computing efficiency while usually obtaining inferior performance than the voxel or point based methods. In this work, we present a real-time view-based single stage 3D object detector, namely CVFNet to fulfill this task. To strengthen the cross-view feature learning under the condition of demanding efficiency, our framework extracts the features of different views and fuses them in an efficient progressive way. We first propose a novel Point-Range feature fusion module that deeply integrates point and range view features in multiple stages. Then, a special Slice Pillar is designed to well maintain the 3D geometry when transforming the obtained deep point-view features into bird's eye view. To better balance the ratio of samples, a sparse pillar detection head is presented to focus the detection on the nonempty grids. We conduct experiments on the popular KITTI and NuScenes benchmark, and state-of-the-art performances are achieved in terms of both accuracy and speed.

preprint2022arXiv

Distributional Correlation--Aware Knowledge Distillation for Stock Trading Volume Prediction

Traditional knowledge distillation in classification problems transfers the knowledge via class correlations in the soft label produced by teacher models, which are not available in regression problems like stock trading volume prediction. To remedy this, we present a novel distillation framework for training a light-weight student model to perform trading volume prediction given historical transaction data. Specifically, we turn the regression model into a probabilistic forecasting model, by training models to predict a Gaussian distribution to which the trading volume belongs. The student model can thus learn from the teacher at a more informative distributional level, by matching its predicted distributions to that of the teacher. Two correlational distillation objectives are further introduced to encourage the student to produce consistent pair-wise relationships with the teacher model. We evaluate the framework on a real-world stock volume dataset with two different time window settings. Experiments demonstrate that our framework is superior to strong baseline models, compressing the model size by $5\times$ while maintaining $99.6\%$ prediction accuracy. The extensive analysis further reveals that our framework is more effective than vanilla distillation methods under low-resource scenarios.

preprint2022arXiv

End-to-end Learning the Partial Permutation Matrix for Robust 3D Point Cloud Registration

Even though considerable progress has been made in deep learning-based 3D point cloud processing, how to obtain accurate correspondences for robust registration remains a major challenge because existing hard assignment methods cannot deal with outliers naturally. Alternatively, the soft matching-based methods have been proposed to learn the matching probability rather than hard assignment. However, in this paper, we prove that these methods have an inherent ambiguity causing many deceptive correspondences. To address the above challenges, we propose to learn a partial permutation matching matrix, which does not assign corresponding points to outliers, and implements hard assignment to prevent ambiguity. However, this proposal poses two new problems, i.e., existing hard assignment algorithms can only solve a full rank permutation matrix rather than a partial permutation matrix, and this desired matrix is defined in the discrete space, which is non-differentiable. In response, we design a dedicated soft-to-hard (S2H) matching procedure within the registration pipeline consisting of two steps: solving the soft matching matrix (S-step) and projecting this soft matrix to the partial permutation matrix (H-step). Specifically, we augment the profit matrix before the hard assignment to solve an augmented permutation matrix, which is cropped to achieve the final partial permutation matrix. Moreover, to guarantee end-to-end learning, we supervise the learned partial permutation matrix but propagate the gradient to the soft matrix instead. Our S2H matching procedure can be easily integrated with existing registration frameworks, which has been verified in representative frameworks including DCP, RPMNet, and DGR. Extensive experiments have validated our method, which creates a new state-of-the-art performance for robust 3D point cloud registration. The code will be made public.

preprint2022arXiv

First implementation of full-workflow automation in radiotherapy: the All-in-One solution on rectal cancer

The aim of this work is to describe the technical characteristics of an AI-powered radiotherapy workflow that enables full-process automation (All-in-One), evaluate its performance implemented for on-couch initial treatment of rectal cancer, and provide insight into the behavior of full-workflow automation in the specialty of radiotherapy. The All-in-One workflow was developed based on a CT-integrated linear accelerator. It incorporates routine radiotherapy procedures from simulation, autosegmentation, autoplanning, image guidance, beam delivery, and in vivo quality assurance (QA) into one scheme, with critical decision points involved, while the patient is on the treatment couch during the whole process. For the enrolled ten patients with rectal cancer, minor modifications of the autosegmented target volumes were required, and the Dice similarity coefficient and 95% Hausdorff distance before and after modifications were 0.892{\pm}0.061 and 18.2{\pm}13.0 mm, respectively. The autosegmented normal tissues and automatic plans were clinically acceptable without any modifications or reoptimization. The pretreatment IGRT corrections were within 2 mm in all directions, and the EPID-based in vivo QA showed a γ passing rate better than 97{\%} (3{\%}/3 mm/10{\%} threshold). The duration of the whole process was 23.2{\pm}3.5 minutes, depending mostly on the time required for manual modification and plan evaluation. The All-in-One workflow enables full automation of the entire radiotherapy process by seamlessly integrating multiple routine procedures. The one-stop solution shortens the time scale it takes to ready the first treatment from days to minutes, significantly improving the patient experience and the efficiency of the workflow, and shows potential to facilitate the clinical application of online adaptive replanning.

preprint2022arXiv

How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean Data

Since training a large-scale backdoored model from scratch requires a large training dataset, several recent attacks have considered to inject backdoors into a trained clean model without altering model behaviors on the clean data. Previous work finds that backdoors can be injected into a trained clean model with Adversarial Weight Perturbation (AWP). Here AWPs refers to the variations of parameters that are small in backdoor learning. In this work, we observe an interesting phenomenon that the variations of parameters are always AWPs when tuning the trained clean model to inject backdoors. We further provide theoretical analysis to explain this phenomenon. We formulate the behavior of maintaining accuracy on clean data as the consistency of backdoored models, which includes both global consistency and instance-wise consistency. We extensively analyze the effects of AWPs on the consistency of backdoored models. In order to achieve better consistency, we propose a novel anchoring loss to anchor or freeze the model behaviors on the clean data, with a theoretical guarantee. Both the analytical and the empirical results validate the effectiveness of the anchoring loss in improving the consistency, especially the instance-wise consistency.

preprint2022arXiv

Iterative models for complex networks formed by extending cliques

We consider a new model for complex networks whose underlying mechanism is extending dense subgraphs. In the frustum model, we iteratively extend cliques over discrete-time steps. For many choices of the underlying parameters, graphs generated by the model densify over time. In the special case of the cone model, generated graphs provably satisfy properties observed in real-world complex networks such as the small world property and bad spectral expansion. We finish with a set of open problems and next steps for the frustum model.

preprint2022arXiv

On bifurcation of statistical properties of partially hyperbolic endomorphisms

We give an example of a path-wise connected open set of $C^\infty$ partially hyperbolic endomorphisms on the $2$-torus, on which the SRB measure exists for each system and varies smoothly depending on the system, while the sign of its central Lyapunov exponent does change.

preprint2022arXiv

PKUSEG: A Toolkit for Multi-Domain Chinese Word Segmentation

Chinese word segmentation (CWS) is a fundamental step of Chinese natural language processing. In this paper, we build a new toolkit, named PKUSEG, for multi-domain word segmentation. Unlike existing single-model toolkits, PKUSEG targets multi-domain word segmentation and provides separate models for different domains, such as web, medicine, and tourism. Besides, due to the lack of labeled data in many domains, we propose a domain adaptation paradigm to introduce cross-domain semantic knowledge via a translation system. Through this method, we generate synthetic data using a large amount of unlabeled data in the target domain and then obtain a word segmentation model for the target domain. We also further refine the performance of the default model with the help of synthetic data. Experiments show that PKUSEG achieves high performance on multiple domains. The new toolkit also supports POS tagging and model training to adapt to various application scenarios. The toolkit is now freely and publicly available for the usage of research and industry.

preprint2022arXiv

Recent Advances in Design of Electrocatalysts for High-Current-Density Water Splitting

Electrochemical water splitting technology for producing "green hydrogen" is important for the global mission of carbon neutrality. Electrocatalysts with decent performance at high current densities play a central role in the industrial implementation of this technology. The field has advanced immensely in recent years, as witnessed by many types of catalysts have been designed and synthesized which work at industrially-relevant current densities (> 200 mA cm-2). Note that the activity and stability of catalysts can be influenced by their local reaction environment, which are closely related to the current density. By discussing recent advances in this field, we summarize several key aspects that affect the catalytic performance for high-current-density electrocatalysis, including dimensionality of catalysts, surface chemistry, electron transport path, morphology, and catalyst-electrolyte interplay. We highlight the multiscale design strategy that considers these aspects comprehensively for developing high-current-density catalysts. We also put forward out perspectives on the future directions in this emerging field.

preprint2022arXiv

RIConv++: Effective Rotation Invariant Convolutions for 3D Point Clouds Deep Learning

3D point clouds deep learning is a promising field of research that allows a neural network to learn features of point clouds directly, making it a robust tool for solving 3D scene understanding tasks. While recent works show that point cloud convolutions can be invariant to translation and point permutation, investigations of the rotation invariance property for point cloud convolution has been so far scarce. Some existing methods perform point cloud convolutions with rotation-invariant features, existing methods generally do not perform as well as translation-invariant only counterpart. In this work, we argue that a key reason is that compared to point coordinates, rotation-invariant features consumed by point cloud convolution are not as distinctive. To address this problem, we propose a simple yet effective convolution operator that enhances feature distinction by designing powerful rotation invariant features from the local regions. We consider the relationship between the point of interest and its neighbors as well as the internal relationship of the neighbors to largely improve the feature descriptiveness. Our network architecture can capture both local and global context by simply tuning the neighborhood size in each convolution layer. We conduct several experiments on synthetic and real-world point cloud classifications, part segmentation, and shape retrieval to evaluate our method, which achieves the state-of-the-art accuracy under challenging rotations.

preprint2022arXiv

VRNet: Learning the Rectified Virtual Corresponding Points for 3D Point Cloud Registration

3D point cloud registration is fragile to outliers, which are labeled as the points without corresponding points. To handle this problem, a widely adopted strategy is to estimate the relative pose based only on some accurate correspondences, which is achieved by building correspondences on the identified inliers or by selecting reliable ones. However, these approaches are usually complicated and time-consuming. By contrast, the virtual point-based methods learn the virtual corresponding points (VCPs) for all source points uniformly without distinguishing the outliers and the inliers. Although this strategy is time-efficient, the learned VCPs usually exhibit serious collapse degeneration due to insufficient supervision and the inherent distribution limitation. In this paper, we propose to exploit the best of both worlds and present a novel robust 3D point cloud registration framework. We follow the idea of the virtual point-based methods but learn a new type of virtual points called rectified virtual corresponding points (RCPs), which are defined as the point set with the same shape as the source and with the same pose as the target. Hence, a pair of consistent point clouds, i.e. source and RCPs, is formed by rectifying VCPs to RCPs (VRNet), through which reliable correspondences between source and RCPs can be accurately obtained. Since the relative pose between source and RCPs is the same as the relative pose between source and target, the input point clouds can be registered naturally. Specifically, we first construct the initial VCPs by using an estimated soft matching matrix to perform a weighted average on the target points. Then, we design a correction-walk module to learn an offset to rectify VCPs to RCPs, which effectively breaks the distribution limitation of VCPs. Finally, we develop a hybrid loss function to enforce the shape and geometry structure consistency ...

preprint2021arXiv

Spatial Structure Engineering in Enhancing Performance of Mosaic Electrocatalysts

Understanding the mechanism and developing strategies toward efficient electrocatalysis at gas-liquidsolid interfaces are important yet challenging. In the past decades, researchers have devoted many efforts to improving catalyst activity by modulating electronic properties of catalysts in terms of chemical components and physical features. Here we develop a mosaic catalyst strategy to improve activity of electrocatalysts by engineering their spatial structures. Taking Pt catalyst as an example, the mosaic Pt leads to high catalytic performance, showing a specific activity 11 times higher than uniform Pt films for hydrogen evolution reaction (HER), as well as higher current densities than commercial Pt/C and uniform Pt films. Such a strategy is found to be general to other catalysts (e.g., twodimensional PtS) and other reactions (e.g., oxygen evolution reaction). The improved catalytic performance of the mosaic catalysts is attributed to enhanced mass transferability and local electric field, both are determined by the occupation ratio of catalysts. Our work shines new light on manipulating electrocatalysis from the perspective of the spatial structure of catalyst, which would guide the design of efficient catalysts for heterogeneous reactions.

preprint2020arXiv

$C^{r}-$prevalence of stable ergodicity for a class of partially hyperbolic systems

We prove that for $r \in \mathbb{N}_{\geq 2} \cup \{\infty\}$, for any dynamically coherent, center bunched and strongly pinched volume preserving $C^r$ partially hyperbolic diffeomorphism $f \colon X \to X$, if either (1) its center foliation is uniformly compact, or (2) its center-stable and center-unstable foliations are of class $C^1$, then there exists a $C^1$-open neighbourhood of $f$ in ${\rm Diff}^r(X,\mathrm{Vol})$, in which stable ergodicity is $C^r$-prevalent in Kolmogorov's sense. In particular, we verify Pugh-Shub's stable ergodicity conjecture in this region. This also provides the first result that verifies the prevalence of stable ergodicity in the measure-theoretical sense. Our theorem applies to a large class of algebraic systems. As applications, we give affirmative answers in the strongly pinched region to: 1. an open question of Pugh-Shub in \cite{PS}; 2. a generic version of an open question of Hirsch-Pugh-Shub in \cite{HPS}; and 3. a generic version of an open question of Pugh-Shub in \cite{HPS}.

preprint2020arXiv

A short-range metastable defect in the double layer ice

Although the phase of water has extensively investigated whether there exists a defect distorting only locally the structure still under debate. Here we report a localized 5775 defect phase presented in the double layer ice on the Au (111) surface, which is a metastable structure with 5- and 7-membered rings compared with a perfect hexagonal one. Without altering the total number of the hydrogen bonds of the ice, the defect only introduces 0.08 Å molecular displacement and 3.27% interaction energy change outside the defected area. Such defect also exists without Au support but causes a larger lattice relaxation or smaller interaction energy change. The excessively high barrier as well as the low quantum tunneling and thermodynamic probabilities hinder the formation of the defect by post-grown isomerization from the perfect to the defected structure. This finding indicates that the defected ice is stable, and the defect can be formed during the ice growth stage.

preprint2020arXiv

Global Context Aware Convolutions for 3D Point Cloud Understanding

Recent advances in deep learning for 3D point clouds have shown great promises in scene understanding tasks thanks to the introduction of convolution operators to consume 3D point clouds directly in a neural network. Point cloud data, however, could have arbitrary rotations, especially those acquired from 3D scanning. Recent works show that it is possible to design point cloud convolutions with rotation invariance property, but such methods generally do not perform as well as translation-invariant only convolution. We found that a key reason is that compared to point coordinates, rotation-invariant features consumed by point cloud convolution are not as distinctive. To address this problem, we propose a novel convolution operator that enhances feature distinction by integrating global context information from the input point cloud to the convolution. To this end, a globally weighted local reference frame is constructed in each point neighborhood in which the local point set is decomposed into bins. Anchor points are generated in each bin to represent global shape features. A convolution can then be performed to transform the points and anchor features into final rotation-invariant features. We conduct several experiments on point cloud classification, part segmentation, shape retrieval, and normals estimation to evaluate our convolution, which achieves state-of-the-art accuracy under challenging rotations.

preprint2020arXiv

High-Throughput Production of Cheap Mineral-Based 2D Electrocatalysts for High-Current-Density Hydrogen Evolution

The high-throughput scalable production of cheap, efficient and durable electrocatalysts that work well at high current densities demanded by industry is a great challenge for the large-scale implementation of electrochemical technologies. Here we report the production of a 2D MoS2-based ink-type electrocatalyst by a scalable top-down exfoliation technique followed by a simple heat treatment. The catalyst shows a high current density of 1000 mA cm^-2 at an overpotential of 454 mV for the hydrogen evolution reaction (HER) without the need of iR correction, as well as good stability over 24 hours. Using the same method, we have, for the first time, produced a cheap MoS2 mineral-based catalyst and found that it had an excellent performance for high-current-density HER. Noteworthy, production rate of this MoS2-based catalyst is one to two orders of magnitude higher than those previously reported. In addition, the price of the MoS2 mineral is five orders of magnitude lower than commercial Pt catalysts, making the MoS2 mineral-based catalyst cheap, and the ink-type catalyst dispersions can be easily integrated with other technologies for large-scale catalyst electrode preparation. These advantages indicate the huge potentials of this method and mineral-based cheap and abundant natural resources as catalysts in the electrochemical technologies.

preprint2020arXiv

Is the Skip Connection Provable to Reform the Neural Network Loss Landscape?

The residual network is now one of the most effective structures in deep learning, which utilizes the skip connections to ``guarantee" the performance will not get worse. However, the non-convexity of the neural network makes it unclear whether the skip connections do provably improve the learning ability since the nonlinearity may create many local minima. In some previous works \cite{freeman2016topology}, it is shown that despite the non-convexity, the loss landscape of the two-layer ReLU network has good properties when the number $m$ of hidden nodes is very large. In this paper, we follow this line to study the topology (sub-level sets) of the loss landscape of deep ReLU neural networks with a skip connection and theoretically prove that the skip connection network inherits the good properties of the two-layer network and skip connections can help to control the connectedness of the sub-level sets, such that any local minima worse than the global minima of some two-layer ReLU network will be very ``shallow". The ``depth" of these local minima are at most $O(m^{(η-1)/n})$, where $n$ is the input dimension, $η<1$. This provides a theoretical explanation for the effectiveness of the skip connection in deep learning.

preprint2020arXiv

Shop The Look: Building a Large Scale Visual Shopping System at Pinterest

As online content becomes ever more visual, the demand for searching by visual queries grows correspondingly stronger. Shop The Look is an online shopping discovery service at Pinterest, leveraging visual search to enable users to find and buy products within an image. In this work, we provide a holistic view of how we built Shop The Look, a shopping oriented visual search system, along with lessons learned from addressing shopping needs. We discuss topics including core technology across object detection and visual embeddings, serving infrastructure for realtime inference, and data labeling methodology for training/evaluation data collection and human evaluation. The user-facing impacts of our system design choices are measured through offline evaluations, human relevance judgements, and online A/B experiments. The collective improvements amount to cumulative relative gains of over 160% in end-to-end human relevance judgements and over 80% in engagement. Shop The Look is deployed in production at Pinterest.

preprint2020arXiv

Smooth mixing Anosov flows in dimension three are exponential mixing

We show that a topologically mixing $C^\infty$ Anosov flow on a 3 dimensional compact manifold is exponential mixing with respect to any equilibrium measure with Holder potential.

preprint2016arXiv

Ballistic Transport and Absolute Continuity of One-Frequency Schrödinger Operators

For the solution $u(t)$ to the discrete Schrödinger equation $${\rm i}\frac{d}{dt}u_n(t)=-(u_{n+1}(t)+u_{n-1}(t))+V(θ+ nα)u_n(t), \quad n\in\Z,$$ with $α\in\R\setminus\Q$ and $V\in C^ω(\T,\R)$, we consider the growth rate with $t$ of its diffusion norm $\langle u(t)\rangle_{p}:=\left(\sum_{n\in\Z}(n^{p}+1) |u_n(t)|^2\right)^\frac12$, and the (non-averaged) transport exponents $$β_u^{+}(p) := \limsup_{t \to \infty} \frac{2\log \langle u(t)\rangle_{p}}{p\log t}, \quad β_u^{-}(p):= \liminf_{t \to \infty} \frac{2\log \langle u(t)\rangle_{p}}{p\log t}.$$ We will show that, if the corresponding Schrödinger operator has purely absolutely continuous spectrum, then $β_{u}^{\pm}(p)=1$, provided that $u(0)$ is well localized.

preprint2016arXiv

Electronic structure contribution to hydrogen bonding interaction of a water dimer

Hydrogen bond (H-bond) covalency has recently been observed in ice and liquid water, while the penetrating molecular orbitals (MOs) in the H-bond region of most typical water dimer system, (H2O)2, have also been discovered. However, obtaining the quantitative contribution of these MOs to the H-bond interaction is still problematic. In this work, we introduced the orbital-resolved electron density projected integral (EDPI) along the H-bond to approach this problem. The calculations show that, surprisingly, the electronic occupied orbital (HOMO-4) of (H2O)2 accounts for about 40% of the electron density at the bond critical point. Moreover, the charge transfer analysis visualizes the electron accumulating effect of the orbital interaction within the H-bond between water molecules, supporting its covalent-like character. Our work expands the classical understanding of H-bond with specific contributions from certain MOs, and will also advance further research into such covalency and offer quantitative electronic structure insights into intermolecular systems.

preprint2015arXiv

A new method of energy calibration of position-sensitive silicon detector

An improved method of energy calibration of position-sensitive silicon detector is presented. Instead of the parabolic function used in traditional method, a new function describing the relation of position and energy is introduced and achieves better energy resolution. For the 8.088 MeV alpha decay of 213Rn calibrated by this new method, the energy resolution is determined to be about 87 keV (FWHM), which is better than the result of the traditional method, 104 keV (FWHM). In addition, different functions can be tried in the new method, which makes the calibration of various detectors with different performances possible.

preprint2015arXiv

On the spectrums of ergodic Schrodinger operators with finitely valued potentials

We show that the measure of the spectrum of Schrödinger operator with potential defined by non-constant function over any minimal aperiodic finite subshift tends to zero, as the coupling constant tends to infinity. We also obtained a quantitative upper bound for the measure of the spectrum. This follows from a result we proved for ergodic Schrödinger operators with finitely valued potentials under two conditions on the recurrence property of the shift. We also show that one of these conditions is necessary for such result.

preprint2013arXiv

Volatility Inference in the Presence of Both Endogenous Time and Microstructure Noise

In this article we consider the volatility inference in the presence of both market microstructure noise and endogenous time. Estimators of the integrated volatility in such a setting are proposed, and their asymptotic properties are studied. Our proposed estimator is compared with the existing popular volatility estimators via numerical studies. The results show that our estimator can have substantially better performance when time endogeneity exists.

preprint2011arXiv

Quantile clocks

Quantile clocks are defined as convolutions of subordinators $L$, with quantile functions of positive random variables. We show that quantile clocks can be chosen to be strictly increasing and continuous and discuss their practical modeling advantages as business activity times in models for asset prices. We show that the marginal distributions of a quantile clock, at each fixed time, equate with the marginal distribution of a single subordinator. Moreover, we show that there are many quantile clocks where one can specify $L$, such that their marginal distributions have a desired law in the class of generalized $s$-self decomposable distributions, and in particular the class of self-decomposable distributions. The development of these results involves elements of distribution theory for specific classes of infinitely divisible random variables and also decompositions of a gamma subordinator, that is of independent interest. As applications, we construct many price models that have continuous trajectories, exhibit volatility clustering and have marginal distributions that are equivalent to those of quite general exponential Lévy price models. In particular, we provide explicit details for continuous processes whose marginals equate with the popular VG, CGMY and NIG price models. We also show how to perfectly sample the marginal distributions of more general classes of convoluted subordinators when $L$ is in a sub-class of generalized gamma convolutions, which is relevant for pricing of European style options.

Zhiyuan Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

35 published item(s)

Generating synthetic computed tomography for radiotherapy: SynthRAD2025 challenge report

ManiFeel: Benchmarking and Understanding Visuotactile Manipulation Policy Learning

Uncertainty-Aware Trip Purpose Inference from GPS Trajectories via POI Semantic Zones and Pareto Calibration

Clique counts for network similarity

$C^1$ actions on manifolds by lattices in Lie groups

A Representation Separation Perspective to Correspondences-free Unsupervised 3D Point Cloud Registration

A spectral condition for the existence of cycles with consecutive odd lengths in non-bipartite graphs

A Ta-TaS2 monolithic catalyst with robust and metallic interface for superior hydrogen evolution

Adversarial Parameter Defense by Multi-Step Risk Minimization

Context-Aware Video Reconstruction for Rolling Shutter Cameras

CVFNet: Real-time 3D Object Detection by Learning Cross View Features

Distributional Correlation--Aware Knowledge Distillation for Stock Trading Volume Prediction

End-to-end Learning the Partial Permutation Matrix for Robust 3D Point Cloud Registration

First implementation of full-workflow automation in radiotherapy: the All-in-One solution on rectal cancer

How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean Data

Iterative models for complex networks formed by extending cliques

On bifurcation of statistical properties of partially hyperbolic endomorphisms

PKUSEG: A Toolkit for Multi-Domain Chinese Word Segmentation

Recent Advances in Design of Electrocatalysts for High-Current-Density Water Splitting

RIConv++: Effective Rotation Invariant Convolutions for 3D Point Clouds Deep Learning

VRNet: Learning the Rectified Virtual Corresponding Points for 3D Point Cloud Registration

Spatial Structure Engineering in Enhancing Performance of Mosaic Electrocatalysts

$C^{r}-$prevalence of stable ergodicity for a class of partially hyperbolic systems

A short-range metastable defect in the double layer ice

Global Context Aware Convolutions for 3D Point Cloud Understanding

High-Throughput Production of Cheap Mineral-Based 2D Electrocatalysts for High-Current-Density Hydrogen Evolution

Is the Skip Connection Provable to Reform the Neural Network Loss Landscape?

Shop The Look: Building a Large Scale Visual Shopping System at Pinterest

Smooth mixing Anosov flows in dimension three are exponential mixing

Ballistic Transport and Absolute Continuity of One-Frequency Schrödinger Operators

Electronic structure contribution to hydrogen bonding interaction of a water dimer

A new method of energy calibration of position-sensitive silicon detector

On the spectrums of ergodic Schrodinger operators with finitely valued potentials

Volatility Inference in the Presence of Both Endogenous Time and Microstructure Noise

Quantile clocks