Source author record

Wei Zeng

Wei Zeng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

21works

21topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning

Text-to-Visualization (Text2VIS) enables users to create visualizations from natural language queries, making data insights more accessible. However, Text2VIS faces challenges in interpreting ambiguous queries, as users often express their visualization needs in imprecise language. To address this challenge, we introduce nBench 2.0, a new benchmark designed to evaluate Text2VIS systems in scenarios involving ambiguous queries. nvBench 2.0 includes 7,878 natural language queries and 24,076 corresponding visualizations, derived from 780 tables across 153 domains. It is built using a controlled ambiguity-injection pipeline that generates ambiguous queries through a reverse-generation workflow. By starting with unambiguous seed visualizations and selectively injecting ambiguities, the pipeline yields multiple valid interpretations for each query, with each ambiguous query traceable to its corresponding visualization through step-wise reasoning paths. We evaluate various Large Language Models (LLMs) on their ability to perform ambiguous Text2VIS tasks using nBench 2.0. We also propose Step-Text2Vis, an LLM-based model trained on nvBench 2.0, which enhances performance in ambiguous scenarios through step-wise preference optimization. Our results show that Step-Text2Vis outperforms all baselines, setting a new state-of-the-art for ambiguous Text2VIS tasks. Our source code and data are available at https://nvbench2.github.io/

preprint2026arXiv

Uncertainty-Guided Dual-Domain Learning for Reliable Skin Lesion Segmentation

Accurate skin lesion segmentation is vital for dermoscopic Computer-Aided Diagnosis. However, visual ambiguity and morphological irregularity often defeat spatial modeling, necessitating multi-domain architectures. Existing paradigms frequently overlook the active use of prediction uncertainty, leading to deterministic frameworks that suffer from blind cross-domain fusion and overfit to label noise. To address these issues, we propose the Uncertainty-Guided Dual-Domain Network (UGDD-Net). UGDD-Net introduces a novel "Glance-and-Gaze" mechanism to transform uncertainty into an active guiding signal. Specifically, the Uncertainty-Guided Bi-directional Feature Fusion (UGBFF) module uses pixel-level uncertainty to modulate spatial-spectral interactions. The Uncertainty-Guided Graph Refinement (UGGR) module constructs a topology-aware graph to propagate reliable semantic consensus and refine uncertain nodes. Finally, the Uncertainty-Guided Margin-Adaptive Loss (UGML) enforces strict constraints on confident pixels while relaxing penalties on uncertain ones to improve statistical calibration. Extensive experiments on ISIC2017, ISIC2018, PH2, and HAM10000 datasets demonstrate that UGDD-Net achieves state-of-the-art performance, especially on "Hard Samples". Our uncertainty maps align with expert inter-observer variability, providing robust interpretability for human-machine collaborative diagnosis.

preprint2025arXiv

Ultrahigh-Energy Gamma-ray Emission Associated with Black Hole-Jet Systems

Black holes (BH), one of the most intriguing objects in the universe, can manifest themselves through electromagnetic radiation initiated by the accretion flow. Some stellar-mass BHs drive relativistic jets when accreting matter from their companion stars, forming microquasars. Non-thermal emission from the radio to tera-electronvolt (TeV) gamma-ray band has been observed from microquasars, indicating the acceleration of relativistic particles. Here we report detection of four microquasars (SS 433, V4641 Sgr, GRS 1915+105, MAXI J1820+070) of spectrum extending to the ultrahigh-energy (UHE; photon energy $E>100$ TeV) band and one microquasar (Cygnus X-1) of spectrum approaching 100 TeV, using the Large High Altitude Air Shower Observatory (LHAASO). Notably, the total emission associated with SS 433 cannot be interpreted with a single leptonic component. In the UHE band, its emission is in spatial coincidence with a giant atomic cloud, which is consistent with a hadronic origin. An elongated source is discovered from V4641 Sgr with the spectrum continuing up to 800 TeV. The detection of UHE gamma rays demonstrates that accreting BHs and their environments can operate as extremely efficient accelerators of particles out of 1 peta-electronvolt (PeV), suggesting microquasars to be important contributors to Galactic cosmic rays especially around the `knee' region.

preprint2022arXiv

A novel spectral method for the subdiffusion equation

In this paper, we design and analyze a novel spectral method for the subdiffusion equation. As it has been known, the solutions of this equation are usually singular near the initial time. Consequently, direct application of the traditional high-order numerical methods is inefficient. We try to overcome this difficulty in a novel approach by combining variable transformation techniques with spectral methods. The idea is to first use suitable variable transformation to re-scale the underlying equation, then construct spectral methods for the re-scaled equation. We establish a new variational framework based on the $ψ$-fractional Sobolev spaces. This allows us to prove the well-posedness of the associated variational problem. The proposed spectral method is based on the variational problem and generalized Jacobi polynomials to approximate the re-scaled fractional differential equation. Our theoretical and numerical investigation show that the proposed method is exponentially convergent for general right hand side functions, even though the exact solution has very limited regularity. Implementation details are also provided, along with a series of numerical examples to show the efficiency of the proposed method.

preprint2022arXiv

Fault Detection and Isolation of Uncertain Nonlinear Parabolic PDE Systems

This paper proposes a novel fault detection and isolation (FDI) scheme for distributed parameter systems modeled by a class of parabolic partial differential equations (PDEs) with nonlinear uncertain dynamics. A key feature of the proposed FDI scheme is its capability of dealing with the effects of system uncertainties for accurate FDI. Specifically, an approximate ordinary differential equation (ODE) system is first derived to capture the dominant dynamics of the original PDE system. An adaptive dynamics identification approach using radial basis function neural network is then proposed based on this ODE system, so as to achieve locally-accurate identification of the uncertain system dynamics under normal and faulty modes. A bank of FDI estimators with associated adaptive thresholds are finally designed for real-time FDI decision making. Rigorous analysis on the FDI performance in terms of fault detectability and isolatability is provided. Simulation study on a representative transport-reaction process is conducted to demonstrate the effectiveness and advantage of the proposed approach.

preprint2021arXiv

Deep Colormap Extraction from Visualizations

This work presents a new approach based on deep learning to automatically extract colormaps from visualizations. After summarizing colors in an input visualization image as a Lab color histogram, we pass the histogram to a pre-trained deep neural network, which learns to predict the colormap that produces the visualization. To train the network, we create a new dataset of 64K visualizations that cover a wide variety of data distributions, chart types, and colormaps. The network adopts an atrous spatial pyramid pooling module to capture color features at multiple scales in the input color histograms. We then classify the predicted colormap as discrete or continuous and refine the predicted colormap based on its color histogram. Quantitative comparisons to existing methods show the superior performance of our approach on both synthetic and real-world visualizations. We further demonstrate the utility of our method with two use cases,i.e., color transfer and color remapping.

preprint2021arXiv

The distance between the weights of the neural network is meaningful

In the application of neural networks, we need to select a suitable model based on the problem complexity and the dataset scale. To analyze the network's capacity, quantifying the information learned by the network is necessary. This paper proves that the distance between the neural network weights in different training stages can be used to estimate the information accumulated by the network in the training process directly. The experiment results verify the utility of this method. An application of this method related to the label corruption is shown at the end.

preprint2021arXiv

Topology Density Map for Urban Data Visualization and Analysis

Density map is an effective visualization technique for depicting the scalar field distribution in 2D space. Conventional methods for constructing density maps are mainly based on Euclidean distance, limiting their applicability in urban analysis that shall consider road network and urban traffic. In this work, we propose a new method named Topology Density Map, targeting for accurate and intuitive density maps in the context of urban environment. Based on the various constraints of road connections and traffic conditions, the method first constructs a directed acyclic graph (DAG) that propagates nonlinear scalar fields along 1D road networks. Next, the method extends the scalar fields to a 2D space by identifying key intersecting points in the DAG, dividing the underlying territory into planar regions using a weighted Voronoi diagram, and calculating the scalar fields for every point. Two case studies demonstrate that the Topology Density Map supplies accurate information to users and provides an intuitive visualization for decision making. An interview with domain experts demonstrates the feasibility, usability, and effectiveness of our method.

preprint2020arXiv

Composition and Configuration Patterns in Multiple-View Visualizations

Multiple-view visualization (MV) is a layout design technique often employed to help users see a large number of data attributes and values in a single cohesive representation. Because of its generalizability, the MV design has been widely adopted by the visualization community to help users examine and interact with large, complex, and high-dimensional data. However, although ubiquitous, there has been little work to categorize and analyze MVs in order to better understand its design space. As a result, there has been little to no guideline in how to use the MV design effectively. In this paper, we present an in-depth study of how MVs are designed in practice. We focus on two fundamental measures of multiple-view patterns: composition, which quantifies what view types and how many are there; and configuration, which characterizes spatial arrangement of view layouts in the display space. We build a new dataset containing 360 images of MVs collected from IEEE VIS, EuroVis, and PacificVis publications 2011 to 2019, and make fine-grained annotations of view types and layouts for these visualization images. From this data we conduct composition and configuration analyses using quantitative metrics of term frequency and layout topology. We identify common practices around MVs, including relationship of view types, popular view layouts, and correlation between view types and layouts. We combine the findings into a MV recommendation system, providing interactive tools to explore the design space, and support example-based design.

preprint2020arXiv

Cooperative Adaptive Learning Control for A Group of Nonholonomic UGVs by Output Feedback

A high-gain observer-based cooperative deterministic learning (CDL) control algorithm is proposed in this chapter for a group of identical unicycle-type unmanned ground vehicles (UGVs) to track over desired reference trajectories. For the vehicle states, the positions of the vehicles can be measured, while the velocities are estimated using the high-gain observer. For the trajectory tracking controller, the radial basis function (RBF) neural network (NN) is used to online estimate the unknown dynamics of the vehicle, and the NN weight convergence and estimation accuracy is guaranteed by CDL. The major challenge and novelty of this chapter is to track the reference trajectory using this observer-based CDL algorithm without the full knowledge of the vehicle state and vehicle model. In addition, any vehicle in the system is able to learn the knowledge of unmodeled dynamics along the union of trajectories experienced by all vehicle agents, such that the learned knowledge can be re-used to follow any reference trajectory defined in the learning phase. The learning-based tracking convergence and consensus learning results, as well as using learned knowledge for tracking experienced trajectories, are shown using the Lyapunov method. Simulation is given to show the effectiveness of this algorithm.

preprint2020arXiv

Exemplar-based Layout Fine-tuning for Node-link Diagrams

We design and evaluate a novel layout fine-tuning technique for node-link diagrams that facilitates exemplar-based adjustment of a group of substructures in batching mode. The key idea is to transfer user modifications on a local substructure to other substructures in the whole graph that are topologically similar to the exemplar. We first precompute a canonical representation for each substructure with node embedding techniques and then use it for on-the-fly substructure retrieval. We design and develop a light-weight interactive system to enable intuitive adjustment, modification transfer, and visual graph exploration. We also report some results of quantitative comparisons, three case studies, and a within-participant user study.

preprint2020arXiv

On the Injection of Relativistic Electrons in the Jet of 3C 279

The acceleration of electrons in 3C 279 is investigated through analyzing the injected electron energy distribution (EED) in a time-dependent synchrotron self-Compton + external Compton emission model. In this model, it is assumed that relativistic electrons are continuously injected into the emission region, and the injected EED [$Q_e^\prime(γ^\prime)$] follows a single power-law form with low- and high-energy cutoffs $\rm γ_{min}'$ and $\rm γ_{max}'$, respectively, and the spectral index $n$, i.e, $Q_e^\prime(γ^\prime)\proptoγ^{\prime-n}$. This model is applied to 14 quasi-simultaneous spectral energy distributions (SEDs) of 3C 279. The Markov Chain Monte Carlo fitting technique is performed to obtain the best-fitting parameters and the uncertainties on the parameters. The results show that the injected EED is well constrained in each state. The value of $n$ is in the range of 2.5 to 3.8, which is larger than that expected by the classic non-relativistic shock acceleration. However, the large value of $n$ can be explained by the relativistic oblique shock acceleration. The flaring activity seems to be related to an increased acceleration efficiency, reflected in an increased $γ'_{\rm min}$ and electron injection power.

preprint2020arXiv

Revisiting the Modifiable Areal Unit Problem in Deep Traffic Prediction with Visual Analytics

Deep learning methods are being increasingly used for urban traffic prediction where spatiotemporal traffic data is aggregated into sequentially organized matrices that are then fed into convolution-based residual neural networks. However, the widely known modifiable areal unit problem within such aggregation processes can lead to perturbations in the network inputs. This issue can significantly destabilize the feature embeddings and the predictions, rendering deep networks much less useful for the experts. This paper approaches this challenge by leveraging unit visualization techniques that enable the investigation of many-to-many relationships between dynamically varied multi-scalar aggregations of urban traffic data and neural network predictions. Through regular exchanges with a domain expert, we design and develop a visual analytics solution that integrates 1) a Bivariate Map equipped with an advanced bivariate colormap to simultaneously depict input traffic and prediction errors across space, 2) a Morans I Scatterplot that provides local indicators of spatial association analysis, and 3) a Multi-scale Attribution View that arranges non-linear dot plots in a tree layout to promote model analysis and comparison across scales. We evaluate our approach through a series of case studies involving a real-world dataset of Shenzhen taxi trips, and through interviews with domain experts. We observe that geographical scale variations have important impact on prediction performances, and interactive visual exploration of dynamically varying inputs and outputs benefit experts in the development of deep traffic prediction models.

preprint2020arXiv

Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient

Deep Q-learning algorithms often suffer from poor gradient estimations with an excessive variance, resulting in unstable training and poor sampling efficiency. Stochastic variance-reduced gradient methods such as SVRG have been applied to reduce the estimation variance (Zhao et al. 2019). However, due to the online instance generation nature of reinforcement learning, directly applying SVRG to deep Q-learning is facing the problem of the inaccurate estimation of the anchor points, which dramatically limits the potentials of SVRG. To address this issue and inspired by the recursive gradient variance reduction algorithm SARAH (Nguyen et al. 2017), this paper proposes to introduce the recursive framework for updating the stochastic gradient estimates in deep Q-learning, achieving a novel algorithm called SRG-DQN. Unlike the SVRG-based algorithms, SRG-DQN designs a recursive update of the stochastic gradient estimate. The parameter update is along an accumulated direction using the past stochastic gradient information, and therefore can get rid of the estimation of the full gradients as the anchors. Additionally, SRG-DQN involves the Adam process for further accelerating the training process. Theoretical analysis and the experimental results on well-known reinforcement learning tasks demonstrate the efficiency and effectiveness of the proposed SRG-DQN algorithm.

preprint2016arXiv

Registration of Volumetric Prostate Scans using Curvature Flow

Radiological imaging of the prostate is becoming more popular among researchers and clinicians in searching for diseases, primarily cancer. Scans might be acquired with different equipment or at different times for prognosis monitoring, with patient movement between scans, resulting in multiple datasets that need to be registered. For these cases, we introduce a method for volumetric registration using curvature flow. Multiple prostate datasets are mapped to canonical solid spheres, which are in turn aligned and registered through the use of identified landmarks on or within the gland. Theoretical proof and experimental results show that our method produces homeomorphisms with feature constraints. We provide thorough validation of our method by registering prostate scans of the same patient in different orientations, from different days and using different modes of MRI. Our method also provides the foundation for a general group-wise registration using a standard reference, defined on the complex plane, for any input. In the present context, this can be used for registering as many scans as needed for a single patient or different patients on the basis of age, weight or even malignant and non-malignant attributes to study the differences in general population. Though we present this technique with a specific application to the prostate, it is generally applicable for volumetric registration problems.

preprint2014arXiv

The Unified Surface Ricci Flow

Ricci flow deforms the Riemannian metric proportionally to the curvature, such that the curvature evolves according to a heat diffusion process and eventually becomes constant everywhere. Ricci flow has demonstrated its great potential by solving various problems in many fields, which can be hardly handled by alternative methods so far. This work introduces the unified theoretic framework for discrete Surface Ricci Flow, including all common schemes: Thurston's Circle Packing, Tangential Circle Packing, Inversive Distance Circle Packing and Discrete Yamabe. Furthermore, this work also introduces a novel scheme, virtual radius circle packing, under the unified framework. This work gives explicit geometric interpretation to the discrete Ricci energy for all the schemes, and Hessian of the discrete Ricci energy for schemes with Euclidean back ground geometry. The unified frame work deepen our understanding to the the discrete surface Ricci flow theory, and inspired us to discover the new schemes, improved the flexibility and robustness of the algorithms, greatly simplified the implementation and improved the debugging efficiency. Experimental results shows the unified surface Ricci flow algorithms can handle general surfaces with different topologies, and is robust to meshes with different qualities, and effective for solving real problems.

preprint2014arXiv

Uncovering the information core in recommender systems

With the rapid growth of the Internet and overwhelming amount of information that people are confronted with, recommender systems have been developed to effiectively support users' decision-making process in online systems. So far, much attention has been paid to designing new recommendation algorithms and improving existent ones. However, few works considered the different contributions from different users to the performance of a recommender system. Such studies can help us improve the recommendation efficiency by excluding irrelevant users. In this paper, we argue that in each online system there exists a group of core users who carry most of the information for recommendation. With them, the recommender systems can already generate satisfactory recommendation. Our core user extraction method enables the recommender systems to achieve 90% of the accuracy by taking only 20% of the data into account.

preprint2013arXiv

Information filtering in sparse online systems: recommendation via semi-local diffusion

With the rapid growth of the Internet and overwhelming amount of information and choices that people are confronted with, recommender systems have been developed to effectively support users' decision-making process in the online systems. However, many recommendation algorithms suffer from the data sparsity problem, i.e. the user-object bipartite networks are so sparse that algorithms cannot accurately recommend objects for users. This data sparsity problem makes many well-known recommendation algorithms perform poorly. To solve the problem, we propose a recommendation algorithm based on the semi-local diffusion process on a user-object bipartite network. The numerical simulation on two sparse datasets, Amazon and Bookcross, show that our method significantly outperforms the state-of-the-art methods especially for those small-degree users. Two personalized semi-local diffusion methods are proposed which further improve the recommendation accuracy. Finally, our work indicates that sparse online systems are essentially different from the dense online systems, all the algorithms and conclusions based on dense data should be rechecked again in sparse data.

preprint2013arXiv

Membership in social networks and the application in information filtering

During the past a few years, users' membership in the online system (i.e. the social groups that online users joined) are wildly investigated. Most of these works focus on the detection, formulation and growth of online communities. In this paper, we study users' membership in a coupled system which contains user-group and user-object bipartite networks. By linking users' membership information and their object selection, we find that the users who have collected only a few objects are more likely to be "influenced" by the membership when choosing objects. Moreover, we observe that some users may join many online communities though they collected few objects. Based on these findings, we design a social diffusion recommendation algorithm which can effectively solve the user cold-start problem. Finally, we propose a personalized combination of our method and the hybrid method in [PNAS 107, 4511 (2010)], which leads to a further improvement in the overall recommendation performance.

preprint2012arXiv

Multi-Output Recommender: Items, Groups and Friends, and Their Mutual Contributing Effects

Due to the development of social media technology, it becomes easier for users to gather together to form groups. Take the Last.fm for example, users can join groups they may be interested where they can share their loved songs and discuss topics about songs and singers. However, the number of groups grows over time, users need effective groups recommendations in order to meet more like-minded users.

preprint2010arXiv

Discrete Laplace-Beltrami Operator Determines Discrete Riemannian Metric

The Laplace-Beltrami operator of a smooth Riemannian manifold is determined by the Riemannian metric. Conversely, the heat kernel constructed from its eigenvalues and eigenfunctions determines the Riemannian metric. This work proves the analogy on Euclidean polyhedral surfaces (triangle meshes), that the discrete Laplace-Beltrami operator and the discrete Riemannian metric (unique up to a scaling) are mutually determined by each other. Given an Euclidean polyhedral surface, its Riemannian metric is represented as edge lengths, satisfying triangle inequalities on all faces. The Laplace-Beltrami operator is formulated using the cotangent formula, where the edge weight is defined as the sum of the cotangent of angles against the edge. We prove that the edge lengths can be determined by the edge weights unique up to a scaling using the variational approach. First, we show that the space of all possible metrics of a polyhedral surface is convex. Then, we construct a special energy defined on the metric space, such that the gradient of the energy equals to the edge weights. Third, we show the Hessian matrix of the energy is positive definite, restricted on the tangent space of the metric space, therefore the energy is convex. Finally, by the fact that the parameter on a convex domain and the gradient of a convex function defined on the domain have one-to-one correspondence, we show the edge weights determines the polyhedral metric unique up to a scaling. The constructive proof leads to a computational algorithm that finds the unique metric on a topological triangle mesh from a discrete Laplace-Beltrami operator matrix.

Wei Zeng

What is connected

Connect this record

See the researcher in context

Building this map preview

21 published item(s)

nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning

Uncertainty-Guided Dual-Domain Learning for Reliable Skin Lesion Segmentation

Ultrahigh-Energy Gamma-ray Emission Associated with Black Hole-Jet Systems

A novel spectral method for the subdiffusion equation

Fault Detection and Isolation of Uncertain Nonlinear Parabolic PDE Systems

Deep Colormap Extraction from Visualizations

The distance between the weights of the neural network is meaningful

Topology Density Map for Urban Data Visualization and Analysis

Composition and Configuration Patterns in Multiple-View Visualizations

Cooperative Adaptive Learning Control for A Group of Nonholonomic UGVs by Output Feedback

Exemplar-based Layout Fine-tuning for Node-link Diagrams

On the Injection of Relativistic Electrons in the Jet of 3C 279

Revisiting the Modifiable Areal Unit Problem in Deep Traffic Prediction with Visual Analytics

Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient

Registration of Volumetric Prostate Scans using Curvature Flow

The Unified Surface Ricci Flow

Uncovering the information core in recommender systems

Information filtering in sparse online systems: recommendation via semi-local diffusion

Membership in social networks and the application in information filtering

Multi-Output Recommender: Items, Groups and Friends, and Their Mutual Contributing Effects

Discrete Laplace-Beltrami Operator Determines Discrete Riemannian Metric