Source author record

Dan Zhang

Dan Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

48works

31topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Do Agents Need to Plan Step-by-Step? Rethinking Planning Horizon in Data-Centric Tool Calling

Explicit planning is a critical capability for LLM-based agents solving complex data-centric tasks, which require precise tool calling over external data sources. Existing strategies fall into two paradigms based on planning horizon: (1) full-horizon (FH), which generates a complete plan before execution, and (2) single-step horizon (SH), which interleaves each action (tool call) with incremental reasoning and observation. While step-by-step execution is a common default under the assumption that eager execution monitoring is necessary for adaptability, we revisit this assumption for well-defined data-centric tasks. Our controlled empirical study isolates planning horizon as the key architectural feature and systematically analyzes the effects of topological complexity and tool robustness on both paradigms. Our experiments across Knowledge Base Question Answering and Multi-hop QA show that FH planning with lazy replanning achieves accuracy parity with SH across varying depths, breadths, and robustness levels, while using 2-3x fewer tokens. These findings suggest that for well-defined data-centric tasks, eager step-wise monitoring is often unnecessary, and full-horizon planning with on-demand replanning can offer a more efficient default.

preprint2026arXiv

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

We present GLM-5V-Turbo, a step toward native foundation models for multimodal agents. As foundation models are increasingly deployed in real environments, agentic capability depends not only on language reasoning, but also on the ability to perceive, interpret, and act over heterogeneous contexts such as images, videos, webpages, documents, GUIs. GLM-5V-Turbo is built around this objective: multimodal perception is integrated as a core component of reasoning, planning, tool use, and execution, rather than as an auxiliary interface to a language model. This report summarizes the main improvements behind GLM-5V-Turbo across model design, multimodal training, reinforcement learning, toolchain expansion, and integration with agent frameworks. These developments lead to strong performance in multimodal coding, visual tool use, and framework-based agentic tasks, while preserving competitive text-only coding capability. More importantly, our development process offers practical insights for building multimodal agents, highlighting the central role of multimodal perception, hierarchical optimization, and reliable end-to-end verification.

preprint2026arXiv

MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free

Extending the input modality of Large Language Models~(LLMs) to the audio domain is essential for achieving comprehensive multimodal perception. However, it is well-known that acoustic information is intrinsically \textit{heterogeneous}, entangling attributes such as speech, music, and environmental context. Existing research is limited to a dense, parameter-shared adapter to model these diverse patterns, which induces \textit{gradient conflict} during optimization, as parameter updates required for distinct attributes contradict each other. To address this limitation, we introduce the \textit{\textbf{MoE-Adapter}}, a sparse Mixture-of-Experts~(MoE) architecture designed to decouple acoustic information. Specifically, it employs a dynamic gating mechanism that routes audio tokens to specialized experts capturing complementary feature subspaces while retaining shared experts for global context, thereby mitigating gradient conflicts and enabling fine-grained feature learning. Comprehensive experiments show that the MoE-Adapter achieves superior performance on both audio semantic and paralinguistic tasks, consistently outperforming dense linear baselines with comparable computational costs. Furthermore, we will release the related code and models to facilitate future research.

preprint2026arXiv

Rubric-based On-policy Distillation

On-policy distillation (OPD) is a powerful paradigm for model alignment, yet its reliance on teacher logits restricts its application to white-box scenarios. We contend that structured semantic rubrics can serve as a scalable alternative to teacher logits, enabling OPD using only teacher-generated responses. To prove it, we introduce ROPD, a simple yet foundational framework for rubric-based OPD. Specifically, ROPD induces prompt-specific rubrics from teacher-student contrasts, and then utilizes these rubrics to score the student rollouts for on-policy optimization. Empirically, ROPD outperforms the advanced logit-based OPD methods across most scenarios, and achieving up to a 10x gain in sample efficiency. These results position rubric-based OPD as a flexible, black-box-compatible alternative to the prevailing logit-based OPD, offering a simple yet strong baseline for scalable distillation across proprietary and open-source LLMs. Code is available at https://github.com/Peregrine123/ROPD_official.

preprint2026arXiv

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Visual generative models have achieved remarkable progress in synthesizing photorealistic images and videos, yet aligning their outputs with human preferences across critical dimensions remains a persistent challenge. Though reinforcement learning from human feedback offers promise for preference alignment, existing reward models for visual generation face limitations, including black-box scoring without interpretability and potentially resultant unexpected biases. We present VisionReward, a general framework for learning human visual preferences in both image and video generation. Specifically, we employ a hierarchical visual assessment framework to capture fine-grained human preferences, and leverages linear weighting to enable interpretable preference learning. Furthermore, we propose a multi-dimensional consistent strategy when using VisionReward as a reward model during preference optimization for visual generation. Experiments show that VisionReward can significantly outperform existing image and video reward models on both machine metrics and human evaluation. Notably, VisionReward surpasses VideoScore by 17.2% in preference prediction accuracy, and text-to-video models with VisionReward achieve a 31.6% higher pairwise win rate compared to the same models using VideoScore. All code and datasets are provided at https://github.com/THUDM/VisionReward.

preprint2024arXiv

Quasinormal modes of quantum-corrected black holes

In this paper, we investigate the quasinormal mode (QNM) spectra for scalar perturbation over a quantum-corrected black hole (BH). The fundamental modes of this quantum-corrected BH exhibit two key properties. Firstly, there is a non-monotonic behavior concerning the quantum-corrected parameter for zero multipole number. Secondly, the quantum gravity effects result in slower decay modes. For higher overtones, a significant deviation becomes evident between the quasinormal frequencies (QNFs) of the quantum-corrected and Schwarzschild BHs. The intervention of quantum gravity corrections induces a significant outburst of overtones. This outburst of these overtones can be attributed to the distinctions near the event horizons between the Schwarzschild and quantum-corrected BHs. Therefore, overtones can serve as a means to probe physical phenomena or disparities in the vicinity of the event horizon.

preprint2023arXiv

A First Search for Solar $^8$B Neutrino in the PandaX-4T Experiment using Neutrino-Nucleus Coherent Scattering

A search for interactions from solar $^8$B neutrinos elastically scattering off xenon nuclei using PandaX-4T commissioning data is reported. The energy threshold of this search is further lowered compared with the previous search for dark matter, with various techniques utilized to suppress the background that emerges from data with the lowered threshold. A blind analysis is performed on the data with an effective exposure of 0.48 tonne$\cdot$year, and no significant excess of events is observed. Among results obtained using the neutrino-nucleus coherent scattering, our results give the best constraint on the solar $^8$B neutrino flux. We further provide a more stringent limit on the cross section between dark matter and nucleon in the mass range from 3 to 9 GeV/c$^2$.

preprint2023arXiv

MEGAnno: Exploratory Labeling for NLP in Computational Notebooks

We present MEGAnno, a novel exploratory annotation framework designed for NLP researchers and practitioners. Unlike existing labeling tools that focus on data labeling only, our framework aims to support a broader, iterative ML workflow including data exploration and model development. With MEGAnno's API, users can programmatically explore the data through sophisticated search and automated suggestion functions and incrementally update task schema as their project evolve. Combined with our widget, the users can interactively sort, filter, and assign labels to multiple items simultaneously in the same notebook where the rest of the NLP project resides. We demonstrate MEGAnno's flexible, exploratory, efficient, and seamless labeling experience through a sentiment analysis use case.

preprint2023arXiv

Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation Learning

Dataset discovery from data lakes is essential in many real application scenarios. In this paper, we propose Starmie, an end-to-end framework for dataset discovery from data lakes (with table union search as the main use case). Our proposed framework features a contrastive learning method to train column encoders from pre-trained language models in a fully unsupervised manner. The column encoder of Starmie captures the rich contextual semantic information within tables by leveraging a contrastive multi-column pre-training strategy. We utilize the cosine similarity between column embedding vectors as the column unionability score and propose a filter-and-verification framework that allows exploring a variety of design choices to compute the unionability score between two tables accordingly. Empirical evaluation results on real table benchmark datasets show that Starmie outperforms the best-known solutions in the effectiveness of table union search by 6.8 in MAP and recall. Moreover, Starmie is the first to employ the HNSW (Hierarchical Navigable Small World) index for accelerate query processing of table union search which provides a 3,000X performance gain over the linear scan baseline and a 400X performance gain over an LSH index (the state-of-the-art solution for data lake indexing).

preprint2023arXiv

Towards Multifaceted Human-Centered AI

Human-centered AI workflows involve stakeholders with multiple roles interacting with each other and automated agents to accomplish diverse tasks. In this paper, we call for a holistic view when designing support mechanisms, such as interaction paradigms, interfaces, and systems, for these multifaceted workflows.

preprint2022arXiv

$\rm ^{83}Rb$/$\rm ^{83m}Kr$ production and cross-section measurement with 3.4 MeV and 20 MeV proton beams

$\rm ^{83m}Kr$, with a short lifetime, is an ideal calibration source for liquid xenon or liquid argon detectors. The $\rm ^{83m}Kr$ isomer can be generated through the decay of $\rm ^{83} Rb$ isotope which is usually produced by proton beams bombarding natural krypton atoms. In this paper, we report a successful production of $\rm ^{83}Rb/^{83m}Kr$ with a proton beam energy of 3.4 MeV, and the first measurement of the production rate with such low energy proton beams. Another production attempt is performed using the newly available 20 MeV proton beam in China, and the measured production rate is consistent with previous measurements. The produced $\rm ^{83m}Kr$ source has been successfully injected into the PandaX-II liquid xenon detector, yielding enough statistics for detector calibration.

preprint2022arXiv

A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators

The rapidly-changing deep learning landscape presents a unique opportunity for building inference accelerators optimized for specific datacenter-scale workloads. We propose Full-stack Accelerator Search Technique (FAST), a hardware accelerator search framework that defines a broad optimization environment covering key design decisions within the hardware-software stack, including hardware datapath, software scheduling, and compiler passes such as operation fusion and tensor padding. In this paper, we analyze bottlenecks in state-of-the-art vision and natural language processing (NLP) models, including EfficientNet and BERT, and use FAST to design accelerators capable of addressing these bottlenecks. FAST-generated accelerators optimized for single workloads improve Perf/TDP by 3.7x on average across all benchmarks compared to TPU-v3. A FAST-generated accelerator optimized for serving a suite of workloads improves Perf/TDP by 2.4x on average compared to TPU-v3. Our return on investment analysis shows that FAST-generated accelerators can potentially be practical for moderate-sized datacenter deployments.

preprint2022arXiv

A novel holographic quantum phase transition and butterfly velocity

In this paper, we make a systematical and in-depth exploration on the phase structure and the behaviors of butterfly velocity in an Einstein-Maxwell-dilaton-axions (EMDA) model. Depending on the model parameter, there are two kinds of mechanisms driving quantum phase transition (QPT) in this model. One is the infrared (IR) geometry to be renormalization group (RG) unstable, and the other is the strength of lattice deformation leading to some kind of bifurcating solution. We also find a novel QPT in the metal phases. The study on the behavior of the butterfly velocity crossing QPT indicates that the butterfly velocity or its first derivative exhibiting local extreme depends on the QPT mechanism. Further, the scaling behaviors of the butterfly velocity in the zero-temperature limit confirm that different phases are controlled by different IR geometries. Therefore, the butterfly velocity is a good probe to QPT and it also provides a possible way to study QPT beyond holography.

preprint2022arXiv

A Search for the Cosmic Ray Boosted Sub-GeV Dark Matter at the PandaX-II Experiment

We report a novel search for the cosmic ray boosted dark matter using the 100~tonne$\cdot$day full data set of the PandaX-II detector located at the China Jinping Underground Laboratory. With the extra energy gained from the cosmic rays, sub-GeV dark matter particles can produce visible recoil signals in the detector. The diurnal modulations in rate and energy spectrum are utilized to further enhance the signal sensitivity. Our result excludes the dark matter-nucleon elastic scattering cross section between 10$^{-31}$cm$^{2}$ and 10$^{-28}$cm$^{2}$ for a dark matter masses from 0.1 MeV/$c^2$ to 0.1 GeV/$c^2$, with a large parameter space previously unexplored by experimental collaborations.

preprint2022arXiv

A search for two-component Majorana dark matter in a simplified model using the full exposure data of PandaX-II experiment

In the two-component Majorana dark matter model, one dark matter particle can scatter off the target nuclei, and turn into a slightly heavier component. In the framework of a simplified model with a vector boson mediator, both the tree-level and loop-level processes contribute to the signal in direct detection experiment. In this paper, we report the search results for such dark matter from PandaX-II experiment, using total data of the full 100.7 tonne$\cdot$day exposure. No significant excess is observed, so strong constraints on the combined parameter space of mediator mass and dark matter mass are derived. With the complementary search results from collider experiments, a large range of parameter space can be excluded.

preprint2022arXiv

Annotating Columns with Pre-trained Language Models

Inferring meta information about tables, such as column headers or relationships between columns, is an active research topic in data management as we find many tables are missing some of this information. In this paper, we study the problem of annotating table columns (i.e., predicting column types and the relationships between columns) using only information from the table itself. We develop a multi-task learning framework (called Doduo) based on pre-trained language models, which takes the entire table as input and predicts column types/relations using a single model. Experimental results show that Doduo establishes new state-of-the-art performance on two benchmarks for the column type prediction and column relation prediction tasks with up to 4.0% and 11.9% improvements, respectively. We report that Doduo can already outperform the previous state-of-the-art performance with a minimal number of tokens, only 8 tokens per column. We release a toolbox (https://github.com/megagonlabs/doduo) and confirm the effectiveness of Doduo on a real-world data science problem through a case study.

preprint2022arXiv

Contrastive Learning of Subject-Invariant EEG Representations for Cross-Subject Emotion Recognition

EEG signals have been reported to be informative and reliable for emotion recognition in recent years. However, the inter-subject variability of emotion-related EEG signals still poses a great challenge for the practical applications of EEG-based emotion recognition. Inspired by recent neuroscience studies on inter-subject correlation, we proposed a Contrastive Learning method for Inter-Subject Alignment (CLISA) to tackle the cross-subject emotion recognition problem. Contrastive learning was employed to minimize the inter-subject differences by maximizing the similarity in EEG signal representations across subjects when they received the same emotional stimuli in contrast to different ones. Specifically, a convolutional neural network was applied to learn inter-subject aligned spatiotemporal representations from EEG time series in contrastive learning. The aligned representations were subsequently used to extract differential entropy features for emotion classification. CLISA achieved state-of-the-art cross-subject emotion recognition performance on our THU-EP dataset with 80 subjects and the publicly available SEED dataset with 15 subjects. It could generalize to unseen subjects or unseen emotional stimuli in testing. Furthermore, the spatiotemporal representations learned by CLISA could provide insights into the neural mechanisms of human emotion processing.

preprint2022arXiv

Impact of Economic Constraints on the Projected Timeframe for Human-Crewed Deep Space Exploration

Deep space exploration offers the most profound opportunity for the expansion of humanity and our understanding of the Universe, but remains extremely challenging. Progress will continue to be paced by uncrewed missions followed up by crewed missions to ever further destinations. Major space powers continue to invest in crewed deep space exploration as an important national strategy. An improved model based on previous work is developed, which projects the earliest possible launch dates for human-crewed missions from cis-lunar space to selected destinations in the Solar System and beyond based on NASA's historic budget trend and overall development trends of deep space exploration research. The purpose of the analysis is to provide a projected timeframe for crewed missions beyond Mars. Our findings suggest the first human missions from a spacefaring nation or international collaboration to the Asteroid Belt and Jovian System could be scheduled as soon as ~2071 to ~2087 and ~2101 to ~2121, respectively, while a launch to the Saturn System may occur by the year ~2132, with an uncertainty window of ~2129 to ~2153.

preprint2022arXiv

Low Radioactive Material Screening and Background Control for the PandaX-4T Experiment

PandaX-4T is a ton-scale dark matter direct detection experiment using a dual-phase TPC technique at the China Jinping Underground Laboratory. Various ultra-low background technologies have been developed and applied to material screening for PandaX-4T, including HPGe gamma spectroscopy, ICP-MS, NAA, radon emanation measurement system, krypton assay station, and alpha detection system. Low background materials were selected to assemble the detector. Surface treatment procedures were investigated to further suppress radioactive background. Combining measured results and Monte Carlo simulation, the total material background rates of PandaX-4T in the energy region of 1-25 keV$\rm{}_{ee}$ are estimated to be (9.9 $\pm$ 1.9) $\times \ 10^{-3}$ mDRU for electron recoil and (2.8 $\pm$ 0.6) $\times \ 10^{-4}$ mDRU for nuclear recoil. In addition, $^{nat}$Kr in the detector is estimated to be <8 ppt.

preprint2022arXiv

Neutron-induced nuclear recoil background in the PandaX-4T experiment

Neutron-induced nuclear recoil background is critical to the dark matter searches in the PandaX-4T liquid xenon experiment. This paper studies the feature of neutron background in liquid xenon and evaluates their contribution in the single scattering nuclear recoil events through three methods. The first method is fully Monte Carlo simulation based. The last two are data-driven methods that also use the multiple scattering signals and high energy signals in the data, respectively. In the PandaX-4T commissioning data with an exposure of 0.63 tonne-year, all these methods give a consistent result that there are $1.15\pm0.57$ neutron-induced background in dark matter signal region within an approximated nuclear recoil energy window between 5 and 100 keV.

preprint2022arXiv

Retinal Structure Detection in OCTA Image via Voting-based Multi-task Learning

Automated detection of retinal structures, such as retinal vessels (RV), the foveal avascular zone (FAZ), and retinal vascular junctions (RVJ), are of great importance for understanding diseases of the eye and clinical decision-making. In this paper, we propose a novel Voting-based Adaptive Feature Fusion multi-task network (VAFF-Net) for joint segmentation, detection, and classification of RV, FAZ, and RVJ in optical coherence tomography angiography (OCTA). A task-specific voting gate module is proposed to adaptively extract and fuse different features for specific tasks at two levels: features at different spatial positions from a single encoder, and features from multiple encoders. In particular, since the complexity of the microvasculature in OCTA images makes simultaneous precise localization and classification of retinal vascular junctions into bifurcation/crossing a challenging task, we specifically design a task head by combining the heatmap regression and grid classification. We take advantage of three different \textit{en face} angiograms from various retinal layers, rather than following existing methods that use only a single \textit{en face}. To facilitate further research, part of these datasets with the source code and evaluation benchmark have been released for public access:https://github.com/iMED-Lab/VAFF-Net.

preprint2022arXiv

Study of background from accidental coincidence signals in the PandaX-II experiment

The PandaX-II experiment employed a 580kg liquid xenon detector to search for the interactions between dark matter particles and the target xenon atoms. The accidental coincidences of isolated signals result in a dangerous background which mimic the signature of the dark matter. We performed a detailed study on the accidental coincidence background in PandaX-II, including the possible origin of the isolated signals, the background level and corresponding background suppression method. With a boosted-decision-tree algorithm, the accidental coincidence background is reduced by 70% in the dark matter signal region, thus the sensitivity of dark matter search at PandaX-II is improved.

preprint2021arXiv

Dark Matter Search Results from the PandaX-4T Commissioning Run

We report the first dark matter search results using the commissioning data from PandaX-4T. Using a time projection chamber with 3.7-tonne of liquid xenon target and an exposure of 0.63 tonne$\cdot$year, 1058 candidate events are identified within an approximate nuclear recoil energy window between 5 and 100 keV. No significant excess over background is observed. Our data set a stringent limit to the dark matter-nucleon spin-independent interactions, with a lowest excluded cross section (90% C.L.) of $3.8\times10^{-47} $cm$^2$ at a dark matter mass of 30 GeV/$c^2$.

preprint2021arXiv

Internal Calibration of the PandaX-II Detector with Radon Gaseous Sources

We have developed a low-energy electron recoil (ER) calibration method with $^{220}$Rn for the PandaX-II detector. $^{220}$Rn, emanated from natural thorium compounds, was fed into the detector through the xenon purification system. From 2017 to 2019, we performed three dedicated calibration campaigns with different radon sources. We studied the detector response to $α$, $β$, and $γ$ particles with focus on low energy ER events. During the runs in 2017 and 2018, the amount of radioactivity of $^{222}$Rn were on the order of 1\% of that of $^{220}$Rn and thorium particulate contamination was negligible, especially in 2018. We also measured the background contribution from $^{214}$Pb for the first time in PandaX-II with the help from a $^{222}$Rn injection. Calibration strategy with $^{220}$Rn and $^{222}$Rn will be implemented in the upcoming PandaX-4T experiment and can be useful for other xenon-based detectors as well.

preprint2021arXiv

Light yield and field dependence measurement in PandaX-II dual-phase xenon detector

The dual-phase xenon time projection chamber (TPC) is one of the most sensitive detector technology for dark matter direct search, where the energy deposition of incoming particle can be converted into photons and electrons through xenon excitation and ionization. The detector response to signal energy deposition varies significantly with the electric field in liquid xenon. We study the detector's light yield and its dependence on the electric field in the PandaX-II dual-phase detector containing 580~kg liquid xenon in the sensitive volume. From our measurements, the light yield at electric fields from 0~V/cm to 317~V/cm is obtained for energy depositions up to 236~keV.

preprint2021arXiv

MITNet: GAN Enhanced Magnetic Induction Tomography Based on Complex CNN

Magnetic induction tomography (MIT) is an efficient solution for long-term brain disease monitoring, which focuses on reconstructing bio-impedance distribution inside the human brain using non-intrusive electromagnetic fields. However, high-quality brain image reconstruction remains challenging since reconstructing images from the measured weak signals is a highly non-linear and ill-conditioned problem. In this work, we propose a generative adversarial network (GAN) enhanced MIT technique, named MITNet, based on a complex convolutional neural network (CNN). The experimental results on the real-world dataset validate the performance of our technique, which outperforms the state-of-art method by 25.27%.

preprint2021arXiv

On-manifold Adversarial Data Augmentation Improves Uncertainty Calibration

Uncertainty estimates help to identify ambiguous, novel, or anomalous inputs, but the reliable quantification of uncertainty has proven to be challenging for modern deep networks. In order to improve uncertainty estimation, we propose On-Manifold Adversarial Data Augmentation or OMADA, which specifically attempts to generate the most challenging examples by following an on-manifold adversarial attack path in the latent space of an autoencoder-based generative model that closely approximates decision boundaries between two or more classes. On a variety of datasets as well as on multiple diverse network architectures, OMADA consistently yields more accurate and better calibrated classifiers than baseline models, and outperforms competing approaches such as Mixup, as well as achieving similar performance to (at times better than) post-processing calibration methods such as temperature scaling. Variants of OMADA can employ different sampling schemes for ambiguous on-manifold examples based on the entropy of their estimated soft labels, which exhibit specific strengths for generalization, calibration of predicted uncertainty, or detection of out-of-distribution inputs.

preprint2021arXiv

Results of Dark Matter Search using the Full PandaX-II Exposure

We report the dark matter search results obtained using the full 132 ton$\cdot$day exposure of the PandaX-II experiment, including all data from March 2016 to August 2018. No significant excess of events is identified above the expected background. Upper limits are set on the spin-independent dark matter-nucleon interactions. The lowest 90% confidence level exclusion on the spin-independent cross section is $2.2\times 10^{-46}$ cm$^2$ at a WIMP mass of 30 GeV/$c^2$.

preprint2021arXiv

Unifying Message Passing Algorithms Under the Framework of Constrained Bethe Free Energy Minimization

Variational message passing (VMP), belief propagation (BP) and expectation propagation (EP) have found their wide applications in complex statistical signal processing problems. In addition to viewing them as a class of algorithms operating on graphical models, this paper unifies them under an optimization framework, namely, Bethe free energy minimization with differently and appropriately imposed constraints. This new perspective in terms of constraint manipulation can offer additional insights on the connection between different message passing algorithms and is valid for a generic statistical model. It also founds a theoretical framework to systematically derive message passing variants. Taking the sparse signal recovery (SSR) problem as an example, a low-complexity EP variant can be obtained by simple constraint reformulation, delivering better estimation performance with lower complexity than the standard EP algorithm. Furthermore, we can resort to the framework for the systematic derivation of hybrid message passing for complex inference tasks. Notably, a hybrid message passing algorithm is exemplarily derived for joint SSR and statistical model learning with near-optimal inference performance and scalable complexity.

preprint2020arXiv

Analytical study of the holographic superconductor from higher derivative theory

In this paper, we analytically study the holographic superconductor models with the high derivative (HD) coupling terms. Using the Sturm-Liouville (SL) eigenvalue method, we perturbatively calculate the critical temperature. The analytical results are in good agreement with the numerical results. It confirms that the perturbative method in terms of the HD coupling parameters is available. Along the same line, we analytically calculate the value of the condensation near the critical temperature. We find that the phase transition is second order with mean field behavior, which is independent of the HD coupling parameters. Then in the low temperature limit, we also calculate the conductivity, which is qualitatively consistent with the numerical one. We find that the superconducting energy gap is proportional to the value of the condensation. But we note that since the condensation changes with the HD coupling parameters, as the function of the HD coupling parameters, the superconducting energy gap follows the same change trend as that of the condensation.

preprint2020arXiv

Chaotic dynamics of string around charged black brane with hyperscaling violation

By fast Lyapunov indicator (FLI), we study the chaotic dynamics of closed string around charged black brane with hyperscaling violation (HV). The Hawking temperature, Lifshitz dynamical exponent and HV exponent together affect the chaotic dynamics of this system. The temperature plays the role of driving the closed string to escape to infinity. There is a threshold value $z_{\ast}=2$, below which the string is captured by the black brane no matter where the string is placed at the beginning. However, when $z>2$, the string escapes to infinity if it is placed near the black brane at the beginning, but if the initial position of string is far away from the black brane, it oscillates around the black brane till eternity, which is a quasi-periodic motion. HV exponent plays the role of driving the string falling into the black brane. With the increase of HV exponent $θ$, the falling velocity becomes faster. We find that when we heat the system with large HV exponent, the chaotic system does not essentially changes. It indicates that the HV exponent plays a very important role in determining the state of the chaotic system. Also we study the the effect from the winding number of the string. The study indicates that the chaotic dynamics of the string is insensitive to the winding number.

preprint2020arXiv

Doped holographic fermionic system

We construct a two-current model. It includes two gauge fields, which introduce the doping effect, and a neutral scalar field. And then we numerically construct an AdS black brane geometry with scalar hair. Over this background, we study the fermionic system with the pseudoscalar Yukawa coupling. Some universal properties from the pseudoscalar Yukawa coupling are revealed. In particular, as the coupling increases, there is a transfer of the spectral weight from the low energy band to the high energy band. The transfer is over low energy scales but not over all energy scales. The peculiar properties are also explored. The study shows that with the increase of the doping, the gap opens more difficult. It indicates that there is a competition between the pseudoscalar Yukawa coupling and the doping.

preprint2020arXiv

Holograhic two-currents model with coupling and its conductivites

We implement a holographic gravity model of two gauge fields with a coupling between them, which is dual to a two-currents model. An analytical black brane solution is obtained. In particular, we work out the expressions of conductivities with coupling and find that the expressions of conductivities are directly related to the coupling parameter $θ$. It is the main topic of our present work. As an application, we calculate the conductivities by the scheme outlined here and briefly discuss the properties of the conductivities. An interesting property is that as the coupling $θ$ increases, the dip at low frequency in $Re[σ_A]$/$Re[σ_B]$ becomes deepening and then turns into a hard-gap-like when $θ=1$, which is independent of the doping $χ$. Some monotonic behaviors of the conductivities are also discussed.

preprint2020arXiv

Sato: Contextual Semantic Type Detection in Tables

Detecting the semantic types of data columns in relational tables is important for various data preparation and information retrieval tasks such as data cleaning, schema matching, data discovery, and semantic search. However, existing detection approaches either perform poorly with dirty data, support only a limited number of semantic types, fail to incorporate the table context of columns or rely on large sample sizes for training data. We introduce Sato, a hybrid machine learning model to automatically detect the semantic types of columns in tables, exploiting the signals from the context as well as the column values. Sato combines a deep learning model trained on a large-scale table corpus with topic modeling and structured prediction to achieve support-weighted and macro average F1 scores of 0.925 and 0.735, respectively, exceeding the state-of-the-art performance by a significant margin. We extensively analyze the overall and per-type performance of Sato, discussing how individual modeling components, as well as feature categories, contribute to its performance.

preprint2019arXiv

Searching for Neutrino-less Double Beta Decay of $^{136}$Xe with PandaX-II Liquid Xenon Detector

We report the Neutrino-less Double Beta Decay (NLDBD) search results from PandaX-II dual-phase liquid xenon time projection chamber. The total live time used in this analysis is 403.1 days from June 2016 to August 2018. With NLDBD-optimized event selection criteria, we obtain a fiducial mass of 219 kg of natural xenon. The accumulated xenon exposure is 242 kg$\cdot$yr, or equivalently 22.2 kg$\cdot$yr of $^{136}$Xe exposure. At the region around $^{136}$Xe decay Q-value of 2458 keV, the energy resolution of PandaX-II is 4.2%. We find no evidence of NLDBD in PandaX-II and establish a lower limit for decay half-life of 2.4 $ \times 10^{23} $ yr at the 90% confidence level, which corresponds to an effective Majorana neutrino mass $m_{ββ} < (1.3 - 3.5)$ eV. This is the first NLDBD result reported from a dual-phase xenon experiment.

preprint2016arXiv

PandaX-III: Searching for Neutrinoless Double Beta Decay with High Pressure $^{136}$Xe Gas Time Projection Chambers

Searching for the Neutrinoless Double Beta Decay (NLDBD) is now regarded as the topmost promising technique to explore the nature of neutrinos after the discovery of neutrino masses in oscillation experiments. PandaX-III (Particle And Astrophysical Xenon Experiment III) will search for the NLDBD of $^{136}$Xe at the China Jin Ping underground Laboratory (CJPL). In the first phase of the experiment, a high pressure gas Time Projection Chamber (TPC) will contain 200 kg, 90% $^{136}$Xe enriched gas operated at 10 bar. Fine pitch micro-pattern gas detector (Microbulk Micromegas) will be used at both ends of the TPC for the charge readout with a cathode in the middle. Charge signals can be used to reconstruct tracks of NLDBD events and provide good energy and spatial resolution. The detector will be immersed in a large water tank to ensure $\sim$5 m of water shielding in all directions. The second phase, a ton-scale experiment, will consist of five TPCs in the same water tank, with improved energy resolution and better control over backgrounds.

preprint2015arXiv

Comparison between GFDM and VOFDM

This document provides a comparison of the transmission techniques used in Generalized Frequency Division Multiplexing (GFDM) and Vector-OFDM (VOFDM). Within the document both systems are coarsely described and common and distinct properties are highlighted.

preprint2015arXiv

GFDM - A Framework for Virtual PHY Services in 5G Networks

The next generation of wireless networks will face different challenges from new scenarios. The main contribution of this paper is to show that Generalized Frequency Division Multiplexing (GFDM), as a baseline of flexible circular filtered multicarrier systems, can be used as a framework to virtualize the PHY service for the upper layers of 5G networks. This framework opens the possibility to apply software-defined network principles to produce software-defined waveforms capable of addressing the requirements of future mobile networks. Hence, a block oriented concept will be used to provide the modulation service, emulating different flavors of waveforms designed to go beyond the well-established Orthogonal Frequency Division Multiplexing (OFDM) principles, in scenarios where they perform best. The virtual physical layer (PHY) service opens the opportunity to have a fast and dynamic evolution of the infrastructure, as applications change over time. The presented unified modulation concept contributes with future research directions to address burst and continuous transmissions, referencing basic approaches for synchronization and advanced receiver design that can be exploited in future for the whole frame structure design and channel estimation strategies.

preprint2015arXiv

GFDM Transceiver using Precoded Data and Low-complexity Multiplication in Time Domain

Future wireless communication systems are demanding a more flexible physical layer. GFDM is a block filtered multicarrier modulation scheme proposed to add multiple degrees of freedom and cover other waveforms in a single framework. In this paper, GFDM modulation and demodulation will be presented as a frequency-domain circular convolution, allowing for a reduction of the implementation complexity when MF, ZF and MMSE filters are employed in the inner and outer receiver operation. Also, precoding is introduced to further increase GFDM flexibility, addressing a wider set of applications.

preprint2015arXiv

ND^() and NB^() interactions in a chiral quark model

ND and ND^* interactions become a hot topic after the observation of new charmed hadrons Σ_c(2800) and Λ_c(2940)^+. In this letter, we have preliminary investigated S-wave ND and ND^* interactions with possible quantum numbers in the chiral SU(3) quark model and the extended chiral SU(3) quark model by solving the resonating group method equation. The numerical results show that the interactions between N and D or N and D^* are both attractive, which are mainly from σexchanges between light quarks. Further bound-state studies indicate the attractions are strong enough to form ND or ND^* molecules, except for (ND)_{J=3/2} and (ND^*)_{J=3/2} in the chiral SU(3) quark model. In consequence ND system with J=1/2 and ND^* system with J=3/2 in the extended SU(3) quark model could correspond to the observed Σ_c(2800) and Λ_c(2940)^+, respectively. Naturally, the same method can be applied to research NB and NB^* interactions, and similar conclusions obtained, i.e. NB and NB^* attractive forces may achieve bound states, except for (NB^*)_{J=3/2} in the chiral SU(3) quark model. Meanwhile, S partial wave phase shifts of ND^{(*)} and NB^{(*)} elastic scattering are illustrated, which are qualitatively consistent with results from bound state problem.

preprint2015arXiv

Principled Evaluation of Differentially Private Algorithms using DPBench

Differential privacy has become the dominant standard in the research community for strong privacy protection. There has been a flood of research into query answering algorithms that meet this standard. Algorithms are becoming increasingly complex, and in particular, the performance of many emerging algorithms is {\em data dependent}, meaning the distribution of the noise added to query answers may change depending on the input data. Theoretical analysis typically only considers the worst case, making empirical study of average case performance increasingly important. In this paper we propose a set of evaluation principles which we argue are essential for sound evaluation. Based on these principles we propose DPBench, a novel evaluation framework for standardized evaluation of privacy algorithms. We then apply our benchmark to evaluate algorithms for answering 1- and 2-dimensional range queries. The result is a thorough empirical study of 15 published algorithms on a total of 27 datasets that offers new insights into algorithm behavior---in particular the influence of dataset scale and shape---and a more complete characterization of the state of the art. Our methodology is able to resolve inconsistencies in prior empirical studies and place algorithm performance in context through comparison to simple baselines. Finally, we pose open research questions which we hope will guide future algorithm design.

preprint2015arXiv

Reduced Complexity Calculation of LMMSE Filter Coefficients for GFDM

A low-complexity algorithm for calculation of the LMMSE filter coefficients for GFDM in a block-fading multipath environment is derived in this letter. The simplification is based on the block circularity of the involved matrices. The proposal reduces complexity from cubic to squared order. The proposed approach can be generalized to other waveforms with circular pulse shaping.

preprint2013arXiv

Dynamic Radio Resource Management for Random Network Coding: Power Control and CSMA Backoff Control

Resource allocation in wireless networks typically occurs at PHY/MAC layers, while random network coding (RNC) is a network layer strategy. An interesting question is how resource allocation mechanisms can be tuned to improve RNC performance. By means of a differential equation framework which models RNC throughput in terms of lower layer parameters, we propose a gradient based approach that can dynamically allocate MAC and PHY layer resources with the goal of maximizing the minimum network coding throughput among all the destination nodes in a RNC multicast. We exemplify this general approach with two resource allocation problems: (i) power control to improve network coding throughput, and (ii) CSMA mean backoff delay control to improve network coding throughput. We design both centralized algorithms and online algorithms for power control and CSMA backoff control. Our evaluations, including numerically solving the differential equations in the centralized algorithm and an event-driven simulation for the online algorithm, show that such gradient based dynamic resource allocation yields significant throughput improvement of the destination nodes in RNC. Further, our numerical results reveal that network coding aware power control can regain the broadcast advantage of wireless transmissions to improve the throughput.

preprint2012arXiv

Analyzing Random Network Coding with Differential Equations and Differential Inclusions

We develop a framework based on differential equations (DE) and differential inclusions (DI) for analyzing Random Network Coding (RNC), as well as a nonlinear variant referred to as Random Coupon (RC), in a wireless network. The DEDI framework serves as a powerful numerical and analytical tool to study RNC. We demonstrate its versatility by proving theoretical results on multicast information flows in a wireless network using RNC or RC. We also demonstrate the accuracy and flexibility of the performance analysis enabled by this framework via illustrative examples of networks with multiple multicast sessions, user cooperation and arbitrary topologies.

preprint2012arXiv

Complexity-Efficient Enumeration Techniques for Soft-Input, Soft-Output Sphere Decoding

In this paper two complexity efficient soft sphere-decoder modifications are proposed for computing the max-log LLR values in iterative MIMO systems, which avoid the costly, typically needed, full enumeration and sorting (FES) procedure during the tree traversal without compromising the max-log performance. It is shown that despite the resulting increase in the number of expanded nodes, they can be more computationally efficient than the typical soft sphere decoders by avoiding the unnecessary complexity of FES.

preprint2011arXiv

A Primary Study of Heavy Baryons Lambda_Q, Xi_Q, Sigma_Q and Omega_Q

We perform a preliminary study of the 1/2+ and 3/2+ ground-state baryons containing a heavy quark in the framework of the chiral SU(3) quark model. By using the calculus of variations, masses of Lambda_Q, Sigma_Q, Xi_Q, Omega_Q, Sigma_Q^*, Xi_Q^* and Omega_Q^*, where Q means c or b quark, are calculated. With taking reasonable model parameters, the numerical results of established heavy baryons are generally in agreement with the available experimental data, except that those of Xi_Q are somewhat heavier. For Omega_b with undetermined experimental mass and nobserved Xi_b^*, Omega_b^*, reasonable theoretical predictions are obtained. Interactions inside baryons are also discussed.

preprint2011arXiv

Breaking a chaotic image encryption algorithm based on perceptron model

Recently, a chaotic image encryption algorithm based on perceptron model was proposed. The present paper analyzes security of the algorithm and finds that the equivalent secret key can be reconstructed with only one pair of known-plaintext/ciphertext, which is supported by both mathematical proof and experiment results. In addition, some other security defects are also reported.

preprint2009arXiv

S-wave DK interactions in the chiral SU(3) quark model

The $DK$ interaction is relevant to the interpretation of the $D_{sJ}(2317)$. We dynamically investigate $S$-wave $DK$ interactions in the chiral SU(3) quark model by solving the resonating group method equation. The numerical results show an attraction between $D$ and $K$, which is from boson exchanges between light quarks. However, such an attraction is not strong enough to form a $DK$ molecule. Meanwhile, $S$ partial wave phase shifts of $DK$ elastic scattering are obtained. The case of $S$-wave $D^*K$ is rather similar to that of $DK$. To draw a definite conclusion whether a molecular state exists in $DK$ or $D^*K$ system, more details of dynamics should be considered in further study.

Dan Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

48 published item(s)

Do Agents Need to Plan Step-by-Step? Rethinking Planning Horizon in Data-Centric Tool Calling

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free

Rubric-based On-policy Distillation

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Quasinormal modes of quantum-corrected black holes

A First Search for Solar $^8$B Neutrino in the PandaX-4T Experiment using Neutrino-Nucleus Coherent Scattering

MEGAnno: Exploratory Labeling for NLP in Computational Notebooks

Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation Learning

Towards Multifaceted Human-Centered AI

$\rm ^{83}Rb$/$\rm ^{83m}Kr$ production and cross-section measurement with 3.4 MeV and 20 MeV proton beams

A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators

A novel holographic quantum phase transition and butterfly velocity

A Search for the Cosmic Ray Boosted Sub-GeV Dark Matter at the PandaX-II Experiment

A search for two-component Majorana dark matter in a simplified model using the full exposure data of PandaX-II experiment

Annotating Columns with Pre-trained Language Models

Contrastive Learning of Subject-Invariant EEG Representations for Cross-Subject Emotion Recognition

Impact of Economic Constraints on the Projected Timeframe for Human-Crewed Deep Space Exploration

Low Radioactive Material Screening and Background Control for the PandaX-4T Experiment

Neutron-induced nuclear recoil background in the PandaX-4T experiment

Retinal Structure Detection in OCTA Image via Voting-based Multi-task Learning

Study of background from accidental coincidence signals in the PandaX-II experiment

Dark Matter Search Results from the PandaX-4T Commissioning Run

Internal Calibration of the PandaX-II Detector with Radon Gaseous Sources

Light yield and field dependence measurement in PandaX-II dual-phase xenon detector

MITNet: GAN Enhanced Magnetic Induction Tomography Based on Complex CNN

On-manifold Adversarial Data Augmentation Improves Uncertainty Calibration

Results of Dark Matter Search using the Full PandaX-II Exposure

Unifying Message Passing Algorithms Under the Framework of Constrained Bethe Free Energy Minimization

Analytical study of the holographic superconductor from higher derivative theory

Chaotic dynamics of string around charged black brane with hyperscaling violation

Doped holographic fermionic system

Holograhic two-currents model with coupling and its conductivites

Sato: Contextual Semantic Type Detection in Tables

Searching for Neutrino-less Double Beta Decay of $^{136}$Xe with PandaX-II Liquid Xenon Detector

PandaX-III: Searching for Neutrinoless Double Beta Decay with High Pressure $^{136}$Xe Gas Time Projection Chambers

Comparison between GFDM and VOFDM

GFDM - A Framework for Virtual PHY Services in 5G Networks

GFDM Transceiver using Precoded Data and Low-complexity Multiplication in Time Domain

ND^(*) and NB^(*) interactions in a chiral quark model

Principled Evaluation of Differentially Private Algorithms using DPBench

Reduced Complexity Calculation of LMMSE Filter Coefficients for GFDM

Dynamic Radio Resource Management for Random Network Coding: Power Control and CSMA Backoff Control

Analyzing Random Network Coding with Differential Equations and Differential Inclusions

Complexity-Efficient Enumeration Techniques for Soft-Input, Soft-Output Sphere Decoding

A Primary Study of Heavy Baryons Lambda_Q, Xi_Q, Sigma_Q and Omega_Q

Breaking a chaotic image encryption algorithm based on perceptron model

S-wave DK interactions in the chiral SU(3) quark model

ND^() and NB^() interactions in a chiral quark model