Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
48works
0followers
27topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

48 published item(s)

preprint2026arXiv

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

The evolution of Large Language Models (LLMs) into autonomous agents has expanded the scope of AI coding from localized code generation to complex, repository-level, and execution-driven problem solving. However, current benchmarks predominantly evaluate code logic in static contexts, neglecting the dynamic, full-process requirements of real-world engineering, particularly in backend development which demands rigorous environment configuration and service deployment. To address this gap, we introduce ABC-Bench, a benchmark explicitly designed to evaluate agentic backend coding within a realistic, executable workflow. Using a scalable automated pipeline, we curated 224 practical tasks spanning 8 languages and 19 frameworks from open-source repositories. Distinct from previous evaluations, ABC-Bench require the agents to manage the entire development lifecycle from repository exploration to instantiating containerized services and pass the external end-to-end API tests. Our extensive evaluation reveals that even state-of-the-art models struggle to deliver reliable performance on these holistic tasks, highlighting a substantial disparity between current model capabilities and the demands of practical backend engineering. Our code is available at https://github.com/OpenMOSS/ABC-Bench.

preprint2026arXiv

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Financial agents powered by large language models (LLMs) are increasingly deployed for investment analysis, risk assessment, and automated decision-making, where their abilities to plan, invoke tools, and manipulate mutable state introduce new security risks in high-stakes and highly regulated financial environments. However, existing safety evaluations largely focus on language-model-level content compliance or abstract agent settings, failing to capture execution-grounded risks arising from real operational workflows and state-changing actions. To bridge this gap, we propose FinVault, the first execution-grounded security benchmark for financial agents, comprising 31 regulatory case-driven sandbox scenarios with state-writable databases and explicit compliance constraints, together with 107 real-world vulnerabilities and 963 test cases that systematically cover prompt injection, jailbreaking, financially adapted attacks, as well as benign inputs for false-positive evaluation. Experimental results reveal that existing defense mechanisms remain ineffective in realistic financial agent settings, with average attack success rates (ASR) still reaching up to 50.0\% on state-of-the-art models and remaining non-negligible even for the most robust systems (ASR 6.7\%), highlighting the limited transferability of current safety designs and the need for stronger financial-specific defenses. Our code can be found at https://github.com/aifinlab/FinVault.

preprint2026arXiv

Hierarchical Contrastive Learning for Multi-Domain Protein-Ligand Binding

Predicting protein-ligand binding affinity remains intractable for multi-domain proteins, where inter-domain dynamics govern molecular recognition. Existing geometric deep learning methods typically treat proteins as monolithic static graphs, suffering from rigid-body assumptions and aleatoric noise in flexible regions. To address this, we introduced HCLBind, a self-supervised framework that decouples geometric representation learning from affinity regression. HCLBind leverages a general-to-specific pre-training paradigm on the Q-BioLiP database to learn a robust physical grammar of binding. We propose a novel hierarchical decoy strategy: the model learns local physicochemical constraints through protein coordinate perturbation in single-domain proteins and global conformational geometry through inter-domain rotation in multi-domain complexes. Our hybrid architecture integrates a domain-gated graph attention network and cross-modal attention to explicitly prioritize domain interfaces. Furthermore, we employ LoRA on protein and ligand foundation models, ensuring efficient optimization while preserving evolutionary knowledge. Experiments on PDBBind demonstrate that HCLBind effectively learns discriminative interface features and provides robust uncertainty estimation, overcoming the limitations of standard supervised learning. The code is available at https://github.com/jiankliu/HCLBind.

preprint2026arXiv

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

While autonomous software engineering (SWE) agents are reshaping programming paradigms, they currently suffer from a "closed-world" limitation: they attempt to fix bugs from scratch or solely using local context, ignoring the immense historical human experience available on platforms like GitHub. Accessing this open-world experience is hindered by the unstructured and fragmented nature of real-world issue-tracking data. In this paper, we introduce MemGovern, a framework designed to govern and transform raw GitHub data into actionable experiential memory for agents. MemGovern employs experience governance to convert human experience into agent-friendly experience cards and introduces an agentic experience search strategy that enables logic-driven retrieval of human expertise. By producing 135K governed experience cards, MemGovern achieves a significant performance boost, improving resolution rates on the SWE-bench Verified by 4.65%. As a plug-in approach, MemGovern provides a solution for agent-friendly memory infrastructure.

preprint2026arXiv

Res$^2$CLIP: Few-Shot Generalist Anomaly Detection with Residual-to-Residual Alignment

Few-shot Generalist Anomaly Detection requires models to generalize to novel categories without retraining, posing significant challenges in real-world scenarios with scarce samples and rapidly changing categories. Existing CLIP-based methods face two major challenges: coarse-grained unified text prompts struggle to adapt to fine-grained foreground-background differences, causing cross-granularity mismatch; and fine-tuning on auxiliary datasets disrupts CLIP's inherent open-world generalization due to domain shift, leading to cross-category generalization degradation. To address these, we propose to shift multimodal alignment entirely into a unified residual space, where residual representations naturally eliminate fine-grained normal feature differences across regions and class-specific biases, simultaneously resolving both problems. Based on this insight, Res$^2$CLIP, the first residual-to-residual alignment framework that symmetrically bridges visual and text modalities within CLIP's residual space, is designed. The framework is developed from a residual perspective into three branches: a text prompt-based branch, a visual prompt-based branch, and a novel residual-to-residual alignment branch. All learnable optimizations are constrained within the residual domain, and the residual alignment optimization objectives are designed to force the model to focus on relative anomaly deviations rather than optimizing class-specific features. Experiments on multiple datasets demonstrate the effectiveness of our architecture. The code is available at https://github.com/hito2448/Res2CLIP.

preprint2026arXiv

Resolving the Fe K$α$ Doublet of the Galactic Center Molecular Cloud G0.11-0.11 with XRISM

Fe K$α$ line emission from Galactic center molecular clouds can be produced either via fluorescence after illumination by an X-ray source or by cosmic ray ionization. Unparalleled high-resolution X-ray spectroscopy obtained by XRISM-Resolve for the galactic center molecular cloud G0.11-0.11 resolves its Fe K$α$ line complex for the first time, and points to a new method for discrimination between the X-ray reflection and cosmic ray ionization models. The Fe K$α$ line complex is resolved into Fe K$α_1$ at $E_{1} = 6.4040 \: \rm{keV}$ and Fe K$α_2$ at $E_{2}= 6.3910 \:\rm{keV}$. Both lines have non-instrumental FWHM of $\approx 3 \:\rm{eV}$, close to the predicted quantum mechanical width of the lines, suggesting scant other sources of line broadening other than instrumental and quantum effects. We measure a radial velocity of $v_{\rm{LSR}} = 50 \pm 12_{fit} \pm 14_{scale} \:\rm{km/s}$ for G0.11-0.11, achieving the same precision reached by radio observations of such clouds. The high-resolution spectrum tests for the presence of secondary Fe K$α$ lines, expected as a signature of cosmic ray proton/ion ionization. The absence of the secondary lines argues against the cosmic ray ionization model for G0.11-0.11. In the preferred X-ray reflection model, if the illuminating source is Sgr A$^{\star}$, the required luminosity for an X-ray outburst about 200 years ago is $L_8 \approx 10^{38} \:\rm{erg/s}$ in an $8\:\rm{keV}$-wide band at $8\:\rm{keV}$.

preprint2026arXiv

SkillGenBench: Benchmarking Skill Generation Pipelines for LLM Agents

As LLM agents are increasingly built around reusable skills, a central challenge is no longer only whether agents can use provided skills, but whether they can generate correct, reusable, and executable skills from repositories and documents. Existing benchmarks primarily evaluate the efficacy of given skills or the ability of agents to solve downstream tasks from raw context, but they do not isolate skill generation itself as the object of study. We introduce SkillGenBench, a benchmark for evaluating skill generation pipelines under a unified and controlled protocol. In SkillGenBench, a generator receives raw corpora and produces standardized skill artifacts, which are then executed under fixed harnesses and assessed with unified evaluation procedures. The benchmark covers two generation regimes: task-conditioned generation, where a task-specific skill is synthesized after the task is revealed, and task-agnostic generation, where a reusable skill library must be distilled before downstream tasks are known. It also spans two complementary procedural sources: repository-grounded instances, where procedures are distributed across code, configuration, and scripts, and document-grounded instances, where procedures and constraints must be distilled from long-form text. We provide standardized task specifications, pinned environments, and evaluation protocols centered on deterministic execution-based checks, supplemented by auxiliary signals for diagnosis. Experiments across a range of skill-generation methods and backbones show substantial performance variation, highlight the difficulty of reusable skill distillation, and reveal distinct failure modes in skill generation from software repositories versus long-form documents. SkillGenBench establishes a reproducible testbed for studying skill generation as an independent research problem in agent systems.

preprint2026arXiv

Sphinx: Benchmarking and Modeling for LLM-Driven Pull Request Review

Pull request (PR) review is essential for ensuring software quality, yet automating this task remains challenging due to noisy supervision, limited contextual understanding, and inadequate evaluation metrics. We present Sphinx, a unified framework for LLM-based PR review that addresses these limitations through three key components: (1) a structured data generation pipeline that produces context-rich, semantically grounded review comments by comparing pseudo-modified and merged code; (2) a checklist-based evaluation benchmark that assesses review quality based on structured coverage of actionable verification points, moving beyond surface-level metrics like BLEU; and (3) Checklist Reward Policy Optimization (CRPO), a novel training paradigm that uses rule-based, interpretable rewards to align model behavior with real-world review practices. Extensive experiments show that models trained with Sphinx achieve state-of-the-art performance on review completeness and precision, outperforming both proprietary and open-source baselines by up to 40\% in checklist coverage. Together, Sphinx enables the development of PR review models that are not only fluent but also context-aware, technically precise, and practically deployable in real-world development workflows. The data will be released after review.

preprint2026arXiv

StoicLLM: Preference Optimization for Philosophical Alignment in Small Language Models

While large language models excel at factual adaptation, their ability to internalize nuanced philosophical frameworks under severe data constraints remains underexplored. We investigate this by specializing small LLMs on micro-datasets of foundational Stoic texts using preference optimization (ORPO, AlphaPO). Evaluated via a multi-model critic bank, our results show that just 300 high-fidelity examples can induce strong alignment with inward-facing Stoic virtues, closely approaching few-shot prompting while freeing the context window. Critically, however, all models, including few-shot baselines, exhibit a persistent failure on Stoicism's outward-facing cosmopolitan duties, pointing to a representational limitation of small models that micro-dataset adaptation alone cannot overcome.

preprint2026arXiv

The Advanced X-ray Imaging Satellite (AXIS) Community Science Book

The AXIS Community Science Book represents the collective effort of 592 scientists worldwide to define the transformative science enabled by the Advanced X-ray Imaging Satellite (AXIS), a next-generation X-ray mission selected by NASA's Astrophysics Probe Program for Phase A study. AXIS will advance the legacy of high-angular-resolution X-ray astronomy with ~1.5'' imaging over a wide 24' field of view and an order of magnitude greater collecting area than Chandra in the 0.3-12 keV band. Combining sharp imaging, high throughput, and rapid response capabilities, AXIS will open new windows on virtually every aspect of modern astrophysics, exploring the birth and growth of supermassive black holes, the feedback processes that shape galaxies, the life cycles of stars and exoplanet environments, and the nature of compact stellar remnants, supernova remnants, and explosive transients. This book compiles 138 community-contributed science cases developed by five Science Working Groups focused on AGN and supermassive black holes, galaxy evolution and feedback, compact objects and supernova remnants, stellar physics and exoplanets, and time-domain and multi-messenger astrophysics. Together, these studies establish the scientific foundation for next-generation X-ray exploration in the 2030s and highlight strong synergies with facilities of the 2030s, such as JWST, Roman, Rubin/LSST, SKA, ALMA, ngVLA, and next-generation gravitational-wave and neutrino networks.

preprint2022arXiv

"Think Before You Speak": Improving Multi-Action Dialog Policy by Planning Single-Action Dialogs

Multi-action dialog policy (MADP), which generates multiple atomic dialog actions per turn, has been widely applied in task-oriented dialog systems to provide expressive and efficient system responses. Existing MADP models usually imitate action combinations from the labeled multi-action dialog samples. Due to data limitations, they generalize poorly toward unseen dialog flows. While interactive learning and reinforcement learning algorithms can be applied to incorporate external data sources of real users and user simulators, they take significant manual effort to build and suffer from instability. To address these issues, we propose Planning Enhanced Dialog Policy (PEDP), a novel multi-task learning framework that learns single-action dialog dynamics to enhance multi-action prediction. Our PEDP method employs model-based planning for conceiving what to express before deciding the current response through simulating single-action dialogs. Experimental results on the MultiWOZ dataset demonstrate that our fully supervised learning-based method achieves a solid task success rate of 90.6%, improving 3% compared to the state-of-the-art methods.

preprint2022arXiv

A primal finite element scheme of the Hodge Laplace problem

In this paper, a unified family, for any $n\geqslant 2$ and $1\leqslant k\leqslant n-1$, of nonconforming finite element schemes are presented for the primal weak formulation of the $n$-dimensional Hodge-Laplace equation on $HΛ^k\cap H^*_0Λ^k$ and on the simplicial subdivisions of the domain. The finite element scheme possesses an $\mathcal{O}(h)$-order convergence rate for sufficiently regular data, and an $\mathcal{O}(h^s)$-order rate on any $s$-regular domain, $0<s\leqslant 1$, no matter what topology the domain has.

preprint2022arXiv

A Scheme to fabricate magnetic graphene-like cobalt nitride CoN4monolayer proposed by first-principles calculations

We propose a scheme to fabricate the cobalt nitride CoN4 monolayer, a magnetic graphene-like two-dimensional material, in which all Co and N atoms are in a plane. Under the pressure above 40 GPa, the bulk CoN4 is stabilized in a triclinic phase. With the pressure decreasing, the triclinic phase of CoN4 is transformed into an orthorhombic phase, and the latter is a layered compound with large interlayer spacing. At ambient condition, the weak interlayer couplings are so small that single CoN4 layer can be exfoliated by the mechanical method.

preprint2022arXiv

Active Learning on a Programmable Photonic Quantum Processor

Training a quantum machine learning model generally requires a large labeled dataset, which incurs high labeling and computational costs. To reduce such costs, a selective training strategy, called active learning (AL), chooses only a subset of the original dataset to learn while maintaining the trained model&#39;s performance. Here, we design and implement two AL-enpowered variational quantum classifiers, to investigate the potential applications and effectiveness of AL in quantum machine learning. Firstly, we build a programmable free-space photonic quantum processor, which enables the programmed implementation of various hybrid quantum-classical computing algorithms. Then, we code the designed variational quantum classifier with AL into the quantum processor, and execute comparative tests for the classifiers with and without the AL strategy. The results validate the great advantage of AL in quantum machine learning, as it saves at most $85\%$ labeling efforts and $91.6\%$ percent computational efforts compared to the training without AL on a data classification task. Our results inspire AL&#39;s further applications in large-scale quantum machine learning to drastically reduce training data and speed up training, underpinning the exploration of practical quantum advantages in quantum physics or real-world applications.

preprint2022arXiv

Analyzing and Simulating User Utterance Reformulation in Conversational Recommender Systems

User simulation has been a cost-effective technique for evaluating conversational recommender systems. However, building a human-like simulator is still an open challenge. In this work, we focus on how users reformulate their utterances when a conversational agent fails to understand them. First, we perform a user study, involving five conversational agents across different domains, to identify common reformulation types and their transition relationships. A common pattern that emerges is that persistent users would first try to rephrase, then simplify, before giving up. Next, to incorporate the observed reformulation behavior in a user simulator, we introduce the task of reformulation sequence generation: to generate a sequence of reformulated utterances with a given intent (rephrase or simplify). We develop methods by extending transformer models guided by the reformulation type and perform further filtering based on estimated reading difficulty. We demonstrate the effectiveness of our approach using both automatic and human evaluation.

preprint2022arXiv

DuMLP-Pin: A Dual-MLP-dot-product Permutation-invariant Network for Set Feature Extraction

Existing permutation-invariant methods can be divided into two categories according to the aggregation scope, i.e. global aggregation and local one. Although the global aggregation methods, e. g., PointNet and Deep Sets, get involved in simpler structures, their performance is poorer than the local aggregation ones like PointNet++ and Point Transformer. It remains an open problem whether there exists a global aggregation method with a simple structure, competitive performance, and even much fewer parameters. In this paper, we propose a novel global aggregation permutation-invariant network based on dual MLP dot-product, called DuMLP-Pin, which is capable of being employed to extract features for set inputs, including unordered or unstructured pixel, attribute, and point cloud data sets. We strictly prove that any permutation-invariant function implemented by DuMLP-Pin can be decomposed into two or more permutation-equivariant ones in a dot-product way as the cardinality of the given input set is greater than a threshold. We also show that the DuMLP-Pin can be viewed as Deep Sets with strong constraints under certain conditions. The performance of DuMLP-Pin is evaluated on several different tasks with diverse data sets. The experimental results demonstrate that our DuMLP-Pin achieves the best results on the two classification problems for pixel sets and attribute sets. On both the point cloud classification and the part segmentation, the accuracy of DuMLP-Pin is very close to the so-far best-performing local aggregation method with only a 1-2% difference, while the number of required parameters is significantly reduced by more than 85% in classification and 69% in segmentation, respectively. The code is publicly available on https://github.com/JaronTHU/DuMLP-Pin.

preprint2022arXiv

Hard X-ray emission from the eastern jet of SS 433 powering the W50 `Manatee&#39; nebula: Evidence for particle re-acceleration

We present a broadband X-ray study of W50 (`the Manatee nebula&#39;), the complex region powered by the microquasar SS 433, that provides a test-bed for several important astrophysical processes. The W50 nebula, a Galactic PeVatron candidate, is classified as a supernova remnant but has an unusual double-lobed morphology likely associated with the jets from SS 433. Using NuSTAR, XMM-Newton, and Chandra observations of the inner eastern lobe of W50, we have detected hard non-thermal X-ray emission up to $\sim$30 keV, originating from a few-arcminute size knotty region (`Head&#39;) located $\lesssim$ 18$^{\prime}$ (29 pc for a distance of 5.5 kpc) east of SS 433, and constrain its photon index to 1.58$\pm$0.05 (0.5-30 keV band). The index gradually steepens eastward out to the radio `ear&#39; where thermal soft X-ray emission with a temperature $kT$$\sim$0.2 keV dominates. The hard X-ray knots mark the location of acceleration sites within the jet and require an equipartition magnetic field of the order of $\gtrsim$12$μ$G. The unusually hard spectral index from the `Head&#39; region challenges classical particle acceleration processes and points to particle injection and re-acceleration in the sub-relativistic SS 433 jet, as seen in blazars and pulsar wind nebulae.

preprint2022arXiv

Light Field Raindrop Removal via 4D Re-sampling

The Light Field Raindrop Removal (LFRR) aims to restore the background areas obscured by raindrops in the Light Field (LF). Compared with single image, the LF provides more abundant information by regularly and densely sampling the scene. Since raindrops have larger disparities than the background in the LF, the majority of texture details occluded by raindrops are visible in other views. In this paper, we propose a novel LFRR network by directly utilizing the complementary pixel information of raindrop-free areas in the input raindrop LF, which consists of the re-sampling module and the refinement module. Specifically, the re-sampling module generates a new LF which is less polluted by raindrops through re-sampling position predictions and the proposed 4D interpolation. The refinement module improves the restoration of the completely occluded background areas and corrects the pixel error caused by 4D interpolation. Furthermore, we carefully build the first real scene LFRR dataset for model training and validation. Experiments demonstrate that the proposed method can effectively remove raindrops and achieves state-of-the-art performance in both background restoration and view consistency maintenance.

preprint2022arXiv

M Subdwarf Research III. Spectroscopic Diagnostics for Breaking Parameter Degeneracy

To understand the parameter degeneracy of M subdwarf spectra at low resolution, we assemble a large number of spectral features in the wavelength range of 0.6-2.5 μm with band strength quantified by narrowband indices. Based on the index trends of BT-Settl model sequences, we illustrate how the main atmospheric parameters (Teff, log g, [M/H], and [alpha/Fe]) affect each spectral feature differently. Furthermore, we propose a four-step process to determine the four parameters sequentially, which extends the basic idea proposed by Jao et al. Each step contains several spectral features that break the degeneracy effect when determining a specific stellar parameter. Finally, the feasibility of each spectroscopic diagnostic with different spectral qualities is investigated. The result is resolution-independent down to R~200.

preprint2022arXiv

Multi-RIS Aided 3D Secure Precise Wireless Transmission

In this paper, multiple reconfigurable intelligent surfaces (RIS) aided secure precise wireless transmission (SPWT) schemes are proposed in the three-dimensional (3D) wireless communication scenario. Unavailable direct path channels from transmitter to receivers are considered when the direct paths are obstructed by obstacles. Then, multiple RISs are utilized to achieve SPWT through the reflection path among transmitter, RISs and receivers in order to enhance the communication performance and energy efficiency simultaneously. First, a maximum-signal-to-interference-and-noise ratio (MSINR) scheme is proposed in a single user scenario. Then, the multi-user scenario is considered where the illegitimate users are regarded as eavesdroppers. A maximum-secrecy-rate (MSR) scheme and a maximum-signal-to-leakage-and-noise ratio (MSLNR) are proposed. The former achieves a better secrecy rate (SR) performance but incurs a higher complexity. The latter has a lower complexity than the MSR scheme with an SR performance loss. Simulation results show that both single-user scheme and multi-user scheme can achieve SPWT which transmits confidential message precisely to location of desired users. Moreover, MSLNR scheme has a lower complexity than the MSR scheme, while the SR performance is close to that of the MSR scheme.

preprint2022arXiv

Ring That Bell: A Corpus and Method for Multimodal Metaphor Detection in Videos

We present the first openly available multimodal metaphor annotated corpus. The corpus consists of videos including audio and subtitles that have been annotated by experts. Furthermore, we present a method for detecting metaphors in the new dataset based on the textual content of the videos. The method achieves a high F1-score (62\%) for metaphorical labels. We also experiment with other modalities and multimodal methods; however, these methods did not out-perform the text-based model. In our error analysis, we do identify that there are cases where video could help in disambiguating metaphors, however, the visual cues are too subtle for our model to capture. The data is available on Zenodo.

preprint2022arXiv

StruBERT: Structure-aware BERT for Table Search and Matching

A large amount of information is stored in data tables. Users can search for data tables using a keyword-based query. A table is composed primarily of data values that are organized in rows and columns providing implicit structural information. A table is usually accompanied by secondary information such as the caption, page title, etc., that form the textual information. Understanding the connection between the textual and structural information is an important yet neglected aspect in table retrieval as previous methods treat each source of information independently. In addition, users can search for data tables that are similar to an existing table, and this setting can be seen as a content-based table retrieval. In this paper, we propose StruBERT, a structure-aware BERT model that fuses the textual and structural information of a data table to produce context-aware representations for both textual and tabular content of a data table. StruBERT features are integrated in a new end-to-end neural ranking model to solve three table-related downstream tasks: keyword- and content-based table retrieval, and table similarity. We evaluate our approach using three datasets, and we demonstrate substantial improvements in terms of retrieval and classification metrics over state-of-the-art methods.

preprint2022arXiv

The Photon Ring in M87*

We report measurements of the gravitationally lensed secondary image -- the first in an infinite series of so-called &#34;photon rings&#34; -- around the supermassive black hole M87* via simultaneous modeling and imaging of the 2017 Event Horizon Telescope (EHT) observations. The inferred ring size remains constant across the seven days of the 2017 EHT observing campaign and is consistent with theoretical expectations, providing clear evidence that such measurements probe spacetime and a striking confirmation of the models underlying the first set of EHT results. The residual diffuse emission evolves on timescales comparable to one week. We are able to detect with high significance a southwestern extension consistent with that expected from the base of a jet that is rapidly rotating in the clockwise direction. This result adds further support to the identification of the jet in M87* with a black hole spin-driven outflow, launched via the Blandford-Znajek process. We present three revised estimates for the mass of M87* based on identifying the modeled thin ring component with the bright ringlike features seen in simulated images, one of which is only weakly sensitive to the astrophysics of the emission region. All three estimates agree with each other and previously reported values. Our strongest mass constraint combines information from both the ring and the diffuse emission region, which together imply a mass-to-distance ratio of $4.20^{+0.12}_{-0.06}~μ{\rm as}$ and a corresponding black hole mass of $(7.13\pm0.39)\times10^9M_\odot$, where the error on the latter is now dominated by the systematic uncertainty arising from the uncertain distance to M87*.

preprint2022arXiv

Transition edge sensor based detector: from X-ray to $γ$-ray

The Transition Edge Sensor is extremely sensitive to the change of temperature, combined with the high-Z metal of a certain thickness, it can realize the high energy resolution measurement of particles such as X-rays. X-rays with energies below 10 keV have very weak penetrating ability, so only a few microns thick of gold or bismuth can obtain quantum efficiency higher than 70\%. Therefore, the entire structure of the TES X-ray detector in this energy range can be realized in the microfabrication process. However, for X-rays or gamma rays from 10 keV to 200 keV, sub-millimeter absorber layers are required, which cannot be realized by microfabrication process. This paper first briefly introduces a set of TES X-ray detectors and their auxiliary systems built by ShanghaiTech University, then focus on the introduction of the TES $γ$-ray detector, with absorber based on an sub-millimeter lead-tin alloy sphere. The detector has a quantum efficiency above 70\% near 100 keV, and an energy resolution of about 161.5eV@59.5keV.

preprint2022arXiv

TRIM: A Design Space Exploration Model for Deep Neural Networks Inference and Training Accelerators

There is increasing demand for specialized hardware for training deep neural networks, both in edge/IoT environments and in high-performance computing systems. The design space of such hardware is very large due to the wide range of processing architectures, deep neural network configurations, and dataflow options. This makes developing deep neural network processors quite complex, especially for training. We present TRIM, an infrastructure to help hardware architects explore the design space of deep neural network accelerators for both inference and training in the early design stages. The model evaluates at the whole network level, considering both inter-layer and intra-layer activities. Given applications, essential hardware specifications, and a design goal, TRIM can quickly explore different hardware design options, select the optimal dataflow and guide new hardware architecture design. We validated TRIM with FPGA-based implementation of deep neural network accelerators and ASIC-based architectures. We also show how to use TRIM to explore the design space through several case studies. TRIM is a powerful tool to help architects evaluate different hardware choices to develop efficient inference and training architecture design.

preprint2022arXiv

Ultracool dwarfs identified using spectra in LAMOST DR7

In this work, we identify 734 ultracool dwarfs with a spectral type of M6 or later, including one L0. Of this sample, 625 were studied spectroscopically for the first time. All of these ultracool dwarfs are within 360~pc, with a \textit{Gaia} G magnitude brighter than ~19.2 mag. By studying the spectra and checking their stellar parameters (Teff, logg, and [FeH] derived with the LAMOST pipeline, we found their cool red nature and their metallicity to be consistent with the nature of Galactic thin-disk objects. Furthermore, 77 of them show lithium absorption lines at 6708A, further indicating their young ages and substellar nature. Kinematics obtained through LAMOST radial velocities, along with the proper motion and parallax data from Gaia EDR3, also suggest that the majority of our targets are thin-disk objects. Kinematic ages were estimated through the relationship between the velocity dispersion and the average age for a certain population. Moreover, we identified 35 binaries, with 6 of them reported as binaries for the first time.

preprint2021arXiv

A simple low-degree optimal finite element scheme for the elastic transmission eigenvalue problem

The paper presents a finite element scheme for the elastic transmission eigenvalue problem written as a fourth order eigenvalue problem. The scheme uses piecewise cubic polynomials and obtains optimal convergence rate. Compared with other low-degree and nonconforming finite element schemes, the scheme inherits the continuous bilinear form which does not need extra stabilizations and is thus simple to implement.

preprint2020arXiv

3D Lidar Mapping Relative Accuracy Automatic Evaluation Algorithm

HD (High Definition) map based on 3D lidar plays a vital role in autonomous vehicle localization, planning, decision-making, perception, etc. Many 3D lidar mapping technologies related to SLAM (Simultaneous Localization and Mapping) are used in HD map construction to ensure its high accuracy. To evaluate the accuracy of 3D lidar mapping, the most common methods use ground truth of poses to calculate the error between estimated poses and ground truth, however it&#39;s usually so difficult to get the ground truth of poses in the actual lidar mapping for autonomous vehicle. In this paper, we proposed a relative accuracy evaluation algorithm that can automatically evaluate the accuracy of HD map built by 3D lidar mapping without ground truth. A method for detecting the degree of ghosting in point cloud map quantitatively is designed to reflect the accuracy indirectly, which takes advantage of the principle of light traveling in a straight line and the fact that light can not penetrate opaque objects. Our experimental results confirm that the proposed evaluation algorithm can automatically and efficiently detect the bad poses whose accuracy are less than the set threshold such as 0.1m, then calculate the bad poses percentage P_bad in all estimated poses to obtain the final accuracy metric P_acc = 1 - P_bad.

preprint2020arXiv

Accelerating Neural Network Inference by Overflow Aware Quantization

The inherent heavy computation of deep neural networks prevents their widespread applications. A widely used method for accelerating model inference is quantization, by replacing the input operands of a network using fixed-point values. Then the majority of computation costs focus on the integer matrix multiplication accumulation. In fact, high-bit accumulator leads to partially wasted computation and low-bit one typically suffers from numerical overflow. To address this problem, we propose an overflow aware quantization method by designing trainable adaptive fixed-point representation, to optimize the number of bits for each input tensor while prohibiting numeric overflow during the computation. With the proposed method, we are able to fully utilize the computing power to minimize the quantization loss and obtain optimized inference performance. To verify the effectiveness of our method, we conduct image classification, object detection, and semantic segmentation tasks on ImageNet, Pascal VOC, and COCO datasets, respectively. Experimental results demonstrate that the proposed method can achieve comparable performance with state-of-the-art quantization methods while accelerating the inference process by about 2 times.

preprint2020arXiv

An optimal piecewise cubic nonconforming finite element scheme for the planar biharmonic equation on general triangulations

This paper presents a nonconforming finite element scheme for the planar biharmonic equation which applis piecewise cubic polynomials ($P_3$) and possesses $\mathcal{O}(h^2)$ convergence rate in energy norm on general shape-regular triangulations. Both Dirichlet and Navier type boundary value problems are studied. The basis for the scheme is a piecewise cubic polynomial space, which can approximate the $H^4$ functions with $\mathcal{O}(h^2)$ accuracy in broken $H^2$ norm. Besides, an equivalence $(\nabla_h^2\ \cdot,\nabla_h^2\ \cdot)=(Δ_h\ \cdot,Δ_h\ \cdot)$, which is usually not true for nonconforming finite element spaces, is proved on the newly designed spaces. The finite element space does not correspond to a finite element defined with Ciarlet&#39;s triple; however, a set of locally supported basis functions of the finite element space is still figured out. The notion of the finite element Stokes complex plays an important role in the analysis and also the construction of the basis functions.

preprint2020arXiv

Auto-completion for Data Cells in Relational Tables

We address the task of auto-completing data cells in relational tables. Such tables describe entities (in rows) with their attributes (in columns). We present the CellAutoComplete framework to tackle several novel aspects of this problem, including: (i) enabling a cell to have multiple, possibly conflicting values, (ii) supplementing the predicted values with supporting evidence, (iii) combining evidence from multiple sources, and (iv) handling the case where a cell should be left empty. Our framework makes use of a large table corpus and a knowledge base as data sources, and consists of preprocessing, candidate value finding, and value ranking components. Using a purpose-built test collection, we show that our approach is 40\% more effective than the best baseline.

preprint2020arXiv

Evaluating Conversational Recommender Systems via User Simulation

Conversational information access is an emerging research area. Currently, human evaluation is used for end-to-end system evaluation, which is both very time and resource intensive at scale, and thus becomes a bottleneck of progress. As an alternative, we propose automated evaluation by means of simulating users. Our user simulator aims to generate responses that a real human would give by considering both individual preferences and the general flow of interaction with the system. We evaluate our simulation approach on an item recommendation task by comparing three existing conversational recommender systems. We show that preference modeling and task-specific interaction models both contribute to more realistic simulations, and can help achieve high correlation between automatic evaluation measures and manual human assessments.

preprint2020arXiv

Generating Categories for Sets of Entities

Category systems are central components of knowledge bases, as they provide a hierarchical grouping of semantically related concepts and entities. They are a unique and valuable resource that is utilized in a broad range of information access tasks. To aid knowledge editors in the manual process of expanding a category system, this paper presents a method of generating categories for sets of entities. First, we employ neural abstractive summarization models to generate candidate categories. Next, the location within the hierarchy is identified for each candidate. Finally, structure-, content-, and hierarchy-based features are used to rank candidates to identify by the most promising ones (measured in terms of specificity, hierarchy, and importance). We develop a test collection based on Wikipedia categories and demonstrate the effectiveness of the proposed approach.

preprint2020arXiv

IAI MovieBot: A Conversational Movie Recommender System

Conversational recommender systems support users in accomplishing recommendation-related goals via multi-turn conversations. To better model dynamically changing user preferences and provide the community with a reusable development framework, we introduce IAI MovieBot, a conversational recommender system for movies. It features a task-specific dialogue flow, a multi-modal chat interface, and an effective way to deal with dynamically changing user preferences. The system is made available open source and is operated as a channel on Telegram.

preprint2020arXiv

Improving Attention Mechanism in Graph Neural Networks via Cardinality Preservation

Graph Neural Networks (GNNs) are powerful to learn the representation of graph-structured data. Most of the GNNs use the message-passing scheme, where the embedding of a node is iteratively updated by aggregating the information of its neighbors. To achieve a better expressive capability of node influences, attention mechanism has grown to be popular to assign trainable weights to the nodes in aggregation. Though the attention-based GNNs have achieved remarkable results in various tasks, a clear understanding of their discriminative capacities is missing. In this work, we present a theoretical analysis of the representational properties of the GNN that adopts the attention mechanism as an aggregator. Our analysis determines all cases when those attention-based GNNs can always fail to distinguish certain distinct structures. Those cases appear due to the ignorance of cardinality information in attention-based aggregation. To improve the performance of attention-based GNNs, we propose cardinality preserved attention (CPA) models that can be applied to any kind of attention mechanisms. Our experiments on node and graph classification confirm our theoretical analysis and show the competitive performance of our CPA models.

preprint2020arXiv

Lowest-degree piecewise polynomial de Rham complex on general quadrilateral grids

This paper is devoted to the construction of finite elements on grids that consist of general quadrilaterals not limited in parallelograms. Two finite elements defined as Ciarlet&#39;s triple are established for the $H^1$ and $H(\rm rot)$ elliptic problems, respectively. An $\mathcal{O}(h)$ order convergence rate in energy norm for both of them and an $\mathcal{O}(h^2)$ order convergence in $L^2$ norm for the $H^1$ scheme are proved under the asymptotic-parallelogram assumption on the grids. Further, the two finite element spaces on general quadrilateral grids, together with the space of piecewise constant functions, formulate a discretized de Rham complex. The finite element spaces consist of piecewise polynomial functions, and, thus, are nonconforming on general quadrilateral grids. Indeed, a rigorous analysis is given in this paper that it is impossible to construct a practically useful finite element defined as Ciarlet&#39;s triple that can formulate a finite element space which consists of continuous piecewise polynomial functions on a grid that may include arbitrary quadrilaterals.

preprint2020arXiv

Lowest-degree robust finite element scheme for a fourth-order elliptic singular perturbation problem on rectangular grids

In this paper, a piecewise quadratic nonconforming finite element method on rectangular grids for a fourth-order elliptic singular perturbation problem is presented. This proposed method is robustly convergent with respect to the perturbation parameter. Numerical results are presented to verify the theoretical findings. The new method uses piecewise quadratic polynomials, and is of the lowest degree possible. Optimal order approximation property of the finite element space is proved by means of a locally-averaged interpolation operator newly constructed. This interpolator, however, is not a projection. Indeed, we establish a general theory and show that no locally defined interpolation associated with the locally supported basis functions can be projective for the finite element space in use. Particularly, the general theory gives an answer to a long-standing open problem presented in [Demko, J. Approx. Theory, $\bf{43}$(2):151--156, 1985].

preprint2020arXiv

M Subdwarf Research. II. Atmospheric Parameters and Kinematics

Applying the revised M subdwarf classification criteria discussed in Paper I to LAMOST DR7, combining the M subdwarf sample from Savcheva et al, a new M subdwarf sample was constructed for further study. The atmospheric parameters for each object were derived fitting with the PHOENIX grid, combining with Gaia DR2, the relationship between the gravity and metallicity were explored according to the locus both in the color-absolute magnitude diagram and the reduced proper motion diagram. Objects that have both the largest gravity and the lowest metallicity are located away from the main-sequence cloud and may be considered as the intrinsic M subdwarfs, which can be classified as luminosity class VI. Another group of objects whose spectra show typical M subdwarf characters have lower gravity and relatively moderate metal deficiency and occupy part of the ordinary M dwarf region in both diagrams. The Galactic U , V , W space velocity components and their dispersion show that the local Galactic halo population sampled in the solar neighborhood is represented by objects of high gravity and an inconspicuous bimodal metallicity distribution, with a fraction of prograde orbits. The other M subdwarfs seem to partly belong to the thick disk component with a significant fraction of thin disk moderately metal-poor objects intricately mixed with them. However, the selection effects, especially the favored anti-center direction of investigation in the LAMOST sub-sample, but also contamination by multiplicity and parameter coupling could play important roles and need to be further investigated.

preprint2020arXiv

Multi-Dimension Fusion Network for Light Field Spatial Super-Resolution using Dynamic Filters

Light field cameras have been proved to be powerful tools for 3D reconstruction and virtual reality applications. However, the limited resolution of light field images brings a lot of difficulties for further information display and extraction. In this paper, we introduce a novel learning-based framework to improve the spatial resolution of light fields. First, features from different dimensions are parallelly extracted and fused together in our multi-dimension fusion architecture. These features are then used to generate dynamic filters, which extract subpixel information from micro-lens images and also implicitly consider the disparity information. Finally, more high-frequency details learned in the residual branch are added to the upsampled images and the final super-resolved light fields are obtained. Experimental results show that the proposed method uses fewer parameters but achieves better performances than other state-of-the-art methods in various kinds of datasets. Our reconstructed images also show sharp details and distinct lines in both sub-aperture images and epipolar plane images.

preprint2020arXiv

Novel Entity Discovery from Web Tables

When working with any sort of knowledge base (KB) one has to make sure it is as complete and also as up-to-date as possible. Both tasks are non-trivial as they require recall-oriented efforts to determine which entities and relationships are missing from the KB. As such they require a significant amount of labor. Tables on the Web, on the other hand, are abundant and have the distinct potential to assist with these tasks. In particular, we can leverage the content in such tables to discover new entities, properties, and relationships. Because web tables typically only contain raw textual content we first need to determine which cells refer to which known entities---a task we dub table-to-KB matching. This first task aims to infer table semantics by linking table cells and heading columns to elements of a KB. Then second task builds upon these linked entities and properties to not only identify novel ones in the same table but also to bootstrap their type and additional relationships. We refer to this process as novel entity discovery and, to the best of our knowledge, it is the first endeavor on mining the unlinked cells in web tables. Our method identifies not only out-of-KB (``novel&#39;&#39;) information but also novel aliases for in-KB (``known&#39;&#39;) entities. When evaluated using three purpose-built test collections, we find that our proposed approaches obtain a marked improvement in terms of precision over our baselines whilst keeping recall stable.

preprint2020arXiv

On the Secrecy of UAV Systems With Linear Trajectory

By observing the fact that moving in a straight line is a common flying behavior of unmanned aerial vehicles (UAVs) in normal applications, e.g., power line inspections, and air patrols along with highway/streets/borders, in this paper we investigate the secrecy outage performance of a UAV system with linear trajectory, where a UAV ($S$) flies in a straight line and transmits its information over the downlink to a legitimate receiver ($D$) on the ground while an eavesdropping UAV ($E$) trying to overhear the information delivery between $S$ and $D$. Meanwhile, some information is delivered to $S$ over the uplink from $D$, such as commanding messages to control $S$&#39;s detecting operations, which can also be eavesdropped by $E$. The locations of $S$, $D$, and $E$ are randomly distributed. We first characterize the statistical characteristics (including cumulative distribution functions and probability density function) of the received signal-to-noise ratio over both downlink and uplink, and then the closed-form analytical expressions for the lower boundary of the secrecy outage probability of both downlink and uplink have also been derived accordingly. Finally, Monte-Carlo simulations are given to testify our proposed analytical models.

preprint2020arXiv

Optimal quadratic element on rectangular grids for $H^1$ problems

In this paper, a piecewise quadratic finite element method on rectangular grids for the $H^1$ problems is presented. The proposed method can be viewed as a reduced rectangular Morley element. For the source problem, the convergence rate of this scheme is $O(h^2)$ in the energy norm on uniform grids. Besides, a lower bound of the $L^2$-norm error is also proved, which makes the capacity analysis of this scheme more clear. On the other hand, for the eigenvalue problem, the numerical eigenvalues by this element are shown to be the lower bounds of the exact ones. Some numerical results are presented, which show the potential of the proposed finite element.

preprint2020arXiv

Scatting theory for reflective Fourier ptychographic diffraction tomography

A forward model is presented to an inverse scattering problem that arises in the application of reflective Fourier ptychographic microscopy. The model allows us to determine the 3D distributions of refractive index for weakly scattering semi-transparent objects using Fourier ptychographic tomography. The derivation results show that both transmission types reported previously and reflective type present in this article are corresponding to a specific application in Wolf&#39;s work (Emil Wolf, Opt. Commun. 1969 1(4) pp. 153-156). The transmission type measures the forward scattering field while the reflective type measures the backward scattering field. This model is also available for holographic method

preprint2020arXiv

SPCANet: Stellar Parameters and Chemical Abundances Network for LAMOST-II Medium Resolution Survey

The fundamental stellar atmospheric parameters T_eff and log g and 13 chemical abundances are derived for medium-resolution spectroscopy from LAMOST Medium-Resolution Survey (MRS) data sets with a deep-learning method. The neural networks we designed, named as SPCANet, precisely map LAMOST MRS spectra to stellar parameters and chemical abundances. The stellar labels derived by SPCANet are with precisions of 119 K for T_eff and 0.17 dex for log g. The abundance precision of 11 elements including [C/H], [N/H], [O/H], [Mg/H], [Al/H], [Si/H], [S/H], [Ca/H], [Ti/H], [Cr/H], [Fe/H], and [Ni/H] are 0.06~0.12 dex, while of [Cu/H] is 0.19 dex. These precisions can be reached even for spectra with signal-to-noise as low as 10. The results of SPCANet are consistent with those from other surveys such as APOGEE, GALAH and RAVE, and are also validated with the previous literature values including clusters and field stars. The catalog of the estimated parameters is available at \url{http://paperdata.china-vo.org/LAMOST/MRS_parameters_elements.csv}.

preprint2020arXiv

Summarizing and Exploring Tabular Data in Conversational Search

Tabular data provide answers to a significant portion of search queries. However, reciting an entire result table is impractical in conversational search systems. We propose to generate natural language summaries as answers to describe the complex information contained in a table. Through crowdsourcing experiments, we build a new conversation-oriented, open-domain table summarization dataset. It includes annotated table summaries, which not only answer questions but also help people explore other information in the table. We utilize this dataset to develop automatic table summarization systems as SOTA baselines. Based on the experimental results, we identify challenges and point out future research directions that this resource will support.

preprint2020arXiv

Web Table Extraction, Retrieval and Augmentation: A Survey

Tables are a powerful and popular tool for organizing and manipulating data. A vast number of tables can be found on the Web, which represents a valuable knowledge resource. The objective of this survey is to synthesize and present two decades of research on web tables. In particular, we organize existing literature into six main categories of information access tasks: table extraction, table interpretation, table search, question answering, knowledge base augmentation, and table augmentation. For each of these tasks, we identify and describe seminal approaches, present relevant resources, and point out interdependencies among the different tasks.

preprint2019arXiv

Quantum-enhanced least-square support vector machine: simplified quantum algorithm and sparse solutions

Quantum algorithms can enhance machine learning in different aspects. Here, we study quantum-enhanced least-square support vector machine (LS-SVM). Firstly, a novel quantum algorithm that uses continuous variable to assist matrix inversion is introduced to simplify the algorithm for quantum LS-SVM, while retaining exponential speed-up. Secondly, we propose a hybrid quantum-classical version for sparse solutions of LS-SVM. By encoding a large dataset into a quantum state, a much smaller transformed dataset can be extracted using quantum matrix toolbox, which is further processed in classical SVM. We also incorporate kernel methods into the above quantum algorithms, which uses both exponential growth Hilbert space of qubits and infinite dimensionality of continuous variable for quantum feature maps. The quantum LS-SVM exploits quantum properties to explore important themes for SVM such as sparsity and kernel methods, and stresses its quantum advantages ranging from speed-up to the potential capacity to solve classically difficult machine learning tasks.