Source author record

Jian Chen

Jian Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

77works

40topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Belief-Guided Inference Control for Large Language Model Services via Verifiable Observations

In black-box large language model (LLM) services, response reliability is often only partially observable at decision time, while stronger inference pathways incur substantial computational cost, inducing a budgeted sequential decision problem: for each request, the system should decide whether the default low-cost response is sufficiently reliable or whether additional computation should be allocated to improve response quality. In this paper, we propose \textbf{Ver}ifiable \textbf{O}bservations for Risk-aware \textbf{I}nference \textbf{C}ontrol (\textsc{Veroic}), a framework for adaptive inference control in black-box LLM settings, which formulates request-time control as a \textit{partially observable Markov decision process} to capture partial observability and sequential budget coupling. It constructs a lightweight verifiable observation channel from the input-output pair by aggregating heterogeneous quality signals into a belief state over latent response reliability, which is then used by a budget-aware policy to decide whether to return the default output or trigger a higher-cost inference pathway. Experiments on diverse tasks show that \textsc{Veroic} achieves improved quality-cost trade-offs, stronger risk estimation and calibration, and more robust long-horizon inference control than competitive baselines.

preprint2026arXiv

ClimateIQA: A New Dataset and Benchmark to Advance Vision-Language Models in Meteorology Anomalies Analysis

Meteorological heatmaps play a vital role in deciphering extreme weather phenomena, yet their inherent complexities marked by irregular contours, unstructured patterns, and complex color variations present unique analytical hurdles for state-of-the-art Vision-Language Models (VLMs). Current state-of-the-art models like GPT-4o, Qwen-VL, and LLaVA 1.6 struggle with tasks such as precise color identification and spatial localization, resulting in inaccurate or incomplete interpretations. To address these challenges, we introduce Sparse Position and Outline Tracking (SPOT), a novel algorithm specifically designed to process irregularly shaped colored regions in visual data. SPOT identifies and localizes these regions by extracting their spatial coordinates, enabling structured representations of irregular shapes. Building on SPOT, we construct ClimateIQA, a novel meteorological visual question answering (VQA) dataset, comprising 26,280 high-resolution heatmaps and 762,120 instruction samples for wind gust, total precipitation, wind chill index and heat index analysis. ClimateIQA enhances VLM training by incorporating spatial cues, geographic metadata, and reanalysis data, improving model accuracy in interpreting and describing extreme weather features. Furthermore, we develop Climate-Zoo, a suite of fine-tuned VLMs based on SPOT-empowered ClimateIQA, which significantly outperforms existing models in meteorological heatmap tasks.

preprint2026arXiv

Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation

Nowadays, regulatory compliance has become a cornerstone of corporate governance, ensuring adherence to systematic legal frameworks. At its core, financial regulations often comprise highly intricate provisions, layered logical structures, and numerous exceptions, which inevitably result in labor-intensive or comprehension challenges. To mitigate this, recent Regulatory Technology (RegTech) and Large Language Models (LLMs) have gained significant attention in automating the conversion of regulatory text into executable compliance logic. However, their performance remains suboptimal particularly when applied to Chinese-language financial regulations, due to three key limitations: (1) incomplete domain-specific knowledge representation, (2) insufficient hierarchical reasoning capabilities, and (3) failure to maintain temporal and logical coherence. One promising solution is to develop a domain specific and code-oriented datasets for model training. Existing datasets such as LexGLUE, LegalBench, and CODE-ACCORD are often English-focused, domain-mismatched, or lack fine-grained granularity for compliance code generation. To fill these gaps, we present Compliance-to-Code, the first large-scale Chinese dataset dedicated to financial regulatory compliance. Covering 1,159 annotated clauses from 361 regulations across ten categories, each clause is modularly structured with four logical elements-subject, condition, constraint, and contextual information-along with regulation relations. We provide deterministic Python code mappings, detailed code reasoning, and code explanations to facilitate automated auditing. To demonstrate utility, we present FinCheck: a pipeline for regulation structuring, code generation, and report generation.

preprint2026arXiv

DeKeyNLU: Enhancing Natural Language to SQL Generation through Task Decomposition and Keyword Extraction

Natural Language to SQL (NL2SQL) provides a new model-centric paradigm that simplifies database access for non-technical users by converting natural language queries into SQL commands. Recent advancements, particularly those integrating Retrieval-Augmented Generation (RAG) and Chain-of-Thought (CoT) reasoning, have made significant strides in enhancing NL2SQL performance. However, challenges such as inaccurate task decomposition and keyword extraction by LLMs remain major bottlenecks, often leading to errors in SQL generation. While existing datasets aim to mitigate these issues by fine-tuning models, they struggle with over-fragmentation of tasks and lack of domain-specific keyword annotations, limiting their effectiveness. To address these limitations, we present DeKeyNLU, a novel dataset which contains 1,500 meticulously annotated QA pairs aimed at refining task decomposition and enhancing keyword extraction precision for the RAG pipeline. Fine-tuned with DeKeyNLU, we propose DeKeySQL, a RAG-based NL2SQL pipeline that employs three distinct modules for user question understanding, entity retrieval, and generation to improve SQL generation accuracy. We benchmarked multiple model configurations within DeKeySQL RAG pipeline. Experimental results demonstrate that fine-tuning with DeKeyNLU significantly improves SQL generation accuracy on both BIRD (62.31% to 69.10%) and Spider (84.2% to 88.7%) dev datasets.

preprint2026arXiv

Echo-α: Large Agentic Multimodal Reasoning Model for Ultrasound Interpretation

Ultrasound interpretation requires both precise lesion localization and holistic clinical reasoning, yet existing methods typically excel at only one of these capabilities: specialized detectors offer strong localization but limited reasoning, whereas multimodal large language models (MLLMs) provide flexible reasoning but weak grounding in specialized medical domains. We present Echo-α, an agentic multimodal reasoning model for ultrasound interpretation that unifies these strengths within an invoke-and-reason framework. Echo-α is trained to coordinate organ-specific detector outputs, integrate them with global visual context, and convert the resulting evidence into grounded diagnostic decisions beyond detector-only inference. This behavior is established through a nine-task supervised curriculum and then refined by sequential reinforcement learning under different reward trade-offs, yielding Echo-α-Grounding for lesion anchoring and Echo-α-Diagnosis for final diagnosis. On multi-center renal and breast ultrasound benchmarks, Echo-α outperforms competitive baselines on both grounding and diagnosis. In particular, on cross-center test sets, Echo-α-Grounding attains 56.73%/43.78% F1@0.5 and Echo- α-Diagnosis reaches 74.90%/49.20% overall accuracy on renal/breast ultrasound. These results suggest that agentic multimodal reasoning can turn specialized detectors into verifiable clinical evidence, offering a practical route toward ultrasound AI systems that are more accurate, interpretable, and transferable. The repository is at https://github.com/MiliLab/Echo-Alpha.

preprint2026arXiv

FastFLUX: Pruning FLUX with Block-wise Replacement and Sandwich Training

Recent advancements in text-to-image (T2I) generation have led to the emergence of highly expressive models such as diffusion transformers (DiTs), exemplified by FLUX. However, their massive parameter sizes lead to slow inference, high memory usage, and poor deployability. Existing acceleration methods (e.g., single-step distillation and attention pruning) often suffer from significant performance degradation and incur substantial training costs. To address these limitations, we propose FastFLUX, an architecture-level pruning framework designed to enhance the inference efficiency of FLUX. At its core is the Block-wise Replacement with Linear Layers (BRLL) method, which replaces structurally complex residual branches in ResBlocks with lightweight linear layers while preserving the original shortcut connections for stability. Furthermore, we introduce Sandwich Training (ST), a localized fine-tuning strategy that leverages LoRA to supervise neighboring blocks, mitigating performance drops caused by structural replacement. Experiments show that our FastFLUX maintains high image quality under both qualitative and quantitative evaluations, while significantly improving inference speed, even with 20\% of the hierarchy pruned. Our code will be available soon.

preprint2026arXiv

First-order Methods for Unconstrained Vector Optimization Problems: A Unified Majorization-Minimization Perspective

In this paper, we develop a unified majorization-minimization scheme and convergence analysis with first-order surrogate functions for unconstrained vector optimization problems (VOPs). By selecting different surrogate functions, the unified method can be reduced to various existing first-order methods. The unified convergence analysis reveals that the slow convergence of the steepest descent method is primarily attributed to the significant gap between the surrogate and objective functions. Consequently, narrowing this surrogate gap can enhance the performance of first-order methods for VOPs. To strike a better trade-off in terms of surrogate gap and per-iteration cost, we reformulate the direction-finding subproblem and elucidate that selecting a tighter surrogate function is equivalent to using an appropriate base of the dual cone in the direction-finding subproblem. Building on this insight, we employ the Barzilai-Borwein method to narrow the surrogate gap and propose a Barzilai-Borwein descent method for VOPs (BBDVO) with polyhedral cones. By reformulating the corresponding subproblem, we provide a novel perspective on the Barzilai-Borwein descent method, bridging the gap between this method and the steepest descent method. Finally, several numerical experiments are presented to validate the efficiency of the BBDVO.

preprint2026arXiv

InsHuman: Towards Natural and Identity-Preserving Human Insertion

Human insertion aims to naturally place specific individuals into a target background. Although existing image editing models may have such ability, they often produce failure cases, including inappropriate human pose in new background, inconsistent number of people, and modified facial identity. Moreover, publicly available human datasets often lack full-body portraits and realistic physical interaction between humans and their background. To address these challenges, we propose InsHuman for natural and identity-preserving human insertion. Specifically, we propose Human-Background Adaptive Fusion (HBAF), which detects foreground humans to obtain a binary mask and applies region-aware weighting to align the human regions between predicted and ground-truth latents, ensuring the person's pose, count, and overall appearance are coherently adapted to the target background.We further propose Face-to-Face ID-Preserving (FFIP), which detects and matches faces between the generated image and the source image in terms of face recognition features to enforce identity consistency for each face.In addition, we propose Bidirectional Data Pairing (BDP) strategy to construct BDP-InsHuman, a high-quality dataset with realistic human-background interactions. Experiments demonstrate that InsHuman achieves significant improvements in generating plausible images while keeping human identity unchanged.

preprint2026arXiv

OpenEM: Large-scale multi-structural 3D datasets for electromagnetic methods

Electromagnetic methods have become one of the most widely used techniques in geological exploration. With the remarkable success of deep learning, applying such techniques to EM methods has emerged as a promising research direction to overcome the limitations of conventional approaches. The effectiveness of deep learning methods depends heavily on the quality of datasets, which directly influences model performance and generalization ability. Existing application studies often construct datasets from random one-dimensional or structurally simple three-dimensional models, which fail to represent the real geological environments. Furthermore, the absence of standardized, publicly 3D geoelectric datasets continues to hinder progress in deep learning based EM exploration. To address these limitations, we present OpenEM, a large-scale, multi-structural three-dimensional geoelectric dataset that encompasses a broad range of geologically plausible subsurface structures. OpenEM consists of nine categories of geoelectric models, spanning from simple configurations with anomalous bodies in half-space to more complex structures such as flat layers, folded layers, flat faults, curved faults, and their corresponding variants with anomalous bodies. Since three-dimensional forward modeling in electromagnetics is extremely time-consuming, we further developed a deep learning based fast forward modeling approach for OpenEM, enabling efficient and reliable forward modeling across the entire dataset. This capability allows OpenEM to be rapidly deployed for a wide range of tasks. OpenEM provides a unified, comprehensive, and large-scale dataset for common EM exploration systems to accelerate the application of deep learning in electromagnetic methods.The complete dataset is publicly available at https://doi.org/10.5281/zenodo.17141981.

preprint2023arXiv

STARS-ISAC: How Many Sensors Do We Need?

A simultaneously transmitting and reflecting surface (STARS) enabled integrated sensing and communications (ISAC) framework is proposed, where a novel bi-directional sensing-STARS architecture is devised to facilitate the full-space communication and sensing. Based on the proposed framework, a joint optimization problem is formulated, where the Cramer-Rao bound (CRB) for estimating the 2-dimension direction-of-arrival of the sensing target is minimized. Two cases are considered for sensing performance enhancement. 1) For the two-user case, an alternating optimization algorithm is proposed. In particular, the maximum number of deployable sensors is obtained in the closed-form expressions. 2) For the multi-user case, an extended CRB (ECRB) metric is proposed to characterize the impact of the number of sensors on the sensing performance. Based on the proposed metric, a novel penalty-based double-loop (PDL) algorithm is proposed to solve the ECRB minimization problem. To tackle the coupling of the ECRB, a general decoupling approach is proposed to convert it to a tractable weighted linear summation form. Simulation results reveal that 1) the proposed PDL algorithm can achieve a near-optimal performance with consideration of sensor deployment; 2) without violating the communication under the quality of service requirements, reducing the receive antennas at the BS does not deteriorate the sensing performance; and 3) it is preferable to deploy more passive elements than sensors in terms of achieving optimal sensing performance

preprint2023arXiv

Timed Model-Based Mutation Operators for Simulink Models

Model-based mutation analysis is a recent research area, and real-time system testing can benefit from using model mutants. Model-based mutation testing (MBMT) is a particular branch of model-based testing. It generates faulty versions of a model using mutation operators to evaluate and improve test cases. Mutation testing is an effective way to ensure software correctness and has been applied to various application areas. Simulink is a vital modeling language for real-time systems. This paper introduces Simulink model mutation analysis to improve Model-in-the-loop (MIL) testing. We propose a set of Simulink mutation operators based on AUTOSAR, which reflects the temporal correctness when a Simulink model is mapped to Operating System tasks. We implement a mutation framework that generates mutants for implicit clock Simulink models. Finally, we demonstrate how this framework generates mutants to reveal task interference issues in the simulation. Our work integrates the Simulink model with the timed systems to better support mutation testing automation.

preprint2022arXiv

A Barzilai-Borwein Descent Method for Multiobjective Optimization Problems

The steepest descent method proposed by Fliege et al. motivates the research on descent methods for multiobjective optimization, which has received increasing attention in recent years. However, empirical results show that the Armijo line search often gives a very small stepsize along the steepest direction, which decelerates the convergence seriously. This paper points out that the issue is mainly due to the imbalances among objective functions. To address this issue, we propose a Barzilai-Borwein descent method for multiobjective optimization (BBDMO) that dynamically tunes gradient magnitudes using Barzilai-Borwein's rule in direction-finding subproblem. With monotone and nonmonotone line search techniques, it is proved that accumulation points generated by BBDMO are Pareto critical points, respectively. Furthermore, theoretical results indicate the Armijo line search can achieve a better stepsize in BBDMO. Finally, comparative results of numerical experiments are reported to illustrate the efficiency of BBDMO and verify the theoretical results.

preprint2022arXiv

Caching and Computation Offloading in High Altitude Platform Station (HAPS) Assisted Intelligent Transportation Systems

Edge intelligence, a new paradigm to accelerate artificial intelligence (AI) applications by leveraging computing resources on the network edge, can be used to improve intelligent transportation systems (ITS). However, due to physical limitations and energy-supply constraints, the computing powers of edge equipment are usually limited. High altitude platform station (HAPS) computing can be considered as a promising extension of edge computing. HAPS is deployed in the stratosphere to provide wide coverage and strong computational capabilities. It is suitable to coordinate terrestrial resources and store the fundamental data associated with ITS-based applications. In this work, three computing layers,i.e., vehicles, terrestrial network edges, and HAPS, are integrated to build a computation framework for ITS, where the HAPS data library stores the fundamental data needed for the applications. In addition, the caching technique is introduced for network edges to store some of the fundamental data from the HAPS so that large propagation delays can be reduced. We aim to minimize the delay of the system by optimizing computation offloading and caching decisions as well as bandwidth and computing resource allocations. The simulation results highlight the benefits of HAPS computing for mitigating delays and the significance of caching at network edges.

preprint2022arXiv

Charge transport in monolayers of metal nanoparticles

Two-dimensional (2D) nanoparticle films are a new class of materials with interesting physical properties and applications ranging from nanoelectronics to sensing and photonics. The importance of conducting nanoparticle films makes the fundamental understanding of their charge transport extremely important for materials and process design. Various hopping and transport mechanisms have been proposed and the nanoparticle monolayer is consistent with the electrical equivalent RC circuit, but their theoretical methods are limited to the model of the single electron tunneling between capacitively coupled nanoparticles with a characteristic time constant RC and the conductivity of thin film is the experimental conductivity, which cannot be deduced from these theoretical models. It is also unclear that how the specific process of electron transpot is affected by temperature. So, nowadays the electron dynamics of thin film cannot be understood fundamentally. Here, we develop an analytical theory based on the model of Sommerfeld, backed up by Monte-Carlo simulations, that predicts the process of charge transport and the effect of temperature on the electron transport in the thin film. In this paper two different nanoparticle models were built to cope with different types of morphology: triangular array and rectangular array. The transport properties of these different kinds of arrays including 2D ordered nanoparticle arrays with/without local structural disorder and 2D gradient nanoparticle arrays were investigated at different temperatures. For 2D well-ordered nanoparticle array without local structural disorder, the I-V curves are non-linear and highly symmetric.

preprint2022arXiv

Convergence rates analysis of Interior Bregman Gradient Method for Vector Optimization Problems

In recent years, by using Bregman distance, the Lipschitz gradient continuity and strong convexity were lifted and replaced by relative smoothness and relative strong convexity. Under the mild assumptions, it was proved that gradient methods with Bregman regularity converge linearly for single-objective optimization problems (SOPs). In this paper, we extend the relative smoothness and relative strong convexity to vector-valued functions and analyze the convergence of an interior Bregman gradient method for vector optimization problems (VOPs). Specifically, the global convergence rates are $\mathcal{O}(\frac{1}{k})$ and $\mathcal{O}(r^{k})(0<r<1)$ for convex and relative strongly convex VOPs, respectively. Moreover, the proposed method converges linearly for VOPs that satisfy a vector Bregman-PL inequality.

preprint2022arXiv

Electron heating in a current-driven turbulence as a result of nonlinear interaction of electron- and ion-acoustic waves

We study electron heating in collisionless current-driven turbulence due to the nonlinear interactions between electron- and ion-acoustic waves. PIC simulation results show that due to a large difference between the electron and ion mean velocities the Buneman instability excites large-amplitude ion-acoustic waves, which strongly modifies the electron velocity distribution function, leading to a secondary instability that generates fast electron-acoustic waves; and in this process, a giant electron hole is ultimately created. This giant electron hole is responsible for strong electron heating due to phase mixing. The numerical simulation results are consistent with the previous observations and provide insight into the key processes responsible for electron heating and the generation of nonlinear waves in a collisionless current-driven instability.

preprint2022arXiv

Electron transport in the single-layer semiconductor

Two-dimensional (2D) materials are a new class of materials with interesting physical properties and applications ranging from nanoelectronics to sensing and photonics. In addition to graphene, the most studied 2D material, monolayers of other layered materials such as semiconducting dichalcogenides MoS2 or WSe2 are gaining in importance as promising channel materials for field-effect transistors (FETs) and phototransistors. However, it is unclear that how the specific process of electron transport is affected by temperature. So, nowadays the electron dynamics of single-layer semiconductor cannot be understood fundamentally. Here, we develop an analytical theory distinguishing from traditional energy band theory, backed up by Monte-Carlo simulations, that predicts the process of electron transport and the effect of temperature on the electron transport in the single-layer semiconductor. In this paper, A new model is built to deal with electron transporting in the sing-layer semiconductor. The resistance is decided by the barrier rather than the electron scattering in the single-layer semiconductor, which is macroscopic quantum effect. Electron transport in FETs with different dielectric configurations are investigated at different temperatures and a new control factor that is decided by top-gate voltage or bottom-gate voltage is introduced to describe the effect of gate voltage on the electron transport in 2D semiconductor. The results of simulation show the drain current is mainly determined by some elements, such as temperature, top-gate voltage, bottom-gate voltage and source-drain voltage.

preprint2022arXiv

Evaluating Alternative Glyph Design for Showing Large-Magnitude-Range Quantum Spins

We present experimental results to explore a form of bivariate glyphs for representing large-magnitude-range vectors. The glyphs meet two conditions: (1) two visual dimensions are separable; and (2) one of the two visual dimensions uses a categorical representation (e.g., a categorical colormap). We evaluate how much these two conditions determine the bivariate glyphs' effectiveness. The first experiment asks participants to perform three local tasks requiring reading no more than two glyphs. The second experiment scales up the search space in global tasks when participants must look at the entire scene of hundreds of vector glyphs to get an answer. Our results support that the first condition is necessary for local tasks when a few items are compared. But it is not enough to understand a large amount of data. The second condition is necessary for perceiving global structures of examining very complex datasets. Participants' comments reveal that the categorical features in the bivariate glyphs trigger emergent optimal viewers' behaviors. This work contributes to perceptually accurate glyph representations for revealing patterns from large scientific results. We release source code, quantum physics data, training documents, participants' answers, and statistical analyses for reproducible science https://osf.io/4xcf5/?view_only=94123139df9c4ac984a1e0df811cd580.

preprint2022arXiv

Fano Interference in a Single-Molecule Junction

Trends of miniaturized devices and quantum interference electronics lead to the long desire of Fano interference in single-molecule junctions, here, which is successfully demonstrated using the 2,7-di(4-pyridyl)-9,9'-spirobifluorene molecule with a long backbone group and a short side group. Experimentally, the two electrically coupled groups are found to contribute to two blurred degenerate points in the differential conductance mapping. This forms a characteristic non-centrosymmetric double-crossing feature, with distinct temperature response for each crossing. Theoretically, we describe the practical in-junction electron transmission using a new two-tunnelling-channel coupling model and obtain a working formula with a Fano term and a Breit-Wigner term. The formula is shown to provide a good fit for all the mapping data and their temperature dependence in three dimensions, identifying the Fano component. Our work thus forms a complete set of evidence of the Fano interference in a single-molecule junction induced by two-tunnelling-channel coupling transport. Density functional theory calculations are used to corroborate this new physics.

preprint2022arXiv

Giant viscoelasticity near Mott criticality in PbCrO3 with large lattice anomalies

Coupling of charge and lattice degrees of freedom in materials can produce intriguing electronic phenomena, such as conventional superconductivity where the electrons are mediated by lattice for creating supercurrent. The Mott transition, which is a source for many fascinating emergent behaviors, is originally thought to be driven solely by correlated electrons with an Ising criticality. Recent studies on the known Mott systems have shown that the lattice degree of freedom is also at play, giving rise to either Landau or unconventional criticality. However, the underlying coupling mechanism of charge and lattice degrees of freedom around the Mott critical endpoint remains elusive, leading to difficulties in understanding the associated Mott physics. Here we report a study of Mott transition in cubic PbCrO3 by measuring the lattice parameter, using high-pressure x-ray diffraction techniques. The Mott criticality in this material is revealed with large lattice anomalies, which is governed by giant viscoelasticity that presumably results from a combination of lattice elasticity and electron viscosity. Because of the viscoelastic effect, the lattice of this material behaves peculiarly near the critical endpoint, inconsistent with any existing university classes. We argue that the viscoelasticity may play as a hidden degree of freedom behind the Mott criticality.

preprint2022arXiv

Improving Fine-tuning of Self-supervised Models with Contrastive Initialization

Self-supervised learning (SSL) has achieved remarkable performance in pretraining the models that can be further used in downstream tasks via fine-tuning. However, these self-supervised models may not capture meaningful semantic information since the images belonging to the same class are always regarded as negative pairs in the contrastive loss. Consequently, the images of the same class are often located far away from each other in learned feature space, which would inevitably hamper the fine-tuning process. To address this issue, we seek to provide a better initialization for the self-supervised models by enhancing the semantic information. To this end, we propose a Contrastive Initialization (COIN) method that breaks the standard fine-tuning pipeline by introducing an extra initialization stage before fine-tuning. Extensive experiments show that, with the enriched semantics, our COIN significantly outperforms existing methods without introducing extra training cost and sets new state-of-the-arts on multiple downstream tasks.

preprint2022arXiv

New Massive Contact Twin Binary in a Radio-quiet HII Region Associated with the M17 Complex

Early-B stars may create an HII region that appears as radio-quiet. We report the identification of new early-B stars associated with the radio-quiet HII region G014.645--00.606 in the M17 complex. The ratio-quiet HII region G014.645--00.606 is adjacent to three radio-quiet WISE HII region candidates. The ionizing sources of the radio-quiet HII regions are expected to later than B1V, given the sensitivity about 1-2 mJy of the MAGPIS 20 cm survey. The stars were first selected if their parallaxes of GAIA EDR3 match that of the 22 GHz H$_2$O maser source within the same region. We used the color-magnitude diagram made from the ZTF photometric catalog to select the candidates for massive stars because the intrinsic $g-r$ colors of massive stars change little from B-type to O-type stars. Five stars lie in the areas of the color-magnitude diagram where either reddened massive stars or evolved post-main sequence stars of lower masses are commonly found. Three of the five stars, sources 1, 2, and 3, are located at the cavities of the three IR bubbles, and extended H$α$ emission is detected around the three IR bubbles. We suggest that sources 1, 2, and 3 are candidates for early-B stars associated with the radio-quiet region G014.645--00.606. Particularly, source 1 is an EW type eclipsing binary with a short period of 0.825 day, while source 2 is an EA type eclipsing binary with a short period of 0.919 day. The physical parameters of the two binary systems have been derived through the PHOEBE model. Source 1 is a twin binary of two stars with T~23,500 K, and source 2 contains a hotter component (T~20,100 K) and a cooler one (T~15,500 K). The $O-C$ values of source 1 show a trend of decline, implying that the period of the source is deceasing. Source 1 is likely a contacting early-B twin binary, for which mass transfer might cause its orbit to shrink.

preprint2022arXiv

On the Ohmic-dominant heating mode of capacitively-coupled plasma inverted by boundary electron emission

Electron emission from the boundary is ubiquitous in capacitively coupled plasma (CCP) and precipitates nonnegligible influences on the discharge properties. Here we present the PIC-MCC simulation of an Ohmic-dominant heating mode of capacitively coupled plasma where the stochastic heating vanishes and only Ohmic heating sustains the discharge, due to sheath inversion by boundary electron emission. The inverted CCP features negative sheath potential without Bohm presheath, hence excluding plasma heating due to sheath edge oscillation. The particle and energy transport of the proposed heating mode is analyzed. The influences of boundary electron emission flux, source voltage, and neutral pressure on the transition between classic and Ohmic-dominant CCP heating modes are shown with designated simulation scans. A modified inverse sheath-plasma coupling due to excessive ionization is discovered. In the end, key indicators of the proposed heating mode in plasma diagnostics are provided for future experimental verifications.

preprint2022arXiv

Security Enhancement for Coupled Phase-Shift STAR-RIS Networks

The secure transmission of the simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) aided communication system is investigated. Considering the coupled phase shifts of STAR-RISs and the fair secrecy requirement of users, a novel secure beamforming design is proposed for addressing the unique full-space mutual eavesdropping of STAR-RIS aided communication. In particular, a penalty based secrecy beamforming algorithm is developed to solve the resulting non-convex optimization problem, where the closed-form solutions of the coupled transmission/reflection coefficients are obtained in each iteration. Numerical results demonstrate that 1) the proposed scheme achieves higher secrecy capacity than conventional RIS; 2) 4-bit discrete phase shifters are sufficient for secrecy guarantee.

preprint2022arXiv

Variable Metric Method for Unconstrained Multiobjective Optimization Problems

In this paper, we propose a variable metric method for unconstrained multiobjective optimization problems (MOPs). First, a sequence of points is generated using different positive definite matrices in the generic framework. It is proved that accumulation points of the sequence are Pareto critical points. Then, without convexity assumption, strong convergence is established for the proposed method. Moreover, we use a common matrix to approximate the Hessian matrices of all objective functions, along which, a new nonmonotone line search technique is proposed to achieve a local superlinear convergence rate. Finally, several numerical results demonstrate the effectiveness of the proposed method.

preprint2021arXiv

$L^2$ extension of $\bar\partial$-closed forms on weakly pseudoconvex Kähler manifolds

Combining V. Koziarz's observation about the regularity of some modified section related to the initial extension with J. McNeal--D. Varolin's regularity argument, we generalize two theorems of McNeal--Varolin for the $L^2$ extension of $\bar\partial$-closed high-degree forms on a Stein manifold to the weakly pseudoconvex Kähler case under mixed positivity conditions.

preprint2021arXiv

Digital Interference Mitigation in Space Division Multiplexing Self-Homodyne Coherent Detection

We propose a digital interference mitigation scheme to reduce the impact of mode coupling in space division multiplexing self-homodyne coherent detection and experimentally verify its effectiveness in 240-Gbps mode-multiplexed transmission over 3-mode multimode fiber.

preprint2021arXiv

Document Domain Randomization for Deep Learning Document Layout Extraction

We present document domain randomization (DDR), the first successful transfer of convolutional neural networks (CNNs) trained only on graphically rendered pseudo-paper pages to real-world document segmentation. DDR renders pseudo-document pages by modeling randomized textual and non-textual contents of interest, with user-defined layout and font styles to support joint learning of fine-grained classes. We demonstrate competitive results using our DDR approach to extract nine document classes from the benchmark CS-150 and papers published in two domains, namely annual meetings of Association for Computational Linguistics (ACL) and IEEE Visualization (VIS). We compare DDR to conditions of style mismatch, fewer or more noisy samples that are more easily obtained in the real world. We show that high-fidelity semantic information is not necessary to label semantic classes but style mismatch between train and test can lower model accuracy. Using smaller training samples had a slightly detrimental effect. Finally, network models still achieved high test accuracy when correct labels are diluted towards confusing labels; this behavior hold across several classes.

preprint2021arXiv

Experimental demonstration of cylindrical vector spatiotemporal optical vortex

We experimentally generate cylindrically polarized wavepackets with transverse orbital angular momentum, demonstrating the coexistence of spatiotemporal optical vortex with spatial polarization singularity. The results in this paper extend the study of spatiotemporal wavepackets to a broader scope, paving the way for its applications in various areas such as light-matter interaction, optical tweezers, spatiotemporal spin-orbit angular momentum coupling, etc.

preprint2021arXiv

Pareto-Frontier-aware Neural Architecture Generation for Diverse Budgets

Designing feasible and effective architectures under diverse computation budgets incurred by different applications/devices is essential for deploying deep models in practice. Existing methods often perform an independent architecture search for each target budget, which is very inefficient yet unnecessary. Moreover, the repeated independent search manner would inevitably ignore the common knowledge among different search processes and hamper the search performance. To address these issues, we seek to train a general architecture generator that automatically produces effective architectures for an arbitrary budget merely via model inference. To this end, we propose a Pareto-Frontier-aware Neural Architecture Generator (NAG) which takes an arbitrary budget as input and produces the Pareto optimal architecture for the target budget. We train NAG by learning the Pareto frontier (i.e., the set of Pareto optimal architectures) over model performance and computational cost (e.g., latency). Extensive experiments on three platforms (i.e., mobile, CPU, and GPU) show the superiority of the proposed method over existing NAS methods.

preprint2021arXiv

Photonic toroidal vortex

Toroidal vortices are whirling disturbances rotating about a ring-shaped core while advancing in the direction normal to the ring orifice. Toroidal vortices are commonly found in nature and being studied in a wide range of disciplines. Here we report the experimental observation of photonic toroidal vortex as a new solution to Maxwell's equations with the use of conformal mapping. The helical phase twists around a closed loop leading to an azimuthal local orbital angular momentum density. The preparation of such intriguing light field may offer insights of extending toroidal vortex to other disciplines and find important applications in light-matter interaction, optical manipulation, photonic symmetry and topology, and quantum information.

preprint2021arXiv

Towards Accurate and Compact Architectures via Neural Architecture Transformer

Designing effective architectures is one of the key factors behind the success of deep neural networks. Existing deep architectures are either manually designed or automatically searched by some Neural Architecture Search (NAS) methods. However, even a well-designed/searched architecture may still contain many nonsignificant or redundant modules/operations. Thus, it is necessary to optimize the operations inside an architecture to improve the performance without introducing extra computational cost. To this end, we have proposed a Neural Architecture Transformer (NAT) method which casts the optimization problem into a Markov Decision Process (MDP) and seeks to replace the redundant operations with more efficient operations, such as skip or null connection. Note that NAT only considers a small number of possible transitions and thus comes with a limited search/transition space. As a result, such a small search space may hamper the performance of architecture optimization. To address this issue, we propose a Neural Architecture Transformer++ (NAT++) method which further enlarges the set of candidate transitions to improve the performance of architecture optimization. Specifically, we present a two-level transition rule to obtain valid transitions, i.e., allowing operations to have more efficient types (e.g., convolution->separable convolution) or smaller kernel sizes (e.g., 5x5->3x3). Note that different operations may have different valid transitions. We further propose a Binary-Masked Softmax (BMSoftmax) layer to omit the possible invalid transitions. Extensive experiments on several benchmark datasets show that the transformed architecture significantly outperforms both its original counterpart and the architectures optimized by existing methods.

Jian Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

77 published item(s)

Belief-Guided Inference Control for Large Language Model Services via Verifiable Observations

ClimateIQA: A New Dataset and Benchmark to Advance Vision-Language Models in Meteorology Anomalies Analysis

Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation

DeKeyNLU: Enhancing Natural Language to SQL Generation through Task Decomposition and Keyword Extraction

Echo-α: Large Agentic Multimodal Reasoning Model for Ultrasound Interpretation

FastFLUX: Pruning FLUX with Block-wise Replacement and Sandwich Training

First-order Methods for Unconstrained Vector Optimization Problems: A Unified Majorization-Minimization Perspective

InsHuman: Towards Natural and Identity-Preserving Human Insertion

OpenEM: Large-scale multi-structural 3D datasets for electromagnetic methods

STARS-ISAC: How Many Sensors Do We Need?

Timed Model-Based Mutation Operators for Simulink Models

A Barzilai-Borwein Descent Method for Multiobjective Optimization Problems

Caching and Computation Offloading in High Altitude Platform Station (HAPS) Assisted Intelligent Transportation Systems

Charge transport in monolayers of metal nanoparticles

Convergence rates analysis of Interior Bregman Gradient Method for Vector Optimization Problems

Electron heating in a current-driven turbulence as a result of nonlinear interaction of electron- and ion-acoustic waves

Electron transport in the single-layer semiconductor

Evaluating Alternative Glyph Design for Showing Large-Magnitude-Range Quantum Spins

Fano Interference in a Single-Molecule Junction

Giant viscoelasticity near Mott criticality in PbCrO3 with large lattice anomalies

Improving Fine-tuning of Self-supervised Models with Contrastive Initialization

New Massive Contact Twin Binary in a Radio-quiet HII Region Associated with the M17 Complex

On the Ohmic-dominant heating mode of capacitively-coupled plasma inverted by boundary electron emission

Security Enhancement for Coupled Phase-Shift STAR-RIS Networks

Variable Metric Method for Unconstrained Multiobjective Optimization Problems

$L^2$ extension of $\bar\partial$-closed forms on weakly pseudoconvex Kähler manifolds

Digital Interference Mitigation in Space Division Multiplexing Self-Homodyne Coherent Detection

Document Domain Randomization for Deep Learning Document Layout Extraction

Experimental demonstration of cylindrical vector spatiotemporal optical vortex

Pareto-Frontier-aware Neural Architecture Generation for Diverse Budgets

Photonic toroidal vortex

Towards Accurate and Compact Architectures via Neural Architecture Transformer

A Gd@C82-based single molecular electret device with switchable electrical polarization

An Application-Driven Non-Orthogonal Multiple Access Enabled Computation Offloading Scheme

BigGAN-based Bayesian reconstruction of natural images from human brain activity

Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search

Closed-loop Matters: Dual Regression Networks for Single Image Super-Resolution

Collisionless Adiabatic Afterglow

Core-level x-ray photoemission and Raman spectroscopy studies on electronic structures in Mott-Hubbard type nickelate oxide NdNiO$_2$

Deep Learning for Image Super-resolution: A Survey

Exemplar-based Layout Fine-tuning for Node-link Diagrams

Hierarchical Neural Architecture Search for Single Image Super-Resolution

Investigating Task-driven Latent Feasibility for Nonconvex Image Modeling

NAT: Neural Architecture Transformer for Accurate and Compact Architectures

Neural encoding and interpretation for high-level visual cortices based on fMRI using image caption features

One-Shot Parameter Identification of the Thevenin's Model for Batteries: Methods and Validation

Privacy-Preserving Dynamic Average Consensus via State Decomposition: Case Study on Multi-Robot Formation Control

Robust and Secure Communications in Intelligent Reflecting Surface Assisted NOMA networks

Supporting Real-Time COVID-19 Medical Management Decisions: The Transition Matrix Model Approach

The Shallow End: Empowering Shallower Deep-Convolutional Networks through Auxiliary Outputs

Photonic Cyclone: spatiotemporal optical vortex with controllable transverse orbital angular momentum

Beam Test Results of High Q CBPM prototype for SXFEL

Bio-Inspired Resource Allocation for Relay-Aided Device-to-Device Communications

Giant Linear Magneto-resistance in Nonmagnetic PtBi2

High-Bandwidth and Large Coupling Tolerance Graded-Index Multimode Polymer Waveguides for On-board High-Speed Optical Interconnects

Landau-Zener-Stuckelberg-Majorana interference in a 3D transmon driven by a chirped microwave

Detection of small single-cycle signals by stochastic resonance using a bistable superconducting quantum interference device

LDAExplore: Visualizing Topic Models Generated Using Latent Dirichlet Allocation

Nonlinear X-ray Compton Scattering

Observation of coherent oscillation in single-passage Landau-Zener transitions

Optimal Power Allocation for A Massive MIMO Relay Aided Secure Communication

Optimal Power Allocation for Secure Communications in Large-Scale MIMO Relaying Systems

Point defects in epitaxial silicene on Ag(111) surface

DGFIndex for Smart Grid: Enhancing Hive with a Cost-Effective Multidimensional Range Index

Persistent Dirac Fermion State on Bulk-like Si(111) Surface

Secure Wireless Information and Power Transfer in Large-Scale MIMO Relaying Systems with Imperfect CSI

Composing DTI Visualizations with End-user Programming

Noncollinear parametric fluorescence by chirped quasi-phase matching for monocycle temporal entanglement

Landau-Zener-Stückelberg Interference of Microwave Dressed States of a Superconducting Phase Qubit

Optical codeword demodulation with error rates below standard quantum limit using a conditional nulling receiver

Fe-based high temperature superconductivity with Tc=31K bordering an insulating antiferromagnet in (Tl,K)FexSe2 Crystals

Landau-Zener-Stuckelberg interferometry in multilevel superconducting flux qubit

Population Inversion Induced by Landau-Zener Transition in a Strongly Driven rf-SQUID

Revised Phase Diagram for FeTe1-xSex system with less excess Fe atoms

Landau-Zener-Stuckelberg-Majorana interference in a 3D transmon driven by a chirped microwave