Source author record

Hong Chen

Hong Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

67works

34topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Fast Semidefinite Convex Relaxation for Optimal Control Problems With Spatio-Temporal Constraints

Solving optimal control problems (OCPs) of autonomous agents operating under spatial and temporal constraints fast and accurately is essential in applications ranging from eco-driving of autonomous vehicles to quadrotor navigation. However, the nonlinear programs approximating the OCPs are inherently nonconvex due to the coupling between the dynamics and the event timing, and therefore, they are challenging to solve. Most approaches address this challenge by predefining waypoint times or just using nonconvex trajectory optimization, which simplifies the problem but often yields suboptimal solutions. To significantly improve the numerical properties, we propose a formulation with a time-scaling direct multiple shooting scheme that partitions the prediction horizon into segments aligned with characteristic time constraints. Moreover, we develop a fast semidefinite-programming-based convex relaxation that exploits the sparsity pattern of the lifted formulation. Comprehensive simulation studies demonstrate the solution optimality and computational efficiency. Furthermore, real-world experiments on a quadrotor waypoint flight task with constrained open time windows validate the practical applicability of the approach in complex environments.

preprint2026arXiv

Deconstructing Pre-training: Knowledge Attribution Analysis in MoE and Dense Models

Mixture-of-Experts (MoE) architectures decouple model capacity from per-token computation, enabling scaling beyond the computational limits imposed by dense scaling laws. Yet how MoE architectures shape knowledge acquisition during pre-training, and how this process differs from dense architectures, remains unknown. To address this issue, we introduce Gated-LPI (Log-Probability Increase), a neuron-level attribution metric that decomposes log-probability increase across neurons. We present a time-resolved comparison of knowledge acquisition dynamics in MoE and dense architectures, tracking checkpoints over 1.2M training steps (~ 5.0T tokens) and 600K training steps (~ 2.5T tokens), respectively. Our experiments uncover three patterns: (1) Low-entropy backbone. The top approximately 1% of MoE neurons capture over 45% of positive updates, forming a high-utility core, which is absent in the dense baseline. (2) Early consolidation. The MoE model locks into a stable importance profile within < 100K steps, whereas the dense model remains volatile throughout training. (3) Functional robustness. Masking the ten most important MoE attention heads reduces relational HIT@10 by < 10%, compared with > 50% for the dense model, showing that sparsity fosters distributed -- rather than brittle -- knowledge storage. These patterns collectively demonstrate that sparsity fosters an intrinsically stable and distributed computational backbone from early in training, helping bridge the gap between sparse architectures and training-time interpretability.

preprint2026arXiv

DP-SelFT: Differentially Private Selective Fine-Tuning for Large Language Models

Large language models (LLMs) are commonly adapted to downstream tasks through fine-tuning, but fine-tuning data often contains sensitive information that may be leaked by the resulting model. Differential privacy (DP) offers formal protection against such leakage, yet DP fine-tuning of LLMs still suffers from substantial utility degradation due to gradient clipping and noise injection. Existing work improves this trade-off by combining DP with parameter-efficient fine-tuning methods such as LoRA, which constrain the form of updates. In this work, we study a complementary direction: selective fine-tuning, which constrains where updates are applied. We propose DP-SelFT, a framework for differentially private selective fine-tuning of LLMs. DP-SelFT addresses three DP-specific challenges in parameter selection: avoiding repeated privacy cost, improving stability under noisy estimates, and selecting parameters that remain useful under clipped and noisy updates. It first constructs a lightweight DP synthetic dataset and performs selection only on this synthetic data, so the selection stage incurs no additional privacy cost. It then conducts layer-level selection by temporarily training candidate layer subsets on a synthetic training split and evaluating them on a synthetic validation split. Crucially, this temporary training is performed under a perturbation regime matched to downstream DP fine-tuning, with worst-case perturbations of the same scale as DP noise. This favors layer subsets that are not only learnable but also robust to noisy private updates. Experiments on benchmark tasks show that DP-SelFT consistently improves the privacy--utility trade-off over existing DP fine-tuning baselines under the same privacy guarantees.

preprint2026arXiv

Federated Nested Learning: Collaborative Training of Self-Referential Memories for Test-Time Adaptation

We rethink Federated Learning (FL) from a nested learning perspective, framing the core challenge as how to collaboratively learn optimization rules, not just static models, to tackle Non-IID client data. To address this, we propose Federated Nested Learning (FedNL), a novel framework that reformulates FL as a three-level nested optimization system. FedNL embeds Titans-based linear attention into FL, enabling clients to perform lightweight, zero-shot test-time adaptation by treating a delta rule as an online gradient step. Experiments on Non-IID MMLU and long-context benchmarks show that FedNL achieves competitive performance in short-context reasoning, enhances the performance of long-context retrieval and streaming Cross-Entropy, and maintains constant inference memory.

preprint2026arXiv

LASSA Architecture-Based Autonomous Fault-Tolerant Control of Unmanned Underwater Vehicles

Unmanned underwater vehicles (UUVs) operate persistently in communication-constrained environments, thus requiring high-level autonomous fault-tolerant control under faulty operating conditions. Existing approaches rely heavily on predefined hard-coded rules and struggle to achieve effective fault-tolerant control against unforeseen faults. Although large language models (LLMs) possess powerful cognitive and reasoning capabilities, their inherent hallucinations remain a major obstacle to their application in UUV control systems. This paper proposes an intelligent control method based on the LASSA (LLM-based Agent with Solver, Sensor and Actuator) architecture. Within this architecture, an LLM identifies unknown faults and accomplishes task replanning via autonomous reasoning without hard-coded rules; the intelligent agent undertakes perception, scheduling and decision evaluation; the solver verifies physical boundary feasibility constraints prior to command transmission to the actuators. This architecture suppresses physically infeasible LLM hallucinations and ensures interpretable, verifiable decision-making. Moreover, it enables fast-slow dual closed-loop collaborative control, where the slow loop undertakes high-level dynamic decision-making and the fast loop guarantees high-frequency real-time control, simultaneously balancing decision intelligence and control timeliness. Lake experiments under normal and lower-rudder-fault conditions show that the framework detects trajectory tracking abnormalities, replans the route by adjusting the turning radius from 4m to 12m and reducing speed from 2kn to 1kn, passes all three solver constraints on the first invocation, and guides the UUV to complete the full mission; under normal conditions no false fault alarms are raised throughout the run.

preprint2026arXiv

TableCache: Primary Foreign Key Guided KV Cache Precomputation for Low Latency Text-to-SQL

In Text-to-SQL tasks, existing LLM-based methods often include extensive database schemas in prompts, leading to long context lengths and increased prefilling latency. While user queries typically focus on recurrent table sets-offering an opportunity for KV cache sharing across queries-current inference engines, such as SGLang and vLLM, generate redundant prefix cache copies when processing user queries with varying table orders. To address this inefficiency, we propose precomputing table representations as KV caches offline and querying the required ones online. A key aspect of our approach is the computation of table caches while preserving primary foreign key relationships between tables. Additionally, we construct a Table Trie structure to facilitate efficient KV cache lookups during inference. To enhance cache performance, we introduce a cache management system with a query reranking strategy to improve cache hit rates and a computation loading pipeline for parallelizing model inference and cache loading. Experimental results show that our proposed TableCache achieves up to a 3.62x speedup in Time to First Token (TTFT) with negligible performance degradation.

preprint2023arXiv

Global Weighted Tensor Nuclear Norm for Tensor Robust Principal Component Analysis

Tensor Robust Principal Component Analysis (TRPCA), which aims to recover a low-rank tensor corrupted by sparse noise, has attracted much attention in many real applications. This paper develops a new Global Weighted TRPCA method (GWTRPCA), which is the first approach simultaneously considers the significance of intra-frontal slice and inter-frontal slice singular values in the Fourier domain. Exploiting this global information, GWTRPCA penalizes the larger singular values less and assigns smaller weights to them. Hence, our method can recover the low-tubal-rank components more exactly. Moreover, we propose an effective adaptive weight learning strategy by a Modified Cauchy Estimator (MCE) since the weight setting plays a crucial role in the success of GWTRPCA. To implement the GWTRPCA method, we devise an optimization algorithm using an Alternating Direction Method of Multipliers (ADMM) method. Experiments on real-world datasets validate the effectiveness of our proposed method.

preprint2022arXiv

A Multi-Strategy based Pre-Training Method for Cold-Start Recommendation

Cold-start problem is a fundamental challenge for recommendation tasks. The recent self-supervised learning (SSL) on Graph Neural Networks (GNNs) model, PT-GNN, pre-trains the GNN model to reconstruct the cold-start embeddings and has shown great potential for cold-start recommendation. However, due to the over-smoothing problem, PT-GNN can only capture up to 3-order relation, which can not provide much useful auxiliary information to depict the target cold-start user or item. Besides, the embedding reconstruction task only considers the intra-correlations within the subgraph of users and items, while ignoring the inter-correlations across different subgraphs. To solve the above challenges, we propose a multi-strategy based pre-training method for cold-start recommendation (MPT), which extends PT-GNN from the perspective of model architecture and pretext tasks to improve the cold-start recommendation performance. Specifically, in terms of the model architecture, in addition to the short-range dependencies of users and items captured by the GNN encoder, we introduce a Transformer encoder to capture long-range dependencies. In terms of the pretext task, in addition to considering the intra-correlations of users and items by the embedding reconstruction task, we add embedding contrastive learning task to capture inter-correlations of users and items. We train the GNN and Transformer encoders on these pretext tasks under the meta-learning setting to simulate the real cold-start scenario, making the model easily and rapidly being adapted to new cold-start users and items. Experiments on three public recommendation datasets show the superiority of the proposed MPT model against the vanilla GNN models, the pre-training GNN model on user/item embedding inference and the recommendation task.

preprint2022arXiv

An Accelerator for Rule Induction in Fuzzy Rough Theory

Rule-based classifier, that extract a subset of induced rules to efficiently learn/mine while preserving the discernibility information, plays a crucial role in human-explainable artificial intelligence. However, in this era of big data, rule induction on the whole datasets is computationally intensive. So far, to the best of our knowledge, no known method focusing on accelerating rule induction has been reported. This is first study to consider the acceleration technique to reduce the scale of computation in rule induction. We propose an accelerator for rule induction based on fuzzy rough theory; the accelerator can avoid redundant computation and accelerate the building of a rule classifier. First, a rule induction method based on consistence degree, called Consistence-based Value Reduction (CVR), is proposed and used as basis to accelerate. Second, we introduce a compacted search space termed Key Set, which only contains the key instances required to update the induced rule, to conduct value reduction. The monotonicity of Key Set ensures the feasibility of our accelerator. Third, a rule-induction accelerator is designed based on Key Set, and it is theoretically guaranteed to display the same results as the unaccelerated version. Specifically, the rank preservation property of Key Set ensures consistency between the rule induction achieved by the accelerator and the unaccelerated method. Finally, extensive experiments demonstrate that the proposed accelerator can perform remarkably faster than the unaccelerated rule-based classifier methods, especially on datasets with numerous instances.

preprint2022arXiv

CODE: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking

Expert finding, a popular service provided by many online websites such as Expertise Finder, LinkedIn, and AMiner, is beneficial to seeking candidate qualifications, consultants, and collaborators. However, its quality is suffered from lack of ample sources of expert information. This paper employs AMiner as the basis with an aim at linking any external experts to the counterparts on AMiner. As it is infeasible to acquire sufficient linkages from arbitrary external sources, we explore the problem of zero-shot expert linking. In this paper, we propose CODE, which first pre-trains an expert linking model by contrastive learning on AMiner such that it can capture the representation and matching patterns of experts without supervised signals, then it is fine-tuned between AMiner and external sources to enhance the models transferability in an adversarial manner. For evaluation, we first design two intrinsic tasks, author identification and paper clustering, to validate the representation and matching capability endowed by contrastive learning. Then the final external expert linking performance on two genres of external sources also implies the superiority of the adversarial fine-tuning method. Additionally, we show the online deployment of CODE, and continuously improve its online performance via active learning.

preprint2022arXiv

Elastic Valley Spin Controlled Chiral Coupling in Topological Valley Phononic Crystals

Distinct from the phononic valley pseudo-spin, the real physical spin of elastic waves adds a novel tool-kit capable of envisaging the valley-spin physics of topological valley phononic crystals from a local viewpoint. Here, we report the observation of local elastic valley spin as well as the hidden elastic spin-valley locking mechanism overlooked before. We demonstrate that the selective one-way routing of valley phonon states along the topological interface can be reversed by imposing the elastic spin meta-source at different interface locations with opposite valley-spin correspondence. We unveil the physical mechanism of selective directionality as the elastic spin controlled chiral coupling of valley phonon states, through both analytical theory and experimental measurement of the opposite local elastic spin density at different interface locations for different transport directions. The elastic spin of valley topological edge phonons can be extended to other topological states and offers new tool to explore topological metamaterials.

preprint2022arXiv

Equilibrium Defaultable Corporate Debt and Investment

In dynamic capital structure models with an investor break-even condition, the firm's Bellman equation may not generate a contraction mapping, so the standard existence and uniqueness conditions do not apply. First, we provide an example showing the problem in a classical trade-off model. The firm can issue one-period defaultable debt, invest in capital and pay a dividend. If the firm cannot meet the required debt payment, it is liquidated. Second, we show how to use a dual to the original problem and a change of measure, such that existence and uniqueness can be proved. In the unique Markov-perfect equilibrium, firm decisions reflect state-dependent capital and debt targets. Our approach may be useful for other dynamic firm models that have an investor break-even condition.

preprint2022arXiv

Error-based Knockoffs Inference for Controlled Feature Selection

Recently, the scheme of model-X knockoffs was proposed as a promising solution to address controlled feature selection under high-dimensional finite-sample settings. However, the procedure of model-X knockoffs depends heavily on the coefficient-based feature importance and only concerns the control of false discovery rate (FDR). To further improve its adaptivity and flexibility, in this paper, we propose an error-based knockoff inference method by integrating the knockoff features, the error-based feature importance statistics, and the stepdown procedure together. The proposed inference procedure does not require specifying a regression model and can handle feature selection with theoretical guarantees on controlling false discovery proportion (FDP), FDR, or k-familywise error rate (k-FWER). Empirical evaluations demonstrate the competitive performance of our approach on both simulated and real data.

preprint2022arXiv

FLDP: Flexible strategy for local differential privacy

Local differential privacy (LDP), a technique applying unbiased statistical estimations instead of real data, is often adopted in data collection. In particular, this technique is used with frequency oracles (FO) because it can protect each user's privacy and prevent leakage of sensitive information. However, the definition of LDP is so conservative that it requires all inputs to be indistinguishable after perturbation. Indeed, LDP protects each value; however, it is rarely used in practical scenarios owing to its cost in terms of accuracy. In this paper, we address the challenge of providing weakened but flexible protection where each value only needs to be indistinguishable from part of the domain after perturbation. First, we present this weakened but flexible LDP (FLDP) notion. We then prove the association with LDP and DP. Second, we design an FHR approach for the common FO issue while satisfying FLDP. The proposed approach balances communication cost, computational complexity, and estimation accuracy. Finally, experimental results using practical and synthetic datasets verify the effectiveness and efficiency of our approach.

preprint2022arXiv

Go Back in Time: Generating Flashbacks in Stories with Event Temporal Prompts

Stories or narratives are comprised of a sequence of events. To compose interesting stories, professional writers often leverage a creative writing technique called flashback that inserts past events into current storylines as we commonly observe in novels and plays. However, it is challenging for machines to generate flashback as it requires a solid understanding of event temporal order (e.g. "feeling hungry" before "eat," not vice versa), and the creativity to arrange storylines so that earlier events do not always appear first in narrative order. Two major issues in existing systems that exacerbate the challenges: 1) temporal bias in pertaining and story datasets that leads to monotonic event temporal orders; 2) lack of explicit guidance that helps machines decide where to insert flashbacks. We propose to address these issues using structured storylines to encode events and their pair-wise temporal relations (before, after and vague) as temporal prompts that guide how stories should unfold temporally. We leverage a Plan-and-Write framework enhanced by reinforcement learning to generate storylines and stories end-to-end. Evaluation results show that the proposed method can generate more interesting stories with flashbacks while maintaining textual diversity, fluency, and temporal coherence.

preprint2022arXiv

How zealots affect the energy cost for controlling complex social networks

The controllability of complex networks may be applicable for understanding how to control a complex social network, where members share their opinions and influence one another. Previous works in this area have focused on controllability, energy cost, or optimization under the assumption that all nodes are compliant, passing on information neutrally without any preferences. However, the assumption on nodal neutrality should be reassessed, given that in networked social systems, some people may hold fast to their personal beliefs. By introducing some stubborn agents, or zealots, who hold steadfast to their beliefs and seek to influence others, the control energy is computed and compared against those without zealots. It was found that the presence of zealots alters the energy cost at a quadratic rate with respect to their own fixed beliefs. However, whether or not the zealots' presence increases or decreases the energy cost is affected by the interplay between different parameters such as the zealots' beliefs, number of drivers, final control time regimes, network effects, network dynamics, and number and configurations of neutral nodes influenced by the zealots. For example, when a network dynamics is linear but does not have conformity behavior, it could be possible for a contrarian zealot to assist in reducing control energy. With conformity behavior, a contrarian zealot always negatively affects network control by increasing energy cost. The results of this paper suggest caution when modeling real networked social systems with the controllability of networked linear dynamics since the system dynamical behavior is sensitive to parameter change.

preprint2022arXiv

Injecting Numerical Reasoning Skills into Knowledge Base Question Answering Models

Embedding-based methods are popular for Knowledge Base Question Answering (KBQA), but few current models have numerical reasoning skills and thus struggle to answer ordinal constrained questions. This paper proposes a new embedding-based KBQA framework which particularly takes numerical reasoning into account. We present NumericalTransformer on top of NSM, a state-of-the-art embedding-based KBQA model, to create NT-NSM. To enable better training, we propose two pre-training tasks with explicit numerical-oriented loss functions on two generated training datasets and a template-based data augmentation method for enriching ordinal constrained QA dataset. Extensive experiments on KBQA benchmarks demonstrate that with the help of our training algorithm, NT-NSM is empowered with numerical reasoning skills and substantially outperforms the baselines in answering ordinal constrained questions.

preprint2022arXiv

Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification

Although deep neural networks are capable of achieving performance superior to humans on various tasks, they are notorious for requiring large amounts of data and computing resources, restricting their success to domains where such resources are available. Metalearning methods can address this problem by transferring knowledge from related tasks, thus reducing the amount of data and computing resources needed to learn new tasks. We organize the MetaDL competition series, which provide opportunities for research groups all over the world to create and experimentally assess new meta-(deep)learning solutions for real problems. In this paper, authored collaboratively between the competition organizers and the top-ranked participants, we describe the design of the competition, the datasets, the best experimental results, as well as the top-ranked methods in the NeurIPS 2021 challenge, which attracted 15 active teams who made it to the final phase (by outperforming the baseline), making over 100 code submissions during the feedback phase. The solutions of the top participants have been open-sourced. The lessons learned include that learning good representations is essential for effective transfer learning.

preprint2022arXiv

Local well-posedness to the 2D Cauchy problem of full compressible magnetohydrodynamic equations with vacuum at infinity

This paper concerns the Cauchy problem of two-dimensional (2D) full compressible magnetohydrodynamic (MHD) equations in the whole plane $\mathbb{R}^2$ with zero density at infinity. By spatial weighted energy method, we derive the local existence and uniqueness of strong solutions provided that the initial density and the initial magnetic field decay not too slowly at infinity. Note that the initial temperature does not need to decay slowly at infinity. In particular, vacuum states at both the interior domain and the far field are allowed.

preprint2022arXiv

Markov subsampling based Huber Criterion

Subsampling is an important technique to tackle the computational challenges brought by big data. Many subsampling procedures fall within the framework of importance sampling, which assigns high sampling probabilities to the samples appearing to have big impacts. When the noise level is high, those sampling procedures tend to pick many outliers and thus often do not perform satisfactorily in practice. To tackle this issue, we design a new Markov subsampling strategy based on Huber criterion (HMS) to construct an informative subset from the noisy full data; the constructed subset then serves as a refined working data for efficient processing. HMS is built upon a Metropolis-Hasting procedure, where the inclusion probability of each sampling unit is determined using the Huber criterion to prevent over scoring the outliers. Under mild conditions, we show that the estimator based on the subsamples selected by HMS is statistically consistent with a sub-Gaussian deviation bound. The promising performance of HMS is demonstrated by extensive studies on large scale simulations and real data examples.

preprint2022arXiv

NeurIPS'22 Cross-Domain MetaDL competition: Design and baseline results

We present the design and baseline results for a new challenge in the ChaLearn meta-learning series, accepted at NeurIPS'22, focusing on "cross-domain" meta-learning. Meta-learning aims to leverage experience gained from previous tasks to solve new tasks efficiently (i.e., with better performance, little training data, and/or modest computational resources). While previous challenges in the series focused on within-domain few-shot learning problems, with the aim of learning efficiently N-way k-shot tasks (i.e., N class classification problems with k training examples), this competition challenges the participants to solve "any-way" and "any-shot" problems drawn from various domains (healthcare, ecology, biology, manufacturing, and others), chosen for their humanitarian and societal impact. To that end, we created Meta-Album, a meta-dataset of 40 image classification datasets from 10 domains, from which we carve out tasks with any number of "ways" (within the range 2-20) and any number of "shots" (within the range 1-20). The competition is with code submission, fully blind-tested on the CodaLab challenge platform. The code of the winners will be open-sourced, enabling the deployment of automated machine learning solutions for few-shot image classification across several domains.

preprint2022arXiv

No One Left Behind: Inclusive Federated Learning over Heterogeneous Devices

Federated learning (FL) is an important paradigm for training global models from decentralized data in a privacy-preserving way. Existing FL methods usually assume the global model can be trained on any participating client. However, in real applications, the devices of clients are usually heterogeneous, and have different computing power. Although big models like BERT have achieved huge success in AI, it is difficult to apply them to heterogeneous FL with weak clients. The straightforward solutions like removing the weak clients or using a small model to fit all clients will lead to some problems, such as under-representation of dropped clients and inferior accuracy due to data loss or limited model representation ability. In this work, we propose InclusiveFL, a client-inclusive federated learning method to handle this problem. The core idea of InclusiveFL is to assign models of different sizes to clients with different computing capabilities, bigger models for powerful clients and smaller ones for weak clients. We also propose an effective method to share the knowledge among multiple local models with different sizes. In this way, all the clients can participate in the model learning in FL, and the final model can be big and powerful enough. Besides, we propose a momentum knowledge distillation method to better transfer knowledge in big models on powerful clients to the small models on weak clients. Extensive experiments on many real-world benchmark datasets demonstrate the effectiveness of the proposed method in learning accurate models from clients with heterogeneous devices under the FL framework.

preprint2022arXiv

NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External Knowledge

Novel object captioning aims at describing objects absent from training data, with the key ingredient being the provision of object vocabulary to the model. Although existing methods heavily rely on an object detection model, we view the detection step as vocabulary retrieval from an external knowledge in the form of embeddings for any object's definition from Wiktionary, where we use in the retrieval image region features learned from a transformers model. We propose an end-to-end Novel Object Captioning with Retrieved vocabulary from External Knowledge method (NOC-REK), which simultaneously learns vocabulary retrieval and caption generation, successfully describing novel objects outside of the training dataset. Furthermore, our model eliminates the requirement for model retraining by simply updating the external knowledge whenever a novel object appears. Our comprehensive experiments on held-out COCO and Nocaps datasets show that our NOC-REK is considerably effective against SOTAs.

preprint2022arXiv

OMG: Observe Multiple Granularities for Natural Language-Based Vehicle Retrieval

Retrieving tracked-vehicles by natural language descriptions plays a critical role in smart city construction. It aims to find the best match for the given texts from a set of tracked vehicles in surveillance videos. Existing works generally solve it by a dual-stream framework, which consists of a text encoder, a visual encoder and a cross-modal loss function. Although some progress has been made, they failed to fully exploit the information at various levels of granularity. To tackle this issue, we propose a novel framework for the natural language-based vehicle retrieval task, OMG, which Observes Multiple Granularities with respect to visual representation, textual representation and objective functions. For the visual representation, target features, context features and motion features are encoded separately. For the textual representation, one global embedding, three local embeddings and a color-type prompt embedding are extracted to represent various granularities of semantic features. Finally, the overall framework is optimized by a cross-modal multi-granularity contrastive loss function. Experiments demonstrate the effectiveness of our method. Our OMG significantly outperforms all previous methods and ranks the 9th on the 6th AI City Challenge Track2. The codes are available at https://github.com/dyhBUPT/OMG.

preprint2022arXiv

Photonic Dirac Nodal Line Semimetals Realized by Hyper-crystal

Recently, the gapless Dirac/Weyl nodal semimetals with linear dispersion and topologically protected modes degeneracy are rapidly growing frontiers of topological physics. Especially, type-I, type-II, and critical type-III nodal semimetals are discovered according to the tilt angles of the Dirac/Weyl cones. Here, by introducing hyperbolic metamaterials into one-dimensional photonic crystals, we design the "hyper-crystal" and study the photonic four-fold degenerate Dirac nodal line semimetals (DNLSs) with two types of perpendicularly polarized waves. Moreover, the flexibly controlled photonic DNLSs using the phase compensation effect of hyperbolic dispersion are studied. Our results not only demonstrate a new platform to realize the various photonic DNLSs, where the optical polarization plays the role of electron spin in electronic DNLSs, but also may pave a novel way to explore the abundant Dirac/Weyl physics in the simple classical wave systems.

preprint2022arXiv

Regularized Modal Regression on Markov-dependent Observations: A Theoretical Assessment

Modal regression, a widely used regression protocol, has been extensively investigated in statistical and machine learning communities due to its robustness to outliers and heavy-tailed noises. Understanding modal regression's theoretical behavior can be fundamental in learning theory. Despite significant progress in characterizing its statistical property, the majority of the results are based on the assumption that samples are independent and identical distributed (i.i.d.), which is too restrictive for real-world applications. This paper concerns the statistical property of regularized modal regression (RMR) within an important dependence structure - Markov dependent. Specifically, we establish the upper bound for RMR estimator under moderate conditions and give an explicit learning rate. Our results show that the Markov dependence impacts on the generalization error in the way that sample size would be discounted by a multiplicative factor depending on the spectral gap of underlying Markov chain. This result shed a new light on characterizing the theoretical underpinning for robust regression.

preprint2022arXiv

Robust Learning-based Predictive Control for Discrete-time Nonlinear Systems with Unknown Dynamics and State Constraints

Robust model predictive control (MPC) is a well-known control technique for model-based control with constraints and uncertainties. In classic robust tube-based MPC approaches, an open-loop control sequence is computed via periodically solving an online nominal MPC problem, which requires prior model information and frequent access to onboard computational resources. In this paper, we propose an efficient robust MPC solution based on receding horizon reinforcement learning, called r-LPC, for unknown nonlinear systems with state constraints and disturbances. The proposed r-LPC utilizes a Koopman operator-based prediction model obtained off-line from pre-collected input-output datasets. Unlike classic tube-based MPC, in each prediction time interval of r-LPC, we use an actor-critic structure to learn a near-optimal feedback control policy rather than a control sequence. The resulting closed-loop control policy can be learned off-line and deployed online or learned online in an asynchronous way. In the latter case, online learning can be activated whenever necessary; for instance, the safety constraint is violated with the deployed policy. The closed-loop recursive feasibility, robustness, and asymptotic stability are proven under function approximation errors of the actor-critic networks. Simulation and experimental results on two nonlinear systems with unknown dynamics and disturbances have demonstrated that our approach has better or comparable performance when compared with tube-based MPC and LQR, and outperforms a recently developed actor-critic learning approach.

preprint2022arXiv

Self-supervised Graph Learning for Occasional Group Recommendation

As an important branch in Recommender System, occasional group recommendation has received more and more attention. In this scenario, each occasional group (cold-start group) has no or few historical interacted items. As each occasional group has extremely sparse interactions with items, traditional group recommendation methods can not learn high-quality group representations. The recent proposed Graph Neural Networks (GNNs), which incorporate the high-order neighbors of the target occasional group, can alleviate the above problem in some extent. However, these GNNs still can not explicitly strengthen the embedding quality of the high-order neighbors with few interactions. Motivated by the Self-supervised Learning technique, which is able to find the correlations within the data itself, we propose a self-supervised graph learning framework, which takes the user/item/group embedding reconstruction as the pretext task to enhance the embeddings of the cold-start users/items/groups. In order to explicitly enhance the high-order cold-start neighbors' embedding quality, we further introduce an embedding enhancer, which leverages the self-attention mechanism to improve the embedding quality for them. Comprehensive experiments show the advantages of our proposed framework than the state-of-the-art methods.

preprint2022arXiv

Spin and polarization analysis of $Z_{cs}$ state

A polarization analysis is performed for the recent observation of $Z_{cs}$ exotic state in the $e^+e^-$ annihilation experiment with motivation for measuring its spin quantum number in the future. Starting with the unpolarized electron and positron beam, the polarization transfer to the $Z_{cs}$ state and its decay angular distribution patterns are investigated. Some observables are suggested for determination of the spin parity quantum numbers. An ensemble of Monte-Carlo events are used to show some moment distributions special for manifestation of the different $Z_{cs}$ spin scenarios.

preprint2022arXiv

Subgraph Retrieval Enhanced Model for Multi-hop Knowledge Base Question Answering

Recent works on knowledge base question answering (KBQA) retrieve subgraphs for easier reasoning. A desired subgraph is crucial as a small one may exclude the answer but a large one might introduce more noises. However, the existing retrieval is either heuristic or interwoven with the reasoning, causing reasoning on the partial subgraphs, which increases the reasoning bias when the intermediate supervision is missing. This paper proposes a trainable subgraph retriever (SR) decoupled from the subsequent reasoning process, which enables a plug-and-play framework to enhance any subgraph-oriented KBQA model. Extensive experiments demonstrate SR achieves significantly better retrieval and QA performance than existing retrieval methods. Via weakly supervised pre-training as well as the end-to-end fine-tuning, SRl achieves new state-of-the-art performance when combined with NSM, a subgraph-oriented reasoner, for embedding-based KBQA methods.

preprint2022arXiv

The existence of multi-peak positive solutions for nonlinear Kirchhoff equations

In this work, we study the following Kirchhoff equation $$\begin{cases}-\left(\varepsilon^2 a+\varepsilon b\int_{\mathbb R^3}|\nabla u|^2\right)Δu +u =Q(x)u^{q-1},\quad u>0,\quad x\in {\mathbb{R}^{3}},\\u\to 0,\quad \text{as}\ |x|\to +\infty,\end{cases}$$ where $a,b>0$ are constants, $2<q<6$, and $\varepsilon>0$ is a parameter. Under some suitable assumptions on the function $Q(x)$, we obtain that the equation above has positive multi-peak solutions concentrating at a critical point of $Q(x)$ for $\varepsilon>0$ sufficiently small, by using the Lyapunov-Schmidt reduction method. We extend the result in (Discrete Contin. Dynam. Systems 6(2000), 39--50) to the nonlinear Kirchhoff equation.

preprint2021arXiv

Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling

Visual storytelling is a task of generating relevant and interesting stories for given image sequences. In this work we aim at increasing the diversity of the generated stories while preserving the informative content from the images. We propose to foster the diversity and informativeness of a generated story by using a concept selection module that suggests a set of concept candidates. Then, we utilize a large scale pre-trained model to convert concepts and images into full stories. To enrich the candidate concepts, a commonsense knowledge graph is created for each image sequence from which the concept candidates are proposed. To obtain appropriate concepts from the graph, we propose two novel modules that consider the correlation among candidate concepts and the image-concept correlation. Extensive automatic and human evaluation results demonstrate that our model can produce reasonable concepts. This enables our model to outperform the previous models by a large margin on the diversity and informativeness of the story, while retaining the relevance of the story to the image sequence.

preprint2021arXiv

GraphPlan: Story Generation by Planning with Event Graph

Story generation is a task that aims to automatically produce multiple sentences to make up a meaningful story. This task is challenging because it requires high-level understanding of semantic meaning of sentences and causality of story events. Naive sequence-to-sequence models generally fail to acquire such knowledge, as the logical correctness can hardly be guaranteed in a text generation model without the strategic planning. In this paper, we focus on planning a sequence of events assisted by event graphs, and use the events to guide the generator. Instead of using a sequence-to-sequence model to output a storyline as in some existing works, we propose to generate an event sequence by walking on an event graph. The event graphs are built automatically based on the corpus. To evaluate the proposed approach, we conduct human evaluation both on event planning and story generation. Based on large-scale human annotation results, our proposed approach is shown to produce more logically correct event sequences and stories.

preprint2021arXiv

Omnidirectional nonreciprocal absorber realized by the magneto-optical hypercrystal

Photonic bandgap design is one of the most basic ways to effectively control the interaction between light and matter. However, the traditional photonic bandgap is always dispersive (blueshift with the increase of the incident angle), which is disadvantageous to the construction of wide-angle optical devices. Hypercrystal, that the photonic crystal with layered hyperbolic metamaterials (HMMs), can strongly modify the bandgap properties based on the anomalous wavevector dispersion of the HMM. Here, based on phase variation compensation between HMM and isotropic dielectric layers, we propose for the first time to design nonreciprocal and flexible photonic bandgaps using magneto-optical HMMs in one-dimensional photonic crystals. Especially for the forward and backward incident light, the blueshift and dispersionless of the forward and backward cavity modes are designed respectively to realize the interesting omnidirectional nonreciprocal absorber. Our results show high (low) absorption about 0.99 (0.25) in an angle range of 20-75 degrees for the forward (backward) incident light at the wavelength of 367 nm. The nonreciprocal omnidirectional cavity mode not only facilitates the design of perfect unidirectional optical absorbers working in a wide-angle range, but also possesses significant applications for all-angle reflectors and filters.

preprint2021arXiv

On a Conjecture of Bahri-Xu

In order to study the Yamabe changing-sign problem, Bahri and Xu proposed a conjecture which is a universal inequality for $p$ points in $\mathbb R^m$. They have verified the conjecture for $p\leq3$. In this paper, we first simplify this conjecture by giving two sufficient and necessary conditions inductively. Then we prove the conjecture for the basic case $m=1$ with arbitrary $p$. In addition, for the cases when $p=4,5$ and $m\geq2$, we manage to reduce them to the basic case $m=1$ and thus prove them as well.

preprint2021arXiv

Optimizing target nodes selection for the control energy of directed complex networks

The energy needed in controlling a complex network is a problem of practical importance. Recent works have focused on the reduction of control energy either via strategic placement of driver nodes, or by decreasing the cardinality of nodes to be controlled. However, optimizing control energy with respect to target nodes selection has yet been considered. In this work, we propose an iterative method based on Stiefel manifold optimization of selectable target node matrix to reduce control energy. We derive the matrix derivative gradient needed for the search algorithm in a general way, and search for target nodes which result in reduced control energy, assuming that driver nodes placement is fixed. Our findings reveal that the control energy is optimal when the path distances from driver nodes to target nodes are minimized. We corroborate our algorithm with extensive simulations on elementary network topologies, random and scale-free networks, as well as various real networks. The simulation results show that the control energy found using our algorithm outperforms heuristic selection strategies for choosing target nodes by a few orders of magnitude. Our work may be applicable to opinion networks, where one is interested in identifying the optimal group of individuals that the driver nodes can influence.

preprint2021arXiv

Power-Efficient Silicon Nitride Thermo-Optic Phase Shifters for Visible Light

We demonstrate power-efficient, thermo-optic, silicon nitride waveguide phase shifters for blue, green, and yellow wavelengths. The phase shifters operated with low power consumption due to a suspended structure and multi-pass waveguide design. The devices were fabricated on 200-mm silicon wafers using deep ultraviolet lithography as part of an active visible-light integrated photonics platform. The measured power consumption to achieve a $π$ phase shift (averaged over multiple devices) was 0.78, 0.93, 1.09, and 1.20 mW at wavelengths of 445, 488, 532, and 561 nm, respectively. The phase shifters were integrated into Mach-Zehnder interferometer switches, and $10- 90$\% rise(fall) times of about 570(590) $μ$s were measured.

preprint2020arXiv

Abnormal wave propagation of high-k modes in tilted linear-crossing metamaterials

In this work, we treat the rotation of the optical axis as a new degree of freedom and theoretically propose a tilted linear-crossing metamaterials (TLCMM). Specifically, the conical dispersions of the NLCMM and TLCMM have the same shapes as type-I and type-II Dirac cones, respectively, in condensed matter physics. Upon rotating the optical axis angle such that it is equal to the cone angle, we find that this special TLCMM has the shape of a type-III Dirac cone. This critical TLCMM can have many unique properties and is of a fundamentally different nature than neighboring phases. When EM waves with large wave vectors are incident to a metamaterial with an open IFC to free space, the incident EM wave is strongly reflected due to wave-vector mismatch. Here, we use boundary conditions and the causality law to reveal that TLCMM high-k modes can achieve abnormal refraction without reflection and filtering. Moreover, these phenomenon are observed experimentally in a planar circuit-based system. The circuit-based TLCMM not only provides a versatile platform for the study of robust negative refraction phenomena in metamaterials, but also has a planar structure that is easier to integrate. Our results regarding the manipulation of EM waves may enable their use in planar-integrated photonics including for directional propagation, cloaking, and switching.

preprint2020arXiv

CONNA: Addressing Name Disambiguation on The Fly

Name disambiguation is a key and also a very tough problem in many online systems such as social search and academic search. Despite considerable research, a critical issue that has not been systematically studied is disambiguation on the fly -- to complete the disambiguation in the real-time. This is very challenging, as the disambiguation algorithm must be accurate, efficient, and error tolerance. In this paper, we propose a novel framework -- CONNA -- to train a matching component and a decision component jointly via reinforcement learning. The matching component is responsible for finding the top matched candidate for the given paper, and the decision component is responsible for deciding on assigning the top matched person or creating a new person. The two components are intertwined and can be bootstrapped via jointly training. Empirically, we evaluate CONNA on two name disambiguation datasets. Experimental results show that the proposed framework can achieve a 1.21%-19.84% improvement on F1-score using joint training of the matching and the decision components. The proposed CONNA has been successfully deployed on AMiner -- a large online academic search system.

preprint2020arXiv

Experimental demonstration of an anomalous Floquet topological insulator based on negative-index media

Photonic and acoustic topological insulators exhibiting one-way transportation that is robust against defects and impurities are typically realized in coupled arrays of two-dimensional ring resonators. These systems have produced a series of applications, including optical isolators, delay lines, and lasers. However, the structures are complicated because an additional coupler ring between neighboring rings is needed to construct photonic pseudo-spin. In this work, a photonic analogue of the quantum spin Hall effect is proposed and experimentally demonstrated in an anomalous Floquet topological insulator in the microwave regime. This improved design takes advantage of the efficient and backward coupling of negative-index media. The results contribute to the understanding of topological structures in metamaterials and point toward a new direction for constructing useful topological photonic devices.

preprint2020arXiv

FedSel: Federated SGD under Local Differential Privacy with Top-k Dimension Selection

As massive data are produced from small gadgets, federated learning on mobile devices has become an emerging trend. In the federated setting, Stochastic Gradient Descent (SGD) has been widely used in federated learning for various machine learning models. To prevent privacy leakages from gradients that are calculated on users' sensitive data, local differential privacy (LDP) has been considered as a privacy guarantee in federated SGD recently. However, the existing solutions have a dimension dependency problem: the injected noise is substantially proportional to the dimension $d$. In this work, we propose a two-stage framework FedSel for federated SGD under LDP to relieve this problem. Our key idea is that not all dimensions are equally important so that we privately select Top-k dimensions according to their contributions in each iteration of federated SGD. Specifically, we propose three private dimension selection mechanisms and adapt the gradient accumulation technique to stabilize the learning process with noisy updates. We also theoretically analyze privacy, accuracy and time complexity of FedSel, which outperforms the state-of-the-art solutions. Experiments on real-world and synthetic datasets verify the effectiveness and efficiency of our framework.

preprint2020arXiv

JarKA: Modeling Attribute Interactions for Cross-lingual Knowledge Alignment

Abstract. Cross-lingual knowledge alignment is the cornerstone in building a comprehensive knowledge graph (KG), which can benefit various knowledge-driven applications. As the structures of KGs are usually sparse, attributes of entities may play an important role in aligning the entities. However, the heterogeneity of the attributes across KGs prevents from accurately embedding and comparing entities. To deal with the issue, we propose to model the interactions between attributes, instead of globally embedding an entity with all the attributes. We further propose a joint framework to merge the alignments inferred from the attributes and the structures. Experimental results show that the proposed model outperforms the state-of-art baselines by up to 38.48% HitRatio@1. The results also demonstrate that our model can infer the alignments between attributes, relationships and values, in addition to entities.

preprint2020arXiv

Nonlinear nanophotonic devices in the Ultraviolet to Visible wavelength range

Although the first lasers invented operated in the visible, the first on-chip devices were optimized for near-infrared (IR) performance driven by demand in telecommunications. However, as the applications of integrated photonics has broadened, the wavelength demand has as well, and we are now returning to the visible (Vis) and pushing into the ultraviolet (UV). This shift has required innovations in device design and in materials as well as leveraging nonlinear behavior to reach these wavelengths. This review discusses the key nonlinear phenomena that can be used as well as presents several emerging material systems and devices that have reached the UV-Vis wavelength range.

preprint2020arXiv

One-dimensional topological quasiperiodic chain for directional wireless power transfer

As an important class of systems with unique topological effects beyond the periodic lattices, quasiperiodic topological structures have attracted much attention in recent years. Due to the quasiperiodic modulation, the topological states in the quasiperiodic topological structures have the characteristics of self-similarity, which can be used to observe the charming Hofstadter butterfly. In addition, because of the asymmetric distribution, the edge states in quasiperiodic chain can be used to realize the adiabatic pumping. When the topological parameters in quasiperiodic topological lattices are considered as synthetic dimensions, they can also be used to study the topological properties with higher dimensions. Here, by using ultra-subwavelength resonators, we design and fabricate a type of one-dimensional quasiperiodic Harper chain with asymmetric topological edge states for the directional wireless power transfer (WPT). By further introducing a power source into the system, we selectively light up two Chinese characters which is composed of LED lamps at both ends of the chain. Moreover, the directional WPT implemented by the topological quasiperiodic chain has the property of topological protection, which is immune to the internal disorder perturbation of the structure. Not only do we apply the asymmetric edge state for directional WPT, but also may further actively control the directional WPT by using the external voltage. In addition, this work provides a flexible platform for designing new WPT devices, such as using the corner states in high-order topological structures or the skin effect in the non-Hermitian topological lattices.

preprint2020arXiv

Sensitivity of topological edge states in a non-Hermitian dimer chain

Photonic topological edge states in one-dimensional dimer chains have long been thought to be robust to structural perturbations by mapping the topological Su-Schrieffer-Heeger model of a solid-state system. However, the edge states at the two ends of a finite topological dimer chain will interact as a result of near-field coupling. This leads to deviation from topological protection by the chiral symmetry from the exact zero energy, weakening the robustness of the topological edge state. With the aid of non-Hermitian physics, the splitting frequencies of edge states can be degenerated again and topological protection recovered by altering the gain or loss strength of the structure. This point of coalescence is known as the exceptional point (EP). The intriguing physical properties of EPs in topological structures give rise to many fascinating and counterintuitive phenomena. In this work, based on a finite non-Hermitian dimer chain composed of ultra-subwavelength resonators, we propose theoretically and verify experimentally that the sensitivity of topological edge states is greatly affected when the system passes through the EP. Using the EP of a non-Hermitian dimer chain, we realize a new sensor that is sensitive to perturbation at the end of the structure and yet topologically protected from internal perturbation. Our demonstration of a non-Hermitian topological structure with an EP paves the way for the development of novel sensors that are not sensitive to internal manufacturing errors but are highly sensitive to changes in the external environment.

preprint2020arXiv

Towards the understanding of fully-heavy tetraquark states from various models

We use a color-magnetic interaction model (CMIM), a traditional constituent quark model (CQM) and a multiquark color flux-tube model (MCFTM) to systematically investigate the properties of the states $[Q_1Q_2][\bar{Q}_3\bar{Q}_4]$ ($Q=c,b$). The dynamical investigation indicates that the CMIM can not completely absorb QCD dynamical effects through the effective constituent quark mass and overestimates the color-magnetic interaction in the states under the assumption of the same spatial configurations. The Coulomb interaction plays a critical role in the dynamical model calculations on the heavy hadrons, which induces the fact that none of bound states $[Q_1Q_2][\bar{Q}_3\bar{Q}_4]$ can be found in the dynamical models. The color configuration $\left[[Q_1Q_2]_{\mathbf{6}_c}[\bar{Q}_3\bar{Q}_4]_{\bar{\mathbf{6}}_c}\right]_{\mathbf{1}}$ should be taken seriously in the ground states due to the strong Coulomb attraction between the $[Q_1Q_2]_{\mathbf{6}_c}$ and $[\bar{Q}_3\bar{Q}_4]_{\bar{\mathbf{6}}_c}$. The color configuration $\left[[Q_1Q_2]_{\bar{\mathbf{3}}_c}[\bar{Q}_2\bar{Q}_4]_{\mathbf{3}_c}\right]_{\mathbf{1}}$ is absolutely dominant in the excited states because of the strong Coulomb attraction within the $[Q_1Q_2]_{\bar{\mathbf{3}}_c}$ and $[\bar{Q}_2\bar{Q}_4]_{\mathbf{3}_c}$. The $J/Ψ$-pair resonances recently observed by LHCb are difficult to be accommodated in the CMIM. The broad structure ranging from 6.2 to 6.8 GeV can be described as the ground tetraquark state $[cc][\bar{c}\bar{c}]$ in the various dynamical models. The narrow structure $X(6900)$ can be identified as the excited state $[cc][\bar{c}\bar{c}]$ with $L=1$ ($L=2$) in the CQM (MCFTM).

preprint2020arXiv

Unsupervised Manifold Clustering of Topological Phononics

Classification of topological phononics is challenging due to the lack of universal topological invariants and the randomness of structure patterns. Here, we show the unsupervised manifold learning for clustering topological phononics without any priori knowledge, neither topological invariants nor supervised trainings, even when systems are imperfect or disordered. This is achieved by exploiting the real-space projection operator about finite phononic lattices to describe the correlation between oscillators. We exemplify the efficient unsupervised manifold clustering in typical phononic systems, including one-dimensional Su-Schrieffer-Heeger-type phononic chain with random couplings, amorphous phononic topological insulators, higher-order phononic topological states and non-Hermitian phononic chain with random dissipations. The results would inspire more efforts on applications of unsupervised machine learning for topological phononic devices and beyond.

preprint2020arXiv

Wireless power transfer via topological modes in dimer chains

The topological characteristics, including invariant topological orders, band inversion, and the topological edge mode (TEM) in the photonic insulators, have been widely studied. Whether people can take advantage of intriguing topological modes in simple one-dimensional systems to implement some practical applications is an issue which people are increasingly concerned about. In this work, based on a photonic dimer chain composed of ultra-subwavelength resonators, we verify experimentally that the TEM in the effective second-order parity-time (PT) system is immune to the inner disorder perturbation, and can be used to realize the long-range wireless power transfer (WPT) with high transmission efficiency. To intuitively show the TEM can be used for WPT, a power signal source is used to excite the TEM. It can be clearly seen that two LED lamps with 0.5-W at both ends of the structure are lighted up with the aid of TEMs. In addition, in order to solve the special technical problems of standby power loss and frequency tracking, we further propose that a WPT system with effective third-order PT symmetry can be constructed by using one topological interface mode and two TEMs. Inspired by the long-range WPT with TEMs in this work, it is expected to use more complex topological structures to achieve energy transmission with more functions, such as the WPT devices whose direction can be selected flexibly in the quasiperiodic or trimer topological chains.

preprint2019arXiv

Can the state $Y(4626)$ be a $P$-wave tetraquark state $[cs][\bar{c}\bar{s}]$?

Stimulated by the state $Y(4626)$ recently reported by Belle Collaboration, we utilize a multiquark color flux-tube model with a multibody confinement potential and one-glue-exchange interaction to make an exhaustive investigation on the diquark-antidiquark state $[cs][\bar{c}\bar{s}]$. Numerical results indicate that the appearance of the states $[cs][\bar{c}\bar{s}]$ like a dumb-bell, the larger the orbital excitation $L$, the more distinguished the shape. The mixing of the color configurations $\left[[cs]_{\bar{\mathbf{3}}_c}[\bar{c}\bar{s}]_{\mathbf{3}_c}\right]_{\mathbf{1}}$ and $\left[[cs]_{\mathbf{6}_c}[\bar{c}\bar{s}]_{\bar{\mathbf{6}}_c}\right]_{\mathbf{1}}$ in the ground states is strong while the color configuration $\left[[cs]_{\bar{\mathbf{3}}_c}[\bar{c}\bar{s}]_{\mathbf{3}_c}\right]_{\mathbf{1}}$ is absolutely predominant in the excited states. The main component of the state $Y(4626)$ can be interpreted as a $P$-wave state $[cs][\bar{c}\bar{s}]$. Its hidden-bottom partner is predicted in the model calculation. The states $X(4140)$, $X(4274)$, $X(4350)$, $X(4500)$ and $X(4700)$ are also discussed.

preprint2019arXiv

Hyperbolic Metamaterials: From Dispersion Manipulation to Application

Manipulating the property of iso-frequency contour (IFC) will provide a powerful control for the interaction between light and matter. Importantly, hyperbolic metamaterials (HMMs), a class of artificial anisotropic materials with hyperbolic IFC have been intensively investigated. Because of the open dispersion curves, HMMs support propagating high-k modes and possess enhanced photonic density of states. As a result, HMMs can be utilized to realize hyper-lens breaking the diffraction limit, meta-cavity laser with subwavelength scale, high sensitivity sensor, long-range energy transfer and so on. In order to make it easier for people who are about to enter this burgeoning and rapidly developing research field, this tutorial article not only introduces the basic physical properties of HMMs, but also discusses the dispersion manipulation of HMMs and HMM-based structures such as hypercrystals. The theoretical methods and experimental platforms are given in this tutorial. Finally, some potential applications associated with hyperbolic dispersion are also introduced

preprint2019arXiv

Liquid-Like Interfaces Mediate Structural Phase Transitions in Lead Halide Perovskites

Microscopic pathways of structural phase transitions are difficult to probe because they occur over multiple, disparate time and length scales. Using $in$ $situ$ nanoscale cathodoluminescence microscopy, we visualize the thermally-driven transition to the perovskite phase in hundreds of non-perovskite phase nanowires, resolving the initial nanoscale nucleation and subsequent mesoscale growth and quantifying the activation energy for phase propagation. In combination with molecular dynamics computer simulations, we reveal that the transformation does not follow a simple martensitic mechanism, and proceeds via ion diffusion through a liquid-like interface between the two structures. While cations are disordered in this liquid-like region, the halide ions retain substantial spatial correlations. We find that the anisotropic crystal structure translates to faster nucleation of the perovskite phase at nanowire ends and faster growth along the long nanowire axis. These results represent a significant step towards manipulating structural phases at the nanoscale for designer materials properties.

preprint2018arXiv

Observation of acoustic spin

Unlike optical waves, acoustic waves in fluids are described by scalar pressure fields, and therefore are considered spinless. Here, we demonstrate experimentally the existence of spin in acoustics. In the interference of two acoustic waves propagating perpendicularly to each other, we observed the spin angular momentum in free space as a result of the rotation of local particle velocity. We successfully measured the acoustic spin, and spin induced torque acting on a lossy acoustic meta-atom that results from absorption of the spin angular momentum. The acoustic spin is also observed in the evanescent field of a guided mode traveling along a metamaterial waveguide. We found spin-momentum locking in acoustic waves whose propagation direction is determined by the sign of spin. The observed acoustic spin could open a new door in acoustics and their applications for the control of wave propagation and particle rotation.

preprint2016arXiv

A Decade of Developing Radio-Astronomy Instrumentation using CASPER Open-Source Technology

The Collaboration for Astronomy Signal Processing and Electronics Research (CASPER) has been working for a decade to reduce the time and cost of designing, building and deploying new digital radio-astronomy instruments. Today, CASPER open-source technology powers over 45 scientific instruments worldwide, and is used by scientists and engineers at dozens of academic institutions. In this paper we catalog the current offerings of the CASPER collaboration, and instruments past and present built by CASPER users and developers. We describe the ongoing state of software development, as CASPER looks to support a broader range of programming environments and hardware and ensure compatibility with the latest vendor tools.

preprint2014arXiv

A GPU-Based Wide-Band Radio Spectrometer

The Graphics Processing Unit (GPU) has become an integral part of astronomical instrumentation, enabling high-performance online data reduction and accelerated online signal processing. In this paper, we describe a wide-band reconfigurable spectrometer built using an off-the-shelf GPU card. This spectrometer, when configured as a polyphase filter bank (PFB), supports a dual-polarization bandwidth of up to 1.1 GHz (or a single-polarization bandwidth of up to 2.2 GHz) on the latest generation of GPUs. On the other hand, when configured as a direct FFT, the spectrometer supports a dual-polarization bandwidth of up to 1.4 GHz (or a single-polarization bandwidth of up to 2.8 GHz).

preprint2013arXiv

Photonic band gap of a graphene-embedded quarter-wave stack

Here, we present a mechanism for tailoring the photonic band structure of a quarter-wave stack without changing its physical periods by embedding conductive sheets. Graphene is utilized and studied as a realistic, two-dimensional conductive sheet. In a graphene-embedded quarter-wave stack, the synergic actions of Bragg scattering and graphene conductance contributions open photonic gaps at the center of the reduced Brillouin zone, that nonexistent in conventional quarter-wave stacks. Such photonic gaps show giant, loss-independent density of optical states at the fixed lower-gap-edges, of even-multiple characteristic frequency of the quarter-wave stack. The novel conductive sheets induced photonic gaps provide a new platform for the enhancement of light-matter interactions.

preprint2012arXiv

Advanced Multi-beam Spectrometer for the Green Bank Telescope

A new spectrometer for the Green Bank Telescope (GBT) is being built jointly by the NRAO and the CASPER, University of California, Berkeley. The spectrometer uses 8 bit ADCs and will be capable of processing up to 1.25 GHz bandwidth from 8 dual polarized beams. This mode will be used to process data from focal plane arrays. The spectrometer supports observing mode with 8 tunable digital sub-bands within the 1.25 GHz bandwidth. The spectrometer can also be configured to process a bandwidth of up to 10 GHz with 64 tunable sub-bands from a dual polarized beam. The vastly enhanced backend capabilities will support several new science projects with the GBT.

preprint2012arXiv

Charmonium states in QCD-inspired quark potential model using Gaussian expansion method

We investigate the mass spectrum and electromagnetic processes of charmonium system with the nonperturbative treatment for the spin-dependent potentials, comparing the pure scalar and scalar-vector mixing linear confining potentials. It is revealed that the scalar-vector mixing confinement would be important for reproducing the mass spectrum and decay widths, and therein the vector component is predicted to be around 22%. With the state wave functions obtained via the full-potential Hamiltonian, the long-standing discrepancy in M1 radiative transitions of $J/ψ$ and $ψ^{\prime}$ are alleviated spontaneously. This work also intends to provide an inspection and suggestion for the possible $c\bar{c}$ among the copious higher charmonium-like states. Particularly, the newly observed X(4160) and X(4350) are found in the charmonium family mass spectrum as $M(2^1D_2)= 4164.9$ MeV and $M(3^3P_2)= 4352.4$ MeV, which strongly favor the $J^{PC}=2^{-+}, 2^{++}$ assignments respectively. The corresponding radiative transitions, leptonic and two-photon decay widths have been also predicted theoretically for the further experimental search.

preprint2012arXiv

Photonic simulation of topological excitations in metamaterials

Condensed matter systems with topological order and metamaterials with left-handed chirality have attracted recently extensive interests in the fields of physics and optics. So far the two fields are independent, and there is no work to address their connection. Here we propose to establish the relation between the topological order in condensed matter systems and the chirality in metamaterials, by mapping explicitly Maxwell's equations to the Dirac equation in one dimension. We report an experimental implement of the band inversion in the Dirac equation, which accompanies change of chirality of electromagnetic wave in metamaterials, and the first microwave measurement of topological excitations and topological phases in one dimension. Our finding provides a proof-of-principle example that electromagnetic wave in the metamaterials can be used to simulate the topological order in condensed matter systems and quantum phenomena in relativistic quantum mechanics in a controlled laboratory environment.

preprint2012arXiv

Planar Metamaterial with Toroidal Moment

We observe toroidal response from a planar metamaterial comprised of asymmetric split ring resonators (ASRRs). It is showed that a toroidal-molecule can be constructed through rational arrangement of planar ASRRs as meta-atoms via manipulating structural symmetry of the meta-atoms. Field maps clearly indicate that the toroidal resonance paves new electromagnetic confinement style in a subwavelength scale. Planar scheme of manipulating the coupling among the ASRRs may stimulate research in optical region involving toroidal multipoles. Toroidal geometry together with the Fano resonance made high-Q response will have enormous potential applications in low-threshold lasing, cavity quantum electrodynamics and nonlinear processing.

preprint2011arXiv

Determining the strange and antistrange quark distributions of the nucleon

The difference between the strange and antistrange quark distributions, $δs(x)=s(x)-\sbar(x)$, and the combination of light quark sea and strange quark sea, $Δ(x)=\dbar(x)+\ubar(x)-s(x)-\sbar(x)$, are originated from non-perturbative processes, and can be calculated using non-perturbative models of the nucleon. We report calculations of $δs(x)$ and $Δ(x)$ using the meson cloud model. Combining our calculations of $Δ(x)$ with relatively well known light antiquark distributions obtained from global analysis of available experimental data, we estimate the total strange sea distributions of the nucleon.

preprint2011arXiv

Low-threshold directional plasmon lasing assisted by spatially coherent surface plasmon polaritons

We theoretically propose directional, low-threshold plasmon lasing in both the near-infrared and visible wavelengths by utilizing spatially coherent surface plasmon polaritons on a meta-surface. The gain strength required for threshold lasing can be tuned down to a large extent through compatible structural parameters. Our calculations show that no more than 65 cm$^{-1}$ at 193.5 THz (1.55 $μ$m) or 267 cm$^{-1}$at 474THz (0.633 $μ$m) of gain coefficient is sufficient to compensate for the dissipation of metal films for threshold lasing; these values are smaller than any reported studies at the same frequencies. These findings present a planar solid-state route for plasmon lasing that is highly efficient and spatially coherent.

preprint2011arXiv

Metallic helix array as a broadband wave plate

This study proposes that a metallic helix array can operate as a highly-transparent broadband wave plate in propagation directions perpendicular to the axis of helices. The functionality arises from a special property of the helix array, namely that the eigenstates of elliptically right-handed and left-handed polarization are dominated by Bragg scattering and local resonance respectively, and can be modulated separately with nearly fixed difference between their wavevectors in a wide frequency range. The wave plate functionality is theoretically and experimentally demonstrated by the transformation of polarized states in a wide frequency range.

preprint2011arXiv

Radiative vertex $VPγ$ and $η-η'$ mixing in light-cone sum rules

In this work, we calculate radiative vertex $VPγ$ ($V=ϕ,ω,ρ$ and $P=η,η'$) by utilizing $ω-ϕ$ mixing scheme and taking into account the contributions of the three-particle twist-4 distribution amplitudes of the photon in QCD sum rules on light cone. According to experimental data of $V\rightarrow Pγ$ and $P\rightarrow Vγ$ from PDG, a value of the $η-η^\prime$ mixing angle, $ϕ=(40.9\pm0.5)^\circ$, is extracted in the framework of the quark-flavor basis to describe the $η-η^\prime$ system.

preprint2011arXiv

The possible candidates of tetraquark : $Z_b(10610)$ and $Z_b(10650)$

Using the chromomagnetic interaction Hamiltonian with proper account for the SU(3) flavor symmetry breaking, we have performed a schematic study on the masses of $S-$wave heavy tetraquarks as $bq\bar{b}\bar{q}$ ($q$ denotes $u$, $d$, $s$ quark). It is found that the numeral results for $bu\bar{b}\bar{d}$ or $bd\bar{b}\bar{u}$ with $1^{+}$ quantum number are 10612 MeV and 10683 MeV respectively, which are well compatible with the recent detected charged bottomonium-like $Z_b(10610)$ and $Z_b(10650)$. Theoretically, we also investigate the possible tetraquark states of $1^{++}$ and $2^{+}$ due to the charge conjugation as the potential candidates for the updating experiments.

preprint2010arXiv

Ferrimagnetic-like surface resonance from hybrid metamaterial slab

We show that a hybrid metamaterial slab comprising of an one-dimensional array of two different types of cavities exhibits ferrimagnetic-like surface resonances which can be used to realize interesting phenomena such as directive emission as a consequence of strong angle-dependent reflection phase and the selective coupling of a Gaussian incident beam into a higher order diffractive channel, giving rise to the phenomenon of negative reflection and retro-directive backtracking. The findings are verified by experiments in the microwave regime.

preprint2010arXiv

Spatially coherent surface resonance states derived from magnetic resonances

A thin metamaterial slab comprising a dielectric spacer sandwiched between a metallic grating and a ground plane is shown to possess spatially coherent surface resonance states that span a large frequency range and can be tuned by structural and material parameters. They give rise to nearly perfect angle-selective absorption and thus exhibit directional thermal emissivity. Direct numerical simulations show that the metamaterial slab supports spatially coherent thermal emission in a wide frequency range that is robust against structural disorder.

preprint1992arXiv

Lattice QCD with 8 Light Quark Flavors

QCD with eight flavors is studied on $16^3\times N_t$ lattices with $N_t=4$, 6, 8, 16 and 32, a dynamical quark mass $ma=0.015$ and lattice coupling $β=6/g^2$ between 4.5 and 5.0. For $N_t=16$ and 32, hadron masses and screening lengths are computed for a variety of valence quark masses. The previously observed, strong, first-order transition for $N_t=4$, 6 and 8 is seen, for $N_t=16$, to become a $β$-independent, zero-temperature transition characterized by a factor of $\approx 3$ change in lattice scale. This strong, first-order transition restores chiral symmetry, at least for $N_t=4$, 6 and 8, producing a chirally symmetric, weak-coupling phase. However, as $N_t$ increases to 16, the chiral symmetry properties of the weak-coupling side of the zero-temperature transition are unclear and offer a hint of a normal, finite-temperature, chiral symmetry breaking transition in the weak-coupling phase.

Hong Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

67 published item(s)

A Fast Semidefinite Convex Relaxation for Optimal Control Problems With Spatio-Temporal Constraints

Deconstructing Pre-training: Knowledge Attribution Analysis in MoE and Dense Models

DP-SelFT: Differentially Private Selective Fine-Tuning for Large Language Models

Federated Nested Learning: Collaborative Training of Self-Referential Memories for Test-Time Adaptation

LASSA Architecture-Based Autonomous Fault-Tolerant Control of Unmanned Underwater Vehicles

TableCache: Primary Foreign Key Guided KV Cache Precomputation for Low Latency Text-to-SQL

Global Weighted Tensor Nuclear Norm for Tensor Robust Principal Component Analysis

A Multi-Strategy based Pre-Training Method for Cold-Start Recommendation

An Accelerator for Rule Induction in Fuzzy Rough Theory

CODE: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking

Elastic Valley Spin Controlled Chiral Coupling in Topological Valley Phononic Crystals

Equilibrium Defaultable Corporate Debt and Investment

Error-based Knockoffs Inference for Controlled Feature Selection

FLDP: Flexible strategy for local differential privacy

Go Back in Time: Generating Flashbacks in Stories with Event Temporal Prompts

How zealots affect the energy cost for controlling complex social networks

Injecting Numerical Reasoning Skills into Knowledge Base Question Answering Models

Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification

Local well-posedness to the 2D Cauchy problem of full compressible magnetohydrodynamic equations with vacuum at infinity

Markov subsampling based Huber Criterion

NeurIPS'22 Cross-Domain MetaDL competition: Design and baseline results

No One Left Behind: Inclusive Federated Learning over Heterogeneous Devices

NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External Knowledge

OMG: Observe Multiple Granularities for Natural Language-Based Vehicle Retrieval

Photonic Dirac Nodal Line Semimetals Realized by Hyper-crystal

Regularized Modal Regression on Markov-dependent Observations: A Theoretical Assessment

Robust Learning-based Predictive Control for Discrete-time Nonlinear Systems with Unknown Dynamics and State Constraints

Self-supervised Graph Learning for Occasional Group Recommendation

Spin and polarization analysis of $Z_{cs}$ state

Subgraph Retrieval Enhanced Model for Multi-hop Knowledge Base Question Answering

The existence of multi-peak positive solutions for nonlinear Kirchhoff equations

Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling

GraphPlan: Story Generation by Planning with Event Graph

Omnidirectional nonreciprocal absorber realized by the magneto-optical hypercrystal

On a Conjecture of Bahri-Xu

Optimizing target nodes selection for the control energy of directed complex networks

Power-Efficient Silicon Nitride Thermo-Optic Phase Shifters for Visible Light

Abnormal wave propagation of high-k modes in tilted linear-crossing metamaterials

CONNA: Addressing Name Disambiguation on The Fly

Experimental demonstration of an anomalous Floquet topological insulator based on negative-index media

FedSel: Federated SGD under Local Differential Privacy with Top-k Dimension Selection

JarKA: Modeling Attribute Interactions for Cross-lingual Knowledge Alignment

Nonlinear nanophotonic devices in the Ultraviolet to Visible wavelength range

One-dimensional topological quasiperiodic chain for directional wireless power transfer

Sensitivity of topological edge states in a non-Hermitian dimer chain

Towards the understanding of fully-heavy tetraquark states from various models

Unsupervised Manifold Clustering of Topological Phononics

Wireless power transfer via topological modes in dimer chains

Can the state $Y(4626)$ be a $P$-wave tetraquark state $[cs][\bar{c}\bar{s}]$?

Hyperbolic Metamaterials: From Dispersion Manipulation to Application

Liquid-Like Interfaces Mediate Structural Phase Transitions in Lead Halide Perovskites

Observation of acoustic spin

A Decade of Developing Radio-Astronomy Instrumentation using CASPER Open-Source Technology

A GPU-Based Wide-Band Radio Spectrometer

Photonic band gap of a graphene-embedded quarter-wave stack

Advanced Multi-beam Spectrometer for the Green Bank Telescope

Charmonium states in QCD-inspired quark potential model using Gaussian expansion method

Photonic simulation of topological excitations in metamaterials

Planar Metamaterial with Toroidal Moment

Determining the strange and antistrange quark distributions of the nucleon

Low-threshold directional plasmon lasing assisted by spatially coherent surface plasmon polaritons

Metallic helix array as a broadband wave plate

Radiative vertex $VPγ$ and $η-η'$ mixing in light-cone sum rules

The possible candidates of tetraquark : $Z_b(10610)$ and $Z_b(10650)$

Ferrimagnetic-like surface resonance from hybrid metamaterial slab

Spatially coherent surface resonance states derived from magnetic resonances

Lattice QCD with 8 Light Quark Flavors