Source author record

Jing Jiang

Jing Jiang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

56works

24topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Benchmarking Continuous Dynamic Multi-Objective Optimization: Survey and Generalized Test Suite

Dynamic multi-objective optimization (DMOO) has recently attracted increasing interest from both academic researchers and engineering practitioners, as numerous real-world applications that evolve over time can be naturally formulated as dynamic multi-objective optimization problems (DMOPs). This growing trend necessitates advanced benchmarks for the rigorous evaluation of optimization algorithms under realistic conditions. This paper introduces a comprehensive and principled framework for constructing highly realistic and challenging DMOO benchmarks. The proposed framework features several novel components: a generalized formulation that allows the Pareto-optimal Set (PS) to change on hypersurfaces, a mechanism for creating controlled variable contribution imbalances to generate heterogeneous landscapes, and dynamic rotation matrices for inducing time-varying variable interactions and non-separability. Furthermore, we incorporate a temporal perturbation mechanism to simulate irregular environmental changes and propose a generalized time-linkage mechanism that systematically embeds historical solution quality into future problems, thereby capturing critical real-world phenomena such as error accumulation and time-deception. Extensive experimental results validate the effectiveness of the proposed framework, demonstrating its superiority over conventional benchmarks in terms of realism, complexity, and its capability for discriminating state-of-the-art algorithmic performance. This work establishes a new standard for dynamic multi-objective optimization benchmarking, providing a powerful tool for the development and evaluation of next-generation algorithms capable of addressing the complexities of real-world dynamic systems.

preprint2026arXiv

SfMamba: Efficient Source-Free Domain Adaptation via Selective Scan Modeling

Source-free domain adaptation (SFDA) tackles the critical challenge of adapting source-pretrained models to unlabeled target domains without access to source data, overcoming data privacy and storage limitations in real-world applications. However, existing SFDA approaches struggle with the trade-off between perception field and computational efficiency in domain-invariant feature learning. Recently, Mamba has offered a promising solution through its selective scan mechanism, which enables long-range dependency modeling with linear complexity. However, the Visual Mamba (i.e., VMamba) remains limited in capturing channel-wise frequency characteristics critical for domain alignment and maintaining spatial robustness under significant domain shifts. To address these, we propose a framework called SfMamba to fully explore the stable dependency in source-free model transfer. SfMamba introduces Channel-wise Visual State-Space block that enables channel-sequence scanning for domain-invariant feature extraction. In addition, SfMamba involves a Semantic-Consistent Shuffle strategy that disrupts background patch sequences in 2D selective scan while preserving prediction consistency to mitigate error accumulation. Comprehensive evaluations across multiple benchmarks show that SfMamba achieves consistently stronger performance than existing methods while maintaining favorable parameter efficiency, offering a practical solution for SFDA. Our code is available at https://github.com/chenxi52/SfMamba.

preprint2025arXiv

OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching

Hallucinations are often inevitable in downstream tasks using large language models (LLMs). To tackle the substantial challenge of addressing hallucinations for LLM-based ontology matching (OM) systems, we introduce a new benchmark dataset OAEI-LLM-T. The dataset evolves from seven TBox datasets in the Ontology Alignment Evaluation Initiative (OAEI), capturing hallucinations of ten different LLMs performing OM tasks. These OM-specific hallucinations are organised into two primary categories and six sub-categories. We showcase the usefulness of the dataset in constructing an LLM leaderboard for OM tasks and for fine-tuning LLMs used in OM tasks.

preprint2022arXiv

Attention-Guided Black-box Adversarial Attacks with Large-Scale Multiobjective Evolutionary Optimization

Fooling deep neural networks (DNNs) with the black-box optimization has become a popular adversarial attack fashion, as the structural prior knowledge of DNNs is always unknown. Nevertheless, recent black-box adversarial attacks may struggle to balance their attack ability and visual quality of the generated adversarial examples (AEs) in tackling high-resolution images. In this paper, we propose an attention-guided black-box adversarial attack based on the large-scale multiobjective evolutionary optimization, termed as LMOA. By considering the spatial semantic information of images, we firstly take advantage of the attention map to determine the perturbed pixels. Instead of attacking the entire image, reducing the perturbed pixels with the attention mechanism can help to avoid the notorious curse of dimensionality and thereby improves the performance of attacking. Secondly, a large-scale multiobjective evolutionary algorithm is employed to traverse the reduced pixels in the salient region. Benefiting from its characteristics, the generated AEs have the potential to fool target DNNs while being imperceptible by the human vision. Extensive experimental results have verified the effectiveness of the proposed LMOA on the ImageNet dataset. More importantly, it is more competitive to generate high-resolution AEs with better visual quality compared with the existing black-box adversarial attacks.

preprint2022arXiv

Beyond Low-pass Filtering: Graph Convolutional Networks with Automatic Filtering

Graph convolutional networks are becoming indispensable for deep learning from graph-structured data. Most of the existing graph convolutional networks share two big shortcomings. First, they are essentially low-pass filters, thus the potentially useful middle and high frequency band of graph signals are ignored. Second, the bandwidth of existing graph convolutional filters is fixed. Parameters of a graph convolutional filter only transform the graph inputs without changing the curvature of a graph convolutional filter function. In reality, we are uncertain about whether we should retain or cut off the frequency at a certain point unless we have expert domain knowledge. In this paper, we propose Automatic Graph Convolutional Networks (AutoGCN) to capture the full spectrum of graph signals and automatically update the bandwidth of graph convolutional filters. While it is based on graph spectral theory, our AutoGCN is also localized in space and has a spatial form. Experimental results show that AutoGCN achieves significant improvement over baseline methods which only work as low-pass filters.

preprint2022arXiv

Boosting Facial Expression Recognition by A Semi-Supervised Progressive Teacher

In this paper, we aim to improve the performance of in-the-wild Facial Expression Recognition (FER) by exploiting semi-supervised learning. Large-scale labeled data and deep learning methods have greatly improved the performance of image recognition. However, the performance of FER is still not ideal due to the lack of training data and incorrect annotations (e.g., label noises). Among existing in-the-wild FER datasets, reliable ones contain insufficient data to train robust deep models while large-scale ones are annotated in lower quality. To address this problem, we propose a semi-supervised learning algorithm named Progressive Teacher (PT) to utilize reliable FER datasets as well as large-scale unlabeled expression images for effective training. On the one hand, PT introduces semi-supervised learning method to relieve the shortage of data in FER. On the other hand, it selects useful labeled training samples automatically and progressively to alleviate label noise. PT uses selected clean labeled data for computing the supervised classification loss and unlabeled data for unsupervised consistency loss. Experiments on widely-used databases RAF-DB and FERPlus validate the effectiveness of our method, which achieves state-of-the-art performance with accuracy of 89.57% on RAF-DB. Additionally, when the synthetic noise rate reaches even 30%, the performance of our PT algorithm only degrades by 4.37%.

preprint2022arXiv

Disentangling Identity and Pose for Facial Expression Recognition

Facial expression recognition (FER) is a challenging problem because the expression component is always entangled with other irrelevant factors, such as identity and head pose. In this work, we propose an identity and pose disentangled facial expression recognition (IPD-FER) model to learn more discriminative feature representation. We regard the holistic facial representation as the combination of identity, pose and expression. These three components are encoded with different encoders. For identity encoder, a well pre-trained face recognition model is utilized and fixed during training, which alleviates the restriction on specific expression training data in previous works and makes the disentanglement practicable on in-the-wild datasets. At the same time, the pose and expression encoder are optimized with corresponding labels. Combining identity and pose feature, a neutral face of input individual should be generated by the decoder. When expression feature is added, the input image should be reconstructed. By comparing the difference between synthesized neutral and expressional images of the same individual, the expression component is further disentangled from identity and pose. Experimental results verify the effectiveness of our method on both lab-controlled and in-the-wild databases and we achieve state-of-the-art recognition performance.

preprint2022arXiv

Exploring and Adapting Chinese GPT to Pinyin Input Method

While GPT has become the de-facto method for text generation tasks, its application to pinyin input method remains unexplored. In this work, we make the first exploration to leverage Chinese GPT for pinyin input method. We find that a frozen GPT achieves state-of-the-art performance on perfect pinyin. However, the performance drops dramatically when the input includes abbreviated pinyin. A reason is that an abbreviated pinyin can be mapped to many perfect pinyin, which links to even larger number of Chinese characters. We mitigate this issue with two strategies, including enriching the context with pinyin and optimizing the training process to help distinguish homophones. To further facilitate the evaluation of pinyin input method, we create a dataset consisting of 270K instances from 15 domains. Results show that our approach improves performance on abbreviated pinyin across all domains. Model analysis demonstrates that both strategies contribute to the performance boost.

preprint2022arXiv

FedNoiL: A Simple Two-Level Sampling Method for Federated Learning with Noisy Labels

Federated learning (FL) aims at training a global model on the server side while the training data are collected and located at the local devices. Hence, the labels in practice are usually annotated by clients of varying expertise or criteria and thus contain different amounts of noises. Local training on noisy labels can easily result in overfitting to noisy labels, which is devastating to the global model through aggregation. Although recent robust FL methods take malicious clients into account, they have not addressed local noisy labels on each device and the impact to the global model. In this paper, we develop a simple two-level sampling method "FedNoiL" that (1) selects clients for more robust global aggregation on the server; and (2) selects clean labels and correct pseudo-labels at the client end for more robust local training. The sampling probabilities are built upon clean label detection by the global model. Moreover, we investigate different schedules changing the local epochs between aggregations over the course of FL, which notably improves the communication and computation efficiency in noisy label setting. In experiments with homogeneous/heterogeneous data distributions and noise ratios, we observed that direct combinations of SOTA FL methods with SOTA noisy-label learning methods can easily fail but our method consistently achieves better and robust performance.

preprint2022arXiv

FedProto: Federated Prototype Learning across Heterogeneous Clients

Heterogeneity across clients in federated learning (FL) usually hinders the optimization convergence and generalization performance when the aggregation of clients' knowledge occurs in the gradient space. For example, clients may differ in terms of data distribution, network latency, input/output space, and/or model architecture, which can easily lead to the misalignment of their local gradients. To improve the tolerance to heterogeneity, we propose a novel federated prototype learning (FedProto) framework in which the clients and server communicate the abstract class prototypes instead of the gradients. FedProto aggregates the local prototypes collected from different clients, and then sends the global prototypes back to all clients to regularize the training of local models. The training on each client aims to minimize the classification error on the local data while keeping the resulting local prototypes sufficiently close to the corresponding global ones. Moreover, we provide a theoretical analysis to the convergence rate of FedProto under non-convex objectives. In experiments, we propose a benchmark setting tailored for heterogeneous FL, with FedProto outperforming several recent FL approaches on multiple datasets.

preprint2022arXiv

Gelfand-Kirillov dimensions and Reducibility of scalar type generalized Verma modules for classical Lie algebras

Let $\mathfrak{g}$ be a classial Lie algebra and $\mathfrak{p}$ be a maximal parabolic subalgebra. Let $M$ be a generalized Verma module induced from a one dimensional representation of $\mathfrak{p}$. Such $M$ is called a scalar type generalized Verma module. Its simple quotient $L$ is a highest weight moudle. In this paper, we will determine the reducibility of such scalar type generalized Verma modules by computing the Gelfand-Kirillov dimension of $L$.

preprint2022arXiv

Joint Trajectory Design and User Scheduling of Aerial Cognitive Radio Networks

Unmanned aerial vehicles (UAVs) have been widely employed to enhance the end-to-end performance of wireless communications since the links between UAVs and terrestrial nodes are line-of-sight (LoS) with high probability. However, the broadcast characteristics of signal propagation in LoS links make it vulnerable to being wiretapped by malicious eavesdroppers, which poses a considerable challenge to the security of wireless communications. This paper investigates the security of aerial cognitive radio networks (CRNs). An airborne base station transmits confidential messages to secondary users utilizing the same spectrum as the primary network. An aerial base station transmits jamming signals to suppress the eavesdropper to enhance secrecy performance. The uncertainty of eavesdropping node locations is considered, and the average secrecy rate of the cognitive user is maximized by optimizing multiple users' scheduling, the UAVs' trajectory, and transmit power. To solve the non-convex optimization problem with mixed multiple integers variable problem, we propose an iterative algorithm based on block coordinate descent and successive convex approximation. Numerical results verify the effectiveness of our proposed algorithm and demonstrate that our scheme is beneficial to improving the secrecy performance of aerial CRNs.

preprint2022arXiv

Omni-Scale CNNs: a simple and effective kernel size configuration for time series classification

The Receptive Field (RF) size has been one of the most important factors for One Dimensional Convolutional Neural Networks (1D-CNNs) on time series classification tasks. Large efforts have been taken to choose the appropriate size because it has a huge influence on the performance and differs significantly for each dataset. In this paper, we propose an Omni-Scale block (OS-block) for 1D-CNNs, where the kernel sizes are decided by a simple and universal rule. Particularly, it is a set of kernel sizes that can efficiently cover the best RF size across different datasets via consisting of multiple prime numbers according to the length of the time series. The experiment result shows that models with the OS-block can achieve a similar performance as models with the searched optimal RF size and due to the strong optimal RF size capture ability, simple 1D-CNN models with OS-block achieves the state-of-the-art performance on four time series benchmarks, including both univariate and multivariate data from multiple domains. Comprehensive analysis and discussions shed light on why the OS-block can capture optimal RF sizes across different datasets. Code available [https://github.com/Wensi-Tang/OS-CNN]

preprint2022arXiv

On Secure NOMA-CDRT Systems with Physical Layer Network Coding

This paper proposes a new scheme to enhance the secrecy performance of a NOMA-based coordinated direct relay transmission system (NOMA-CDRT) with an untrusted relay. The physical-layer network coding and the non-orthogonal multiple access scheme are combined to improve the spectrum efficiency. Furthermore, inter-user interference and friendly jamming signals are utilized to suppress the eavesdropping ability of the untrusted relay without affecting the acceptance quality of legitimate users. Specifically, the far user in the first slot and the near user in the second slot act as jammers to generate jamming signals to ensure secure transmissions of the confidential signals. We investigate the secrecy performance of the proposed scheme in NOMA-CDRT systems and derive the closed-form expression for the ergodic secrecy sum rate. The asymptotic analysis at high signal-to-noise ratio is performed to obtain more insights. Finally, simulation results are presented to demonstrate the effectiveness of the proposed scheme and the correctness of the theoretical analysis.

preprint2022arXiv

On the Convergence of Clustered Federated Learning

Knowledge sharing and model personalization are essential components to tackle the non-IID challenge in federated learning (FL). Most existing FL methods focus on two extremes: 1) to learn a shared model to serve all clients with non-IID data, and 2) to learn personalized models for each client, namely personalized FL. There is a trade-off solution, namely clustered FL or cluster-wise personalized FL, which aims to cluster similar clients into one cluster, and then learn a shared model for all clients within a cluster. This paper is to revisit the research of clustered FL by formulating them into a bi-level optimization framework that could unify existing methods. We propose a new theoretical analysis framework to prove the convergence by considering the clusterability among clients. In addition, we embody this framework in an algorithm, named Weighted Clustered Federated Learning (WeCFL). Empirical analysis verifies the theoretical results and demonstrates the effectiveness of the proposed WeCFL under the proposed cluster-wise non-IID settings.

preprint2022arXiv

Online non-convex learning for river pollution source identification

In this paper, novel gradient-based online learning algorithms are developed to investigate an important environmental application: real-time river pollution source identification, which aims at estimating the released mass, location, and time of a river pollution source based on downstream sensor data monitoring the pollution concentration. The pollution is assumed to be instantaneously released once. The problem can be formulated as a non-convex loss minimization problem in statistical learning, and our online algorithms have vectorized and adaptive step sizes to ensure high estimation accuracy in three dimensions which have different magnitudes. In order to keep the algorithm from stucking to the saddle points of non-convex loss, the escaping from saddle points module and multi-start setting are derived to further improve the estimation accuracy by searching for the global minimizer of the loss functions. This can be shown theoretically and experimentally as the $O(N)$ local regret of the algorithms and the high probability cumulative regret bound $O(N)$ under a particular error bound condition in loss functions. A real-life river pollution source identification example shows the superior performance of our algorithms compared to existing methods in terms of estimation accuracy. Managerial insights for the decision maker to use the algorithms are also provided.

preprint2022arXiv

Personalized Federated Learning With Graph

Knowledge sharing and model personalization are two key components in the conceptual framework of personalized federated learning (PFL). Existing PFL methods focus on proposing new model personalization mechanisms while simply implementing knowledge sharing by aggregating models from all clients, regardless of their relation graph. This paper aims to enhance the knowledge-sharing process in PFL by leveraging the graph-based structural information among clients. We propose a novel structured federated learning (SFL) framework to learn both the global and personalized models simultaneously using client-wise relation graphs and clients' private data. We cast SFL with graph into a novel optimization problem that can model the client-wise complex relations and graph-based structural topology by a unified framework. Moreover, in addition to using an existing relation graph, SFL could be expanded to learn the hidden relations among clients. Experiments on traffic and image benchmark datasets can demonstrate the effectiveness of the proposed method. All implementation codes are available on Github

preprint2022arXiv

Uplink Performance of High-Mobility Cell-Free Massive MIMO-OFDM Systems

High-speed train (HST) communications with orthogonal frequency division multiplexing (OFDM) techniques have received significant attention in recent years. Besides, cell-free (CF) massive multiple-input multiple-output (MIMO) is considered a promising technology to achieve the ultimate performance limit. In this paper, we focus on the performance of CF massive MIMO-OFDM systems with both matched filter and large-scale fading decoding (LSFD) receivers in HST communications. HST communications with small cell and cellular massive MIMO-OFDM systems are also analyzed for comparison. Considering the bad effect of Doppler frequency offset (DFO) on system performance, exact closed-form expressions for uplink spectral efficiency (SE) of all systems are derived. According to the simulation results, we find that the CF massive MIMO-OFDM system with LSFD achieves both larger SE and lower SE drop percentages than other systems. In addition, increasing the number of access points (APs) and antennas per AP can effectively compensate for the performance loss from the DFO. Moreover, there is an optimal vertical distance between APs and HST to achieve the maximum SE.

preprint2021arXiv

A Low-Complexity ADMM-based Massive MIMO Detectors via Deep Neural Networks

An alternate direction method of multipliers (ADMM)-based detectors can achieve good performance in both small and large-scale multiple-input multiple-output (MIMO) systems. However, due to the difficulty of choosing the optimal penalty parameters, their performance is limited. This paper presents a deep neural network (DNN)-based massive MIMO detection method which can overcome the above limitation. It exploits the unfolding technique and learns to estimate the penalty parameters. Additionally, a computationally cheaper detector is also proposed. The proposed methods can handle the higher-order modulation signals. Numerical results are presented to demonstrate the performances of the proposed methods compared with the existing works.

preprint2021arXiv

Deep learning piston aberration control of fiber laser phased array by spiral phase modulation

The stochastic parallel gradient descent (SPGD) algorithm is usually employed as the control strategy for phase-locking in fiber laser phased array systems. However, the convergence speed of the SPGD algorithm will slow down as the number of array elements increases. To improve the control bandwidth, the convolutional neural network is introduced to quickly calculate the initial piston aberration in a single step. In addition, the irrationality of the commonly used Mean Square Error (MSE) evaluation function in existing convolutional neural networks is analyzed. A new evaluation function NPCD (Normalized Phase Cosine Distance) is proposed to improve the accuracy of the neural networks. The results show that the piston aberration residual is 0.005 and the power in the bucket (PIB) is 0.993 after accurate preliminary compensation, which means that the system directly enters the co-phase state. We also demonstrate the robustness and scalability by adding additional disturbance and expanding the scale of the array.

preprint2021arXiv

Episodic memory governs choices: An RNN-based reinforcement learning model for decision-making task

Typical methods to study cognitive function are to record the electrical activities of animal neurons during the training of animals performing behavioral tasks. A key problem is that they fail to record all the relevant neurons in the animal brain. To alleviate this problem, we develop an RNN-based Actor-Critic framework, which is trained through reinforcement learning (RL) to solve two tasks analogous to the monkeys' decision-making tasks. The trained model is capable of reproducing some features of neural activities recorded from animal brain, or some behavior properties exhibited in animal experiments, suggesting that it can serve as a computational platform to explore other cognitive functions. Furthermore, we conduct behavioral experiments on our framework, trying to explore an open question in neuroscience: which episodic memory in the hippocampus should be selected to ultimately govern future decisions. We find that the retrieval of salient events sampled from episodic memories can effectively shorten deliberation time than common events in the decision-making process. The results indicate that salient events stored in the hippocampus could be prioritized to propagate reward information, and thus allow decision-makers to learn a strategy faster.

preprint2021arXiv

Isometric Propagation Network for Generalized Zero-shot Learning

Zero-shot learning (ZSL) aims to classify images of an unseen class only based on a few attributes describing that class but no access to any training sample. A popular strategy is to learn a mapping between the semantic space of class attributes and the visual space of images based on the seen classes and their data. Thus, an unseen class image can be ideally mapped to its corresponding class attributes. The key challenge is how to align the representations in the two spaces. For most ZSL settings, the attributes for each seen/unseen class are only represented by a vector while the seen-class data provide much more information. Thus, the imbalanced supervision from the semantic and the visual space can make the learned mapping easily overfitting to the seen classes. To resolve this problem, we propose Isometric Propagation Network (IPN), which learns to strengthen the relation between classes within each space and align the class dependency in the two spaces. Specifically, IPN learns to propagate the class representations on an auto-generated graph within each space. In contrast to only aligning the resulted static representation, we regularize the two dynamic propagation procedures to be isometric in terms of the two graphs' edge weights per step by minimizing a consistency loss between them. IPN achieves state-of-the-art performance on three popular ZSL benchmarks. To evaluate the generalization capability of IPN, we further build two larger benchmarks with more diverse unseen classes and demonstrate the advantages of IPN on them.

preprint2021arXiv

PICA: A Pixel Correlation-based Attentional Black-box Adversarial Attack

The studies on black-box adversarial attacks have become increasingly prevalent due to the intractable acquisition of the structural knowledge of deep neural networks (DNNs). However, the performance of emerging attacks is negatively impacted when fooling DNNs tailored for high-resolution images. One of the explanations is that these methods usually focus on attacking the entire image, regardless of its spatial semantic information, and thereby encounter the notorious curse of dimensionality. To this end, we propose a pixel correlation-based attentional black-box adversarial attack, termed as PICA. Firstly, we take only one of every two neighboring pixels in the salient region as the target by leveraging the attentional mechanism and pixel correlation of images, such that the dimension of the black-box attack reduces. After that, a general multiobjective evolutionary algorithm is employed to traverse the reduced pixels and generate perturbations that are imperceptible by the human vision. Extensive experimental results have verified the effectiveness of the proposed PICA on the ImageNet dataset. More importantly, PICA is computationally more efficient to generate high-resolution adversarial examples compared with the existing black-box attacks.

preprint2021arXiv

Toggling Near-field Directionality via Polarization Control of Surface Waves

Directional excitation of guidance modes is central to many applications ranging from light harvesting, optical information processing to quantum optical technology. Of paramount interest, especially, the active control of near-field directionality provides a new paradigm for the real-time on-chip manipulation of light. Here we find that for a given dipolar source, its near-field directionality can be toggled efficiently via tailoring the polarization of surface waves that are excited, for example, via tuning the chemical potential of graphene in a graphene-metasurface waveguide. This finding enables a feasible scheme for the active near-field directionality. Counterintuitively, we reveal that this scheme can transform a circular electric/magnetic dipole into a Huygens dipole in the near-field coupling. Moreover, for Janus dipoles, this scheme enables us to actively flip their near-field coupling and non-coupling faces.

preprint2020arXiv

A Universal Representation Transformer Layer for Few-Shot Image Classification

Few-shot classification aims to recognize unseen classes when presented with only a small number of samples. We consider the problem of multi-domain few-shot image classification, where unseen classes and examples come from diverse data sources. This problem has seen growing interest and has inspired the development of benchmarks such as Meta-Dataset. A key challenge in this multi-domain setting is to effectively integrate the feature representations from the diverse set of training domains. Here, we propose a Universal Representation Transformer (URT) layer, that meta-learns to leverage universal features for few-shot classification by dynamically re-weighting and composing the most appropriate domain-specific representations. In experiments, we show that URT sets a new state-of-the-art result on Meta-Dataset. Specifically, it achieves top-performance on the highest number of data sources compared to competing methods. We analyze variants of URT and present a visualization of the attention score heatmaps that sheds light on how the model performs cross-domain generalization. Our code is available at https://github.com/liulu112601/URT.

preprint2020arXiv

Aspect and Opinion Aware Abstractive Review Summarization with Reinforced Hard Typed Decoder

In this paper, we study abstractive review summarization.Observing that review summaries often consist of aspect words, opinion words and context words, we propose a two-stage reinforcement learning approach, which first predicts the output word type from the three types, and then leverages the predicted word type to generate the final word distribution.Experimental results on two Amazon product review datasets demonstrate that our method can consistently outperform several strong baseline approaches based on ROUGE scores.

preprint2020arXiv

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

Modeling multivariate time series has long been a subject that has attracted researchers from a diverse range of fields including economics, finance, and traffic. A basic assumption behind multivariate time series forecasting is that its variables depend on one another but, upon looking closely, it is fair to say that existing methods fail to fully exploit latent spatial dependencies between pairs of variables. In recent years, meanwhile, graph neural networks (GNNs) have shown high capability in handling relational dependencies. GNNs require well-defined graph structures for information propagation which means they cannot be applied directly for multivariate time series where the dependencies are not known in advance. In this paper, we propose a general graph neural network framework designed specifically for multivariate time series data. Our approach automatically extracts the uni-directed relations among variables through a graph learning module, into which external knowledge like variable attributes can be easily integrated. A novel mix-hop propagation layer and a dilated inception layer are further proposed to capture the spatial and temporal dependencies within the time series. The graph learning, graph convolution, and temporal convolution modules are jointly learned in an end-to-end framework. Experimental results show that our proposed model outperforms the state-of-the-art baseline methods on 3 of 4 benchmark datasets and achieves on-par performance with other approaches on two traffic datasets which provide extra structural information.

preprint2020arXiv

Interpretable Rumor Detection in Microblogs by Attending to User Interactions

We address rumor detection by learning to differentiate between the community's response to real and fake claims in microblogs. Existing state-of-the-art models are based on tree models that model conversational trees. However, in social media, a user posting a reply might be replying to the entire thread rather than to a specific user. We propose a post-level attention model (PLAN) to model long distance interactions between tweets with the multi-head attention mechanism in a transformer network. We investigated variants of this model: (1) a structure aware self-attention model (StA-PLAN) that incorporates tree structure information in the transformer network, and (2) a hierarchical token and post-level attention model (StA-HiTPLAN) that learns a sentence representation with token-level self-attention. To the best of our knowledge, we are the first to evaluate our models on two rumor detection data sets: the PHEME data set as well as the Twitter15 and Twitter16 data sets. We show that our best models outperform current state-of-the-art models for both data sets. Moreover, the attention mechanism allows us to explain rumor detection predictions at both token-level and post-level.

preprint2020arXiv

Multi-level Head-wise Match and Aggregation in Transformer for Textual Sequence Matching

Transformer has been successfully applied to many natural language processing tasks. However, for textual sequence matching, simple matching between the representation of a pair of sequences might bring in unnecessary noise. In this paper, we propose a new approach to sequence pair matching with Transformer, by learning head-wise matching representations on multiple levels. Experiments show that our proposed approach can achieve new state-of-the-art performance on multiple tasks that rely only on pre-computed sequence-vector-representation, such as SNLI, MNLI-match, MNLI-mismatch, QQP, and SQuAD-binary.

preprint2020arXiv

Self-Attention Enhanced Patient Journey Understanding in Healthcare System

Understanding patients' journeys in healthcare system is a fundamental prepositive task for a broad range of AI-based healthcare applications. This task aims to learn an informative representation that can comprehensively encode hidden dependencies among medical events and its inner entities, and then the use of encoding outputs can greatly benefit the downstream application-driven tasks. A patient journey is a sequence of electronic health records (EHRs) over time that is organized at multiple levels: patient, visits and medical codes. The key challenge of patient journey understanding is to design an effective encoding mechanism which can properly tackle the aforementioned multi-level structured patient journey data with temporal sequential visits and a set of medical codes. This paper proposes a novel self-attention mechanism that can simultaneously capture the contextual and temporal relationships hidden in patient journeys. A multi-level self-attention network (MusaNet) is specifically designed to learn the representations of patient journeys that is used to be a long sequence of activities. The MusaNet is trained in end-to-end manner using the training data derived from EHRs. We evaluated the efficacy of our method on two medical application tasks with real-world benchmark datasets. The results have demonstrated the proposed MusaNet produces higher-quality representations than state-of-the-art baseline methods. The source code is available in https://github.com/xueping/MusaNet.

preprint2019arXiv

Chaos Phase Induced Mass-producible Monolayer Two-dimensional Material

Crystal phase is well studied and presents a periodical atom arrangement in three dimensions lattice, but the "amorphous phase" is poorly understood. Here, by starting from cage-like bicyclocalix[2]arene[2]triazines building block, a brand-new 2D MOF is constructed with extremely weak interlaminar interaction existing between two adjacent 2D-crystal layer. Inter-layer slip happens under external disturbance and leads to the loss of periodicity at one dimension in the crystal lattice, resulting in an interim phase between the crystal and amorphous phase - the chaos phase, non-periodical in microscopic scale but orderly in mesoscopic scale. This chaos phase 2D MOF is a disordered self-assembly of black-phosphorus like 3D-layer, which has excellent mechanical-strength and a thickness of 1.15 nm. The bulky 2D-MOF material is readily to be exfoliated into monolayer nanosheets in gram-scale with unprecedented evenness and homogeneity, as well as previously unattained lateral size (>10 um), which present the first mass-producible monolayer 2D material and can form wafer-scale film on substrate.

preprint2019arXiv

Learning Graph Embedding with Adversarial Training Methods

Graph embedding aims to transfer a graph into vectors to facilitate subsequent graph analytics tasks like link prediction and graph clustering. Most approaches on graph embedding focus on preserving the graph structure or minimizing the reconstruction errors for graph data. They have mostly overlooked the embedding distribution of the latent codes, which unfortunately may lead to inferior representation in many cases. In this paper, we present a novel adversarially regularized framework for graph embedding. By employing the graph convolutional network as an encoder, our framework embeds the topological information and node content into a vector representation, from which a graph decoder is further built to reconstruct the input graph. The adversarial training principle is applied to enforce our latent codes to match a prior Gaussian or Uniform distribution. Based on this framework, we derive two variants of adversarial models, the adversarially regularized graph autoencoder (ARGA) and its variational version, adversarially regularized variational graph autoencoder (ARVGA), to learn the graph embedding effectively. We also exploit other potential variations of ARGA and ARVGA to get a deeper understanding on our designs. Experimental results compared among twelve algorithms for link prediction and twenty algorithms for graph clustering validate our solutions.

preprint2019arXiv

Learning Private Neural Language Modeling with Attentive Aggregation

Mobile keyboard suggestion is typically regarded as a word-level language modeling problem. Centralized machine learning technique requires massive user data collected to train on, which may impose privacy concerns for sensitive personal typing data of users. Federated learning (FL) provides a promising approach to learning private language modeling for intelligent personalized keyboard suggestion by training models in distributed clients rather than training in a central server. To obtain a global model for prediction, existing FL algorithms simply average the client models and ignore the importance of each client during model aggregation. Furthermore, there is no optimization for learning a well-generalized global model on the central server. To solve these problems, we propose a novel model aggregation with the attention mechanism considering the contribution of clients models to the global model, together with an optimization technique during server aggregation. Our proposed attentive aggregation method minimizes the weighted distance between the server model and client models through iterative parameters updating while attends the distance between the server model and client models. Through experiments on two popular language modeling datasets and a social media dataset, our proposed method outperforms its counterparts in terms of perplexity and communication cost in most settings of comparison.

preprint2019arXiv

Temporal Self-Attention Network for Medical Concept Embedding

In longitudinal electronic health records (EHRs), the event records of a patient are distributed over a long period of time and the temporal relations between the events reflect sufficient domain knowledge to benefit prediction tasks such as the rate of inpatient mortality. Medical concept embedding as a feature extraction method that transforms a set of medical concepts with a specific time stamp into a vector, which will be fed into a supervised learning algorithm. The quality of the embedding significantly determines the learning performance over the medical data. In this paper, we propose a medical concept embedding method based on applying a self-attention mechanism to represent each medical concept. We propose a novel attention mechanism which captures the contextual information and temporal relationships between medical concepts. A light-weight neural net, "Temporal Self-Attention Network (TeSAN)", is then proposed to learn medical concept embedding based solely on the proposed attention mechanism. To test the effectiveness of our proposed methods, we have conducted clustering and prediction tasks on two public EHRs datasets comparing TeSAN against five state-of-the-art embedding methods. The experimental results demonstrate that the proposed TeSAN model is superior to all the compared methods. To the best of our knowledge, this work is the first to exploit temporal self-attentive relations between medical events.

preprint2016arXiv

A Compare-Aggregate Model for Matching Text Sequences

Many NLP tasks including machine comprehension, answer selection and text entailment require the comparison between sequences. Matching the important units between sequences is a key to solve these problems. In this paper, we present a general "compare-aggregate" framework that performs word-level matching followed by aggregation using Convolutional Neural Networks. We particularly focus on the different comparison functions we can use to match two vectors. We use four different datasets to evaluate the model. We find that some simple comparison functions based on element-wise operations can work better than standard neural network and neural tensor network.

preprint2016arXiv

Large-area, uniform and low-cost dual-mode plasmonic naked-eye colorimetry and SERS sensor with handheld Raman spectrometer

We demonstrated a highly sensitive, waferscale, highly uniform plasmonic nanomushroom substrate based on plastic for nakedeye plasmonic colorimetry and surface enhanced Raman spectroscopy (SERS). We gave it the name FlexBrite. The dualmode functionality of FlexBrite allows for label-free qualitative analysis by SERS with an enhancement factor (EF) of 10^8 and labelfree quantitative analysis by naked eye colorimetry with a sensitivity of 611 nm RIU-1. The SERS EF of FlexBrite in the wet state was found to be 4.81 X 10^8, 7 times stronger than in the dry state, making FlexBrite suitable for aqueous environments such as microfluid systems. The labelfree detection of biotin streptavidin interaction by both SERS and colorimetry was demonstrated with FlexBrite. The detection of trace amounts of the narcotic drug methamphetamine in drinking water by SERS was implemented with a handheld Raman spectrometer and FlexBrite. This plastic based dual-mode nano-mushroom substrate has the potential to be used as a sensing platform for easy and fast analysis in chemical and biological assays.

preprint2016arXiv

Learning Natural Language Inference with LSTM

Natural language inference (NLI) is a fundamentally important task in natural language processing that has many applications. The recently released Stanford Natural Language Inference (SNLI) corpus has made it possible to develop and evaluate learning-centered methods such as deep neural networks for natural language inference (NLI). In this paper, we propose a special long short-term memory (LSTM) architecture for NLI. Our model builds on top of a recently proposed neural attention model for NLI but is based on a significantly different idea. Instead of deriving sentence embeddings for the premise and the hypothesis to be used for classification, our solution uses a match-LSTM to perform word-by-word matching of the hypothesis with the premise. This LSTM is able to place more emphasis on important word-level matching results. In particular, we observe that this LSTM remembers important mismatches that are critical for predicting the contradiction or the neutral relationship label. On the SNLI corpus, our model achieves an accuracy of 86.1%, outperforming the state of the art.

preprint2016arXiv

Lithography-Free, Low-Cost Method for Improving Photodiode Performance by Etching Silicon Nanocones as Antireflection Layer

A three-step process has been demonstrated to improve the performance of photodiode by creating nanocone forest on the surface of photodiode as an antireflection layer. This high-throughput, low-cost process has been shown to decrease the reflectivity by 66.1%, enhance the quantum efficiency by 27%, and increase the responsivity by 25.7%. This low-cost manufacture process can be applied to increase the responsivity of silicon based photonic devices.

preprint2016arXiv

Machine Comprehension Using Match-LSTM and Answer Pointer

Machine comprehension of text is an important problem in natural language processing. A recently released dataset, the Stanford Question Answering Dataset (SQuAD), offers a large number of real questions and their answers created by humans through crowdsourcing. SQuAD provides a challenging testbed for evaluating machine comprehension algorithms, partly because compared with previous datasets, in SQuAD the answers do not come from a small set of candidate answers and they have variable lengths. We propose an end-to-end neural architecture for the task. The architecture is based on match-LSTM, a model we proposed previously for textual entailment, and Pointer Net, a sequence-to-sequence model proposed by Vinyals et al.(2015) to constrain the output tokens to be from the input sequences. We propose two ways of using Pointer Net for our task. Our experiments show that both of our two models substantially outperform the best results obtained by Rajpurkar et al.(2016) using logistic regression and manually crafted features.

preprint2016arXiv

Unified Framework for the Effective Rate Analysis of Wireless Communication Systems over MISO Fading Channels

This paper proposes a unified framework for the effective rate analysis over arbitrary correlated and not necessarily identical multiple inputs single output (MISO) fading channels, which uses moment generating function (MGF) based approach and H transform representation. The proposed framework has the potential to simplify the cumbersome analysis procedure compared to the probability density function (PDF) based approach. Moreover, the effective rates over two specific fading scenarios are investigated, namely independent but not necessarily identical distributed (i.n.i.d.) MISO hyper Fox's H fading channels and arbitrary correlated generalized K fading channels. The exact analytical representations for these two scenarios are also presented. By substituting corresponding parameters, the effective rates in various practical fading scenarios, such as Rayleigh, Nakagami-m, Weibull/Gamma and generalized K fading channels, are readily available. In addition, asymptotic approximations are provided for the proposed H transform and MGF based approach as well as for the effective rate over i.n.i.d. MISO hyper Fox's H fading channels. Simulations under various fading scenarios are also presented, which support the validity of the proposed method.

preprint2015arXiv

Achieving Autonomous Compressive Spectrum Sensing for Cognitive Radios

Compressive sensing (CS) technologies present many advantages over other existing approaches for implementing wideband spectrum sensing in cognitive radios (CRs), such as reduced sampling rate and computational complexity. However, there are two significant challenges: 1) choosing an appropriate number of sub-Nyquist measurements, and 2) deciding when to terminate the greedy recovery algorithm that reconstructs wideband spectrum. In this paper, an autonomous compressive spectrum sensing (ACSS) framework is presented that enables a CR to automatically choose the number of measurements while guaranteeing the wideband spectrum recovery with a small predictable recovery error. This is realized by the proposed measurement infrastructure and the validation technique. The proposed ACSS can find a good spectral estimate with high confidence by using only a small testing subset in both noiseless and noisy environments. Furthermore, a sparsity-aware spectral recovery algorithm is proposed to recover the wideband spectrum without requiring knowledge of the instantaneous spectral sparsity level. Such an algorithm bridges the gap between CS theory and practical spectrum sensing. Simulation results show that ACSS can not only recover the spectrum using an appropriate number of measurements, but can also considerably improve the spectral recovery performance compared with existing CS approaches. The proposed recovery algorithm can autonomously adopt a proper number of iterations, therefore solving the problems of under-fitting or over-fitting which commonly exist in most greedy recovery algorithms.

preprint2015arXiv

Bounds and Constructions for $\overline{3}$-Separable Codes with Length $3$

Separable codes were introduced to provide protection against illegal redistribution of copyrighted multimedia material. Let $\mathcal{C}$ be a code of length $n$ over an alphabet of $q$ letters. The descendant code ${\sf desc}(\mathcal{C}_0)$ of $\mathcal{C}_0 = \{{\bf c}_1, {\bf c}_2, \ldots, {\bf c}_t\} \subseteq {\mathcal{C}}$ is defined to be the set of words ${\bf x} = (x_1, x_2, \ldots,x_n)^T$ such that $x_i \in \{c_{1,i}, c_{2,i}, \ldots, c_{t,i}\}$ for all $i=1, \ldots, n$, where ${\bf c}_j=(c_{j,1},c_{j,2},\ldots,c_{j,n})^T$. $\mathcal{C}$ is a $\overline{t}$-separable code if for any two distinct $\mathcal{C}_1, \mathcal{C}_2 \subseteq \mathcal{C}$ with $|\mathcal{C}_1| \le t$, $|\mathcal{C}_2| \le t$, we always have ${\sf desc}(\mathcal{C}_1) \neq {\sf desc}(\mathcal{C}_2)$. Let $M(\overline{t},n,q)$ denote the maximal possible size of such a separable code. In this paper, an upper bound on $M(\overline{3},3,q)$ is derived by considering an optimization problem related to a partial Latin square, and then two constructions for $\overline{3}$-SC$(3,M,q)$s are provided by means of perfect hash families and Steiner triple systems.

preprint2014arXiv

Black Silicon Solar Thin-film Microcells Integrating Top Nanocone Structures for Broadband and Omnidirectional Light-Trapping

Recently developed classes of monocrystalline silicon solar microcells (u-cell) can be assembled into modules with characteristics (i.e., mechanically flexible forms, compact concentrator designs, and high-voltage outputs) that would be impossible to achieve using conventional, wafer-based approaches. In this paper, we describe a highly dense, uniform and non-periodic nanocone forest structure of black silicon (bSi) created on optically-thin (30 um) u-cells for broadband and omnidirectional light-trapping with a lithography-free and high-throughput plasma texturizing process. With optimized plasma etching conditions and a silicon nitride passivation layer, black silicon u-cells, when embedded in a polymer waveguiding layer, display dramatic increases of as much as 65.7% in short circuit current, as compared to a bare silicon device. The conversion efficiency increases from 8% to 11.5% with a small drop in open circuit voltage and fill factor.

preprint2014arXiv

Codes with the Identifiable Parent Property for Multimedia Fingerprinting

Let ${\cal C}$ be a $q$-ary code of length $n$ and size $M$, and ${\cal C}(i) = \{{\bf c}(i) \ | \ {\bf c}=({\bf c}(1), {\bf c}(2), \ldots, {\bf c}(n))^{T} \in {\cal C}\}$ be the set of $i$th coordinates of ${\cal C}$. The descendant code of a sub-code ${\cal C}^{'} \subseteq {\cal C}$ is defined to be ${\cal C}^{'}(1) \times {\cal C}^{'}(2) \times \cdots \times {\cal C}^{'}(n)$. In this paper, we introduce a multimedia analogue of codes with the identifiable parent property (IPP), called multimedia IPP codes or $t$-MIPPC$(n, M, q)$, so that given the descendant code of any sub-code ${\cal C}^{'}$ of a multimedia $t$-IPP code ${\cal C}$, one can always identify, as IPP codes do in the generic digital scenario, at least one codeword in ${\cal C}^{'}$. We first derive a general upper bound on the size $M$ of a multimedia $t$-IPP code, and then investigate multimedia $3$-IPP codes in more detail. We characterize a multimedia $3$-IPP code of length $2$ in terms of a bipartite graph and a generalized packing, respectively. By means of these combinatorial characterizations, we further derive a tight upper bound on the size of a multimedia $3$-IPP code of length $2$, and construct several infinite families of (asymptotically) optimal multimedia $3$-IPP codes of length $2$.

preprint2014arXiv

Multimedia IPP Codes with Efficient Tracing

Binary multimedia identifiable parent property codes (binary $t$-MIPPCs) are used in multimedia fingerprinting schemes where the identification of users taking part in the averaging collusion attack to illegally redistribute content is required. In this paper, we first introduce a binary strong multimedia identifiable parent property code (binary $t$-SMIPPC) whose tracing algorithm is more efficient than that of a binary $t$-MIPPC. Then a composition construction for binary $t$-SMIPPCs from $q$-ary $t$-SMIPPCs is provided. Several infinite series of optimal $q$-ary $t$-SMIPPCs of length $2$ with $t = 2, 3$ are derived from the relationships among $t$-SMIPPCs and other fingerprinting codes, such as $\overline{t}$-separable codes and $t$-MIPPCs. Finally, combinatorial properties of $q$-ary $2$-SMIPPCs of length $3$ are investigated, and optimal $q$-ary $2$-SMIPPCs of length $3$ with $q \equiv 0, 1, 2, 5 \pmod 6$ are constructed.

preprint2014arXiv

Strongly Separable Codes

Binary $t$-frameproof codes ($t$-FPCs) are used in multimedia fingerprinting schemes where the identification of authorized users taking part in the averaging collusion attack is required. In this paper, a binary strongly $\bar{t}$-separable code ($\bar{t}$-SSC) is introduced to improve such a scheme based on a binary $t$-FPC. A binary $\bar{t}$-SSC has the same traceability as a binary $t$-FPC but has more codewords than a binary $t$-FPC. A composition construction for binary $\bar{t}$-SSCs from $q$-ary $\bar{t}$-SSCs is described, which stimulates the research on $q$-ary $\bar{t}$-SSCs with short length. Several infinite series of optimal $q$-ary $\bar{2}$-SSCs of length $2$ are derived from the fact that a $q$-ary $\bar{2}$-SSC of length $2$ is equivalent to a $q$-ary $\bar{2}$-separable code of length $2$. Combinatorial properties of $q$-ary $\bar{2}$-SSCs of length $3$ are investigated, and a construction for $q$-ary $\bar{2}$-SSCs of length $3$ is provided. These $\bar{2}$-SSCs of length $3$ have more than $12.5\%$ codewords than $2$-FPCs of length $3$ could have.

preprint2013arXiv

Cellphone based Portable Bacteria Pre-Concentrating microfluidic Sensor and Impedance Sensing System

Portable low-cost sensors and sensing systems for the identification and quantitative measurement of bacteria in field water are critical in preventing drinking water from being contaminated by bacteria. In this article, we reported the design, fabrication and testing of a low-cost, miniaturized and sensitive bacteria sensor based on electrical impedance spectroscopy method using a smartphone as the platform. Our design of microfluidics enabled the pre-concentration of the bacteria which lowered the detection limit to 10 bacterial cells per milliliter. We envision that our demonstrated smartphone-based sensing system will realize highly-sensitive and rapid in-field quantification of multiple species of bacteria and pathogens.

preprint2013arXiv

Multi-rate Sub-Nyquist Spectrum Sensing in Cognitive Radios

Wideband spectrum sensing is becoming increasingly important to cognitive radio (CR) systems for exploiting spectral opportunities. This paper introduces a novel multi-rate sub-Nyquist spectrum sensing (MS3) system that implements cooperative wideband spectrum sensing in a CR network. MS3 can detect the wideband spectrum using partial measurements without reconstructing the full frequency spectrum. Sub-Nyquist sampling rates are adopted in sampling channels for wrapping the frequency spectrum onto itself. This significantly reduces sensing requirements of CR. The effects of sub-Nyquist sampling are considered, and the performance of multi-channel sub-Nyquist samplings is analyzed. To improve its detection performance, sub-Nyquist sampling rates are chosen to be different such that the numbers of samples are consecutive prime numbers. Furthermore, when the received signals at CRs are faded or shadowed, the performance of MS3 is analytically evaluated. Numerical results show that the proposed system can significantly enhance the wideband spectrum sensing performance while requiring low computational and implementation complexities.

preprint2013arXiv

Practical Analysis of Codebook Design and Frequency Offset Estimation for Virtual-MIMO Systems

A virtual multiple-input multiple-output (MIMO) wireless system using the receiver-side cooperation with the compress-and-forward (CF) protocol, is an alternative to a point-to-point MIMO system, when a single receiver is not equipped with multiple antennas. It is evident that the practicality of CF cooperation will be greatly enhanced if an efficient source coding technique can be used at the relay. It is even more desirable that CF cooperation should not be unduly sensitive to carrier frequency offsets (CFOs). This paper presents a practical study of these two issues. Firstly, codebook designs of the Voronoi vector quantization (VQ) and the tree-structure vector quantization (TSVQ) to enable CF cooperation at the relay are described. A comparison in terms of the codebook design and encoding complexity is analyzed. It is shown that the TSVQ is much simpler to design and operate, and can achieve a favorable performance-complexity tradeoff. Furthermore, this paper demonstrates that CFO can lead to significant performance degradation for the virtual MIMO system. To overcome this, it is proposed to maintain clock synchronization and jointly estimate the CFO between the relay and the destination. This approach is shown to provide a significant performance improvement.

preprint2013arXiv

Relaying Technologies for Smart Grid Communications

Wireless technologies can support a broad range of smart grid applications including advanced metering infrastructure (AMI) and demand response (DR). However, there are many formidable challenges when wireless technologies are applied to the smart gird, e.g., the tradeoffs between wireless coverage and capacity, the high reliability requirement for communication, and limited spectral resources. Relaying has emerged as one of the most promising candidate solutions for addressing these issues. In this article, an introduction to various relaying strategies is presented, together with a discussion of how to improve spectral efficiency and coverage in relay-based information and communications technology (ICT) infrastructure for smart grid applications. Special attention is paid to the use of unidirectional relaying, collaborative beamforming, and bidirectional relaying strategies.

preprint2013arXiv

Wideband Spectrum Sensing with Sub-Nyquist Sampling in Cognitive Radios

Multi-rate asynchronous sub-Nyquist sampling (MASS) is proposed for wideband spectrum sensing. Corresponding spectral recovery conditions are derived and the probability of successful recovery is given. Compared to previous approaches, MASS offers lower sampling rate, and is an attractive approach for cognitive radio networks.

preprint2010arXiv

Flipped SU(5) X U(1)_X Models from F-Theory

We systematically construct flipped SU(5) X U(1)_X models without and with bulk vector-like particles from F-theory. To realize the decoupling scenario, we introduce sets of vector-like particles in complete SU(5) X U(1) multiplets at the TeV scale, or at the intermediate scale, or at the TeV scale and high scale. To avoid the Landau pole problem for the gauge couplings, we can only introduce five sets of vector-like particles around the TeV scale. These vector-like particles can couple to the Standard Model singlet fields, and obtain suitable masses by Higgs mechanism. We study gauge coupling unification in detail. We show that the U(1)_X flux contributions to the gauge couplings preserve the SU(5) X U(1)_X gauge coupling unification. We calculate the SU(3)_C X SU(2)_L unification scales, and the SU(5) X U(1)_X unification scales and unified couplings. In most of our models, the high-scale or bulk vector-like particles can be considered as string-scale threshold corrections since their masses are close to the string scale. Futhermore, we discuss the phenomenological consequences of our models. In particular, in the models with TeV-scale vector-like particles, the vector-like particles can be observed at the Large Hadron collider, the proton decay is within the reach of the future Hyper-Kamiokande experiment, the lightest CP-even Higgs boson mass can be increased, the hybrid inflation can be naturally realized, and the correct cosmic primodial density fluctuations can be generated.

preprint2009arXiv

$b \to s$ Transitions in Family-dependent $U(1)^\prime$ Models

We analyze flavor-changing-neutral-current (FCNC) effects in the $b\to s$ transitions that are induced by family non-universal $U(1)'$ gauge symmetries. After systematically developing the necessary formalism, we present a correlated analysis for the $ΔB =1, 2$ processes. We adopt a model-independent approach in which we only require family-universal charges for the first and second generations and small fermion mixing angles. We analyze the constraints on the resulting parameter space from $B_s - \bar B_s$ mixing and the time-dependent CP asymmetries of the penguin-dominated $B_d \to (π, ϕ, η', ρ, ω, f_0)K_S$ decays. Our results indicate that the currently observed discrepancies in some of these modes with respect to the Standard Model predictions can be consistently accommodated within this general class of models.

preprint2009arXiv

F-SU(5)

We construct three flipped SU(5) X U(1)_X models from F-theory, and consider two such models from free fermionic string model building. To achieve the decoupling scenario in F-theory models and the string-scale gauge coupling unification in free fermionic models, we introduce vector-like particles at the TeV scale that can be observed at the Large Hadron Collider. We study gauge coupling unification, and find that proton decay is within the reach of the future Hyper-Kamiokande experiment. In these models, the doublet-triplet splitting problem and monopole problem can be solved, the neutrino masses and mixings can be explained via the double seesaw or seesaw mechanism, the observed baryon asymmetry can be obtained through leptogenesis, the hybrid inflation can be realized, and the correct cosmic primodial density fluctuations can be generated.

preprint2009arXiv

Phenomenological Implications of Supersymmetric Family Non-universal U(1)-prime Models

We construct a class of anomaly-free supersymmetric U(1)' models that are characterized by family non-universal U(1)' charges motivated from E_6 embeddings. The family non-universality arises from an interchange of the standard roles of the two SU(5) 5* representations within the 27 of E_6 for the third generation. We analyze U(1)' and electroweak symmetry breaking and present the particle mass spectrum. The models, which include additional Higgs multiplets and exotic quarks at the TeV scale, result in specific patterns of flavor-changing neutral currents in the b to s transitions that can accommodate the presently observed deviations inthis sector from the SM predictions.

preprint2002arXiv

N=2 6-dimensional Supersymmetric $E_6$ Breaking

We study the $N=2$ supersymmetric $E_6$ models on the 6-dimensional space-time where the supersymmetry and gauge symmetry can be broken by the discrete symmetry. On the space-time $M^4\times S^1/(Z_2\times Z_2') \times S^1/(Z_2\times Z_2')$, for the zero modes, we obtain the 4-dimensional $N=1$ supersymmetric models with gauge groups $SU(3)\times SU(2) \times SU(2) \times U(1)^2$, $SU(4)\times SU(2) \times SU(2) \times U(1)$, and $SU(3)\times SU(2) \times U(1)^3$ with one extra pair of Higgs doublets from the vector multiplet. In addition, considering that the extra space manifold is the annulus $A^2$ and disc $D^2$, we list all the constraints on constructing the 4-dimensional $N=1$ supersymmetric $SU(3)\times SU(2) \times U(1)^3$ models for the zero modes, and give the simplest model with $Z_9$ symmetry. We also comment on the extra gauge symmetry breaking and its generalization.

Jing Jiang

What is connected

Connect this record

See the researcher in context

Building this map preview

56 published item(s)

Benchmarking Continuous Dynamic Multi-Objective Optimization: Survey and Generalized Test Suite

SfMamba: Efficient Source-Free Domain Adaptation via Selective Scan Modeling

OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching

Attention-Guided Black-box Adversarial Attacks with Large-Scale Multiobjective Evolutionary Optimization

Beyond Low-pass Filtering: Graph Convolutional Networks with Automatic Filtering

Boosting Facial Expression Recognition by A Semi-Supervised Progressive Teacher

Disentangling Identity and Pose for Facial Expression Recognition

Exploring and Adapting Chinese GPT to Pinyin Input Method

FedNoiL: A Simple Two-Level Sampling Method for Federated Learning with Noisy Labels

FedProto: Federated Prototype Learning across Heterogeneous Clients

Gelfand-Kirillov dimensions and Reducibility of scalar type generalized Verma modules for classical Lie algebras

Joint Trajectory Design and User Scheduling of Aerial Cognitive Radio Networks

Omni-Scale CNNs: a simple and effective kernel size configuration for time series classification

On Secure NOMA-CDRT Systems with Physical Layer Network Coding

On the Convergence of Clustered Federated Learning

Online non-convex learning for river pollution source identification

Personalized Federated Learning With Graph

Uplink Performance of High-Mobility Cell-Free Massive MIMO-OFDM Systems

A Low-Complexity ADMM-based Massive MIMO Detectors via Deep Neural Networks

Deep learning piston aberration control of fiber laser phased array by spiral phase modulation

Episodic memory governs choices: An RNN-based reinforcement learning model for decision-making task

Isometric Propagation Network for Generalized Zero-shot Learning

PICA: A Pixel Correlation-based Attentional Black-box Adversarial Attack

Toggling Near-field Directionality via Polarization Control of Surface Waves

A Universal Representation Transformer Layer for Few-Shot Image Classification

Aspect and Opinion Aware Abstractive Review Summarization with Reinforced Hard Typed Decoder

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

Interpretable Rumor Detection in Microblogs by Attending to User Interactions

Multi-level Head-wise Match and Aggregation in Transformer for Textual Sequence Matching

Self-Attention Enhanced Patient Journey Understanding in Healthcare System

Chaos Phase Induced Mass-producible Monolayer Two-dimensional Material

Learning Graph Embedding with Adversarial Training Methods

Learning Private Neural Language Modeling with Attentive Aggregation

Temporal Self-Attention Network for Medical Concept Embedding

A Compare-Aggregate Model for Matching Text Sequences

Large-area, uniform and low-cost dual-mode plasmonic naked-eye colorimetry and SERS sensor with handheld Raman spectrometer

Learning Natural Language Inference with LSTM

Lithography-Free, Low-Cost Method for Improving Photodiode Performance by Etching Silicon Nanocones as Antireflection Layer

Machine Comprehension Using Match-LSTM and Answer Pointer

Unified Framework for the Effective Rate Analysis of Wireless Communication Systems over MISO Fading Channels

Achieving Autonomous Compressive Spectrum Sensing for Cognitive Radios

Bounds and Constructions for $\overline{3}$-Separable Codes with Length $3$

Black Silicon Solar Thin-film Microcells Integrating Top Nanocone Structures for Broadband and Omnidirectional Light-Trapping

Codes with the Identifiable Parent Property for Multimedia Fingerprinting

Multimedia IPP Codes with Efficient Tracing

Strongly Separable Codes

Cellphone based Portable Bacteria Pre-Concentrating microfluidic Sensor and Impedance Sensing System

Multi-rate Sub-Nyquist Spectrum Sensing in Cognitive Radios

Practical Analysis of Codebook Design and Frequency Offset Estimation for Virtual-MIMO Systems

Relaying Technologies for Smart Grid Communications

Wideband Spectrum Sensing with Sub-Nyquist Sampling in Cognitive Radios

Flipped SU(5) X U(1)_X Models from F-Theory

$b \to s$ Transitions in Family-dependent $U(1)^\prime$ Models

F-SU(5)

Phenomenological Implications of Supersymmetric Family Non-universal U(1)-prime Models

N=2 6-dimensional Supersymmetric $E_6$ Breaking