Source author record

Jing Huang

Jing Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

48works

28topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

An AI-guided mechanotyping instrument for fully automated oocyte quality assessment

The mechanical properties of oocytes are regarded as important indicators of their developmental potential. During fertilization, deviations from the normal mechanical range can hinder sperm penetration, ultimately reducing fertilization efficiency and compromising embryo quality. However, current methods for measuring oocyte mechanics often suffer from serious cellular damage, low automation levels, and large measurement errors. To address these limitations, we developed an AI-guided micronewton-scale mechanical measurement system for safe and automated oocyte quality assessment. The system integrates voice interaction with automated experimental workflows to control a magnetically actuated microgripper, which applies defined loading forces to induce micron-scale compressive deformation of the oocyte. Combined with AI-assisted object detection and image segmentation algorithms, the system captures cellular deformation in real time, enabling precise calculation of the oocyte's compressive modulus. This measurement system enables automated, quantitative, and non-destructive evaluation of oocyte mechanical properties, providing an effective approach for oocyte quality screening in in vitro fertilization (IVF) and other assisted reproductive technologies (ART).

preprint2026arXiv

Caracal: Causal Architecture via Spectral Mixing

The scalability of Large Language Models to long sequences is hindered by the quadratic cost of attention and the limitations of positional encodings. To address these, we introduce Caracal, a novel architecture that replaces attention with a parameter-efficient, O(L log(L)) Multi-Head Fourier (MHF) module. Our contributions are threefold: (1) We leverage the Fast Fourier Transform (FFT) for sequence mixing, inherently addressing both bottlenecks mentioned above. (2) We apply a frequency-domain causal masking technique that enforces autoregressive capabilities via asymmetric padding and truncation, overcoming a critical barrier for Fourier-based generative models. (3) Unlike efficient models relying on hardware-specific implementations (e.g., Mamba), we uses standard library operators. This ensures robust portability, eliminating common deployment barriers. Evaluations demonstrate that Caracal performs competitively with Transformer and SSM baselines, offering a scalable and simple pathway for efficient long-sequence modeling. Code is available in Appendix.

preprint2026arXiv

Chordal signed graphs and signed bigraphs

Chordal graphs and chordal bigraphs enjoy beautiful characterizations, in terms of forbidden subgraphs, vertex/edge orderings, vertex/edge separating sets, and tree-like representations. In this paper, we introduce chordal signed graphs and chordal signed bigraphs. Interestingly, chordal signed graphs are equivalent to strict chordal digraphs studied by Hell and Hernández-Cruz. A forbidden subdigraph characterization of strict chordal digraphs can be translated to a forbidden subgraph characterization of chordal signed graphs. We give a forbidden subgraph characterization of chordal signed bigraphs. The forbidden subgraphs for chordal signed bigraphs are analogous to those for chordal signed graphs but the proofs are much more complicated and intriguing.

preprint2026arXiv

Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models

Digital twins, as precise digital representations of physical systems, have evolved from passive simulation tools into intelligent and autonomous entities through the integration of artificial intelligence technologies. This paper presents a unified four-stage framework that systematically characterizes AI integration across the digital twin lifecycle, spanning modeling, mirroring, intervention, and autonomous management. By synthesizing existing technologies and practices, we distill a unified four-stage framework that systematically characterizes how AI methodologies are embedded across the digital twin lifecycle: (1) modeling the physical twin through physics-based and physics-informed AI approaches, (2) mirroring the physical system into a digital twin with real-time synchronization, (3) intervening in the physical twin through predictive modeling, anomaly detection, and optimization strategies, and (4) achieving autonomous management through large language models, foundation models, and intelligent agents. We analyze the synergy between physics-based modeling and data-driven learning, highlighting the shift from traditional numerical solvers to physics-informed and foundation models for physical systems. Furthermore, we examine how generative AI technologies, including large language models and generative world models, transform digital twins into proactive and self-improving cognitive systems capable of reasoning, communication, and creative scenario generation. Through a cross-domain review spanning eleven application domains, including healthcare, aerospace, smart manufacturing, robotics, and smart cities, we identify common challenges related to scalability, explainability, and trustworthiness, and outline directions for responsible AI-driven digital twin systems.

preprint2026arXiv

Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs

Training tool-calling agents requires large-scale trajectory data with verifiable labels, yet existing approaches either synthesize environments that diverge from real API behavior or generate tasks without ground-truth outcomes for verification. We present FireFly, a pipeline for generating verified tool-call data from real-world MCP servers. Our key insight is to invert the standard synthesis pipeline: rather than generating tasks and hoping they are solvable, we first let a strong LLM explore real APIs along graph-guided DAG structures, then synthesize tasks backward from observed outcomes, guaranteeing label correctness by construction. To handle the scale of real-world tool spaces (${\sim}$1,000 tools), we build a pairwise tool graph and sample sub-DAGs to focus exploration on semantically coherent workflows. To address environment drift in live APIs, we construct a retrieval-augmented simulator that caches all exploration results and replays them during training and evaluation, enabling fully offline and reproducible RL. Applying this pipeline yields 5,144 verified tasks spanning 240 servers and 993 tools. A 4B-parameter model trained with GRPO on FireFly matches Claude Sonnet 4.6 on our held-out test set and shows improvements on multiple tool-calling benchmarks including Tau2-Bench, MCPMark, and MCP-Atlas.

preprint2026arXiv

MobileDreamer: Generative Sketch World Model for GUI Agent

Mobile GUI agents have shown strong potential in real-world automation and practical applications. However, most existing agents remain reactive, making decisions mainly from current screen, which limits their performance on long-horizon tasks. Building a world model from repeated interactions enables forecasting action outcomes and supports better decision making for mobile GUI agents. This is challenging because the model must predict post-action states with spatial awareness while remaining efficient enough for practical deployment. In this paper, we propose MobileDreamer, an efficient world-model-based lookahead framework to equip the GUI agents based on the future imagination provided by the world model. It consists of textual sketch world model and rollout imagination for GUI agent. Textual sketch world model forecasts post-action states through a learning process to transform digital images into key task-related sketches, and designs a novel order-invariant learning strategy to preserve the spatial information of GUI elements. The rollout imagination strategy for GUI agent optimizes the action-selection process by leveraging the prediction capability of world model. Experiments on Android World show that MobileDreamer achieves state-of-the-art performance and improves task success by 5.25%. World model evaluations further verify that our textual sketch modeling accurately forecasts key GUI elements.

preprint2025arXiv

BATISNet: Instance Segmentation of Tooth Point Clouds with Boundary Awareness

Accurate segmentation of the tooth point cloud is of great significance for diagnosis clinical assisting and treatment planning. Existing methods mostly employ semantic segmentation, focusing on the semantic feature between different types of teeth. However, due to the tightly packed structure of teeth, unclear boundaries, and the diversity of complex cases such as missing teeth, malposed teeth, semantic segmentation often struggles to achieve satisfactory results when dealing with complex dental cases. To address these issues, this paper propose BATISNet, a boundary-aware instance network for tooth point cloud segmentation. This network model consists of a feature extraction backbone and an instance segmentation module. It not only focuses on extracting the semantic features of different types of teeth but also learns the instance features of individual teeth. It helps achieve more robust and accurate tooth instance segmentation in complex clinical scenarios such as missing teeth and malposed teeth. Additionally, to further enhance the completeness and accuracy of tooth boundary segmentation, a boundary-aware loss function is designed to specifically supervise the boundary segmentation between instances. It mitigates effectively tooth adhesion and boundary ambiguity issues. Extensive experimental results show that BATISNet outperforms existing methods in tooth integrity segmentation, providing more reliable and detailed data support for practical clinical applications.

preprint2025arXiv

Hierarchical Deformation Planning and Neural Tracking for DLOs in Constrained Environments

Deformable linear objects (DLOs) manipulation presents significant challenges due to DLOs' inherent high-dimensional state space and complex deformation dynamics. The wide-populated obstacles in realistic workspaces further complicate DLO manipulation, necessitating efficient deformation planning and robust deformation tracking. In this work, we propose a novel framework for DLO manipulation in constrained environments. This framework combines hierarchical deformation planning with neural tracking, ensuring reliable performance in both global deformation synthesis and local deformation tracking. Specifically, the deformation planner begins by generating a spatial path set that inherently satisfies the homotopic constraints associated with DLO keypoint paths. Next, a path-set-guided optimization method is applied to synthesize an optimal temporal deformation sequence for the DLO. In manipulation execution, a neural model predictive control approach, leveraging a data-driven deformation model, is designed to accurately track the planned DLO deformation sequence. The effectiveness of the proposed framework is validated in extensive constrained DLO manipulation tasks.

preprint2025arXiv

Passage-traversing optimal path planning with sampling-based algorithms

This paper introduces a new paradigm of optimal path planning, i.e., passage-traversing optimal path planning (PTOPP), that optimizes paths' traversed passages for specified optimization objectives. In particular, PTOPP is utilized to find the path with optimal accessible free space along its entire length, which represents a basic requirement for paths in robotics. As passages are places where free space shrinks and becomes constrained, the core idea is to leverage the path's passage traversal status to characterize its accessible free space comprehensively. To this end, a novel passage detection and free space decomposition method using proximity graphs is proposed, enabling fast detection of sparse but informative passages and environment decompositions. Based on this preprocessing, optimal path planning with accessible free space objectives or constraints is formulated as PTOPP problems compatible with sampling-based optimal planners. Then, sampling-based algorithms for PTOPP, including their dependent primitive procedures, are developed leveraging partitioned environments for fast passage traversal check. All these methods are implemented and thoroughly tested for effectiveness and efficiency validation. Compared to existing approaches, such as clearance-based methods, PTOPP demonstrates significant advantages in configurability, solution optimality, and efficiency, addressing prior limitations and incapabilities. It is believed to provide an efficient and versatile solution to accessible free space optimization over conventional avenues and more generally, to a broad class of path planning problems that can be formulated as PTOPP.

preprint2025arXiv

Reading or Reasoning? Format Decoupled Reinforcement Learning for Document OCR

Reading text from images or scanned documents via OCR models has been a longstanding focus of researchers. Intuitively, text reading is perceived as a straightforward perceptual task, and existing work primarily focuses on constructing enriched data engineering to enhance SFT capabilities. In this work, we observe that even advanced OCR models exhibit significantly higher entropy in formatted text (\emph{e.g.}, formula, table, etc.) compared to plain text, often by an order of magnitude. These statistical patterns reveal that advanced OCR models struggle with high output uncertainty when dealing with format sensitive document, suggesting that reasoning over diverse reading pathways may improve OCR performance. To address this, we propose format decoupled reinforcement learning (FD-RL), which leverages high-entropy patterns for targeted optimization. Our approach employs entropy-based data filtration strategy to identify format-intensive instances, and adopt format decoupled rewards tailored to different format types, enabling format-level validation rather than token-level memorization. FD-RL achieves an average score of 90.41 on OmniDocBench, setting a new record for end-to-end models on this highly popular benchmark. More importantly, we conduct comprehensive ablation studies over data, training, filtering, and rewarding strategies, thoroughly validating their effectiveness.

preprint2022arXiv

Comparability digraphs: An analogue of comparability graphs

Comparability graphs are a popular class of graphs. We introduce as the digraph analogue of comparability graphs the class of comparability digraphs. We show that many concepts such as implication classes and the knotting graph for a comparability graph can be naturally extended to a comparability digraph. We give a characterization of comparability digraphs in terms of their knotting graphs. Semicomplete comparability digraphs are a prototype of comparability digraphs. One instrumental technique for analyzing the structure of comparability graphs is the Triangle Lemma for graphs. We generalize the Triangle Lemma to semicomplete digraphs. Using the Triangle Lemma for semicomplete digraphs we prove that if an implication class of a semicomplete digraph contains no circuit of length 2 then it contains no circuit at all. We also use it to device an $\mathcal{O}(n^3)$ time recognition algorithm for semicomplete comparability digraphs where $n$ is the number of vertices of the input digraph. The correctness of the algorithm implies a characterization for semicomplete comparability digraphs, akin to that for comparability graphs.

preprint2022arXiv

Conversational AI Systems for Social Good: Opportunities and Challenges

Conversational artificial intelligence (ConvAI) systems have attracted much academic and commercial attention recently, making significant progress on both fronts. However, little existing work discusses how these systems can be developed and deployed for social good in real-world applications, with comprehensive case studies and analyses of pros and cons. In this paper, we briefly review the progress the community has made towards better ConvAI systems and reflect on how existing technologies can help advance social good initiatives from various angles that are unique for ConvAI, or not yet become common knowledge in the community. We further discuss about the challenges ahead for ConvAI systems to better help us achieve these goals and highlight the risks involved in their development and deployment in the real world.

preprint2022arXiv

Cross-modal Contrastive Distillation for Instructional Activity Anticipation

In this study, we aim to predict the plausible future action steps given an observation of the past and study the task of instructional activity anticipation. Unlike previous anticipation tasks that aim at action label prediction, our work targets at generating natural language outputs that provide interpretable and accurate descriptions of future action steps. It is a challenging task due to the lack of semantic information extracted from the instructional videos. To overcome this challenge, we propose a novel knowledge distillation framework to exploit the related external textual knowledge to assist the visual anticipation task. However, previous knowledge distillation techniques generally transfer information within the same modality. To bridge the gap between the visual and text modalities during the distillation process, we devise a novel cross-modal contrastive distillation (CCD) scheme, which facilitates knowledge distillation between teacher and student in heterogeneous modalities with the proposed cross-modal distillation loss. We evaluate our method on the Tasty Videos dataset. CCD improves the anticipation performance of the visual-alone student model by a large margin of 40.2% relatively in BLEU4. Our approach also outperforms the state-of-the-art approaches by a large margin.

preprint2022arXiv

Improving Time Sensitivity for Question Answering over Temporal Knowledge Graphs

Question answering over temporal knowledge graphs (KGs) efficiently uses facts contained in a temporal KG, which records entity relations and when they occur in time, to answer natural language questions (e.g., "Who was the president of the US before Obama?"). These questions often involve three time-related challenges that previous work fail to adequately address: 1) questions often do not specify exact timestamps of interest (e.g., "Obama" instead of 2000); 2) subtle lexical differences in time relations (e.g., "before" vs "after"); 3) off-the-shelf temporal KG embeddings that previous work builds on ignore the temporal order of timestamps, which is crucial for answering temporal-order related questions. In this paper, we propose a time-sensitive question answering (TSQA) framework to tackle these problems. TSQA features a timestamp estimation module to infer the unwritten timestamp from the question. We also employ a time-sensitive KG encoder to inject ordering information into the temporal KG embeddings that TSQA is based on. With the help of techniques to reduce the search space for potential answers, TSQA significantly outperforms the previous state of the art on a new benchmark for question answering over temporal KGs, especially achieving a 32% (absolute) error reduction on complex questions that require multiple steps of reasoning over facts in the temporal KG.

preprint2022arXiv

Observational Signatures of Tearing Instability in the Current Sheet of a Solar Flare

Magnetic reconnection is a fundamental physical process converting magnetic energy into not only plasma energy but also particle energy in various astrophysical phenomena. In this letter, we show a unique dataset of a solar flare where various plasmoids were formed by a continually stretched current sheet. EUV images captured reconnection inflows, outflows, and particularly the recurring plasma blobs (plasmoids). X-ray images reveal nonthermal emission sources at the lower end of the current sheet, presumably as large plasmoids with a sufficiently amount of energetic electrons trapped in. In the radio domain, an upward slowly drifting pulsation structure, followed by a rare pair of oppositely drifting structures, was observed. These structures are supposed to map the evolution of the primary and the secondary plasmoids formed in the current sheet. Our results on plasmoids at different locations and scales shed important light on the dynamics, plasma heating, particle acceleration, and transport processes in the turbulent current sheet and provide observational evidence for the cascading magnetic reconnection process.

preprint2022arXiv

On the Opportunities and Risks of Foundation Models

AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their capabilities (e.g., language, vision, robotics, reasoning, human interaction) and technical principles(e.g., model architectures, training procedures, data, systems, security, evaluation, theory) to their applications (e.g., law, healthcare, education) and societal impact (e.g., inequity, misuse, economic and environmental impact, legal and ethical considerations). Though foundation models are based on standard deep learning and transfer learning, their scale results in new emergent capabilities,and their effectiveness across so many tasks incentivizes homogenization. Homogenization provides powerful leverage but demands caution, as the defects of the foundation model are inherited by all the adapted models downstream. Despite the impending widespread deployment of foundation models, we currently lack a clear understanding of how they work, when they fail, and what they are even capable of due to their emergent properties. To tackle these questions, we believe much of the critical research on foundation models will require deep interdisciplinary collaboration commensurate with their fundamentally sociotechnical nature.

preprint2022arXiv

Paint shop vehicle sequencing based on quantum computing considering color changeover and painting quality

As customer demands become increasingly diverse, the colors and styles of vehicles offered by automotive companies have also grown substantially. It poses great challenges to design and management of automotive manufacturing system, among which is the proper sequencing of vehicles in everyday operation of the paint shop. With typically hundreds of vehicles in one shift, the paint shop sequencing problem is intractable in classical computing. In this paper, we propose to solve a general paint shop sequencing problem using state-of-the-art quantum computing algorithms. Most existing works are solely focused on reducing color changeover costs, i.e., costs incurred by different colors between consecutive vehicles. This work reveals that different sequencing of vehicles also significantly affects the quality performance of the painting process. We use a machine learning model pretrained on historical data to predict the probability of painting defect. The problem is formulated as a combinational optimization problem with two cost components, i.e., color changeover cost and repair cost. The problem is further converted to a quantum optimization problem and solved with Quantum Approximation Optimization Algorithm (QAOA). As a matter of fact, current quantum computers are still limited in accuracy and scalability. However, with a simplified case study, we demonstrate how the classic sequencing problem in paint shop can be formulated and solved using quantum computing and demonstrate the potential of quantum computing in solving real problems in manufacturing systems.

preprint2022arXiv

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision

Existing self-supervised 3D human pose estimation schemes have largely relied on weak supervisions like consistency loss to guide the learning, which, inevitably, leads to inferior results in real-world scenarios with unseen poses. In this paper, we propose a novel self-supervised approach that allows us to explicitly generate 2D-3D pose pairs for augmenting supervision, through a self-enhancing dual-loop learning framework. This is made possible via introducing a reinforcement-learning-based imitator, which is learned jointly with a pose estimator alongside a pose hallucinator; the three components form two loops during the training process, complementing and strengthening one another. Specifically, the pose estimator transforms an input 2D pose sequence to a low-fidelity 3D output, which is then enhanced by the imitator that enforces physical constraints. The refined 3D poses are subsequently fed to the hallucinator for producing even more diverse data, which are, in turn, strengthened by the imitator and further utilized to train the pose estimator. Such a co-evolution scheme, in practice, enables training a pose estimator on self-generated motion data without relying on any given 3D data. Extensive experiments across various benchmarks demonstrate that our approach yields encouraging results significantly outperforming the state of the art and, in some cases, even on par with results of fully-supervised methods. Notably, it achieves 89.1% 3D PCK on MPI-INF-3DHP under self-supervised cross-dataset evaluation setup, improving upon the previous best self-supervised methods by 8.6%. Code can be found at: https://github.com/Garfield-kh/PoseTriplet

preprint2022arXiv

SpanDrop: Simple and Effective Counterfactual Learning for Long Sequences

Distilling supervision signal from a long sequence to make predictions is a challenging task in machine learning, especially when not all elements in the input sequence contribute equally to the desired output. In this paper, we propose SpanDrop, a simple and effective data augmentation technique that helps models identify the true supervision signal in a long sequence with very few examples. By directly manipulating the input sequence, SpanDrop randomly ablates parts of the sequence at a time and ask the model to perform the same task to emulate counterfactual learning and achieve input attribution. Based on theoretical analysis of its properties, we also propose a variant of SpanDrop based on the beta-Bernoulli distribution, which yields diverse augmented sequences while providing a learning objective that is more consistent with the original dataset. We demonstrate the effectiveness of SpanDrop on a set of carefully designed toy tasks, as well as various natural language processing tasks that require reasoning over long sequences to arrive at the correct answer, and show that it helps models improve performance both when data is scarce and abundant.

preprint2022arXiv

Video2StyleGAN: Encoding Video in Latent Space for Manipulation

Many recent works have been proposed for face image editing by leveraging the latent space of pretrained GANs. However, few attempts have been made to directly apply them to videos, because 1) they do not guarantee temporal consistency, 2) their application is limited by their processing speed on videos, and 3) they cannot accurately encode details of face motion and expression. To this end, we propose a novel network to encode face videos into the latent space of StyleGAN for semantic face video manipulation. Based on the vision transformer, our network reuses the high-resolution portion of the latent vector to enforce temporal consistency. To capture subtle face motions and expressions, we design novel losses that involve sparse facial landmarks and dense 3D face mesh. We have thoroughly evaluated our approach and successfully demonstrated its application to various face video manipulations. Particularly, we propose a novel network for pose/expression control in a 3D coordinate system. Both qualitative and quantitative results have shown that our approach can significantly outperform existing single image methods, while achieving real-time (66 fps) speed.

preprint2021arXiv

Chordality of locally semicomplete and weakly quasi-transitive digraphs

Chordal graphs are important in the structural and algorithmic graph theory. A digraph analogue of chordal graphs was introduced by Haskin and Rose in 1973 but has not been a subject of active studies until recently when a characterization of semicomplete chordal digraphs in terms of forbidden subdigraphs was found by Meister and Telle. Locally semicomplete digraphs, quasi-transitive digraphs, and extended semi-complete digraphs are amongst the most popular generalizations of semicomplete digraphs. We extend the forbidden subdigraph characterization of semicomplete chordal digraphs to locally semicomplete chordal digraphs. We introduce a new class of digraphs, called weakly quasi-transitive digraphs, which contains quasi-transitive digraphs, symmetric digraphs, and extended semicomplete digraphs, but is incomparable to the class of locally semicomplete digraphs. We show that weakly quasi-transitive digraphs can be recursively constructed by simple substitutions from transitive oriented graphs, semicomplete digraphs, and symmetric digraphs. This recursive construction of weakly quasi-transitive digraphs, similar to the one for quasi-transitive digraphs, demonstrates the naturalness of the new digraph class. As a by-product, we prove that the forbidden subdigraphs for semicomplete chordal digraphs are the same for weakly quasi-transitive chordal digraphs.

preprint2021arXiv

Energy and spectral analysis of confined solar flares from radio and X-ray observations

The energy and spectral shape of radio bursts may help us understand the generation mechanism of solar eruptions, including solar flares, CMEs, eruptive filaments, and various scales of jets. The different kinds of flares may have different characteristics of energy and spectral distribution. In this work, we selected 10 mostly confined flare events during October 2014 to investigate their overall spectral behavior and the energy emitted in microwaves by using radio observations from microwaves to interplanetary radio waves, and X-ray observations of GOES, RHESSI, and Fermi/GBM. We found that: All the confined flare events were associated with a microwave continuum burst extending to frequencies of 9.4 - 15.4 GHz, and the peak frequencies of all confined flare events are higher than 4.995 GHz and lower than or equal to 17 GHz. The median value is around 9 GHz. The microwave burst energy (or fluence) as well as the peak frequency are found to provide useful criteria to estimate the power of solar flares. The observations imply that the magnetic field in confined flares tends to be stronger than that in 412 flares studied by Nita et al. 2004. All 10 events studied did not produce detectable hard X-rays with energies above 300 keV indicating the lack of efficient acceleration of electrons to high energies in the confined flares.

preprint2021arXiv

Inductive Learning on Commonsense Knowledge Graph Completion

Commonsense knowledge graph (CKG) is a special type of knowledge graph (KG), where entities are composed of free-form text. However, most existing CKG completion methods focus on the setting where all the entities are presented at training time. Although this setting is standard for conventional KG completion, it has limitations for CKG completion. At test time, entities in CKGs can be unseen because they may have unseen text/names and entities may be disconnected from the training graph, since CKGs are generally very sparse. Here, we propose to study the inductive learning setting for CKG completion where unseen entities may present at test time. We develop a novel learning framework named InductivE. Different from previous approaches, InductiveE ensures the inductive learning capability by directly computing entity embeddings from raw entity attributes/text. InductiveE consists of a free-text encoder, a graph encoder, and a KG completion decoder. Specifically, the free-text encoder first extracts the textual representation of each entity based on the pre-trained language model and word embedding. The graph encoder is a gated relational graph convolutional neural network that learns from a densified graph for more informative entity representation learning. We develop a method that densifies CKGs by adding edges among semantic-related entities and provide more supportive information for unseen entities, leading to better generalization ability of entity embedding for unseen entities. Finally, inductiveE employs Conv-TransE as the CKG completion decoder. Experimental results show that InductiveE significantly outperforms state-of-the-art baselines in both standard and inductive settings on ATOMIC and ConceptNet benchmarks. InductivE performs especially well on inductive scenarios where it achieves above 48% improvement over present methods.

preprint2021arXiv

Selective Attention Based Graph Convolutional Networks for Aspect-Level Sentiment Classification

Aspect-level sentiment classification aims to identify the sentiment polarity towards a specific aspect term in a sentence. Most current approaches mainly consider the semantic information by utilizing attention mechanisms to capture the interactions between the context and the aspect term. In this paper, we propose to employ graph convolutional networks (GCNs) on the dependency tree to learn syntax-aware representations of aspect terms. GCNs often show the best performance with two layers, and deeper GCNs do not bring additional gain due to over-smoothing problem. However, in some cases, important context words cannot be reached within two hops on the dependency tree. Therefore we design a selective attention based GCN block (SA-GCN) to find the most important context words, and directly aggregate these information into the aspect-term representation. We conduct experiments on the SemEval 2014 Task 4 datasets. Our experimental results show that our model outperforms the current state-of-the-art.

preprint2020arXiv

(k; l)-Colourings and Ferrers Diagram Representations of Cographs

For a pair of natural numbers $k, l$, a $(k,l)$-colouring of a graph $G$ is a partition of the vertex set of $G$ into (possibly empty) sets $S_1, S_2, \dots, S_k$, $C_1, C_2, \dots, C_l$ such that each set $S_i$ is an independent set and each set $C_j$ induces a clique in $G$. The $(k,l)$-colouring problem, which is NP-complete in general, has been studied for special graph classes such as chordal graphs, cographs and line graphs. Let $\hatκ(G) = (κ_0(G),κ_1(G),\dots,κ_{θ(G)-1}(G))$ and $\hatλ(G) = (λ_0(G),λ_1(G),\dots,λ_{χ(G)-1}(G))$ where $κ_l(G)$ (respectively, $λ_k(G)$) is the minimum $k$ (respectively, $l$) such that $G$ has a $(k,l)$-colouring. We prove that $\hatκ(G)$ and $\hatλ(G)$ are a pair of conjugate sequences for every graph $G$ and when $G$ is a cograph, the number of vertices in $G$ is equal to the sum of the entries in $\hatκ(G)$ or in $\hatλ(G)$. Using the decomposition property of cographs we show that every cograph can be represented by Ferrers diagram. We devise algorithms which compute $\hatκ(G)$ for cographs $G$ and find an induced subgraph in $G$ that can be used to certify the non-$(k,l)$-colourability of $G$.

preprint2020arXiv

Entity and Evidence Guided Relation Extraction for DocRED

Document-level relation extraction is a challenging task which requires reasoning over multiple sentences in order to predict relations in a document. In this paper, we pro-pose a joint training frameworkE2GRE(Entity and Evidence Guided Relation Extraction)for this task. First, we introduce entity-guided sequences as inputs to a pre-trained language model (e.g. BERT, RoBERTa). These entity-guided sequences help a pre-trained language model (LM) to focus on areas of the document related to the entity. Secondly, we guide the fine-tuning of the pre-trained language model by using its internal attention probabilities as additional features for evidence prediction.Our new approach encourages the pre-trained language model to focus on the entities and supporting/evidence sentences. We evaluate our E2GRE approach on DocRED, a recently released large-scale dataset for relation extraction. Our approach is able to achieve state-of-the-art results on the public leaderboard across all metrics, showing that our E2GRE is both effective and synergistic on relation extraction and evidence prediction.

preprint2020arXiv

Graph Sequential Network for Reasoning over Sequences

Recently Graph Neural Network (GNN) has been applied successfully to various NLP tasks that require reasoning, such as multi-hop machine reading comprehension. In this paper, we consider a novel case where reasoning is needed over graphs built from sequences, i.e. graph nodes with sequence data. Existing GNN models fulfill this goal by first summarizing the node sequences into fixed-dimensional vectors, then applying GNN on these vectors. To avoid information loss inherent in the early summarization and make sequential labeling tasks on GNN output feasible, we propose a new type of GNN called Graph Sequential Network (GSN), which features a new message passing algorithm based on co-attention between a node and each of its neighbors. We validate the proposed GSN on two NLP tasks: interpretable multi-hop reading comprehension on HotpotQA and graph based fact verification on FEVER. Both tasks require reasoning over multiple documents or sentences. Our experimental results show that the proposed GSN attains better performance than the standard GNN based methods.

preprint2020arXiv

Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting

Recent end-to-end trainable methods for scene text spotting, integrating detection and recognition, showed much progress. However, most of the current arbitrary-shape scene text spotters use region proposal networks (RPN) to produce proposals. RPN relies heavily on manually designed anchors and its proposals are represented with axis-aligned rectangles. The former presents difficulties in handling text instances of extreme aspect ratios or irregular shapes, and the latter often includes multiple neighboring instances into a single proposal, in cases of densely oriented text. To tackle these problems, we propose Mask TextSpotter v3, an end-to-end trainable scene text spotter that adopts a Segmentation Proposal Network (SPN) instead of an RPN. Our SPN is anchor-free and gives accurate representations of arbitrary-shape proposals. It is therefore superior to RPN in detecting text instances of extreme aspect ratios or irregular shapes. Furthermore, the accurate proposals produced by SPN allow masked RoI features to be used for decoupling neighboring text instances. As a result, our Mask TextSpotter v3 can handle text instances of extreme aspect ratios or irregular shapes, and its recognition accuracy won't be affected by nearby text or background noise. Specifically, we outperform state-of-the-art methods by 21.9 percent on the Rotated ICDAR 2013 dataset (rotation robustness), 5.9 percent on the Total-Text dataset (shape robustness), and achieve state-of-the-art performance on the MSRA-TD500 dataset (aspect ratio robustness). Code is available at: https://github.com/MhLiao/MaskTextSpotterV3

preprint2020arXiv

Mastering the working sequence in human-robot collaborative assembly based on reinforcement learning

A long-standing goal of the Human-Robot Collaboration (HRC) in manufacturing systems is to increase the collaborative working efficiency. In line with the trend of Industry 4.0 to build up the smart manufacturing system, the Co-robot in the HRC system deserves better designing to be more self-organized and to find the superhuman proficiency by self-learning. Inspired by the impressive machine learning algorithms developed by Google Deep Mind like Alphago Zero, in this paper, the human-robot collaborative assembly working process is formatted into a chessboard and the selection of moves in the chessboard is used to analogize the decision making by both human and robot in the HRC assembly working process. To obtain the optimal policy of working sequence to maximize the working efficiency, the robot is trained with a self-play algorithm based on reinforcement learning, without guidance or domain knowledge beyond game rules. A neural network is also trained to predict the distribution of the priority of move selections and whether a working sequence is the one resulting in the maximum of the HRC efficiency. An adjustable desk assembly is used to demonstrate the proposed HRC assembly algorithm and its efficiency.

preprint2020arXiv

Modeling the broadest spectral band of the Crab nebula and constraining the ions acceleration efficiency

Although it is widely accepted that the electromagnetic spectrum from radio to very-high-energy $γ$-rays of pulsar wind nebulae (PWNe) originates from leptons, there is still an open question that protons (or more generally, ions) may exist in pulsar wind and are further accelerated in PWN. The broadband spectrum of the prototype PWN Crab, extended recently by the detection of the Tibet AS$γ$ and HAWC experiments above 100 TeV, may be helpful in constraining the acceleration efficiency of ions. Here, we model the broadest energy spectrum of Crab and find that the broadband spectrum can be explained by the one-zone leptonic model in which the electrons/positrons produce the emission from radio to soft $γ$-rays via the synchrotron process, and simultaneously generate the GeV-TeV $γ$-rays through inverse Compton scattering including the synchrotron self-Compton process. In the framework of this leptonic model, the fraction of energy converted into the energetic protons is constrained to be below $0.5\ (n_{\rm t}/10\ {\rm cm}^{-3})^{-1}$ per cent, where $n_{\rm t}$ is the target gas density in the Crab. However, this fraction can be up to $7\ (n_{\rm t}/10\ {\rm cm}^{-3})^{-1}$ per cent if only the $γ$-rays are used.

preprint2020arXiv

Obstructions for acyclic local tournament orientation completions

The orientation completion problem for a fixed class of oriented graphs asks whether a given partially oriented graph can be completed to an oriented graph in the class. Orientation completion problems have been studied recently for several classes of oriented graphs, yielding both polynomial time solutions and NP-completeness results. Local tournaments are a well-structured class of oriented graphs that generalize tournaments and their underlying graphs are intimately related to proper circular-arc graphs. Proper interval graphs are precisely those which can be oriented as acyclic local tournaments. It has been proved that the orientation completion problems for local tournaments and acyclic local tournaments are both polynomial time solvable. In this paper we identify the obstructions for acyclic local tournament orientation completions. These are in a sense minimal partially oriented graphs that cannot be completed to acyclic local tournaments. Our description of the obstructions imply that they can be recognized in polynomial time. In a companion paper we will determine all obstructions for local tournament orientation completions.

preprint2020arXiv

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding

Translational distance-based knowledge graph embedding has shown progressive improvements on the link prediction task, from TransE to the latest state-of-the-art RotatE. However, N-1, 1-N and N-N predictions still remain challenging. In this work, we propose a novel translational distance-based approach for knowledge graph link prediction. The proposed method includes two-folds, first we extend the RotatE from 2D complex domain to high dimension space with orthogonal transforms to model relations for better modeling capacity. Second, the graph context is explicitly modeled via two directed context representations. These context representations are used as part of the distance scoring function to measure the plausibility of the triples during training and inference. The proposed approach effectively improves prediction accuracy on the difficult N-1, 1-N and N-N cases for knowledge graph link prediction task. The experimental results show that it achieves better performance on two benchmark data sets compared to the baseline RotatE, especially on data set (FB15k-237) with many high in-degree connection nodes.

preprint2020arXiv

Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents

Interpretable multi-hop reading comprehension (RC) over multiple documents is a challenging problem because it demands reasoning over multiple information sources and explaining the answer prediction by providing supporting evidences. In this paper, we propose an effective and interpretable Select, Answer and Explain (SAE) system to solve the multi-document RC problem. Our system first filters out answer-unrelated documents and thus reduce the amount of distraction information. This is achieved by a document classifier trained with a novel pairwise learning-to-rank loss. The selected answer-related documents are then input to a model to jointly predict the answer and supporting sentences. The model is optimized with a multi-task learning objective on both token level for answer prediction and sentence level for supporting sentences prediction, together with an attention-based interaction between these two tasks. Evaluated on HotpotQA, a challenging multi-hop RC data set, the proposed SAE system achieves top competitive performance in distractor setting compared to other existing systems on the leaderboard.

preprint2020arXiv

Speaker Diarization with Lexical Information

This work presents a novel approach for speaker diarization to leverage lexical information provided by automatic speech recognition. We propose a speaker diarization system that can incorporate word-level speaker turn probabilities with speaker embeddings into a speaker clustering process to improve the overall diarization accuracy. To integrate lexical and acoustic information in a comprehensive way during clustering, we introduce an adjacency matrix integration for spectral clustering. Since words and word boundary information for word-level speaker turn probability estimation are provided by a speech recognition system, our proposed method works without any human intervention for manual transcriptions. We show that the proposed method improves diarization performance on various evaluation datasets compared to the baseline diarization system using acoustic information only in speaker embeddings.

preprint2019arXiv

Cross-lingual Text-independent Speaker Verification using Unsupervised Adversarial Discriminative Domain Adaptation

Speaker verification systems often degrade significantly when there is a language mismatch between training and testing data. Being able to improve cross-lingual speaker verification system using unlabeled data can greatly increase the robustness of the system and reduce human labeling costs. In this study, we introduce an unsupervised Adversarial Discriminative Domain Adaptation (ADDA) method to effectively learn an asymmetric mapping that adapts the target domain encoder to the source domain, where the target domain and source domain are speech data from different languages. ADDA, together with a popular Domain Adversarial Training (DAT) approach, are evaluated on a cross-lingual speaker verification task: the training data is in English from NIST SRE04-08, Mixer 6 and Switchboard, and the test data is in Chinese from AISHELL-I. We show that with the ADDA adaptation, Equal Error Rate (EER) of the x-vector system decreases from 9.331\% to 7.645\%, relatively 18.07\% reduction of EER, and 6.32\% reduction from DAT as well. Further data analysis of ADDA adapted speaker embedding shows that the learned speaker embeddings can perform well on speaker classification for the target domain data, and are less dependent with respect to the shift in language.

preprint2016arXiv

Diagnosing the Source Region of a Solar Burst on 26 September 2011 by Microwave Type III Pairs

This work reports a peculiar and interesting train of microwave type III pair bursts in the impulsive rising phase of a solar flare on 2011 September 26. The observations include radio spectrometers at frequency of 0.80 - 2.00 GHz, hard X-ray (RHESSI and FERMI), EUV images of SWAP/PROBA-2 and magnetogram of HMI/SDO. By using a recently developed method (Tan et al. 2016a), we diagnosed the plasma density, temperature, plasma beta, magnetic field near the source region, the energy of energetic electrons and the distance between the acceleration region and the emission start sites of type III bursts. From the diagnostics, we find that: (1) The plasma density, temperature, magnetic field, and the distance between the acceleration region and the emission start sites almost have no obvious variations during the period of type III pair trains, while the energy of electrons has an obvious peak value which is consistent to the hard X-ray emission. (2) The plasma beta is much higher than an unity showing a highly dynamic process near the emission start site of type III bursts. (3) Although the reversed-slope type III branches drift slower at one order of magnitude than that of the normal type III branches, the related downgoing and upgoing electrons still could have same order of magnitude of energy. These facts indicate that both of the upgoing and downgoing electrons are possibly accelerated by similar mechanism and in a small source region. This diagnostics can help us to understand the microphysics in the source region of solar bursts.

preprint2016arXiv

Performance of new 8-inch photomultiplier tube used for the Tibet muon-detector array

A new hybrid experiment has been constructed to measure the chemical composition of cosmic rays around the "knee" in the wide energy range by the Tibet AS$γ$ collaboration at Tibet, China, since 2014. They consist of a high-energy air-shower-core array (YAC-II), a high-density air-shower array (Tibet-III) and a large underground water-Cherenkov muon-detector array (MD). In order to obtain the primary proton, helium and iron spectra and their "knee" positions in the energy range lower than $10^{16}$ eV, each of PMTs equipped to the MD cell is required to measure the number of photons capable of covering a wide dynamic range of 100 - $10^{6}$ photoelectrons (PEs) according to Monte Carlo simulations. In this paper, we firstly compare the characteristic features between R5912-PMT made by Japan Hamamatsu and CR365-PMT made by Beijing Hamamatsu. This is the first comparison between R5912-PMT and CR365-PMT. If there exists no serious difference, we will then add two 8-inch-in-diameter PMTs to meet our requirements in each MD cell, which are responsible for the range of 100 - 10000 PEs and 2000 - 1000000 PEs, respectively. That is, MD cell is expected to be able to measure the number of muons over 6 orders of magnitude.

preprint2016arXiv

Very Long-period Pulsations before the Onset of Solar Flares

Solar flares are the most powerful explosions occurring in the solar system, which may lead to disastrous space weather events and impact various aspects of our Earth. So far, it is still a big challenge in modern astrophysics to understand the origin of solar flares and predict their onset. Based on the analysis of soft X-ray emission observed by the Geostationary Operational Environmental Satellite (GOES), this work reported a new discovery of very long-periodic pulsations occurred in the preflare phase before the onset of solar flares (preflare-VLPs). These pulsations are typically with period of 8 - 30 min and last for about 1 - 2 hours. They are possibly generated from LRC oscillations of plasma loops where electric current dominates the physical process during magnetic energy accumulation in the source region. The preflare-VLP provides an essential information for understanding the triggering mechanism and origin of solar flares, and may help us to response to solar explosions and the corresponding disastrous space weather events as a convenient precursory indicator.

preprint2015arXiv

Development of Yangbajing Air shower Core detector array for a new EAS hybrid Experiment

Aiming at the observation of cosmic-ray chemical composition at the "knee" energy region, we have been developinga new type air-shower core detector (YAC, Yangbajing Air shower Core detector array) to be set up at Yangbajing (90.522$^\circ$ E, 30.102$^\circ$ N, 4300 m above sea level, atmospheric depth: 606 g/m$^2$) in Tibet, China. YAC works together with the Tibet air-shower array (Tibet-III) and an underground water cherenkov muon detector array (MD) as a hybrid experiment. Each YAC detector unit consists of lead plates of 3.5 cm thick and a scintillation counter which detects the burst size induced by high energy particles in the air-shower cores. The burst size can be measured from 1 MIP (Minimum Ionization Particle) to $10^{6}$ MIPs. The first phase of this experiment, named "YAC-I", consists of 16 YAC detectors each having the size 40 cm $\times$ 50 cm and distributing in a grid with an effective area of 10 m$^{2}$. YAC-I is used to check hadronic interaction models. The second phase of the experiment, called "YAC-II", consists of 124 YAC detectors with coverage about 500 m$^2$. The inner 100 detectors of 80 cm $\times $ 50 cm each are deployed in a 10 $\times$ 10 matrix from with a 1.9 m separation and the outer 24 detectors of 100 cm $\times$ 50 cm each are distributed around them to reject non-core events whose shower cores are far from the YAC-II array. YAC-II is used to study the primary cosmic-ray composition, in particular, to obtain the energy spectra of proton, helium and iron nuclei between 5$\times$$10^{13}$ eV and $10^{16}$ eV covering the "knee" and also being connected with direct observations at energies around 100 TeV. We present the design and performance of YAC-II in this paper.

preprint2015arXiv

End-vertices of LBFS of (AT-free) bigraphs

Lexicographic Breadth First Search (LBFS) is one of fundamental graph search algorithms that has numerous applications, including recognition of graph classes, computation of graph parameters, and detection of certain graph structures. The well-known result of Rose, Tarjan and Lueker on the end-vertices of LBFS of chordal graphs has tempted researchers to study the end-vertices of LBFS of various classes of graphs, including chordal graphs, split graphs, interval graphs, and asteroidal triple-free (AT-free) graphs. In this paper we study the end-vertices of LBFS of bipartite graphs. We show that deciding whether a vertex of a bipartite graph is the end-vertex of an LBFS is an NP-complete problem. In contrast we characterize the end-vertices of LBFS of AT-free bipartite graphs. Our characterization implies that the problem of deciding whether a vertex of an AT-free bipartite graph is the end-vertex of an LBFS is solvable in polynomial time. Key words: Lexicographic breadth first search, end-vertex, bipartite graphs, AT-free, proper interval bigraph, characterization, algorithm.

preprint2015arXiv

Spectra of cosmic ray electrons and diffuse gamma rays with the constraints of AMS-02 and HESS data

Recently, AMS-02 reported their observed results of cosmic rays(CRs). In addition to the AMS-02 data, we add HESS data to estimate the spectra of CR electrons and the diffuse gamma rays above TeV. In the conventional diffusion model, a global analysis is performed on the spectral features of CR electrons and the diffuse gamma rays by GALRPOP package. The results show that the spectrum structure of the primary component of CR electrons can not be fully reproduced by a simple power law and the relevant break is around hundred GeV. At 99\% C.L., the injection indices above the break decrease from 2.54 to 2.35, but the ones below the break are only in the range 2.746 - 2.751. The spectrum of CR electrons does not need to add TeV cutoff to match the features of HESS data too. Based on the difference between the fluxes of CR electrons and the primary component of them, the predicted excess of CR positrons is consistent with the interpretations as pulsar or dark matter. In the analysis of the Galactic diffuse gamma rays with the indirect constraint of AMS-02 and HESS data, it is found that the fluxes of Galactic diffuse gamma rays are consistent with GeV data of Fermi-LAT in the high latitude regions. The results indicate that the inverse Compton scattering(IC) is the dominant component in the range of the hundred GeV to tens of TeV respectively from the high latitude regions to the low ones, and in the all regions of Galaxy the flux of diffuse gamma rays is less than CR electrons at the energy scale of 20 TeV.

preprint2014arXiv

A Very Small and Super Strong Zebra Pattern Burst at the Beginning of a Solar Flare

Microwave emission with spectral zebra pattern structures (ZPs) is observed frequently in solar flares and the Crab pulsar. The previous observations show that ZP is only a structure overlapped on the underlying broadband continuum with slight increments and decrements. This work reports an extremely unusual strong ZP burst occurring just at the beginning of a solar flare observed simultaneously by two radio telescopes located in China and Czech Republic and by the extreme ultraviolet (EUV) telescope on board NASA's satellite Solar Dynamics Observatory on 2013 April 11. It is a very short and super strong explosion whose intensity exceeds several times that of the underlying flaring broadband continuum emission, lasting for just 18 s. EUV images show that the flare starts from several small flare bursting points (FBPs). There is a sudden EUV flash with extra enhancement in one of these FBPs during the ZP burst. Analysis indicates that the ZP burst accompanying EUV flash is an unusual explosion revealing a strong coherent process with rapid particle acceleration, violent energy release, and fast plasma heating simultaneously in a small region with short duration just at the beginning of the flare.

preprint2014arXiv

Principles of Physical Layer Security in Multiuser Wireless Networks: A Survey

This paper provides a comprehensive review of the domain of physical layer security in multiuser wireless networks. The essential premise of physical-layer security is to enable the exchange of confidential messages over a wireless medium in the presence of unauthorized eavesdroppers without relying on higher-layer encryption. This can be achieved primarily in two ways: without the need for a secret key by intelligently designing transmit coding strategies, or by exploiting the wireless communication medium to develop secret keys over public channels. The survey begins with an overview of the foundations dating back to the pioneering work of Shannon and Wyner on information-theoretic security. We then describe the evolution of secure transmission strategies from point-to-point channels to multiple-antenna systems, followed by generalizations to multiuser broadcast, multiple-access, interference, and relay networks. Secret-key generation and establishment protocols based on physical layer mechanisms are subsequently covered. Approaches for secrecy based on channel coding design are then examined, along with a description of inter-disciplinary approaches based on game theory and stochastic geometry. The associated problem of physical-layer message authentication is also introduced briefly. The survey concludes with observations on potential research directions in this area.

preprint2014arXiv

Solar Radio Bursts with Spectral Fine Structures in Preflares

A good observation of preflare activities is important for us to understand the origin and triggering mechanism of solar flares, and to predict the occurrence of solar flares. This work presents the characteristics of microwave spectral fine structures as preflare activities of four solar flares observed by Ondřejov radio spectrograph in the frequency range of 0.8--2.0 GHz. We found that these microwave bursts which occurred 1--4 minutes before the onset of flares have spectral fine structures with relatively weak intensities and very short timescales. They include microwave quasi-periodic pulsations (QPP) with very short period of 0.1-0.3 s and dot bursts with millisecond timescales and narrow frequency bandwidths. Accompanying these microwave bursts, there are filament motions, plasma ejection or loop brightening on the EUV imaging observations and non-thermal hard X-ray emission enhancements observed by RHESSI. These facts may reveal certain independent non-thermal energy releasing processes and particle acceleration before the onset of solar flares. They may be conducive to understand the nature of solar flares and predict their occurrence.

preprint2013arXiv

Secure Communication Via an Untrusted Non-Regenerative Relay in Fading Channels

We investigate a relay network where the source can potentially utilize an untrusted non-regenerative relay to augment its direct transmission of a confidential message to the destination. Since the relay is untrusted, it is desirable to protect the confidential data from it while simultaneously making use of it to increase the reliability of the transmission. We first examine the secrecy outage probability (SOP) of the network assuming a single antenna relay, and calculate the exact SOP for three different schemes: direct transmission without using the relay, conventional non-regenerative relaying, and cooperative jamming by the destination. Subsequently, we conduct an asymptotic analysis of the SOPs to determine the optimal policies in different operating regimes. We then generalize to the multi-antenna relay case and investigate the impact of the number of relay antennas on the secrecy performance. Finally, we study a scenario where the relay has only a single RF chain which necessitates an antenna selection scheme, and we show that unlike the case where all antennas are used, under certain conditions the cooperative jamming scheme with antenna selection provides a diversity advantage for the receiver. Numerical results are presented to verify the theoretical predictions of the preferred transmission policies.

preprint2012arXiv

Spin-dependent exciton quenching and intrinsic spin coherence in CdSe/CdS nanocrystals

Large surface to volume ratios of semiconductor nanocrystals cause susceptibility to charge trapping, which can modify luminescence yields and induce single-particle blinking. Optical spectroscopies cannot differentiate between bulk and surface traps in contrast to spin-resonance techniques, which in principle avail chemical information on such trap sites. Magnetic resonance detection via spin-controlled photoluminescence enables the direct observation of interactions between emissive excitons and trapped charges. This approach allows the discrimination of two functionally different trap states in CdSe/CdS nanocrystals underlying the fluorescence quenching and thus blinking mechanisms: a spin-dependent Auger process in charged particles; and a charge-separated state pair process, which leaves the particle neutral. The paramagnetic trap centers offer control of energy transfer from the wide-gap CdS to the narrow-gap CdSe, i.e. light harvesting within the heterostructure. Coherent spin motion within the trap states of the CdS arms of nanocrystal tetrapods is reflected by spatially remote luminescence from CdSe cores with surprisingly long coherence times of >300 ns at 3.5 K.

preprint2011arXiv

Cooperative Jamming for Secure Communications in MIMO Relay Networks

Secure communications can be impeded by eavesdroppers in conventional relay systems. This paper proposes cooperative jamming strategies for two-hop relay networks where the eavesdropper can wiretap the relay channels in both hops. In these approaches, the normally inactive nodes in the relay network can be used as cooperative jamming sources to confuse the eavesdropper. Linear precoding schemes are investigated for two scenarios where single or multiple data streams are transmitted via a decode-and-forward (DF) relay, under the assumption that global channel state information (CSI) is available. For the case of single data stream transmission, we derive closed-form jamming beamformers and the corresponding optimal power allocation. Generalized singular value decomposition (GSVD)-based secure relaying schemes are proposed for the transmission of multiple data streams. The optimal power allocation is found for the GSVD relaying scheme via geometric programming. Based on this result, a GSVD-based cooperative jamming scheme is proposed that shows significant improvement in terms of secrecy rate compared to the approach without jamming. Furthermore, the case involving an eavesdropper with unknown CSI is also investigated in this paper. Simulation results show that the secrecy rate is dramatically increased when inactive nodes in the relay network participate in cooperative jamming.

preprint2011arXiv

Robust Secure Transmission in MISO Channels Based on Worst-Case Optimization

This paper studies robust transmission schemes for multiple-input single-output (MISO) wiretap channels. Both the cases of direct transmission and cooperative jamming with a helper are investigated with imperfect channel state information (CSI) for the eavesdropper links. Robust transmit covariance matrices are obtained based on worst-case secrecy rate maximization, under both individual and global power constraints. For the case of an individual power constraint, we show that the non-convex maximin optimization problem can be transformed into a quasiconvex problem that can be efficiently solved with existing methods. For a global power constraint, the joint optimization of the transmit covariance matrices and power allocation between the source and the helper is studied via geometric programming. We also study the robust wiretap transmission problem for the case with a quality-of-service constraint at the legitimate receiver. Numerical results show the advantage of the proposed robust design. In particular, for the global power constraint scenario, although cooperative jamming is not necessary for optimal transmission with perfect eavesdropper's CSI, we show that robust jamming support can increase the worst-case secrecy rate and lower the signal to interference-plus-noise ratio at Eve in the presence of channel mismatches between the transmitters and the eavesdropper.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Source provenance

Where this author record came from

arxivconfidence 95%

external id: arxiv:2512.24974:author:3:jing-huang

Imported May 21, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2512.24201:author:5:jing-huang

Imported May 21, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2506.23614:author:1:jing-huang

Imported May 21, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.00292:author:4:jing-huang

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2605.17558:author:14:jing-huang

Imported May 20, 2026Synced May 20, 2026

7 works

Guangtao Wang

Researcher

Guangtao Wang contributes to research discovery and scholarly infrastructure.

Open to collaborate

7 works

Xiaodong He

Researcher

Xiaodong He contributes to research discovery and scholarly infrastructure.

Open to collaborate

6 works

Bowen Zhou

Researcher

Bowen Zhou contributes to research discovery and scholarly infrastructure.

Open to collaborate

5 works

Baolin Tan

Researcher

Baolin Tan contributes to research discovery and scholarly infrastructure.

Open to collaborate

Jing Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

48 published item(s)

An AI-guided mechanotyping instrument for fully automated oocyte quality assessment

Caracal: Causal Architecture via Spectral Mixing

Chordal signed graphs and signed bigraphs

Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models

Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs

MobileDreamer: Generative Sketch World Model for GUI Agent

BATISNet: Instance Segmentation of Tooth Point Clouds with Boundary Awareness

Hierarchical Deformation Planning and Neural Tracking for DLOs in Constrained Environments

Passage-traversing optimal path planning with sampling-based algorithms

Reading or Reasoning? Format Decoupled Reinforcement Learning for Document OCR

Comparability digraphs: An analogue of comparability graphs

Conversational AI Systems for Social Good: Opportunities and Challenges

Cross-modal Contrastive Distillation for Instructional Activity Anticipation

Improving Time Sensitivity for Question Answering over Temporal Knowledge Graphs

Observational Signatures of Tearing Instability in the Current Sheet of a Solar Flare

On the Opportunities and Risks of Foundation Models

Paint shop vehicle sequencing based on quantum computing considering color changeover and painting quality

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision

SpanDrop: Simple and Effective Counterfactual Learning for Long Sequences

Video2StyleGAN: Encoding Video in Latent Space for Manipulation

Chordality of locally semicomplete and weakly quasi-transitive digraphs

Energy and spectral analysis of confined solar flares from radio and X-ray observations

Inductive Learning on Commonsense Knowledge Graph Completion

Selective Attention Based Graph Convolutional Networks for Aspect-Level Sentiment Classification

(k; l)-Colourings and Ferrers Diagram Representations of Cographs

Entity and Evidence Guided Relation Extraction for DocRED

Graph Sequential Network for Reasoning over Sequences

Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting

Mastering the working sequence in human-robot collaborative assembly based on reinforcement learning

Modeling the broadest spectral band of the Crab nebula and constraining the ions acceleration efficiency

Obstructions for acyclic local tournament orientation completions

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding

Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents

Speaker Diarization with Lexical Information

Cross-lingual Text-independent Speaker Verification using Unsupervised Adversarial Discriminative Domain Adaptation

Diagnosing the Source Region of a Solar Burst on 26 September 2011 by Microwave Type III Pairs

Performance of new 8-inch photomultiplier tube used for the Tibet muon-detector array

Very Long-period Pulsations before the Onset of Solar Flares

Development of Yangbajing Air shower Core detector array for a new EAS hybrid Experiment

End-vertices of LBFS of (AT-free) bigraphs

Spectra of cosmic ray electrons and diffuse gamma rays with the constraints of AMS-02 and HESS data

A Very Small and Super Strong Zebra Pattern Burst at the Beginning of a Solar Flare

Principles of Physical Layer Security in Multiuser Wireless Networks: A Survey

Solar Radio Bursts with Spectral Fine Structures in Preflares

Secure Communication Via an Untrusted Non-Regenerative Relay in Fading Channels

Spin-dependent exciton quenching and intrinsic spin coherence in CdSe/CdS nanocrystals

Cooperative Jamming for Secure Communications in MIMO Relay Networks

Robust Secure Transmission in MISO Channels Based on Worst-Case Optimization