Source author record

Jin Xu

Jin Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

41works

42topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

XekRung Technical Report

We present XekRung, a frontier large language model for cybersecurity, designed to provide comprehensive security capabilities. To achieve this, we develop diverse data synthesis pipelines tailored to the cybersecurity domain, enabling the scalable construction of high-quality training data and providing a strong foundation for cybersecurity knowledge and understanding. Building on this foundation, we establish a complete training pipeline spanning continued pre-training (CPT), supervised fine-tuning (SFT), and reinforcement learning (RL) to further extend the model's capabilities. We further introduce a multi-dimensional evaluation system to guide the iterative improvement of both domain-specific and general-purpose abilities. Extensive experiments demonstrate that XekRung achieves state-of-the-art performance on cybersecurity-specific benchmarks among models of the same scale, while maintaining strong performance on general benchmarks.

preprint2025arXiv

Duality of Ryu-Takayanagi surfaces inside and outside the horizon

We study the Ryu-Takayanagi (RT) surfaces associated with timelike subregions in static spacetimes with a horizon. It is possible to find the analytical continuation of the RT surfaces that can extend into the horizon, allowing us to probe the interior of the black hole. The horizon typically divides the RT surface into two distinct parts. We demonstrate that the area of the surface inside the horizon can be reconstructed from the contributions of the surfaces outside the horizon, along with additional RT surfaces for spacelike subregions that are causally related to the timelike subregions. This result provides a concrete realization of black hole complementarity at the level of classical metric, where the spacetime in the black hole interior can be reconstructed from the degrees of freedom outside the horizon.

preprint2022arXiv

A Novel Markov Model for Near-Term Railway Delay Prediction

Predicting the near-future delay with accuracy for trains is momentous for railway operations and passengers' traveling experience. This work aims to design prediction models for train delays based on Netherlands Railway data. We first develop a chi-square test to show that the delay evolution over stations follows a first-order Markov chain. We then propose a delay prediction model based on non-homogeneous Markov chains. To deal with the sparsity of the transition matrices of the Markov chains, we propose a novel matrix recovery approach that relies on Gaussian kernel density estimation. Our numerical tests show that this recovery approach outperforms other heuristic approaches in prediction accuracy. The Markov chain model we propose also shows to be better than other widely-used time series models with respect to both interpretability and prediction accuracy. Moreover, our proposed model does not require a complicated training process, which is capable of handling large-scale forecasting problems.

preprint2022arXiv

AGIC: Approximate Gradient Inversion Attack on Federated Learning

Federated learning is a private-by-design distributed learning paradigm where clients train local models on their own data before a central server aggregates their local updates to compute a global model. Depending on the aggregation method used, the local updates are either the gradients or the weights of local learning models. Recent reconstruction attacks apply a gradient inversion optimization on the gradient update of a single minibatch to reconstruct the private data used by clients during training. As the state-of-the-art reconstruction attacks solely focus on single update, realistic adversarial scenarios are overlooked, such as observation across multiple updates and updates trained from multiple mini-batches. A few studies consider a more challenging adversarial scenario where only model updates based on multiple mini-batches are observable, and resort to computationally expensive simulation to untangle the underlying samples for each local step. In this paper, we propose AGIC, a novel Approximate Gradient Inversion Attack that efficiently and effectively reconstructs images from both model or gradient updates, and across multiple epochs. In a nutshell, AGIC (i) approximates gradient updates of used training samples from model updates to avoid costly simulation procedures, (ii) leverages gradient/model updates collected from multiple epochs, and (iii) assigns increasing weights to layers with respect to the neural network structure for reconstruction quality. We extensively evaluate AGIC on three datasets, CIFAR-10, CIFAR-100 and ImageNet. Our results show that AGIC increases the peak signal-to-noise ratio (PSNR) by up to 50% compared to two representative state-of-the-art gradient inversion attacks. Furthermore, AGIC is faster than the state-of-the-art simulation based attack, e.g., it is 5x faster when attacking FedAvg with 8 local steps in between model updates.

preprint2022arXiv

Analyzing and Mitigating Interference in Neural Architecture Search

Weight sharing is a popular approach to reduce the cost of neural architecture search (NAS) by reusing the weights of shared operators from previously trained child models. However, the rank correlation between the estimated accuracy and ground truth accuracy of those child models is low due to the interference among different child models caused by weight sharing. In this paper, we investigate the interference issue by sampling different child models and calculating the gradient similarity of shared operators, and observe: 1) the interference on a shared operator between two child models is positively correlated with the number of different operators; 2) the interference is smaller when the inputs and outputs of the shared operator are more similar. Inspired by these two observations, we propose two approaches to mitigate the interference: 1) MAGIC-T: rather than randomly sampling child models for optimization, we propose a gradual modification scheme by modifying one operator between adjacent optimization steps to minimize the interference on the shared operators; 2) MAGIC-A: forcing the inputs and outputs of the operator across all child models to be similar to reduce the interference. Experiments on a BERT search space verify that mitigating interference via each of our proposed methods improves the rank correlation of super-pet and combining both methods can achieve better results. Our discovered architecture outperforms RoBERTa$_{\rm base}$ by 1.1 and 0.6 points and ELECTRA$_{\rm base}$ by 1.6 and 1.1 points on the dev and test set of GLUE benchmark. Extensive results on the BERT compression, reading comprehension and ImageNet task demonstrate the effectiveness and generality of our proposed methods.

preprint2022arXiv

CLS: Cross Labeling Supervision for Semi-Supervised Learning

It is well known that the success of deep neural networks is greatly attributed to large-scale labeled datasets. However, it can be extremely time-consuming and laborious to collect sufficient high-quality labeled data in most practical applications. Semi-supervised learning (SSL) provides an effective solution to reduce the cost of labeling by simultaneously leveraging both labeled and unlabeled data. In this work, we present Cross Labeling Supervision (CLS), a framework that generalizes the typical pseudo-labeling process. Based on FixMatch, where a pseudo label is generated from a weakly-augmented sample to teach the prediction on a strong augmentation of the same input sample, CLS allows the creation of both pseudo and complementary labels to support both positive and negative learning. To mitigate the confirmation bias of self-labeling and boost the tolerance to false labels, two different initialized networks with the same structure are trained simultaneously. Each network utilizes high-confidence labels from the other network as additional supervision signals. During the label generation phase, adaptive sample weights are assigned to artificial labels according to their prediction confidence. The sample weight plays two roles: quantify the generated labels' quality and reduce the disruption of inaccurate labels on network training. Experimental results on the semi-supervised classification task show that our framework outperforms existing approaches by large margins on the CIFAR-10 and CIFAR-100 datasets.

preprint2022arXiv

ECO v1: Towards Event-Centric Opinion Mining

Events are considered as the fundamental building blocks of the world. Mining event-centric opinions can benefit decision making, people communication, and social good. Unfortunately, there is little literature addressing event-centric opinion mining, although which significantly diverges from the well-studied entity-centric opinion mining in connotation, structure, and expression. In this paper, we propose and formulate the task of event-centric opinion mining based on event-argument structure and expression categorizing theory. We also benchmark this task by constructing a pioneer corpus and designing a two-step benchmark framework. Experiment results show that event-centric opinion mining is feasible and challenging, and the proposed task, dataset, and baselines are beneficial for future studies.

preprint2022arXiv

Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval

With the recent boom of video-based social platforms (e.g., YouTube and TikTok), video retrieval using sentence queries has become an important demand and attracts increasing research attention. Despite the decent performance, existing text-video retrieval models in vision and language communities are impractical for large-scale Web search because they adopt brute-force search based on high-dimensional embeddings. To improve efficiency, Web search engines widely apply vector compression libraries (e.g., FAISS) to post-process the learned embeddings. Unfortunately, separate compression from feature encoding degrades the robustness of representations and incurs performance decay. To pursue a better balance between performance and efficiency, we propose the first quantized representation learning method for cross-view video retrieval, namely Hybrid Contrastive Quantization (HCQ). Specifically, HCQ learns both coarse-grained and fine-grained quantizations with transformers, which provide complementary understandings for texts and videos and preserve comprehensive semantic information. By performing Asymmetric-Quantized Contrastive Learning (AQ-CL) across views, HCQ aligns texts and videos at coarse-grained and multiple fine-grained levels. This hybrid-grained learning strategy serves as strong supervision on the cross-view video quantization model, where contrastive learning at different levels can be mutually promoted. Extensive experiments on three Web video benchmark datasets demonstrate that HCQ achieves competitive performance with state-of-the-art non-compressed retrieval methods while showing high efficiency in storage and computation. Code and configurations are available at https://github.com/gimpong/WWW22-HCQ.

preprint2022arXiv

libRoadRunner 2.0: A High-Performance SBML Simulation and Analysis Library

Motivation: This paper presents libRoadRunner 2.0, an extensible, high-performance, cross-platform, open-source software library for the simulation and analysis of models expressed using Systems Biology Markup Language SBML). Results: libRoadRunner is a self-contained library, able to run both as a component inside other tools via its C++ and C bindings, and interactively through its Python or Julia interface. libRoadRunner uses a custom Just-In-Time JIT compiler built on the widely-used LLVM JIT compiler framework. It compiles SBML-specified models directly into native machine code for a large variety of processors, making it appropriate for solving extremely large models or repeated runs. libRoadRunner is flexible, supporting the bulk of the SBML specification (except for delay and nonlinear algebraic equations) and including several SBML extensions such as composition and distributions. It offers multiple deterministic and stochastic integrators, as well as tools for steady-state, sensitivity, stability analysis, and structural analysis of the stoichiometric matrix. Availability: libRoadRunner binary distributions are available for Mac OS X, Linux, and Windows. The library is licensed under the Apache License Version 2.0. libRoadRunner is also available for ARM-based computers such as the Raspberry Pi and can in principle be compiled on any system supported by LLVM-13. http://sys-bio.github.io/roadrunner/index.html provides online documentation, full build instructions, binaries, and a git source repository.

preprint2022arXiv

Matrix Multiplication with Less Arithmetic Complexity and IO Complexity

After Strassen presented the first sub-cubic matrix multiplication algorithm, many Strassen-like algorithms are presented. Most of them with low asymptotic cost have large hidden leading coefficient which are thus impractical. To reduce the leading coefficient, Cenk and Hasan give a general approach reducing the leading coefficient of $<2,2,2;7>$-algorithm to $5$ but increasing IO complexity. In 2017, Karstadt and Schwartz also reduce the leading coefficient of $<2,2,2;7>$-algorithm to $5$ by the Alternative Basis Matrix Multiplication method. Meanwhile, their method reduces the IO complexity and low-order monomials in arithmetic complexity. In 2019, Beniamini and Schwartz generalize Alternative Basis Matrix Multiplication method reducing leading coefficient in arithmetic complexity but increasing IO complexity. In this paper, we propose a new matrix multiplication algorithm which reduces leading coefficient both in arithmetic complexity and IO complexity. We apply our method to Strassen-like algorithms improving arithmetic complexity and IO complexity (the comparison with previous results are shown in Tables 1 and 2). Surprisingly, our IO complexity of $<3,3,3;23>$-algorithm is $14n^{\log_323}M^{-\frac{1}{2}} + o(n^{\log_323})$ which breaks Ballard's IO complexity low bound ($Ω(n^{\log_323}M^{1-\frac{\log_323}{2}})$) for recursive Strassen-like algorithms.

preprint2022arXiv

Procedural Text Understanding via Scene-Wise Evolution

Procedural text understanding requires machines to reason about entity states within the dynamical narratives. Current procedural text understanding approaches are commonly \textbf{entity-wise}, which separately track each entity and independently predict different states of each entity. Such an entity-wise paradigm does not consider the interaction between entities and their states. In this paper, we propose a new \textbf{scene-wise} paradigm for procedural text understanding, which jointly tracks states of all entities in a scene-by-scene manner. Based on this paradigm, we propose \textbf{S}cene \textbf{G}raph \textbf{R}easoner (\textbf{SGR}), which introduces a series of dynamically evolving scene graphs to jointly formulate the evolution of entities, states and their associations throughout the narrative. In this way, the deep interactions between all entities and states can be jointly captured and simultaneously derived from scene graphs. Experiments show that SGR not only achieves the new state-of-the-art performance but also significantly accelerates the speed of reasoning.

preprint2022arXiv

Simultaneous control of spectral and directional emissivity with gradient epsilon-near-zero InAs photonic structures

Controlling both the spectral bandwidth and directional range of emitted thermal radiation is a fundamental challenge in modern photonics and materials research. Recent work has shown that materials with a spatial gradient in their epsilon near zero response can support broad spectrum directionality in their emissivity, enabling high radiance to specific angles of incidence. However, this capability has been limited spectrally and directionally by the availability of materials supporting phonon-polariton resonances over long-wave infrared wavelengths. Here, we design and experimentally demonstrate an approach using doped III-V semiconductors that can simultaneously tailor spectral peak, bandwidth and directionality of infrared emissivity. We epitaxially grow and characterize InAs-based gradient ENZ photonic structures that exhibit broadband directional emission with varying spectral bandwidths and peak directions as a function of their doping concentration profile and thickness. Due to its easy-to-fabricate geometry we believe this approach provides a versatile photonic platform to dynamically control broadband spectral and directional emissivity for a range of emerging applications.

preprint2021arXiv

MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition

In this paper, we propose MixSpeech, a simple yet effective data augmentation method based on mixup for automatic speech recognition (ASR). MixSpeech trains an ASR model by taking a weighted combination of two different speech features (e.g., mel-spectrograms or MFCC) as the input, and recognizing both text sequences, where the two recognition losses use the same combination weight. We apply MixSpeech on two popular end-to-end speech recognition models including LAS (Listen, Attend and Spell) and Transformer, and conduct experiments on several low-resource datasets including TIMIT, WSJ, and HKUST. Experimental results show that MixSpeech achieves better accuracy than the baseline models without data augmentation, and outperforms a strong data augmentation method SpecAugment on these recognition tasks. Specifically, MixSpeech outperforms SpecAugment with a relative PER improvement of 10.6$\%$ on TIMIT dataset, and achieves a strong WER of 4.7$\%$ on WSJ dataset.

preprint2021arXiv

Studentized Permutation Method for Comparing Restricted Mean Survival Times with Small Sample from Randomized Trials

Recent observations, especially in cancer immunotherapy clinical trials with time-to-event outcomes, show that the commonly used proportial hazard assumption is often not justifiable, hampering an appropriate analyse of the data by hazard ratios. An attractive alternative advocated is given by the restricted mean survival time (RMST), which does not rely on any model assumption and can always be interpreted intuitively. As pointed out recently by Horiguchi and Uno (2020), methods for the RMST based on asymptotic theory suffer from inflated type-I error under small sample sizes. To overcome this problem, they suggested a permutation strategy leading to more convincing results in simulations. However, their proposal requires an exchangeable data set-up between comparison groups which may be limiting in practice. In addition, it is not possible to invert their testing procedure to obtain valid confidence intervals, which can provide more in-depth information. In this paper, we address these limitations by proposing a studentized permutation test as well as the corresponding permutation-based confidence intervals. In our extensive simulation study, we demonstrate the advantage of our new method, especially in situations with relative small sample sizes and unbalanced groups. Finally we illustrate the application of the proposed method by re-analysing data from a recent lung cancer clinical trial.

preprint2020arXiv

A Bayesian Framework for Nash Equilibrium Inference in Human-Robot Parallel Play

We consider shared workspace scenarios with humans and robots acting to achieve independent goals, termed as parallel play. We model these as general-sum games and construct a framework that utilizes the Nash equilibrium solution concept to consider the interactive effect of both agents while planning. We find multiple Pareto-optimal equilibria in these tasks. We hypothesize that people act by choosing an equilibrium based on social norms and their personalities. To enable coordination, we infer the equilibrium online using a probabilistic model that includes these two factors and use it to select the robot's action. We apply our approach to a close-proximity pick-and-place task involving a robot and a simulated human with three potential behaviors - defensive, selfish, and norm-following. We showed that using a Bayesian approach to infer the equilibrium enables the robot to complete the task with less than half the number of collisions while also reducing the task execution time as compared to the best baseline. We also performed a study with human participants interacting either with other humans or with different robot agents and observed that our proposed approach performs similar to human-human parallel play interactions. The code is available at https://github.com/shray/bayes-nash

preprint2020arXiv

Cognitive Representation Learning of Self-Media Online Article Quality

The automatic quality assessment of self-media online articles is an urgent and new issue, which is of great value to the online recommendation and search. Different from traditional and well-formed articles, self-media online articles are mainly created by users, which have the appearance characteristics of different text levels and multi-modal hybrid editing, along with the potential characteristics of diverse content, different styles, large semantic spans and good interactive experience requirements. To solve these challenges, we establish a joint model CoQAN in combination with the layout organization, writing characteristics and text semantics, designing different representation learning subnetworks, especially for the feature learning process and interactive reading habits on mobile terminals. It is more consistent with the cognitive style of expressing an expert's evaluation of articles. We have also constructed a large scale real-world assessment dataset. Extensive experimental results show that the proposed framework significantly outperforms state-of-the-art methods, and effectively learns and integrates different factors of the online article quality assessment.

preprint2020arXiv

LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition

Speech synthesis (text to speech, TTS) and recognition (automatic speech recognition, ASR) are important speech tasks, and require a large amount of text and speech pairs for model training. However, there are more than 6,000 languages in the world and most languages are lack of speech training data, which poses significant challenges when building TTS and ASR systems for extremely low-resource languages. In this paper, we develop LRSpeech, a TTS and ASR system under the extremely low-resource setting, which can support rare languages with low data cost. LRSpeech consists of three key techniques: 1) pre-training on rich-resource languages and fine-tuning on low-resource languages; 2) dual transformation between TTS and ASR to iteratively boost the accuracy of each other; 3) knowledge distillation to customize the TTS model on a high-quality target-speaker voice and improve the ASR model on multiple voices. We conduct experiments on an experimental language (English) and a truly low-resource language (Lithuanian) to verify the effectiveness of LRSpeech. Experimental results show that LRSpeech 1) achieves high quality for TTS in terms of both intelligibility (more than 98% intelligibility rate) and naturalness (above 3.5 mean opinion score (MOS)) of the synthesized speech, which satisfy the requirements for industrial deployment, 2) achieves promising recognition accuracy for ASR, and 3) last but not least, uses extremely low-resource training data. We also conduct comprehensive analyses on LRSpeech with different amounts of data resources, and provide valuable insights and guidances for industrial deployment. We are currently deploying LRSpeech into a commercialized cloud speech service to support TTS on more rare languages.

preprint2020arXiv

MetaFun: Meta-Learning with Iterative Functional Updates

We develop a functional encoder-decoder approach to supervised meta-learning, where labeled data is encoded into an infinite-dimensional functional representation rather than a finite-dimensional one. Furthermore, rather than directly producing the representation, we learn a neural update rule resembling functional gradient descent which iteratively improves the representation. The final representation is used to condition the decoder to make predictions on unlabeled data. Our approach is the first to demonstrates the success of encoder-decoder style meta-learning methods like conditional neural processes on large-scale few-shot classification benchmarks such as miniImageNet and tieredImageNet, where it achieves state-of-the-art performance.

preprint2020arXiv

MultiSpeech: Multi-Speaker Text to Speech with Transformer

Transformer-based text to speech (TTS) model (e.g., Transformer TTS~\cite{li2019neural}, FastSpeech~\cite{ren2019fastspeech}) has shown the advantages of training and inference efficiency over RNN-based model (e.g., Tacotron~\cite{shen2018natural}) due to its parallel computation in training and/or inference. However, the parallel computation increases the difficulty while learning the alignment between text and speech in Transformer, which is further magnified in the multi-speaker scenario with noisy data and diverse speakers, and hinders the applicability of Transformer for multi-speaker TTS. In this paper, we develop a robust and high-quality multi-speaker Transformer TTS system called MultiSpeech, with several specially designed components/techniques to improve text-to-speech alignment: 1) a diagonal constraint on the weight matrix of encoder-decoder attention in both training and inference; 2) layer normalization on phoneme embedding in encoder to better preserve position information; 3) a bottleneck in decoder pre-net to prevent copy between consecutive speech frames. Experiments on VCTK and LibriTTS multi-speaker datasets demonstrate the effectiveness of MultiSpeech: 1) it synthesizes more robust and better quality multi-speaker voice than naive Transformer based TTS; 2) with a MutiSpeech model as the teacher, we obtain a strong multi-speaker FastSpeech model with almost zero quality degradation while enjoying extremely fast inference speed.

preprint2020arXiv

Multivariate Relations Aggregation Learning in Social Networks

Multivariate relations are general in various types of networks, such as biological networks, social networks, transportation networks, and academic networks. Due to the principle of ternary closures and the trend of group formation, the multivariate relationships in social networks are complex and rich. Therefore, in graph learning tasks of social networks, the identification and utilization of multivariate relationship information are more important. Existing graph learning methods are based on the neighborhood information diffusion mechanism, which often leads to partial omission or even lack of multivariate relationship information, and ultimately affects the accuracy and execution efficiency of the task. To address these challenges, this paper proposes the multivariate relationship aggregation learning (MORE) method, which can effectively capture the multivariate relationship information in the network environment. By aggregating node attribute features and structural features, MORE achieves higher accuracy and faster convergence speed. We conducted experiments on one citation network and five social networks. The experimental results show that the MORE model has higher accuracy than the GCN (Graph Convolutional Network) model in node classification tasks, and can significantly reduce time cost.

preprint2020arXiv

OFFER: A Motif Dimensional Framework for Network Representation Learning

Aiming at better representing multivariate relationships, this paper investigates a motif dimensional framework for higher-order graph learning. The graph learning effectiveness can be improved through OFFER. The proposed framework mainly aims at accelerating and improving higher-order graph learning results. We apply the acceleration procedure from the dimensional of network motifs. Specifically, the refined degree for nodes and edges are conducted in two stages: (1) employ motif degree of nodes to refine the adjacency matrix of the network; and (2) employ motif degree of edges to refine the transition probability matrix in the learning process. In order to assess the efficiency of the proposed framework, four popular network representation algorithms are modified and examined. By evaluating the performance of OFFER, both link prediction results and clustering results demonstrate that the graph representation learning algorithms enhanced with OFFER consistently outperform the original algorithms with higher efficiency.

preprint2020arXiv

On Competitive Analysis for Polling Systems

Polling systems have been widely studied, however most of these studies focus on polling systems with renewal processes for arrivals and random variables for service times. There is a need driven by practical applications to study polling systems with arbitrary arrivals (not restricted to time-varying or in batches) and revealed service time upon a job's arrival. To address that need, our work considers a polling system with generic setting and for the first time provides the worst-case analysis for online scheduling policies in this system. We provide conditions for the existence of constant competitive ratios, and competitive lower bounds for general scheduling policies in polling systems. Our work also bridges the queueing and scheduling communities by proving the competitive ratios for several well-studied policies in the queueing literature, such as cyclic policies with exhaustive, gated or l-limited service disciplines for polling systems.

preprint2020arXiv

Peak Age of Information in Priority Queueing Systems

We consider a priority queueing system where a single processor serves k classes of packets that are generated randomly following Poisson processes. Our objective is to compute the expected Peak Age of Information (PAoI) under various scenarios. In particular, we consider two situations where the buffer size at each queue is one and infinite, and in the infinite buffer size case we consider First Come First Serve (FCFS) and Last Come First Serve (LCFS) as service disciplines. For the system with buffer size one at each queue, we derive PAoI exactly for the case of exponential service time and bounds (which are excellent approximations) for the case of general service time, with small k. For the system with infinite buffer size, we provide closed-form expressions of PAoI for both FCFS and LCFS where service time is general and k could be large. Using those results we investigated the effect of ordering of priorities and service disciplines for the various scenarios. We perform extensive numerical studies to validate our results and develop insights.

preprint2020arXiv

Weak Supervision for Fake News Detection via Reinforcement Learning

Today social media has become the primary source for news. Via social media platforms, fake news travel at unprecedented speeds, reach global audiences and put users and communities at great risk. Therefore, it is extremely important to detect fake news as early as possible. Recently, deep learning based approaches have shown improved performance in fake news detection. However, the training of such models requires a large amount of labeled data, but manual annotation is time-consuming and expensive. Moreover, due to the dynamic nature of news, annotated samples may become outdated quickly and cannot represent the news articles on newly emerged events. Therefore, how to obtain fresh and high-quality labeled samples is the major challenge in employing deep learning models for fake news detection. In order to tackle this challenge, we propose a reinforced weakly-supervised fake news detection framework, i.e., WeFEND, which can leverage users' reports as weak supervision to enlarge the amount of training data for fake news detection. The proposed framework consists of three main components: the annotator, the reinforced selector and the fake news detector. The annotator can automatically assign weak labels for unlabeled news based on users' reports. The reinforced selector using reinforcement learning techniques chooses high-quality samples from the weakly labeled data and filters out those low-quality ones that may degrade the detector's prediction performance. The fake news detector aims to identify fake news based on the news content. We tested the proposed framework on a large collection of news articles published via WeChat official accounts and associated user reports. Extensive experiments on this dataset show that the proposed WeFEND model achieves the best performance compared with the state-of-the-art methods.

preprint2020arXiv

Whole-Body Human Pose Estimation in the Wild

This paper investigates the task of 2D human whole-body pose estimation, which aims to localize dense landmarks on the entire human body including face, hands, body, and feet. As existing datasets do not have whole-body annotations, previous methods have to assemble different deep models trained independently on different datasets of the human face, hand, and body, struggling with dataset biases and large model complexity. To fill in this blank, we introduce COCO-WholeBody which extends COCO dataset with whole-body annotations. To our best knowledge, it is the first benchmark that has manual annotations on the entire human body, including 133 dense landmarks with 68 on the face, 42 on hands and 23 on the body and feet. A single-network model, named ZoomNet, is devised to take into account the hierarchical structure of the full human body to solve the scale variation of different body parts of the same person. ZoomNet is able to significantly outperform existing methods on the proposed COCO-WholeBody dataset. Extensive experiments show that COCO-WholeBody not only can be used to train deep models from scratch for whole-body pose estimation but also can serve as a powerful pre-training dataset for many different tasks such as facial landmark detection and hand keypoint estimation. The dataset is publicly available at https://github.com/jin-s13/COCO-WholeBody.

preprint2016arXiv

Maximum Leaf Spanning Trees of Growing Sierpinski Networks Models

The dynamical phenomena of complex networks are very difficult to predict from local information due to the rich microstructures and corresponding complex dynamics. On the other hands, it is a horrible job to compute some stochastic parameters of a large network having thousand and thousand nodes. We design several recursive algorithms for finding spanning trees having maximal leaves (MLS-trees) in investigation of topological structures of Sierpinski growing network models, and use MLS-trees to determine the kernels, dominating and balanced sets of the models. We propose a new stochastic method for the models, called the edge-cumulative distribution, and show that it obeys a power law distribution.

preprint2016arXiv

Probe Machine

A novel computing model, called \emph{Probe Machine}, is proposed in this paper. Different from Turing Machine, Probe Machine is a fully-parallel computing model in the sense that it can simultaneously process multiple pairs of data, rather than sequentially process every pair of linearly-adjacent data. In this paper, we establish the mathematical model of Probe Machine as a 9-tuple consisting of data library, probe library, data controller, probe controller, probe operation, computing platform, detector, true solution storage, and residue collector. We analyze the computation capability of the Probe Machine model, and in particular we show that Turing Machine is a special case of Probe Machine. We revisit two NP-complete problems---i.e., the Graph Coloring and Hamilton Cycle problems, and devise two algorithms on basis of the established Probe Machine model, which can enumerate all solutions to each of these problems by only one probe operation. Furthermore, we show that Probe Machine can be implemented by leveraging the nano-DNA probe technologies. The computational power of an electronic computer based on Turing Machine is known far more than that of the human brain. A question naturally arises: will a future computer based on Probe Machine outperform the human brain in more ways beyond the computational power?

preprint2016arXiv

Theory on the Structure and Coloring of Maximal Planar Graphs (1)Recursion Formulae of Chromatic Polynomial and Four-Color Conjecture

In this paper, two recursion formulae of chromatic polynomial of a maximal planar graph G are obtained. Moreover, the application of these formulaes to the proof of Four-Color Conjecture is investigated. By using these formulae, the proof of Four-Color Conjecture boils down to the study on a special class of graphs, viz., 4-chromatic-funnel pseudo uniquely-4-colorable maximal planar graphs.

preprint2015arXiv

Growing Network Models Having Part Edges Removed/added Randomly

Since network motifs are an important property of networks and some networks have the behaviors of rewiring or reducing or adding edges between old vertices before new vertices entering the networks, we construct our non-randomized model N(t) and randomized model N'(t) that have the predicated fixed subgraphs like motifs and satisfy both properties of growth and preferential attachment by means of the recursive algorithm from the lower levels of the so-called bound growing network models. To show the scale-free property of the randomized model N'(t), we design a new method, called edge-cumulative distribution, and democrat two edge-cumulative distributions of N(t) and N'(t) are equivalent to each other.

preprint2015arXiv

Improper Graceful and Odd-graceful Labellings of Graph Theory

In this paper we define some new labellings for trees, called the in-improper and out-improper odd-graceful labellings such that some trees labelled with the new labellings can induce graceful graphs having at least a cycle. We, next, apply the new labellings to construct large scale of graphs having improper graceful/odd-graceful labellings or having graceful/odd-graceful labellings.

preprint2015arXiv

On uniquely 3-colorable plane graphs without prescribed adjacent faces

A graph $G$ is \emph{uniquely k-colorable} if the chromatic number of $G$ is $k$ and $G$ has only one $k$-coloring up to permutation of the colors. For a plane graph $G$, two faces $f_1$ and $f_2$ of $G$ are \emph{adjacent $(i,j)$-faces} if $d(f_1)=i$, $d(f_2)=j$ and $f_1$ and $f_2$ have a common edge, where $d(f)$ is the degree of a face $f$. In this paper, we prove that every uniquely 3-colorable plane graph has adjacent $(3,k)$-faces, where $k\leq 5$. The bound 5 for $k$ is best possible. Furthermore, we prove that there exist a class of uniquely 3-colorable plane graphs having neither adjacent $(3,i)$-faces nor adjacent $(3,j)$-faces, where $i,j\in \{3,4,5\}$ and $i \neq j$. One of our constructions implies that there exist an infinite family of edge-critical uniquely 3-colorable plane graphs with $n$ vertices and $\frac{7}{3}n-\frac{14}{3}$ edges, where $n(\geq 11)$ is odd and $n\equiv 2\pmod{3}$.

preprint2015arXiv

Time-Resolved Mass Sensing of a Molecular Adsorbate Nonuniformly Distributed Along a Nanomechnical String

We show that the particular distribution of mass deposited on the surface of a nanomechanical resonator can be estimated by tracking the evolution of the device's resonance frequencies during the process of desorption. The technique, which relies on analytical models we have developed for the multimodal response of the system, enables mass sensing at much higher levels of accuracy than is typically achieved with a single frequency-shift measurement and no rigorous knowledge of the mass profile. We report on a series of demonstration experiments, in which the explosive molecule 1,3,5-trinitroperhydro-1,3,5-triazine (RDX) is vapor deposited along the length of a silicon nitride nanostring to create a dense, random covering of RDX crystallites on the surface. In some cases, the deposition is biased to produce distributions with a slight excess or deficit of mass at the string midpoint. The added mass is then allowed to sublimate away under vacuum conditions, with the device returning to its original state over about 4 h (and the resonance frequencies, measured via optical interferometry, relaxing back to their pre-mass-deposition values). Our claim is that the detailed time trace of observed frequency shifts is rich in information---not only about the quantity of RDX initially deposited but also about its spatial arrangement along the nanostring. The data also reveal that sublimation in this case follows a nontrivial rate law, consistent with mass loss occurring at the exposed surface area of the RDX crystallites.

preprint2014arXiv

Active Dictionary Learning in Sparse Representation Based Classification

Sparse representation, which uses dictionary atoms to reconstruct input vectors, has been studied intensively in recent years. A proper dictionary is a key for the success of sparse representation. In this paper, an active dictionary learning (ADL) method is introduced, in which classification error and reconstruction error are considered as the active learning criteria in selection of the atoms for dictionary construction. The learned dictionaries are caculated in sparse representation based classification (SRC). The classification accuracy and reconstruction error are used to evaluate the proposed dictionary learning method. The performance of the proposed dictionary learning method is compared with other methods, including unsupervised dictionary learning and whole-training-data dictionary. The experimental results based on the UCI data sets and face data set demonstrate the efficiency of the proposed method.

preprint2014arXiv

Tree-colorable maximal planar graphs

A tree-coloring of a maximal planar graph is a proper vertex $4$-coloring such that every bichromatic subgraph, induced by this coloring, is a tree. A maximal planar graph $G$ is tree-colorable if $G$ has a tree-coloring. In this article, we prove that a tree-colorable maximal planar graph $G$ with $δ(G)\geq 4$ contains at least four odd-vertices. Moreover, for a tree-colorable maximal planar graph of minimum degree 4 that contains exactly four odd-vertices, we show that the subgraph induced by its four odd-vertices is not a claw and contains no triangles.

preprint2013arXiv

Frustrating antiferromagnetic exchange interactions enhance specific valence-bond-pair motifs

We present variational results for the ground state of the antiferromagnetic quantum Heisenberg model with frustrating next-nearest-neighbour interactions. The trial wave functions employed are of resonating-valence-bond type, elaborated to account for various geometric motifs of adjacent bond pairs. The calculation is specialized to a square-lattice cluster consisting of just sixteen sites, large enough that the system can accommodate nontrivial singlet dimer correlations but small enough that exhaustive enumeration of states in the total spin zero sector is still feasible. A symbolic computation approach allows us to generate an algebraic expression for the expectation value of any observable and hence to carry out the energy optimization exactly. While we have no measurements that could unambiguously identify a spin liquid state in the controversial region at intermediate frustration, we can say that the bond-bond correlation factors that emerge do not appear to be consistent with the existence of a columnar valence bond crystal. Furthermore, our results suggest that the magnetically disordered region may accommodate two distinct phases.

preprint2013arXiv

Size of edge-critical uniquely 3-colorable planar graphs

A graph $G$ is \emph{uniquely k-colorable} if the chromatic number of $G$ is $k$ and $G$ has only one $k$-coloring up to permutation of the colors. A uniquely $k$-colorable graph $G$ is edge-critical if $G-e$ is not a uniquely $k$-colorable graph for any edge $e\in E(G)$. Mel'nikov and Steinberg [L. S. Mel'nikov, R. Steinberg, One counterexample for two conjectures on three coloring, Discrete Math. 20 (1977) 203-206] asked to find an exact upper bound for the number of edges in a edge-critical 3-colorable planar graph with $n$ vertices. In this paper, we give some properties of edge-critical uniquely 3-colorable planar graphs and prove that if $G$ is such a graph with $n(\geq6)$ vertices, then $|E(G)|\leq \frac{5}{2}n-6 $, which improves the upper bound $\frac{8}{3}n-\frac{17}{3}$ given by Matsumoto [N. Matsumoto, The size of edge-critical uniquely 3-colorable planar graphs, Electron. J. Combin. 20 (3) (2013) $\#$P49]. Furthermore, we find some edge-critical 3-colorable planar graphs which have $n(=10,12, 14)$ vertices and $\frac{5}{2}n-7$ edges.

preprint2013arXiv

Two distinct spin liquid states in a layered cubic lattice

We construct a family of short-range resonating-valence-bond wave functions on a layered cubic lattice, allowing for a tunable anisotropy in the amplitudes assigned to nearest-neighbour valence bonds along one axis. Monte Carlo simulations reveal that four phases are stabilized over the full range of the anisotropy parameter. They are separated from one another by a sequence of continuous quantum phase transitions. An antiferromagnetic phase, centred on the perfect isotropy point, intervenes between two distinct quantum spin liquid states. One of them is continuously deformable to the two-dimensional U(1) spin liquid, which is known to exhibit critical bond correlations. The other has both spin and bond correlations that decay exponentially. The existence of this second phase is proof that, contrary to expectations, neither a bipartite lattice structure nor a conventional Marshall sign rule is an impediment to realizing a fully gapped quantum spin liquid.

preprint2012arXiv

Mathematical Proofs of Two Conjectures: The Four Color Problem and The Uniquely 4-colorable Planar Graph

The famous four color theorem states that for all planar graphs, every vertex can be assigned one of 4 colors such that no two adjacent vertices receive the same color. Since Francis Guthrie first conjectured it in 1852, it is until 1976 with electronic computer that Appel and Haken first gave a proof by finding and verifying 1936 reducible unavoidable sets, and a simplified proof of Robertson, Sanders, Seymour and Thomas in 1997 only involved 633 reducible unavoidable sets, both proofs could not be realized effectively by hand. Until now, finding the reducible unavoidable sets remains the only successful method to use, which came from Kempe's first "proof" of the four color problem in 1879. An alternative method only involving 4 reducible unavoidable sets for proving the four color theorem is used in this paper, which takes form of mathematical proof rather than a computer-assisted proof and proves both the four color conjecture and the uniquely 4-colorable planar graph conjecture by mathematical method.

preprint2012arXiv

Theory on Structure and Coloring of Maximal Planar Graphs (I): Relationship between Structure and Coloring

Maximal planar graph refers to the planar graph with the most edges, which means no more edges can be added so that the resulting graph is still planar. The Four-Color Conjecture says that every planar graph without loops is 4-colorable. Indeed, in order to prove Four-Color Conjecture, it clearly suffices to show that all maximal planar graphs are 4-colorable. Since this conjecture was proposed in 1852, no mathematical proofs have been invented up until now. Maybe the main reasons lie in the following three aspects in terms of maximal planar graphs: not clearing up the structures, not figuring out the coloring types, and not straightening out the relation between structure and coloring. For this, we will write a series of articles to study the structure and coloring theory of maximal planar graphs systematically. This is our first article, which focuses mainly on the structure and coloring relations. First, we introduce a new way to construct maximal planar graphs. The advantage of this method is that it establishes an immediate relation with 4-colorings, and reveals how a given maximal planar graph is generated. Second, a special class of maximal planar graphs--recursive maximal planar graphs is researched in depth, which lays a foundation for solving the uniquely 4-colorable planar graphs conjecture(see subsequent articles). Third, we discover an important mode for classifying 4-colorings: tree-coloring and cycle-coloring, which runs through the whole series of articles. Furthermore, this mode is applied to the research on an arbitrary 4-coloring and its corresponding structure of unions of two and three bicolored subgraphs. Finally, we introduce the concepts of black-white coloring and stamen phenomenon, and find out a necessary and sufficient condition for an even cycle to be a 2-colorable cycle.

preprint2010arXiv

On Computing Upper Limits to Source Intensities

A common problem in astrophysics is determining how bright a source could be and still not be detected. Despite the simplicity with which the problem can be stated, the solution involves complex statistical issues that require careful analysis. In contrast to the confidence bound, this concept has never been formally analyzed, leading to a great variety of often ad hoc solutions. Here we formulate and describe the problem in a self-consistent manner. Detection significance is usually defined by the acceptable proportion of false positives (the TypeI error), and we invoke the complementary concept of false negatives (the TypeII error), based on the statistical power of a test, to compute an upper limit to the detectable source intensity. To determine the minimum intensity that a source must have for it to be detected, we first define a detection threshold, and then compute the probabilities of detecting sources of various intensities at the given threshold. The intensity that corresponds to the specified TypeII error probability defines that minimum intensity, and is identified as the upper limit. Thus, an upper limit is a characteristic of the detection procedure rather than the strength of any particular source and should not be confused with confidence intervals or other estimates of source intensity. This is particularly important given the large number of catalogs that are being generated from increasingly sensitive surveys. We discuss the differences between these upper limits and confidence bounds. Both measures are useful quantities that should be reported in order to extract the most science from catalogs, though they answer different statistical questions: an upper bound describes an inference range on the source intensity, while an upper limit calibrates the detection process. We provide a recipe for computing upper limits that applies to all detection algorithms.

preprint2008arXiv

Capacity Bounds for Broadcast Channels with Confidential Messages

In this paper, we study capacity bounds for discrete memoryless broadcast channels with confidential messages. Two private messages as well as a common message are transmitted; the common message is to be decoded by both receivers, while each private message is only for its intended receiver. In addition, each private message is to be kept secret from the unintended receiver where secrecy is measured by equivocation. We propose both inner and outer bounds to the rate equivocation region for broadcast channels with confidential messages. The proposed inner bound generalizes Csiszár and Körner's rate equivocation region for broadcast channels with a single confidential message, Liu {\em et al}'s achievable rate region for broadcast channels with perfect secrecy, Marton's and Gel'fand and Pinsker's achievable rate region for general broadcast channels. Our proposed outer bounds, together with the inner bound, helps establish the rate equivocation region of several classes of discrete memoryless broadcast channels with confidential messages, including less noisy, deterministic, and semi-deterministic channels. Furthermore, specializing to the general broadcast channel by removing the confidentiality constraint, our proposed outer bounds reduce to new capacity outer bounds for the discrete memory broadcast channel.

Jin Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

41 published item(s)

XekRung Technical Report

Duality of Ryu-Takayanagi surfaces inside and outside the horizon

A Novel Markov Model for Near-Term Railway Delay Prediction

AGIC: Approximate Gradient Inversion Attack on Federated Learning

Analyzing and Mitigating Interference in Neural Architecture Search

CLS: Cross Labeling Supervision for Semi-Supervised Learning

ECO v1: Towards Event-Centric Opinion Mining

Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval

libRoadRunner 2.0: A High-Performance SBML Simulation and Analysis Library

Matrix Multiplication with Less Arithmetic Complexity and IO Complexity

Procedural Text Understanding via Scene-Wise Evolution

Simultaneous control of spectral and directional emissivity with gradient epsilon-near-zero InAs photonic structures

MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition

Studentized Permutation Method for Comparing Restricted Mean Survival Times with Small Sample from Randomized Trials

A Bayesian Framework for Nash Equilibrium Inference in Human-Robot Parallel Play

Cognitive Representation Learning of Self-Media Online Article Quality

LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition

MetaFun: Meta-Learning with Iterative Functional Updates

MultiSpeech: Multi-Speaker Text to Speech with Transformer

Multivariate Relations Aggregation Learning in Social Networks

OFFER: A Motif Dimensional Framework for Network Representation Learning

On Competitive Analysis for Polling Systems

Peak Age of Information in Priority Queueing Systems

Weak Supervision for Fake News Detection via Reinforcement Learning

Whole-Body Human Pose Estimation in the Wild

Maximum Leaf Spanning Trees of Growing Sierpinski Networks Models

Probe Machine

Theory on the Structure and Coloring of Maximal Planar Graphs (1)Recursion Formulae of Chromatic Polynomial and Four-Color Conjecture

Growing Network Models Having Part Edges Removed/added Randomly

Improper Graceful and Odd-graceful Labellings of Graph Theory

On uniquely 3-colorable plane graphs without prescribed adjacent faces

Time-Resolved Mass Sensing of a Molecular Adsorbate Nonuniformly Distributed Along a Nanomechnical String

Active Dictionary Learning in Sparse Representation Based Classification

Tree-colorable maximal planar graphs

Frustrating antiferromagnetic exchange interactions enhance specific valence-bond-pair motifs

Size of edge-critical uniquely 3-colorable planar graphs

Two distinct spin liquid states in a layered cubic lattice

Mathematical Proofs of Two Conjectures: The Four Color Problem and The Uniquely 4-colorable Planar Graph

Theory on Structure and Coloring of Maximal Planar Graphs (I): Relationship between Structure and Coloring

On Computing Upper Limits to Source Intensities

Capacity Bounds for Broadcast Channels with Confidential Messages