Source author record

Ke Xu

Ke Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

66works

28topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

WaferSAGE: Large Language Model-Powered Wafer Defect Analysis via Synthetic Data Generation and Rubric-Guided Reinforcement Learning

We present WaferSAGE, a framework for wafer defect visual question answering using small vision-language models. To address data scarcity in semiconductor manufacturing, we propose a three-stage synthesis pipeline incorporating structured rubric generation for precise evaluation. Starting from limited labeled wafer maps, we employ clustering-based cleaning to filter label noise, then generate comprehensive defect descriptions using vision-language models, which are converted into structured evaluation rubrics criteria. These rubrics guide the synthesis of VQA pairs, ensuring coverage across defect type identification, spatial distribution, morphology, and root cause analysis. Our dual assessment framework aligns rule-based metrics with LLM-Judge scores via Bayesian optimization, enabling reliable automated evaluation. Through curriculum-based reinforcement learning with Group Sequence Policy Optimization (GSPO) and rubric-aligned rewards, our 4B-parameter Qwen3-VL model achieves a 6.493 LLM-Judge score, closely approaching Gemini-3-Flash (7.149) while enabling complete on-premise deployment. We demonstrate that small models with domain-specific training can surpass proprietary large models in specialized industrial visual understanding, offering a viable path for privacy-preserving, cost-effective deployment in semiconductor manufacturing.

preprint2022arXiv

A Simple yet Effective Method for Graph Classification

In deep neural networks, better results can often be obtained by increasing the complexity of previously developed basic models. However, it is unclear whether there is a way to boost performance by decreasing the complexity of such models. Intuitively, given a problem, a simpler data structure comes with a simpler algorithm. Here, we investigate the feasibility of improving graph classification performance while simplifying the learning process. Inspired by structural entropy on graphs, we transform the data sample from graphs to coding trees, which is a simpler but essential structure for graph data. Furthermore, we propose a novel message passing scheme, termed hierarchical reporting, in which features are transferred from leaf nodes to root nodes by following the hierarchical structure of coding trees. We then present a tree kernel and a convolutional network to implement our scheme for graph classification. With the designed message passing scheme, the tree kernel and convolutional network have a lower runtime complexity of $O(n)$ than Weisfeiler-Lehman subtree kernel and other graph neural networks of at least $O(hm)$. We empirically validate our methods with several graph classification benchmarks and demonstrate that they achieve better performance and lower computational consumption than competing approaches.

preprint2022arXiv

Abnormally High Thermal Conductivity in Fivefold Twinned Diamond Nanowires

Fivefold twins (5FTs), discovered nearly 200 years ago, are a common multiply twinned structure that usually dramatically deteriorate the thermal transport properties of nanomaterials. Here, we report the anomalous thermal conductivity ($κ$) in a novel fivefold twinned diamond nanowires (5FT-DNWs). The $κ$ of 5FT-DNWs is effectively enhanced by the defects of 5FT boundaries, and non-monotonically changes with the cross-sectional area ($\textit{S}$). Above the critical $\textit{S}$ = 7.1 nm$^{2}$, 5FT-DNWs show a constant value of $κ$, whereas below it, there appears a sharp increase in $κ$ with decreasing $\textit{S}$. More importantly, 5FT-DNWs with minimal $\textit{S}$ show a superior $κ$ over the bulk diamond. By confirming the Normal-process-dominated scattering event, it is demonstrated that the phonon hydrodynamic behavior plays a determinative role in abnormally high $κ$ of 5FT-DNWs with small $\textit{S}$. The super-transported phonon hydrodynamic phenomenon unveiled in the twinned diamond nanowires may provide a new route for pursuing highly thermally conductive nanomaterials.

preprint2022arXiv

Adaptive Channel Encoding Transformer for Point Cloud Analysis

Transformer plays an increasingly important role in various computer vision areas and remarkable achievements have also been made in point cloud analysis. Since they mainly focus on point-wise transformer, an adaptive channel encoding transformer is proposed in this paper. Specifically, a channel convolution called Transformer-Conv is designed to encode the channel. It can encode feature channels by capturing the potential relationship between coordinates and features. Compared with simply assigning attention weight to each channel, our method aims to encode the channel adaptively. In addition, our network adopts the neighborhood search method of low-level and high-level dual semantic receptive fields to improve the performance. Extensive experiments show that our method is superior to state-of-the-art point cloud classification and segmentation methods on three benchmark datasets.

preprint2022arXiv

Asymmetric star formation triggered by gas inflow in a barred lenticular galaxy PGC 34107

Comparing to the inactive and gas-poor normal lenticular galaxies (S0s) in the local universe, we study a barred star-forming S0 galaxy, PGC 34107, which has been observed by the Centro Astronómico Hispano Alemán (CAHA) 3.5-m telescope and the Northern Extended Millimeter Array (NOEMA). The spatially resolved ionized gas and molecular gas traced by $^{12}$CO(1-0), hereafter CO(1-0), show the similar distribution and kinematics to the stellar component with an off-center star-forming region, $\sim$380 pc away from the center. The resolved kinematics of molecular CO(1-0) emission reveals that there is a blueshifted (redshifted) velocity component on the receding (approaching) side of the galaxy along the stellar bar. This might provide a plausible evidence of non-circular motion, such as the bar-induced molecular gas inflow. The velocity of molecular gas inflow decreases with approaching towards the peak of the off-center star formation in the north, which might be associated with the inner Lindblad resonance (ILR). In addition to CO(1-0), we also detect the isotopic line of $^{13}$CO(1-0). Most $\rm Hα$, CO(1-0) and $^{13}$CO(1-0) emissions are concentrated on this northern star-forming region. We find that PGC 34107 follows the local stellar mass-metallicity relation, star-forming main sequence, and the Kennicutt-Schmidt law. The resolved and integrated molecular gas main sequence suggest that there is a higher gas fraction in the galaxy central region, which supports a scenario that the bar-induced gas reservoir provides the raw material, and subsequently triggers the central star formation.

preprint2022arXiv

Bi-directional Object-context Prioritization Learning for Saliency Ranking

The saliency ranking task is recently proposed to study the visual behavior that humans would typically shift their attention over different objects of a scene based on their degrees of saliency. Existing approaches focus on learning either object-object or object-scene relations. Such a strategy follows the idea of object-based attention in Psychology, but it tends to favor those objects with strong semantics (e.g., humans), resulting in unrealistic saliency ranking. We observe that spatial attention works concurrently with object-based attention in the human visual recognition system. During the recognition process, the human spatial attention mechanism would move, engage, and disengage from region to region (i.e., context to context). This inspires us to model the region-level interactions, in addition to the object-level reasoning, for saliency ranking. To this end, we propose a novel bi-directional method to unify spatial attention and object-based attention for saliency ranking. Our model includes two novel modules: (1) a selective object saliency (SOS) module that models objectbased attention via inferring the semantic representation of the salient object, and (2) an object-context-object relation (OCOR) module that allocates saliency ranks to objects by jointly modeling the object-context and context-object interactions of the salient objects. Extensive experiments show that our approach outperforms existing state-of-theart methods. Our code and pretrained model are available at https://github.com/GrassBro/OCOR.

preprint2022arXiv

Dual-Neighborhood Deep Fusion Network for Point Cloud Analysis

Recently, deep neural networks have made remarkable achievements in 3D point cloud classification. However, existing classification methods are mainly implemented on idealized point clouds and suffer heavy degradation of per-formance on non-idealized scenarios. To handle this prob-lem, a feature representation learning method, named Dual-Neighborhood Deep Fusion Network (DNDFN), is proposed to serve as an improved point cloud encoder for the task of non-idealized point cloud classification. DNDFN utilizes a trainable neighborhood learning method called TN-Learning to capture the global key neighborhood. Then, the global neighborhood is fused with the local neighbor-hood to help the network achieve more powerful reasoning ability. Besides, an Information Transfer Convolution (IT-Conv) is proposed for DNDFN to learn the edge infor-mation between point-pairs and benefits the feature transfer procedure. The transmission of information in IT-Conv is similar to the propagation of information in the graph which makes DNDFN closer to the human reasoning mode. Extensive experiments on existing benchmarks especially non-idealized datasets verify the effectiveness of DNDFN and DNDFN achieves the state of the arts.

preprint2022arXiv

Exploring Memorization in Adversarial Training

Deep learning models have a propensity for fitting the entire training set even with random labels, which requires memorization of every training sample. In this paper, we explore the memorization effect in adversarial training (AT) for promoting a deeper understanding of model capacity, convergence, generalization, and especially robust overfitting of the adversarially trained models. We first demonstrate that deep networks have sufficient capacity to memorize adversarial examples of training data with completely random labels, but not all AT algorithms can converge under the extreme circumstance. Our study of AT with random labels motivates further analyses on the convergence and generalization of AT. We find that some AT approaches suffer from a gradient instability issue and most recently suggested complexity measures cannot explain robust generalization by considering models trained on random labels. Furthermore, we identify a significant drawback of memorization in AT that it could result in robust overfitting. We then propose a new mitigation algorithm motivated by detailed memorization analyses. Extensive experiments on various datasets validate the effectiveness of the proposed method.

preprint2022arXiv

Harmonizer: Learning to Perform White-Box Image and Video Harmonization

Recent works on image harmonization solve the problem as a pixel-wise image translation task via large autoencoders. They have unsatisfactory performances and slow inference speeds when dealing with high-resolution images. In this work, we observe that adjusting the input arguments of basic image filters, e.g., brightness and contrast, is sufficient for humans to produce realistic images from the composite ones. Hence, we frame image harmonization as an image-level regression problem to learn the arguments of the filters that humans use for the task. We present a Harmonizer framework for image harmonization. Unlike prior methods that are based on black-box autoencoders, Harmonizer contains a neural network for filter argument prediction and several white-box filters (based on the predicted arguments) for image harmonization. We also introduce a cascade regressor and a dynamic loss strategy for Harmonizer to learn filter arguments more stably and precisely. Since our network only outputs image-level arguments and the filters we used are efficient, Harmonizer is much lighter and faster than existing methods. Comprehensive experiments demonstrate that Harmonizer surpasses existing methods notably, especially with high-resolution inputs. Finally, we apply Harmonizer to video harmonization, which achieves consistent results across frames and 56 fps at 1080P resolution. Code and models are available at: https://github.com/ZHKKKe/Harmonizer.

preprint2022arXiv

Hierarchical information matters: Text classification via tree based graph neural network

Text classification is a primary task in natural language processing (NLP). Recently, graph neural networks (GNNs) have developed rapidly and been applied to text classification tasks. As a special kind of graph data, the tree has a simpler data structure and can provide rich hierarchical information for text classification. Inspired by the structural entropy, we construct the coding tree of the graph by minimizing the structural entropy and propose HINT, which aims to make full use of the hierarchical information contained in the text for the task of text classification. Specifically, we first establish a dependency parsing graph for each text. Then we designed a structural entropy minimization algorithm to decode the key information in the graph and convert each graph to its corresponding coding tree. Based on the hierarchical structure of the coding tree, the representation of the entire graph is obtained by updating the representation of non-leaf nodes in the coding tree layer by layer. Finally, we present the effectiveness of hierarchical information in text classification. Experimental results show that HINT outperforms the state-of-the-art methods on popular benchmarks while having a simple structure and few parameters.

preprint2022arXiv

Night-time Scene Parsing with a Large Real Dataset

Although huge progress has been made on scene analysis in recent years, most existing works assume the input images to be in day-time with good lighting conditions. In this work, we aim to address the night-time scene parsing (NTSP) problem, which has two main challenges: 1) labeled night-time data are scarce, and 2) over- and under-exposures may co-occur in the input night-time images and are not explicitly modeled in existing pipelines. To tackle the scarcity of night-time data, we collect a novel labeled dataset, named {\it NightCity}, of 4,297 real night-time images with ground truth pixel-level semantic annotations. To our knowledge, NightCity is the largest dataset for NTSP. In addition, we also propose an exposure-aware framework to address the NTSP problem through augmenting the segmentation process with explicitly learned exposure features. Extensive experiments show that training on NightCity can significantly improve NTSP performances and that our exposure-aware model outperforms the state-of-the-art methods, yielding top performances on our dataset as well as existing datasets.

preprint2022arXiv

Quantized Adaptive Subgradient Algorithms and Their Applications

Data explosion and an increase in model size drive the remarkable advances in large-scale machine learning, but also make model training time-consuming and model storage difficult. To address the above issues in the distributed model training setting which has high computation efficiency and less device limitation, there are still two main difficulties. On one hand, the communication costs for exchanging information, e.g., stochastic gradients among different workers, is a key bottleneck for distributed training efficiency. On the other hand, less parameter model is easy for storage and communication, but the risk of damaging the model performance. To balance the communication costs, model capacity and model performance simultaneously, we propose quantized composite mirror descent adaptive subgradient (QCMD adagrad) and quantized regularized dual average adaptive subgradient (QRDA adagrad) for distributed training. To be specific, we explore the combination of gradient quantization and sparse model to reduce the communication cost per iteration in distributed training. A quantized gradient-based adaptive learning rate matrix is constructed to achieve a balance between communication costs, accuracy, and model sparsity. Moreover, we theoretically find that a large quantization error brings in extra noise, which influences the convergence and sparsity of the model. Therefore, a threshold quantization strategy with a relatively small error is adopted in QCMD adagrad and QRDA adagrad to improve the signal-to-noise ratio and preserve the sparsity of the model. Both theoretical analyses and empirical results demonstrate the efficacy and efficiency of the proposed algorithms.

preprint2022arXiv

Structural Entropy Guided Graph Hierarchical Pooling

Following the success of convolution on non-Euclidean space, the corresponding pooling approaches have also been validated on various tasks regarding graphs. However, because of the fixed compression quota and stepwise pooling design, these hierarchical pooling methods still suffer from local structure damage and suboptimal problem. In this work, inspired by structural entropy, we propose a hierarchical pooling approach, SEP, to tackle the two issues. Specifically, without assigning the layer-specific compression quota, a global optimization algorithm is designed to generate the cluster assignment matrices for pooling at once. Then, we present an illustration of the local structure damage from previous methods in the reconstruction of ring and grid synthetic graphs. In addition to SEP, we further design two classification models, SEP-G and SEP-N for graph classification and node classification, respectively. The results show that SEP outperforms state-of-the-art graph pooling methods on graph classification benchmarks and obtains superior performance on node classifications.

preprint2022arXiv

The molecular gas resolved by ALMA in the low-metallicity dwarf merging galaxy Haro 11

The physical mechanisms for starburst or quenching in less massive ($M_* < 10^{10} M_{\odot}$) galaxies are unclear. The merger is one of the inescapable processes referred to as both starburst and quenching in massive galaxies. However, the effects of the merger on star formation in dwarf galaxies and their evolution results are still uncertain. We aim to explore how to trigger and quench star formation in dwarf galaxies by studying the metal-poor gas-rich dwarf mergers based on the multi-band observations at a spatial resolution of $\sim$ 460 pc. We use the archival data of ALMA (band 3, 8) and VLT/MUSE to map CO($J=$1-0), [CI]($^3$P$_1 - ^3$P$_0$), and H$α$ emission in one of the most extreme starburst merging dwarf galaxies, Haro 11. We find the molecular gas is assembled around the central two star-forming regions. The molecular/ionized gas and stellar components show complex kinematics, indicating that the gas is probably at a combined stage of collision of clouds and feedback from star formation. The peak location and distribution of [CI](1-0) strongly resemble the CO(1-0) emission, meaning that it might trace the same molecular gas as CO in such a dwarf merger starburst galaxy. The enhancement of line ratios ($\sim 0.5$) of [CI]/CO around knot C is probably generated by the dissociation of CO molecules by cosmic rays and far-ultraviolet photons. Globally, Haro 11 and its star-forming regions share similar SFEs as the high-$z$ starburst galaxies or the clumps in nearby (U)LIRGs. Given the high SFE, sSFR, small stellar mass, low metallicity, and deficient HI gas, Haro 11 could be an analog of high-$z$ dwarf starburst and the potential progenitor of the nearby less massive elliptical galaxies. The significantly smaller turbulent pressure and viral parameter will probably trigger the intense starbursts. We also predict that it will quench at $M_* < 8.5 \times 10^9 M_{\odot}$.

preprint2022arXiv

Tracing the Origin of Adversarial Attack for Forensic Investigation and Deterrence

Deep neural networks are vulnerable to adversarial attacks. In this paper, we take the role of investigators who want to trace the attack and identify the source, that is, the particular model which the adversarial examples are generated from. Techniques derived would aid forensic investigation of attack incidents and serve as deterrence to potential attacks. We consider the buyers-seller setting where a machine learning model is to be distributed to various buyers and each buyer receives a slightly different copy with same functionality. A malicious buyer generates adversarial examples from a particular copy $\mathcal{M}_i$ and uses them to attack other copies. From these adversarial examples, the investigator wants to identify the source $\mathcal{M}_i$. To address this problem, we propose a two-stage separate-and-trace framework. The model separation stage generates multiple copies of a model for a same classification task. This process injects unique characteristics into each copy so that adversarial examples generated have distinct and traceable features. We give a parallel structure which embeds a ``tracer'' in each copy, and a noise-sensitive training loss to achieve this goal. The tracing stage takes in adversarial examples and a few candidate models, and identifies the likely source. Based on the unique features induced by the noise-sensitive loss function, we could effectively trace the potential adversarial copy by considering the output logits from each tracer. Empirical results show that it is possible to trace the origin of the adversarial example and the mechanism can be applied to a wide range of architectures and datasets.

preprint2021arXiv

Personalized Graph Neural Networks with Attention Mechanism for Session-Aware Recommendation

The problem of session-aware recommendation aims to predict users' next click based on their current session and historical sessions. Existing session-aware recommendation methods have defects in capturing complex item transition relationships. Other than that, most of them fail to explicitly distinguish the effects of different historical sessions on the current session. To this end, we propose a novel method, named Personalized Graph Neural Networks with Attention Mechanism (A-PGNN) for brevity. A-PGNN mainly consists of two components: one is Personalized Graph Neural Network (PGNN), which is used to extract the personalized structural information in each user behavior graph, compared with the traditional Graph Neural Network (GNN) model, which considers the role of the user when the node embeddding is updated. The other is Dot-Product Attention mechanism, which draws on the Transformer net to explicitly model the effect of historical sessions on the current session. Extensive experiments conducted on two real-world data sets show that A-PGNN evidently outperforms the state-of-the-art personalized session-aware recommendation methods.

preprint2021arXiv

Self-Attention Attribution: Interpreting Information Interactions Inside Transformer

The great success of Transformer-based models benefits from the powerful multi-head self-attention mechanism, which learns token dependencies and encodes contextual information from the input. Prior work strives to attribute model decisions to individual input features with different saliency measures, but they fail to explain how these input features interact with each other to reach predictions. In this paper, we propose a self-attention attribution method to interpret the information interactions inside Transformer. We take BERT as an example to conduct extensive studies. Firstly, we apply self-attention attribution to identify the important attention heads, while others can be pruned with marginal performance degradation. Furthermore, we extract the most salient dependencies in each layer to construct an attribution tree, which reveals the hierarchical interactions inside Transformer. Finally, we show that the attribution results can be used as adversarial patterns to implement non-targeted attacks towards BERT.

preprint2020arXiv

Behavior variations and their implications for popularity promotions: From elites to mass in Weibo

The boom in social media with regard to producing and consuming information simultaneously implies the crucial role of online user influence in determining content popularity. In particular, understanding behavior variations between the influential elites and the mass grassroots is an important issue in communication. However, how their behavior varies across user categories and content domains, and how these differences influence content popularity are rarely addressed. From a novel view of seven content-domains, a detailed picture of behavior variations among five user groups, from both views of elites and mass, is drawn in Weibo, one of the most popular Twitter-like services in China. Interestingly, elites post more diverse contents with video links while the mass possess retweeters of higher loyalty. According to these variations, user-oriented actions of enhancing content popularity are discussed and testified. The most surprising finding is that the diversity of contents do not always bring more retweets, and the mass and elites should promote content popularity by increasing their retweeter counts and loyalty, respectively. Our results for the first time demonstrate the possibility of highly individualized strategies of popularity promotions in social media, instead of a universal principle.

preprint2020arXiv

Electronics-Free Pneumatic Logic Circuits for Localized Feedback Control of Multi-Actuator Soft Robots

The vision of creating entirely-soft robots capable of performing complex tasks will be accomplished only when the controllers required for autonomous operation can be fully implemented on soft components. Despite recent advances in compliant fluidic circuitry for mechanical signal processing, the applicability of this technology for soft robot control has been limited by complicated fabrication and tuning processes, and also the need for external signals such as clocks and digital references. We propose a method to develop pneumatic soft robots in which coordinated interactions between multiple actuators are performed using controllers implemented on components distributedly embedded in the soft structures of the system. In this approach, the notions of binary and multi-valued actuator logic states are introduced. In this way, the physical local dynamical couplings between the analog states of the actuators, established using soft valves of a new type, can be thought of as logic-gate-based mappings acting on discretized representations of the actuator states. Consequently, techniques for digital logic design can be applied to derive the architectures of the localized mechanical couplings that intelligently coordinate the oscillation patterns of the actuator responses. For the purposes of controller tuning, the soft valves are conceived so that their main physical parameters can be adjusted from the exterior of the robot through simple geometrical changes of the corresponding structural elements. To demonstrate the proposed approach, we present the development of a six-state locomoting soft robot.

preprint2020arXiv

Harvesting and Refining Question-Answer Pairs for Unsupervised QA

Question Answering (QA) has shown great success thanks to the availability of large-scale datasets and the effectiveness of neural models. Recent research works have attempted to extend these successes to the settings with few or no labeled data available. In this work, we introduce two approaches to improve unsupervised QA. First, we harvest lexically and syntactically divergent questions from Wikipedia to automatically construct a corpus of question-answer pairs (named as RefQA). Second, we take advantage of the QA model to extract more appropriate answers, which iteratively refines data over RefQA. We conduct experiments on SQuAD 1.1, and NewsQA by fine-tuning BERT without access to manually annotated data. Our approach outperforms previous unsupervised approaches by a large margin and is competitive with early supervised models. We also show the effectiveness of our approach in the few-shot learning setting.

preprint2020arXiv

Learning to Compare for Better Training and Evaluation of Open Domain Natural Language Generation Models

Automated evaluation of open domain natural language generation (NLG) models remains a challenge and widely used metrics such as BLEU and Perplexity can be misleading in some cases. In our paper, we propose to evaluate natural language generation models by learning to compare a pair of generated sentences by fine-tuning BERT, which has been shown to have good natural language understanding ability. We also propose to evaluate the model-level quality of NLG models with sample-level comparison results with skill rating system. While able to be trained in a fully self-supervised fashion, our model can be further fine-tuned with a little amount of human preference annotation to better imitate human judgment. In addition to evaluating trained models, we propose to apply our model as a performance indicator during training for better hyperparameter tuning and early-stopping. We evaluate our approach on both story generation and chit-chat dialogue response generation. Experimental results show that our model correlates better with human preference compared with previous automated evaluation approaches. Training with the proposed metric yields better performance in human evaluation, which further demonstrates the effectiveness of the proposed model.

preprint2020arXiv

Mechanical Metastructures of Triple Periodic Carbon Clathrates

Clathrates are lightweight, cage-like, fully-sp3 three dimensional (3D) structures that are experimentally-available for several host elements of the IV group. However, carbon clathrates are as yet hypothetical structures. Herein, the mechanical properties of Type-I-C46 Type-II-C34 and Type-H-C34 carbon clathrates are explored by first-principles calculations. It is revealed that those carbon clathrates show distinct anisotropic patterns in ideal tensile/shear strengths and critical tensile/shear strains, with maximum ideal tensile strength of Type-I carbon clathrate that is superior over that of diamond in <111> direction. However, it is identified isotropy in shear Youngs modulus, and in terms of tensile/shear Youngs moduli, they are sorted as Type-I > Type-II > Type-H carbon clathrates. There are distinct critical load-bearing bond configurations that explain their distinct mechanical behaviors. Moreover, those carbon clathrates are intrinsically indirect semiconductors, and their electronic properties can be greatly dictated by mechanical strain. Carbon clathrates can be potentially utilized as lightweight technically robust engineering metastructures and in electromechanical devices.

preprint2020arXiv

Off-Path TCP Exploits of the Mixed IPID Assignment

In this paper, we uncover a new off-path TCP hijacking attack that can be used to terminate victim TCP connections or inject forged data into victim TCP connections by manipulating the new mixed IPID assignment method, which is widely used in Linux kernel version 4.18 and beyond to help defend against TCP hijacking attacks. The attack has three steps. First, an off-path attacker can downgrade the IPID assignment for TCP packets from the more secure per-socket-based policy to the less secure hash-based policy, building a shared IPID counter that forms a side channel on the victim. Second, the attacker detects the presence of TCP connections by observing the shared IPID counter on the victim. Third, the attacker infers the sequence number and the acknowledgment number of the detected connection by observing the side channel of the shared IPID counter. Consequently, the attacker can completely hijack the connection, i.e., resetting the connection or poisoning the data stream. We evaluate the impacts of this off-path TCP attack in the real world. Our case studies of SSH DoS, manipulating web traffic, and poisoning BGP routing tables show its threat on a wide range of applications. Our experimental results show that our off-path TCP attack can be constructed within 215 seconds and the success rate is over 88%. Finally, we analyze the root cause of the exploit and develop a new IPID assignment method to defeat this attack. We prototype our defense in Linux 4.18 and confirm its effectiveness through extensive evaluation over real applications on the Internet.

preprint2020arXiv

Online monitoring for safe pedestrian-vehicle interactions

As autonomous systems begin to operate amongst humans, methods for safe interaction must be investigated. We consider an example of a small autonomous vehicle in a pedestrian zone that must safely maneuver around people in a free-form fashion. We investigate two key questions: How can we effectively integrate pedestrian intent estimation into our autonomous stack. Can we develop an online monitoring framework to give formal guarantees on the safety of such human-robot interactions. We present a pedestrian intent estimation framework that can accurately predict future pedestrian trajectories given multiple possible goal locations. We integrate this into a reachability-based online monitoring scheme that formally assesses the safety of these interactions with nearly real-time performance (approximately 0.3 seconds). These techniques are integrated on a test vehicle with a complete in-house autonomous stack, demonstrating effective and safe interaction in real-world experiments.

preprint2020arXiv

Oxygen Functionalization-induced Crossover in the Tensile Properties of thinnest 2D Ti2C MXene

Transition metal carbides/nitrides (MXenes) are a newly developing class of two-dimensional (2D) materials with technically robust properties that can be finely tuned by planar surface functionalization. Herein, the critical role of oxygen (O-) functionalization on the tensile mechanical characteristics of thinnest 2D Ti2C MXene is explored by molecular dynamic (MD) simulation with first-principle based ReaxFF forcefield. It is demonstrated that Ti2C sheet shows unique tensile mechanical behaviors that pronouncedly vary with the content of O-functionalization and stretching direction. Upon both loading directions, there is an apparent crossover in the Young's modulus, failure strength and failure strain. Intriguingly, under armchair directional load, a structural transition of 1T to 1T' phase occurs in the Ti2C region, which has been observed in many transition metal dichalcogenides. Upon zigzag directional straining, however, two distinct structural transformations take place in pristine and fully O-functionalized Ti2C sheets, respectively. As the load is removed, those three structural transformations are reversible, and they are critically understood by analysis of the bond configurations. The study provides important insights into mechanical behaviors and structural transformations of functionalized MXenes.

preprint2020arXiv

Self-Adversarial Learning with Comparative Discrimination for Text Generation

Conventional Generative Adversarial Networks (GANs) for text generation tend to have issues of reward sparsity and mode collapse that affect the quality and diversity of generated samples. To address the issues, we propose a novel self-adversarial learning (SAL) paradigm for improving GANs' performance in text generation. In contrast to standard GANs that use a binary classifier as its discriminator to predict whether a sample is real or generated, SAL employs a comparative discriminator which is a pairwise classifier for comparing the text quality between a pair of samples. During training, SAL rewards the generator when its currently generated sentence is found to be better than its previously generated samples. This self-improvement reward mechanism allows the model to receive credits more easily and avoid collapsing towards the limited number of real samples, which not only helps alleviate the reward sparsity issue but also reduces the risk of mode collapse. Experiments on text generation benchmark datasets show that our proposed approach substantially improves both the quality and the diversity, and yields more stable performance compared to the previous GANs for text generation.

preprint2020arXiv

Towards Ground Truth Explainability on Tabular Data

In data science, there is a long history of using synthetic data for method development, feature selection and feature engineering. Our current interest in synthetic data comes from recent work in explainability. Today's datasets are typically larger and more complex - requiring less interpretable models. In the setting of \textit{post hoc} explainability, there is no ground truth for explanations. Inspired by recent work in explaining image classifiers that does provide ground truth, we propose a similar solution for tabular data. Using copulas, a concise specification of the desired statistical properties of a dataset, users can build intuition around explainability using controlled data sets and experimentation. The current capabilities are demonstrated on three use cases: one dimensional logistic regression, impact of correlation from informative features, impact of correlation from redundant variables.

preprint2020arXiv

Weak ties strengthen anger contagion in social media

Increasing evidence suggests that, similar to face-to-face communications, human emotions also spread in online social media. However, the mechanisms underlying this emotion contagion, for example, whether different feelings spread in unlikely ways or how the spread of emotions relates to the social network, is rarely investigated. Indeed, because of high costs and spatio-temporal limitations, explorations of this topic are challenging using conventional questionnaires or controlled experiments. Because they are collection points for natural affective responses of massive individuals, online social media sites offer an ideal proxy for tackling this issue from the perspective of computational social science. In this paper, based on the analysis of millions of tweets in Weibo, surprisingly, we find that anger travels easily along weaker ties than joy, meaning that it can infiltrate different communities and break free of local traps because strangers share such content more often. Through a simple diffusion model, we reveal that weaker ties speed up anger by applying both propagation velocity and coverage metrics. To the best of our knowledge, this is the first time that quantitative long-term evidence has been presented that reveals a difference in the mechanism by which joy and anger are disseminated. With the extensive proliferation of weak ties in booming social media, our results imply that the contagion of anger could be profoundly strengthened to globalize its negative impact.

preprint2016arXiv

Blind Quantum Computing with Decoy States

In this paper, we study the Universal Blind Quantum Computing (UBQC) protocol, which allows a client to perform quantum computation on a remote quantum server and the Remote Blind qubit state Preparation (RBSP) protocol which allows the client to prepare single qubits on the server's location with week coherent pulses. This RBSP protocol is inefficient for small $ T $, because the required number of pulses for generating one single qubit scales as $ \mbox{O}(1/T^4) $, where $ T $ is the transmittance of the quantum channe between the client and the server. Our contribution in this paper is a modified RBSP protocol with decoy states. This new protocol allows us to generate $ S $ single qubits simultaneously in a single instance. With our new rotocol, we are able to reduce the number of weak coherent pulses required for generating one single qubit from $ \mbox{O}(1/T^4) $ to nearly $ \mbox{O}(1/T) $, significantly increasing the efficiency. For a 100 km standard telecom fiber of loss 0.2 dB/km, our method gives a six orders of magnitude improvement in the speed, which is very significant.

preprint2016arXiv

Can Online Emotions Predict the Stock Market in China?

Whether the online social media, like Twitter or its variant Weibo, can be a convincing proxy to predict the stock market has been debated for years, especially for China. However, as the traditional theory in behavioral finance states, the individual emotions can influence decision-making of investors, so it is reasonable to further explore this controversial topic from the perspective of online emotions, which is richly carried by massive tweets in social media. Surprisingly, through thorough study on over 10 million stock-relevant tweets from Weibo, both correlation analysis and causality test show that five attributes of the stock market in China can be competently predicted by various online emotions, like disgust, joy, sadness and fear. Specifically, the presented model significantly outperforms the baseline solutions on predicting five attributes of the stock market under the $K$-means discretization. We also employ this model in the scenario of realistic online application and its performance is further testified.

preprint2016arXiv

Designing New Improper Ferroelectrics with a General Strategy

The presence of a switchable spontaneous electric polarization makes ferroelectrics ideal candidates for the use in many applications such as memory and sensors devices. Since known ferroelectrics are rather limited, finding new ferroelectric (FE) materials has become a flourishing field. One promising route is to design the so-called hybrid improper ferroelectricity. However, the previous approach based on the Landau theory is not easily adopted to the systems that are unrelated to the Pbnm perovskite structure. To this end, we develop a general design rule that is applicable to any systems. By combining this rule with density functional theory calculations, we identify previously unrecognized classes of FE materials. It shows that the R-3c perovskite structure can become FE by substituting half of the B-site cations. ZnSrO2 with a non-perovskite layered structure can also be FE through the anion substitution. Moreover, our approach can be used to design new multiferroics as illustrated in the case of fluorine substituted LaMnO3.

preprint2016arXiv

Higher contagion and weaker ties mean anger spreads faster than joy in social media

Increasing evidence suggests that, similar to face-to-face communications, human emotions also spread in online social media. However, the mechanisms underlying this emotional contagion, for example, whether different feelings spread in unlikely ways or how the spread of emotions relates to the social network, is rarely investigated. Indeed, because of high costs and spatio-temporal limitations, explorations of this topic are challenging using conventional questionnaires or controlled experiments. Because they are collection points for natural affective responses of massive individuals, online social media sites offer an ideal proxy for tackling this issue from the perspective of computational social science. In this paper, based on the analysis of millions of tweets in Weibo, surprisingly, we find that anger is more contagious than joy, indicating that it can spark more angry follow-up tweets. Moreover, regarding dissemination in social networks, anger travels easily along weaker ties than joy, meaning that it can infiltrate different communities and break free of local traps because strangers share such content more often. Through a simple diffusion model, we reveal that greater contagion and weaker ties function cooperatively to speed up anger's spread. The diffusion of real-world events with different dominant emotions provides further testimony to the findings. To the best of our knowledge, this is the first time that quantitative long-term evidence has been presented that reveals a difference in the mechanism by which joy and anger are disseminated. Our findings shed light on both personal anger management in human communications and on controlling collective outrage in cyberspace.

preprint2016arXiv

Learning Influence Functions from Incomplete Observations

We study the problem of learning influence functions under incomplete observations of node activations. Incomplete observations are a major concern as most (online and real-world) social networks are not fully observable. We establish both proper and improper PAC learnability of influence functions under randomly missing observations. Proper PAC learnability under the Discrete-Time Linear Threshold (DLT) and Discrete-Time Independent Cascade (DIC) models is established by reducing incomplete observations to complete observations in a modified graph. Our improper PAC learnability result applies for the DLT and DIC models as well as the Continuous-Time Independent Cascade (CIC) model. It is based on a parametrization in terms of reachability features, and also gives rise to an efficient and practical heuristic. Experiments on synthetic and real-world datasets demonstrate the ability of our method to compensate even for a fairly large fraction of missing observations.

preprint2016arXiv

PipeCNN: An OpenCL-Based FPGA Accelerator for Large-Scale Convolution Neuron Networks

Convolutional neural networks (CNNs) have been widely employed in many applications such as image classification, video analysis and speech recognition. Being compute-intensive, CNN computations are mainly accelerated by GPUs with high power dissipations. Recently, studies were carried out exploiting FPGA as CNN accelerator because of its reconfigurability and energy efficiency advantage over GPU, especially when OpenCL-based high-level synthesis tools are now available providing fast verification and implementation flows. Previous OpenCL-based design only focused on creating a generic framework to identify performance-related hardware parameters, without utilizing FPGA's special capability of pipelining kernel functions to minimize memory bandwidth requirement. In this work, we propose an FPGA accelerator with a new architecture of deeply pipelined OpenCL kernels. Data reuse and task mapping techniques are also presented to improve design efficiency. The proposed schemes are verified by implementing two representative large-scale CNNs, AlexNet and VGG on Altera Stratix-V A7 FPGA. We have achieved a similar peak performance of 33.9 GOPS with a 34% resource reduction on DSP blocks compared to previous work. Our design is openly accessible and thus can be reused to explore new architectures for neural network accelerators.

preprint2016arXiv

Sleeping Beauties in Meme Diffusion

A sleeping beauty in diffusion indicates that the information, can be ideas or innovations, will experience a hibernation before a sudden spike of popularity and it is widely found in citation history of scientific publications. However, in this study, we demonstrate that the sleeping beauty is an interesting and unexceptional phenomenon in information diffusion and even more inspiring, there exist two consecutive sleeping beauties in the entire lifetime of propagation, suggesting that the information, including scientific topics, search queries or Wikipedia entries, which we call memes, will go unnoticed for a period and suddenly attracts some attention, and then it falls asleep again and later wakes up with another unexpected popularity peak. Further explorations on this phenomenon show that intervals between two wake ups follow an exponential distribution and the second wake up generally reaches its peak at a higher velocity. In addition, higher volume of the first wake up will lead to even much higher popularity of the second wake up with great odds. Taking these findings into consideration, an upgraded Bass model is presented to well describe the diffusion dynamics of memes on different media. Our results can help understand the common mechanism behind propagation of different memes and are instructive to locate the tipping point in marketing or find innovative publications in science.

preprint2015arXiv

A comparative study of ab initio nonradiative recombination rate calculations under different formalisms

Nonradiative carrier recombination is of both great applied and fundamental importance.But the correct ab initio approaches to calculate it remains to be inconclusive. Here we used 5 different formalisms to calculate the nonradiative carrier recombinations of two complex defect structures GaP:Zn_Ga-O_P and GaN:Zn_Ga-V_N, and compared the results with experiments.In order to apply different multiphonon assisted electron transition formalisms, we have calculated the electron-phonon coupling constants by ab initio density functional theory for all phonon modes. Compared with different methods, the capture coefficients calculated by the static coupling theory are 4.30*10^-8 and 1.46*10^-7 cm^3/s for GaP:Zn_Ga-O_P and GaN:Zn_Ga-V_N, which are in good agreement with the experiment results, 4*10^-8 and 3.0*10^-7 cm^3/s respectively. We also provided arguments for why the static coupling theory should be used to calculate the nonradiative decays of semiconductors.

preprint2015arXiv

A Statistical Parsing Framework for Sentiment Classification

We present a statistical parsing framework for sentence-level sentiment classification in this article. Unlike previous works that employ syntactic parsing results for sentiment analysis, we develop a statistical parser to directly analyze the sentiment structure of a sentence. We show that complicated phenomena in sentiment analysis (e.g., negation, intensification, and contrast) can be handled the same as simple and straightforward sentiment expressions in a unified and probabilistic way. We formulate the sentiment grammar upon Context-Free Grammars (CFGs), and provide a formal description of the sentiment parsing framework. We develop the parsing model to obtain possible sentiment parse trees for a sentence, from which the polarity model is proposed to derive the sentiment strength and polarity, and the ranking model is dedicated to selecting the best sentiment tree. We train the parser directly from examples of sentences annotated only with sentiment polarity labels but without any syntactic annotations or polarity annotations of constituents within sentences. Therefore we can obtain training data easily. In particular, we train a sentiment parser, s.parser, from a large amount of review sentences with users' ratings as rough sentiment polarity labels. Extensive experiments on existing benchmark datasets show significant improvements over baseline sentiment classification approaches.

preprint2015arXiv

Characterizing Information Spreading in Online Social Networks

Online social networks (OSNs) are changing the way in which the information spreads throughout the Internet. A deep understanding of the information spreading in OSNs leads to both social and commercial benefits. In this paper, we characterize the dynamic of information spreading (e.g., how fast and widely the information spreads against time) in OSNs by developing a general and accurate model based on the Interactive Markov Chains (IMCs) and mean-field theory. This model explicitly reveals the impacts of the network topology on information spreading in OSNs. Further, we extend our model to feature the time-varying user behaviors and the ever-changing information popularity. The complicated dynamic patterns of information spreading are captured by our model using six key parameters. Extensive tests based on Renren's dataset validate the accuracy of our model, which demonstrate that it can characterize the dynamic patterns of video sharing in Renren precisely and predict future spreading tendency successfully.

preprint2015arXiv

Unusual Ferroelectricity Induced by the Jahn-Teller Effect: A Case Study on Lacunar Spinel Compounds

The Jahn-Teller effect refers to the symmetry-lowering geometrical distortion in a crystal (or non-linear molecule) due to the presence of a degenerate electronic state. Usually, the Jahn-Teller distortion is not polar. Recently, GaV4S8 with the lacunar spinel structure was found to undergoes a Jahn-Teller distortion from cubic to ferroelectric rhombohedral structure at TJT = 38K. In this work, we carry out a general group theory analysis to show how and when the Jahn-Teller effect gives rise to ferroelectricity. On the basis of this theory, we find that the ferroelectric Jahn-Teller distort in GaV4S8 is due to the non-centrosymmetric nature of the parent phase and strong electron-phonon interaction related to two low-energy T2 phonon modes. Interestingly, GaV4S8 is not only ferroelectric, but also ferromagnetic with the magnetic easy axis along the ferroelectric direction. This suggests that GaV4S8 is a multiferroic in which an external electric field may control its magnetization direction. Our study not only explains the Jahn-Teller physics in GaV4S8, but also paves a new way for searching and designing new ferroelectrics and multiferroics.

preprint2014arXiv

A universal law in human mobility

The intrinsic factor that drives the human movement remains unclear for decades. While our observations from intra-urban and inter-urban trips both demonstrate a universal law in human mobility. Be specific, the probability from one location to another is inversely proportional to the number of population living in locations which are closer than the destination. A simple rank-based model is then presented, which is parameterless but predicts human flows with a convincing fidelity. Besides, comparison with other models shows that our model is more stable and fundamental at different spatial scales by implying the strong correlation between human mobility and social relationship.

preprint2014arXiv

Competition Between Homophily and Information Entropy Maximization in Social Networks

In social networks, it is conventionally thought that two individuals with more overlapped friends tend to establish a new friendship, which could be stated as homophily breeding new connections. While the recent hypothesis of maximum information entropy is presented as the possible origin of effective navigation in small-world networks. We find there exists a competition between information entropy maximization and homophily in local structure through both theoretical and experimental analysis. This competition means that a newly built relationship between two individuals with more common friends would lead to less information entropy gain for them. We conjecture that in the evolution of the social network, both of the two assumptions coexist. The rule of maximum information entropy produces weak ties in the network, while the law of homophily makes the network highly clustered locally and the individuals would obtain strong and trust ties. Our findings shed light on the social network modeling from a new perspective.

preprint2014arXiv

Weighted Shortest Path Models: A Revisit to the Simulation of Internet Routing

Understanding how packets are routed in Internet is significantly important to Internet measurement and modeling. The conventional solution for route simulation is based on the assumption of unweighted shortest path. However, it has been found and widely accepted that a packet in Internet is usually not transmitted along the unweighted shortest path between its source and destination. To better simulate the routing behavior of a packet, we thoroughly explore the real-world Internet routes and present a novel local information based simulation model, with a tuning parameter, which assigns weights to links based on local information and then simulates the Internet route with weighted shortest path. Comparisons with baseline approaches show its capability in well replicating the route length distribution and other structural properties of the Internet topology. Meanwhile, the optimal parameter of this model locates in the range of (0, 2), which implies that a packet inside the Internet inclines to move to nodes with small degrees. This behavior actually reflects the design philosophy of Internet routing, balancing between network efficiency and traffic congestion.

preprint2014arXiv

Who creates trends in online social media: The crowd or opinion leaders?

Trends in online social media always reflect the collective attention of a vast number of individuals across the network. For example, Internet slang words can be ubiquitous because of social memes and online contagions in an extremely short period. From Weibo, a Twitter-like service in China, we find that the adoption of popular Internet slang words experiences two peaks in its temporal evolution, in which the former is relatively much lower than the latter. This interesting phenomenon in fact provides a decent window to disclose essential factors that drive the massive diffusion underlying trends in online social media. Specifically, the in-depth comparison between diffusions represented by different peaks suggests that more attention from the crowd at early stage of the propagation produces large-scale coverage, while the dominant participation of opinion leaders at the early stage just leads to popularity of small scope. Our results quantificationally challenge the conventional hypothesis of influentials. And the implications of these novel findings for marketing practice and influence maximization in social networks are also discussed.

preprint2014arXiv

Word Network Topic Model: A Simple but General Solution for Short and Imbalanced Texts

The short text has been the prevalent format for information of Internet in recent decades, especially with the development of online social media, whose millions of users generate a vast number of short messages everyday. Although sophisticated signals delivered by the short text make it a promising source for topic modeling, its extreme sparsity and imbalance brings unprecedented challenges to conventional topic models like LDA and its variants. Aiming at presenting a simple but general solution for topic modeling in short texts, we present a word co-occurrence network based model named WNTM to tackle the sparsity and imbalance simultaneously. Different from previous approaches, WNTM models the distribution over topics for each word instead of learning topics for each document, which successfully enhance the semantic density of data space without importing too much time or space complexity. Meanwhile, the rich contextual information preserved in the word-word space also guarantees its sensitivity in identifying rare topics with convincing quality. Furthermore, employing the same Gibbs sampling with LDA makes WNTM easily to be extended to various application scenarios. Extensive validations on both short and normal texts testify the outperformance of WNTM as compared to baseline methods. And finally we also demonstrate its potential in precisely discovering newly emerging topics or unexpected events in Weibo at pretty early stages.

preprint2013arXiv

A novel weighting scheme for random $k$-SAT

Consider a random $k$-CNF formula $F_{k}(n, rn)$ with $n$ variables and $rn$ clauses. For every truth assignment $σ\in \{0, 1\}^{n}$ and every clause $c=\ell_{1}\vee\cdots\vee\ell_{k}$, let $d=d(σ, c)$ be the number of satisfied literal occurrences in $c$ under $σ$. For fixed $β>-1$ and $λ>0$, we take $ω(σ, c)=0$, if $d=0$; $ω(σ, c)=λ(1+β)$, if $d=1$ and $ω(σ, c)=λ^{d}$, if $d>1$. Applying the above weighting scheme, we get that if $F_{k}(n, rn)$ is unsatisfiable with probability tending to one as $n\rightarrow\infty$, then $r\geq2.83, 8.09, 18.91, 40.81, 84.87$ for $k=3, 4, 5, 6$ and $7,$ respectively.

preprint2013arXiv

Anger is More Influential Than Joy: Sentiment Correlation in Weibo

Recent years have witnessed the tremendous growth of the online social media. In China, Weibo, a Twitter-like service, has attracted more than 500 million users in less than four years. Connected by online social ties, different users influence each other emotionally. We find the correlation of anger among users is significantly higher than that of joy, which indicates that angry emotion could spread more quickly and broadly in the network. While the correlation of sadness is surprisingly low and highly fluctuated. Moreover, there is a stronger sentiment correlation between a pair of users if they share more interactions. And users with larger number of friends posses more significant sentiment influence to their neighborhoods. Our findings could provide insights for modeling sentiment influence and propagation in online social networks.

preprint2013arXiv

Continuous Double Auction Mechanism and Bidding Strategies in Cloud Computing Markets

Cloud computing has been an emerging model which aims at allowing customers to utilize computing resources hosted by Cloud Service Providers (CSPs). More and more consumers rely on CSPs to supply computing and storage service on the one hand, and CSPs try to attract consumers on favorable terms on the other. In such competitive cloud computing markets, pricing policies are critical to market efficiency. While CSPs often publish their prices and charge users according to the amount of resources they consume, auction mechanism is rarely applied. In fact a feasible auction mechanism is the most effective method for allocation of resources, especially double auction is more efficient and flexible for it enables buyers and sellers to enter bids and offers simultaneously. In this paper we bring up an electronic auction platform for cloud, and a cloud Continuous Double Auction (CDA) mechanism is formulated to match orders and facilitate trading based on the platform. Some evaluating criteria are defined to analyze the efficiency of markets and strategies. Furthermore, the selection of bidding strategies for the auction plays a very important role for each player to maximize its own profit, so we developed a novel bidding strategy for cloud CDA, BH-strategy, which is a two-stage game bidding strategy. At last we designed three simulation scenarios to compare the performance of our strategy with other dominating bidding strategies and proved that BH-strategy has better performance on surpluses, successful transactions and market efficiency. In addition, we discussed that our cloud CDA mechanism is feasible for cloud computing resource allocation.

preprint2012arXiv

Can P2P Technology Benefit Eyeball ISPs? A Cooperative Profit Distribution Answer

Peer-to-Peer (P2P) technology has been regarded as a promising way to help Content Providers (CPs) cost-effectively distribute content. However, under the traditional Internet pricing mechanism, the fact that most P2P traffic flows among peers can dramatically decrease the profit of ISPs, who may take actions against P2P and impede the progress of P2P technology. In this paper, we develop a mathematical framework to analyze such economic issues. Inspired by the idea from cooperative game theory, we propose a cooperative profit-distribution model based on Nash Bargaining Solution (NBS), in which eyeball ISPs and Peer-assisted CPs (PCPs) form two coalitions respectively and then compute a fair Pareto point to determine profit distribution. Moreover, we design a fair and feasible mechanism for profit distribution within each coalition. We show that such a cooperative method not only guarantees the fair profit distribution among network participators, but also helps to improve the economic efficiency of the overall network system. To our knowledge, this is the first work that systematically studies solutions for P2P caused unbalanced profit distribution and gives a feasible cooperative method to increase and fairly share profit.

preprint2012arXiv

Modeling collective human mobility: Understanding exponential law of intra-urban movement

It is very important to understand urban mobility patterns because most trips are concentrated in urban areas. In the paper, a new model is proposed to model collective human mobility in urban areas. The model can be applied to predict individual flows not only in intra-city but also in countries or a larger range. Based on the model, it can be concluded that the exponential law of distance distribution is attributed to decreasing exponentially of average density of human travel demands. Since the distribution of human travel demands only depends on urban planning, population distribution, regional functions and so on, it illustrates that these inherent properties of cities are impetus to drive collective human movements.

preprint2012arXiv

Polarization Dependent Loss and All-Optical Modulation in Graphene on Suspended Membrane Waveguides

We observe a strong polarization dependent optical loss of in-plane light propagation in silicon waveguide due to the presence of graphene. Both transverse-electric (TE) and transverse-magnetic (TM) modes are efficiently (~3 dB) coupled to the graphene on suspended membrane waveguides using an apodized focusing subwavelength grating. The TE mode has 7.7 dB less excess optical loss than the TM mode at 1.5 μm for a 150 μm long waveguide in good agreement with a theoretical model. All-optical modulation of light is demonstrated. There is also a large thermally induced change in waveguide effective index because of optical absorption in graphene.

preprint2012arXiv

Structural Analysis of Network Traffic Matrix via Relaxed Principal Component Pursuit

The network traffic matrix is widely used in network operation and management. It is therefore of crucial importance to analyze the components and the structure of the network traffic matrix, for which several mathematical approaches such as Principal Component Analysis (PCA) were proposed. In this paper, we first argue that PCA performs poorly for analyzing traffic matrix that is polluted by large volume anomalies, and then propose a new decomposition model for the network traffic matrix. According to this model, we carry out the structural analysis by decomposing the network traffic matrix into three sub-matrices, namely, the deterministic traffic, the anomaly traffic and the noise traffic matrix, which is similar to the Robust Principal Component Analysis (RPCA) problem previously studied in [13]. Based on the Relaxed Principal Component Pursuit (Relaxed PCP) method and the Accelerated Proximal Gradient (APG) algorithm, we present an iterative approach for decomposing a traffic matrix, and demonstrate its efficiency and flexibility by experimental results. Finally, we further discuss several features of the deterministic and noise traffic. Our study develops a novel method for the problem of structural analysis of the traffic matrix, which is robust against pollution of large volume anomalies.

preprint2011arXiv

A Large Family of Multi-path Dual Congestion Control Algorithms

The goal of traffic management is efficiently utilizing network resources via adapting of source sending rates and routes selection. Traditionally, this problem is formulated into a utilization maximization problem. The single-path routing scheme fails to react to instantaneous network congestion. Multi-path routing schemes thus have been proposed aiming at improving network efficiency. Unfortunately, the natural optimization problem to consider is concave but not strictly concave. It thus brings a huge challenge to design stable multi-path congestion control algorithms. In this paper, we propose a generalized multi-path utility maximization model to consider the problem of routes selection and flow control, and derive a family of multi-path dual congestion control algorithms. We show that the proposed algorithms are stable in the absence of delays. We also derive decentralized and scalable sufficient conditions for a particular scheme when propagation delays exist in networks. Simulations are implemented using both Matlab and NS2, on which evaluation of the proposed multi-path dual algorithms is exerted. The comparison results, between the proposed algorithms and the other two existing algorithms, show that the proposed multi-path dual algorithms with appropriate parameter settings can achieve a stable aggregated throughput while maintaining fairness among the involved users.

preprint2011arXiv

Economical standard quantum process tomography

Recently, Bendersky \emph{et al.} developed a method to complete the task of characterizing an arbitrary $χ$ matrix element in a scalable way, Phys. Rev. Lett. Vol. \textbf{100}, 190403(2008), where an auxiliary system was needed. In present work, we shall show that the same task can also be completed within the scheme of standard quantum process tomography (SQPT) where there is no requirement for ancilla. Our method depends on two observations: With the elaborately chosen operators basis, the SQPT may have an economical form where a single run of experiment, in which we measure the expectation value of a chosen operator in the outport of the quantum channel with a known input, is sufficient to characterize a selected $χ$ matrix element; With the progress recently achieved in quantum entanglement detection, we also find that the number of the experimental settings to realize the experiment for the selected $χ$ matrix element does not exceed 2N for the N-qubits system. For practice, our scheme can be applied for the cases where the controlled two-body interaction is neither available nor desirable.

preprint2011arXiv

Internet Resource Pricing Models, Mechanisms, and Methods

With the fast development of video and voice network applications, CDN (Content Distribution Networks) and P2P (Peer-to-Peer) content distribution technologies have gradually matured. How to effectively use Internet resources thus has attracted more and more attentions. For the study of resource pricing, a whole pricing strategy containing pricing models, mechanisms and methods covers all the related topics. We first introduce three basic Internet resource pricing models through an Internet cost analysis. Then, with the evolution of service types, we introduce several corresponding mechanisms which can ensure pricing implementation and resource allocation. On network resource pricing methods, we discuss the utility optimization in economics, and emphasize two classes of pricing methods (including system optimization and entities' strategic optimizations). Finally, we conclude the paper and forecast the research direction on pricing strategy which is applicable to novel service situation in the near future.

preprint2011arXiv

Link Prediction in Complex Networks: A Clustering Perspective

Link prediction is an open problem in the complex network, which attracts much research interest currently. However, little attention has been paid to the relation between network structure and the performance of prediction methods. In order to fill this vital gap, we try to understand how the network structure affects the performance of link prediction methods in the view of clustering. Our experiments on both synthetic and real-world networks show that as the clustering grows, the precision of these methods could be improved remarkably, while for the sparse and weakly clustered network, they perform poorly. We explain this through the distinguishment caused by increased clustering between the score distribution of positive and negative instances. Our finding also sheds light on the problem of how to select appropriate approaches for different networks with various densities and clusterings.

preprint2011arXiv

Partial standard quantum process tomography

With the general theorem of SQPT, we shall develop a scheme to detemine an arbitrary matrix element of $chi$, which is expanded with the Choi operators, in a scalbe way.

preprint2011arXiv

Performance of Local Information Based Link Prediction: A Sampling Perspective

Link prediction is pervasively employed to uncover the missing links in the snapshots of real-world networks, which are usually obtained from kinds of sampling methods. Contrarily, in the previous literature, in order to evaluate the performance of the prediction, the known edges in the sampled snapshot are divided into the training set and the probe set randomly, without considering the diverse sampling approaches beyond. However, different sampling methods might lead to different missing links, especially for the biased ones. For this reason, random partition based evaluation of performance is no longer convincing if we take the sampling method into account. Hence, in this paper, aim at filling this void, we try to reevaluate the performance of local information based link predictions through sampling methods governed division of the training set and the probe set. It is interesting that we find for different sampling methods, each prediction approach performs unevenly. Moreover, most of these predictions perform weakly when the sampling method is biased, which indicates that the performance of these methods is overestimated in the prior works.

preprint2011arXiv

Phase Transitions in Knowledge Compilation: an Experimental Study

Phase transitions in many complex combinational problems have been widely studied in the past decade. In this paper, we investigate phase transitions in the knowledge compilation empirically, where DFA, OBDD and d-DNNF are chosen as the target languages to compile random k-SAT instances. We perform intensive experiments to analyze the sizes of compilation results and draw the following conclusions: there exists an easy-hard-easy pattern in compilations; the peak point of sizes in the pattern is only related to the ratio of the number of clauses to that of variables when k is fixed, regardless of target languages; most sizes of compilation results increase exponentially with the number of variables growing, but there also exists a phase transition that separates a polynomial-increment region from the exponential-increment region; Moreover, we explain why the phase transition in compilations occurs by analyzing microstructures of DFAs, and conclude that a kind of solution interchangeability with more than 2 variables has a sharp transition near the peak point of the easy-hard-easy pattern, and thus it has a great impact on sizes of DFAs.

preprint2011arXiv

SafeZone: A Hierarchical Inter-Domain Authenticated Source Address Validation Solution

Next generation Internet is highly concerned about the issue of reliability. Principally, the foundation of reliability is authentication of the source IP address. With the signature-and-verification based defense mechanisms available today, unfortunately, there is a lack of hierarchical architecture, which makes the structure of the trust alliance excessively flat and single. Moreover, with the increasing scale of the trust alliance, costs of validation grow so quickly that they do not adapt to incremental deployment. Via comparison with traditional solutions, this article proposes a hierarchical, inter-domain authenticated source address validation solution named SafeZone. SafeZone employs two intelligent designs, lightweight tag replacement and a hierarchical partitioning scheme, each of which helps to ensure that SafeZone can construct trustworthy and hierarchical trust alliances without the negative influences and complex operations on de facto networks. Extensive experiments also indicate that SafeZone can effectively obtain the design goals of a hierarchical architecture, along with lightweight, loose coupling and "multi-fence support" and as well as an incremental deployment scheme.

preprint2011arXiv

The scaling of human mobility by taxis is exponential

As a significant factor in urban planning, traffic forecasting and prediction of epidemics, modeling patterns of human mobility draws intensive attention from researchers for decades. Power-law distribution and its variations are observed from quite a few real-world human mobility datasets such as the movements of banking notes, trackings of cell phone users' locations and trajectories of vehicles. In this paper, we build models for 20 million trajectories with fine granularity collected from more than 10 thousand taxis in Beijing. In contrast to most models observed in human mobility data, the taxis' traveling displacements in urban areas tend to follow an exponential distribution instead of a power-law. Similarly, the elapsed time can also be well approximated by an exponential distribution. Worth mentioning, analysis of the interevent time indicates the bursty nature of human mobility, similar to many other human activities.

preprint2010arXiv

Being Rational or Aggressive? A Revisit to Dunbar's Number in Online Social Networks

Recent years have witnessed the explosion of online social networks (OSNs). They provide powerful IT-innovations for online social activities such as organizing contacts, publishing contents, and sharing interests between friends who may never meet before. As more and more people become the active users of online social networks, one may ponder questions such as: (1) Do OSNs indeed improve our sociability? (2) To what extent can we expand our offline social spectrum in OSNs? (3) Can we identify some interesting user behaviors in OSNs? Our work in this paper just aims to answer these interesting questions. To this end, we pay a revisit to the well-known Dunbar's number in online social networks. Our main research contributions are as follows. First, to our best knowledge, our work is the first one that systematically validates the existence of the online Dunbar's number in the range of [200,300]. To reach this, we combine using local-structure analysis and user-interaction analysis for extensive real-world OSNs. Second, we divide OSNs users into two categories: rational and aggressive, and find that rational users intend to develop close and reciprocated relationships, whereas aggressive users have no consistent behaviors. Third, we build a simple model to capture the constraints of time and cognition that affect the evolution of online social networks. Finally, we show the potential use of our findings in viral marketing and privacy management in online social networks.

preprint2010arXiv

One More Weight is Enough: Toward the Optimal Traffic Engineering with OSPF

Traffic Engineering (TE) leverages information of network traffic to generate a routing scheme optimizing the traffic distribution so as to advance network performance. However, optimize the link weights for OSPF to the offered traffic is an known NP-hard problem. In this paper, motivated by the fairness concept of congestion control, we firstly propose a new generic objective function, where various interests of providers can be extracted with different parameter settings. And then, we model the optimal TE as the utility maximization of multi-commodity flows with the generic objective function and theoretically show that any given set of optimal routes corresponding to a particular objective function can be converted to shortest paths with respect to a set of positive link weights. This can be directly configured on OSPF-based protocols. On these bases, we employ the Network Entropy Maximization(NEM) framework and develop a new OSPF-based routing protocol, SPEF, to realize a flexible way to split traffic over shortest paths in a distributed fashion. Actually, comparing to OSPF, SPEF only needs one more weight for each link and provably achieves optimal TE. Numerical experiments have been done to compare SPEF with the current version of OSPF, showing the effectiveness of SPEF in terms of link utilization and network load distribution.

preprint2010arXiv

Weak ties: Subtle role of information diffusion in online social networks

As a social media, online social networks play a vital role in the social information diffusion. However, due to its unique complexity, the mechanism of the diffusion in online social networks is different from the ones in other types of networks and remains unclear to us. Meanwhile, few works have been done to reveal the coupled dynamics of both the structure and the diffusion of online social networks. To this end, in this paper, we propose a model to investigate how the structure is coupled with the diffusion in online social networks from the view of weak ties. Through numerical experiments on large-scale online social networks, we find that in contrast to some previous research results, selecting weak ties preferentially to republish cannot make the information diffuse quickly, while random selection can achieve this goal. However, when we remove the weak ties gradually, the coverage of the information will drop sharply even in the case of random selection. We also give a reasonable explanation for this by extra analysis and experiments. Finally, we conclude that weak ties play a subtle role in the information diffusion in online social networks. On one hand, they act as bridges to connect isolated local communities together and break through the local trapping of the information. On the other hand, selecting them as preferential paths to republish cannot help the information spread further in the network. As a result, weak ties might be of use in the control of the virus spread and the private information diffusion in real-world applications.

preprint2009arXiv

Enhancing the robustness of scale-free networks

Error tolerance and attack vulnerability are two common and important properties of complex networks, which are usually used to evaluate the robustness of a network. Recently, much work has been devoted to determining the network design with optimal robustness. However, little attention has been paid to the problem of how to improve the robustness of existing networks. In this paper, we present a new parameter alpha, called enforcing parameter, to guide the process of enhancing the robustness of scale-free networks by gradually adding new links. Intuitively, alpha < 0 means the nodes with lower degrees are selected preferentially while the nodes with higher degrees will be more probably selected when alpha > 0. It is shown both theoretically and experimentally that when alpha < 0 the attack survivability of the network can be enforced apparently. Then we propose new strategies to enhance the network robustness. Through extensive experiments and comparisons, we conclude that establishing new links between nodes with low degrees can drastically enforce the attack survivability of scale-free networks while having little impact on the error tolerance.

preprint2009arXiv

Modeling the IPv6 Internet AS-level Topology

To measure the IPv6 internet AS-level topology, a network topology discovery system, called Dolphin, was developed. By comparing the measurement result of Dolphin with that of CAIDA's Scamper, it was found that the IPv6 Internet at AS level, similar to other complex networks, is also scale-free but the exponent of its degree distribution is 1.2, which is much smaller than that of the IPv4 Internet and most other scale-free networks. In order to explain this feature of IPv6 Internet we argue that the degree exponent is a measure of uniformity of the degree distribution. Then, for the purpose on modeling the networks, we propose a new model based on the two major factors affecting the exponent of the EBA model. It breaks the lower bound of degree exponent which is 2 for most models. To verify the validity of this model, both theoretical and experimental analyses have been carried out. Finally, we demonstrate how this model can be successfully used to reproduce the topology of the IPv6 Internet.

preprint2005arXiv

A Simple Model to Generate Hard Satisfiable Instances

In this paper, we try to further demonstrate that the models of random CSP instances proposed by [Xu and Li, 2000; 2003] are of theoretical and practical interest. Indeed, these models, called RB and RD, present several nice features. First, it is quite easy to generate random instances of any arity since no particular structure has to be integrated, or property enforced, in such instances. Then, the existence of an asymptotic phase transition can be guaranteed while applying a limited restriction on domain size and on constraint tightness. In that case, a threshold point can be precisely located and all instances have the guarantee to be hard at the threshold, i.e., to have an exponential tree-resolution complexity. Next, a formal analysis shows that it is possible to generate forced satisfiable instances whose hardness is similar to unforced satisfiable ones. This analysis is supported by some representative results taken from an intensive experimentation that we have carried out, using complete and incomplete search methods.

Ke Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

66 published item(s)

WaferSAGE: Large Language Model-Powered Wafer Defect Analysis via Synthetic Data Generation and Rubric-Guided Reinforcement Learning

A Simple yet Effective Method for Graph Classification

Abnormally High Thermal Conductivity in Fivefold Twinned Diamond Nanowires

Adaptive Channel Encoding Transformer for Point Cloud Analysis

Asymmetric star formation triggered by gas inflow in a barred lenticular galaxy PGC 34107

Bi-directional Object-context Prioritization Learning for Saliency Ranking

Dual-Neighborhood Deep Fusion Network for Point Cloud Analysis

Exploring Memorization in Adversarial Training

Harmonizer: Learning to Perform White-Box Image and Video Harmonization

Hierarchical information matters: Text classification via tree based graph neural network

Night-time Scene Parsing with a Large Real Dataset

Quantized Adaptive Subgradient Algorithms and Their Applications

Structural Entropy Guided Graph Hierarchical Pooling

The molecular gas resolved by ALMA in the low-metallicity dwarf merging galaxy Haro 11

Tracing the Origin of Adversarial Attack for Forensic Investigation and Deterrence

Personalized Graph Neural Networks with Attention Mechanism for Session-Aware Recommendation

Self-Attention Attribution: Interpreting Information Interactions Inside Transformer

Behavior variations and their implications for popularity promotions: From elites to mass in Weibo

Electronics-Free Pneumatic Logic Circuits for Localized Feedback Control of Multi-Actuator Soft Robots

Harvesting and Refining Question-Answer Pairs for Unsupervised QA

Learning to Compare for Better Training and Evaluation of Open Domain Natural Language Generation Models

Mechanical Metastructures of Triple Periodic Carbon Clathrates

Off-Path TCP Exploits of the Mixed IPID Assignment

Online monitoring for safe pedestrian-vehicle interactions

Oxygen Functionalization-induced Crossover in the Tensile Properties of thinnest 2D Ti2C MXene

Self-Adversarial Learning with Comparative Discrimination for Text Generation

Towards Ground Truth Explainability on Tabular Data

Weak ties strengthen anger contagion in social media

Blind Quantum Computing with Decoy States

Can Online Emotions Predict the Stock Market in China?

Designing New Improper Ferroelectrics with a General Strategy

Higher contagion and weaker ties mean anger spreads faster than joy in social media

Learning Influence Functions from Incomplete Observations

PipeCNN: An OpenCL-Based FPGA Accelerator for Large-Scale Convolution Neuron Networks

Sleeping Beauties in Meme Diffusion

A comparative study of ab initio nonradiative recombination rate calculations under different formalisms

A Statistical Parsing Framework for Sentiment Classification

Characterizing Information Spreading in Online Social Networks

Unusual Ferroelectricity Induced by the Jahn-Teller Effect: A Case Study on Lacunar Spinel Compounds

A universal law in human mobility

Competition Between Homophily and Information Entropy Maximization in Social Networks

Weighted Shortest Path Models: A Revisit to the Simulation of Internet Routing

Who creates trends in online social media: The crowd or opinion leaders?

Word Network Topic Model: A Simple but General Solution for Short and Imbalanced Texts

A novel weighting scheme for random $k$-SAT

Anger is More Influential Than Joy: Sentiment Correlation in Weibo

Continuous Double Auction Mechanism and Bidding Strategies in Cloud Computing Markets

Can P2P Technology Benefit Eyeball ISPs? A Cooperative Profit Distribution Answer

Modeling collective human mobility: Understanding exponential law of intra-urban movement

Polarization Dependent Loss and All-Optical Modulation in Graphene on Suspended Membrane Waveguides

Structural Analysis of Network Traffic Matrix via Relaxed Principal Component Pursuit

A Large Family of Multi-path Dual Congestion Control Algorithms

Economical standard quantum process tomography

Internet Resource Pricing Models, Mechanisms, and Methods

Link Prediction in Complex Networks: A Clustering Perspective

Partial standard quantum process tomography

Performance of Local Information Based Link Prediction: A Sampling Perspective

Phase Transitions in Knowledge Compilation: an Experimental Study

SafeZone: A Hierarchical Inter-Domain Authenticated Source Address Validation Solution

The scaling of human mobility by taxis is exponential

Being Rational or Aggressive? A Revisit to Dunbar's Number in Online Social Networks

One More Weight is Enough: Toward the Optimal Traffic Engineering with OSPF

Weak ties: Subtle role of information diffusion in online social networks

Enhancing the robustness of scale-free networks

Modeling the IPv6 Internet AS-level Topology

A Simple Model to Generate Hard Satisfiable Instances