Source author record

Bo Wu

Bo Wu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

37works

27topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Quasi-invariant theorem on the Gaussian path space

In this article, we will first introduce a class of Gaussian processes, and prove the quasi-invariant theorem with respect to the Gaussian Wiener measure, which is the law of the associated Gaussian process. In particular, it includes the case of the fractional Brownian motion. As applications, we will establish the integration by parts formula and Bismut-Elworthy-Li formula on the Gaussian path space, and by which some logarithmic Sobolev inequalities will be presented. Moreover, we will also provides some applications in the field of financial mathematics.

preprint2023arXiv

Constrained Active Classification Using Partially Observable Markov Decision Processes

In this work, we study the problem of actively classifying the attributes of dynamical systems characterized as a finite set of Markov decision process (MDP) models. We are interested in finding strategies that actively interact with the dynamical system and observe its reactions so that the attribute of interest is classified efficiently with high confidence. We present a decision-theoretic framework based on partially observable Markov decision processes (POMDPs). The proposed framework relies on assigning a classification belief (a probability distribution) to the attributes of interest. Given an initial belief, a confidence level over which a classification decision can be made, a cost bound, safe belief sets, and a finite time horizon, we compute POMDP strategies leading to classification decisions. We present three different algorithms to compute such strategies. The first algorithm computes the optimal strategy exactly by value iteration. To overcome the computational complexity of computing the exact solutions, we propose a second algorithm based on adaptive sampling and a third based on a Monte Carlo tree search to approximate the optimal probability of reaching a classification decision. We illustrate the proposed methodology using examples from medical diagnosis, security surveillance, and wildlife classification.

preprint2023arXiv

Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks

In continuing tasks, average-reward reinforcement learning may be a more appropriate problem formulation than the more common discounted reward formulation. As usual, learning an optimal policy in this setting typically requires a large amount of training experiences. Reward shaping is a common approach for incorporating domain knowledge into reinforcement learning in order to speed up convergence to an optimal policy. However, to the best of our knowledge, the theoretical properties of reward shaping have thus far only been established in the discounted setting. This paper presents the first reward shaping framework for average-reward learning and proves that, under standard assumptions, the optimal policy under the original reward function can be recovered. In order to avoid the need for manual construction of the shaping function, we introduce a method for utilizing domain knowledge expressed as a temporal logic formula. The formula is automatically translated to a shaping function that provides additional reward throughout the learning process. We evaluate the proposed method on three continuing tasks. In all cases, shaping speeds up the average-reward learning rate without any reduction in the performance of the learned policy compared to relevant baselines.

preprint2022arXiv

A Lightweight NMS-free Framework for Real-time Visual Fault Detection System of Freight Trains

Real-time vision-based system of fault detection (RVBS-FD) for freight trains is an essential part of ensuring railway transportation safety. Most existing vision-based methods still have high computational costs based on convolutional neural networks. The computational cost is mainly reflected in the backbone, neck, and post-processing, i.e., non-maximum suppression (NMS). In this paper, we propose a lightweight NMS-free framework to achieve real-time detection and high accuracy simultaneously. First, we use a lightweight backbone for feature extraction and design a fault detection pyramid to process features. This fault detection pyramid includes three novel individual modules using attention mechanism, bottleneck, and dilated convolution for feature enhancement and computation reduction. Instead of using NMS, we calculate different loss functions, including classification and location costs in the detection head, to further reduce computation. Experimental results show that our framework achieves over 83 frames per second speed with a smaller model size and higher accuracy than the state-of-the-art detectors. Meanwhile, the hardware resource requirements of our method are low during the training and testing process.

preprint2022arXiv

Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for Natural Language Understanding

In recent years, large pre-trained Transformer networks have demonstrated dramatic improvements in many natural language understanding tasks. However, the huge size of these models brings significant challenges to their fine-tuning and online deployment due to latency and cost constraints. New hardware supporting both N:M semi-structured sparsity and low-precision integer computation is a promising solution to boost DNN model serving efficiency. However, there have been very few studies that systematically investigate to what extent pre-trained Transformer networks benefit from the combination of these techniques, as well as how to best compress each component of the Transformer. We propose a flexible compression framework NxMiFormer that performs simultaneous sparsification and quantization using ADMM and STE-based QAT. Furthermore, we present and inexpensive, heuristic-driven search algorithm that identifies promising heterogeneous compression configurations that meet a compression ratio constraint. When evaluated across the GLUE suite of NLU benchmarks, our approach can achieve up to 93% compression of the encoders of a BERT model while retaining 98.2% of the original model accuracy and taking full advantage of the hardware's capabilities. Heterogeneous configurations found the by the search heuristic maintain 99.5% of the baseline accuracy while still compressing the model by 87.5%.

preprint2022arXiv

DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs

As Deep Learning (DL) systems are widely deployed for mission-critical applications, debugging such systems becomes essential. Most existing works identify and repair suspicious neurons on the trained Deep Neural Network (DNN), which, unfortunately, might be a detour. Specifically, several existing studies have reported that many unsatisfactory behaviors are actually originated from the faults residing in DL programs. Besides, locating faulty neurons is not actionable for developers, while locating the faulty statements in DL programs can provide developers with more useful information for debugging. Though a few recent studies were proposed to pinpoint the faulty statements in DL programs or the training settings (e.g. too large learning rate), they were mainly designed based on predefined rules, leading to many false alarms or false negatives, especially when the faults are beyond their capabilities. In view of these limitations, in this paper, we proposed DeepFD, a learning-based fault diagnosis and localization framework which maps the fault localization task to a learning problem. In particular, it infers the suspicious fault types via monitoring the runtime features extracted during DNN model training and then locates the diagnosed faults in DL programs. It overcomes the limitations by identifying the root causes of faults in DL programs instead of neurons and diagnosing the faults by a learning approach instead of a set of hard-coded rules. The evaluation exhibits the potential of DeepFD. It correctly diagnoses 52% faulty DL programs, compared with around half (27%) achieved by the best state-of-the-art works. Besides, for fault localization, DeepFD also outperforms the existing works, correctly locating 42% faulty programs, which almost doubles the best result (23%) achieved by the existing works.

preprint2022arXiv

DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection

Lidars and cameras are critical sensors that provide complementary information for 3D detection in autonomous driving. While prevalent multi-modal methods simply decorate raw lidar point clouds with camera features and feed them directly to existing 3D detection models, our study shows that fusing camera features with deep lidar features instead of raw points, can lead to better performance. However, as those features are often augmented and aggregated, a key challenge in fusion is how to effectively align the transformed features from two modalities. In this paper, we propose two novel techniques: InverseAug that inverses geometric-related augmentations, e.g., rotation, to enable accurate geometric alignment between lidar points and image pixels, and LearnableAlign that leverages cross-attention to dynamically capture the correlations between image and lidar features during fusion. Based on InverseAug and LearnableAlign, we develop a family of generic multi-modal 3D detection models named DeepFusion, which is more accurate than previous methods. For example, DeepFusion improves PointPillars, CenterPoint, and 3D-MAN baselines on Pedestrian detection for 6.7, 8.9, and 6.2 LEVEL_2 APH, respectively. Notably, our models achieve state-of-the-art performance on Waymo Open Dataset, and show strong model robustness against input corruptions and out-of-distribution data. Code will be publicly available at https://github.com/tensorflow/lingvo/tree/master/lingvo/.

preprint2022arXiv

IGNNITION: Bridging the Gap Between Graph Neural Networks and Networking Systems

Recent years have seen the vast potential of Graph Neural Networks (GNN) in many fields where data is structured as graphs (e.g., chemistry, recommender systems). In particular, GNNs are becoming increasingly popular in the field of networking, as graphs are intrinsically present at many levels (e.g., topology, routing). The main novelty of GNNs is their ability to generalize to other networks unseen during training, which is an essential feature for developing practical Machine Learning (ML) solutions for networking. However, implementing a functional GNN prototype is currently a cumbersome task that requires strong skills in neural network programming. This poses an important barrier to network engineers that often do not have the necessary ML expertise. In this article, we present IGNNITION, a novel open-source framework that enables fast prototyping of GNNs for networking systems. IGNNITION is based on an intuitive high-level abstraction that hides the complexity behind GNNs, while still offering great flexibility to build custom GNN architectures. To showcase the versatility and performance of this framework, we implement two state-of-the-art GNN models applied to different networking use cases. Our results show that the GNN models produced by IGNNITION are equivalent in terms of accuracy and performance to their native implementations in TensorFlow.

preprint2022arXiv

Joint Inference of Reward Machines and Policies for Reinforcement Learning

Incorporating high-level knowledge is an effective way to expedite reinforcement learning (RL), especially for complex tasks with sparse rewards. We investigate an RL problem where the high-level knowledge is in the form of reward machines, i.e., a type of Mealy machine that encodes the reward functions. We focus on a setting in which this knowledge is a priori not available to the learning agent. We develop an iterative algorithm that performs joint inference of reward machines and policies for RL (more specifically, q-learning). In each iteration, the algorithm maintains a hypothesis reward machine and a sample of RL episodes. It derives q-functions from the current hypothesis reward machine, and performs RL to update the q-functions. While performing RL, the algorithm updates the sample by adding RL episodes along which the obtained rewards are inconsistent with the rewards based on the current hypothesis reward machine. In the next iteration, the algorithm infers a new hypothesis reward machine from the updated sample. Based on an equivalence relationship we defined between states of reward machines, we transfer the q-functions between the hypothesis reward machines in consecutive iterations. We prove that the proposed algorithm converges almost surely to an optimal policy in the limit if a minimal reward machine can be inferred and the maximal length of each RL episode is sufficiently long. The experiments show that learning high-level knowledge in the form of reward machines can lead to fast convergence to optimal policies in RL, while standard RL methods such as q-learning and hierarchical RL methods fail to converge to optimal policies after a substantial number of training steps in many tasks.

preprint2022arXiv

Spectral Analysis and its applications for a class of scale-free network based on the weighted m-clique annex operation

The spectrum of network is an important tool to study the function and dynamic properties of network, and graph operation and product is an effective mechanism to construct a specific local and global topological structure. In this study, a class of weighted $m-$clique annex operation $τ_m^r(\cdot)$ controlled by scale factor $m$ and weight factor $r$ is defined, through which an iterative weighted network model $G_t$ with small-world and scale-free properties is constructed. In particular, when the number of iterations $t$ tends to infinity, the network has transfinite fractal property. Then, through the iterative features of the network structure, the iterative relationship of the eigenvalues of the normalized Laplacian matrix corresponding to the network is studied. Accordingly, some applications of the spectrum of the network, including the Kenemy constant, Multiplicative Degree-Kirchhoff index and the number of weighted spanning trees, are further given. In addition, we also study the effect of the two factors controlling network operation on the structure and function of the iterative weighted network $G_t$, so that the network operation can better simulate the real network and have more application potential in the field of artificial network.

preprint2021arXiv

Average trapping time on a type of horizontally segmented 3 dimensional Sierpinski gasket network with two types of locally self-similar structures

As a classic self-similar network model, Sierpinski gasket network has been used many times to study the characteristics of self-similar structure and its influence on the dynamic properties of the network. However, the network models studied in these problems only contain a single self-similar structure, which is inconsistent with the structural characteristics of the actual network models. In this paper, a type of horizontally segmented 3 dimensional Sierpinski gasket network is constructed, whose main feature is that it contains the locally self-similar structures of the 2 dimensional Sierpinski gasket network and the 3 dimensional Sierpinski gasket network at the same time, and the scale transformation between the two kinds of self-similar structures can be controlled by adjusting the crosscutting coefficient. The analytical expression of the average trapping time on the network model is solved, which used to analyze the effect of two types of self-similar structures on the properties of random walks. Finally, we conclude that the dominant self-similar structure will exert a greater influence on the random walk process on the network.

preprint2021arXiv

Efficient Mining of Frequent Subgraphs with Two-Vertex Exploration

Frequent Subgraph Mining (FSM) is the key task in many graph mining and machine learning applications. Numerous systems have been proposed for FSM in the past decade. Although these systems show good performance for small patterns (with no more than four vertices), we found that they have difficulty in mining larger patterns. In this work, we propose a novel two-vertex exploration strategy to accelerate the mining process. Compared with the single-vertex exploration adopted by previous systems, our two-vertex exploration avoids the large memory consumption issue and significantly reduces the memory access overhead. We further enhance the performance through an index-based quick pattern technique that reduces the overhead of isomorphism checks, and a subgraph sampling technique that mitigates the issue of subgraph explosion. The experimental results show that our system achieves significant speedups against the state-of-the-art graph pattern mining systems and supports larger pattern mining tasks that none of the existing systems can handle.

preprint2020arXiv

Analogical Reasoning for Visually Grounded Language Acquisition

Children acquire language subconsciously by observing the surrounding world and listening to descriptions. They can discover the meaning of words even without explicit language knowledge, and generalize to novel compositions effortlessly. In this paper, we bring this ability to AI, by studying the task of Visually grounded Language Acquisition (VLA). We propose a multimodal transformer model augmented with a novel mechanism for analogical reasoning, which approximates novel compositions by learning semantic mapping and reasoning operations from previously seen compositions. Our proposed method, Analogical Reasoning Transformer Networks (ARTNet), is trained on raw multimedia data (video frames and transcripts), and after observing a set of compositions such as "washing apple" or "cutting carrot", it can generalize and recognize new compositions in new video frames, such as "washing carrot" or "cutting apple". To this end, ARTNet refers to relevant instances in the training data and uses their visual features and captions to establish analogies with the query image. Then it chooses the suitable verb and noun to create a new composition that describes the new image best. Extensive experiments on an instructional video dataset demonstrate that the proposed method achieves significantly better generalization capability and recognition accuracy compared to state-of-the-art transformer models.

preprint2020arXiv

Audio-visual Recognition of Overlapped speech for the LRS2 dataset

Automatic recognition of overlapped speech remains a highly challenging task to date. Motivated by the bimodal nature of human speech perception, this paper investigates the use of audio-visual technologies for overlapped speech recognition. Three issues associated with the construction of audio-visual speech recognition (AVSR) systems are addressed. First, the basic architecture designs i.e. end-to-end and hybrid of AVSR systems are investigated. Second, purposefully designed modality fusion gates are used to robustly integrate the audio and visual features. Third, in contrast to a traditional pipelined architecture containing explicit speech separation and recognition components, a streamlined and integrated AVSR system optimized consistently using the lattice-free MMI (LF-MMI) discriminative criterion is also proposed. The proposed LF-MMI time-delay neural network (TDNN) system establishes the state-of-the-art for the LRS2 dataset. Experiments on overlapped speech simulated from the LRS2 dataset suggest the proposed AVSR system outperformed the audio only baseline LF-MMI DNN system by up to 29.98\% absolute in word error rate (WER) reduction, and produced recognition performance comparable to a more complex pipelined system. Consistent performance improvements of 4.89\% absolute in WER reduction over the baseline AVSR system using feature fusion are also obtained.

preprint2020arXiv

Distortionless Multi-Channel Target Speech Enhancement for Overlapped Speech Recognition

Speech enhancement techniques based on deep learning have brought significant improvement on speech quality and intelligibility. Nevertheless, a large gain in speech quality measured by objective metrics, such as perceptual evaluation of speech quality (PESQ), does not necessarily lead to improved speech recognition performance due to speech distortion in the enhancement stage. In this paper, a multi-channel dilated convolutional network based frequency domain modeling is presented to enhance target speaker in the far-field, noisy and multi-talker conditions. We study three approaches towards distortionless waveforms for overlapped speech recognition: estimating complex ideal ratio mask with an infinite range, incorporating the fbank loss in a multi-objective learning and finetuning the enhancement model by an acoustic model. Experimental results proved the effectiveness of all three approaches on reducing speech distortions and improving recognition accuracy. Particularly, the jointly tuned enhancement model works very well with other standalone acoustic model on real test data.

preprint2020arXiv

Dynamic Spatiotemporal Graph Neural Network with Tensor Network

Dynamic spatial graph construction is a challenge in graph neural network (GNN) for time series data problems. Although some adaptive graphs are conceivable, only a 2D graph is embedded in the network to reflect the current spatial relation, regardless of all the previous situations. In this work, we generate a spatial tensor graph (STG) to collect all the dynamic spatial relations, as well as a temporal tensor graph (TTG) to find the latent pattern along time at each node. These two tensor graphs share the same nodes and edges, which leading us to explore their entangled correlations by Projected Entangled Pair States (PEPS) to optimize the two graphs. We experimentally compare the accuracy and time costing with the state-of-the-art GNN based methods on the public traffic datasets.

preprint2020arXiv

End-to-End Multi-Look Keyword Spotting

The performance of keyword spotting (KWS), measured in false alarms and false rejects, degrades significantly under the far field and noisy conditions. In this paper, we propose a multi-look neural network modeling for speech enhancement which simultaneously steers to listen to multiple sampled look directions. The multi-look enhancement is then jointly trained with KWS to form an end-to-end KWS model which integrates the enhanced signals from multiple look directions and leverages an attention mechanism to dynamically tune the model's attention to the reliable sources. We demonstrate, on our large noisy and far-field evaluation sets, that the proposed approach significantly improves the KWS performance against the baseline KWS system and a recent beamformer based multi-beam KWS system.

preprint2020arXiv

Learning the Compositional Visual Coherence for Complementary Recommendations

Complementary recommendations, which aim at providing users product suggestions that are supplementary and compatible with their obtained items, have become a hot topic in both academia and industry in recent years. %However, it is challenging due to its complexity and subjectivity. Existing work mainly focused on modeling the co-purchased relations between two items, but the compositional associations of item collections are largely unexplored. Actually, when a user chooses the complementary items for the purchased products, it is intuitive that she will consider the visual semantic coherence (such as color collocations, texture compatibilities) in addition to global impressions. Towards this end, in this paper, we propose a novel Content Attentive Neural Network (CANN) to model the comprehensive compositional coherence on both global contents and semantic contents. Specifically, we first propose a \textit{Global Coherence Learning} (GCL) module based on multi-heads attention to model the global compositional coherence. Then, we generate the semantic-focal representations from different semantic regions and design a \textit{Focal Coherence Learning} (FCL) module to learn the focal compositional coherence from different semantic-focal representations. Finally, we optimize the CANN in a novel compositional optimization strategy. Extensive experiments on the large-scale real-world data clearly demonstrate the effectiveness of CANN compared with several state-of-the-art methods.

preprint2020arXiv

MemNet: Memory-Efficiency Guided Neural Architecture Search with Augment-Trim learning

Recent studies on automatic neural architectures search have demonstrated significant performance, competitive to or even better than hand-crafted neural architectures. However, most of the existing network architecture tend to use residual, parallel structures and concatenation block between shallow and deep features to construct a large network. This requires large amounts of memory for storing both weights and feature maps. This is challenging for mobile and embedded devices since they may not have enough memory to perform inference with the designed large network model. To close this gap, we propose MemNet, an augment-trim learning-based neural network search framework that optimizes not only performance but also memory requirement. Specifically, it employs memory consumption based ranking score which forces an upper bound on memory consumption for navigating the search process. Experiment results show that, as compared to the state-of-the-art efficient designing methods, MemNet can find an architecture which can achieve competitive accuracy and save an average of 24.17% on the total memory needed.

preprint2020arXiv

Policy Synthesis for Switched Linear Systems with Markov Decision Process Switching

We study the synthesis of mode switching protocols for a class of discrete-time switched linear systems in which the mode jumps are governed by Markov decision processes (MDPs). We call such systems MDP-JLS for brevity. Each state of the MDP corresponds to a mode in the switched system. The probabilistic state transitions in the MDP represent the mode transitions. We focus on finding a policy that selects the switching actions at each mode such that the switched system that follows these actions is guaranteed to be stable. Given a policy in the MDP, the considered MDP-JLS reduces to a Markov jump linear system (MJLS). {We consider both mean-square stability and stability with probability one. For mean-square stability, we leverage existing stability conditions for MJLSs and propose efficient semidefinite programming formulations to find a stabilizing policy in the MDP. For stability with probability one, we derive new sufficient conditions and compute a stabilizing policy using linear programming. We also extend the policy synthesis results to MDP-JLS with uncertain mode transition probabilities.

preprint2020arXiv

Population pharmacokinetics and dosing regimen optimization of tacrolimus in Chinese lung transplant recipients

We aimed to develop a population pharmacokinetic model of tacrolimus in Chinese lung transplant recipients, and propose model based dosing regimens for individualized treatment. We obtained 807 tacrolimus whole blood concentrations from 52 lung transplant patients and genotyped CYP3A5*3. Population pharmacokinetic analysis was performed using nonlinear mixed effects modeling. Monte Carlo simulations were employed to design initial dosing regimens. Tacrolimus pharmacokinetics was described by a one compartment model with first order absorption and elimination process. The mean estimated apparent clearance was 13.1 l/h with 20.1% inter subject variability in CYP3A5*3/*3 70kg patients with 30% hematocrit and voriconazole free therapy, which is lower than that in Caucasian(17.5 to 36.5 l/h). Hematocrit, postoperative days, tacrolimus daily dose, voriconazole cotherapy, and CYP3A5*3 genotype were identified as significant covariates for tacrolimus clearance. To achieve the target trough concentration (10 to 15 ng/ml) on the 8th day after transplantation, CYP3A5*1/*3 patients with voriconazole free cotherapy, a higher initial dosage than the current regimen of 0.04 mg/kg q12h should be recommened. Given the nonlinear kinetics of tacrolimus and large variability, population pharmacokinetic model should be combined with therapeutic drug monitoring to optimize individualized therapy.

preprint2020arXiv

SMP Challenge: An Overview of Social Media Prediction Challenge 2019

"SMP Challenge" aims to discover novel prediction tasks for numerous data on social multimedia and seek excellent research teams. Making predictions via social multimedia data (e.g. photos, videos or news) is not only helps us to make better strategic decisions for the future, but also explores advanced predictive learning and analytic methods on various problems and scenarios, such as multimedia recommendation, advertising system, fashion analysis etc. In the SMP Challenge at ACM Multimedia 2019, we introduce a novel prediction task Temporal Popularity Prediction, which focuses on predicting future interaction or attractiveness (in terms of clicks, views or likes etc.) of new online posts in social media feeds before uploading. We also collected and released a large-scale SMPD benchmark with over 480K posts from 69K users. In this paper, we define the challenge problem, give an overview of the dataset, present statistics of rich information for data and annotation and design the accuracy and correlation evaluation metrics for temporal popularity prediction to the challenge.

preprint2016arXiv

Bearing fault diagnosis based on spectrum images of vibration signals

Bearing fault diagnosis has been a challenge in the monitoring activities of rotating machinery, and it's receiving more and more attention. The conventional fault diagnosis methods usually extract features from the waveforms or spectrums of vibration signals in order to realize fault classification. In this paper, a novel feature in the form of images is presented, namely the spectrum images of vibration signals. The spectrum images are simply obtained by doing fast Fourier transformation. Such images are processed with two-dimensional principal component analysis (2DPCA) to reduce the dimensions, and then a minimum distance method is applied to classify the faults of bearings. The effectiveness of the proposed method is verified with experimental data.

preprint2016arXiv

Bibliometric Index for Academic Leadership

Academic leadership is essential for research innovation and impact. Until now, there has been no dedicated measure of leadership by bibliometrics. Popular bibliometric indices are mainly based on academic output, such as the journal impact factor and the number of citations. Here we develop an academic leadership index based on readily available bibliometric data that is sensitive to not only academic output but also research efficiency. Our leadership index was tested in two studies on peer-reviewed journal papers by extramurally-funded principal investigators in the field of life sciences from China and the USA, respectively. The leadership performance of these principal investigators was quantified and compared relative to university rank and other factors. As a validation measure, we show that the highest average leadership index was achieved by principal investigators at top national universities in both countries. More interestingly, our results also indicate that on an individual basis, strong leadership and high efficiency are not necessarily associated with those at top-tier universities nor with the most funding. This leadership index may become the basis of a comprehensive merit system, facilitating academic evaluation and resource management.

preprint2016arXiv

Characterizing the path-independent property of the Girsanov density for degenerated stochastic differential equations

In this paper, we derive a characterization theorem for the path-independent property of the density of the Girsanov transformation for {\it degenerated} stochastic differential equations (SDEs), extending the characterization theorem of \cite{twwy} for the non-degenerated SDEs. We further extends our consideration to non-Lipschitz SDEs with jumps and with degenerated diffusion coefficients, which generalizes the corresponding characterization theorem established in \cite{hqwu}.

preprint2016arXiv

Combined Top-Down and Bottom-Up Approaches to Performance-guaranteed Integrated Task and Motion Planning of Cooperative Multi-agent Systems

We propose a hierarchical design framework to automatically synthesize coordination schemes and control policies for cooperative multi-agent systems to fulfill formal performance requirements, by associating a bottom-up reactive motion controller with a top-down mission plan. On one hand, starting from a global mission that is specified as a regular language over all the agents' mission capabilities, a mission planning layer sits on the top of the proposed framework, decomposing the global mission into local tasks that are in consistency with each agent's individual capabilities, and compositionally justifying whether the achievement of local tasks implies the satisfaction of the global mission via an assume-guarantee paradigm. On the other hand, bottom-up motion plans associated with each agent are synthesized corresponding to the obtained local missions by composing basic motion primitives, which are verified safe by differential dynamic logic (d$\mathcal{L}$), through a Satisfiability Modulo Theories (SMT) solver that searches feasible solutions in face of constraints imposed by local task requirements and the environment description. It is shown that the proposed framework can handle dynamical environments as the motion primitives possess reactive features, making the motion plans adaptive to local environmental changes. Furthermore, on-line mission reconfiguration can be triggered by the motion planning layer once no feasible solutions can be found through the SMT solver. The effectiveness of the overall design framework is validated by an automated warehouse case study.

preprint2016arXiv

Formal Design of Robot Integrated Task and Motion Planning

Integrated Task and Motion Planning (ITMP) for mobile robots in a dynamic environment with moving obstacles is a challenging research question and attracts more and more attentions recently. Most existing methods either restrict to static environments or lack performance guarantees. This motivates us to investigate the ITMP problem using formal methods and propose a bottom-up compositional design approach called CoSMoP (Composition of Safe Motion Primitives). Our basic idea is to synthesize a global motion plan through composing simple local moves and actions, and to achieve its performance guarantee through modular and incremental verifications. The design consists of two steps. First, basic motion primitives are designed and verified locally. Then, a global motion path is built upon these certified motion primitives by concatenating them together. In particular, we model the motion primitives as hybrid automata and verify their safety through formulating as Differential Dynamic Logic (d$\mathcal{L}$). Furthermore, these proven safe motion primitives are composed based on an encoding to Satisfiability Modulo Theories (SMT) that takes into account the geometric constraints. Since d$\mathcal{L}$ allows compositional verification, the sequential composition of the safe motion primitives also preserves safety properties. Therefore, the CoSMoP generates correct plans for given task specifications that are formally proven safe even for moving obstacles. Illustrative examples are presented to show the effectiveness of the methods.

preprint2016arXiv

Injection method of barrier bucket supported by off-aligned electron cooling for CRing of HIAF

A new accelerator complex, HIAF (the High Intensity Heavy Ion Accelerator Facility), has been approved in China. It is designed to provide intense primary and radioactive ion beams for research in high energy density physics, nuclear physics, atomic physics as well as other applications. In order to achieve a high intensity of up to 5e11 ppp 238U34+, the Compression Ring (CRing) needs to stack more than 5 bunches transferred from the Booster Ring (BRing). However, the normal bucket to bucket injection scheme can only achieve an intensity gain of 2, so an injection method, fixed barrier bucket (BB) supported by electron cooling, is proposed. To suppress the severe space charge effect during the stacking process, off-alignment is adopted in the cooler to control the transverse emittance. In this paper, simulation and optimization with the BETACOOL program are presented.

preprint2015arXiv

Calculation of the coupled betatron motion introduced by cooler's solenoids of CSRm

Several solenoids are usually installed in electron cooler device to guide the motion of the electron beam in the cooler. However, the solenoids also have influence to the ion beam in the cooler storage ring. The transverse motion of the ion beam in storage ring will become coupled, if the solenoids installed in the electron cooler are not compensated perfectly. In this paper, the coupled transverse motion due to the uncompensated cooler's solenoids of CSRm (The main storage ring in the IMP, Lan Zhou, China) is studied, and the coupled beam envelopes are calculated by a new method.

preprint2015arXiv

Characterization of non-constant lower bound of Ricci curvature via entropy inequality on Wasserstein space

When the Ricci curvature of a Riemannian manifold is not lower bounded by a constant, but lower bounded by a continuous function, we give a new characterization of this lower bound through the convexity of relative entropy on the probability space over the Riemannian manifold. Hence, we generalize K.T. Sturm and von Renesse's result (Comm. Pure Appl. Math. 2005) to the case with non-constant lower bound of Ricci curvature.

preprint2015arXiv

Closed orbit correction of HIMM synchrotron

The correction of closed orbit has great influence on the operation of synchrotron. The design of correction system is one significant component of lattice design. It is suggested to set BPMs at the peaks of betatron oscillation. The correctors need to be located at the positions where \b{eta} function is large or the sources of large errors. In the simulation of the closed orbit correction of HIMM synchrotron, one important reason affecting the result of horizontal correction is the longitudinal alignment error of dipole magnet. It is advisable to decrease this kind of alignment error while the deflection angle of dipole magnet is large.

preprint2015arXiv

Remarks on spectral gaps on the Riemannian path space

In this paper, we will give some remarks on links between the spectral gap of the Ornstein-Uhlenbeck operator on the Riemannian path space with lower and upper bounds of the Ricci curvature on the base manifold; this work was motivated by a recent work of A. Naber on the characterization of the bound of the Ricci curvature by analysis of path spaces.

preprint2015arXiv

The electronic origin of shear-induced direct to indirect gap transition and anisotropy diminution in phosphorene

Artificial monolayer black phosphorus, the so-called phosphorene has attracted global interest with its distinguished anisotropic optoelectronic and electronic properties. Here, we unraveled the shear-induced direct to indirect gap transition and anisotropy diminution in phosphorene based on first-principles calculations. Lattice dynamic analysis demonstrated that phosphorene can sustain up to 10% applied shear strain. The band gap of phosphorene experiences a direct to indirect transition when 5% shear strain is applied. The electronic origin of direct to indirect gap transition from 1.54 eV at ambient condition to 1.22 eV at 10% shear strains for phosphorene was explored and the anisotropy diminution in phosphorene is discussed by calculating the maximum sound velocities, effective mass and decomposed charge density, which signals the undesired shear-induced direct to indirect gap transition in the applications of phosphorene for electronics and optoelectronics. On the other hand, the shear-induced electronic anisotropy properties suggest that phosphorene can be applied as the switcher in the nano electronic applications.

preprint2015arXiv

The Side-Effects of the Space Charge Field Introduced by Hollow Electron Beam in the Electron Cooler of CSRm

Electron cooler is used to improve the quality of the beam in synchrotron, however it also introduces nonlinear electromagnetic field, which cause tuneshift, tunespread and may drive resonances leading to beam loss. In this paper the tuneshift and the tunespread caused by nonlinear electromagnetic field of the hollow electron beam was investigated, and the resonance driving terms of the nonlinear electromagnetic field was analysed. The differences were presented comparing with the solid electron beam. The calculations were performed for $^{238}U^{32+}$ ions of energy 1.272MeV stored in CSRm, using the parameters given in table1. The conclusion is that in this situation nonlinear field caused by the hollow electron beam do not lead to serious resonances.

preprint2013arXiv

Functional inequalities on path space over a non-compact Riemannian manifold

We prove the existence of the O-U Dirichlet form and the damped O-U Dirichlet form on path space over a general non-compact Riemannian manifold which is complete and stochastically complete. We show a weighted log-Sobolev inequality for the O-U Dirichlet form and the (standard) log-Sobolev inequality for the damped O-U Dirichlet form. In particular, the Poincaré inequality (and the super Poincaré inequality) can be established for the O-U Dirichlet form on path space over a class of Riemannian manifolds with unbounded Ricci curvatures. Moreover, we construct a large class of quasi-regular local Dirichlet forms with unbounded random diffusion coefficients on the path space over a general non-compact manifold.

preprint2009arXiv

A Concrete Estimate For The Weak Poincare Inequality On Loop Space

The aim of the paper is to study the pinned Wiener measure on the loop space over a simply connected compact Riemannian manifold together with the Hilbert space structure induced by Mallianvin calculus and the induced Ornstein- Uhlenbeck operator. We give a concrete estimate for the weak Poincare inequality for the O-U Dirichlet form on loop space over simply connected compact Riemannian manifold with strict positive Ricci curvature.

preprint2009arXiv

A Poincaré Inequality on Loop Spaces

We investigate properties of measures in infinite dimensional spaces in terms of Poincaré inequalities. A Poincaré inequality states that the $L^2$ variance of an admissible function is controlled by the homogeneous $H^1$ norm. In the case of Loop spaces, it was observed by L. Gross that the homogeneous $H^1$ norm alone may not control the $L^2$ norm and a potential term involving the end value of the Brownian bridge is introduced. Aida, on the other hand, introduced a weight on the Dirichlet form. We show that Aida's modified Logarithmic Sobolev inequality implies weak Logarithmic Sobolev Inequalities and weak Poincaré inequalities with precise estimates on the order of convergence. The order of convergence in the weak Sobolev inequalities are related to weak $L^1$ estimates on the weight function. This and a relation between Logarithmic Sobolev inequalities and weak Poincaré inequalities lead to a Poincaré inequality on the loop space over certain manifolds.

Bo Wu

What is connected

Connect this record

See the researcher in context

Building this map preview

37 published item(s)

Quasi-invariant theorem on the Gaussian path space

Constrained Active Classification Using Partially Observable Markov Decision Processes

Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks

A Lightweight NMS-free Framework for Real-time Visual Fault Detection System of Freight Trains

Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for Natural Language Understanding

DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs

DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection

IGNNITION: Bridging the Gap Between Graph Neural Networks and Networking Systems

Joint Inference of Reward Machines and Policies for Reinforcement Learning

Spectral Analysis and its applications for a class of scale-free network based on the weighted m-clique annex operation

Average trapping time on a type of horizontally segmented 3 dimensional Sierpinski gasket network with two types of locally self-similar structures

Efficient Mining of Frequent Subgraphs with Two-Vertex Exploration

Analogical Reasoning for Visually Grounded Language Acquisition

Audio-visual Recognition of Overlapped speech for the LRS2 dataset

Distortionless Multi-Channel Target Speech Enhancement for Overlapped Speech Recognition

Dynamic Spatiotemporal Graph Neural Network with Tensor Network

End-to-End Multi-Look Keyword Spotting

Learning the Compositional Visual Coherence for Complementary Recommendations

MemNet: Memory-Efficiency Guided Neural Architecture Search with Augment-Trim learning

Policy Synthesis for Switched Linear Systems with Markov Decision Process Switching

Population pharmacokinetics and dosing regimen optimization of tacrolimus in Chinese lung transplant recipients

SMP Challenge: An Overview of Social Media Prediction Challenge 2019

Bearing fault diagnosis based on spectrum images of vibration signals

Bibliometric Index for Academic Leadership

Characterizing the path-independent property of the Girsanov density for degenerated stochastic differential equations

Combined Top-Down and Bottom-Up Approaches to Performance-guaranteed Integrated Task and Motion Planning of Cooperative Multi-agent Systems

Formal Design of Robot Integrated Task and Motion Planning

Injection method of barrier bucket supported by off-aligned electron cooling for CRing of HIAF

Calculation of the coupled betatron motion introduced by cooler's solenoids of CSRm

Characterization of non-constant lower bound of Ricci curvature via entropy inequality on Wasserstein space

Closed orbit correction of HIMM synchrotron

Remarks on spectral gaps on the Riemannian path space

The electronic origin of shear-induced direct to indirect gap transition and anisotropy diminution in phosphorene

The Side-Effects of the Space Charge Field Introduced by Hollow Electron Beam in the Electron Cooler of CSRm

Functional inequalities on path space over a non-compact Riemannian manifold

A Concrete Estimate For The Weak Poincare Inequality On Loop Space

A Poincaré Inequality on Loop Spaces