Source author record

Ye Yuan

Ye Yuan appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

75works

31topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

MINER: Mining Multimodal Internal Representation for Efficient Retrieval

Visual document retrieval has become essential for accessing information in visually rich documents. Existing approaches fall into two camps. Late-interaction retrievers achieve strong quality through fine-grained token-level matching but store hundreds of vectors per page, incurring large index footprints and high serving costs. By contrast, dense single-vector retrievers retain storage and latency advantages but consistently lag in quality because they compress all information into a single final-layer embedding. In this work, we first conduct a layerwise diagnostic on single-vector retrievers, revealing that retrieval-relevant signal resides in internal representations. Motivated by these findings, we propose MINER (Mining Multimodal Internal RepreseNtation for Efficient Retrieval), a lightweight plug-in module that probes and fuses internal signals across transformer layers into a single compact embedding without modifying the backbone or sacrificing single-vector efficiency. The first Retrieval-Aligned Layer Probing stage attaches a lightweight probe at each layer, surfacing which dimensions carry retrieval-relevant information. The subsequent Adaptive Sparse Multi-Layer Fusion stage applies performance-adaptive neuron-level masking to the selected layers and fuses the surviving signals into the final dense vector. Across ViDoRe V1/V2/V3, MINER outperforms existing dense single-vector retrievers on the majority of benchmarks, with up to 4.5% nDCG@5 improvement over its corresponding backbone. Compared to strong late-interaction baselines, in some settings MINER substantially narrows the nDCG@$5$ gap to $0.2$ while preserving the storage and serving advantages of dense retrieval.

preprint2024arXiv

AGG: Amortized Generative 3D Gaussians for Single Image to 3D

Given the growing need for automatic 3D content creation pipelines, various 3D representations have been studied to generate 3D objects from a single image. Due to its superior rendering efficiency, 3D Gaussian splatting-based models have recently excelled in both 3D reconstruction and generation. 3D Gaussian splatting approaches for image to 3D generation are often optimization-based, requiring many computationally expensive score-distillation steps. To overcome these challenges, we introduce an Amortized Generative 3D Gaussian framework (AGG) that instantly produces 3D Gaussians from a single image, eliminating the need for per-instance optimization. Utilizing an intermediate hybrid representation, AGG decomposes the generation of 3D Gaussian locations and other appearance attributes for joint optimization. Moreover, we propose a cascaded pipeline that first generates a coarse representation of the 3D data and later upsamples it with a 3D Gaussian super-resolution module. Our method is evaluated against existing optimization-based 3D Gaussian frameworks and sampling-based pipelines utilizing other 3D representations, where AGG showcases competitive generation abilities both qualitatively and quantitatively while being several orders of magnitude faster. Project page: https://ir1d.github.io/AGG/

preprint2023arXiv

From creep to flow: Granular materials under cyclic shear

Granular materials such as sand, powders, and grains are omnipresent in daily life, industrial applications, and earth-science [1]. When unperturbed, they form stable structures that resemble the ones of other amorphous solids like metallic and colloidal glasses [2]. It is commonly conjectured that all these amorphous materials show a universal mechanical response when sheared slowly, i.e., to have an elastic regime, followed by yielding [3]. Here we use X-ray tomography to determine the microscopic dynamics of a cyclically sheared granular system in three dimensions. Independent of the shear amplitude $Γ$, the sample shows a cross-over from creep to diffusive dynamics, indicating that granular materials have no elastic response and always yield, in stark contrast to other glasses. The overlap function [4] reveals that at large $Γ$ yielding is a simple cross-over phenomenon, while for small $Γ$ it shows features of a first order transition with a critical point at $Γ\approx 0.1$ at which one finds a pronounced slowing down and dynamical heterogeneity. Our findings are directly related to the surface roughness of granular particles which induces a micro-corrugation to the potential energy landscape, thus creating relaxation channels that are absent in simple glasses. These processes must be understood for reaching an understanding of the complex relaxation dynamics of granular systems.

preprint2022arXiv

A deep learning-based remaining useful life prediction approach for bearings

In industrial applications, nearly half the failures of motors are caused by the degradation of rolling element bearings (REBs). Therefore, accurately estimating the remaining useful life (RUL) for REBs are of crucial importance to ensure the reliability and safety of mechanical systems. To tackle this challenge, model-based approaches are often limited by the complexity of mathematical modeling. Conventional data-driven approaches, on the other hand, require massive efforts to extract the degradation features and construct health index. In this paper, a novel online data-driven framework is proposed to exploit the adoption of deep convolutional neural networks (CNN) in predicting the RUL of bearings. More concretely, the raw vibrations of training bearings are first processed using the Hilbert-Huang transform (HHT) and a novel nonlinear degradation indicator is constructed as the label for learning. The CNN is then employed to identify the hidden pattern between the extracted degradation indicator and the vibration of training bearings, which makes it possible to estimate the degradation of the test bearings automatically. Finally, testing bearings' RULs are predicted by using a $ε$-support vector regression model. The superior performance of the proposed RUL estimation framework, compared with the state-of-the-art approaches, is demonstrated through the experimental results. The generality of the proposed CNN model is also validated by transferring to bearings undergoing different operating conditions.

preprint2022arXiv

A General End-to-end Diagnosis Framework for Manufacturing Systems

The manufacturing sector is envisioned to be heavily influenced by artificial intelligence-based technologies with the extraordinary increases in computational power and data volumes. A central challenge in manufacturing sector lies in the requirement of a general framework to ensure satisfied diagnosis and monitoring performances in different manufacturing applications. Here we propose a general data-driven, end-to-end framework for the monitoring of manufacturing systems. This framework, derived from deep learning techniques, evaluates fused sensory measurements to detect and even predict faults and wearing conditions. This work exploits the predictive power of deep learning to automatically extract hidden degradation features from noisy, time-course data. We have experimented the proposed framework on ten representative datasets drawn from a wide variety of manufacturing applications. Results reveal that the framework performs well in examined benchmark applications and can be applied in diverse contexts, indicating its potential use as a critical corner stone in smart manufacturing.

preprint2022arXiv

A Nonlinear PID-Enhanced Adaptive Latent Factor Analysis Model

High-dimensional and incomplete (HDI) data holds tremendous interactive information in various industrial applications. A latent factor (LF) model is remarkably effective in extracting valuable information from HDI data with stochastic gradient decent (SGD) algorithm. However, an SGD-based LFA model suffers from slow convergence since it only considers the current learning error. To address this critical issue, this paper proposes a Nonlinear PID-enhanced Adaptive Latent Factor (NPALF) model with two-fold ideas: 1) rebuilding the learning error via considering the past learning errors following the principle of a nonlinear PID controller; b) implementing all parameters adaptation effectively following the principle of a particle swarm optimization (PSO) algorithm. Experience results on four representative HDI datasets indicate that compared with five state-of-the-art LFA models, the NPALF model achieves better convergence rate and prediction accuracy for missing data of an HDI data.

preprint2022arXiv

A Piecewise Learning Framework for Control of Unknown Nonlinear Systems with Stability Guarantees

We propose a piecewise learning framework for controlling nonlinear systems with unknown dynamics. While model-based reinforcement learning techniques in terms of some basis functions are well known in the literature, when it comes to more complex dynamics, only a local approximation of the model can be obtained using a limited number of bases. The complexity of the identifier and the controller can be considerably high if obtaining an approximation over a larger domain is desired. To overcome this limitation, we propose a general piecewise nonlinear framework where each piece is responsible for locally learning and controlling over some region of the domain. We obtain rigorous uncertainty bounds for the learned piecewise models. The piecewise affine (PWA) model is then studied as a special case, for which we propose an optimization-based verification technique for stability analysis of the closed-loop system. Accordingly, given a time-discretization of the learned {PWA} system, we iteratively search for a common piecewise Lyapunov function in a set of positive definite functions, where a non-monotonic convergence is allowed. This Lyapunov candidate is verified on the uncertain system to either provide a certificate for stability or find a counter-example when it fails. This counter-example is added to a set of samples to facilitate the further learning of a Lyapunov function. We demonstrate the results on two examples and show that the proposed approach yields a less conservative region of attraction (ROA) compared with alternative state-of-the-art approaches. Moreover, we provide the runtime results to demonstrate potentials of the proposed framework in real-world implementations.

preprint2022arXiv

A Sampling Theorem for Exact Identification of Continuous-time Nonlinear Dynamical Systems

Low sampling frequency challenges the exact identification of the continuous-time (CT) dynamical system from sampled data, even when its model is identifiable. The necessary and sufficient condition is proposed -- which is built from Koopman operator -- to the exact identification of the CT system from sampled data. The condition gives a Nyquist-Shannon-like critical frequency for exact identification of CT nonlinear dynamical systems with Koopman invariant subspaces: 1) it establishes a sufficient condition for a sampling frequency that permits a discretized sequence of samples to discover the underlying system and 2) it also establishes a necessary condition for a sampling frequency that leads to system aliasing that the underlying system is indistinguishable; and 3) the original CT signal does not have to be band-limited as required in the Nyquist-Shannon Theorem. The theoretical criterion has been demonstrated on a number of simulated examples, including linear systems, nonlinear systems with equilibria, and limit cycles.

preprint2022arXiv

Adaptive Latent Factor Analysis via Generalized Momentum-Incorporated Particle Swarm Optimization

Stochastic gradient descent (SGD) algorithm is an effective learning strategy to build a latent factor analysis (LFA) model on a high-dimensional and incomplete (HDI) matrix. A particle swarm optimization (PSO) algorithm is commonly adopted to make an SGD-based LFA model's hyper-parameters, i.e, learning rate and regularization coefficient, self-adaptation. However, a standard PSO algorithm may suffer from accuracy loss caused by premature convergence. To address this issue, this paper incorporates more historical information into each particle's evolutionary process for avoiding premature convergence following the principle of a generalized-momentum (GM) method, thereby innovatively achieving a novel GM-incorporated PSO (GM-PSO). With it, a GM-PSO-based LFA (GMPL) model is further achieved to implement efficient self-adaptation of hyper-parameters. The experimental results on three HDI matrices demonstrate that the GMPL model achieves a higher prediction accuracy for missing data estimation in industrial applications.

preprint2022arXiv

Causal Effect Estimation using Variational Information Bottleneck

Causal inference is to estimate the causal effect in a causal relationship when intervention is applied. Precisely, in a causal model with binary interventions, i.e., control and treatment, the causal effect is simply the difference between the factual and counterfactual. The difficulty is that the counterfactual may never been obtained which has to be estimated and so the causal effect could only be an estimate. The key challenge for estimating the counterfactual is to identify confounders which effect both outcomes and treatments. A typical approach is to formulate causal inference as a supervised learning problem and so counterfactual could be predicted. Including linear regression and deep learning models, recent machine learning methods have been adapted to causal inference. In this paper, we propose a method to estimate Causal Effect by using Variational Information Bottleneck (CEVIB). The promising point is that VIB is able to naturally distill confounding variables from the data, which enables estimating causal effect by using observational data. We have compared CEVIB to other methods by applying them to three data sets showing that our approach achieved the best performance. We also experimentally showed the robustness of our method.

preprint2022arXiv

Edge-based Local Push for Personalized PageRank

Personalized PageRank (PPR) is a popular node proximity metric in graph mining and network research. Given a graph G=(V,E) and a source node $s \in V$, a single-source PPR (SSPPR) query asks for the PPR value $\vpi(u)$ with respect to s, which represents the relative importance of node u in the context of the source node s. Among existing algorithms for SSPPR queries, LocalPush is a fundamental method which serves as a cornerstone for subsequent algorithms. In LocalPush, a push operation is a crucial primitive operation, which distributes the probability at a node u to ALL u's neighbors via the corresponding edges. Although this push operation works well on unweighted graphs, unfortunately, it can be rather inefficient on weighted graphs. In particular, on unbalanced weighted graphs where only a few of these edges take the majority of the total weight among them, the push operation would have to distribute insignificant probabilities along those edges which just take the minor weights, resulting in expensive overhead. To resolve this issue, we propose the EdgePush algorithm, a novel method for computing SSPPR queries on weighted graphs. EdgePush decomposes the aforementioned push operations in edge-based push, allowing the algorithm to operate at the edge level granularity. Hence, it can flexibly distribute the probabilities according to edge weights. Furthermore, our EdgePush allows a fine-grained termination threshold for each individual edge, leading to a superior complexity over LocalPush. Notably, we prove that EdgePush improves the theoretical query cost of LocalPush by an order of up to O(n) when the graph's weights are unbalanced, both in terms of $\ell_1$-error and normalized additive error. Our experimental results demonstrate that EdgePush significantly outperforms state-of-the-art baselines in terms of query efficiency on large motif-based and real-world weighted graphs.

preprint2022arXiv

From Universal Humanoid Control to Automatic Physically Valid Character Creation

Automatically designing virtual humans and humanoids holds great potential in aiding the character creation process in games, movies, and robots. In some cases, a character creator may wish to design a humanoid body customized for certain motions such as karate kicks and parkour jumps. In this work, we propose a humanoid design framework to automatically generate physically valid humanoid bodies conditioned on sequence(s) of pre-specified human motions. First, we learn a generalized humanoid controller trained on a large-scale human motion dataset that features diverse human motion and body shapes. Second, we use a design-and-control framework to optimize a humanoid's physical attributes to find body designs that can better imitate the pre-specified human motion sequence(s). Leveraging the pre-trained humanoid controller and physics simulation as guidance, our method is able to discover new humanoid designs that are customized to perform pre-specified human motions.

preprint2022arXiv

GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras

We present an approach for 3D global human mesh recovery from monocular videos recorded with dynamic cameras. Our approach is robust to severe and long-term occlusions and tracks human bodies even when they go outside the camera's field of view. To achieve this, we first propose a deep generative motion infiller, which autoregressively infills the body motions of occluded humans based on visible motions. Additionally, in contrast to prior work, our approach reconstructs human meshes in consistent global coordinates even with dynamic cameras. Since the joint reconstruction of human motions and camera poses is underconstrained, we propose a global trajectory predictor that generates global human trajectories based on local body movements. Using the predicted trajectories as anchors, we present a global optimization framework that refines the predicted trajectories and optimizes the camera poses to match the video evidence such as 2D keypoints. Experiments on challenging indoor and in-the-wild datasets with dynamic cameras demonstrate that the proposed approach outperforms prior methods significantly in terms of motion infilling and global mesh recovery.

preprint2022arXiv

Learning Deep Representation with Energy-Based Self-Expressiveness for Subspace Clustering

Deep subspace clustering has attracted increasing attention in recent years. Almost all the existing works are required to load the whole training data into one batch for learning the self-expressive coefficients in the framework of deep learning. Although these methods achieve promising results, such a learning fashion severely prevents from the usage of deeper neural network architectures (e.g., ResNet), leading to the limited representation abilities of the models. In this paper, we propose a new deep subspace clustering framework, motivated by the energy-based models. In contrast to previous approaches taking the weights of a fully connected layer as the self-expressive coefficients, we propose to learn an energy-based network to obtain the self-expressive coefficients by mini-batch training. By this means, it is no longer necessary to load all data into one batch for learning, and it thus becomes a reality that we can utilize deeper neural network models for subspace clustering. Considering the powerful representation ability of the recently popular self-supervised learning, we attempt to leverage self-supervised representation learning to learn the dictionary. Finally, we propose a joint framework to learn both the self-expressive coefficients and dictionary simultaneously, and train the model in an end-to-end manner. The experiments are performed on three publicly available datasets, and extensive experimental results demonstrate our method can significantly outperform the other related approaches. For instance, on the three datasets, our method can averagely achieve $13.8\%$, $15.4\%$, $20.8\%$ improvements in terms of Accuracy, NMI, and ARI over SENet which is proposed very recently and obtains the second best results in the experiments.

preprint2022arXiv

Novel total hip surgery robotic system based on self-localization and optical measurement

This paper presents the development and experimental evaluation of a surgical robotic system for total hip arthroplasty (THA). Although existing robotic systems used in joint replacement surgery have achieved some progresses, the robot arm must be situated accurately at the target position during operation, which depends significantly on the experience of the surgeon. In addition, handheld acetabulum reamers typically exhibit uneven strength and grinding file. Moreover, the lack of techniques to real-time measure femoral neck length may lead to poor outcomes. To tackle these challenges, we propose a real-time traceable optical positioning strategy to reduce unnecessary manual adjustments to the robotic arm during surgery, an end-effector system to stabilise grinding, and an optical probe to provide real-time measurement of the femoral neck length and other parameters used to choose the proper prosthesis. The lengths of the lower limbs are measured as the prosthesis is installed. The experimental evaluation results show that, based on its accuracy, execution ability, and robustness, the proposed surgical robotic system is feasible for THA.

preprint2022arXiv

On Almost Sure Convergence Rates of Stochastic Gradient Methods

The vast majority of convergence rates analysis for stochastic gradient methods in the literature focus on convergence in expectation, whereas trajectory-wise almost sure convergence is clearly important to ensure that any instantiation of the stochastic algorithms would converge with probability one. Here we provide a unified almost sure convergence rates analysis for stochastic gradient descent (SGD), stochastic heavy-ball (SHB), and stochastic Nesterov's accelerated gradient (SNAG) methods. We show, for the first time, that the almost sure convergence rates obtained for these stochastic gradient methods on strongly convex functions, are arbitrarily close to their optimal convergence rates possible. For non-convex objective functions, we not only show that a weighted average of the squared gradient norms converges to zero almost surely, but also the last iterates of the algorithms. We further provide last-iterate almost sure convergence rates analysis for stochastic gradient methods on weakly convex smooth functions, in contrast with most existing results in the literature that only provide convergence in expectation for a weighted average of the iterates.

preprint2022arXiv

Online No-regret Model-Based Meta RL for Personalized Navigation

The interaction between a vehicle navigation system and the driver of the vehicle can be formulated as a model-based reinforcement learning problem, where the navigation systems (agent) must quickly adapt to the characteristics of the driver (environmental dynamics) to provide the best sequence of turn-by-turn driving instructions. Most modern day navigation systems (e.g, Google maps, Waze, Garmin) are not designed to personalize their low-level interactions for individual users across a wide range of driving styles (e.g., vehicle type, reaction time, level of expertise). Towards the development of personalized navigation systems that adapt to a variety of driving styles, we propose an online no-regret model-based RL method that quickly conforms to the dynamics of the current user. As the user interacts with it, the navigation system quickly builds a user-specific model, from which navigation commands are optimized using model predictive control. By personalizing the policy in this way, our method is able to give well-timed driving instructions that match the user's dynamics. Our theoretical analysis shows that our method is a no-regret algorithm and we provide the convergence rate in the agnostic setting. Our empirical analysis with 60+ hours of real-world user data using a driving simulator shows that our method can reduce the number of collisions by more than 60%.

preprint2022arXiv

SearchMorph:Multi-scale Correlation Iterative Network for Deformable Registration

Deformable image registration can obtain dynamic information about images, which is of great significance in medical image analysis. The unsupervised deep learning registration method can quickly achieve high registration accuracy without labels. However, these methods generally suffer from uncorrelated features, poor ability to register large deformations and details, and unnatural deformation fields. To address the issues above, we propose an unsupervised multi-scale correlation iterative registration network (SearchMorph). In the proposed network, we introduce a correlation layer to strengthen the relevance between features and construct a correlation pyramid to provide multi-scale relevance information for the network. We also design a deformation field iterator, which improves the ability of the model to register details and large deformations through the search module and GRU while ensuring that the deformation field is realistic. We use single-temporal brain MR images and multi-temporal echocardiographic sequences to evaluate the model's ability to register large deformations and details. The experimental results demonstrate that the method in this paper achieves the highest registration accuracy and the lowest folding point ratio using a short elapsed time to state-of-the-art.

preprint2022arXiv

Symbolic Expression Transformer: A Computer Vision Approach for Symbolic Regression

Symbolic Regression (SR) is a type of regression analysis to automatically find the mathematical expression that best fits the data. Currently, SR still basically relies on various searching strategies so that a sample-specific model is required to be optimized for every expression, which significantly limits the model's generalization and efficiency. Inspired by the fact that human beings can infer a mathematical expression based on the curve of it, we propose Symbolic Expression Transformer (SET), a sample-agnostic model from the perspective of computer vision for SR. Specifically, the collected data is represented as images and an image caption model is employed for translating images to symbolic expressions. A large-scale dataset without overlap between training and testing sets in the image domain is released. Our results demonstrate the effectiveness of SET and suggest the promising direction of image-based model for solving the challenging SR problem.

preprint2022arXiv

Syntax-Aware Network for Handwritten Mathematical Expression Recognition

Handwritten mathematical expression recognition (HMER) is a challenging task that has many potential applications. Recent methods for HMER have achieved outstanding performance with an encoder-decoder architecture. However, these methods adhere to the paradigm that the prediction is made "from one character to another", which inevitably yields prediction errors due to the complicated structures of mathematical expressions or crabbed handwritings. In this paper, we propose a simple and efficient method for HMER, which is the first to incorporate syntax information into an encoder-decoder network. Specifically, we present a set of grammar rules for converting the LaTeX markup sequence of each expression into a parsing tree; then, we model the markup sequence prediction as a tree traverse process with a deep neural network. In this way, the proposed method can effectively describe the syntax context of expressions, alleviating the structure prediction errors of HMER. Experiments on three benchmark datasets demonstrate that our method achieves better recognition performance than prior arts. To further validate the effectiveness of our method, we create a large-scale dataset consisting of 100k handwritten mathematical expression images acquired from ten thousand writers. The source code, new dataset, and pre-trained models of this work will be publicly available.

preprint2022arXiv

Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design

An agent's functionality is largely determined by its design, i.e., skeletal structure and joint attributes (e.g., length, size, strength). However, finding the optimal agent design for a given function is extremely challenging since the problem is inherently combinatorial and the design space is prohibitively large. Additionally, it can be costly to evaluate each candidate design which requires solving for its optimal controller. To tackle these problems, our key idea is to incorporate the design procedure of an agent into its decision-making process. Specifically, we learn a conditional policy that, in an episode, first applies a sequence of transform actions to modify an agent's skeletal structure and joint attributes, and then applies control actions under the new design. To handle a variable number of joints across designs, we use a graph-based policy where each graph node represents a joint and uses message passing with its neighbors to output joint-specific actions. Using policy gradient methods, our approach enables joint optimization of agent design and control as well as experience sharing across different designs, which improves sample efficiency substantially. Experiments show that our approach, Transform2Act, outperforms prior methods significantly in terms of convergence speed and final performance. Notably, Transform2Act can automatically discover plausible designs similar to giraffes, squids, and spiders. Code and videos are available at https://sites.google.com/view/transform2act.

preprint2022arXiv

Unified Simulation, Perception, and Generation of Human Behavior

Understanding and modeling human behavior is fundamental to almost any computer vision and robotics applications that involve humans. In this thesis, we take a holistic approach to human behavior modeling and tackle its three essential aspects -- simulation, perception, and generation. Throughout the thesis, we show how the three aspects are deeply connected and how utilizing and improving one aspect can greatly benefit the other aspects. We also discuss the lessons learned and our vision for what is next for human behavior modeling.

preprint2021arXiv

A Practical Solution for SAR Despeckling With Adversarial Learning Generated Speckled-to-Speckled Images

In this letter, we aim to address a synthetic aperture radar (SAR) despeckling problem with the necessity of neither clean (speckle-free) SAR images nor independent speckled image pairs from the same scene, and a practical solution for SAR despeckling (PSD) is proposed. First, an adversarial learning framework is designed to generate speckled-to-speckled (S2S) image pairs from the same scene in the situation where only single speckled SAR images are available. Then, the S2S SAR image pairs are employed to train a modified despeckling Nested-UNet model using the Noise2Noise (N2N) strategy. Moreover, an iterative version of the PSD method (PSDi) is also presented. Experiments are conducted on both synthetic speckled and real SAR data to demonstrate the superiority of the proposed methods compared with several state-of-the-art methods. The results show that our methods can reach a good tradeoff between feature preservation and speckle suppression.

preprint2021arXiv

AnchorFace: An Anchor-based Facial Landmark Detector Across Large Poses

Facial landmark localization aims to detect the predefined points of human faces, and the topic has been rapidly improved with the recent development of neural network based methods. However, it remains a challenging task when dealing with faces in unconstrained scenarios, especially with large pose variations. In this paper, we target the problem of facial landmark localization across large poses and address this task based on a split-and-aggregate strategy. To split the search space, we propose a set of anchor templates as references for regression, which well addresses the large variations of face poses. Based on the prediction of each anchor template, we propose to aggregate the results, which can reduce the landmark uncertainty due to the large poses. Overall, our proposed approach, named AnchorFace, obtains state-of-the-art results with extremely efficient inference speed on four challenging benchmarks, i.e. AFLW, 300W, Menpo, and WFLW dataset. Code will be available at https://github.com/nothingelse92/AnchorFace.

preprint2021arXiv

Elastomeric Nematic Colloids, Colloidal Crystals and Microstructures with Complex Topology

Control of physical behaviors of nematic colloids and colloidal crystals has been demonstrated by tuning particle shape, topology, chirality and surface charging. However, the capability of altering physical behaviors of such soft matter systems by changing particle shape and the ensuing responses to external stimuli has remained elusive. We fabricated genus-one nematic elastomeric colloidal ring-shaped particles and various microstructures using two-photon photopolymerization. Nematic ordering within both the nano-printed particle and the surrounding medium leads to anisotropic responses and actuation when heated. With the thermal control, elastomeric microstructures are capable of changing from genus-one to genus-zero surface topology. Using these particles as building blocks, we investigated elastomeric colloidal crystals immersed within a liquid crystal fluid, which exhibit crystallographic symmetry transformations. Our findings may lead to colloidal crystals responsive to a large variety of external stimuli, including electric fields and light. Pre-designed response of elastomeric nematic colloids, including changes of colloidal surface topology and lattice symmetry, are of interest for both fundamental research and applications.

preprint2020arXiv

AutoPose: Searching Multi-Scale Branch Aggregation for Pose Estimation

We present AutoPose, a novel neural architecture search(NAS) framework that is capable of automatically discovering multiple parallel branches of cross-scale connections towards accurate and high-resolution 2D human pose estimation. Recently, high-performance hand-crafted convolutional networks for pose estimation show growing demands on multi-scale fusion and high-resolution representations. However, current NAS works exhibit limited flexibility on scale searching, they dominantly adopt simplified search spaces of single-branch architectures. Such simplification limits the fusion of information at different scales and fails to maintain high-resolution representations. The presentedAutoPose framework is able to search for multi-branch scales and network depth, in addition to the cell-level microstructure. Motivated by the search space, a novel bi-level optimization method is presented, where the network-level architecture is searched via reinforcement learning, and the cell-level search is conducted by the gradient-based method. Within 2.5 GPU days, AutoPose is able to find very competitive architectures on the MS COCO dataset, that are also transferable to the MPII dataset. Our code is available at https://github.com/VITA-Group/AutoPose.

preprint2020arXiv

Colloidal interactions and unusual crystallization versus de-mixing of elastic multipoles formed by gold mesoflowers

Colloidal interactions in nematic liquid crystals can be described as interactions between elastic multipoles that depend on particle shape, topology, chirality, boundary conditions and induced topological defects. Here, we describe a nematic colloidal system consisting of mesostructures of gold capable of inducing elastic multipoles of different order. Elastic monopoles are formed by relatively large asymmetric mesoflower particles, for which gravity and elastic torque balancing yields monopole-type interactions. High-order multipoles are instead formed by smaller mesoflowers with a myriad of shapes corresponding to multipoles of different orders, consistent with our computer simulations based on free energy minimization. We reveal unexpected many-body interactions in this colloidal system, ranging from de-mixing of elastic monopoles to a zoo of unusual colloidal crystals formed by high-order multipoles like hexadecapoles. Our findings show that gold mesoflowers may serve as a designer toolkit for engineering colloidal interaction and self-assembly, potentially exceeding that in atomic and molecular systems.

preprint2020arXiv

Critical behavior of the insulator-to-metal transition in Te-hyperdoped Si

Hyperdoping Si with chalcogens is a topic of great interest due to the strong sub-bandgap absorption exhibited by the resulting material, which can be exploited to develop broadband room-temperature infrared photodetectors using fully Si-compatible technology. Here, we report on the critical behavior of the impurity-driven insulator-to-metal transition in Te-hyperdoped Si layers fabricated via ion implantation followed by nanosecond pulsed-laser melting. Electrical transport measurements reveal an insulator-to-metal transition, which is also confirmed and understood by density functional theory calculations. We demonstrate that the metallic phase is governed by a power law dependence of the conductivity at temperatures below 25 K, whereas the conductivity in the insulating phase is well described by a variable-range hopping mechanism with a Coulomb gap at temperatures in the range of 2-50 K. These results show that the electron wave-function in the vicinity of the transition is strongly affected by the disorder and the electron-electron interaction.

preprint2020arXiv

DLow: Diversifying Latent Flows for Diverse Human Motion Prediction

Deep generative models are often used for human motion prediction as they are able to model multi-modal data distributions and characterize diverse human behavior. While much care has been taken into designing and learning deep generative models, how to efficiently produce diverse samples from a deep generative model after it has been trained is still an under-explored problem. To obtain samples from a pretrained generative model, most existing generative human motion prediction methods draw a set of independent Gaussian latent codes and convert them to motion samples. Clearly, this random sampling strategy is not guaranteed to produce diverse samples for two reasons: (1) The independent sampling cannot force the samples to be diverse; (2) The sampling is based solely on likelihood which may only produce samples that correspond to the major modes of the data distribution. To address these problems, we propose a novel sampling method, Diversifying Latent Flows (DLow), to produce a diverse set of samples from a pretrained deep generative model. Unlike random (independent) sampling, the proposed DLow sampling method samples a single random variable and then maps it with a set of learnable mapping functions to a set of correlated latent codes. The correlated latent codes are then decoded into a set of correlated samples. During training, DLow uses a diversity-promoting prior over samples as an objective to optimize the latent mappings to improve sample diversity. The design of the prior is highly flexible and can be customized to generate diverse motions with common features (e.g., similar leg motion but diverse upper-body motion). Our experiments demonstrate that DLow outperforms state-of-the-art baseline methods in terms of sample diversity and accuracy. Our code is released on the project page: https://www.ye-yuan.com/dlow.

preprint2020arXiv

Efficient Non-Line-of-Sight Imaging from Transient Sinograms

Non-line-of-sight (NLOS) imaging techniques use light that diffusely reflects off of visible surfaces (e.g., walls) to see around corners. One approach involves using pulsed lasers and ultrafast sensors to measure the travel time of multiply scattered light. Unlike existing NLOS techniques that generally require densely raster scanning points across the entirety of a relay wall, we explore a more efficient form of NLOS scanning that reduces both acquisition times and computational requirements. We propose a circular and confocal non-line-of-sight (C2NLOS) scan that involves illuminating and imaging a common point, and scanning this point in a circular path along a wall. We observe that (1) these C2NLOS measurements consist of a superposition of sinusoids, which we refer to as a transient sinogram, (2) there exists computationally efficient reconstruction procedures that transform these sinusoidal measurements into 3D positions of hidden scatterers or NLOS images of hidden objects, and (3) despite operating on an order of magnitude fewer measurements than previous approaches, these C2NLOS scans provide sufficient information about the hidden scene to solve these different NLOS imaging tasks. We show results from both simulated and real C2NLOS scans.

preprint2020arXiv

Elastic colloidal monopoles and reconfigurable self-assembly in liquid crystals

Monopole-like electrostatic interactions are ubiquitous in biology and condensed matter, but they are often screened by counter-ions and cannot be switched from attractive to repulsive. In colloidal science, where the prime goal is to develop colloidal particles that mimic and exceed the diversity and length-scales of atomic and molecular assembly, electrostatically charged particles cannot change the sign of their surface charge or transform from monopoles to higher-order multipoles. In liquid-crystal colloids, elastic interactions between particles arise to minimize the free energy associated with elastic distortions in the long-range alignment of rod-like molecules around the particles. In dipolar, quadrupolar and hexadecapolar nematic colloids, the symmetries of such elastic distortions mimic both electrostatic multipoles and the outmost occupied electron shells of atoms. Electric and magnetic switching, spontaneous transformations and optical control of elastic multipoles, as well as their interactions with topological defects and surface boundary conditions, have been demonstrated in such colloids. However, it has long been understood that elastic monopoles should relax to uniform or higher-order multipole states because of the elastic torques that they induce. Here we develop nematic colloids with strong elastic monopole moments and with elastic torques balanced by optical torques exerted by ambient light. We demonstrate the monopole-to-quadrupole reconfiguration of these colloidal particles by unstructured light, which resembles the driving of atoms between the ground state and various excited states. We show that the sign of the elastic monopoles can be switched, and that like-charged monopoles attract whereas oppositely charged ones repel, unlike in electrostatics. We also demonstrate the out-of-equilibrium dynamic assembly of these colloidal particles.

preprint2020arXiv

End-to-End 3D Multi-Object Tracking and Trajectory Forecasting

3D multi-object tracking (MOT) and trajectory forecasting are two critical components in modern 3D perception systems. We hypothesize that it is beneficial to unify both tasks under one framework to learn a shared feature representation of agent interaction. To evaluate this hypothesis, we propose a unified solution for 3D MOT and trajectory forecasting which also incorporates two additional novel computational units. First, we employ a feature interaction technique by introducing Graph Neural Networks (GNNs) to capture the way in which multiple agents interact with one another. The GNN is able to model complex hierarchical interactions, improve the discriminative feature learning for MOT association, and provide socially-aware context for trajectory forecasting. Second, we use a diversity sampling function to improve the quality and diversity of our forecasted trajectories. The learned sampling function is trained to efficiently extract a variety of outcomes from a generative trajectory distribution and helps avoid the problem of generating many duplicate trajectory samples. We show that our method achieves state-of-the-art performance on the KITTI dataset. Our project website is at http://www.xinshuoweng.com/projects/GNNTrkForecast.

preprint2020arXiv

Exact Single-Source SimRank Computation on Large Graphs

SimRank is a popular measurement for evaluating the node-to-node similarities based on the graph topology. In recent years, single-source and top-$k$ SimRank queries have received increasing attention due to their applications in web mining, social network analysis, and spam detection. However, a fundamental obstacle in studying SimRank has been the lack of ground truths. The only exact algorithm, Power Method, is computationally infeasible on graphs with more than $10^6$ nodes. Consequently, no existing work has evaluated the actual trade-offs between query time and accuracy on large real-world graphs. In this paper, we present ExactSim, the first algorithm that computes the exact single-source and top-$k$ SimRank results on large graphs. With high probability, this algorithm produces ground truths with a rigorous theoretical guarantee. We conduct extensive experiments on real-world datasets to demonstrate the efficiency of ExactSim. The results show that ExactSim provides the ground truth for any single-source SimRank query with a precision up to 7 decimal places within a reasonable query time.

preprint2020arXiv

Generative Hybrid Representations for Activity Forecasting with No-Regret Learning

Automatically reasoning about future human behaviors is a difficult problem but has significant practical applications to assistive systems. Part of this difficulty stems from learning systems' inability to represent all kinds of behaviors. Some behaviors, such as motion, are best described with continuous representations, whereas others, such as picking up a cup, are best described with discrete representations. Furthermore, human behavior is generally not fixed: people can change their habits and routines. This suggests these systems must be able to learn and adapt continuously. In this work, we develop an efficient deep generative model to jointly forecast a person's future discrete actions and continuous motions. On a large-scale egocentric dataset, EPIC-KITCHENS, we observe our method generates high-quality and diverse samples while exhibiting better generalization than related generative models. Finally, we propose a variant to continually learn our model from streaming data, observe its practical effectiveness, and theoretically justify its learning efficiency.

preprint2020arXiv

Keeping Designers in the Loop: Communicating Inherent Algorithmic Trade-offs Across Multiple Objectives

Artificial intelligence algorithms have been used to enhance a wide variety of products and services, including assisting human decision making in high-stakes contexts. However, these algorithms are complex and have trade-offs, notably between prediction accuracy and fairness to population subgroups. This makes it hard for designers to understand algorithms and design products or services in a way that respects users' goals, values, and needs. We proposed a method to help designers and users explore algorithms, visualize their trade-offs, and select algorithms with trade-offs consistent with their goals and needs. We evaluated our method on the problem of predicting criminal defendants' likelihood to re-offend through (i) a large-scale Amazon Mechanical Turk experiment, and (ii) in-depth interviews with domain experts. Our evaluations show that our method can help designers and users of these systems better understand and navigate algorithmic trade-offs. This paper contributes a new way of providing designers the ability to understand and control the outcomes of algorithmic systems they are creating.

preprint2020arXiv

On Deep Unsupervised Active Learning

Unsupervised active learning has attracted increasing attention in recent years, where its goal is to select representative samples in an unsupervised setting for human annotating. Most existing works are based on shallow linear models by assuming that each sample can be well approximated by the span (i.e., the set of all linear combinations) of certain selected samples, and then take these selected samples as representative ones to label. However, in practice, the data do not necessarily conform to linear models, and how to model nonlinearity of data often becomes the key point to success. In this paper, we present a novel Deep neural network framework for Unsupervised Active Learning, called DUAL. DUAL can explicitly learn a nonlinear embedding to map each input into a latent space through an encoder-decoder architecture, and introduce a selection block to select representative samples in the the learnt latent space. In the selection block, DUAL considers to simultaneously preserve the whole input patterns as well as the cluster structure of data. Extensive experiments are performed on six publicly available datasets, and experimental results clearly demonstrate the efficacy of our method, compared with state-of-the-arts.

preprint2020arXiv

Optical Non-Line-of-Sight Physics-based 3D Human Pose Estimation

We describe a method for 3D human pose estimation from transient images (i.e., a 3D spatio-temporal histogram of photons) acquired by an optical non-line-of-sight (NLOS) imaging system. Our method can perceive 3D human pose by `looking around corners' through the use of light indirectly reflected by the environment. We bring together a diverse set of technologies from NLOS imaging, human pose estimation and deep reinforcement learning to construct an end-to-end data processing pipeline that converts a raw stream of photon measurements into a full 3D human pose sequence estimate. Our contributions are the design of data representation process which includes (1) a learnable inverse point spread function (PSF) to convert raw transient images into a deep feature vector; (2) a neural humanoid control policy conditioned on the transient image feature and learned from interactions with a physics simulator; and (3) a data synthesis and augmentation strategy based on depth data that can be transferred to a real-world NLOS imaging system. Our preliminary experiments suggest that our method is able to generalize to real-world NLOS measurement to estimate physically-valid 3D human poses.

preprint2020arXiv

Self-assembled nematic colloidal motors powered by light

Biological motors are marvels of nature that inspire creation of their synthetic counterparts with comparable nanoscale dimensions, high efficiency and diverse functions. Molecular motors have been synthesized, but obtaining nanomotors through self-assembly remains challenging. Here we describe a self-assembled colloidal motor with a repetitive light-driven rotation of transparent micro-particles immersed in a liquid crystal and powered by a continuous exposure to unstructured ~1 nW light. A monolayer of azobenzene molecules defines how the liquid crystal's optical axis mechanically couples to the particle's surface, as well as how they jointly rotate as the light's polarization changes. The rotating particle twists the liquid crystal, which changes polarization of traversing light. The resulting feedback mechanism yields a continuous opto-mechanical cycle and drives the unidirectional particle spinning, with handedness and frequency robustly controlled by polarization and intensity of light. Our findings may lead to opto-mechanical devices and colloidal machines compatible with liquid crystal display technology.

preprint2020arXiv

Self-PU: Self Boosted and Calibrated Positive-Unlabeled Training

Many real-world applications have to tackle the Positive-Unlabeled (PU) learning problem, i.e., learning binary classifiers from a large amount of unlabeled data and a few labeled positive examples. While current state-of-the-art methods employ importance reweighting to design various risk estimators, they ignored the learning capability of the model itself, which could have provided reliable supervision. This motivates us to propose a novel Self-PU learning framework, which seamlessly integrates PU learning and self-training. Self-PU highlights three "self"-oriented building blocks: a self-paced training algorithm that adaptively discovers and augments confident positive/negative examples as the training proceeds; a self-calibrated instance-aware loss; and a self-distillation scheme that introduces teacher-students learning as an effective regularization for PU learning. We demonstrate the state-of-the-art performance of Self-PU on common PU learning benchmarks (MNIST and CIFAR-10), which compare favorably against the latest competitors. Moreover, we study a real-world application of PU learning, i.e., classifying brain images of Alzheimer's Disease. Self-PU obtains significantly improved results on the renowned Alzheimer's Disease Neuroimaging Initiative (ADNI) database over existing methods. The code is publicly available at: https://github.com/TAMU-VITA/Self-PU.

preprint2020arXiv

Semi-Supervised Cervical Dysplasia Classification With Learnable Graph Convolutional Network

Cervical cancer is the second most prevalent cancer affecting women today. As the early detection of cervical carcinoma relies heavily upon screening and pre-clinical testing, digital cervicography has great potential as a primary or auxiliary screening tool, especially in low-resource regions due to its low cost and easy access. Although an automated cervical dysplasia detection system has been desirable, traditional fully-supervised training of such systems requires large amounts of annotated data which are often labor-intensive to collect. To alleviate the need for much manual annotation, we propose a novel graph convolutional network (GCN) based semi-supervised classification model that can be trained with fewer annotations. In existing GCNs, graphs are constructed with fixed features and can not be updated during the learning process. This limits their ability to exploit new features learned during graph convolution. In this paper, we propose a novel and more flexible GCN model with a feature encoder that adaptively updates the adjacency matrix during learning and demonstrate that this model design leads to improved performance. Our experimental results on a cervical dysplasia classification dataset show that the proposed framework outperforms previous methods under a semi-supervised setting, especially when the labeled samples are scarce.

preprint2020arXiv

SiamFC++: Towards Robust and Accurate Visual Tracking with Target Estimation Guidelines

Visual tracking problem demands to efficiently perform robust classification and accurate target state estimation over a given target at the same time. Former methods have proposed various ways of target state estimation, yet few of them took the particularity of the visual tracking problem itself into consideration. After a careful analysis, we propose a set of practical guidelines of target state estimation for high-performance generic object tracker design. Following these guidelines, we design our Fully Convolutional Siamese tracker++ (SiamFC++) by introducing both classification and target state estimation branch(G1), classification score without ambiguity(G2), tracking without prior knowledge(G3), and estimation quality score(G4). Extensive analysis and ablation studies demonstrate the effectiveness of our proposed guidelines. Without bells and whistles, our SiamFC++ tracker achieves state-of-the-art performance on five challenging benchmarks(OTB2015, VOT2018, LaSOT, GOT-10k, TrackingNet), which proves both the tracking and generalization ability of the tracker. Particularly, on the large-scale TrackingNet dataset, SiamFC++ achieves a previously unseen AUC score of 75.4 while running at over 90 FPS, which is far above the real-time requirement. Code and models are available at: https://github.com/MegviiDetection/video_analyst .

preprint2020arXiv

State-Aware Tracker for Real-Time Video Object Segmentation

In this work, we address the task of semi-supervised video object segmentation(VOS) and explore how to make efficient use of video property to tackle the challenge of semi-supervision. We propose a novel pipeline called State-Aware Tracker(SAT), which can produce accurate segmentation results with real-time speed. For higher efficiency, SAT takes advantage of the inter-frame consistency and deals with each target object as a tracklet. For more stable and robust performance over video sequences, SAT gets awareness for each state and makes self-adaptation via two feedback loops. One loop assists SAT in generating more stable tracklets. The other loop helps to construct a more robust and holistic target representation. SAT achieves a promising result of 72.3% J&F mean with 39 FPS on DAVIS2017-Val dataset, which shows a decent trade-off between efficiency and accuracy. Code will be released at github.com/MegviiDetection/video_analyst.

preprint2020arXiv

UG$^{2+}$ Track 2: A Collective Benchmark Effort for Evaluating and Advancing Image Understanding in Poor Visibility Environments

The UG$^{2+}$ challenge in IEEE CVPR 2019 aims to evoke a comprehensive discussion and exploration about how low-level vision techniques can benefit the high-level automatic visual recognition in various scenarios. In its second track, we focus on object or face detection in poor visibility enhancements caused by bad weathers (haze, rain) and low light conditions. While existing enhancement methods are empirically expected to help the high-level end task, that is observed to not always be the case in practice. To provide a more thorough examination and fair comparison, we introduce three benchmark sets collected in real-world hazy, rainy, and low-light conditions, respectively, with annotate objects/faces annotated. To our best knowledge, this is the first and currently largest effort of its kind. Baseline results by cascading existing enhancement and detection models are reported, indicating the highly challenging nature of our new data as well as the large room for further technical innovations. We expect a large participation from the broad research community to address these challenges together.

preprint2020arXiv

Uncertainty Quantification for Deep Context-Aware Mobile Activity Recognition and Unknown Context Discovery

Activity recognition in wearable computing faces two key challenges: i) activity characteristics may be context-dependent and change under different contexts or situations; ii) unknown contexts and activities may occur from time to time, requiring flexibility and adaptability of the algorithm. We develop a context-aware mixture of deep models termed the α-\b{eta} network coupled with uncertainty quantification (UQ) based upon maximum entropy to enhance human activity recognition performance. We improve accuracy and F score by 10% by identifying high-level contexts in a data-driven way to guide model development. In order to ensure training stability, we have used a clustering-based pre-training in both public and in-house datasets, demonstrating improved accuracy through unknown context discovery.

preprint2019arXiv

A Novel GAN-based Fault Diagnosis Approach for Imbalanced Industrial Time Series

This paper proposes a novel fault diagnosis approach based on generative adversarial networks (GAN) for imbalanced industrial time series where normal samples are much larger than failure cases. We combine a well-designed feature extractor with GAN to help train the whole network. Aimed at obtaining data distribution and hidden pattern in both original distinguishing features and latent space, the encoder-decoder-encoder three-sub-network is employed in GAN, based on Deep Convolution Generative Adversarial Networks (DCGAN) but without Tanh activation layer and only trained on normal samples. In order to verify the validity and feasibility of our approach, we test it on rolling bearing data from Case Western Reserve University and further verify it on data collected from our laboratory. The results show that our proposed approach can achieve excellent performance in detecting faulty by outputting much larger evaluation scores.

preprint2019arXiv

Wasserstein Distance based Deep Adversarial Transfer Learning for Intelligent Fault Diagnosis

The demand of artificial intelligent adoption for condition-based maintenance strategy is astonishingly increased over the past few years. Intelligent fault diagnosis is one critical topic of maintenance solution for mechanical systems. Deep learning models, such as convolutional neural networks (CNNs), have been successfully applied to fault diagnosis tasks for mechanical systems and achieved promising results. However, for diverse working conditions in the industry, deep learning suffers two difficulties: one is that the well-defined (source domain) and new (target domain) datasets are with different feature distributions; another one is the fact that insufficient or no labelled data in target domain significantly reduce the accuracy of fault diagnosis. As a novel idea, deep transfer learning (DTL) is created to perform learning in the target domain by leveraging information from the relevant source domain. Inspired by Wasserstein distance of optimal transport, in this paper, we propose a novel DTL approach to intelligent fault diagnosis, namely Wasserstein Distance based Deep Transfer Learning (WD-DTL), to learn domain feature representations (generated by a CNN based feature extractor) and to minimize the distributions between the source and target domains through adversarial training. The effectiveness of the proposed WD-DTL is verified through 3 transfer scenarios and 16 transfer fault diagnosis experiments of both unsupervised and supervised (with insufficient labelled data) learning. We also provide a comprehensive analysis of the network visualization of those transfer tasks.

preprint2017arXiv

On Identification of Distribution Grids

Large-scale integration of distributed energy resources into residential distribution feeders necessitates careful control of their operation through power flow analysis. While the knowledge of the distribution system model is crucial for this type of analysis, it is often unavailable or outdated. The recent introduction of synchrophasor technology in low-voltage distribution grids has created an unprecedented opportunity to learn this model from high-precision, time-synchronized measurements of voltage and current phasors at various locations. This paper focuses on joint estimation of model parameters (admittance values) and operational structure of a poly-phase distribution network from the available telemetry data via the lasso, a method for regression shrinkage and selection. We propose tractable convex programs capable of tackling the low rank structure of the distribution system and develop an online algorithm for early detection and localization of critical events that induce a change in the admittance matrix. The efficacy of these techniques is corroborated through power flow studies on four three-phase radial distribution systems serving real household demands.

preprint2016arXiv

Edge pinning and transformation of defect lines induced by faceted colloidal rings in nematic liquid crystals

Nematic colloids exhibit a large diversity of topological defects and structures induced by colloidal particles in the orientationally ordered liquid crystal host fluids. These defects and field configurations define elastic interactions and medium-mediated self-assembly, as well as serve as model systems in exploiting the richness of interactions between topologies and geometries of colloidal surfaces, nematic fields, and topological singularities induced by particles in the nematic bulk and at nematic-colloidal interfaces. Here we demonstrate formation of quarter-strength surface-pinned disclinations, as well as a large variety of director field configurations with splitting and reconnections of singular defect lines, prompted by colloidal particles with sharp edges and size large enough to define strong boundary conditions. Using examples of faceted ring-shaped particles of genus g = 1, we explore transformation of defect lines as they migrate between locations in the bulk of the nematic host to edge-pinned locations at the surfaces of particles and vice versa, showing that this behavior is compliant with topological constraints defined by mathematical theorems. We discuss how transformation of bulk and surface defect lines induced by faceted colloids can enrich the diversity of elasticity-mediated colloidal interactions and how these findings may impinge on prospects of their controlled reconfigurable self-assembly in nematic hosts.

preprint2016arXiv

Event Detection and Localization in Distribution Grids with Phasor Measurement Units

The recent introduction of synchrophasor technology into power distribution systems has given impetus to various monitoring, diagnostic, and control applications, such as system identification and event detection, which are crucial for restoring service, preventing outages, and managing equipment health. Drawing on the existing framework for inferring topology and admittances of a power network from voltage and current phasor measurements, this paper proposes an online algorithm for event detection and localization in unbalanced three-phase distribution systems. Using a convex relaxation and a matrix partitioning technique, the proposed algorithm is capable of identifying topology changes and attributing them to specific categories of events. The performance of this algorithm is evaluated on a standard test distribution feeder with synthesized loads, and it is shown that a tripped line can be detected and localized in an accurate and timely fashion, highlighting its potential for real-world applications.

preprint2016arXiv

Intrinsic diamagnetism in the Weyl semimetal TaAs

We investigate the magnetic properties of TaAs, a prototype Weyl semimetal. TaAs crystals show weak diamagnetism with magnetic susceptibility of about -7 * 10^{-7} emu/(g*Oe) at 5 K. A general feature is the appearance of a minimum at around 185 K in magnetization measurements as a function of temperature. No phase transition is observed in the temperature range between 5 K and 400 K. The magnetic properties indicate that the intrinsic Fermi level in TaAs is not located at the Weyl nodes, in agreement with the theory prediction.

preprint2016arXiv

Precise tuning of the Curie temperature of (Ga,Mn)As-based magnetic semiconductors by hole compensation: Support for valence-band ferromagnetism

For the prototype diluted ferromagnetic semiconductor (Ga,Mn)As, there is a fundamental concern about the electronic states near the Fermi level, i.e., whether the Fermi level resides in a well-separated impurity band derived from Mn doping (impurity-band model) or in the valence band that is already merged with the Mn-derived impurity band (valence-band model). We investigate this question by carefully shifting the Fermi level by means of carrier compensation. We use helium-ion implantation, a standard industry technology, to precisely compensate the hole doping of GaAs-based diluted ferromagnetic semiconductors while keeping the Mn concentration constant. We monitor the change of Curie temperature ($T_C$) and conductivity. For a broad range of samples including (Ga,Mn)As and (Ga,Mn)(As,P) with various Mn and P concentrations, we observe a smooth decrease of $T_C$ with carrier compensation over a wide temperature range while the conduction is changed from metallic to insulating. The existence of $T_C$ below 10\,K is also confirmed in heavily compensated samples. Our experimental results are naturally explained within the valence-band picture.

preprint2016arXiv

Properties of massive star-forming clumps with infall motions

In this work, we aim to characterise high-mass clumps with infall motions. We selected 327 clumps from the Millimetre Astronomy Legacy Team 90-GHz (MALT90) survey, and identified 100 infall candidates. Combined with the results of He et al. (2015), we obtained a sample of 732 high-mass clumps, including 231 massive infall candidates and 501 clumps where infall is not detected. Objects in our sample were classified as pre-stellar, proto-stellar, HII or photo-dissociation region (PDR). The detection rates of the infall candidates in the pre-stellar, proto-stellar, HII and PDR stages are 41.2%, 36.6%, 30.6% and 12.7%, respectively. The infall candidates have a higher H$_{2}$ column density and volume density compared with the clumps where infall is not detected at every stage. For the infall candidates, the median values of the infall rates at the pre-stellar, proto-stellar, HII and PDR stages are 2.6$\times$10$^{-3}$, 7.0$\times$10$^{-3}$, 6.5$\times$10$^{-3}$ and 5.5$\times$10$^{-3}$ M$_\odot$ yr$^{-1}$, respectively. These values indicate that infall candidates at later evolutionary stages are still accumulating material efficiently. It is interesting to find that both infall candidates and clumps where infall is not detected show a clear trend of increasing mass from the pre-stellar to proto-stellar, and to the HII stages. The power indices of the clump mass function (ClMF) are 2.04$\pm$0.16 and 2.17$\pm$0.31 for the infall candidates and clumps where infall is not detected, respectively, which agree well with the power index of the stellar initial mass function (2.35) and the cold Planck cores (2.0).

preprint2016arXiv

Radiation Studies for the Target Station of the MOMENT

The discovery of the neutrino mixing angle $θ_{13}$ opens new opportunities for the discovery of the leptonic CP violation for high intensity neutrino beams. MOMENT a future neutrino facility with a high-power proton beam of 15 MW from a continuous-wave linac is focused on that discovery. The high power of the proton beam causes extreme radiation conditions for the facility and especially for the target station where the pion capture system of five superconducting solenoids is located. In this paper initial studies are performed for the effects of the radiation on the solenoid structure and the area surrounding it. A concept cooling system is also proposed.

preprint2016arXiv

Review Networks for Caption Generation

We propose a novel extension of the encoder-decoder framework, called a review network. The review network is generic and can enhance any existing encoder- decoder model: in this paper, we consider RNN decoders with both CNN and RNN encoders. The review network performs a number of review steps with attention mechanism on the encoder hidden states, and outputs a thought vector after each review step; the thought vectors are used as the input of the attention mechanism in the decoder. We show that conventional encoder-decoders are a special case of our framework. Empirically, we show that our framework improves over state-of- the-art encoder-decoder systems on the tasks of image captioning and source code captioning.

preprint2016arXiv

Suppressing the cellular breakdown in silicon supersaturated with titanium

Hyper doping Si with up to 6 at.% Ti in solid solution was performed by ion implantation followed by pulsed laser annealing and flash lamp annealing. In both cases, the implanted Si layer can be well recrystallized by liquid phase epitaxy and solid phase epitaxy, respectively. Cross-sectional transmission electron microscopy of Ti-implanted Si after liquid phase epitaxy shows the so-called growth interface breakdown or cellular breakdown owing to the occurrence of constitutional supercooling in the melt. The appearance of cellular breakdown prevents further recrystallization. However, the out-diffusion and cellular breakdown can be effectively suppressed by solid phase epitaxy during flash lamp annealing due to the high velocity of amorphous-crystalline interface and the low diffusion velocity for Ti in the solid phase.

preprint2016arXiv

Topological nanocolloids with facile electric switching of plasmonic properties

Combining topology and plasmonics paradigms in nanocolloidal systems may enable new means of pre-engineering desired composite material properties. Here we design and realize orientationally ordered assemblies of noble metal nanoparticles with genus-one topology and unusual long-range ordering mediated by their interactions with the surrounding nematic fluid host. Facile electric switching of these composites is reminiscent to that of pristine liquid crystals (LCs), but provides a means of reconfiguring the nanoparticle assembly and thus also the ensuing composite medium's optical properties. Our findings may lead to formation of new molecular-colloidal soft matter phases with unusual optical properties as well as optical metamaterials.

preprint2016arXiv

Track segment finding with CGEM-IT and matching to tracks in ODC

The relative differences in coordinates of Cylindrical-Gas-Electron-Multiplier-Detector-based Inner Tracker (CGEM-IT) clusters are studied to search for track segments in CGEM-IT. With the full simulation of single muon track samples, clear patterns are found and parameterized for the correct cluster combinations. The cluster combinations satisfying the patterns are selected as track segment candidates in CGEM-IT with an efficiency higher than 99%. The parameters of the track segments are obtained by a helix fitting. Some chi-squared quantities, evaluating the differences in track parameters between the track segments in CGEM-IT and the tracks found in Outer-Drift-Chamber (ODC), are calculated and used to match them. Proper chi-squared requirements are determined as a function of transverse momentum and the matching efficiency is found reasonable.

preprint2015arXiv

A Sparse Bayesian Approach to the Identification of Nonlinear State-Space Systems

This technical note considers the identification of nonlinear discrete-time systems with additive process noise but without measurement noise. In particular, we propose a method and its associated algorithm to identify the system nonlinear functional forms and their associated parameters from a limited number of time-series data points. For this, we cast this identification problem as a sparse linear regression problem and take a Bayesian viewpoint to solve it. As such, this approach typically leads to nonconvex optimisations. We propose a convexification procedure relying on an efficient iterative re-weighted $\ell_1$-minimisation algorithm that uses general sparsity inducing priors on the parameters of the system and marginal likelihood maximisation. Using this approach, we also show how convex constraints on the parameters can be easily added to our proposed iterative re-weighted $\ell_1$-minimisation algorithm. In the supplementary material \cite{appendix}, we illustrate the effectiveness of the proposed identification method on two classical systems in biology and physics, namely, a genetic repressilator network and a large scale network of interconnected Kuramoto oscillators.

preprint2015arXiv

Identifying Biochemical Reaction Networks From Heterogeneous Datasets

In this paper, we propose a new method to identify biochemical reaction networks (i.e. both reactions and kinetic parameters) from heterogeneous datasets. Such datasets can contain (a) data from several replicates of an experiment performed on a biological system; (b) data measured from a biochemical network subjected to different experimental conditions, for example, changes/perturbations in biological inductions, temperature, gene knock-out, gene over-expression, etc. Simultaneous integration of various datasets to perform system identification has the potential to avoid non-identifiability issues typically arising when only single datasets are used.

preprint2015arXiv

Infall Motions in Massive Star-Forming Regions: Results from Years 1 & 2 of the MALT90 Survey

Massive star-forming regions with observed infall motions are good sites for studying the birth of massive stars. In this paper, 405 compact sources have been extracted from the APEX Telescope Large Area Survey of the Galaxy (ATLASGAL) compact sources that also have been observed in the Millimetre Astronomy Legacy Team 90 GHz (MALT90) survey during Years 1 and 2. These observations are complemented with Spitzer GLIMPSE/MIPSGAL mid-IR survey data to help classify the elected star-forming clumps into three evolutionary stages: pre-stellar, proto-stellar and UCHII regions. The results suggest that 0.05 g cm$^{-2}$ is a reliable empirical lower bound for the clump surface densities required for massive-star formation to occur. The optically thick HCO$^{+}$(1-0) and HNC(1-0) lines, as well as the optically thin N$_{2}$H$^{+}$(1-0) line were used to search for infall motions toward these sources. By analyzing the asymmetries of the optically thick HCO$^{+}$(1-0) and HNC(1-0) lines and the mapping observations of HCO$^{+}$(1-0), a total of 131 reliable infall candidates have been identified. The HCO$^{+}$(1-0) line shows the highest occurrence of obvious asymmetric features, suggesting that it may be a better infall motion tracer than other lines such as HNC(1-0). The detection rates of infall candidates toward pre-stellar, proto-stellar and UCHII clumps are 0.3452, 0.3861 and 0.2152, respectively. The relatively high detection rate of infall candidates toward UCHII clumps indicates that many UCHII regions are still accreting matter. The peak column densities and masses of the infall candidates, in general, display a increasing trend with progressing evolutionary stages. However, the rough estimates of the mass infall rate show no obvious variation with evolutionary stage.

preprint2015arXiv

Study of cluster reconstruction and track fitting algorithms for CGEM-IT at BESIII

Considering the aging effects of existing Inner Drift Chamber (IDC) of BES\uppercase\expandafter{\romannumeral3}, a GEM based inner tracker is proposed to be designed and constructed as an upgrade candidate for IDC. This paper introduces a full simulation package of CGEM-IT with a simplified digitization model, describes the development of the softwares for cluster reconstruction and track fitting algorithm based on Kalman filter method for CGEM-IT. Preliminary results from the reconstruction algorithms are obtained using a Monte Carlo sample of single muon events in CGEM-IT.

preprint2015arXiv

Study of the Tracking Method and Expected Performance of the Silicon Pixel Inner Tracker Applied in BESIII

The inner drift chamber of the BESIII is encountering serious aging problem after five year's running. For the first layer, the decrease in gas gain is about 26% from 2009 to 2013. The upgrade of the inner tracking detector has become an urgent problem for the BESIII experiment. An inner tracker using CMOS pixel sensors is an important candidate because of its great advantages on spatial resolution and radiation hardness. In order to carry out a Monte Carlo study on the expected performance, a Geant4-based full simulation for the silicon pixel detector has been implemented. The tracking method combining the silicon pixel inner tracker and outer drift chamber has been studied and a preliminary reconstruction software was developed. The Monte Carlo study shows that the performances including momentum resolution, vertex resolution and the tracking efficiency are significantly improved due to the good spatial resolution and moderate material budget of the silicon pixel detector.

preprint2015arXiv

Study of tracking efficiency and its systematic uncertainty from $J/ψ\to p \overline{p} π^+ π^-$ at BESIII

Based on $J/ψ$ events collected with the BESIII detector, with corresponding Monte Carlo samples, the tracking efficiency and its systematic uncertainty are studied using a control sample of $J/ψ\to p \overline p π^+ π^-$. Validation methods and different factors influencing the tracking efficiency are presented in detail. The tracking efficiency and its systematic uncertainty for protons and pions with the transverse momentum and polar angle dependence are also discussed.

preprint2014arXiv

H2CO and H110α observations towards NH3 sources

We observed the H2CO(110-111) absorption lines and H110α radio recombination lines (RRL) toward 180 NH3 sources using the Nanshan 25-m radio telescope. In our observation, 138 sources were found to have H2CO lines and 36 have H110α RRLs. Among the 138 detected H2CO sources, 38 sources were first detected. The detection rates of H2CO have a better correlation with extinction than with background continuum radiation. Line center velocities of H2CO and NH3 agree well. The line width ratios of H2CO and NH3 are generally larger than 1 and are similar to that of 13CO. The correlation between column densities of H2CO and extinction is better than that between NH3 and extinction. These line width relation and column density relation indicate H2CO is distributed on a larger scale than that of NH3, being similar to the regions of 13CO. The abundance ratios between NH3 and H2CO were found to be different in local clouds and other clouds.

preprint2014arXiv

MOMENT: a muon-decay medium-baseline neutrino beam facility

Neutrino beam with about 300 MeV in energy, high-flux and medium baseline is considered a rational choice for measuring CP violation before the more powerful Neutrino Factory will be built. Following this concept, a unique neutrino beam facility based on muon-decayed neutrinos is proposed. The facility adopts a continuous-wave proton linac of 1.5 GeV and 10 mA as the proton driver, which can deliver an extremely high beam power of 15 MW. Instead of pion-decayed neutrinos, unprecedentedly intense muon-decayed neutrinos are used for better background discrimination. The schematic design for the facility is presented here, including the proton driver, the assembly of a mercury-jet target and capture superconducting solenoids, a pion/muon beam transport line, a long muon decay channel of about 600 m and the detector concept. The physics prospects and the technical challenges are also discussed.

preprint2014arXiv

Network Reconstruction from Intrinsic Noise

This paper considers the problem of inferring an unknown network of dynamical systems driven by unknown, intrinsic, noise inputs. Equivalently we seek to identify direct causal dependencies among manifest variables only from observations of these variables. For linear, time-invariant systems of minimal order, we characterise under what conditions this problem is well posed. We first show that if the transfer matrix from the inputs to manifest states is minimum phase, this problem has a unique solution irrespective of the network topology. This is equivalent to there being only one valid spectral factor (up to a choice of signs of the inputs) of the output spectral density. If the assumption of phase-minimality is relaxed, we show that the problem is characterised by a single Algebraic Riccati Equation (ARE), of dimension determined by the number of latent states. The number of solutions to this ARE is an upper bound on the number of solutions for the network. We give necessary and sufficient conditions for any two dynamical networks to have equal output spectral density, which can be used to construct all equivalent networks. Extensive simulations quantify the number of solutions for a range of problem sizes. For a slightly simpler case, we also provide an algorithm to construct all equivalent networks from the output spectral density.

preprint2014arXiv

On minimal realisations of dynamical structure functions

Motivated by the fact that transfer functions do not contain structural information about networks, dynamical structure functions were introduced to capture causal relationships between measured nodes in networks. From the dynamical structure functions, a) we show that the actual number of hidden states can be larger than the number of hidden states estimated from the corresponding transfer function; b) we can obtain partial information about the true state-space equation, which cannot in general be obtained from the transfer function. Based on these properties, this paper proposes algorithms to find minimal realisations for a given dynamical structure function. This helps to estimate the minimal number of hidden states, to better understand the complexity of the network, and to identify potential targets for new measurements.

preprint2014arXiv

Study of the calibration of X-T relation for the BESIII drift chamber

This paper introduces the calibration of the time-to-distance relation for the BESIII drift chamber. The parameterization of the time-to-distance relation is presented. The studies of left-right asymmetry and the variation with the entrance angle are performed. The impact of dead channels on the time-to-distance relation is given special attention in order to reduce the shifts of the measured momenta for the tracks passing near dead cells. Finally we present the spatial resolution (123 μm) for barrel Bhabha events (|cosθ|<0.8) from J/ψ data taken in 2012.

preprint2013arXiv

A detailed study of the high-mass clump interacting with the bubble N10

We performed a detailed study of the high-mass clump interacting with bubble N10 based on the spectral lines $^{12}CO(3-2)$, $HCO^+(4-3)$, $N_2H^+(4-3)$ and $CH_3OH(7(0,7)-6(0,6))$ and continuum emission data at 450 $μ$m and 850 $μ$m released on CADC and Spitzer data. Blue-shifted optically thick line $^{12}CO (3-2)$ seems to indicate that the outer envelope of the high-mass clump is still falling toward the center. Detection of $CH_3OH(7(0,7)-6(0,6))$ suggests that a hot core has formed around YSO N10-7. And position-velocity diagram of $N_2H^+ (4-3)$ indicates the cold dense core of the clump has not been destroyed by the star formation activities. The mass of N10-7 is about 27.44 $M_\odot$. The ratio $HCO^+(4-3)/N_2H^+ (4-3)$ in the outer part of the clump is larger than that in the inner part of it. The reason may be that the CO abundance relative to $N_2H^+ (4-3)$ increased in the outer part of the high-mass clump, more $N_2H^+ (4-3)$ were converted into $HCO^+(4-3)$.

preprint2013arXiv

An extended segment pattern dictionary for pattern matching tracking algorithm at BESIII

A pattern matching based tracking algorithm, named MdcPatRec, is used for the reconstruction of charged tracks in the drift chamber of the BESIII detector. This paper addresses the shortage of segment finding in MdcPatRec algorithm. An extended segment construction scheme and the corresponding pattern dictionary are presented. Evaluation with Monte-Carlo and experimental data show that the new method can achieve higher efficiency for low transverse momentum tracks.

preprint2013arXiv

Distributed privacy-preserving network size computation: A system-identification based method

In this study, we propose an algorithm for computing the network size of communicating agents. The algorithm is distributed: a) it does not require a leader selection; b) it only requires local exchange of information, and; c) its design can be implemented using local information only, without any global information about the network. It is privacy-preserving, namely it does not require to propagate identifying labels. This algorithm is based on system identification, and more precisely on the identification of the order of a suitably-constructed discrete-time linear time-invariant system over some finite field. We provide a probabilistic guarantee for any randomly picked node to correctly compute the number of nodes in the network. Moreover, numerical implementation has been taken into account to make the algorithm applicable to networks of hundreds of nodes, and therefore make the algorithm applicable in real-world sensor or robotic networks. We finally illustrate our results in simulation and conclude the paper with discussions on how our technique differs from a previously-known strategy based on statistical inference.

preprint2012arXiv

Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases

Many studies have been conducted on seeking the efficient solution for subgraph similarity search over certain (deterministic) graphs due to its wide application in many fields, including bioinformatics, social network analysis, and Resource Description Framework (RDF) data management. All these works assume that the underlying data are certain. However, in reality, graphs are often noisy and uncertain due to various factors, such as errors in data extraction, inconsistencies in data integration, and privacy preserving purposes. Therefore, in this paper, we study subgraph similarity search on large probabilistic graph databases. Different from previous works assuming that edges in an uncertain graph are independent of each other, we study the uncertain graphs where edges' occurrences are correlated. We formally prove that subgraph similarity search over probabilistic graphs is #P-complete, thus, we employ a filter-and-verify framework to speed up the search. In the filtering phase,we develop tight lower and upper bounds of subgraph similarity probability based on a probabilistic matrix index, PMI. PMI is composed of discriminative subgraph features associated with tight lower and upper bounds of subgraph isomorphism probability. Based on PMI, we can sort out a large number of probabilistic graphs and maximize the pruning capability. During the verification phase, we develop an efficient sampling algorithm to validate the remaining candidates. The efficiency of our proposed solutions has been verified through extensive experiments.

preprint2012arXiv

Minimal realization of the dynamical structure function and its application to network reconstruction

Network reconstruction, i.e., obtaining network structure from data, is a central theme in systems biology, economics and engineering. In some previous work, we introduced dynamical structure functions as a tool for posing and solving the problem of network reconstruction between measured states. While recovering the network structure between hidden states is not possible since they are not measured, in many situations it is important to estimate the minimal number of hidden states in order to understand the complexity of the network under investigation and help identify potential targets for measurements. Estimating the minimal number of hidden states is also crucial to obtain the simplest state-space model that captures the network structure and is coherent with the measured data. This paper characterizes minimal order state-space realizations that are consistent with a given dynamical structure function by exploring properties of dynamical structure functions and developing an algorithm to explicitly obtain such a minimal realization.

preprint2012arXiv

Reconstruction of Arbitrary Biochemical Reaction Networks: A Compressive Sensing Approach

Reconstruction of biochemical reaction networks is a central topic in systems biology which raises crucial theoretical challenges in system identification. Nonlinear Ordinary Differential Equations (ODEs) that involve polynomial and rational functions are typically used to model biochemical reaction networks. Such nonlinear models make the problem of determining the connectivity of biochemical networks from time-series experimental data quite difficult. In this paper, we present a network reconstruction algorithm that can deal with model descriptions under the form of polynomial and rational functions. Rather than identifying the parameters of linear or nonlinear ODEs characterised by pre-defined equation structures, our methodology allows us to determine the nonlinear ODEs structure together with their associated reaction constants. To solve the network reconstruction problem, we cast it as a Compressive Sensing (CS) problem and use Bayesian Sparse Learning (BSL) algorithms as an efficient way to obtain its solution.

preprint2012arXiv

Robust Network Reconstruction in Polynomial Time

This paper presents an efficient algorithm for robust network reconstruction of Linear Time-Invariant (LTI) systems in the presence of noise, estimation errors and unmodelled nonlinearities. The method here builds on previous work on robust reconstruction to provide a practical implementation with polynomial computational complexity. Following the same experimental protocol, the algorithm obtains a set of structurally-related candidate solutions spanning every level of sparsity. We prove the existence of a magnitude bound on the noise, which if satisfied, guarantees that one of these structures is the correct solution. A problem-specific model-selection procedure then selects a single solution from this set and provides a measure of confidence in that solution. Extensive simulations quantify the expected performance for different levels of noise and show that significantly more noise can be tolerated in comparison to the original method.

Ye Yuan

What is connected

Connect this record

See the researcher in context

Building this map preview

75 published item(s)

MINER: Mining Multimodal Internal Representation for Efficient Retrieval

AGG: Amortized Generative 3D Gaussians for Single Image to 3D

From creep to flow: Granular materials under cyclic shear

A deep learning-based remaining useful life prediction approach for bearings

A General End-to-end Diagnosis Framework for Manufacturing Systems

A Nonlinear PID-Enhanced Adaptive Latent Factor Analysis Model

A Piecewise Learning Framework for Control of Unknown Nonlinear Systems with Stability Guarantees

A Sampling Theorem for Exact Identification of Continuous-time Nonlinear Dynamical Systems

Adaptive Latent Factor Analysis via Generalized Momentum-Incorporated Particle Swarm Optimization

Causal Effect Estimation using Variational Information Bottleneck

Edge-based Local Push for Personalized PageRank

From Universal Humanoid Control to Automatic Physically Valid Character Creation

GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras

Learning Deep Representation with Energy-Based Self-Expressiveness for Subspace Clustering

Novel total hip surgery robotic system based on self-localization and optical measurement

On Almost Sure Convergence Rates of Stochastic Gradient Methods

Online No-regret Model-Based Meta RL for Personalized Navigation

SearchMorph:Multi-scale Correlation Iterative Network for Deformable Registration

Symbolic Expression Transformer: A Computer Vision Approach for Symbolic Regression

Syntax-Aware Network for Handwritten Mathematical Expression Recognition

Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design

Unified Simulation, Perception, and Generation of Human Behavior

A Practical Solution for SAR Despeckling With Adversarial Learning Generated Speckled-to-Speckled Images

AnchorFace: An Anchor-based Facial Landmark Detector Across Large Poses

Elastomeric Nematic Colloids, Colloidal Crystals and Microstructures with Complex Topology

AutoPose: Searching Multi-Scale Branch Aggregation for Pose Estimation

Colloidal interactions and unusual crystallization versus de-mixing of elastic multipoles formed by gold mesoflowers

Critical behavior of the insulator-to-metal transition in Te-hyperdoped Si

DLow: Diversifying Latent Flows for Diverse Human Motion Prediction

Efficient Non-Line-of-Sight Imaging from Transient Sinograms

Elastic colloidal monopoles and reconfigurable self-assembly in liquid crystals

End-to-End 3D Multi-Object Tracking and Trajectory Forecasting

Exact Single-Source SimRank Computation on Large Graphs

Generative Hybrid Representations for Activity Forecasting with No-Regret Learning

Keeping Designers in the Loop: Communicating Inherent Algorithmic Trade-offs Across Multiple Objectives

On Deep Unsupervised Active Learning

Optical Non-Line-of-Sight Physics-based 3D Human Pose Estimation

Self-assembled nematic colloidal motors powered by light

Self-PU: Self Boosted and Calibrated Positive-Unlabeled Training

Semi-Supervised Cervical Dysplasia Classification With Learnable Graph Convolutional Network

SiamFC++: Towards Robust and Accurate Visual Tracking with Target Estimation Guidelines

State-Aware Tracker for Real-Time Video Object Segmentation

UG$^{2+}$ Track 2: A Collective Benchmark Effort for Evaluating and Advancing Image Understanding in Poor Visibility Environments

Uncertainty Quantification for Deep Context-Aware Mobile Activity Recognition and Unknown Context Discovery

A Novel GAN-based Fault Diagnosis Approach for Imbalanced Industrial Time Series

Wasserstein Distance based Deep Adversarial Transfer Learning for Intelligent Fault Diagnosis

On Identification of Distribution Grids

Edge pinning and transformation of defect lines induced by faceted colloidal rings in nematic liquid crystals

Event Detection and Localization in Distribution Grids with Phasor Measurement Units

Intrinsic diamagnetism in the Weyl semimetal TaAs

Precise tuning of the Curie temperature of (Ga,Mn)As-based magnetic semiconductors by hole compensation: Support for valence-band ferromagnetism

Properties of massive star-forming clumps with infall motions

Radiation Studies for the Target Station of the MOMENT

Review Networks for Caption Generation

Suppressing the cellular breakdown in silicon supersaturated with titanium

Topological nanocolloids with facile electric switching of plasmonic properties

Track segment finding with CGEM-IT and matching to tracks in ODC

A Sparse Bayesian Approach to the Identification of Nonlinear State-Space Systems

Identifying Biochemical Reaction Networks From Heterogeneous Datasets

Infall Motions in Massive Star-Forming Regions: Results from Years 1 & 2 of the MALT90 Survey

Study of cluster reconstruction and track fitting algorithms for CGEM-IT at BESIII

Study of the Tracking Method and Expected Performance of the Silicon Pixel Inner Tracker Applied in BESIII

Study of tracking efficiency and its systematic uncertainty from $J/ψ\to p \overline{p} π^+ π^-$ at BESIII

H2CO and H110α observations towards NH3 sources

MOMENT: a muon-decay medium-baseline neutrino beam facility

Network Reconstruction from Intrinsic Noise

On minimal realisations of dynamical structure functions

Study of the calibration of X-T relation for the BESIII drift chamber

A detailed study of the high-mass clump interacting with the bubble N10

An extended segment pattern dictionary for pattern matching tracking algorithm at BESIII

Distributed privacy-preserving network size computation: A system-identification based method

Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases

Minimal realization of the dynamical structure function and its application to network reconstruction

Reconstruction of Arbitrary Biochemical Reaction Networks: A Compressive Sensing Approach