Source author record

Yu Xiang

Yu Xiang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Robotics quant-ph Machine Learning Information Theory eess.SP math.IT Artificial Intelligence Cryptography and Security math-ph math.MP Methodology Computational Engineering, Finance, and Science Databases eess.IV Graphics math.CA math.OC physics.flu-dyn

Catalog footprint

What is connected

34works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Decentralized Conformal Novelty Detection via Quantized Model Exchange

This work studies decentralized novelty detection with global false discovery rate (FDR) control across heterogeneous composite null distributions, without sharing the raw data due to privacy and bandwidth considerations. We propose a framework based on the exchange of quantized surrogate models, allowing independent agents to share low-precision representations of locally learned non-conformity score functions. We prove that evaluating data against these quantized composite scores preserves conditional exchangeability, providing rigorous finite-sample guarantees for global FDR control. Empirical studies on synthetic datasets confirm our theoretical results, demonstrating that the proposed approach maintains competitive statistical power while drastically reducing the communication cost.

preprint2026arXiv

Non-commutativity as a Universal Characterization for Enhanced Quantum Metrology

A central challenge in quantum metrology is to effectively harness quantum resources to surpass classical precision bounds. Although recent studies suggest that the indefinite causal order may enable sensitivities to attain the super-Heisenberg scaling, the physical origins of such enhancements remain elusive. Here, we introduce the nilpotency index $\mathcal{K}$, which quantifies the depth of non-commutativity between operators during the encoding process, can act as a fundamental parameter governing quantum-enhanced sensing. We show that a finite $\mathcal{K}$ yields an enhanced scaling of root-mean-square error as $N^{-(1+\mathcal{K})}$. Meanwhile, the requirement for indefinite causal order arises only when the nested commutators become constant. Remarkably, in the limit $\mathcal{K} \to \infty$, exponential precision scaling $N^{-1}e^{-N}$ is achievable. We propose experimentally feasible protocols implementing these mechanisms, providing a systematic pathway towards practical quantum-enhanced metrology.

preprint2024arXiv

Filtering one-way Einstein-Podolsky-Rosen steering

Einstein-Podolsky-Rosen (EPR) steering, a fundamental concept of quantum nonlocality, describes one observer's capability to remotely affect another distant observer's state by local measurements. Unlike quantum entanglement and Bell nonlocality, both associated with the symmetric quantum correlation, EPR steering depicts the unique asymmetric property of quantum nonlocality. With the local filter operation in which some system components are discarded, quantum nonlocality can be distilled to enhance the nonlocal correlation, and even the hidden nonlocality can be activated. However, asymmetric quantum nonlocality in the filter operation still lacks a well-rounded investigation, especially considering the discarded parts where quantum nonlocal correlations may still exist with probabilities. Here, in both theory and experiment, we investigate the effect of reusing the discarded particles from local filter. We observe all configurations of EPR steering simultaneously and other intriguing evolution of asymmetric quantum nonlocality, such as reversing the direction of one-way EPR steering. This work provides a perspective to answer "What is the essential role of utilizing quantum steering as a resource?", and demonstrates a practical toolbox for manipulating asymmetric quantum systems with significant potential applications in quantum information tasks.

preprint2022arXiv

A Manifold-based Airfoil Geometric-feature Extraction and Discrepant Data Fusion Learning Method

Geometrical shape of airfoils, together with the corresponding flight conditions, are crucial factors for aerodynamic performances prediction. The obtained airfoils geometrical features in most existing approaches (e.g., geometrical parameters extraction, polynomial description and deep learning) are in Euclidean space. State-of-the-art studies showed that curves or surfaces of an airfoil formed a manifold in Riemannian space. Therefore, the features extracted by existing methods are not sufficient to reflect the geometric-features of airfoils. Meanwhile, flight conditions and geometric features are greatly discrepant with different types, the relevant knowledge of the influence of these two factors that on final aerodynamic performances predictions must be evaluated and learned to improve prediction accuracy. Motivated by the advantages of manifold theory and multi-task learning, we propose a manifold-based airfoil geometric-feature extraction and discrepant data fusion learning method (MDF) to extract geometric-features of airfoils in Riemannian space (we call them manifold-features) and further fuse the manifold-features with flight conditions to predict aerodynamic performances. Experimental results show that our method could extract geometric-features of airfoils more accurately compared with existing methods, that the average MSE of re-built airfoils is reduced by 56.33%, and while keeping the same predicted accuracy level of CL, the MSE of CD predicted by MDF is further reduced by 35.37%.

preprint2022arXiv

A Multi-Characteristic Learning Method with Micro-Doppler Signatures for Pedestrian Identification

The identification of pedestrians using radar micro-Doppler signatures has become a hot topic in recent years. In this paper, we propose a multi-characteristic learning (MCL) model with clusters to jointly learn discrepant pedestrian micro-Doppler signatures and fuse the knowledge learned from each cluster into final decisions. Time-Doppler spectrogram (TDS) and signal statistical features extracted from FMCW radar, as two categories of micro-Doppler signatures, are used in MCL to learn the micro-motion information inside pedestrians' free walking patterns. The experimental results show that our model achieves a higher accuracy rate and is more stable for pedestrian identification than other studies, which make our model more practical.

preprint2022arXiv

Characterizing Multipartite Non-Gaussian Entanglement for Three-Mode Spontaneous Parametric Down-Conversion Process

Very recently, strongly non-Gaussian states have been observed via a direct three-mode spontaneous parametric down-conversion in a superconducting cavity [Phys. Rev. X 10, 011011 (2020)]. The created multi-photon non-Gaussian correlations are attractive and useful for various quantum information tasks. However, how to detect and classify multipartite non-Gaussian entanglement has not yet been completely understood. Here, we present an experimentally practical method to characterize continuous-variable multipartite non-Gaussian entanglement, by introducing a class of nonlinear squeezing parameters involving accessible higher-order moments of phase-space quadratures. As these parameters can depend on arbitrary operators, we consider their analytical optimization over a set of practical measurements, in order to detect different classes of multipartite non-Gaussian entanglement ranging from fully separable to fully inseparable. We demonstrate that the nonlinear squeezing parameters act as an excellent approximation to the quantum Fisher information within accessible third-order moments. The level of the nonlinear squeezing quantifies the metrological advantage provided by those entangled states. Moreover, by analyzing the above mentioned experiment, we show that our method can be readily used to confirm fully inseparable tripartite non-Gaussian entangled states by performing a limited number of measurements without requiring full knowledge of the quantum state.

preprint2022arXiv

Differentially Private Variable Selection via the Knockoff Filter

The knockoff filter, recently developed by Barber and Candes, is an effective procedure to perform variable selection with a controlled false discovery rate (FDR). We propose a private version of the knockoff filter by incorporating Gaussian and Laplace mechanisms, and show that variable selection with controlled FDR can be achieved. Simulations demonstrate that our setting has reasonable statistical power.

preprint2022arXiv

Experimental demonstration of remotely creating Wigner negativity via quantum steering

Non-Gaussian states with Wigner negativity are of particular interest in quantum technology due to their potential applications in quantum computing and quantum metrology. However, how to create such states at a remote location remains a challenge, which is important for efficiently distributing quantum resource between distant nodes in a network. Here, we experimentally prepare optical non-Gaussian state with negative Wigner function at a remote node via local non-Gaussian operation and shared Gaussian entangled state existing quantum steering. By performing photon subtraction on one mode, Wigner negativity is created in the remote target mode. We show that the Wigner negativity is sensitive to loss on the target mode, but robust to loss on the mode performing photon subtraction. This experiment confirms the connection between the remotely created Wigner negativity and quantum steering. As an application, we present that the generated non-Gaussian state exhibits metrological power in quantum phase estimation.

preprint2022arXiv

Few-shot Single-view 3D Reconstruction with Memory Prior Contrastive Network

3D reconstruction of novel categories based on few-shot learning is appealing in real-world applications and attracts increasing research interests. Previous approaches mainly focus on how to design shape prior models for different categories. Their performance on unseen categories is not very competitive. In this paper, we present a Memory Prior Contrastive Network (MPCN) that can store shape prior knowledge in a few-shot learning based 3D reconstruction framework. With the shape memory, a multi-head attention module is proposed to capture different parts of a candidate shape prior and fuse these parts together to guide 3D reconstruction of novel categories. Besides, we introduce a 3D-aware contrastive learning method, which can not only complement the retrieval accuracy of memory network, but also better organize image features for downstream tasks. Compared with previous few-shot 3D reconstruction methods, MPCN can handle the inter-class variability without category annotations. Experimental results on a benchmark synthetic dataset and the Pascal3D+ real-world dataset show that our model outperforms the current state-of-the-art methods significantly.

preprint2022arXiv

HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers

We introduce a new simulation benchmark "HandoverSim" for human-to-robot object handovers. To simulate the giver's motion, we leverage a recent motion capture dataset of hand grasping of objects. We create training and evaluation environments for the receiver with standardized protocols and metrics. We analyze the performance of a set of baselines and show a correlation with a real-world evaluation. Code is open sourced at https://handover-sim.github.io.

preprint2022arXiv

Hierarchical Policies for Cluttered-Scene Grasping with Latent Plans

6D grasping in cluttered scenes is a longstanding problem in robotic manipulation. Open-loop manipulation pipelines may fail due to inaccurate state estimation, while most end-to-end grasping methods have not yet scaled to complex scenes with obstacles. In this work, we propose a new method for end-to-end learning of 6D grasping in cluttered scenes. Our hierarchical framework learns collision-free target-driven grasping based on partial point cloud observations. We learn an embedding space to encode expert grasping plans during training and a variational autoencoder to sample diverse grasping trajectories at test time. Furthermore, we train a critic network for plan selection and an option classifier for switching to an instance grasping policy through hierarchical reinforcement learning. We evaluate our method and compare against several baselines in simulation, as well as demonstrate that our latent planning can generalize to real-world cluttered-scene grasping tasks. Our videos and code can be found at https://sites.google.com/view/latent-grasping .

preprint2022arXiv

Lower Bounds on the Error Probability for Invariant Causal Prediction

It is common practice to collect observations of feature and response pairs from different environments. A natural question is how to identify features that have consistent prediction power across environments. The invariant causal prediction framework proposes to approach this problem through invariance, assuming a linear model that is invariant under different environments. In this work, we make an attempt to shed light on this framework by connecting it to the Gaussian multiple access channel problem. Specifically, we incorporate optimal code constructions and decoding methods to provide lower bounds on the error probability. We illustrate our findings by various simulation settings.

preprint2022arXiv

NeuralGrasps: Learning Implicit Representations for Grasps of Multiple Robotic Hands

We introduce a neural implicit representation for grasps of objects from multiple robotic hands. Different grasps across multiple robotic hands are encoded into a shared latent space. Each latent vector is learned to decode to the 3D shape of an object and the 3D shape of a robotic hand in a grasping pose in terms of the signed distance functions of the two 3D shapes. In addition, the distance metric in the latent space is learned to preserve the similarity between grasps across different robotic hands, where the similarity of grasps is defined according to contact regions of the robotic hands. This property enables our method to transfer grasps between different grippers including a human hand, and grasp transfer has the potential to share grasping skills between robots and enable robots to learn grasping skills from humans. Furthermore, the encoded signed distance functions of objects and grasps in our implicit representation can be used for 6D object pose estimation with grasping contact optimization from partial point clouds, which enables robotic grasping in the real world.

preprint2022arXiv

Quantum Steering: Practical Challenges and Perspectives

Einstein-Rosen-Podolsky (EPR) steering or quantum steering describes the "spooky-action-at-a-distance" that one party is able to remotely alter the states of the other if they share a certain entangled state. Generally, it admits an operational interpretation as the task of verifying entanglement without trust in the steering party's devices, making it lying intermediate between Bell nonlocality and entanglement. Together with the asymmetrical nature, quantum steering has attracted a considerable interest from theoretical and experimental sides over past decades. In this Perspective, we present a brief overview of the EPR steering with emphasis on recent progress, discuss current challenges, opportunities and propose various future directions. We look to the future which directs research to a larger-scale level beyond massless and microscopic systems to reveal steering of higher dimensionality, and to build up steered networks composed of multiple parties.

preprint2022arXiv

TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information

Deep neural networks based object detectors have shown great success in a variety of domains like autonomous vehicles, biomedical imaging, etc. It is known that their success depends on a large amount of data from the domain of interest. While deep models often perform well in terms of overall accuracy, they often struggle in performance on rare yet critical data slices. For example, data slices like "motorcycle at night" or "bicycle at night" are often rare but very critical slices for self-driving applications and false negatives on such rare slices could result in ill-fated failures and accidents. Active learning (AL) is a well-known paradigm to incrementally and adaptively build training datasets with a human in the loop. However, current AL based acquisition functions are not well-equipped to tackle real-world datasets with rare slices, since they are based on uncertainty scores or global descriptors of the image. We propose TALISMAN, a novel framework for Targeted Active Learning or object detectIon with rare slices using Submodular MutuAl iNformation. Our method uses the submodular mutual information functions instantiated using features of the region of interest (RoI) to efficiently target and acquire data points with rare slices. We evaluate our framework on the standard PASCAL VOC07+12 and BDD100K, a real-world self-driving dataset. We observe that TALISMAN outperforms other methods by in terms of average precision on rare slices, and in terms of mAP.

preprint2021arXiv

Deterministic distribution of multipartite entanglement and steering in a quantum network by separable states

As two valuable quantum resources, Einstein-Podolsky-Rosen entanglement and steering play important roles in quantum-enhanced communication protocols. Distributing such quantum resources among multiple remote users in a network is a crucial precondition underlying various quantum tasks. We experimentally demonstrate the deterministic distribution of two- and three-mode Gaussian entanglement and steering by transmitting separable states in a network consisting of a quantum server and multiple users. In our experiment, entangled states are not prepared solely by the quantum server, but are created among independent users during the distribution process. More specifically, the quantum server prepares separable squeezed states and applies classical displacements on them before spreading out, and users simply perform local beam-splitter operations and homodyne measurements after they receive separable states. We show that the distributed Gaussian entanglement and steerability are robust against channel loss. Furthermore, one-way Gaussian steering is achieved among users that is useful for further directional or highly asymmetric quantum information processing.

preprint2021arXiv

iCaps: Iterative Category-level Object Pose and Shape Estimation

This paper proposes a category-level 6D object pose and shape estimation approach iCaps, which allows tracking 6D poses of unseen objects in a category and estimating their 3D shapes. We develop a category-level auto-encoder network using depth images as input, where feature embeddings from the auto-encoder encode poses of objects in a category. The auto-encoder can be used in a particle filter framework to estimate and track 6D poses of objects in a category. By exploiting an implicit shape representation based on signed distance functions, we build a LatentNet to estimate a latent representation of the 3D shape given the estimated pose of an object. Then the estimated pose and shape can be used to update each other in an iterative way. Our category-level 6D object pose and shape estimation pipeline only requires 2D detection and segmentation for initialization. We evaluate our approach on a publicly available dataset and demonstrate its effectiveness. In particular, our method achieves comparably high accuracy on shape estimation.

preprint2021arXiv

Learning Composable Behavior Embeddings for Long-horizon Visual Navigation

Learning high-level navigation behaviors has important implications: it enables robots to build compact visual memory for repeating demonstrations and to build sparse topological maps for planning in novel environments. Existing approaches only learn discrete, short-horizon behaviors. These standalone behaviors usually assume a discrete action space with simple robot dynamics, thus they cannot capture the intricacy and complexity of real-world trajectories. To this end, we propose Composable Behavior Embedding (CBE), a continuous behavior representation for long-horizon visual navigation. CBE is learned in an end-to-end fashion; it effectively captures path geometry and is robust to unseen obstacles. We show that CBE can be used to performing memory-efficient path following and topological mapping, saving more than an order of magnitude of memory than behavior-less approaches.

preprint2021arXiv

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation

Segmenting unseen objects in cluttered scenes is an important skill that robots need to acquire in order to perform tasks in new environments. In this work, we propose a new method for unseen object instance segmentation by learning RGB-D feature embeddings from synthetic data. A metric learning loss function is utilized to learn to produce pixel-wise feature embeddings such that pixels from the same object are close to each other and pixels from different objects are separated in the embedding space. With the learned feature embeddings, a mean shift clustering algorithm can be applied to discover and segment unseen objects. We further improve the segmentation accuracy with a new two-stage clustering algorithm. Our method demonstrates that non-photorealistic synthetic RGB and depth images can be used to learn feature embeddings that transfer well to real-world images for unseen object instance segmentation.

preprint2021arXiv

Quantification of Wigner Negativity Remotely Generated via Einstein-Podolsky-Rosen Steering

Wigner negativity, as a well-known indicator of nonclassicality, plays an essential role in quantum computing and simulation using continuous-variable systems. Recently, it has been proven that Einstein-Podolsky-Rosen steering is a prerequisite to generate Wigner negativity between two remote modes. Motivated by the demand of real-world quantum network, here we investigate the shareability of generated Wigner negativity in the multipartite scenario from a quantitative perspective. By establishing a monogamy relation akin to the generalized Coffman-Kundu-Wootters inequality, we show that the amount of Wigner negativity cannot be freely distributed among different modes. Moreover, for photon subtraction -- one of the main experimentally realized non-Gaussian operations -- we provide a general method to quantify the remotely generated Wigner negativity. With this method, we find that there is no direct quantitative relation between the Gaussian steerability and the amount of generated Wigner negativity. Our results pave the way for exploiting Wigner negativity as a valuable resource for numerous quantum information protocols based on non-Gaussian scenario.

preprint2020arXiv

Flow Field Reconstructions with GANs based on Radial Basis Functions

Nonlinear sparse data regression and generation have been a long-term challenge, to cite the flow field reconstruction as a typical example. The huge computational cost of computational fluid dynamics (CFD) makes it much expensive for large scale CFD data producing, which is the reason why we need some cheaper ways to do this, of which the traditional reduced order models (ROMs) were promising but they couldn't generate a large number of full domain flow field data (FFD) to realize high-precision flow field reconstructions. Motivated by the problems of existing approaches and inspired by the success of the generative adversarial networks (GANs) in the field of computer vision, we prove an optimal discriminator theorem that the optimal discriminator of a GAN is a radial basis function neural network (RBFNN) while dealing with nonlinear sparse FFD regression and generation. Based on this theorem, two radial basis function-based GANs (RBF-GAN and RBFC-GAN), for regression and generation purposes, are proposed. Three different datasets are applied to verify the feasibility of our models. The results show that the performance of the RBF-GAN and the RBFC-GAN are better than that of GANs/cGANs by means of both the mean square error (MSE) and the mean square percentage error (MSPE). Besides, compared with GANs/cGANs, the stability of the RBF-GAN and the RBFC-GAN improve by 34.62% and 72.31%, respectively. Consequently, our proposed models can be used to generate full domain FFD from limited and sparse datasets, to meet the requirement of high-precision flow field reconstructions.

preprint2020arXiv

Information Laundering for Model Privacy

In this work, we propose information laundering, a novel framework for enhancing model privacy. Unlike data privacy that concerns the protection of raw data information, model privacy aims to protect an already-learned model that is to be deployed for public use. The private model can be obtained from general learning methods, and its deployment means that it will return a deterministic or random response for a given input query. An information-laundered model consists of probabilistic components that deliberately maneuver the intended input and output for queries to the model, so the model's adversarial acquisition is less likely. Under the proposed framework, we develop an information-theoretic principle to quantify the fundamental tradeoffs between model utility and privacy leakage and derive the optimal design.

preprint2020arXiv

LatentFusion: End-to-End Differentiable Reconstruction and Rendering for Unseen Object Pose Estimation

Current 6D object pose estimation methods usually require a 3D model for each object. These methods also require additional training in order to incorporate new objects. As a result, they are difficult to scale to a large number of objects and cannot be directly applied to unseen objects. We propose a novel framework for 6D pose estimation of unseen objects. We present a network that reconstructs a latent 3D representation of an object using a small number of reference views at inference time. Our network is able to render the latent 3D representation from arbitrary views. Using this neural renderer, we directly optimize for pose given an input image. By training our network with a large number of 3D shapes for reconstruction and rendering, our network generalizes well to unseen objects. We present a new dataset for unseen object pose estimation--MOPED. We evaluate the performance of our method for unseen object pose estimation on MOPED as well as the ModelNet and LINEMOD datasets. Our method performs competitively to supervised methods that are trained on those objects. Code and data is available at https://keunhong.com/publications/latentfusion/.

preprint2020arXiv

Manipulation Trajectory Optimization with Online Grasp Synthesis and Selection

In robot manipulation, planning the motion of a robot manipulator to grasp an object is a fundamental problem. A manipulation planner needs to generate a trajectory of the manipulator arm to avoid obstacles in the environment and plan an end-effector pose for grasping. While trajectory planning and grasp planning are often tackled separately, how to efficiently integrate the two planning problems remains a challenge. In this work, we present a novel method for joint motion and grasp planning. Our method integrates manipulation trajectory optimization with online grasp synthesis and selection, where we apply online learning techniques to select goal configurations for grasping, and introduce a new grasp synthesis algorithm to generate grasps online. We evaluate our planning approach and demonstrate that our method generates robust and efficient motion plans for grasping in cluttered scenes. Our video can be found at https://www.youtube.com/watch?v=LIcACf8YkGU .

preprint2020arXiv

Quasi-Fine-Grained Uncertainty Relations

Nonlocality, which is the key feature of quantum theory, has been linked with the uncertainty principle by fine-grained uncertainty relations, by considering combinations of outcomes for different measurements. However, this approach assumes that information about the system to be fine-grained is local, and does not present an explicitly computable bound. Here, we generalize above approach to general quasi-fine-grained uncertainty relations (QFGURs) which applies in the presence of quantum memory and provides conspicuously computable bounds to quantitatively link the uncertainty to entanglement and Einstein-Podolsky-Rosen (EPR) steering, respectively. Moreover, our QFGURs provide a framework to unify three important forms of uncertainty relations, i.e., universal uncertainty relations, uncertainty principle in the presence of quantum memory, and fine-grained uncertainty relation. This result gives a direct significance to the uncertainty principle, and allows us to determine whether a quantum measurement exhibits typical quantum correlations, meanwhile, it reveals a fundamental connection between basic elements of quantum theory, specifically, uncertainty measures, combined outcomes for different measurements, quantum memory, entanglement and EPR steering.

preprint2020arXiv

Scaling Local Control to Large-Scale Topological Navigation

Visual topological navigation has been revitalized recently thanks to the advancement of deep learning that substantially improves robot perception. However, the scalability and reliability issue remain challenging due to the complexity and ambiguity of real world images and mechanical constraints of real robots. We present an intuitive solution to show that by accurately measuring the capability of a local controller, large-scale visual topological navigation can be achieved while being scalable and robust. Our approach achieves state-of-the-art results in trajectory following and planning in large-scale environments. It also generalizes well to real robots and new environments without retraining or finetuning.

preprint2020arXiv

Self-supervised 6D Object Pose Estimation for Robot Manipulation

To teach robots skills, it is crucial to obtain data with supervision. Since annotating real world data is time-consuming and expensive, enabling robots to learn in a self-supervised way is important. In this work, we introduce a robot system for self-supervised 6D object pose estimation. Starting from modules trained in simulation, our system is able to label real world images with accurate 6D object poses for self-supervised learning. In addition, the robot interacts with objects in the environment to change the object configuration by grasping or pushing objects. In this way, our system is able to continuously collect data and improve its pose estimation modules. We show that the self-supervised learning improves object segmentation and 6D pose estimation performance, and consequently enables the system to grasp objects more reliably. A video showing the experiments can be found at https://youtu.be/W1Y0Mmh1Gd8.

preprint2020arXiv

The Best of Both Modes: Separately Leveraging RGB and Depth for Unseen Object Instance Segmentation

In order to function in unstructured environments, robots need the ability to recognize unseen novel objects. We take a step in this direction by tackling the problem of segmenting unseen object instances in tabletop environments. However, the type of large-scale real-world dataset required for this task typically does not exist for most robotic settings, which motivates the use of synthetic data. We propose a novel method that separately leverages synthetic RGB and synthetic depth for unseen object instance segmentation. Our method is comprised of two stages where the first stage operates only on depth to produce rough initial masks, and the second stage refines these masks with RGB. Surprisingly, our framework is able to learn from synthetic RGB-D data where the RGB is non-photorealistic. To train our method, we introduce a large-scale synthetic dataset of random objects on tabletops. We show that our method, trained on this dataset, can produce sharp and accurate masks, outperforming state-of-the-art methods on unseen object instance segmentation. We also show that our method can segment unseen objects for robot grasping. Code, models and video can be found at https://rse-lab.cs.washington.edu/projects/unseen-object-instance-segmentation/.

preprint2019arXiv

Versatile Multipartite Einstein-Podolsky-Rosen Steering via a Quantum Frequency Comb

Multipartite Einstein-Podolsky-Rosen steering is an essential resource for quantum communication networks where the reliability of equipment at all of the nodes cannot be fully trusted. Here, we present experimental generation of a highly versatile and flexible repository of multipartite steering using an optical frequency comb and ultrafast pulse shaping. Simply modulating the optical spectral resolution of the detection system using the pulse shaper, this scheme is able to produce on-demand 4, 8 and 16-mode Gaussian steering without changing the photonics architecture. We find that the steerability increases with higher spectral resolution. For 16-mode state, we identify as many as 65534 possible bipartition steering existing in this intrinsic multimode quantum resource, and demonstrate that the prepared state steerability is robust to mode losses. Moreover, we verify four types of monogamy relations of Gaussian steering and demonstrate strong violation for one of them. Our method offers a powerful foundation for constructing quantum networks in real-world scenario.

preprint2015arXiv

A Coarse-to-Fine Model for 3D Pose Estimation and Sub-category Recognition

Despite the fact that object detection, 3D pose estimation, and sub-category recognition are highly correlated tasks, they are usually addressed independently from each other because of the huge space of parameters. To jointly model all of these tasks, we propose a coarse-to-fine hierarchical representation, where each level of the hierarchy represents objects at a different level of granularity. The hierarchical representation prevents performance loss, which is often caused by the increase in the number of parameters (as we consider more tasks to model), and the joint modelling enables resolving ambiguities that exist in independent modelling of these tasks. We augment PASCAL3D+ dataset with annotations for these tasks and show that our hierarchical model is effective in joint modelling of object detection, 3D pose estimation, and sub-category recognition.

preprint2015arXiv

Deep Metric Learning via Lifted Structured Feature Embedding

Learning the distance metric between pairs of examples is of great importance for learning and visual recognition. With the remarkable success from the state of the art convolutional neural networks, recent works have shown promising results on discriminatively training the networks to learn semantic feature embeddings where similar examples are mapped close to each other and dissimilar examples are mapped farther apart. In this paper, we describe an algorithm for taking full advantage of the training batches in the neural network training by lifting the vector of pairwise distances within the batch to the matrix of pairwise distances. This step enables the algorithm to learn the state of the art feature embedding by optimizing a novel structured prediction objective on the lifted problem. Additionally, we collected Online Products dataset: 120k images of 23k classes of online products for metric learning. Our experiments on the CUB-200-2011, CARS196, and Online Products datasets demonstrate significant improvement over existing deep feature embedding methods on all experimented embedding sizes with the GoogLeNet network.

preprint2013arXiv

Saari's Conjecture and Variational Minimal Solutions for $N$-Body Problems

In this paper, we will prove Saari's conjecture in a particular case by using a arithmetic fact, and then, apply it to prove that for any given positive masses, the variational minimal solutions of the N-body problem in ${\mathbb{R}}^2$ are precisely a relative equilibrium solution whose configuration minimizes the function $IU^2$ in ${\mathbb{R}}^2$.

preprint2011arXiv

Gaussian Channel with Noisy Feedback and Peak Energy Constraint

Optimal coding over the additive white Gaussian noise channel under the peak energy constraint is studied when there is noisy feedback over an orthogonal additive white Gaussian noise channel. As shown by Pinsker, under the peak energy constraint, the best error exponent for communicating an M-ary message, M >= 3, with noise-free feedback is strictly larger than the one without feedback. This paper extends Pinsker's result and shows that if the noise power in the feedback link is sufficiently small, the best error exponent for conmmunicating an M-ary message can be strictly larger than the one without feedback. The proof involves two feedback coding schemes. One is motivated by a two-stage noisy feedback coding scheme of Burnashev and Yamamoto for binary symmetric channels, while the other is a linear noisy feedback coding scheme that extends Pinsker's noise-free feedback coding scheme. When the feedback noise power $α$ is sufficiently small, the linear coding scheme outperforms the two-stage (nonlinear) coding scheme, and is asymptotically optimal as $α$ tends to zero. By contrast, when $α$ is relatively larger, the two-stage coding scheme performs better.

preprint2011arXiv

Twisted Angles for Central Configurations Formed By Two Twisted Regular Polygons

In this paper, we study the necessary conditions and sufficient conditions for the twisted angles of the central configurations formed by two twisted regular polygons, specially, we prove that for the 2N-body problems, the twisted angles must be$θ=0 {or} θ=π/N$.

Yu Xiang

What is connected

Connect this record

See the researcher in context

Building this map preview

34 published item(s)

Decentralized Conformal Novelty Detection via Quantized Model Exchange

Non-commutativity as a Universal Characterization for Enhanced Quantum Metrology

Filtering one-way Einstein-Podolsky-Rosen steering

A Manifold-based Airfoil Geometric-feature Extraction and Discrepant Data Fusion Learning Method

A Multi-Characteristic Learning Method with Micro-Doppler Signatures for Pedestrian Identification

Characterizing Multipartite Non-Gaussian Entanglement for Three-Mode Spontaneous Parametric Down-Conversion Process

Differentially Private Variable Selection via the Knockoff Filter

Experimental demonstration of remotely creating Wigner negativity via quantum steering

Few-shot Single-view 3D Reconstruction with Memory Prior Contrastive Network

HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers

Hierarchical Policies for Cluttered-Scene Grasping with Latent Plans

Lower Bounds on the Error Probability for Invariant Causal Prediction

NeuralGrasps: Learning Implicit Representations for Grasps of Multiple Robotic Hands

Quantum Steering: Practical Challenges and Perspectives

TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information

Deterministic distribution of multipartite entanglement and steering in a quantum network by separable states

iCaps: Iterative Category-level Object Pose and Shape Estimation

Learning Composable Behavior Embeddings for Long-horizon Visual Navigation

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation

Quantification of Wigner Negativity Remotely Generated via Einstein-Podolsky-Rosen Steering

Flow Field Reconstructions with GANs based on Radial Basis Functions

Information Laundering for Model Privacy

LatentFusion: End-to-End Differentiable Reconstruction and Rendering for Unseen Object Pose Estimation

Manipulation Trajectory Optimization with Online Grasp Synthesis and Selection

Quasi-Fine-Grained Uncertainty Relations

Scaling Local Control to Large-Scale Topological Navigation

Self-supervised 6D Object Pose Estimation for Robot Manipulation

The Best of Both Modes: Separately Leveraging RGB and Depth for Unseen Object Instance Segmentation

Versatile Multipartite Einstein-Podolsky-Rosen Steering via a Quantum Frequency Comb

A Coarse-to-Fine Model for 3D Pose Estimation and Sub-category Recognition

Deep Metric Learning via Lifted Structured Feature Embedding

Saari's Conjecture and Variational Minimal Solutions for $N$-Body Problems

Gaussian Channel with Noisy Feedback and Peak Energy Constraint

Twisted Angles for Central Configurations Formed By Two Twisted Regular Polygons