Researcher profile

Yifei Shi

Yifei Shi contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - Emerging
9works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

PartDexTOG: Generating Dexterous Task-Oriented Grasping via Language-driven Part Analysis

Task-oriented grasping is a crucial yet challenging task in robotic manipulation. Despite the recent progress, few existing methods address task-oriented grasping with dexterous hands. Dexterous hands provide better precision and versatility, enabling robots to perform task-oriented grasping more effectively. In this paper, we argue that part analysis can enhance dexterous grasping by providing detailed information about the object's functionality. We propose PartDexTOG, a method that generates dexterous task-oriented grasps via language-driven part analysis. Taking a 3D object and a manipulation task represented by language as input, the method first generates the category-level and part-level grasp descriptions w.r.t the manipulation task by LLMs. Then, a category-part conditional diffusion model is developed to generate a dexterous grasp for each part, respectively, based on the generated descriptions. To select the most plausible combination of grasp and corresponding part from the generated ones, we propose a measure of geometric consistency between grasp and part. We show that our method greatly benefits from the open-world knowledge reasoning on object parts by LLMs, which naturally facilitates the learning of grasp generation on objects with different geometry and for different manipulation tasks. Our method ranks top on the OakInk-shape dataset over all previous methods, improving the Penetration Volume, the Grasp Displace, and the P-FID over the state-of-the-art by $3.58\%$, $2.87\%$, and $41.43\%$, respectively. Notably, it demonstrates good generality in handling novel categories and tasks.

preprint2022arXiv

Blockchain-assisted Undisclosed IIoT Vulnerabilities Trusted Sharing Protection with Dynamic Token

With the large-scale deployment of industrial internet of things (IIoT) devices, the number of vulnerabilities that threaten IIoT security is also growing dramatically, including a mass of undisclosed IIoT vulnerabilities that lack mitigation measures. Coordination Vulnerabilities Disclosure (CVD) is one of the most popular vulnerabilities sharing solutions, in which some security workers (SWs) can develop undisclosed vulnerabilities patches together. However, CVD assumes that sharing participants (SWs) are all honest, and thus offering chances for dishonest SWs to leak undisclosed IIoT vulnerabilities. To combat such threats, we propose an Undisclosed IIoT Vulnerabilities Trusted Sharing Protection (UIV-TSP) scheme with dynamic token. In this article, a dynamic token is an implicit access credential for an SW to acquire an undisclosed vulnerability information, which is only held by the system and constantly updated as the SW access. Meanwhile, the latest updated token can be stealthily sneaked into the acquired information as the traceability token. Once the undisclosed vulnerability information leaves the SW host, the embedded self-destruct program will be automatically triggered to prevent leaks since the destination MAC address in the traceability token has changed. To quickly distinguish dishonest SWs, trust mechanism is adopted to evaluate the trust value of SWs. Moreover, we design a blockchain-assisted continuous logs storage method to achieve the tamper-proofing of dynamic token and the transparency of undisclosed IIoT vulnerabilities sharing. The simulation results indicate that our proposed scheme is resilient to suppress dishonest SWs and protect the IoT undisclosed vulnerabilities effectively.

preprint2022arXiv

RayMVSNet: Learning Ray-based 1D Implicit Fields for Accurate Multi-View Stereo

Learning-based multi-view stereo (MVS) has by far centered around 3D convolution on cost volumes. Due to the high computation and memory consumption of 3D CNN, the resolution of output depth is often considerably limited. Different from most existing works dedicated to adaptive refinement of cost volumes, we opt to directly optimize the depth value along each camera ray, mimicking the range (depth) finding of a laser scanner. This reduces the MVS problem to ray-based depth optimization which is much more light-weight than full cost volume optimization. In particular, we propose RayMVSNet which learns sequential prediction of a 1D implicit field along each camera ray with the zero-crossing point indicating scene depth. This sequential modeling, conducted based on transformer features, essentially learns the epipolar line search in traditional multi-view stereo. We also devise a multi-task learning for better optimization convergence and depth accuracy. Our method ranks top on both the DTU and the Tanks \& Temples datasets over all previous learning-based methods, achieving overall reconstruction score of 0.33mm on DTU and f-score of 59.48% on Tanks & Temples.

preprint2022arXiv

Recurrent 3D Attentional Networks for End-to-End Active Object Recognition

Active vision is inherently attention-driven: The agent actively selects views to attend in order to fast achieve the vision task while improving its internal representation of the scene being observed. Inspired by the recent success of attention-based models in 2D vision tasks based on single RGB images, we propose to address the multi-view depth-based active object recognition using attention mechanism, through developing an end-to-end recurrent 3D attentional network. The architecture takes advantage of a recurrent neural network (RNN) to store and update an internal representation. Our model, trained with 3D shape datasets, is able to iteratively attend to the best views targeting an object of interest for recognizing it. To realize 3D view selection, we derive a 3D spatial transformer network which is differentiable for training with backpropagation, achieving much faster convergence than the reinforcement learning employed by most existing attention-based models. Experiments show that our method, with only depth input, achieves state-of-the-art next-best-view performance in time efficiency and recognition accuracy.

preprint2020arXiv

SymmetryNet: Learning to Predict Reflectional and Rotational Symmetries of 3D Shapes from Single-View RGB-D Images

We study the problem of symmetry detection of 3D shapes from single-view RGB-D images, where severely missing data renders geometric detection approach infeasible. We propose an end-to-end deep neural network which is able to predict both reflectional and rotational symmetries of 3D objects present in the input RGB-D image. Directly training a deep model for symmetry prediction, however, can quickly run into the issue of overfitting. We adopt a multi-task learning approach. Aside from symmetry axis prediction, our network is also trained to predict symmetry correspondences. In particular, given the 3D points present in the RGB-D image, our network outputs for each 3D point its symmetric counterpart corresponding to a specific predicted symmetry. In addition, our network is able to detect for a given shape multiple symmetries of different types. We also contribute a benchmark of 3D symmetry detection based on single-view RGB-D images. Extensive evaluation on the benchmark demonstrates the strong generalization ability of our method, in terms of high accuracy of both symmetry axis prediction and counterpart estimation. In particular, our method is robust in handling unseen object instances with large variation in shape, multi-symmetry composition, as well as novel object categories.

preprint2016arXiv

Resonant inelastic x-ray scattering as a probe of band structure effects in cuprates

We analyze within quasi-particle theory a recent resonant inelastic x-ray scattering (RIXS) experiment on $\mathrm{YBa_2Cu_3O_{6+x}}$ with the incoming photon energy detuned at several values from the resonance maximum [Minola et al., Phys. Rev. Lett. 114, 217003 (2015)]. Surprisingly, the data shows much weaker dependence on detuning than expected from recent measurements on a different cuprate superconductor, $\mathrm{Bi_2Sr_2CuO_{6+x}}$ [Guarise et al., Nat. Commun. 5, 5760 (2014)]. We demonstrate here, that this discrepancy, originally attributed to collective magnetic excitations, can be understood in terms of the differences between the band structures of these materials. We find good agreement between theory and experiment over a large range of dopings, both in the underdoped and in the overdoped regime. Moreover, we demonstrate that the RIXS signal depends sensitively on excitations at energies well above the Fermi surface, that are inaccessible to traditionally used band structure probes, such as angle-resolved photemisson spectroscopy. This makes RIXS a powerful probe of band structure, not suffering from surface preparation problems and small sample sizes, making it potentially applicable to a number of cuprate materials.

preprint2016arXiv

Superconducting pairing in resonant inelastic X-ray scattering

We develop a method to study the effect of the superconducting transition on resonant inelastic X-ray scattering (RIXS) signal in superconductors with an order parameter with an arbitrary symmetry within a quasiparticle approach. As an example, we compare the direct RIXS signal below and above the superconducting transition for p-wave type order parameters. For a p-wave order parameter with a nodal line, we show that, counterintuitively, the effect of the gap is most noticeable for momentum transfers in the nodal direction. This phenomenon may be naturally explained as a type of nesting effect.

preprint2012arXiv

Full counting statistics and the Edgeworth series for matrix product states

We consider full counting statistics of spin in matrix product states. In particular, we study the approach to gaussian distribution for magnetization. We derive the asymptotic corrections to the central limit theorem for magnetization distribution for finite but large blocks in analogy to the Edgeworth series. We also show how central limit theorem like behavior is modified for certain states with topological characteristics such as the AKLT state.

preprint2011arXiv

Boson pairing and unusual criticality in a generalized XY model

We discuss the unusual critical behavior of a generalized XY model containing both 2π-periodic and π-periodic couplings between sites. The presence of vortices and half-vortices allows for single-particle condensate and pair-condensate phases. Using a field theoretic formulation and worm algorithm Monte Carlo simulations, we show that in two dimensions it is possible for the system to pass directly from the disordered (high temperature) phase to the single particle (quasi)-condensate via an Ising transition, a situation reminiscent of the `deconfined criticality' scenario.