Source author record

Di Tian

Di Tian appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.str-el eess.SP Robotics

Catalog footprint

What is connected

2works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Tactile-based Multimodal Fusion in Embodied Intelligence: A Survey of Vision, Language, and Contact-Driven Paradigms

Tactile sensing is a fundamental modality for embodied intelligence, offering unique and direct feedback on contact geometry, material properties, and interaction dynamics that remote sensors cannot replace. However, unimodal tactile perception is inherently limited by its sparse spatial coverage and lack of global semantic context. With the recent explosion in deep learning and large language models, integrating tactile with vision and language has become essential to bridge physical interaction with semantic reasoning, leading to the emergence of Multimodal Tactile Fusion. Despite rapid progress, the existing researches remain fragmented across disparate datasets, sensing modalities, and tasks, lacking a unified theoretical framework. To address this gap, this paper provides a comprehensive survey of multimodal tactile fusion research up to the first quarter of 2026. We propose a hierarchical taxonomy that organizes the field into two primary dimensions: multimodal datasets and multimodal methods. On the data side, we categorize resources ranging from Tactile-Vision datasets, Tactile-Language datasets, Tactile-Vision-Language datasets, and Tactile-Vision-Other datasets. On the method side, we structure prior work into three core pillars: (1) Multimodal Perception and Recognition, which focuses on object understanding and grasp prediction; (2) Cross-Modal Generation, focusing on bidirectional translation between tactile, vision, and text; and (3) Multimodal Interaction, emphasizing feedback control and language-guided manipulation. Furthermore, we summarize representative tactile sensing hardware, review commonly used evaluation metrics and benchmark settings, and discuss current challenges and promising future directions.

preprint2015arXiv

X-ray scattering study of pyrochlore iridates: crystal structure, electronic and magnetic excitations

We have investigated the structural, electronic, and magnetic properties of the pyrochlore iridates Eu2Ir2O7 and Pr2Ir2O7 using a combination of resonant elastic x-ray scattering, x-ray powder diffraction, and resonant inelastic x-ray scattering (RIXS). The structural parameters of Eu2Ir2O7 have been examined as a function of temperature and applied pressure, with a particular emphasis on regions of the phase diagram where electronic and magnetic phase transitions have been reported. We find no evidence of crystal symmetry change over the range of temperatures (~6 to 300 K) and pressures (~0.1 to 17 GPa) studied. We have also investigated the electronic and magnetic excitations in single crystal samples of Eu2Ir2O7 and Pr2Ir2O7 using high resolution Ir L3-edge RIXS. In spite of very different ground state properties, we find these materials exhibit qualitatively similar excitation spectra, with crystal field excitations at ~3-5 eV, spin-orbit excitations at ~0.5-1 eV, and broad low-lying excitations below ~0.15 eV. In Eu2Ir2O7 we observe highly damped magnetic excitations at ~45 meV, which display significant momentum dependence. We compare these results with recent dynamical structure factor calculations.