Researcher profile

Zhiwei Hu

Zhiwei Hu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2022arXiv

1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation

The task of referring video object segmentation aims to segment the object in the frames of a given video to which the referring expressions refer. Previous methods adopt multi-stage approach and design complex pipelines to obtain promising results. Recently, the end-to-end method based on Transformer has proved its superiority. In this work, we draw on the advantages of the above methods to provide a simple and effective pipeline for RVOS. Firstly, We improve the state-of-the-art one-stage method ReferFormer to obtain mask sequences that are strongly correlated with language descriptions. Secondly, based on a reliable and high-quality keyframe, we leverage the superior performance of video object segmentation model to further enhance the quality and temporal consistency of the mask results. Our single model reaches 70.3 J &F on the Referring Youtube-VOS validation set and 63.0 on the test set. After ensemble, we achieve 64.1 on the final leaderboard, ranking 1st place on CVPR2022 Referring Youtube-VOS challenge. Code will be available at https://github.com/Zhiweihhh/cvpr2022-rvos-challenge.git.

preprint2022arXiv

A ferrotoroidic candidate with well-separated spin chains

The search of novel quasi one-dimensional (1D) materials is one of the important aspects in the field of material science. Toroidal moment, the order parameter of ferrotoroidic order, can be generated by a head-to-tail configuration of magnetic moment. It has been theoretically proposed that one-dimensional (1D) dimerized and antiferromagnetic-like spin chain hosts ferrotoroidicity and has the toroidal moment composed of only two antiparallel spins. Here, we report a ferrotoroidic candidate of Ba6Cr2S10 with such a theoretical model of spin chain. The structure consists of unique dimerized face-sharing CrS6 octahedral chains along the c axis. An antiferromagnetic-like ordering at ~10 K breaks both space- and time-reversal symmetries and the magnetic point group of mm'2' allows three ferroic orders in Ba6Cr2S10: (anti)ferromagnetic, ferroelectric and ferrotoroidic orders. Our investigation reveals that Ba6Cr2S10 is a rare ferrotoroidic candidate with quasi 1D spin chain, which can be considered as a starting point for the further exploration of the physics and applications of ferrotoroidicity.

preprint2022arXiv

Deeply Interleaved Two-Stream Encoder for Referring Video Segmentation

Referring video segmentation aims to segment the corresponding video object described by the language expression. To address this task, we first design a two-stream encoder to extract CNN-based visual features and transformer-based linguistic features hierarchically, and a vision-language mutual guidance (VLMG) module is inserted into the encoder multiple times to promote the hierarchical and progressive fusion of multi-modal features. Compared with the existing multi-modal fusion methods, this two-stream encoder takes into account the multi-granularity linguistic context, and realizes the deep interleaving between modalities with the help of VLGM. In order to promote the temporal alignment between frames, we further propose a language-guided multi-scale dynamic filtering (LMDF) module to strengthen the temporal coherence, which uses the language-guided spatial-temporal features to generate a set of position-specific dynamic filters to more flexibly and effectively update the feature of current frame. Extensive experiments on four datasets verify the effectiveness of the proposed model.

preprint2022arXiv

Type-aware Embeddings for Multi-Hop Reasoning over Knowledge Graphs

Multi-hop reasoning over real-life knowledge graphs (KGs) is a highly challenging problem as traditional subgraph matching methods are not capable to deal with noise and missing information. To address this problem, it has been recently introduced a promising approach based on jointly embedding logical queries and KGs into a low-dimensional space to identify answer entities. However, existing proposals ignore critical semantic knowledge inherently available in KGs, such as type information. To leverage type information, we propose a novel TypE-aware Message Passing (TEMP) model, which enhances the entity and relation representations in queries, and simultaneously improves generalization, deductive and inductive reasoning. Remarkably, TEMP is a plug-and-play model that can be easily incorporated into existing embedding-based models to improve their performance. Extensive experiments on three real-world datasets demonstrate TEMP's effectiveness.

preprint2022arXiv

Visual Subtitle Feature Enhanced Video Outline Generation

With the tremendously increasing number of videos, there is a great demand for techniques that help people quickly navigate to the video segments they are interested in. However, current works on video understanding mainly focus on video content summarization, while little effort has been made to explore the structure of a video. Inspired by textual outline generation, we introduce a novel video understanding task, namely video outline generation (VOG). This task is defined to contain two sub-tasks: (1) first segmenting the video according to the content structure and then (2) generating a heading for each segment. To learn and evaluate VOG, we annotate a 10k+ dataset, called DuVOG. Specifically, we use OCR tools to recognize subtitles of videos. Then annotators are asked to divide subtitles into chapters and title each chapter. In videos, highlighted text tends to be the headline since it is more likely to attract attention. Therefore we propose a Visual Subtitle feature Enhanced video outline generation model (VSENet) which takes as input the textual subtitles together with their visual font sizes and positions. We consider the VOG task as a sequence tagging problem that extracts spans where the headings are located and then rewrites them to form the final outlines. Furthermore, based on the similarity between video outlines and textual outlines, we use a large number of articles with chapter headings to pretrain our model. Experiments on DuVOG show that our model largely outperforms other baseline methods, achieving 77.1 of F1-score for the video segmentation level and 85.0 of ROUGE-L_F0.5 for the headline generation level.

preprint2022arXiv

Youling: an AI-Assisted Lyrics Creation System

Recently, a variety of neural models have been proposed for lyrics generation. However, most previous work completes the generation process in a single pass with little human intervention. We believe that lyrics creation is a creative process with human intelligence centered. AI should play a role as an assistant in the lyrics creation process, where human interactions are crucial for high-quality creation. This paper demonstrates \textit{Youling}, an AI-assisted lyrics creation system, designed to collaborate with music creators. In the lyrics generation process, \textit{Youling} supports traditional one pass full-text generation mode as well as an interactive generation mode, which allows users to select the satisfactory sentences from generated candidates conditioned on preceding context. The system also provides a revision module which enables users to revise undesired sentences or words of lyrics repeatedly. Besides, \textit{Youling} allows users to use multifaceted attributes to control the content and format of generated lyrics. The demo video of the system is available at https://youtu.be/DFeNpHk0pm4.

preprint2021arXiv

Magnetic Frustration in a Zeolite

Zeolites are so well known in real world applications and after decades of scientific study that they hardly need any intro-duction: their importance in chemistry cannot be overemphasized. Here we add to the remarkable properties that they dis-play by reporting our discovery that the simplest zeolite, sodalite, when doped with Cr3+ in the \b{eta}-cage, is a frustrated magnet. Soft X-ray absorption spectroscopy and magnetic measurements reveal that the Cr present is Cr(III). Cr(III), with its isotropic 3d3 valence electron configuration, is well-known as the basis for many geometrically frustrated magnets, but it is especially surprising that a material like the Ca8Al12Cr2O29 zeolite is a frustrated magnet. This finding illustrates the value of exploring the properties of even well-known materials families.

preprint2021arXiv

Possible multi-orbital ground state in CeCu$_2$Si$_2$

The crystal-field ground state wave function of CeCu$_2$Si$_2$ has been investigated with linear polarized $M$-edge x-ray absorption spectroscopy from 250mK to 250K, thus covering the superconducting ($T_{\text{c}}$=0.6K), the Kondo ($T_{\text{K}}$$\approx$20K) as well as the Curie-Weiss regime. The comparison with full-multiplet calculations shows that the temperature dependence of the experimental linear dichroism is well explained with a $Γ_7^{(1)}$ crystal-field ground-state and the thermal population of excited states at around 30meV. The crystal-field scheme does not change throughout the entire temperature range thus making the scenario of orbital switching unlikely. Spectroscopic evidence for the presence of the Ce 4$f^0$ configuration in the ground state is consistent with the possibility for a multi-orbital character of the ground state. We estimate from the Kondo temperature and crystal-field splitting energies that several percents of the higher lying $Γ_6$ state and $Γ_7^{(2)}$ crystal-field states are mixed into the primarily $Γ_7^{(1)}$ ground state. This estimate is also supported by re-normalized band-structure calculations that uses the experimentally determined crystal-field scheme.

preprint2020arXiv

Room-temperature ferrimagnetism of anti-site-disordered Ca2MnOsO6

Room-temperature ferrimagnetism was discovered for the anti-site-disordered perovskite Ca2MnOsO6 with Tc = 305 K. Ca2MnOsO6 crystallizes into an orthorhombic structure with a space group of Pnma, in which Mn and Os share the oxygen-coordinated-octahedral site at an equal ratio without a noticeable ordered arrangement. The material is electrically semiconducting with variable-range-hopping behavior. X-ray absorption spectroscopy confirmed the trivalent state of the Mn and the pentavalent state of the Os. X-ray magnetic circular dichroism spectroscopy reveals that the Mn and Os magnetic moments are aligned antiferromagnetically, thereby classifying the material as a ferrimagnet which is in accordance with band structure calculations. It is intriguing that the magnetic signal of the Os is very weak, and that the observed total magnetic moment is primarily due to the Mn. The Tc = 305 K is the second highest in the material category of so-called disordered ferromagnets such as CaRu1-xMnxO3, SrRu1-xCrxO3, and CaIr1-xMnxO3, and hence, may support the development of spintronic oxides with relaxed requirements concerning the anti-site disorder of the magnetic ions.

preprint2019arXiv

High-pressure synthesis and spin glass behavior of a Mn/Ir disordered quadruple perovskite CaCu3Mn2Ir2O12

A new 3d-5d hybridized quadruple perovskite oxide, CaCu3Mn2Ir2O12, was synthesized by high-pressure and high-temperature methods. The Rietveld structure analysis reveals that the compound crystallizes in an AA'3B4O12-type perovskite structure with space group Im-3, where the Ca and Cu are 1:3 ordered at fixed atomic positions. At the B site the 3d Mn and the 5d Ir ions are disorderly distributed due to a rare equal +4 charge states for both of them as determined by X-ray absorption spectroscopy. The competing antiferromagnetic and ferromagnetic interactions among Cu2+, Mn4+, and Ir4+ ions give rise to spin glass behavior, which follows a conventional dynamical slowing down model.