Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
55works
0followers
26topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

55 published item(s)

preprint2026arXiv

A Multi-Task Embedder For Retrieval Augmented LLMs

LLMs confront inherent limitations in terms of its knowledge, memory, and action. The retrieval augmentation stands as a vital mechanism to address these limitations, which brings in useful information from external sources to augment the LLM. However, existing retrieval methods encounter two pressing issues. On one hand, the general retrievers are not properly optimized for retrieval augmentation hence exhibit limited effectiveness; on the other hand, the task-specific retrievers excel in the targeted retrieval augmentation scenario, while lack the versatility to handle diverse scenarios. In this work, we propose \textbf{LLM-Embedder} for the unified support of diverse retrieval augmentation scenarios. Our method presents three technical contributions. Firstly, we introduce a new \textit{reward formulation}, namely {rank-aware reward}. It exploits the ranking position of the desired output among $N$ sampled outputs from the LLM, which leads to fine-grained and robust computation of reward from the LLM's feedback. Secondly, we design a novel \textit{distillation objective}, called graded distillation. It incorporates both the absolute value and the relative order of the reward for more sufficient utilization of the LLM's feedback. Thirdly, we systematically optimize the \textit{multi-task learning}, which effectively unifies the multiple retrieval functionalities into one model. In our experiment, LLM-Embedder notably improves the LLM's performances in various downstream tasks, and outperforms both general and task-specific retrievers with a substantial advantage.

preprint2026arXiv

Cryogenic interface-state filling and tunneling mechanisms in strained Ge/SiGe heterostructures

Traps at the semiconductor-oxide interface are considered as a major source of instability in strained Ge/SiGe quantum devices, yet the quantified study of their cryogenic behavior remains limited. In this work, we investigate interface-state trapping using Hall-bar field-effect transistors fabricated on strained Ge/SiGe heterostructures. Combining transport measurements with long-term stabilization and Schrödinger-Poisson modelling, we reconstruct the gradual filling process of interface states at cryogenic condition. Using the calculated valence band profiles, we further evaluate the tunneling current density between the quantum well and the semiconductor-oxide interface. Our calculation demonstrates that the total tunneling current is consistent with a crossover from trap-assisted-tunneling-dominated transport to Fowler-Nordheim-tunneling-dominated transport under different gate bias regimes. These results refine the conventional Fowler-Nordheim-based picture of interface trapping in strained Ge/SiGe heterostructures and provide guidelines for improving Ge-based quantum device performance by improving barrier crystalline qualities and reducing dislocation-related trap densities.

preprint2026arXiv

MemoBrain: Executive Memory as an Agentic Brain for Reasoning

Complex reasoning in tool-augmented agent frameworks is inherently long-horizon, causing reasoning traces and transient tool artifacts to accumulate and strain the bounded working context of large language models. Without explicit memory mechanisms, such accumulation disrupts logical continuity and undermines task alignment. This positions memory not as an auxiliary efficiency concern, but as a core component for sustaining coherent, goal-directed reasoning over long horizons. We propose MemoBrain, an executive memory model for tool-augmented agents that constructs a dependency-aware memory over reasoning steps, capturing salient intermediate states and their logical relations. Operating as a co-pilot alongside the reasoning agent, MemoBrain organizes reasoning progress without blocking execution and actively manages the working context. Specifically, it prunes invalid steps, folds completed sub-trajectories, and preserves a compact, high-salience reasoning backbone under a fixed context budget. Together, these mechanisms enable explicit cognitive control over reasoning trajectories rather than passive context accumulation. We evaluate MemoBrain on challenging long-horizon benchmarks, including GAIA, WebWalker, and BrowseComp-Plus, demonstrating consistent improvements over strong baselines.

preprint2026arXiv

MomentSeeker: A Task-Oriented Benchmark For Long-Video Moment Retrieval

Accurately locating key moments within long videos is crucial for solving long video understanding (LVU) tasks. However, existing benchmarks are either severely limited in terms of video length and task diversity, or they focus solely on the end-to-end LVU performance, making them inappropriate for evaluating whether key moments can be accurately accessed. To address this challenge, we propose MomentSeeker, a novel benchmark for long-video moment retrieval (LMVR), distinguished by the following features. First, it is created based on long and diverse videos, averaging over 1200 seconds in duration and collected from various domains, e.g., movie, anomaly, egocentric, and sports. Second, it covers a variety of real-world scenarios in three levels: global-level, event-level, object-level, covering common tasks like action recognition, object localization, and causal reasoning, etc. Third, it incorporates rich forms of queries, including text-only queries, image-conditioned queries, and video-conditioned queries. On top of MomentSeeker, we conduct comprehensive experiments for both generation-based approaches (directly using MLLMs) and retrieval-based approaches (leveraging video retrievers). Our results reveal the significant challenges in long-video moment retrieval in terms of accuracy and efficiency, despite improvements from the latest long-video MLLMs and task-specific fine-tuning. We have publicly released MomentSeeker(https://yhy-2000.github.io/MomentSeeker/) to facilitate future research in this area.

preprint2026arXiv

Novel GPU Boruta algorithms for feature selection from high-dimensional data

Most feature selection algorithms, especially wrapper methods, run inefficiently on CPU based platforms because of their high computational complexity. This inefficiency makes them unsuitable for processing large scale datasets. To address this challenge, the present study proposed two GPU accelerated versions of the Boruta feature selection procedure, in which Boruta-Permut relies on permutation based feature importance and Boruta-TreeImp employs importance based on impurity reduction. To evaluate these methods we conducted experiments on both a self constructed dataset and several publicly available datasets. The experimental results show that the proposed GPU accelerated algorithms greatly improve computational efficiency while preserving feature selection accuracy comparable to the original Boruta algorithm. In our analysis we also observe that the impurity reduction based version can overestimate the importance of some features. Overall these findings suggest that performing Boruta feature selection on GPUs offers an effective and cost efficient solution for large scale data analysis, which is a good deal.

preprint2026arXiv

PrivCode: When Code Generation Meets Differential Privacy

Large language models (LLMs) have presented outstanding performance in code generation and completion. However, fine-tuning these models on private datasets can raise privacy and proprietary concerns, such as the leakage of sensitive personal information. Differentially private (DP) code generation provides theoretical guarantees for protecting sensitive code by generating synthetic datasets that preserve statistical properties while reducing privacy leakage concerns. However, DP code generation faces significant challenges due to the strict syntactic dependencies and the privacy-utility trade-off. We propose PrivCode, the first DP synthesizer specifically designed for code datasets. It incorporates a two-stage framework to improve both privacy and utility. In the first stage, termed "privacy-sanitizing", PrivCode generates DP-compliant synthetic code by training models using DP-SGD while introducing syntactic information to preserve code structure. The second stage, termed "utility-boosting", fine-tunes a larger pre-trained LLM on the synthetic privacy-free code to mitigate the utility loss caused by DP, enhancing the utility of the generated code. Extensive experiments on four LLMs show that PrivCode generates higher-utility code across various testing tasks under four benchmarks. The experiments also confirm its ability to protect sensitive data under varying privacy budgets. We provide the replication package at the anonymous link.

preprint2026arXiv

Video-Browser: Towards Agentic Open-web Video Browsing

The evolution of autonomous agents is redefining information seeking, transitioning from passive retrieval to proactive, open-ended web research. However, a significant modality gap remains in processing the web's most dynamic and information-dense modality: video. In this paper, we first formalize the task of Agentic Video Browsing and introduce Video-BrowseComp, a benchmark evaluating open-ended agentic browsing tasks that enforce a mandatory dependency on videos. We observe that current paradigms struggle to reconcile the scale of open-ended video exploration with the need for fine-grained visual verification. Direct visual inference (e.g., RAG) maximizes perception but incurs prohibitive context costs, while text-centric summarization optimizes efficiency but often misses critical visual details required for accurate grounding. To address this, we propose Video-Browser, a novel agent leveraging Pyramidal Perception, filtering with cheap metadata and zooming in with expensive visual perception only when necessary. Experiments demonstrate that our approach achieves a 37.5% relative improvement while reducing token consumption by 58.3% compared to Direct visual inference, establishing a foundation for verifiable open-web video research. We open-source all codes, benchmark at {https://anonymous.4open.science/r/VideoBrowser} and {https://github.com/chrisx599/Video-Browser}.

preprint2025arXiv

Collaborative Low-Rank Adaptation for Pre-Trained Vision Transformers

Low-rank adaptation (LoRA) has achieved remarkable success in fine-tuning pre-trained vision transformers for various downstream tasks. Existing studies mainly focus on exploring more parameter-efficient strategies or more effective representation learning schemes. However, these methods either sacrifice fine-tuning performance or introduce excessive trainable parameters, failing to strike a balance between learning performance and parameter efficiency. To address this problem, we propose a novel tuning method named collaborative low-rank adaptation (CLoRA) in this paper. CLoRA consists of base-space sharing and sample-agnostic diversity enhancement (SADE) components. To maintain parameter efficiency while expanding the learning capacity of low-rank modules (LRMs), base-space sharing allows all LRMs to share a set of down/up-projection spaces. In CLoRA, the low-rank matrices obtained from the shared spaces collaboratively construct each LRM. Since the representations extracted by these matrices may contain redundant information, SADE is employed to regularize the similarities among them to encourage diverse representations in the training process. We conduct extensive experiments on widely used image and point cloud datasets to evaluate the performance of CLoRA. Experimental results demonstrate that CLoRA strikes a better balance between learning performance and parameter efficiency, while requiring the fewest GFLOPs for point cloud analysis, compared with the state-of-the-art methods.

preprint2025arXiv

Few-Shot-Based Modular Image-to-Video Adapter for Diffusion Models

Diffusion models (DMs) have recently achieved impressive photorealism in image and video generation. However, their application to image animation remains limited, even when trained on large-scale datasets. Two primary challenges contribute to this: the high dimensionality of video signals leads to a scarcity of training data, causing DMs to favor memorization over prompt compliance when generating motion; moreover, DMs struggle to generalize to novel motion patterns not present in the training set, and fine-tuning them to learn such patterns, especially using limited training data, is still under-explored. To address these limitations, we propose Modular Image-to-Video Adapter (MIVA), a lightweight sub-network attachable to a pre-trained DM, each designed to capture a single motion pattern and scalable via parallelization. MIVAs can be efficiently trained on approximately ten samples using a single consumer-grade GPU. At inference time, users can specify motion by selecting one or multiple MIVAs, eliminating the need for prompt engineering. Extensive experiments demonstrate that MIVA enables more precise motion control while maintaining, or even surpassing, the generation quality of models trained on significantly larger datasets.

preprint2024arXiv

Physics-informed Machine Learning for Battery Pack Thermal Management

With the popularity of electric vehicles, the demand for lithium-ion batteries is increasing. Temperature significantly influences the performance and safety of batteries. Battery thermal management systems can effectively control the temperature of batteries; therefore, the performance and safety can be ensured. However, the development process of battery thermal management systems is time-consuming and costly due to the extensive training dataset needed by data-driven models requiring enormous computational costs for finite element analysis. Therefore, a new approach to constructing surrogate models is needed in the era of AI. Physics-informed machine learning enforces the physical laws in surrogate models, making it the perfect candidate for estimating battery pack temperature distribution. In this study, we first developed a 21700 battery pack indirect liquid cooling system with cold plates on the top and bottom with thermal paste surrounding the battery cells. Then, the simplified finite element model was built based on experiment results. Due to the high coolant flow rate, the cold plates can be considered as constant temperature boundaries, while battery cells are the heat sources. The physics-informed convolutional neural network served as a surrogate model to estimate the temperature distribution of the battery pack. The loss function was constructed considering the heat conduction equation based on the finite difference method. The physics-informed loss function helped the convergence of the training process with less data. As a result, the physics-informed convolutional neural network showed more than 15 percents improvement in accuracy compared to the data-driven method with the same training data.

preprint2023arXiv

Async-fork: Mitigating Query Latency Spikes Incurred by the Fork-based Snapshot Mechanism from the OS Level

In-memory key-value stores (IMKVSes) serve many online applications because of their efficiency. To support data backup, popular industrial IMKVSes periodically take a point-in-time snapshot of the in-memory data with the system call fork. However, this mechanism can result in latency spikes for queries arriving during the snapshot period because fork leads the engine into the kernel mode in which the engine is out-of-service for queries. In contrast to existing research focusing on optimizing snapshot algorithms, we optimize the fork operation to address the latency spikes problem from the operating system (OS) level, while keeping the data persistent mechanism in IMKVSes unchanged. Specifically, we first conduct an in-depth study to reveal the impact of the fork operation as well as the optimization techniques on query latency. Based on findings in the study, we propose Async-fork to offload the work of copying the page table from the engine (the parent process) to the child process as copying the page table dominates the execution time of fork. To keep data consistent between the parent and the child, we design the proactive synchronization strategy. Async-fork is implemented in the Linux kernel and deployed into the online Redis database in public clouds. Our experiment results show that compared with the default fork method in OS, Async-fork reduces the tail latency of queries arriving during the snapshot period by 81.76% on an 8GB instance and 99.84% on a 64GB instance.

preprint2023arXiv

Emergent Electronic Kagome Lattice in Correlated Charge-Density-Wave State of 1T-TaS$_2$

Quantum materials with tunable correlated and/or topological electronic states, such as the electronic Kagome lattice, provide an ideal platform to study the exotic quantum properties. However, the real-space investigations on the correlated electronic Kagome lattice have been rarely reported. Herein, we report on the electronic Kagome lattice emerging in the correlated charge-density-wave (CDW) state of 1T-TaS$_2$ at ~200 K via variable-temperature scanning tunneling microscopy (VT-STM). This emergent Kagome lattice can be considered a fractional electron-filling superstructure with reduced translational and rotational symmetries, confirmed by STM measurements and density functional theory simulations. The characteristic band structure and density of states of this electronic Kagome lattice are further explored based on theoretical calculations. Our results demonstrate a self-organized electronic Kagome lattice from the correlated CDW state via the effective tuning parameter of temperature and provide a platform to directly explore the interplay of correlated electrons and topological physics.

preprint2023arXiv

Time-aware Hyperbolic Graph Attention Network for Session-based Recommendation

Session-based Recommendation (SBR) is to predict users' next interested items based on their previous browsing sessions. Existing methods model sessions as graphs or sequences to estimate user interests based on their interacted items to make recommendations. In recent years, graph-based methods have achieved outstanding performance on SBR. However, none of these methods consider temporal information, which is a crucial feature in SBR as it indicates timeliness or currency. Besides, the session graphs exhibit a hierarchical structure and are demonstrated to be suitable in hyperbolic geometry. But few papers design the models in hyperbolic spaces and this direction is still under exploration. In this paper, we propose Time-aware Hyperbolic Graph Attention Network (TA-HGAT) - a novel hyperbolic graph neural network framework to build a session-based recommendation model considering temporal information. More specifically, there are three components in TA-HGAT. First, a hyperbolic projection module transforms the item features into hyperbolic space. Second, the time-aware graph attention module models time intervals between items and the users' current interests. Third, an evolutionary loss at the end of the model provides an accurate prediction of the recommended item based on the given timestamp. TA-HGAT is built in a hyperbolic space to learn the hierarchical structure of session graphs. Experimental results show that the proposed TA-HGAT has the best performance compared to ten baseline models on two real-world datasets.

preprint2022arXiv

1T-FeS$_2$$:$ a new type of two-dimensional metallic ferromagnet

Discovery of intrinsic two-dimensional (2D) magnetic materials is crucial for understanding the fundamentals of 2D magnetism and realizing next-generation magnetoelectronic and magneto-optical devices. Although significant efforts have been devoted to identifying 2D magnetism by exfoliating bulk magnetic layered materials, seldom studies are performed to synthesize ultra-thin magnetic materials directly for non-layered magnetic materials. Here, we report the successful synthesis of a new type of theoretically proposed 2D metallic ferromagnet 1T FeS2, through the molten-salt-assisted chemical vapor deposition (CVD) method. The long-range 2D ferromagnetic order is confirmed by the observation of a large anomalous Hall effect (AHE) and a hysteretic magnetoresistance. The experimentally detected out-of-plane ferromagnetic ordering is theoretically suported with Stoner criterion. Our findings open up new possibilities to search novel 2D ferromagnets in non-layered compounds and render opportunities for realizing realistic ultra-thin spintronic devices.

preprint2022arXiv

A Mutually Reinforced Framework for Pretrained Sentence Embeddings

The lack of labeled data is a major obstacle to learning high-quality sentence embeddings. Recently, self-supervised contrastive learning (SCL) is regarded as a promising way to address this problem. However, the existing works mainly rely on hand-crafted data annotation heuristics to generate positive training samples, which not only call for domain expertise and laborious tuning, but are also prone to the following unfavorable cases: 1) trivial positives, 2) coarse-grained positives, and 3) false positives. As a result, the self-supervision's quality can be severely limited in reality. In this work, we propose a novel framework InfoCSE to address the above problems. Instead of relying on annotation heuristics defined by humans, it leverages the sentence representation model itself and realizes the following iterative self-supervision process: on one hand, the improvement of sentence representation may contribute to the quality of data annotation; on the other hand, more effective data annotation helps to generate high-quality positive samples, which will further improve the current sentence representation model. In other words, the representation learning and data annotation become mutually reinforced, where a strong self-supervision effect can be derived. Extensive experiments are performed based on three benchmark datasets, where notable improvements can be achieved against the existing SCL-based methods.

preprint2022arXiv

A Novel Underwater Image Enhancement and Improved Underwater Biological Detection Pipeline

For aquaculture resource evaluation and ecological environment monitoring, automatic detection and identification of marine organisms is critical. However, due to the low quality of underwater images and the characteristics of underwater biological, a lack of abundant features may impede traditional hand-designed feature extraction approaches or CNN-based object detection algorithms, particularly in complex underwater environment. Therefore, the goal of this paper is to perform object detection in the underwater environment. This paper proposed a novel method for capturing feature information, which adds the convolutional block attention module (CBAM) to the YOLOv5 backbone. The interference of underwater creature characteristics on object characteristics is decreased, and the output of the backbone network to object information is enhanced. In addition, the self-adaptive global histogram stretching algorithm (SAGHS) is designed to eliminate the degradation problems such as low contrast and color loss caused by underwater environmental information to better restore image quality. Extensive experiments and comprehensive evaluation on the URPC2021 benchmark dataset demonstrate the effectiveness and adaptivity of our methods. Beyond that, this paper conducts an exhaustive analysis of the role of training data on performance.

preprint2022arXiv

Ada-Ranker: A Data Distribution Adaptive Ranking Paradigm for Sequential Recommendation

A large-scale recommender system usually consists of recall and ranking modules. The goal of ranking modules (aka rankers) is to elaborately discriminate users' preference on item candidates proposed by recall modules. With the success of deep learning techniques in various domains, we have witnessed the mainstream rankers evolve from traditional models to deep neural models. However, the way that we design and use rankers remains unchanged: offline training the model, freezing the parameters, and deploying it for online serving. Actually, the candidate items are determined by specific user requests, in which underlying distributions (e.g., the proportion of items for different categories, the proportion of popular or new items) are highly different from one another in a production environment. The classical parameter-frozen inference manner cannot adapt to dynamic serving circumstances, making rankers' performance compromised. In this paper, we propose a new training and inference paradigm, termed as Ada-Ranker, to address the challenges of dynamic online serving. Instead of using parameter-frozen models for universal serving, Ada-Ranker can adaptively modulate parameters of a ranker according to the data distribution of the current group of item candidates. We first extract distribution patterns from the item candidates. Then, we modulate the ranker by the patterns to make the ranker adapt to the current data distribution. Finally, we use the revised ranker to score the candidate list. In this way, we empower the ranker with the capacity of adapting from a global model to a local model which better handles the current task.

preprint2022arXiv

Asymmetric Fraunhofer pattern in Josephson junctions from heterodimensional superlattice V$_5$S$_8$

Introduction of spin-orbit coupling (SOC) in a Josephson junction (JJ) gives rise to unusual Josephson effects. We investigate JJs based on a newly discovered heterodimensional superlattice V$_5$S$_8$ with a special form of SOC. The unique homointerface of our JJs enables elimination of extrinsic effects due to interfaces and disorder. We observe asymmetric Fraunhofer patterns with respect to both the perpendicular magnetic field and the current. The asymmetry is influenced by an in-plane magnetic field. Analysis of the pattern points to a nontrivial spatial distribution of the Josephson current that is intrinsic to the SOC in V$_5$S$_8$.

preprint2022arXiv

Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding

Generalized text representations are the foundation of many natural language understanding tasks. To fully utilize the different corpus, it is inevitable that models need to understand the relevance among them. However, many methods ignore the relevance and adopt a single-channel model (a coarse paradigm) directly for all tasks, which lacks enough rationality and interpretation. In addition, some existing works learn downstream tasks by stitches skill block(a fine paradigm), which might cause irrationalresults due to its redundancy and noise. Inthis work, we first analyze the task correlation through three different perspectives, i.e., data property, manual design, and model-based relevance, based on which the similar tasks are grouped together. Then, we propose a hierarchical framework with a coarse-to-fine paradigm, with the bottom level shared to all the tasks, the mid-level divided to different groups, and the top-level assigned to each of the tasks. This allows our model to learn basic language properties from all tasks, boost performance on relevant tasks, and reduce the negative impact from irrelevant tasks. Our experiments on 13 benchmark datasets across five natural language understanding tasks demonstrate the superiority of our method.

preprint2022arXiv

Direct Light Orbital Angular Momentum Detection in Mid-Infrared based on Type-II Weyl Semimetal TaIrTe4

The capability of direct photocurrent detection of orbital angular momentum (OAM) of light has recently been realized with topological Weyl semimetal, but limited to near infrared wavelength range. The extension of direct OAM detection to midinfrared, a wavelength range that plays important role in a vast range of applications, such as autonomous driving, night vision and motion detection, is challenging and has not yet been realized. This is because most studies of photocurrent responses are not sensitive to the phase information and the photo response is usually very poor in the mid-infrared. In this study, we designed a photodetector based on Type-II Weyl semimetal tantalum iridium tellurides with designed electrode geometries for direct detection of the topological charge of OAM through orbital photogalvanic effect. Our results indicate helical phase gradient of light can be distinguished by a current winding around the optical beam axis with a magnitude proportional to its quantized OAM mode number. The topological enhanced response at mid-infrared of TaIrTe4 further help overcome the low responsivity issues and finally render the direct orbital angular momentum detection capability in mid-infrared. Our study enables on-chip integrated OAM detection, and thus OAM sensitive focal plane arrays in mid-infrared. Such capability triggers new route to explore applications of light carrying OAM, especially that it can crucially promote the performance of many mid-infrared imaging related applications, such as intricate target recognition and night vision.

preprint2022arXiv

Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings

Vector quantization (VQ) based ANN indexes, such as Inverted File System (IVF) and Product Quantization (PQ), have been widely applied to embedding based document retrieval thanks to the competitive time and memory efficiency. Originally, VQ is learned to minimize the reconstruction loss, i.e., the distortions between the original dense embeddings and the reconstructed embeddings after quantization. Unfortunately, such an objective is inconsistent with the goal of selecting ground-truth documents for the input query, which may cause severe loss of retrieval quality. Recent works identify such a defect, and propose to minimize the retrieval loss through contrastive learning. However, these methods intensively rely on queries with ground-truth documents, whose performance is limited by the insufficiency of labeled data. In this paper, we propose Distill-VQ, which unifies the learning of IVF and PQ within a knowledge distillation framework. In Distill-VQ, the dense embeddings are leveraged as "teachers", which predict the query's relevance to the sampled documents. The VQ modules are treated as the "students", which are learned to reproduce the predicted relevance, such that the reconstructed embeddings may fully preserve the retrieval result of the dense embeddings. By doing so, Distill-VQ is able to derive substantial training signals from the massive unlabeled data, which significantly contributes to the retrieval quality. We perform comprehensive explorations for the optimal conduct of knowledge distillation, which may provide useful insights for the learning of VQ based ANN index. We also experimentally show that the labeled data is no longer a necessity for high-quality vector quantization, which indicates Distill-VQ's strong applicability in practice.

preprint2022arXiv

Effect of Stacking Order on the Electronic State of 1T-TaS$_2$

New theoretical proposals and experimental findings on transition metal dichalcogenide 1T-TaS$_2$ have revived interests in its possible Mott insulating state. We perform a comprehensive scanning tunneling microscopy and spectroscopy experiment on different single-step areas in pristine 1T-TaS$_2$. After accurately determining the relative displacement of Star-of-David super-lattices in two layers, we find different stacking orders can correspond to the similar large-gap spectrum on the upper terrace. When the measurement is performed away from the step edge, the large gap spectrum can always be maintained. The stacking order seems rarely disturb the large-gap spectrum in the ideal bulk material. We conclude that the large insulating gap is from the single-layer property, which is a correlation-induced Mott gap based on the single-band Hubbard model. Specific stacking orders can perturb the state and induce a small-gap or metallic spectrum for a limited area around the step edge, which we attribute to a surface and edge phenomenon. Our work provides more evidence about the surface electronic state and deepens our understanding of the Mott insulating state in 1T-TaS$_2$.

preprint2022arXiv

Enhanced versatility of table-top X-rays from van der Waals structures

Van der Waals (vdW) materials have attracted much interest for their myriad unique electronic, mechanical and thermal properties. In particular, they are promising candidates for monochromatic, table-top X-ray sources. This work reveals that the versatility of the table-top vdW X-ray source goes beyond what has been demonstrated so far. By introducing a tilt angle between the vdW structure and the incident electron beam, it is theoretically and experimentally shown that the accessible photon energy range is more than doubled. This allows for greater versatility in real-time tuning of the vdW X-ray source. Furthermore, this work shows that the accessible photon energy range is maximized by simultaneously controlling both the electron energy and the vdW structure tilt. These results should pave the way for highly tunable, compact X-ray sources, with potential applications including hyperspectral X-ray fluoroscopy and X-ray quantum optics.

preprint2022arXiv

Error-Aware Spatial Ensembles for Video Frame Interpolation

Video frame interpolation~(VFI) algorithms have improved considerably in recent years due to unprecedented progress in both data-driven algorithms and their implementations. Recent research has introduced advanced motion estimation or novel warping methods as the means to address challenging VFI scenarios. However, none of the published VFI works considers the spatially non-uniform characteristics of the interpolation error (IE). This work introduces such a solution. By closely examining the correlation between optical flow and IE, the paper proposes novel error prediction metrics that partition the middle frame into distinct regions corresponding to different IE levels. Building upon this IE-driven segmentation, and through the use of novel error-controlled loss functions, it introduces an ensemble of spatially adaptive interpolation units that progressively processes and integrates the segmented regions. This spatial ensemble results in an effective and computationally attractive VFI solution. Extensive experimentation on popular video interpolation benchmarks indicates that the proposed solution outperforms the current state-of-the-art (SOTA) in applications of current interest.

preprint2022arXiv

Extensibility of Hohenberg-Kohn Theorem to general quantum systems

Hohenberg-Kohn (HK) theorem is a cornerstone of modern electronic structure calculations. For interacting electrons, given that the internal part of the Hamiltonian ($\hat H_{int}$), containing the kinetic energy and Couloumb interaction of electrons, has a fixed form, the theorem states that when the electrons are subject to an external electrostatic field, the ground-state density can inversely determine the field, and thus the full Hamiltonian completely. For a general quantum system, a HK-type Hamiltonian in the form of $\hat H_{hk}\{g_i\}=\hat H_{int}+\sum_i g_i \hat O_i$ can always be defined, by grouping those terms with fixed or preknown coefficients into $\hat H_{int}$, and factorizing the remaining as superposition of a set of Hermitian operators $\{\hat O_i\}$. We ask whether the HK theorem can be extended, so that the ground-state expectation values of $\{\hat O_i\}$ as the generalized density can in principle be used as the fundamental variables determining all the properties of the system. We show that the question can be addressed by introducing the concept of generalized density correlation matrix (GDCM) defined with respect to the $\{\hat O_i\}$ operators. The invertibility of the GDCM represents a mathematically rigorous and practically useful criterion for the extension of HK theorem to be valid. We apply this criterion to several representative systems, including the quantum Ising dimer, the frustration-free systems, N-level quantum systems with fixed inter-level transition amplitude and tunable level energies, and a fermionic Hubbard chain with inhomogeneous on-site interactions. We suggest that for a finite-size system, finding an invertible GDCM under one single $\{g_i\}$ configuration is typically sufficient to establish the generic extensibility of the HK theorem in the entire parameter space.

preprint2022arXiv

Fighting Sybils in Airdrops

Airdrop is a crucial concept in tokenomics. Startups of decentralized applications (DApps) reward early supporters by airdropping newly issued tokens up to a certain amount as a free giveaway. This naturally induces greedy hackers, called Sybils, to create multiple accounts for more shares. Most airdrops have prerequisites for qualification, in which utilizing these DApps is unsurprisingly the principal. One particular characteristic of DApps is to implement users' interactions with them in the form of token transfer transactions or smart contract calling transactions on public blockchains. We argue that these individual transactions could reveal underlying signatures of their sending accounts. Specifically, accounts controlled by the same Sybil may exhibit some common behaviors. A careful analysis of Sybil's behaviors shows that accounts controlled by the same Sybil may produce similar DApp activities and regular token transfer patterns. We model the transactions as graphs by representing accounts as vertices and transactions as edges. When multiple accounts receive tokens from the same Sybil to conduct interactions with DApps, we inspect the graphs for these activities and patterns to detect suspicious accounts. We demonstrate the effectiveness of the proposed method in a recent airdrop by presenting the suspicious accounts controlled by Sybils. All the detected accounts exhibit similar interaction activities and regular transfer patterns.

preprint2022arXiv

GateFormer: Speeding Up News Feed Recommendation with Input Gated Transformers

News feed recommendation is an important web service. In recent years, pre-trained language models (PLMs) have been intensively applied to improve the recommendation quality. However, the utilization of these deep models is limited in many aspects, such as lack of explainability and being incompatible with the existing inverted index systems. Above all, the PLMs based recommenders are inefficient, as the encoding of user-side information will take huge computation costs. Although the computation can be accelerated with efficient transformers or distilled PLMs, it is still not enough to make timely recommendations for the active users, who are associated with super long news browsing histories. In this work, we tackle the efficient news recommendation problem from a distinctive perspective. Instead of relying on the entire input (i.e., the collection of news articles a user ever browsed), we argue that the user's interest can be fully captured merely with those representative keywords. Motivated by this, we propose GateFormer, where the input data is gated before feeding into transformers. The gating module is made personalized, lightweight and end-to-end learnable, such that it may perform accurate and efficient filtering of informative user input. GateFormer achieves highly impressive performances in experiments, where it notably outperforms the existing acceleration approaches in both accuracy and efficiency. We also surprisingly find that even with over 10-fold compression of the original input, GateFormer is still able to maintain on-par performances with the SOTA methods.

preprint2022arXiv

Homointerface planar Josephson junction based on inverse proximity effect

The quality of a superconductor-normal metal-superconductor Josephson junction (JJ) depends crucially on the transparency of the superconductor-normal metal (S/N) interface. We demonstrate a technique for fabricating planar JJs with perfect S/N interfaces. The technique utilizes a strong inverse proximity effect discovered in Al/V$_5$S$_8$ bilayers, by which the Al layer is driven into the resistive state. The highly transparent S/N homointerface and the peculiar normal metal enable the flow of Josephson supercurrent across a 2.9 $μ$m long weak link. Moreover, our JJ exhibits a giant critical current and a large product of the critical current and the normal state resistance.

preprint2022arXiv

Layer Imbalance Aware Multiplex Network Embedding

Multiplex network embedding is an effective technique to jointly learn the low-dimensional representations of nodes across network layers. However, the number of edges among layers may vary significantly. This data imbalance will lead to performance degradation especially on the sparse layer due to learning bias and the adverse effects of irrelevant or conflicting data in other layers. In this paper, a Layer Imbalance Aware Multiplex Network Embedding (LIAMNE) method is proposed where the edges in auxiliary layers are under-sampled based on the node similarity in the embedding space of the target layer to achieve balanced edge distribution and to minimize noisy relations that are less relevant to the target layer. Real-world datasets with different degrees of layer imbalance are used for experimentation. The results demonstrate that LIAMNE significantly outperforms several state-of-the-art multiplex network embedding methods in link prediction on the target layer. Meantime, the comprehensive representation of the entire multiplex network is not compromised by the sampling method as evaluated by its performance on the node classification task.

preprint2022arXiv

Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?

Recent years have witnessed great progress on applying pre-trained language models, e.g., BERT, to information retrieval (IR) tasks. Hyperlinks, which are commonly used in Web pages, have been leveraged for designing pre-training objectives. For example, anchor texts of the hyperlinks have been used for simulating queries, thus constructing tremendous query-document pairs for pre-training. However, as a bridge across two web pages, the potential of hyperlinks has not been fully explored. In this work, we focus on modeling the relationship between two documents that are connected by hyperlinks and designing a new pre-training objective for ad-hoc retrieval. Specifically, we categorize the relationships between documents into four groups: no link, unidirectional link, symmetric link, and the most relevant symmetric link. By comparing two documents sampled from adjacent groups, the model can gradually improve its capability of capturing matching signals. We propose a progressive hyperlink predication ({PHP}) framework to explore the utilization of hyperlinks in pre-training. Experimental results on two large-scale ad-hoc retrieval datasets and six question-answering datasets demonstrate its superiority over existing pre-training methods.

preprint2022arXiv

Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval

Ad-hoc search calls for the selection of appropriate answers from a massive-scale corpus. Nowadays, the embedding-based retrieval (EBR) becomes a promising solution, where deep learning based document representation and ANN search techniques are allied to handle this task. However, a major challenge is that the ANN index can be too large to fit into memory, given the considerable size of answer corpus. In this work, we tackle this problem with Bi-Granular Document Representation, where the lightweight sparse embeddings are indexed and standby in memory for coarse-grained candidate search, and the heavyweight dense embeddings are hosted in disk for fine-grained post verification. For the best of retrieval accuracy, a Progressive Optimization framework is designed. The sparse embeddings are learned ahead for high-quality search of candidates. Conditioned on the candidate distribution induced by the sparse embeddings, the dense embeddings are continuously learned to optimize the discrimination of ground-truth from the shortlisted candidates. Besides, two techniques: the contrastive quantization and the locality-centric sampling are introduced for the learning of sparse and dense embeddings, which substantially contribute to their performances. Thanks to the above features, our method effectively handles massive-scale EBR with strong advantages in accuracy: with up to +4.3% recall gain on million-scale corpus, and up to +17.5% recall gain on billion-scale corpus. Besides, Our method is applied to a major sponsored search platform with substantial gains on revenue (+1.95%), Recall (+1.01%) and CTR (+0.49%). Our code is available at https://github.com/microsoft/BiDR.

preprint2022arXiv

Reconcile the Bulk Metallic and Surface Insulating state in 1T-TaSe$_2$

The transition metal dichalcogenides 1T-TaS$_2$ and 1T-TaSe$_2$ have been extensively studied for the complicated correlated electronic properties. The origin of different surface electronic states remains controversial. We apply scanning tunneling microscopy and spectroscopy to restudy the surface electronic state of bulk 1T-TaSe$_2$. Both insulating and metallic states are identified in different areas of the same sample. The insulating state is similar to that in 1T-TaS$_2$, concerning both the dI/dV spectrum and the orbital texture. With further investigations in single-step areas, the discrepancy of electronic states is found to be associated with different stacking orders. The insulating state is most possibly a single-layer property, modulated to a metallic state in some particular stacking orders. Both the metallic and large-gap insulating spectra, together with their corresponding stacking orders, are dominant in 1T-TaSe$_2$. The connected metallic areas lead to the metallic transport behavior. We then reconcile the bulk metallic and surface insulating state in 1T-TaSe$_2$. The rich phenomena in 1T-TaSe$_2$ deepen our understanding of the correlated electronic state in bulk 1T-TaSe$_2$ and 1T-TaS$_2$.

preprint2022arXiv

Reinforcement Routing on Proximity Graph for Efficient Recommendation

We focus on Maximum Inner Product Search (MIPS), which is an essential problem in many machine learning communities. Given a query, MIPS finds the most similar items with the maximum inner products. Methods for Nearest Neighbor Search (NNS) which is usually defined on metric space don't exhibit the satisfactory performance for MIPS problem since inner product is a non-metric function. However, inner products exhibit many good properties compared with metric functions, such as avoiding vanishing and exploding gradients. As a result, inner product is widely used in many recommendation systems, which makes efficient Maximum Inner Product Search a key for speeding up many recommendation systems. Graph based methods for NNS problem show the superiorities compared with other class methods. Each data point of the database is mapped to a node of the proximity graph. Nearest neighbor search in the database can be converted to route on the proximity graph to find the nearest neighbor for the query. This technique can be used to solve MIPS problem. Instead of searching the nearest neighbor for the query, we search the item with maximum inner product with query on the proximity graph. In this paper, we propose a reinforcement model to train an agent to search on the proximity graph automatically for MIPS problem if we lack the ground truths of training queries. If we know the ground truths of some training queries, our model can also utilize these ground truths by imitation learning to improve the agent's search ability. By experiments, we can see that our proposed mode which combines reinforcement learning with imitation learning shows the superiorities over the state-of-the-art methods

preprint2022arXiv

Strong Neel ordering and luminescence correlation in a two-dimensional antiferromagnet

Magneto-optical effect has been widely used in light modulation, optical sensing and information storage. Recently discovered two-dimensional (2D) van der Waals layered magnets are considered as promising platforms for investigating novel magneto-optical phenomena and devices, due to the long-range magnetic ordering down to atomically-thin thickness, rich species and tunable properties. However, majority 2D antiferromagnets suffer from low luminescence efficiency which hinders their magneto-optical investigations and applications. Here, we uncover strong light-magnetic ordering interactions in 2D antiferromagnetic MnPS3 utilizing a newly-emerged near-infrared photoluminescence (PL) mode far below its intrinsic bandgap. This ingap PL mode shows strong correlation with the Neel ordering and persists down to monolayer thickness. Combining the DFT, STEM and XPS, we illustrate the origin of the PL mode and its correlation with Neel ordering, which can be attributed to the oxygen ion-mediated states. Moreover, the PL strength can be further tuned and enhanced using ultraviolet-ozone treatment. Our studies offer an effective approach to investigate light-magnetic ordering interactions in 2D antiferromagnetic semiconductors.

preprint2022arXiv

Towards Generalizable Semantic Product Search by Text Similarity Pre-training on Search Click Logs

Recently, semantic search has been successfully applied to e-commerce product search and the learned semantic space(s) for query and product encoding are expected to generalize to unseen queries or products. Yet, whether generalization can conveniently emerge has not been thoroughly studied in the domain thus far. In this paper, we examine several general-domain and domain-specific pre-trained Roberta variants and discover that general-domain fine-tuning does not help generalization, which aligns with the discovery of prior art. Proper domain-specific fine-tuning with clickstream data can lead to better model generalization, based on a bucketed analysis of a publicly available manual annotated query-product pair da

preprint2022arXiv

Uni-Retriever: Towards Learning The Unified Embedding Based Retriever in Bing Sponsored Search

Embedding based retrieval (EBR) is a fundamental building block in many web applications. However, EBR in sponsored search is distinguished from other generic scenarios and technically challenging due to the need of serving multiple retrieval purposes: firstly, it has to retrieve high-relevance ads, which may exactly serve user's search intent; secondly, it needs to retrieve high-CTR ads so as to maximize the overall user clicks. In this paper, we present a novel representation learning framework Uni-Retriever developed for Bing Search, which unifies two different training modes knowledge distillation and contrastive learning to realize both required objectives. On one hand, the capability of making high-relevance retrieval is established by distilling knowledge from the ``relevance teacher model''. On the other hand, the capability of making high-CTR retrieval is optimized by learning to discriminate user's clicked ads from the entire corpus. The two training modes are jointly performed as a multi-objective learning process, such that the ads of high relevance and CTR can be favored by the generated embeddings. Besides the learning strategy, we also elaborate our solution for EBR serving pipeline built upon the substantially optimized DiskANN, where massive-scale EBR can be performed with competitive time and memory efficiency, and accomplished in high-quality. We make comprehensive offline and online experiments to evaluate the proposed techniques, whose findings may provide useful insights for the future development of EBR systems. Uni-Retriever has been mainstreamed as the major retrieval path in Bing's production thanks to the notable improvements on the representation and EBR serving quality.

preprint2021arXiv

All at Once: Temporally Adaptive Multi-Frame Interpolation with Advanced Motion Modeling

Recent advances in high refresh rate displays as well as the increased interest in high rate of slow motion and frame up-conversion fuel the demand for efficient and cost-effective multi-frame video interpolation solutions. To that regard, inserting multiple frames between consecutive video frames are of paramount importance for the consumer electronics industry. State-of-the-art methods are iterative solutions interpolating one frame at the time. They introduce temporal inconsistencies and clearly noticeable visual artifacts. Departing from the state-of-the-art, this work introduces a true multi-frame interpolator. It utilizes a pyramidal style network in the temporal domain to complete the multi-frame interpolation task in one-shot. A novel flow estimation procedure using a relaxed loss function, and an advanced, cubic-based, motion model is also used to further boost interpolation accuracy when complex motion segments are encountered. Results on the Adobe240 dataset show that the proposed method generates visually pleasing, temporally consistent frames, outperforms the current best off-the-shelf method by 1.57db in PSNR with 8 times smaller model and 7.7 times faster. The proposed method can be easily extended to interpolate a large number of new frames while remaining efficient because of the one-shot mechanism.

preprint2021arXiv

BALM: Bundle Adjustment for Lidar Mapping

A local Bundle Adjustment (BA) on a sliding window of keyframes has been widely used in visual SLAM and proved to be very effective in lowering the drift. But in lidar SLAM, BA method is hardly used because the sparse feature points (e.g., edge and plane) make the exact point matching impossible. In this paper, we formulate the lidar BA as minimizing the distance from a feature point to its matched edge or plane. Unlike the visual SLAM (and prior plane adjustment method in lidar SLAM) where the feature has to be co-determined along with the pose, we show that the feature can be analytically solved and removed from the BA, the resultant BA is only dependent on the scan poses. This greatly reduces the optimization scale and allows large-scale dense plane and edge features to be used. To speedup the optimization, we derive the analytical derivatives of the cost function, up to second order, in closed form. Moreover, we propose a novel adaptive voxelization method to search feature correspondence efficiently. The proposed formulations are incorporated into a LOAM back-end for map refinement. Results show that, although as a back-end, the local BA can be solved very efficiently, even in real-time at 10Hz when optimizing 20 scans of point-cloud. The local BA also considerably lowers the LOAM drift. Our implementation of the BA optimization and LOAM are open-sourced to benefit the community.

preprint2021arXiv

Dynamic Graph Collaborative Filtering

Dynamic recommendation is essential for modern recommender systems to provide real-time predictions based on sequential data. In real-world scenarios, the popularity of items and interests of users change over time. Based on this assumption, many previous works focus on interaction sequences and learn evolutionary embeddings of users and items. However, we argue that sequence-based models are not able to capture collaborative information among users and items directly. Here we propose Dynamic Graph Collaborative Filtering (DGCF), a novel framework leveraging dynamic graphs to capture collaborative and sequential relations of both items and users at the same time. We propose three update mechanisms: zero-order 'inheritance', first-order 'propagation', and second-order 'aggregation', to represent the impact on a user or item when a new interaction occurs. Based on them, we update related user and item embeddings simultaneously when interactions occur in turn, and then use the latest embeddings to make recommendations. Extensive experiments conducted on three public datasets show that DGCF significantly outperforms the state-of-the-art dynamic recommendation methods up to 30. Our approach achieves higher performance when the dataset contains less action repetition, indicating the effectiveness of integrating dynamic collaborative information.

preprint2021arXiv

Heterogeneous Similarity Graph Neural Network on Electronic Health Records

Mining Electronic Health Records (EHRs) becomes a promising topic because of the rich information they contain. By learning from EHRs, machine learning models can be built to help human experts to make medical decisions and thus improve healthcare quality. Recently, many models based on sequential or graph models are proposed to achieve this goal. EHRs contain multiple entities and relations and can be viewed as a heterogeneous graph. However, previous studies ignore the heterogeneity in EHRs. On the other hand, current heterogeneous graph neural networks cannot be simply used on an EHR graph because of the existence of hub nodes in it. To address this issue, we propose Heterogeneous Similarity Graph Neural Network (HSGNN) analyze EHRs with a novel heterogeneous GNN. Our framework consists of two parts: one is a preprocessing method and the other is an end-to-end GNN. The preprocessing method normalizes edges and splits the EHR graph into multiple homogeneous graphs while each homogeneous graph contains partial information of the original EHR graph. The GNN takes all homogeneous graphs as input and fuses all of them into one graph to make a prediction. Experimental results show that HSGNN outperforms other baselines in the diagnosis prediction task.

preprint2021arXiv

Robust W-GAN-Based Estimation Under Wasserstein Contamination

Robust estimation is an important problem in statistics which aims at providing a reasonable estimator when the data-generating distribution lies within an appropriately defined ball around an uncontaminated distribution. Although minimax rates of estimation have been established in recent years, many existing robust estimators with provably optimal convergence rates are also computationally intractable. In this paper, we study several estimation problems under a Wasserstein contamination model and present computationally tractable estimators motivated by generative adversarial networks (GANs). Specifically, we analyze properties of Wasserstein GAN-based estimators for location estimation, covariance matrix estimation, and linear regression and show that our proposed estimators are minimax optimal in many scenarios. Finally, we present numerical results which demonstrate the effectiveness of our estimators.

preprint2021arXiv

Training Large-Scale News Recommenders with Pretrained Language Models in the Loop

News recommendation calls for deep insights of news articles' underlying semantics. Therefore, pretrained language models (PLMs), like BERT and RoBERTa, may substantially contribute to the recommendation quality. However, it's extremely challenging to have news recommenders trained together with such big models: the learning of news recommenders requires intensive news encoding operations, whose cost is prohibitive if PLMs are used as the news encoder. In this paper, we propose a novel framework, {SpeedyFeed}, which efficiently trains PLMs-based news recommenders of superior quality. SpeedyFeed is highlighted for its light-weighted encoding pipeline, which gives rise to three major advantages. Firstly, it makes the intermedia results fully reusable for the training workflow, which removes most of the repetitive but redundant encoding operations. Secondly, it improves the data efficiency of the training workflow, where non-informative data can be eliminated from encoding. Thirdly, it further saves the cost by leveraging simplified news encoding and compact news representation. Extensive experiments show that SpeedyFeed leads to more than 100$\times$ acceleration of the training process, which enables big models to be trained efficiently and effectively over massive user data. The well-trained PLMs-based model from SpeedyFeed demonstrates highly competitive performance, where it outperforms the state-of-the-art news recommenders with significant margins. SpeedyFeed is also a model-agnostic framework, which is potentially applicable to a wide spectrum of content-based recommender systems; therefore, the whole framework is open-sourced to facilitate the progress in related areas.

preprint2021arXiv

Understanding the flat band in 1T-TaS2 using a rotated basis

Electronic flat bands serve as a unique platform to achieve strongly-correlated phases. The emergence of a flat band around the Fermi level in 1T-TaS$_2$ in accompany with the development of a $\sqrt{13}\times\sqrt{13}$ charge density wave (CDW) superlattice has long been noticed experimentally, but a transparent theoretical understanding remains elusive. We show that without CDW, the primary feature of the $1\times1$ bands can be fitted by a simple trigonometric function, and physically understood by choosing a rotated $\tilde{t}_{2g}$ basis with the principle axes aligning to the tilted TaS$_6$ octahedron. Using this basis, we trace the band evolution in the $\sqrt{13}\times\sqrt{13}$ superlattice by progressively including different CDW effects. We point out that CDW strongly rehybridizes the three $\tilde{t}_{2g}$ orbitals, which leads to the formation of a well-localized molecular orbital and spawns the flat band.

preprint2020arXiv

Efficient Growth and Characterization of One Dimensional Transition Metal Tellurides Inside Carbon Nanotubes

Atomically thin one dimensional (1D) van der Waals wires of transition metal monochalocogenides (TMMs) have been anticipated as promising building blocks for integrated nanoelectronics. While reliable production of TMM nanowires has eluded scientists over the past few decades, we finally demonstrated a bottom up fabrication of MoTe nanowires inside carbon nanotubes (CNTs). Still, the current synthesis method is based on vacuum annealing of reactive MoTe2, and limits access to a variety of TMMs. Here we report an expanded framework for high yield synthesis of the 1D tellurides including WTe, an unprecedented family of TMMs. Experimental and theoretical analyses revealed that the choice of suitable metal oxides as a precursor provides useful yield for their characterization. These TMM nanowires exhibit a significant optical absorption in the visible light region. More important, electronic properties of CNTs can be tuned by encapsulating different TMM nanowires.

preprint2020arXiv

Electric field manipulation enhanced by strong spin-orbit coupling: promoting rare-earth ions as qubits

Quantum information processing based on magnetic ions are considered potential candidates for applications because they can be modified and scaled up by a variety of chemical methods. For these systems to achieve individual spin addressability and high energy efficiency, we exploited the electric field as a tool to manipulate their quantum behaviours, functioning via spin-orbit coupling. A Ce:YAG single crystal was employed due to that rare-earth ions have strong spin-orbit coupling and with considerations regarding the dynamics and the symmetry requirements. The Stark effect of the Ce3+ ion was observed and measured. When demonstrated as a quantum phase gate, the electric field manipulation exhibited high efficiency which allowed up to 57 π/2 operations before decoherence with optimized field directions. It was also utilized to carry out quantum bang-bang control, as a method of dynamic decoupling, and the refined Deutsch-Jozsa algorithm. Our experiments highlighted rare-earth ions as potentially applicable qubits since they offer enhanced spin-electric coupling which enables high-efficiency quantum manipulation.

preprint2020arXiv

Electronic nematicity in FeSe: a first-principles perspective

Electronic nematicity is an important order in most iron-based superconductors, and FeSe represents a unique example, in which nematicity disentangles from spin ordering. It is commonly perceived that this property arises from strong electronic correlation, which can not be properly captured by density functional theory (DFT). Here, we show that by properly considering the paramagnetic condition and carefully searching the energy landscape with symmetry-preconditioned wavefunctions, two nematic solutions stand out at either the DFT+$U$ or hybrid functional level, both of which are lower in energy than the symmetric solution. The ground-state band structure and Fermi surface can be well compared with the recent experimental results. Symmetry analysis assigns these two new solutions to the $B_{1g}$ and $E_u$ irreducible representations of the D$_{4h}$ point group. While the $B_{1g}$ Ising nematicity has been widely discussed in the context of vestigial stripe antiferromagnetic order, the two-component $E_u$ vector nematicity is beyond previous theoretical discussion. Distinct from the $B_{1g}$ order, the $E_u$ order features mixing of the Fe $d$-orbitals and inversion symmetry breaking, which lead to striking experimental consequences, e.g. missing of an electron pocket.

preprint2020arXiv

Giant renormalization of correlation strength in 1T-TaS2 by lattice vibration

The lattice thermodynamics of a 1T-TaS2 layer, e.g. the spontaneous formation of a sqrt13*sqrt13 commensurate charge density wave (CCDW) and vibrations around the equilibrium position, is calculated by ab initio molecular dynamics. Based on that, we examine how the ground-state electronic structure is renormalized by lattice temperature. We show that the band gap within the density functional theory plus onsite-U correction shrinks by half when the temperature raises from 0 K to 200 K. The gap size reduction is one order of magnitude larger than the temperature variation in energy. This giant temperature dependence is closely related to the CCDW-triggered Mottness in 1T-TaS2, and is expected to result in unconventional thermodynamic properties.

preprint2020arXiv

Low-cost Retina-like Robotic Lidars Based on Incommensurable Scanning

High performance lidars are essential in autonomous robots such as self-driving cars, automated ground vehicles and intelligent machines. Traditional mechanical scanning lidars offer superior performance in autonomous vehicles, but the potential mass application is limited by the inherent manufacturing difficulty. We propose a robotic lidar sensor based on incommensurable scanning that allows straightforward mass production and adoption in autonomous robots. Some unique features are additionally permitted by this incommensurable scanning. Similar to the fovea in human retina, this lidar features a peaked central angular density, enabling in applications that prefers eye-like attention. The incommensurable scanning method of this lidar could also provide a much higher resolution than conventional lidars which is beneficial in robotic applications such as sensor calibration. Examples making use of these advantageous features are demonstrated.

preprint2020arXiv

Many-body effect in optical properties of monolayer molybdenum diselenide

Excitons in monolayer transition metal dichalcogenide (TMD) provide a paradigm of composite Boson in 2D system. This letter reports a photoluminescence and reflectance study of excitons in monolayer molybdenum diselenide (MoSe2) with electrostatic gating. We observe the repulsive and attractive Fermi polaron modes of the band edge exciton, its excited state and the spin-off excitons. Our data validate the polaronic behavior of excitonic states in the system quantitatively where the simple three-particle trion model is insufficient to explain.

preprint2020arXiv

Quantum phase interference in a fullerene-based molecular qutrit

High spin magnetic molecules are promising candidates for quantum information processing because they intrinsically have multiple sublevels for information storage and computational operations. However, due to their susceptibility to the environment and limitation from the selection rule, the arbitrary control of the quantum state of a multilevel system on a molecular and electron spin basis has not been realized. Here we exploit the photoexcited triplet of C70 as a molecular electron spin qutrit. After the system was initialized by photoexcitation, we prepared it into representative three-level superposition states characteristic of the qutrit, measured their density matrices, and showed the interference of the quantum phases in the superposition. The interference pattern is further interpreted as a map of evolution through time under different conditions.

preprint2020arXiv

Renormalization of the Mott gap by lattice entropy: The case of 1T-TaS2

In many transition-metal oxides and dichalcogenides, the electronic and lattice degrees of freedom are strongly coupled, giving rise to remarkable phenomena, such as metal-insulator transition (MIT) and charge-density wave (CDW) order. We study this interplay by tracing the instant electronic structure under ab initio molecular dynamics. Applying this method to a 1T-TaS2 layer, we show that the CDW-triggered Mott gap undergoes a continuous reduction as the lattice temperature raises, despite a nearly constant CDW amplitude. Before the CDW order undergoes a sharp first-order transition around the room temperature, the dynamical CDW fluctuation already shrinks the Mott gap size by half. The gap size reduction is one order of magnitude larger than the lattice temperature variation. Our calculation not only provides an important clue to understand the thermodynamics behavior in 1T-TaS2, but also demonstrates a general approach to quantify the lattice entropy effect in MIT.

preprint2019arXiv

In-plane ordering of O vacancies in a high-Tc cuprate superconductor with compressed Cu-O octahedrons: a first-principles cluster expansion study

A recently discovered high-Tc cuprate superconductor Ba2CuO$_{4-δ}$ exhibits exceptional Jahn-Teller distortion, wherein the CuO6 octahedrons are compressed along the c axis. As a consequence, the O vacancies prefer to reside in the CuO2 plane, but the exact structure is not known. By combining first-principles total energy calculation with the automated structure inversion method, the effective cluster interactions of O vacancies are mapped out. Around $δ$=0.8, where the 73K superconductivity was observed experimentally, we predict that the ordered O vacancies slice the CuO2 plane into not only 1D chains and but also two-leg ladders. A Monte Carlo simulation is performed based on the effective cluster interaction model, showing that such an ordering pattern is stable up to ~900 K. Our results put forth a concrete structural basis to discuss the underlying superconducting mechanism.

preprint2019arXiv

Phase evolution and superconductivity enhancement in Se-substituted MoTe$_2$ thin films

The strong spin$-$orbit coupling (SOC) and numerous crystal phases in few$-$layer transition metal dichalcogenides (TMDCs) MX$_2$ (M$=$W, Mo, and X$=$Te, Se, S) has led to a variety of novel physics, such as Ising superconductivity and quantum spin Hall effect realized in monolayer 2H$-$ and Td$-$MX$_2$, respectively. Consecutive tailoring of the MX$_2$ structure from 2H to Td phase may realize the long$-$sought topological superconductivity in one material system by incorporating superconductivity and quantum spin Hall effect together. In this work, by combing Raman spectrum, X-ray photoelectron spectrum (XPS), scanning transmission electron microscopy imaging (STEM) as well as electrical transport measurements, we demonstrate that a consecutively structural phase transitions from Td to 1T$'$ to 2H polytype can be realized as the Se-substitution concentration increases. More importantly, the Se$-$substitution has been found to notably enhance the superconductivity of the MoTe$_2$ thin film, which is interpreted as the introduction of the two$-$band superconductivity. The chemical constituent induced phase transition offers a new strategy to study the s$_{+-}$ superconductivity and the possible topological superconductivity as well as to develop phase$-$sensitive devices based on MX$_2$ materials.

preprint2019arXiv

Phase-controllable growth of ultrathin 2D magnetic FeTe crystals

Two-dimensional (2D) magnets with intrinsic ferromagnetic/antiferromagnetic (FM/AFM) ordering are highly desirable for future spintronics devices. However, the synthesis of 2D magnetic crystals, especially the direct growth on SiO2/Si substrate, is just in its infancy. Here, we report a chemical vapor deposition (CVD)-based rational growth approach for the synthesis of ultrathin FeTe crystals with controlled structural and magnetic phases. By precisely optimizing the growth temperature (Tgrowth), FeTe nanoplates with either layered tetragonal or non-layered hexagonal phase can be controlled with high-quality. The two controllable phases lead to square and triangular morphologies with a thickness down to 3.6 and 2.8 nm, respectively. More importantly, transport measurements reveal that tetragonal FeTe is antiferromagnetic with a Neel temperature (TN) about 71.8 K, while hexagonal FeTe is ferromagnetic with a Curie temperature (TC) around 220 K. Theoretical calculations indicate that the ferromagnetic order in hexagonal FeTe is originated from a concomitant lattice distortion and the spin-lattice coupling. This study represents a major step forward in the CVD growth of 2D magnetic materials on SiO2/Si substrates and highlights on their potential applications in the future spintronic devices.

preprint2017arXiv

Triangulated Surface Denoising using High Order Regularization with Dynamic Weights

Recovering high quality surfaces from noisy triangulated surfaces is a fundamental important problem in geometry processing. Sharp features including edges and corners can not be well preserved in most existing denoising methods except the recent total variation (TV) and $\ell_0$ regularization methods. However, these two methods have suffered producing staircase artifacts in smooth regions. In this paper, we first introduce a second order regularization method for restoring a surface normal vector field, and then propose a new vertex updating scheme to recover the desired surface according to the restored surface normal field. The proposed model can preserve sharp features and simultaneously suppress the staircase effects in smooth regions which overcomes the drawback of the first order models. In addition, the new vertex updating scheme can prevent ambiguities introduced in existing vertex updating methods. Numerically, the proposed high order model is solved by the augmented Lagrangian method with a dynamic weighting strategy. Intensive numerical experiments on a variety of surfaces demonstrate the superiority of our method by visually and quantitatively.