Source author record

Han Li

Han Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

29works

27topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Action-Aware Generative Sequence Modeling for Short Video Recommendation

With the rapid development of the Internet, users have increasingly higher expectations for the recommendation accuracy of online content consumption platforms. However, short videos often contain diverse segments, and users may not hold the same attitude toward all of them. Traditional binary-classification recommendation models, which treat a video as a single holistic entity, face limitations in accurately capturing such nuanced preferences. Considering that user consumption is a temporal process, this paper demonstrates that the timing of user actions can represent diverse intentions through statistical analysis and examination of action patterns. Based on this insight, we propose a novel modeling paradigm: Action-Aware Generative Sequence Network (A2Gen), which refines user actions along the temporal dimension and chains them into sequences for unified processing and prediction. First, we introduce the Context-aware Attention Module (CAM) to model action sequences enriched with item-specific contextual features. Building upon this, we develop the Hierarchical Sequence Encoder (HSE) to learn temporal action patterns from users' historical actions. Finally, through leveraging CAM, we design a module for action sequence generation: the Action-seq Autoregressive Generator (AAG). Extensive offline experiments on the Kuaishou's dataset and the Tmall public dataset demonstrate the superiority of our proposed model. Furthermore, through large-scale online A/B testing deployed on Kuaishou's platform, our model achieves significant improvements over baseline methods in multi-task prediction by leveraging sequential information. Specifically, it yields increases of 0.34% in user watch time, 8.1% in interaction rate, and 0.162% in overall user retention (LifeTime-7), leading to successful deployment across all traffic, serving over 400 million users every day.

preprint2024arXiv

Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning

In recent years, text-to-video retrieval methods based on CLIP have experienced rapid development. The primary direction of evolution is to exploit the much wider gamut of visual and textual cues to achieve alignment. Concretely, those methods with impressive performance often design a heavy fusion block for sentence (words)-video (frames) interaction, regardless of the prohibitive computation complexity. Nevertheless, these approaches are not optimal in terms of feature utilization and retrieval efficiency. To address this issue, we adopt multi-granularity visual feature learning, ensuring the model's comprehensiveness in capturing visual content features spanning from abstract to detailed levels during the training phase. To better leverage the multi-granularity features, we devise a two-stage retrieval architecture in the retrieval phase. This solution ingeniously balances the coarse and fine granularity of retrieval content. Moreover, it also strikes a harmonious equilibrium between retrieval effectiveness and efficiency. Specifically, in training phase, we design a parameter-free text-gated interaction block (TIB) for fine-grained video representation learning and embed an extra Pearson Constraint to optimize cross-modal representation learning. In retrieval phase, we use coarse-grained video representations for fast recall of top-k candidates, which are then reranked by fine-grained video representations. Extensive experiments on four benchmarks demonstrate the efficiency and effectiveness. Notably, our method achieves comparable performance with the current state-of-the-art methods while being nearly 50 times faster.

preprint2023arXiv

The Security Analysis of Continuous-Variable Quantum Key Distribution under Limited Eavesdropping with Practical Fiber

Research on optimal eavesdropping models under practical conditions will help to evaluate realistic risk when employing quantum key distribution (QKD) system for secure information transmission. Intuitively, fiber loss will lead to the optical energy leaking to the environment, rather than harvested by the eavesdropper, which also limits the eavesdropping ability while improving the QKD system performance in practical use. However, defining the optimal eavesdropping model in the presence of lossy fiber is difficult because the channel is beyond the control of legitimate partners and the leaked signal is undetectable. Here we investigate how the fiber loss influences the eavesdropping ability based on a teleportation-based collective attack model which requires two distant stations and a shared entanglement source. We find that if the distributed entanglement is limited due to the practical loss, the optimal attack occurs when the two teleportation stations are merged to one and placed close to the transmitter site, which performs similar to the entangling-cloning attack but with a reduced wiretapping ratio. Assuming Eve uses the best available hollow-core fiber, the secret key rate in the practical environment can be 20%~40% higher than that under ideal eavesdropping. While if the entanglement distillation technology is mature enough to provide high quality of distributed entanglement, the two teleportation stations should be distantly separated for better eavesdropping performance, where the eavesdropping can even approach the optimal collective attack. Under the current level of entanglement purification technology, the unavoidable fiber loss can still greatly limit the eavesdropping ability as well as enhance the secret key rate and transmission distance of the realistic system, which promotes the development of QKD systems in practical application scenarios.

preprint2022arXiv

Absolute Quantification of sp$^{3}$ Defects in Semiconducting Single-Wall Carbon Nanotubes by Raman Spectroscopy

The functionalization of semiconducting single-wall carbon nanotubes (SWCNTs) with luminescent sp$^{3}$ defects creates red-shifted emission features in the near-infrared and boosts their photoluminescence quantum yields (PLQYs). While multiple synthetic routes for the selective introduction of sp$^{3}$ defects have been developed, a convenient metric to precisely quantify the number of defects on a SWCNT lattice is not available. Here, we present a direct and simple quantification protocol based on a linear correlation of the integrated Raman D/G$^{+}$ signal ratios and defect densities as extracted from PLQY measurements. Corroborated by a statistical analysis of single-nanotube emission spectra at cryogenic temperature, this method enables the quantitative evaluation of sp$^{3}$ defect densities in (6,5) SWCNTs with an error of $\pm$ 3 defects per micrometer and the determination of oscillator strengths for different defect types. The developed protocol requires only standard Raman spectroscopy and is independent of the defect configuration, dispersion solvent and nanotube length.

preprint2022arXiv

DNN Training Acceleration via Exploring GPGPU Friendly Sparsity

The training phases of Deep neural network~(DNN) consumes enormous processing time and energy. Compression techniques utilizing the sparsity of DNNs can effectively accelerate the inference phase of DNNs. However, it is hardly used in the training phase because the training phase involves dense matrix-multiplication using General-Purpose Computation on Graphics Processors (GPGPU), which endorse the regular and structural data layout. In this paper, we first propose the Approximate Random Dropout that replaces the conventional random dropout of neurons and synapses with a regular and online generated row-based or tile-based dropout patterns to eliminate the unnecessary computation and data access for the multilayer perceptron~(MLP) and long short-term memory~(LSTM). We then develop a SGD-based Search Algorithm that produces the distribution of row-based or tile-based dropout patterns to compensate for the potential accuracy loss. Moreover, aiming at the convolution neural network~(CNN) training acceleration, we first explore the importance and sensitivity of input feature maps; and then propose the sensitivity-aware dropout method to dynamically drop the input feature maps based on their sensitivity so as to achieve greater forward and backward training acceleration while reserving better NN accuracy. To facilitate DNN programming, we build a DNN training computation framework that unifies the proposed techniques in the software stack. As a result, the GPGPU only needs to support the basic operator -- matrix multiplication and can achieve significant performance improvement regardless of DNN model.

preprint2022arXiv

KPGT: Knowledge-Guided Pre-training of Graph Transformer for Molecular Property Prediction

Designing accurate deep learning models for molecular property prediction plays an increasingly essential role in drug and material discovery. Recently, due to the scarcity of labeled molecules, self-supervised learning methods for learning generalizable and transferable representations of molecular graphs have attracted lots of attention. In this paper, we argue that there exist two major issues hindering current self-supervised learning methods from obtaining desired performance on molecular property prediction, that is, the ill-defined pre-training tasks and the limited model capacity. To this end, we introduce Knowledge-guided Pre-training of Graph Transformer (KPGT), a novel self-supervised learning framework for molecular graph representation learning, to alleviate the aforementioned issues and improve the performance on the downstream molecular property prediction tasks. More specifically, we first introduce a high-capacity model, named Line Graph Transformer (LiGhT), which emphasizes the importance of chemical bonds and is mainly designed to model the structural information of molecular graphs. Then, a knowledge-guided pre-training strategy is proposed to exploit the additional knowledge of molecules to guide the model to capture the abundant structural and semantic information from large-scale unlabeled molecular graphs. Extensive computational tests demonstrated that KPGT can offer superior performance over current state-of-the-art methods on several molecular property prediction tasks.

preprint2022arXiv

Multi-node Acceleration for Large-scale GCNs

Limited by the memory capacity and compute power, singe-node graph convolutional neural network (GCN) accelerators cannot complete the execution of GCNs within a reasonable amount of time, due to the explosive size of graphs nowadays. Thus, large-scale GCNs call for a multi-node acceleration system (MultiAccSys) like TPU-Pod for large-scale neural networks. In this work, we aim to scale up single-node GCN accelerators to accelerate GCNs on large-scale graphs. We first identify the communication pattern and challenges of multi-node acceleration for GCNs on large-scale graphs. We observe that (1) coarse-grained communication patterns exist in the execution of GCNs in MultiAccSys, which introduces massive amount of redundant network transmissions and off-chip memory accesses; (2) overall, the acceleration of GCNs in MultiAccSys is bandwidth-bound and latency-tolerant. Guided by these two observations, we then propose MultiGCN, the first MultiAccSys for large-scale GCNs that trades network latency for network bandwidth. Specifically, by leveraging the network latency tolerance, we first propose a topology-aware multicast mechanism with a one put per multicast message-passing model to reduce transmissions and alleviate network bandwidth requirements. Second, we introduce a scatter-based round execution mechanism which cooperates with the multicast mechanism and reduces redundant off-chip memory accesses. Compared to the baseline MultiAccSys, MultiGCN achieves 4~12x speedup using only 28%~68% energy, while reducing 32% transmissions and 73% off-chip memory accesses on average. It not only achieves 2.5~8x speedup over the state-of-the-art multi-GPU solution, but also scales to large-scale graphs as opposed to single-node GCN accelerators.

preprint2022arXiv

MVP-Net: Multiple View Pointwise Semantic Segmentation of Large-Scale Point Clouds

Semantic segmentation of 3D point cloud is an essential task for autonomous driving environment perception. The pipeline of most pointwise point cloud semantic segmentation methods includes points sampling, neighbor searching, feature aggregation, and classification. Neighbor searching method like K-nearest neighbors algorithm, KNN, has been widely applied. However, the complexity of KNN is always a bottleneck of efficiency. In this paper, we propose an end-to-end neural architecture, Multiple View Pointwise Net, MVP-Net, to efficiently and directly infer large-scale outdoor point cloud without KNN or any complex pre/postprocessing. Instead, assumption-based space filling curves and multi-rotation of point cloud methods are introduced to point feature aggregation and receptive field expanding. Numerical experiments show that the proposed MVP-Net is 11 times faster than the most efficient pointwise semantic segmentation method RandLA-Net and achieves the same accuracy on the large-scale benchmark SemanticKITTI dataset.

preprint2022arXiv

SATr: Slice Attention with Transformer for Universal Lesion Detection

Universal Lesion Detection (ULD) in computed tomography plays an essential role in computer-aided diagnosis. Promising ULD results have been reported by multi-slice-input detection approaches which model 3D context from multiple adjacent CT slices, but such methods still experience difficulty in obtaining a global representation among different slices and within each individual slice since they only use convolution-based fusion operations. In this paper, we propose a novel Slice Attention Transformer (SATr) block which can be easily plugged into convolution-based ULD backbones to form hybrid network structures. Such newly formed hybrid backbones can better model long-distance feature dependency via the cascaded self-attention modules in the Transformer block while still holding a strong power of modeling local features with the convolutional operations in the original backbone. Experiments with five state-of-the-art methods show that the proposed SATr block can provide an almost free boost to lesion detection accuracy without extra hyperparameters or special network designs.

preprint2022arXiv

Spin Supersolidity in Nearly Ideal Easy-axis Triangular Quantum Antiferromagnet Na$_2$BaCo(PO$_4$)$_2$

Prototypical models and their material incarnations are cornerstones to the understanding of quantum magnetism. Here we show theoretically that the recently synthesized magnetic compound Na$_2$BaCo(PO$_4$)$_2$ (NBCP) is a rare, nearly ideal material realization of the $S=1/2$ triangular-lattice antiferromagnet with significant easy-axis spin exchange anisotropy. By combining the automatic parameter searching and tensor-network simulations, we establish a microscopic model description of this material with realistic model parameters, which can not only fit well the experimental thermodynamic data but also reproduce the measured magnetization curves without further adjustment of parameters. According to the established model, the NBCP hosts a spin supersolid state that breaks both the lattice translation symmetry and the spin rotational symmetry. Such a state is a spin analogue of the long-sought supersolid state, thought to exist in solid Helium and optical lattice systems, and share similar traits. The NBCP therefore represents an ideal material-based platform to explore the physics of supersolidity as well as its quantum and thermal melting.

preprint2022arXiv

Translucency and negative temperature-dependence for the slip length of water on graphene

Carbonous materials, such as graphene and carbon nanotube, have attracted tremendous attention in the fields of nanofluidics due to the slip at the interface between solid and liquid. The dependence of slip length for water on the types of supporting substrates and thickness of carbonous layer, which is critical for applications such as sustainable cooling of electronic devices, remains unknown. In this paper, using colloidal probe atomic force microscope, we measured the slip length of water on graphene ls supported by hydrophilic and hydrophobic substrates, i.e., SiO2 and octadecyltrimethoxysilane (OTS). The ls on single-layer graphene supported by SiO2 is found to be 1.6~1.9 nm, and by OTS is 8.5~0.9 nm. With the thickness of few-layer graphene increases to 3~4 layers, both ls gradually converge to the value of graphite (4.3~3.5 nm). Such thickness dependence is termed slip length translucency. Further, ls is found to decrease by about 70% with the temperature increases from 300 K to 350 K for 2-layer graphene supported by SiO2. These observations are explained by analysis based on Green-Kubo relation and McLachlan theory. Our results provide the first set of reference values for the slip length of water on supported few-layer graphene. They can not only serve as a direct experimental reference for solid-liquid interaction, but also provide guideline for the design of nanofluidics-based devices, for example the thermo-mechanical nanofluidic devices.

preprint2021arXiv

Direct Visualization of a Static Incommensurate Antiferromagnetic Order by Suppressing the Superconducting Phase Coherence in Fe-doped Bi2Sr2CaCu2O8+delta

In cuprate superconductors, due to strong electronic correlations, there are multiple intertwined orders which either coexist or compete with superconductivity. Among them the antiferromagnetic (AF) order is the most prominent one. In the region where superconductivity sets in, the long-range AF order is destroyed. Yet the residual short-range AF fluctuations are present up to a much higher doping and their role in the emergence of the superconducting phase is still highly debated. Here, by using a spin polarized scanning tunneling microscope, for the first time, we directly visualize an emergent incommensurate AF order in the nearby region of Fe impurities embedded in the optimally doped Bi2Sr2CaCu2O8+δ (Bi2212). Remarkably the Fe impurities suppress the superconducting coherence peaks with the gapped feature intact, but pin down the ubiquitous short-range incommensurate AF order. Our work shows an intimate relation between antiferromagnetism and superconductivity.

preprint2021arXiv

Neural model robustness for skill routing in large-scale conversational AI systems: A design choice exploration

Current state-of-the-art large-scale conversational AI or intelligent digital assistant systems in industry comprises a set of components such as Automatic Speech Recognition (ASR) and Natural Language Understanding (NLU). For some of these systems that leverage a shared NLU ontology (e.g., a centralized intent/slot schema), there exists a separate skill routing component to correctly route a request to an appropriate skill, which is either a first-party or third-party application that actually executes on a user request. The skill routing component is needed as there are thousands of skills that can either subscribe to the same intent and/or subscribe to an intent under specific contextual conditions (e.g., device has a screen). Ensuring model robustness or resilience in the skill routing component is an important problem since skills may dynamically change their subscription in the ontology after the skill routing model has been deployed to production. We show how different modeling design choices impact the model robustness in the context of skill routing on a state-of-the-art commercial conversational AI system, specifically on the choices around data augmentation, model architecture, and optimization method. We show that applying data augmentation can be a very effective and practical way to drastically improve model robustness.

preprint2021arXiv

No observation of chiral flux current in the topological kagome metal CsV$_{3}$Sb$_{5}$

Compounds with kagome lattice usually host many exotic quantum states, including the quantum spin liquid, non-trivial topological Dirac bands and a strongly renormalized flat band, etc. Recently an interesting vanadium based kagome family $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, or Cs) was discovered, and these materials exhibit multiple interesting properties, including unconventional saddle-point driving charge density wave (CDW) state, superconductivity, etc. Furthermore, some experiments show anomalous Hall effect which inspires that there might be some chiral flux current states. Here we report scanning tunneling measurements by using spin-polarized tips. Although we have observed clearly the $2a_0\times2a_0$ CDW and $4a_0$ stripe orders, the well-designed experiments with refined spin-polarized tips do not reveal any trace of the chiral flux current phase in CsV$_3$Sb$_5$ within the limits of experimental accuracy. No observation of the local magnetic moment in our experiments may put an upper bound constraint on the magnitude of magnetic moments induced by the possible chiral loop current which has a time-reversal symmetry breaking along $c$-axis in CsV$_{3}$Sb$_{5}$.

preprint2020arXiv

A Deep Prediction Network for Understanding Advertiser Intent and Satisfaction

For e-commerce platforms such as Taobao and Amazon, advertisers play an important role in the entire digital ecosystem: their behaviors explicitly influence users' browsing and shopping experience; more importantly, advertiser's expenditure on advertising constitutes a primary source of platform revenue. Therefore, providing better services for advertisers is essential for the long-term prosperity for e-commerce platforms. To achieve this goal, the ad platform needs to have an in-depth understanding of advertisers in terms of both their marketing intents and satisfaction over the advertising performance, based on which further optimization could be carried out to service the advertisers in the correct direction. In this paper, we propose a novel Deep Satisfaction Prediction Network (DSPN), which models advertiser intent and satisfaction simultaneously. It employs a two-stage network structure where advertiser intent vector and satisfaction are jointly learned by considering the features of advertiser's action information and advertising performance indicators. Experiments on an Alibaba advertisement dataset and online evaluations show that our proposed DSPN outperforms state-of-the-art baselines and has stable performance in terms of AUC in the online environment. Further analyses show that DSPN not only predicts advertisers' satisfaction accurately but also learns an explainable advertiser intent, revealing the opportunities to optimize the advertising performance further.

preprint2020arXiv

Bounding Maps for Universal Lesion Detection

Universal Lesion Detection (ULD) in computed tomography plays an essential role in computer-aided diagnosis systems. Many detection approaches achieve excellent results for ULD using possible bounding boxes (or anchors) as proposals. However, empirical evidence shows that using anchor-based proposals leads to a high false-positive (FP) rate. In this paper, we propose a box-to-map method to represent a bounding box with three soft continuous maps with bounds in x-, y- and xy- directions. The bounding maps (BMs) are used in two-stage anchor-based ULD frameworks to reduce the FP rate. In the 1 st stage of the region proposal network, we replace the sharp binary ground-truth label of anchors with the corresponding xy-direction BM hence the positive anchors are now graded. In the 2 nd stage, we add a branch that takes our continuous BMs in x- and y- directions for extra supervision of detailed locations. Our method, when embedded into three state-of-the-art two-stage anchor-based detection methods, brings a free detection accuracy improvement (e.g., a 1.68% to 3.85% boost of sensitivity at 4 FPs) without extra inference time.

preprint2020arXiv

Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

In E-commerce, advertising is essential for merchants to reach their target users. The typical objective is to maximize the advertiser's cumulative revenue over a period of time under a budget constraint. In real applications, an advertisement (ad) usually needs to be exposed to the same user multiple times until the user finally contributes revenue (e.g., places an order). However, existing advertising systems mainly focus on the immediate revenue with single ad exposures, ignoring the contribution of each exposure to the final conversion, thus usually falls into suboptimal solutions. In this paper, we formulate the sequential advertising strategy optimization as a dynamic knapsack problem. We propose a theoretically guaranteed bilevel optimization framework, which significantly reduces the solution space of the original optimization space while ensuring the solution quality. To improve the exploration efficiency of reinforcement learning, we also devise an effective action space reduction approach. Extensive offline and online experiments show the superior performance of our approaches over state-of-the-art baselines in terms of cumulative revenue.

preprint2020arXiv

Explicit Result on Equivalence of Rational Quadratic Forms Avoiding Primes

Given a pair of regular quadratic forms over $\mathbb{Q}$ which are in the same genus and a finite set of primes $P$, we show that there is an effective way to determine a rational equivalence between these two quadratic forms which are integral over every prime in $P$. This answers one of the principal questions posed by Conway and Sloane in their book {\em Sphere packings, lattices and groups}, Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], Vol 290, Springer-Verlag, New York, 1999; page 402.

preprint2020arXiv

Kosterlitz-Thouless Melting of Magnetic Order in the Triangular Quantum Ising Material TmMgGaO$_4$

Frustrated magnets host the promises of material realizations of new paradigm of quantum matter, while direct comparison of unbiased model calculations with experimental measurements is still very challenging. Here, we design and implement a protocol of employing many-body computation methodologies for accurate model calculation -- both equilibrium and dynamical properties -- of a frustrated rare-earth magnet TmMgGaO$_4$ (TMGO), which perfectly explains the corresponding experimental findings. Our results confirm TMGO is an ideal realization of triangular-lattice Ising model with an intrinsic transverse field. The magnetic order of TMGO is predicted to melt through two successive Kosterlitz-Thouless (KT) phase transitions, with a floating KT phase in between. The dynamical spectra calculated suggest remnant images of a vanishing magnetic stripe order that represent vortex-antivortex pairs, resembling rotons in a superfluid helium film. TMGO therefore constitutes a rare quantum magnet for realizing KT physics and we further propose experimental detections of its intriguing properties.

preprint2020arXiv

Learning Optimal Tree Models Under Beam Search

Retrieving relevant targets from an extremely large target set under computational limits is a common challenge for information retrieval and recommendation systems. Tree models, which formulate targets as leaves of a tree with trainable node-wise scorers, have attracted a lot of interests in tackling this challenge due to their logarithmic computational complexity in both training and testing. Tree-based deep models (TDMs) and probabilistic label trees (PLTs) are two representative kinds of them. Though achieving many practical successes, existing tree models suffer from the training-testing discrepancy, where the retrieval performance deterioration caused by beam search in testing is not considered in training. This leads to an intrinsic gap between the most relevant targets and those retrieved by beam search with even the optimally trained node-wise scorers. We take a first step towards understanding and analyzing this problem theoretically, and develop the concept of Bayes optimality under beam search and calibration under beam search as general analyzing tools for this purpose. Moreover, to eliminate the discrepancy, we propose a novel algorithm for learning optimal tree models under beam search. Experiments on both synthetic and real data verify the rationality of our theoretical analysis and demonstrate the superiority of our algorithm compared to state-of-the-art methods.

preprint2020arXiv

Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising

Bipartite b-matching is fundamental in algorithm design, and has been widely applied into economic markets, labor markets, etc. These practical problems usually exhibit two distinct features: large-scale and dynamic, which requires the matching algorithm to be repeatedly executed at regular intervals. However, existing exact and approximate algorithms usually fail in such settings due to either requiring intolerable running time or too much computation resource. To address this issue, we propose \texttt{NeuSearcher} which leverages the knowledge learned from previously instances to solve new problem instances. Specifically, we design a multichannel graph neural network to predict the threshold of the matched edges weights, by which the search region could be significantly reduced. We further propose a parallel heuristic search algorithm to iteratively improve the solution quality until convergence. Experiments on both open and industrial datasets demonstrate that \texttt{NeuSearcher} can speed up 2 to 3 times while achieving exactly the same matching solution compared with the state-of-the-art approximation approaches.

preprint2020arXiv

Learning to Infer User Hidden States for Online Sequential Advertising

To drive purchase in online advertising, it is of the advertiser's great interest to optimize the sequential advertising strategy whose performance and interpretability are both important. The lack of interpretability in existing deep reinforcement learning methods makes it not easy to understand, diagnose and further optimize the strategy. In this paper, we propose our Deep Intents Sequential Advertising (DISA) method to address these issues. The key part of interpretability is to understand a consumer's purchase intent which is, however, unobservable (called hidden states). In this paper, we model this intention as a latent variable and formulate the problem as a Partially Observable Markov Decision Process (POMDP) where the underlying intents are inferred based on the observable behaviors. Large-scale industrial offline and online experiments demonstrate our method's superior performance over several baselines. The inferred hidden states are analyzed, and the results prove the rationality of our inference.

preprint2020arXiv

Routing of valley photons in a WS2 monolayer via delocalized Bloch modes of in-plane inversion-symmetry broken photonic crystal slabs

The valleys of two-dimensional transition metal dichalcogenides (TMDCs) offer a new degree of freedom for information processing. To take advantage of this valley degree of freedom, on one hand, it is feasible to control valleys by utilizing different external stimuli like optical and electric fields. On the other hand, nanostructures are also used to separate the valleys by near field coupling. However, for both above methods, either required low-temperature environment or low degree of coherence properties limit their further applications. Here, we demonstrate all-dielectric photonic crystal (PhC) slabs without in-plane inversion symmetry (C2 symmetry) could separate and route valley photons in a WS2 monolayer at room temperature. Coupling with circularly polarized photonic Bloch modes of such PhC slabs, valley photons emitted by a WS2 monolayer are routed directionally and efficiently separated in the far field. In addition, the far-field emission is directionally enhanced and with long-distance spatial coherence property.

preprint2016arXiv

Effective equidistribution of translates of maximal horospherical measures in the space of lattices

Recently Mohammadi and Salehi-Golsefidy gave necessary and sufficient conditions under which certain translates of homogeneous measures converge, and they determined the limiting measures in the cases of convergence. The class of measures they considered includes the maximal horospherical measures. In this paper we prove the corresponding effective equidistribution results in the space of unimodular lattices. We also prove the corresponding results for probability measures with absolutely continuous densities in rank two and three. Then we address the problem of determining the error terms in two counting problems also considered by Mohammadi and Salehi-Golsefidy. In the first problem, we determine an error term for counting the number of lifts of a closed horosphere from an irreducible, finite-volume quotient of the space of positive definite $n\times n$ matrices of determinant one that intersect a ball with large radius. In the second problem, we determine a logarithmic error term for the Manin conjecture of a flag variety over $\mathbb{Q}$.

preprint2015arXiv

Creation of high mobility two-dimensional electron gases via strain induced polarization at an otherwise nonpolar complex oxide interface

The discovery of two-dimensional electron gases (2DEGs) in SrTiO3-based heterostructures provides new opportunities for nanoelectronics. Herein, we create a new type of oxide 2DEG by the epitaxial-strain-induced polarization at an otherwise nonpolar perovskite-type interface of CaZrO3/SrTiO3. Remarkably, this heterointerface is atomically sharp, and exhibits a high electron mobility exceeding 60,000 cm2V-1s-1 at low temperatures. The 2DEG carrier density exhibits a critical dependence on the film thickness, in good agreement with the polarization induced 2DEG scheme.

preprint2013arXiv

Design of a 325MHz Beta=0.12 superconducting single spoke cavity for China-ADS

Twelve superconducting single spoke cavities whose Beta is 0.12 (Spoke012) operating at 325MHz, are adopted in Injector I for China-ADS linac. This type of spoke cavity is believed to be one of the key challenges for its very low geometry Beta. So far, the prototype cavity has been designed, fabricated and tested successfully. The design work was finished by using CST-MWS and ANSYS software for the RF and mechanical properties optimization. This paper presents the details of the final design for Spoke012 prototype cavity.

preprint2013arXiv

Effective Limit Distribution of the Frobenius Numbers

The Frobenius number $F(\ba)$ of a lattice point $\ba$ in $\R^d$ with positive coprime coordinates, is the largest integer which can $not$ be expressed as a non-negative integer linear combination of the coordinates of $\ba$. Marklof in \cite{M} proved the existence of the limit distribution of the Frobenius numbers, when $\ba$ is taken to be random in an enlarging domain in $\R^d$. We will show that if the domain has piecewise smooth boundary, the error term for the convergence of the distribution functions is at most a polynomial in the enlarging factor.

preprint2013arXiv

Spectral gap actions and invariant states

We define spectral gap actions of discrete groups on von Neumann algebras and study their relations with invariant states. We will show that a finitely generated ICC group $Γ$ is inner amenable if and only if there exist more than one inner invariant states on the group von Neumann algebra $L(Γ)$. Moreover, a countable discrete group $Γ$ has property $(T)$ if and only if for any action $α$ of $Γ$ on a von Neumann algebra $N$, every $α$-invariant state on $N$ is a weak-$^*$-limit of a net of normal $α$-invariant states.

preprint2012arXiv

Efficient Superimposition Recovering Algorithm

In this article, we address the issue of recovering latent transparent layers from superimposition images. Here, we assume we have the estimated transformations and extracted gradients of latent layers. To rapidly recover high-quality image layers, we propose an Efficient Superimposition Recovering Algorithm (ESRA) by extending the framework of accelerated gradient method. In addition, a key building block (in each iteration) in our proposed method is the proximal operator calculating. Here we propose to employ a dual approach and present our Parallel Algorithm with Constrained Total Variation (PACTV) method. Our recovering method not only reconstructs high-quality layers without color-bias problem, but also theoretically guarantees good convergence performance.

Han Li

What is connected

Connect this record

See the researcher in context

Building this map preview

29 published item(s)

Action-Aware Generative Sequence Modeling for Short Video Recommendation

Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning

The Security Analysis of Continuous-Variable Quantum Key Distribution under Limited Eavesdropping with Practical Fiber

Absolute Quantification of sp$^{3}$ Defects in Semiconducting Single-Wall Carbon Nanotubes by Raman Spectroscopy

DNN Training Acceleration via Exploring GPGPU Friendly Sparsity

KPGT: Knowledge-Guided Pre-training of Graph Transformer for Molecular Property Prediction

Multi-node Acceleration for Large-scale GCNs

MVP-Net: Multiple View Pointwise Semantic Segmentation of Large-Scale Point Clouds

SATr: Slice Attention with Transformer for Universal Lesion Detection

Spin Supersolidity in Nearly Ideal Easy-axis Triangular Quantum Antiferromagnet Na$_2$BaCo(PO$_4$)$_2$

Translucency and negative temperature-dependence for the slip length of water on graphene

Direct Visualization of a Static Incommensurate Antiferromagnetic Order by Suppressing the Superconducting Phase Coherence in Fe-doped Bi2Sr2CaCu2O8+delta

Neural model robustness for skill routing in large-scale conversational AI systems: A design choice exploration

No observation of chiral flux current in the topological kagome metal CsV$_{3}$Sb$_{5}$

A Deep Prediction Network for Understanding Advertiser Intent and Satisfaction

Bounding Maps for Universal Lesion Detection

Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

Explicit Result on Equivalence of Rational Quadratic Forms Avoiding Primes

Kosterlitz-Thouless Melting of Magnetic Order in the Triangular Quantum Ising Material TmMgGaO$_4$

Learning Optimal Tree Models Under Beam Search

Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising

Learning to Infer User Hidden States for Online Sequential Advertising

Routing of valley photons in a WS2 monolayer via delocalized Bloch modes of in-plane inversion-symmetry broken photonic crystal slabs

Effective equidistribution of translates of maximal horospherical measures in the space of lattices

Creation of high mobility two-dimensional electron gases via strain induced polarization at an otherwise nonpolar complex oxide interface

Design of a 325MHz Beta=0.12 superconducting single spoke cavity for China-ADS

Effective Limit Distribution of the Frobenius Numbers

Spectral gap actions and invariant states

Efficient Superimposition Recovering Algorithm