Source author record

Jian Xu

Jian Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

66works

38topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

On the Approximation Complexity of Matrix Product Operator Born Machines

Matrix product operator Born machines (MPO-BMs) are tractable tensor-network models for probabilistic modeling, but their efficient approximation capability remains unclear. We characterize this boundary from both negative and positive perspectives. First, we prove that KL approximation is NP-hard for MPO-BMs in the continuous setting, ruling out universal efficient approximation in the worst case. Second, for score-based variational inference, we show that, under a locality and spectral-gap conditions on the loss-induced Hamiltonian, structured targets (e.g., path-graph Markov random fields) admit MPO-BM approximations with polynomial bond dimension and provable KL guarantees. Third, under the same locality structure, we prove that polynomially many score queries suffice to estimate the induced Hamiltonian and obtain such guarantees. Our results provide a theoretical characterization of when MPO-BMs are fundamentally hard to approximate and when they become efficiently learnable.

preprint2022arXiv

3D large-scale fused silica microfluidic chips enabled by hybrid laser microfabrication for continuous-flow UV photochemical synthesis

We demonstrate a hybrid laser microfabrication approach, which combines the technical merits of ultrafast laser-assisted chemical etching and carbon dioxide laser-induced in-situ melting, for centimeter-scale and bonding-free fabrication of 3D complex hollow microstructures in fused silica glass. With the developed approach, large-scale fused silica microfluidic chips with integrated 3D cascaded micromixing units can be reliably manufactured. High-performance on-chip mixing and continuous-flow photochemical synthesis under UV LEDs irradiation at ~280 nm were demonstrated using the manufactured chip, indicating a powerful capability for versatile fabrication of highly transparent all-glass microfluidic reactors for on-chip photochemical synthesis.

preprint2022arXiv

A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising

In online advertising, auto-bidding has become an essential tool for advertisers to optimize their preferred ad performance metrics by simply expressing high-level campaign objectives and constraints. Previous works designed auto-bidding tools from the view of single-agent, without modeling the mutual influence between agents. In this paper, we instead consider this problem from a distributed multi-agent perspective, and propose a general $\underline{M}$ulti-$\underline{A}$gent reinforcement learning framework for $\underline{A}$uto-$\underline{B}$idding, namely MAAB, to learn the auto-bidding strategies. First, we investigate the competition and cooperation relation among auto-bidding agents, and propose a temperature-regularized credit assignment to establish a mixed cooperative-competitive paradigm. By carefully making a competition and cooperation trade-off among agents, we can reach an equilibrium state that guarantees not only individual advertiser's utility but also the system performance (i.e., social welfare). Second, to avoid the potential collusion behaviors of bidding low prices underlying the cooperation, we further propose bar agents to set a personalized bidding bar for each agent, and then alleviate the revenue degradation due to the cooperation. Third, to deploy MAAB in the large-scale advertising system with millions of advertisers, we propose a mean-field approach. By grouping advertisers with the same objective as a mean auto-bidding agent, the interactions among the large-scale advertisers are greatly simplified, making it practical to train MAAB efficiently. Extensive experiments on the offline industrial dataset and Alibaba advertising platform demonstrate that our approach outperforms several baseline methods in terms of social welfare and revenue.

preprint2022arXiv

AMCAD: Adaptive Mixed-Curvature Representation based Advertisement Retrieval System

Graph embedding based retrieval has become one of the most popular techniques in the information retrieval community and search engine industry. The classical paradigm mainly relies on the flat Euclidean geometry. In recent years, hyperbolic (negative curvature) and spherical (positive curvature) representation methods have shown their superiority to capture hierarchical and cyclic data structures respectively. However, in industrial scenarios such as e-commerce sponsored search platforms, the large-scale heterogeneous query-item-advertisement interaction graphs often have multiple structures coexisting. Existing methods either only consider a single geometry space, or combine several spaces manually, which are incapable and inflexible to model the complexity and heterogeneity in the real scenario. To tackle this challenge, we present a web-scale Adaptive Mixed-Curvature ADvertisement retrieval system (AMCAD) to automatically capture the complex and heterogeneous graph structures in non-Euclidean spaces. Specifically, entities are represented in adaptive mixed-curvature spaces, where the types and curvatures of the subspaces are trained to be optimal combinations. Besides, an attentive edge-wise space projector is designed to model the similarities between heterogeneous nodes according to local graph structures and the relation types. Moreover, to deploy AMCAD in Taobao, one of the largest ecommerce platforms with hundreds of million users, we design an efficient two-layer online retrieval framework for the task of graph based advertisement retrieval. Extensive evaluations on real-world datasets and A/B tests on online traffic are conducted to illustrate the effectiveness of the proposed system.

preprint2022arXiv

An Information-theoretic Method for Collaborative Distributed Learning with Limited Communication

In this paper, we study the information transmission problem under the distributed learning framework, where each worker node is merely permitted to transmit a $m$-dimensional statistic to improve learning results of the target node. Specifically, we evaluate the corresponding expected population risk (EPR) under the regime of large sample sizes. We prove that the performance can be enhanced since the transmitted statistics contribute to estimating the underlying distribution under the mean square error measured by the EPR norm matrix. Accordingly, the transmitted statistics correspond to the eigenvectors of this matrix, and the desired transmission allocates these eigenvectors among the statistics such that the EPR is minimal. Moreover, we provide the analytical solution of the desired statistics for single-node and two-node transmission, where a geometrical interpretation is given to explain the eigenvector selection. For the general case, an efficient algorithm that can output the allocation solution is developed based on the node partitions.

preprint2022arXiv

Artificial Intelligence Enables Real-Time and Intuitive Control of Prostheses via Nerve Interface

Objective: The next generation prosthetic hand that moves and feels like a real hand requires a robust neural interconnection between the human minds and machines. Methods: Here we present a neuroprosthetic system to demonstrate that principle by employing an artificial intelligence (AI) agent to translate the amputee's movement intent through a peripheral nerve interface. The AI agent is designed based on the recurrent neural network (RNN) and could simultaneously decode six degree-of-freedom (DOF) from multichannel nerve data in real-time. The decoder's performance is characterized in motor decoding experiments with three human amputees. Results: First, we show the AI agent enables amputees to intuitively control a prosthetic hand with individual finger and wrist movements up to 97-98% accuracy. Second, we demonstrate the AI agent's real-time performance by measuring the reaction time and information throughput in a hand gesture matching task. Third, we investigate the AI agent's long-term uses and show the decoder's robust predictive performance over a 16-month implant duration. Conclusion & significance: Our study demonstrates the potential of AI-enabled nerve technology, underling the next generation of dexterous and intuitive prosthetic hands.

preprint2022arXiv

Asymptotically Unbiased Estimation for Delayed Feedback Modeling via Label Correction

Alleviating the delayed feedback problem is of crucial importance for the conversion rate(CVR) prediction in online advertising. Previous delayed feedback modeling methods using an observation window to balance the trade-off between waiting for accurate labels and consuming fresh feedback. Moreover, to estimate CVR upon the freshly observed but biased distribution with fake negatives, the importance sampling is widely used to reduce the distribution bias. While effective, we argue that previous approaches falsely treat fake negative samples as real negative during the importance weighting and have not fully utilized the observed positive samples, leading to suboptimal performance. In this work, we propose a new method, DElayed Feedback modeling with UnbiaSed Estimation, (DEFUSE), which aim to respectively correct the importance weights of the immediate positive, the fake negative, the real negative, and the delay positive samples at finer granularity. Specifically, we propose a two-step optimization approach that first infers the probability of fake negatives among observed negatives before applying importance sampling. To fully exploit the ground-truth immediate positives from the observed distribution, we further develop a bi-distribution modeling framework to jointly model the unbiased immediate positives and the biased delay conversions. Experimental results on both public and our industrial datasets validate the superiority of DEFUSE. Codes are available at https://github.com/ychen216/DEFUSE.git.

preprint2022arXiv

FedHAP: Federated Hashing with Global Prototypes for Cross-silo Retrieval

Deep hashing has been widely applied in large-scale data retrieval due to its superior retrieval efficiency and low storage cost. However, data are often scattered in data silos with privacy concerns, so performing centralized data storage and retrieval is not always possible. Leveraging the concept of federated learning (FL) to perform deep hashing is a recent research trend. However, existing frameworks mostly rely on the aggregation of the local deep hashing models, which are trained by performing similarity learning with local skewed data only. Therefore, they cannot work well for non-IID clients in a real federated environment. To overcome these challenges, we propose a novel federated hashing framework that enables participating clients to jointly train the shared deep hashing model by leveraging the prototypical hash codes for each class. Globally, the transmission of global prototypes with only one prototypical hash code per class will minimize the impact of communication cost and privacy risk. Locally, the use of global prototypes are maximized by jointly training a discriminator network and the local hashing network. Extensive experiments on benchmark datasets are conducted to demonstrate that our method can significantly improve the performance of the deep hashing model in the federated environments with non-IID data distributions.

preprint2022arXiv

Impression Allocation and Policy Search in Display Advertising

In online display advertising, guaranteed contracts and real-time bidding (RTB) are two major ways to sell impressions for a publisher. For large publishers, simultaneously selling impressions through both guaranteed contracts and in-house RTB has become a popular choice. Generally speaking, a publisher needs to derive an impression allocation strategy between guaranteed contracts and RTB to maximize its overall outcome (e.g., revenue and/or impression quality). However, deriving the optimal strategy is not a trivial task, e.g., the strategy should encourage incentive compatibility in RTB and tackle common challenges in real-world applications such as unstable traffic patterns (e.g., impression volume and bid landscape changing). In this paper, we formulate impression allocation as an auction problem where each guaranteed contract submits virtual bids for individual impressions. With this formulation, we derive the optimal bidding functions for the guaranteed contracts, which result in the optimal impression allocation. In order to address the unstable traffic pattern challenge and achieve the optimal overall outcome, we propose a multi-agent reinforcement learning method to adjust the bids from each guaranteed contract, which is simple, converging efficiently and scalable. The experiments conducted on real-world datasets demonstrate the effectiveness of our method.

preprint2022arXiv

Learning Disentangled Representations for Counterfactual Regression via Mutual Information Minimization

Learning individual-level treatment effect is a fundamental problem in causal inference and has received increasing attention in many areas, especially in the user growth area which concerns many internet companies. Recently, disentangled representation learning methods that decompose covariates into three latent factors, including instrumental, confounding and adjustment factors, have witnessed great success in treatment effect estimation. However, it remains an open problem how to learn the underlying disentangled factors precisely. Specifically, previous methods fail to obtain independent disentangled factors, which is a necessary condition for identifying treatment effect. In this paper, we propose Disentangled Representations for Counterfactual Regression via Mutual Information Minimization (MIM-DRCFR), which uses a multi-task learning framework to share information when learning the latent factors and incorporates MI minimization learning criteria to ensure the independence of these factors. Extensive experiments including public benchmarks and real-world industrial user growth datasets demonstrate that our method performs much better than state-of-the-art methods.

preprint2022arXiv

Leaving No One Behind: A Multi-Scenario Multi-Task Meta Learning Approach for Advertiser Modeling

Advertisers play an essential role in many e-commerce platforms like Taobao and Amazon. Fulfilling their marketing needs and supporting their business growth is critical to the long-term prosperity of platform economies. However, compared with extensive studies on user modeling such as click-through rate predictions, much less attention has been drawn to advertisers, especially in terms of understanding their diverse demands and performance. Different from user modeling, advertiser modeling generally involves many kinds of tasks (e.g. predictions of advertisers' expenditure, active-rate, or total impressions of promoted products). In addition, major e-commerce platforms often provide multiple marketing scenarios (e.g. Sponsored Search, Display Ads, Live Streaming Ads) while advertisers' behavior tend to be dispersed among many of them. This raises the necessity of multi-task and multi-scenario consideration in comprehensive advertiser modeling, which faces the following challenges: First, one model per scenario or per task simply doesn't scale; Second, it is particularly hard to model new or minor scenarios with limited data samples; Third, inter-scenario correlations are complicated, and may vary given different tasks. To tackle these challenges, we propose a multi-scenario multi-task meta learning approach (M2M) which simultaneously predicts multiple tasks in multiple advertising scenarios.

preprint2022arXiv

Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval

Passage retrieval is a fundamental task in information retrieval (IR) research, which has drawn much attention recently. In the English field, the availability of large-scale annotated dataset (e.g, MS MARCO) and the emergence of deep pre-trained language models (e.g, BERT) has resulted in a substantial improvement of existing passage retrieval systems. However, in the Chinese field, especially for specific domains, passage retrieval systems are still immature due to quality-annotated dataset being limited by scale. Therefore, in this paper, we present a novel multi-domain Chinese dataset for passage retrieval (Multi-CPR). The dataset is collected from three different domains, including E-commerce, Entertainment video and Medical. Each dataset contains millions of passages and a certain amount of human annotated query-passage related pairs. We implement various representative passage retrieval methods as baselines. We find that the performance of retrieval models trained on dataset from general domain will inevitably decrease on specific domain. Nevertheless, a passage retrieval system built on in-domain annotated dataset can achieve significant improvement, which indeed demonstrates the necessity of domain labeled data for further optimization. We hope the release of the Multi-CPR dataset could benchmark Chinese passage retrieval task in specific domain and also make advances for future studies.

preprint2022arXiv

Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation From Monocular RGB Image

Recently, RGBD-based category-level 6D object pose estimation has achieved promising improvement in performance, however, the requirement of depth information prohibits broader applications. In order to relieve this problem, this paper proposes a novel approach named Object Level Depth reconstruction Network (OLD-Net) taking only RGB images as input for category-level 6D object pose estimation. We propose to directly predict object-level depth from a monocular RGB image by deforming the category-level shape prior into object-level depth and the canonical NOCS representation. Two novel modules named Normalized Global Position Hints (NGPH) and Shape-aware Decoupled Depth Reconstruction (SDDR) module are introduced to learn high fidelity object-level depth and delicate shape representations. At last, the 6D object pose is solved by aligning the predicted canonical representation with the back-projected object-level depth. Extensive experiments on the challenging CAMERA25 and REAL275 datasets indicate that our model, though simple, achieves state-of-the-art performance.

preprint2022arXiv

TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization adjoint with Moving Speed

Continuous normalizing flows (CNFs) construct invertible mappings between an arbitrary complex distribution and an isotropic Gaussian distribution using Neural Ordinary Differential Equations (neural ODEs). It has not been tractable on large datasets due to the incremental complexity of the neural ODE training. Optimal Transport theory has been applied to regularize the dynamics of the ODE to speed up training in recent works. In this paper, a temporal optimization is proposed by optimizing the evolutionary time for forward propagation of the neural ODE training. In this appoach, we optimize the network weights of the CNF alternately with evolutionary time by coordinate descent. Further with temporal regularization, stability of the evolution is ensured. This approach can be used in conjunction with the original regularization approach. We have experimentally demonstrated that the proposed approach can significantly accelerate training without sacrifying performance over baseline models.

preprint2022arXiv

Unconventional steady states and topological phases in an open two-level non-Hermitian system

Decoherence and non-Hermiticity are two different effects of the open quantum systems. Both of them have triggered many interesting phenomena. In this paper, we theoretically study an open two-level non-Hermitian system coupling to a dissipative environment by solving the vectorized Lindblad equation. This scheme provides us a powerful framework to address widespread open systems with gain, loss and dissipation. Our results show that there exist a new class of exceptional points (EPs) and steady states due to the interplay between non-Hermiticity and decoherence. Furthermore, we also demonstrate new-type topological properties of eigenstates with zero real-part of eigenvalues ($Re[λ]=0$) which are corresponding to Fermi arcs. It is revealed that the phases of eigenstates located in Fermi arcs regime have a topological phase $|π/2|$ which is totally unaffected by the dissipative environment. Our results provide a promising approach for further uncovering and understanding the intriguing properties of non-Hermitian open systems.

preprint2022arXiv

Visual Encoding and Debiasing for CTR Prediction

Extracting expressive visual features is crucial for accurate Click-Through-Rate (CTR) prediction in visual search advertising systems. Current commercial systems use off-the-shelf visual encoders to facilitate fast online service. However, the extracted visual features are coarse-grained and/or biased. In this paper, we present a visual encoding framework for CTR prediction to overcome these problems. The framework is based on contrastive learning which pulls positive pairs closer and pushes negative pairs apart in the visual feature space. To obtain fine-grained visual features,we present contrastive learning supervised by click through data to fine-tune the visual encoder. To reduce sample selection bias, firstly we train the visual encoder offline by leveraging both unbiased self-supervision and click supervision signals. Secondly, we incorporate a debiasing network in the online CTR predictor to adjust the visual features by contrasting high impression items with selected items with lower impressions.We deploy the framework in the visual sponsor search system at Alibaba. Offline experiments on billion-scale datasets and online experiments demonstrate that the proposed framework can make accurate and unbiased predictions.

preprint2021arXiv

Computation Resource Allocation Solution in Recommender Systems

Recommender systems rely heavily on increasing computation resources to improve their business goal. By deploying computation-intensive models and algorithms, these systems are able to inference user interests and exhibit certain ads or commodities from the candidate set to maximize their business goals. However, such systems are facing two challenges in achieving their goals. On the one hand, facing massive online requests, computation-intensive models and algorithms are pushing their computation resources to the limit. On the other hand, the response time of these systems is strictly limited to a short period, e.g. 300 milliseconds in our real system, which is also being exhausted by the increasingly complex models and algorithms. In this paper, we propose the computation resource allocation solution (CRAS) that maximizes the business goal with limited computation resources and response time. We comprehensively illustrate the problem and formulate such a problem as an optimization problem with multiple constraints, which could be broken down into independent sub-problems. To solve the sub-problems, we propose the revenue function to facilitate the theoretical analysis, and obtain the optimal computation resource allocation strategy. To address the applicability issues, we devise the feedback control system to help our strategy constantly adapt to the changing online environment. The effectiveness of our method is verified by extensive experiments based on the real dataset from Taobao.com. We also deploy our method in the display advertising system of Alibaba. The online results show that our computation resource allocation solution achieves significant business goal improvement without any increment of computation cost, which demonstrates the efficacy of our method in real industrial practice.

preprint2021arXiv

Optimizing Multiple Performance Metrics with Deep GSP Auctions for E-commerce Advertising

In e-commerce advertising, the ad platform usually relies on auction mechanisms to optimize different performance metrics, such as user experience, advertiser utility, and platform revenue. However, most of the state-of-the-art auction mechanisms only focus on optimizing a single performance metric, e.g., either social welfare or revenue, and are not suitable for e-commerce advertising with various, dynamic, difficult to estimate, and even conflicting performance metrics. In this paper, we propose a new mechanism called Deep GSP auction, which leverages deep learning to design new rank score functions within the celebrated GSP auction framework. These new rank score functions are implemented via deep neural network models under the constraints of monotone allocation and smooth transition. The requirement of monotone allocation ensures Deep GSP auction nice game theoretical properties, while the requirement of smooth transition guarantees the advertiser utilities would not fluctuate too much when the auction mechanism switches among candidate mechanisms to achieve different optimization objectives. We deployed the proposed mechanisms in a leading e-commerce ad platform and conducted comprehensive experimental evaluations with both offline simulations and online A/B tests. The results demonstrated the effectiveness of the Deep GSP auction compared to the state-of-the-art auction mechanisms.

preprint2020arXiv

A Deep Prediction Network for Understanding Advertiser Intent and Satisfaction

For e-commerce platforms such as Taobao and Amazon, advertisers play an important role in the entire digital ecosystem: their behaviors explicitly influence users' browsing and shopping experience; more importantly, advertiser's expenditure on advertising constitutes a primary source of platform revenue. Therefore, providing better services for advertisers is essential for the long-term prosperity for e-commerce platforms. To achieve this goal, the ad platform needs to have an in-depth understanding of advertisers in terms of both their marketing intents and satisfaction over the advertising performance, based on which further optimization could be carried out to service the advertisers in the correct direction. In this paper, we propose a novel Deep Satisfaction Prediction Network (DSPN), which models advertiser intent and satisfaction simultaneously. It employs a two-stage network structure where advertiser intent vector and satisfaction are jointly learned by considering the features of advertiser's action information and advertising performance indicators. Experiments on an Alibaba advertisement dataset and online evaluations show that our proposed DSPN outperforms state-of-the-art baselines and has stable performance in terms of AUC in the online environment. Further analyses show that DSPN not only predicts advertisers' satisfaction accurately but also learns an explainable advertiser intent, revealing the opportunities to optimize the advertising performance further.

preprint2020arXiv

A Deep Recurrent Survival Model for Unbiased Ranking

Position bias is a critical problem in information retrieval when dealing with implicit yet biased user feedback data. Unbiased ranking methods typically rely on causality models and debias the user feedback through inverse propensity weighting. While practical, these methods still suffer from two major problems. First, when inferring a user click, the impact of the contextual information, such as documents that have been examined, is often ignored. Second, only the position bias is considered but other issues resulted from user browsing behaviors are overlooked. In this paper, we propose an end-to-end Deep Recurrent Survival Ranking (DRSR), a unified framework to jointly model user's various behaviors, to (i) consider the rich contextual information in the ranking list; and (ii) address the hidden issues underlying user behaviors, i.e., to mine observe pattern in queries without any click (non-click queries), and to model tracking logs which cannot truly reflect the user browsing intents (untrusted observation). Specifically, we adopt a recurrent neural network to model the contextual information and estimates the conditional likelihood of user feedback at each position. We then incorporate survival analysis techniques with the probability chain rule to mathematically recover the unbiased joint probability of one user's various behaviors. DRSR can be easily incorporated with both point-wise and pair-wise learning objectives. The extensive experiments over two large-scale industrial datasets demonstrate the significant performance gains of our model comparing with the state-of-the-arts.

preprint2020arXiv

Anapole mediated giant photothermal nonlinearity in nanostructured silicon

Featured with a plethora of electric and magnetic Mie resonances, high index dielectric nanostructures offer a versatile platform to concentrate light-matter interactions at the nanoscale. By integrating unique features of far-field scattering control and near-field concentration from radiationless anapole states, here, we demonstrate a giant photothermal nonlinearity in single subwavelength-sized silicon nanodisks. The nanoscale energy concentration and consequent near-field enhancements mediated by the anapole mode yield a reversible nonlinear scattering with a large modulation depth and a broad dynamic range, unveiling a record-high nonlinear index change up to 0.5 at mild incident light intensities on the order of MW/cm2. The observed photothermal nonlinearity showcases three orders of magnitude enhancement compared with that of unstructured bulk silicon, as well as nearly one order of magnitude higher than that through the radiative electric dipolar mode. Such nonlinear scattering can empower distinctive point spread functions in confocal reflectance imaging, offering the potential for far-field localization of nanostructured Si with an accuracy approaching 40 nm. Our findings shed new light on active silicon photonics based on optical anapoles.

preprint2020arXiv

Building a PubMed knowledge graph

PubMed is an essential resource for the medical domain, but useful concepts are either difficult to extract or are ambiguated, which has significantly hindered knowledge discovery. To address this issue, we constructed a PubMed knowledge graph (PKG) by extracting bio-entities from 29 million PubMed abstracts, disambiguating author names, integrating funding data through the National Institutes of Health (NIH) ExPORTER, collecting affiliation history and educational background of authors from ORCID, and identifying fine-grained affiliation data from MapAffil. Through the integration of the credible multi-source data, we could create connections among the bio-entities, authors, articles, affiliations, and funding. Data validation revealed that the BioBERT deep learning method of bio-entity extraction significantly outperformed the state-of-the-art models based on the F1 score (by 0.51%), with the author name disambiguation (AND) achieving a F1 score of 98.09%. PKG can trigger broader innovations, not only enabling us to measure scholarly impact, knowledge usage, and knowledge transfer, but also assisting us in profiling authors and organizations based on their connections with bio-entities. The PKG is freely available on Figshare (https://figshare.com/s/6327a55355fc2c99f3a2, simplified version that exclude PubMed raw data) and TACC website (http://er.tacc.utexas.edu/datasets/ped, full version).

preprint2020arXiv

Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

In E-commerce, advertising is essential for merchants to reach their target users. The typical objective is to maximize the advertiser's cumulative revenue over a period of time under a budget constraint. In real applications, an advertisement (ad) usually needs to be exposed to the same user multiple times until the user finally contributes revenue (e.g., places an order). However, existing advertising systems mainly focus on the immediate revenue with single ad exposures, ignoring the contribution of each exposure to the final conversion, thus usually falls into suboptimal solutions. In this paper, we formulate the sequential advertising strategy optimization as a dynamic knapsack problem. We propose a theoretically guaranteed bilevel optimization framework, which significantly reduces the solution space of the original optimization space while ensuring the solution quality. To improve the exploration efficiency of reinforcement learning, we also devise an effective action space reduction approach. Extensive offline and online experiments show the superior performance of our approaches over state-of-the-art baselines in terms of cumulative revenue.

preprint2020arXiv

Field-weighted Factorization Machines for Click-Through Rate Prediction in Display Advertising

Click-through rate (CTR) prediction is a critical task in online display advertising. The data involved in CTR prediction are typically multi-field categorical data, i.e., every feature is categorical and belongs to one and only one field. One of the interesting characteristics of such data is that features from one field often interact differently with features from different other fields. Recently, Field-aware Factorization Machines (FFMs) have been among the best performing models for CTR prediction by explicitly modeling such difference. However, the number of parameters in FFMs is in the order of feature number times field number, which is unacceptable in the real-world production systems. In this paper, we propose Field-weighted Factorization Machines (FwFMs) to model the different feature interactions between different fields in a much more memory-efficient way. Our experimental evaluations show that FwFMs can achieve competitive prediction performance with only as few as 4% parameters of FFMs. When using the same number of parameters, FwFMs can bring 0.92% and 0.47% AUC lift over FFMs on two real CTR prediction data sets.

preprint2020arXiv

Generalized bioinspired approach to a daytime radiative cooling "skin"

Energy-saving cooling materials with strong operability are desirable towards sustainable thermal management. Inspired by the cooperative thermo-optical effect in fur of polar bear, we develop a flexible and reusable cooling skin via laminating a polydimethylsiloxane film with a highly-scattering polyethylene aerogel. Owing to its high porosity of 97.9% and tailored pore size of 3.8 +- 1.4 micrometers, superior solar reflectance of 0.96 and high transparency to irradiated thermal energy of 0.8 can be achieved at a thickness of 2.7 mm. Combined with low thermal conductivity of 0.032 W/m/K of the aerogel, the cooling skin exerts midday sub-ambient temperature drops of 5-6 degrees in a metropolitan environment, with an estimated limit of 14 degrees under ideal service conditions. We envision that this generalized bilayer approach will construct a bridge from night-time to daytime radiative cooling and pave the way for economical, scalable, flexible and reusable cooling materials.

preprint2020arXiv

GmFace: A Mathematical Model for Face Image Representation Using Multi-Gaussian

Establishing mathematical models is a ubiquitous and effective method to understand the objective world. Due to complex physiological structures and dynamic behaviors, mathematical representation of the human face is an especially challenging task. A mathematical model for face image representation called GmFace is proposed in the form of a multi-Gaussian function in this paper. The model utilizes the advantages of two-dimensional Gaussian function which provides a symmetric bell surface with a shape that can be controlled by parameters. The GmNet is then designed using Gaussian functions as neurons, with parameters that correspond to each of the parameters of GmFace in order to transform the problem of GmFace parameter solving into a network optimization problem of GmNet. The face modeling process can be described by the following steps: (1) GmNet initialization; (2) feeding GmNet with face image(s); (3) training GmNet until convergence; (4) drawing out the parameters of GmNet (as the same as GmFace); (5) recording the face model GmFace. Furthermore, using GmFace, several face image transformation operations can be realized mathematically through simple parameter computation.

preprint2020arXiv

Great Chiral Fluorescence from Optical Duality Silver Nanostructures Enabled by 3D Laser Printing

Featured by prominent flexibility and fidelity in producing sophisticated stereoscopic structures transdimensionally, three-dimensional (3D) laser printing technique has vastly extended the toolkit for delivering diverse functional devices. Yet chiral nanoemitters heavily resorting to artificial structures that manifest efficient emission and tightly confined light-mater interactions simultaneously remains alluring but dauntingly challenging for this technique at this moment. In this work, we assert the chiral photoluminescence is implemented from silver nanostructures of optical duality in one go via a twofold three-dimensional laser printing scheme. Such laser printing protocol allows the highly desired duality by simultaneously producing uniformly distributed fluorescent silver nanoclusters and aggregated plasmonic silver nanoparticles to tightly confine chiral interactions at the nanoscale. A helical emitter of 550 nm-helix-diameter as fabricated has seen a record-high luminescence anisotropic factor with the absolute value up to 0.58, which is two orders of magnitude greater than fluorescent chiral silver clusters. This method holds great promise for future versatile applications in chiroptical nanodevices.

preprint2020arXiv

Inverse scattering transform for the Kundu-Eckhaus Equation with nonzero boundary condition

In this paper, we consider the initial value problem for both of the defocusing and focusing Kundu-Eckhaus (KE) equation with non-zero boundary conditions (NZBCs) at infinity by inverse scattering transform method. The solutions of the KE equation with NZBCs can be reconstructed in terms of the solution of an associated $2 \times 2$ matrix Riemann-Hilbert problem (RHP). In our formulation, both the direct and the inverse problems are posed in terms of a suitable uniformization variable which allows us to develop the IST on the standard complex plane instead of a two-sheeted Riemann surface or the cut plane with discontinuities along the cuts. Furthermore, on the one hand, we obtain the N-soliton solutions with simple pole of the defocusing and focusing KE equation with the NZBCs, especially, the explicit one-soliton solutions are given in details. And we prove that the scattering data $a(ζ)$ of the defocusing KE equation can only have simple zeros. On the other hand, we also obtain the soliton solutions with double pole of the focusing KE equation with NZBCs. And we show that the double pole solutions can be viewed as some proper limit of the two simple pole soliton solutions. Some dynamical behaviors and typical collisions of the soliton solutions of both of the defocusing and focusing KE equation are shown graphically.

preprint2020arXiv

Joint and Progressive Subspace Analysis (JPSA) with Spatial-Spectral Manifold Alignment for Semi-Supervised Hyperspectral Dimensionality Reduction

Conventional nonlinear subspace learning techniques (e.g., manifold learning) usually introduce some drawbacks in explainability (explicit mapping) and cost-effectiveness (linearization), generalization capability (out-of-sample), and representability (spatial-spectral discrimination). To overcome these shortcomings, a novel linearized subspace analysis technique with spatial-spectral manifold alignment is developed for a semi-supervised hyperspectral dimensionality reduction (HDR), called joint and progressive subspace analysis (JPSA). The JPSA learns a high-level, semantically meaningful, joint spatial-spectral feature representation from hyperspectral data by 1) jointly learning latent subspaces and a linear classifier to find an effective projection direction favorable for classification; 2) progressively searching several intermediate states of subspaces to approach an optimal mapping from the original space to a potential more discriminative subspace; 3) spatially and spectrally aligning manifold structure in each learned latent subspace in order to preserve the same or similar topological property between the compressed data and the original data. A simple but effective classifier, i.e., nearest neighbor (NN), is explored as a potential application for validating the algorithm performance of different HDR approaches. Extensive experiments are conducted to demonstrate the superiority and effectiveness of the proposed JPSA on two widely-used hyperspectral datasets: Indian Pines (92.98\%) and the University of Houston (86.09\%) in comparison with previous state-of-the-art HDR methods. The demo of this basic work (i.e., ECCV2018) is openly available at https://github.com/danfenghong/ECCV2018_J-Play.

preprint2020arXiv

Learning Optimal Tree Models Under Beam Search

Retrieving relevant targets from an extremely large target set under computational limits is a common challenge for information retrieval and recommendation systems. Tree models, which formulate targets as leaves of a tree with trainable node-wise scorers, have attracted a lot of interests in tackling this challenge due to their logarithmic computational complexity in both training and testing. Tree-based deep models (TDMs) and probabilistic label trees (PLTs) are two representative kinds of them. Though achieving many practical successes, existing tree models suffer from the training-testing discrepancy, where the retrieval performance deterioration caused by beam search in testing is not considered in training. This leads to an intrinsic gap between the most relevant targets and those retrieved by beam search with even the optimally trained node-wise scorers. We take a first step towards understanding and analyzing this problem theoretically, and develop the concept of Bayes optimality under beam search and calibration under beam search as general analyzing tools for this purpose. Moreover, to eliminate the discrepancy, we propose a novel algorithm for learning optimal tree models under beam search. Experiments on both synthetic and real data verify the rationality of our theoretical analysis and demonstrate the superiority of our algorithm compared to state-of-the-art methods.

preprint2020arXiv

Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising

Bipartite b-matching is fundamental in algorithm design, and has been widely applied into economic markets, labor markets, etc. These practical problems usually exhibit two distinct features: large-scale and dynamic, which requires the matching algorithm to be repeatedly executed at regular intervals. However, existing exact and approximate algorithms usually fail in such settings due to either requiring intolerable running time or too much computation resource. To address this issue, we propose \texttt{NeuSearcher} which leverages the knowledge learned from previously instances to solve new problem instances. Specifically, we design a multichannel graph neural network to predict the threshold of the matched edges weights, by which the search region could be significantly reduced. We further propose a parallel heuristic search algorithm to iteratively improve the solution quality until convergence. Experiments on both open and industrial datasets demonstrate that \texttt{NeuSearcher} can speed up 2 to 3 times while achieving exactly the same matching solution compared with the state-of-the-art approximation approaches.

preprint2020arXiv

Learning to Infer User Hidden States for Online Sequential Advertising

To drive purchase in online advertising, it is of the advertiser's great interest to optimize the sequential advertising strategy whose performance and interpretability are both important. The lack of interpretability in existing deep reinforcement learning methods makes it not easy to understand, diagnose and further optimize the strategy. In this paper, we propose our Deep Intents Sequential Advertising (DISA) method to address these issues. The key part of interpretability is to understand a consumer's purchase intent which is, however, unobservable (called hidden states). In this paper, we model this intention as a latent variable and formulate the problem as a Partially Observable Markov Decision Process (POMDP) where the underlying intents are inferred based on the observable behaviors. Large-scale industrial offline and online experiments demonstrate our method's superior performance over several baselines. The inferred hidden states are analyzed, and the results prove the rationality of our inference.

preprint2020arXiv

Real-Time Cardiac Cine MRI with Residual Convolutional Recurrent Neural Network

Real-time cardiac cine MRI does not require ECG gating in the data acquisition and is more useful for patients who can not hold their breaths or have abnormal heart rhythms. However, to achieve fast image acquisition, real-time cine commonly acquires highly undersampled data, which imposes a significant challenge for MRI image reconstruction. We propose a residual convolutional RNN for real-time cardiac cine reconstruction. To the best of our knowledge, this is the first work applying deep learning approach to Cartesian real-time cardiac cine reconstruction. Based on the evaluation from radiologists, our deep learning model shows superior performance than compressed sensing.

preprint2020arXiv

Top-Down Shape Abstraction Based on Greedy Pole Selection

Motivated by the fact that the medial axis transform is able to encode nearly the complete shape, we propose to use as few medial balls as possible to approximate the original enclosed volume by the boundary surface. We progressively select new medial balls, in a top-down style, to enlarge the region spanned by the existing medial balls. The key spirit of the selection strategy is to encourage large medial balls while imposing given geometric constraints. We further propose a speedup technique based on a provable observation that the intersection of medial balls implies the adjacency of power cells (in the sense of the power crust). We further elaborate the selection rules in combination with two closely related applications. One application is to develop an easy-to-use ball-stick modeling system that helps non-professional users to quickly build a shape with only balls and wires, but any penetration between two medial balls must be suppressed. The other application is to generate porous structures with convex, compact (with a high isoperimetric quotient) and shape-aware pores where two adjacent spherical pores may have penetration as long as the mechanical rigidity can be well preserved.

preprint2019arXiv

Freeform microfluidic networks encapsulated in laser printed three-dimensional macro-scale glass objects

Large-scale microfluidic microsystems with complex three-dimensional (3D) configurations are highly in demand by both fundamental research and industrial application, holding the potentials for fostering a wide range of innovative applications such as lab-on-a-chip and organ-on-a-chip as well as continuous-flow manufacturing of fine chemicals. However, freeform fabrication of such systems remains challenging for most of the current fabrication techniques in terms of fabrication resolution, flexibility, and achievable footprint size. Here, we report ultrashort pulse laser microfabrication of freeform microfluidic circuits with high aspect ratios and tunable diameters embedded in 3D printed glass objects. We achieve uniform microfluidic channel diameter by carefully distributing a string of extra access ports along the microfluidic channels for avoiding the over-etching in the thin microfluidic channels. After the chemical etching is completed, the extra access ports are sealed using carbon dioxide laser induced localized glass melting. We demonstrate a model hand of fused silica with a size of ~3 cm * 2.7 cm * 1.1 cm in which the whole blood vessel system is encapsulated.

preprint2019arXiv

Self-limited Growth of an Oxyhydroxide Phase at the Fe3O4(001) Surface in Liquid and Ambient Pressure Water

Atomic-scale investigations of metal oxide surfaces exposed to aqueous environments are vital to understand degradation phenomena (e.g. dissolution and corrosion) as well as the performance of these materials in applications. Here, we utilize a new experimental setup for the UHV-compatible dosing of liquids to explore the stability of the Fe3O4(001)-c(2x2) surface following exposure to liquid and ambient pressure water. X-ray photoelectron spectroscopy (XPS) and low energy electron diffraction (LEED) data show that extensive hydroxylation causes the surface to revert to a bulk-like (1x1) termination. However, scanning tunnelling microscopy (STM) images reveal a more complex situation, with the slow growth of an oxyhydroxide phase, which ultimately saturates at approximately 40% coverage. We conclude that the new material contains OH groups from dissociated water coordinated to Fe cations extracted from subsurface layers, and that the surface passivates once the surface oxygen lattice is saturated with H because no further dissociation can take place. The resemblance of the STM images to those acquired in previous electrochemical STM (EC-STM) studies lead us to believe a similar structure exists at the solid-electrolyte interface during immersion at pH 7.

preprint2018arXiv

Polarization-insensitive space-selective etching in fused silica induced by picosecond laser irradiation

It is well known that when the fused silica is irradiated with focused femtosecond laser beams, space selective chemical etching can be achieved. The etching rate depends sensitively on the polarization of the laser. Surprisingly, we observe that by chirping the Fourier-transform-limited femtosecond laser pulses to picosecond pulses, the polarization dependence of the etching rate disappears, whereas an efficient etching rate can still be maintained. Observation with a scanning electron microscope reveals that the chirped pulses can induce interconnected nanocracks in the irradiated areas which facilitates efficient introduction of the etchant into the microchannel. The reported technology is of great use for fabrication of three-dimensional (3D) microfluidic systems and glass-based 3D printing.

preprint2016arXiv

Lift-Based Bidding in Ad Selection

Real-time bidding (RTB) has become one of the largest online advertising markets in the world. Today the bid price per ad impression is typically decided by the expected value of how it can lead to a desired action event (e.g., registering an account or placing a purchase order) to the advertiser. However, this industry standard approach to decide the bid price does not consider the actual effect of the ad shown to the user, which should be measured based on the performance lift among users who have been or have not been exposed to a certain treatment of ads. In this paper, we propose a new bidding strategy and prove that if the bid price is decided based on the performance lift rather than absolute performance value, advertisers can actually gain more action events. We describe the modeling methodology to predict the performance lift and demonstrate the actual performance gain through blind A/B test with real ad campaigns in an industry-leading Demand-Side Platform (DSP). We also discuss the relationship between attribution models and bidding strategies. We prove that, to move the DSPs to bid based on performance lift, they should be rewarded according to the relative performance lift they contribute.

preprint2016arXiv

Long-time asymptotics for the short pulse equation

In this paper, we analyze the long-time behavior of the solution of the initial value problem (IVP) for the short pulse (SP) equation. As the SP equation is a complete integrable system, which posses a Wadati-Konno-Ichikawa (WKI)-type Lax pair, we formulate a $2\times 2$ matrix Riemann-Hilbert problem to this IVP by using the inverse scattering method. Since the spectral variable $k$ is the same order in the WKI-type Lax pair, we construct the solution of this IVP parametrically in the new scale $(y,t)$, whereas the original scale $(x,t)$ is given in terms of functions in the new scale, in terms of the solution of this Riemann-Hilbert problem. However, by employing the nonlinear steepest descent method of Deift and Zhou for oscillatory Riemann-Hilbert problem, we can get the explicit leading order asymptotic of the solution of the short pulse equation in the original scale $(x,t)$ as time $t$ goes to infinity.

preprint2016arXiv

Representing higher-order dependencies in networks

To ensure the correctness of network analysis methods, the network (as the input) has to be a sufficiently accurate representation of the underlying data. However, when representing sequential data from complex systems such as global shipping traffic or web clickstream traffic as networks, conventional network representations that implicitly assume the Markov property (first-order dependency) can quickly become limiting. This assumption holds that when movements are simulated on the network, the next movement depends only on the current node, discounting the fact that the movement may depend on several previous steps. However, we show that data derived from many complex systems can show up to fifth-order dependencies. In these cases, the oversimplifying assumption of the first-order network representation can lead to inaccurate network analysis results. To address this problem, we propose the Higher-Order Network (HON) representation that can discover and embed variable orders of dependencies in a network representation. Through a comprehensive empirical evaluation and analysis, we establish several desirable characteristics of HON, including accuracy, scalability, and direct compatibility with the existing suite of network analysis methods. We illustrate how HON can be applied to a broad variety of tasks, such as random walking, clustering, and ranking, and we demonstrate that by using it as input, HON yields more accurate results without any modification to these tasks.

preprint2015arXiv

Initial-boundary value problem for integrable nonlinear evolution equations with $3\times 3$ Lax pairs on the interval

We present an approach for analyzing initial-boundary value problems which is formulated on the finite interval ($0\le x\le L$, where $L$ is a positive constant) for integrable equations whose Lax pairs involve $3\times 3$ matrices. Boundary value problems for integrable nonlinear evolution PDEs can be analyzed by the unified method introduced by Fokas and developed by him and his collaborators. In this paper, we show that the solution can be expressed in terms of the solution of a $3\times 3$ Riemann-Hilbert problem. The relevant jump matrices are explicitly given in terms of the three matrix-value spectral functions $s(k)$,$S(k)$ and $S_L(k)$, which in turn are defined in terms of the initial values, boundary values at $x=0$ and boundary values at $x=L$, respectively. However, these spectral functions are not independent, they satisfy a global relation. Here, we show that the characterization of the unknown boundary values in terms of the given initial and boundary data is explicitly described for a nonlinear evolution PDE defined on the interval. Also, we show that in the limit when the length of the interval tends to infity, the relevant formulas reduce to the analogous formulas obtained for the case of boundary value problems formulated on the half-line.

preprint2015arXiv

Interferometric detection of Chern numbers in topological optical lattices

Topological states of matter emergent as a new type of quantum phases, which can be distinguished by their associated topological invariants, e.g., Chern numbers. Currently, there is increasing in-terests toward the physically detection of the new predicted topological phases. Here, we propose an interferometric approach to directly measure the Chern number in a topological optical lattice via detecting the associated Zak phase. We show that this interferometric approach can distinguish Zak phases of plus or minus 2π from 0 in the first Brillouin zone, and thus provides a new tool to directly detect the Chern number of topological systems. In addition, we demonstrate that this method is feasible under realistic experimental conditions and widely applicable for many systems. Finally, this scheme can be readily generalized to detect high Chern number systems.

preprint2015arXiv

Large n-limit for Random matrices with External Source with 3 eigenvalues

In this paper, we analyze the large n-limit for random matrix with external source with three distinct eigenvalues. And we confine ourselves in the Hermite case and the three distinct eigenvalues are $-a,0,a$. For the case $a^2>3$, we establish the universal behavior of local eigenvalue correlations in the limit $n\rightarrow \infty$, which is known from unitarily invariant random matrix models. Thus, local eigenvalue correlations are expressed in terms of the sine kernel in the bulk and in terms of the Airy kernel at the edge of the spectrum. The result can be obtained by analyzing $4\times 4$ Riemann-Hilbert problem via nonlinear steepest decent method.

preprint2015arXiv

Smart Pacing for Effective Online Ad Campaign Optimization

In targeted online advertising, advertisers look for maximizing campaign performance under delivery constraint within budget schedule. Most of the advertisers typically prefer to impose the delivery constraint to spend budget smoothly over the time in order to reach a wider range of audiences and have a sustainable impact. Since lots of impressions are traded through public auctions for online advertising today, the liquidity makes price elasticity and bid landscape between demand and supply change quite dynamically. Therefore, it is challenging to perform smooth pacing control and maximize campaign performance simultaneously. In this paper, we propose a smart pacing approach in which the delivery pace of each campaign is learned from both offline and online data to achieve smooth delivery and optimal performance goals. The implementation of the proposed approach in a real DSP system is also presented. Experimental evaluations on both real online ad campaigns and offline simulations show that our approach can effectively improve campaign performance and achieve delivery goals.

preprint2015arXiv

The GLM representation of the global relation for the two-component nonlinear Schrödinger equation on the interval

In a previous work, we show that the solution of the initial-boundary value problem for the two-component nonlinear Schrödinger equation on the finite interval can be expressed in terms of the solution of a $3\times 3$ Riemann-Hilbert problem. The relevant jump matrices are explicitly given in terms of the three matrix-value spectral functions $s(k)$, $S(k)$ and $S_L(k)$, which in turn are defined in terms of the initial values, boundary values at $x=0$ and boundary values at $x=L$, respectively. However, for a well-posed problem, only part of the boundary values can be prescribed, the remaining boundary data cannot be independently specified, but are determined by the so-called global relation. Here, we use a Gelfand-Levitan-Marchenko representation to derive an expression for the generalized Dirichlet-to-Neumann map to characterize the unknown boundary values in physical domain, which is different from the approach, in fact it analyzed the global relation in spectral domain, used in the previous work. And, we can show that these two representations are equivalent.

preprint2015arXiv

The Ostrovsky-Vakhnenko equation on the half-line: a Riemann-Hilbert approach

We analyze an initial-boundary value problem for the Ostrovsky-Vakhnenko equation on the half-line. This equation can be viewed as the short wave model for the Degasperis-Procesi (DP) equation. We show that the solution u(x,t) can be recovered from its initial and boundary values via the solution of a 3\times 3 vector Riemann-Hilbert problem formulated in the complex plane of a spectral parameter z.

preprint2015arXiv

The unified transform method for the Sasa-Satsuma equation on the interval

We present a Riemann-Hilbert problem formalism for the initial-boundary value problem for the Sasa-Satsuma(SS) equation on the finite interval. Assume that the solution existes, we show that this solution can be expressed in terms of the solution of a $3\times 3$ Riemann-Hilbert problem. The relevant jump matrices are explicitly given in terms of the three matrix-value spectral functions $s(k)$, $S(k)$ and $S_L(k)$, which in turn are defined in terms of the initial values, boundary values at $x=0$ and boundary values at $x=L$, respectively. However, for a well-posed problem, only part of the boundary values can be prescribed, the remaining boundary data cannot be independently specified, but are determined by the so-called global relation. Here, we analyze the global relation to characterize the unknown boundary values in terms of the given initial and boundary data.

preprint2014arXiv

Global quantum discord in infinite quantum spin chains

In this paper, we study global quantum discord (GQD) in infinite-size spin chains. For this purpose, in the framework of matrix product states (MPSs), we propose an effective procedure to calculate GQD (denoted as Gn) for consecutive n-site subchains in infinite chains. For a spin-1/2 three-body interaction model, whose ground state can be exactly expressed as MPSs, We use the procedure to study Gn with n up to $24$. Then for a spin-1/2 XXZ chain, we firstly use infinite time-evolving block decimation (iTEBD) algorithm to obtain the approximate wavefunction in the from of MPSs, and then figure out Gn with n up to $18$. In both models, Gn shows an interesting linear growth as the increase of n, that is, Gn = k*n+b. Moreover, in non-critical regions the slope $k$ of Gn converges very fast, while in critical regions it converges relatively slow, and the behaviors are explained in a clear physical picture with the short-range and long-range correlations. Based on these results, we propose to use Gn/n to describe the global correlations in infinite chains. Gn/n has twofold physical meanings. Firstly, it can be regarded as "global discord per site", very similar to "energy per site" or "magnetization per site" in quantum magnetic systems. Secondly, Gn/n (when n is large enough) describes the quantum correlation between a single site and an (n-1)-site block. Then we successfully apply our theory to an exactly soluble infinite-size spin XY chain which is beyond the matrix product formula, and the Hamiltonian can reduce to the transverse-field Ising model and the XX model. The relation between GQD and quantum phase transitions in these models is discussed.

preprint2014arXiv

Multi-partite quantum nonlocality and Bell-type inequalities in an infinite-order quantum phase transition of the one-dimensional spin-1/2 XXZ chain

In this paper, combined with infinite time-evolving block decimation (iTEBD) algorithm and Bell-type inequalities, we investigate multi-partite quantum nonlocality in an infinite one-dimensional quantum spin-1/2 XXZ system. High hierarchy of multipartite nonlocality can be observed in the gapless phase of the model, meanwhile only the lowest hierarchy of multipartite nonlocality is observed in most regions of the gapped anti-ferromagnetic phase. Thereby, Bell-type inequalities disclose different correlation structures in the two phases of the system. Furthermore, at the infinite-order QPT (or Kosterlitz-Thouless QPT) point of the model, the correlation measures always show a local minimum value, regardless of the length of the subchains. It indicates that relatively low hierarchy of multi-partite nonlocality would be observed at the infinite-order QPT point in a Bell-type experiment. The result is in contrast to the existing results of the second-order QPT in the one-dimensional XY model, where multi-partite nonlocality with the hierarchy has been observed. Thus, multi-partite nonlocality provides us an alternative perspective to distinguish between these two kinds of QPTs. Reliable clues for the existence of tripartite quantum entanglement have also been found.

preprint2013arXiv

Hierarchical dynamics for system-bath coherence correlation spectrum

We propose a quasi-particle description for the hierarchical equations of motion formalism for quantum dissipative dynamics systems. Not only it provides an alternative mathematical means to the existing formalism, the new protocol clarifies also explicitly the physical meanings of the auxiliary density operators and their relations to full statistics on solvation bath variables. Combining with the standard linear response theory, we construct further the hierarchical dynamics formalism for correlated spectrum of system--bath coherence. We evaluate the spectrum matrix for a demonstrative spin-boson system-bath model. While the individual diagonal element of the spectrum matrix describes the system or the solvation bath correlation, the off-diagonal elements characterize the correlation between system and bath solvation dynamics.

preprint2013arXiv

Leading-order temporal asymptotics of the Fokas-Lenells Equation without solitons

We use the Deift-Zhou method to obtain, in the solitonless sector, the leading order asymptotic of the solution to the Cauchy problem of the Fokas-Lenells equation as $t\ra+\infty$ on the full-line.

preprint2013arXiv

Long-time asymptotic for the derivative nonlinear Schrödinger equation with step-like initial value

We consider the Cauchy problem for the Gerdjikov-Ivanov(GI) type of the derivative nonlinear Schrödinger (DNLS) equation: $$iq_t+q_{xx}-iq^2\bar{q}_x+\frac{1}{2}|q|^4{q}=0.$$ with steplike initial data: $q(x,0)=0$ for $x\le 0$ and $q(x,0)=Ae^{-2iBx}$ for $x>0$,where $A>0$ and $B\in \R$ are constants.The paper aims at studying the long-time asymptotics of the solution to this problem.We show that there are four regions in the half-plane $-\infty<x<\infty,t>0$,where the asymptotics has qualitatively different forms:a slowly decaying self-similar wave of Zakharov-Manakov type for $x>-4tB$, a plane wave region:$x<-4t(B+\sqrt{2A^2(B+\frac{A^2}{4})})$, an elliptic region:$-4t(B+\sqrt{2A^2(B+\frac{A^2}{4})})<x<-4tB$. The main tool is the asymptotic analysis of an associated matrix Riemann-Hilbert problem.

preprint2013arXiv

Multi-hump solitary waves of nonlinear Dirac equation

This paper concentrates on a (1+1)-dimensional nonlinear Dirac (NLD) equation with a general self-interaction, being a linear combination of the scalar, pseudoscalar, vector and axial vector self-interactions to the power of the integer $k+1$. The solitary wave solutions to the NLD equation are analytically derived, and the upper bounds of the hump number in the charge, energy and momentum densities for the solitary waves are proved in theory. The results show that: (1) for a given integer $k$, the hump number in the charge density is not bigger than $4$, while that in the energy density is not bigger than $3$; (2) those upper bounds can only be achieved in the situation of higher nonlinearity, namely, $k\in\{5,6,7,\cdots \}$ for the charge density and $k\in\{3,5,7,\cdots\}$ for the energy density; (3) the momentum density has the same multi-hump structure as the energy density; (4) more than two humps (resp. one hump) in the charge (resp. energy) density can only happen under the linear combination of the pseudoscalar self-interaction and at least one of the scalar and vector (or axial vector) self-interactions. Our results on the multi-hump structure will be interesting in the interaction dynamics for the NLD solitary waves.

preprint2013arXiv

Numerical methods for nonlinear Dirac equation

This paper presents a review of the current state-of-the-art of numerical methods for nonlinear Dirac (NLD) equation. Several methods are extendedly proposed for the (1+1)-dimensional NLD equation with the scalar and vector self-interaction and analyzed in the way of the accuracy and the time reversibility as well as the conservation of the discrete charge, energy and linear momentum. Those methods are the Crank-Nicolson (CN) schemes, the linearized CN schemes, the odd-even hopscotch scheme, the leapfrog scheme, a semi-implicit finite difference scheme, and the exponential operator splitting (OS) schemes. The nonlinear subproblems resulted from the OS schemes are analytically solved by fully exploiting the local conservation laws of the NLD equation. The effectiveness of the various numerical methods, with special focus on the error growth and the computational cost, is illustrated on two numerical experiments, compared to two high-order accurate Runge-Kutta discontinuous Galerkin methods. Theoretical and numerical comparisons show that the high-order accurate OS schemes may compete well with other numerical schemes discussed here in terms of the accuracy and the efficiency. A fourth-order accurate OS scheme is further applied to investigating the interaction dynamics of the NLD solitary waves under the scalar and vector self-interaction. The results show that the interaction dynamics of two NLD solitary waves depend on the exponent power of the self-interaction in the NLD equation; collapse happens after collision of two equal one-humped NLD solitary waves under the cubic vector self-interaction in contrast to no collapse scattering for corresponding quadric case.

preprint2013arXiv

The Fokas method to the Sasa-Satsuma equation on the half-line

We present a Riemann-Hilbert problem formalism for the initial-boundary value problem for the Sasa-Satsuma(SS) equation: $iq_T+\frac{1}{2}q_{XX}+|q|^2q+i\eps (q_{XXX}+6|q|^2q_X+3q(|q|^2)_X)=0$ on the half-line. And we also analysis the global relation in this paper.

preprint2013arXiv

The unified method for the three-wave equation on the half-line

We present a Riemann-Hilbert problem formalism for the initial-boundary value problem for the three-wave equation: \[p_{ij,t}-\frac{b_i-b_j}{a_i-a_j}p_{ij,x}+\sum_k(\frac{b_k-b_j}{a_k-a_j}-\frac{b_i-b_k}{a_i-a_k})p_{ik}p_{kj}=0,\quad i,j,k=1,2,3.\] on the half-line.

preprint2013arXiv

Two parameters scaling approach to Anderson localization of weekly interacting BEC

We numerically study the Anderson localization of weekly interacting Bose-Einstein condensate in a one-dimensional disordered potential. We show that two parameters are needed to completely describe such system, and the density profile of which can be described with the sum of two exponential functions. This is a new attempt for precise description of systems with interplay of disorder and interaction.

preprint2012arXiv

Long-time asymptotic for the derivative nonlinear Schrödinger equation with decaying initial value

We present a new Riemann-Hilbert problem formalism for the initial value problem for the derivative nonlinear Schrödinger (DNLS) equation on the line. We show that the solution of this initial value problem can be obtained from the solution of some associated Riemann-Hilbert problem. This new Riemann-Hilbert problem for the DNLS equation will lead us to use nonlinear steepest-descent/stationary phase method or Deift-Zhou method to derive the long-time asymptotic for the DNLS equation on the line.

preprint2012arXiv

The derivative nonlinear Schrodinger equation on the interval

We use the Fokas method to analyze the derivative nonlinear Schrödinger (DNLS) equation $iq_t(x,t)=-q_{xx}(x,t)+(r q^2)_x$ on the interval $[0,L]$. Assuming that the solution $q(x,t)$ exists, we show that it can be represented in terms of the solution of a matrix Riemann-Hilbert problem formulated in the plane of the complex spectral parameter $\x$. This problem has explicit $(x,t)$ dependence, and it has jumps across $\{\x \in \C|\im{\x^4}=0 \}$. The relevant jump matrices are explicitly given in terms of the spectral functions $\{a(\x),b(\x)\},\{A(\x),B(\x)\}$, and $\{\ca({\x}),\cb(\x)\}$, which in turn are defined in terms of the initial data $q_0(x)=q(x,0)$, the boundary data $g_0(t)=q(0,t),g_1(t)=q_x(0,t)$, and another boundary values $f_0(t)=q(L,t),f_1(t)=q_x(L,t)$. The spectral functions are not independent, but related by a compatibility condition, the so-called global relation.

preprint2011arXiv

Advancing hierarchical equations of motion for efficient evaluation of coherent two-dimensional spectroscopy

To advance hierarchial equations of motion as a standard theory for quantum dissipative dynamics, we put forward a mixed Heisenberg--Schrodinger scheme with block-matrix implementation on efficient evaluation of nonlinear optical response function. The new approach is also integrated with optimized hierarchical theory and numerical filtering algorithm. Different configurations of coherent two-dimensional spectroscopy of model excitonic dimer systems are investigated, with focus on the effects of intermolecular transfer coupling and bi-exciton interaction.

preprint2011arXiv

Optimized hierarchical equations of motion for Drude dissipation

The hierarchical equations of motion theory for Drude dissipation is optimized, with a convenient convergence criterion proposed in advance of numerical propagations. The theoretical construction is on basis of a Padé spectrum decomposition that has been qualified to be the best sum-over-poles scheme for quantum distribution function. The resulting hierarchical dynamics under the {\em apriori} convergence criterion are exemplified with a benchmark spin-boson system, and also the transient absorption and two-dimensional spectroscopy of a model exciton dimer system.

preprint2011arXiv

Technology ready use of single layer graphene as a transparent electrode for hybrid photovoltaic devices

Graphene has been used recently as a replacement for indium tin oxide (ITO) for the transparent electrode of an organic photovoltaic device. Due to its limited supply, ITO is considered as a limiting factor for the commercialization of organic solar cells. We explored the use of large-area graphene grown on copper by chemical vapor deposition (CVD) and then transferred to a glass substrate as an alternative transparent electrode. The transferred film was shown by scanning Raman spectroscopy measurements to consist of >90% single layer graphene. Optical spectroscopy measurements showed that the layer-transferred graphene has an optical absorbance of 1.23% at a wavelength of 532 nm. We fabricated organic hybrid solar cells utilizing this material as an electrode and compared their performance with ITO devices fabricated using the same procedure. We demonstrated power conversion efficiency up to 3.98%, higher than that of the ITO device (3.86%), showing that layer-transferred graphene promises to be a high quality, low-cost, flexible material for transparent electrodes in solar cell technology.

preprint2009arXiv

Exact quantum dissipative dynamics under external time-dependent fields driving

Exact and nonperturbative quantum master equation can be constructed via the calculus on path integral. It results in hierarchical equations of motion for the reduced density operator. Involved are also a set of well--defined auxiliary density operators that resolve not just system--bath coupling strength but also memory. In this work, we scale these auxiliary operators individually to achieve a uniform error tolerance, as set by the reduced density operator. An efficient propagator is then proposed to the hierarchical Liouville--space dynamics of quantum dissipation. Numerically exact studies are carried out on the dephasing effect on population transfer in the simple stimulated Raman adiabatic passage scheme. We also make assessments on several perturbative theories for their applicabilities in the present system of study.

preprint2009arXiv

Hierarchical quantum master equation with semiclassical Drude dissipation

We propose a nonperturbative quantum dissipation theory, in term of hierarchical quantum master equation. It may be used with a great degree of confidence to various dynamics systems in condensed phases. The theoretical development is rooted in an improved semiclassical treatment of Drude bath, beyond the conventional high temperature approximations. It leads to the new theory a simple modification but important improvement over the conventional stochastic Liouville equation theory, without extra numerical cost. Its broad range of validity and applicability is extensively demonstrated with two--level electron transfer model systems, where the new theory can be considered as the modified Zusman equation. We also present a criterion, which depends only on the system--bath coupling strength, characteristic bath memory time, and temperature, to estimate the performance of the hierarchical quantum master equation.

preprint2009arXiv

Hierarchical theory of quantum dissipation: Partial fraction decomposition scheme

We propose a partial fraction decomposition scheme to the construction of hierarchical equations of motion theory for bosonic quantum dissipation systems. The expansion of Bose--Einstein function in this scheme shows similar properties as it applies for Fermi function. The performance of the resulting quantum dissipation theory is exemplified with spin--boson systems. In all cases we have tested the new theory performs much better, about an order of magnitude faster, than the best available conventional theory based on Matsubara spectral decomposition scheme.

preprint2009arXiv

Implementation of local and high-fidelity quantum conditional phase gates in a scalable two-dimensional ion trap

We propose a scheme to implement high-fidelity conditional phase gates on pair of trapped ions immersed in a two-dimensional Coulomb crystal, using interaction mediated by all axial modes without side-band addressing. We show through numerical calculations that only local modes can be excited to achieve entangling gates through shaping the laser beams, so that the complexity of the quantum gate does not increase with the size of the system. These results suggest a promising approach for realization of large scale fault-tolerant quantum computation in two dimensional traps architecture.

Jian Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

66 published item(s)

On the Approximation Complexity of Matrix Product Operator Born Machines

3D large-scale fused silica microfluidic chips enabled by hybrid laser microfabrication for continuous-flow UV photochemical synthesis

A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising

AMCAD: Adaptive Mixed-Curvature Representation based Advertisement Retrieval System

An Information-theoretic Method for Collaborative Distributed Learning with Limited Communication

Artificial Intelligence Enables Real-Time and Intuitive Control of Prostheses via Nerve Interface

Asymptotically Unbiased Estimation for Delayed Feedback Modeling via Label Correction

FedHAP: Federated Hashing with Global Prototypes for Cross-silo Retrieval

Impression Allocation and Policy Search in Display Advertising

Learning Disentangled Representations for Counterfactual Regression via Mutual Information Minimization

Leaving No One Behind: A Multi-Scenario Multi-Task Meta Learning Approach for Advertiser Modeling

Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval

Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation From Monocular RGB Image

TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization adjoint with Moving Speed

Unconventional steady states and topological phases in an open two-level non-Hermitian system

Visual Encoding and Debiasing for CTR Prediction

Computation Resource Allocation Solution in Recommender Systems

Optimizing Multiple Performance Metrics with Deep GSP Auctions for E-commerce Advertising

A Deep Prediction Network for Understanding Advertiser Intent and Satisfaction

A Deep Recurrent Survival Model for Unbiased Ranking

Anapole mediated giant photothermal nonlinearity in nanostructured silicon

Building a PubMed knowledge graph

Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

Field-weighted Factorization Machines for Click-Through Rate Prediction in Display Advertising

Generalized bioinspired approach to a daytime radiative cooling "skin"

GmFace: A Mathematical Model for Face Image Representation Using Multi-Gaussian

Great Chiral Fluorescence from Optical Duality Silver Nanostructures Enabled by 3D Laser Printing

Inverse scattering transform for the Kundu-Eckhaus Equation with nonzero boundary condition

Joint and Progressive Subspace Analysis (JPSA) with Spatial-Spectral Manifold Alignment for Semi-Supervised Hyperspectral Dimensionality Reduction

Learning Optimal Tree Models Under Beam Search

Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising

Learning to Infer User Hidden States for Online Sequential Advertising

Real-Time Cardiac Cine MRI with Residual Convolutional Recurrent Neural Network

Top-Down Shape Abstraction Based on Greedy Pole Selection

Freeform microfluidic networks encapsulated in laser printed three-dimensional macro-scale glass objects

Self-limited Growth of an Oxyhydroxide Phase at the Fe3O4(001) Surface in Liquid and Ambient Pressure Water

Polarization-insensitive space-selective etching in fused silica induced by picosecond laser irradiation

Lift-Based Bidding in Ad Selection

Long-time asymptotics for the short pulse equation

Representing higher-order dependencies in networks

Initial-boundary value problem for integrable nonlinear evolution equations with $3\times 3$ Lax pairs on the interval

Interferometric detection of Chern numbers in topological optical lattices

Large n-limit for Random matrices with External Source with 3 eigenvalues

Smart Pacing for Effective Online Ad Campaign Optimization

The GLM representation of the global relation for the two-component nonlinear Schrödinger equation on the interval

The Ostrovsky-Vakhnenko equation on the half-line: a Riemann-Hilbert approach

The unified transform method for the Sasa-Satsuma equation on the interval

Global quantum discord in infinite quantum spin chains

Multi-partite quantum nonlocality and Bell-type inequalities in an infinite-order quantum phase transition of the one-dimensional spin-1/2 XXZ chain

Hierarchical dynamics for system-bath coherence correlation spectrum

Leading-order temporal asymptotics of the Fokas-Lenells Equation without solitons

Long-time asymptotic for the derivative nonlinear Schrödinger equation with step-like initial value

Multi-hump solitary waves of nonlinear Dirac equation

Numerical methods for nonlinear Dirac equation

The Fokas method to the Sasa-Satsuma equation on the half-line

The unified method for the three-wave equation on the half-line

Two parameters scaling approach to Anderson localization of weekly interacting BEC

Long-time asymptotic for the derivative nonlinear Schrödinger equation with decaying initial value

The derivative nonlinear Schrodinger equation on the interval

Advancing hierarchical equations of motion for efficient evaluation of coherent two-dimensional spectroscopy

Optimized hierarchical equations of motion for Drude dissipation

Technology ready use of single layer graphene as a transparent electrode for hybrid photovoltaic devices

Exact quantum dissipative dynamics under external time-dependent fields driving

Hierarchical quantum master equation with semiclassical Drude dissipation

Hierarchical theory of quantum dissipation: Partial fraction decomposition scheme

Implementation of local and high-fidelity quantum conditional phase gates in a scalable two-dimensional ion trap