Source author record

Junqi Wang

Junqi Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning cond-mat.soft hep-ph hep-th math.OC Multiagent Systems

Catalog footprint

What is connected

5works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2021arXiv

Distributionally-Constrained Policy Optimization via Unbalanced Optimal Transport

We consider constrained policy optimization in Reinforcement Learning, where the constraints are in form of marginals on state visitations and global action executions. Given these distributions, we formulate policy optimization as unbalanced optimal transport over the space of occupancy measures. We propose a general purpose RL objective based on Bregman divergence and optimize it using Dykstra's algorithm. The approach admits an actor-critic algorithm for when the state or action space is large, and only samples from the marginals are available. We discuss applications of our approach and provide demonstrations to show the effectiveness of our algorithm.

preprint2021arXiv

Efficient Discretizations of Optimal Transport

Obtaining solutions to Optimal Transportation (OT) problems is typically intractable when the marginal spaces are continuous. Recent research has focused on approximating continuous solutions with discretization methods based on i.i.d. sampling, and has proven convergence as the sample size increases. However, obtaining OT solutions with large sample sizes requires intensive computation effort, that can be prohibitive in practice. In this paper, we propose an algorithm for calculating discretizations with a given number of points for marginal distributions, by minimizing the (entropy-regularized) Wasserstein distance, and result in plans that are comparable to those obtained with much larger numbers of i.i.d. samples. Moreover, a local version of such discretizations which is parallelizable for large scale applications is proposed. We prove bounds for our approximation and demonstrate performance on a wide range of problems.

preprint2020arXiv

Sequential Cooperative Bayesian Inference

Cooperation is often implicitly assumed when learning from other agents. Cooperation implies that the agent selecting the data, and the agent learning from the data, have the same goal, that the learner infer the intended hypothesis. Recent models in human and machine learning have demonstrated the possibility of cooperation. We seek foundational theoretical results for cooperative inference by Bayesian agents through sequential data. We develop novel approaches analyzing consistency, rate of convergence and stability of Sequential Cooperative Bayesian Inference (SCBI). Our analysis of the effectiveness, sample efficiency and robustness show that cooperation is not only possible in specific instances but theoretically well-founded in general. We discuss implications for human-human and human-machine cooperation.

preprint2014arXiv

Lattice Boltzmann kinetic modeling and simulation of thermal liquid-vapor system

We present a highly efficient lattice Boltzmann (LB) kinetic model for thermal liquid-vapor system. Three key components are as beow: (i) a discrete velocity model by Kataoka \emph{et al.} [Phys. Rev. E \textbf{69}, 035701(R)(2004)]; (ii) a forcing term $I_{i}$ aiming to describe the interfacial stress and recover the van der Waals equation of state by Gonnella \emph{et al.} [Phys. Rev. E \textbf{76}, 036703 (2007)]; and (iii) a Windowed Fast Fourier Transform (WFFT) scheme and its inverse by our group [Phys. Rev. E \textbf{84}, 046715 (2011)] for solving the spatial derivatives, together with a second-order Runge-Kutta (RK) finite difference scheme for solving the temporal derivative in the LB equation. The model is verified and validated by well-known benchmark tests. The results recovered from the present model are well consistent with previous ones[Phys. Rev. E \textbf{84}, 046715 (2011)] or theoretical analysis. The usage of less discrete velocities, high-order RK algorithm and WFFT scheme with 16th-order in precision makes the model more efficient by about $10$ times and more accurate than the original one.

preprint2009arXiv

BCFW Recursion Relation with Nonzero Boundary Contribution

The appearance of BCFW on-shell recursion relation has deepen our understanding of quantum field theory, especially the one with gauge boson and graviton. To be able to write the BCFW recursion relation, the knowledge of boundary contributions is needed. So far, most applications have been constrained to the cases where the boundary contribution is zero. In this paper, we show that for some theories, although there is no proper deformation to annihilate the boundary contribution, its effects can be analyzed in simple way, thus we do able to write down the BCFW recursion relation with boundary contributions. The examples we will present in this paper include the lambda-phi-four theory and Yukawa coupling between fermions and scalars.