Researcher profile

Xia Han

Xia Han contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
2topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2022arXiv

Choquet regularization for reinforcement learning

We propose \emph{Choquet regularizers} to measure and manage the level of exploration for reinforcement learning (RL), and reformulate the continuous-time entropy-regularized RL problem of Wang et al. (2020, JMLR, 21(198)) in which we replace the differential entropy used for regularization with a Choquet regularizer. We derive the Hamilton--Jacobi--Bellman equation of the problem, and solve it explicitly in the linear--quadratic (LQ) case via maximizing statically a mean--variance constrained Choquet regularizer. Under the LQ setting, we derive explicit optimal distributions for several specific Choquet regularizers, and conversely identify the Choquet regularizers that generate a number of broadly used exploratory samplers such as $ε$-greedy, exponential, uniform and Gaussian.

preprint2022arXiv

Risk Concentration and the Mean-Expected Shortfall Criterion

Expected Shortfall (ES, also known as CVaR) is the most important coherent risk measure in finance, insurance, risk management, and engineering. Recently, Wang and Zitikis (2021) put forward four economic axioms for portfolio risk assessment and provide the first economic axiomatic foundation for the family of ES. In particular, the axiom of no reward for concentration (NRC) is arguably quite strong, which imposes an additive form of the risk measure on portfolios with a certain dependence structure. We move away from the axiom of NRC by introducing the notion of concentration aversion, which does not impose any specific form of the risk measure. It turns out that risk measures with concentration aversion are functions of ES and the expectation. Together with the other three standard axioms of monotonicity, translation invariance and lower semicontinuity, concentration aversion uniquely characterizes the family of ES. In addition, we establish an axiomatic foundation for the problem of mean-ES portfolio selection and new explicit formulas for convex and consistent risk measures. Finally, we provide an economic justification for concentration aversion via a few axioms on the attitude of a regulator towards dependence structures.