Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
19works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

19 published item(s)

preprint2026arXiv

Truth or Tribe: How In-group Favoritism Prioritize Facts in Persona Agents

In-group favoritism refers to the phenomena of favoring members of one's in-group over out-group members and is widely observed in numerous social cooperative behaviors. Recently, in-group favoritism biases have also been identified in generative language models. However, whether the in-group favoritism exists when persona agents are faced with contradicting information (e.g., misinformation), and how to mitigate the adverse effects of in-group favoritism biases in persona agents have been understudied. To address these problems, we propose a Truth or Tribe simulation framework to study the agent cooperation within the spread of contradicting information through a triadic interaction paradigm, and conduct controlled trials to evaluate the primary moderating factors. Extensive results showcase that persona agents display strong in-group favoritism, accepting incorrect answers from identity-similar peers at much higher rates than from dissimilar peers. In-group favoritism continues to emerge in defeasible reasoning contexts where no absolute truth exists, and it intensifies as cognitive complexity increases. Furthermore, three intervention strategies--Identity-Blind Instruction, Structured Counterfactual Reasoning, and Heterogeneous Perspective Ensemble--are proposed to mitigate the in-group favoritism.

preprint2024arXiv

CscK metrics near the canonical class

Let $X$ be a Kähler manifold with semi-ample canonical bundle $K_X$. It is proved by Jian-Shi-Song that for any Kähler class $γ$, there exists $δ>0$ such that for all $t\in (0, δ)$ there exists a unique cscK metric $g_t$ in $K_X+ t γ$. In this paper, we prove that $\{ (X, g_t) \}_{ t\in (0, δ)} $ have uniformly bounded Kähler potentials, volume forms and diameters. As a consequence, these metric spaces are pre-compact in the Gromov-Hausdorff sense.

preprint2022arXiv

Dynamical evolution in the D1D5 CFT

It is interesting to ask: how does the radial space direction emerge from the CFT in gauge-gravity duality? In this context we resolve a long-standing puzzle with the gravity duals of two classes of states in the D1D5 CFT. For each class the CFT states are in the untwisted sector, suggesting that the energy gap should be $1/R_y$ where $R_y$ is the radius of the circle on which the D1D5 CFT is compactified. For one class of states, the gravity dual indeed has exactly this gap, while for the other class, the gravity dual has a very deep throat, leading to an energy gap much smaller than $1/R_y$. We resolve this puzzle by showing that for the latter class of states, perturbing the CFT off its free point leads to the formation of a band structure in the CFT. We also explain why such a band structure does not arise for the first class of states. Thus for the case where a deep throat emerges in the gravity description, the dynamics of falling down this throat is described in the CFT as a sequential `hopping' between states all of which have the same energy at the free point; this hopping amplitude converts an integer spaced spectrum into a closely spaced band of energy levels.

preprint2022arXiv

Efficient Parallel Graph Trimming by Arc-Consistency

Given a large data graph, trimming techniques can reduce the search space by removing vertices without outgoing edges. One application is to speed up the parallel decomposition of graphs into strongly connected components (SCC decomposition), which is a fundamental step for analyzing graphs. We observe that graph trimming is essentially a kind of arc-consistency problem, and AC-3, AC-4, and AC-6 are the most relevant arc-consistency algorithms for application to graph trimming. The existing parallel graph trimming methods require worst-case $\mathcal O(nm)$ time and worst-case $\mathcal O(n)$ space for graphs with $n$ vertices and $m$ edges. We call these parallel AC-3-based as they are much like the AC-3 algorithm. In this work, we propose AC-4-based and AC-6-based trimming methods. That is, AC-4-based trimming has an improved worst-case time of $\mathcal O(n+m)$ but requires worst-case space of $\mathcal O(n+m)$; compared with AC-4-based trimming, AC-6-based has the same worst-case time of $\mathcal O(n+m)$ but an improved worst-case space of $\mathcal O(n)$. We parallelize the AC-4-based and AC-6-based algorithms to be suitable for shared-memory multi-core machines. The algorithms are designed to minimize synchronization overhead. For these algorithms, we also prove the correctness and analyze time complexities with the work-depth model. In experiments, we compare these three parallel trimming algorithms over a variety of real and synthetic graphs. Specifically, for the maximum number of traversed edges per worker by using 16 workers, AC-3-based traverses up to 58.3 and 36.5 times more edges than AC-6-based trimming and AC-4-based trimming, respectively.

preprint2022arXiv

Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review

The intelligent dialogue system, aiming at communicating with humans harmoniously with natural language, is brilliant for promoting the advancement of human-machine interaction in the era of artificial intelligence. With the gradually complex human-computer interaction requirements (e.g., multimodal inputs, time sensitivity), it is difficult for traditional text-based dialogue system to meet the demands for more vivid and convenient interaction. Consequently, Visual Context Augmented Dialogue System (VAD), which has the potential to communicate with humans by perceiving and understanding multimodal information (i.e., visual context in images or videos, textual dialogue history), has become a predominant research paradigm. Benefiting from the consistency and complementarity between visual and textual context, VAD possesses the potential to generate engaging and context-aware responses. For depicting the development of VAD, we first characterize the concepts and unique features of VAD, and then present its generic system architecture to illustrate the system workflow. Subsequently, several research challenges and representative works are detailed investigated, followed by the summary of authoritative benchmarks. We conclude this paper by putting forward some open issues and promising research trends for VAD, e.g., the cognitive mechanisms of human-machine dialogue under cross-modal dialogue context, and knowledge-enhanced cross-modal semantic interaction.

preprint2022arXiv

Green's functions and complex Monge-Ampère equations

Uniform $L^1$ and lower bounds are obtained for the Green's function on compact Kähler manifolds. Unlike in the classic theorem of Cheng-Li for Riemannian manifolds, the lower bounds do not depend directly on the Ricci curvature, but only on integral bounds for the volume form and certain of its derivatives. In particular, a uniform lower bound for the Green's function on Kähler manifolds is obtained which depends only on a lower bound for the scalar curvature and on an $L^q$ norm for the volume form for some $q>1$. The proof relies on auxiliary Monge-Ampère equations, and is fundamentally non-linear. The lower bounds for the Green's function imply in turn $C^1$ and $C^2$ estimates for complex Monge-Ampère equations with a sharper dependence on the function on the right hand side.

preprint2022arXiv

IDNP: Interest Dynamics Modeling using Generative Neural Processes for Sequential Recommendation

Recent sequential recommendation models rely increasingly on consecutive short-term user-item interaction sequences to model user interests. These approaches have, however, raised concerns about both short- and long-term interests. (1) {\it short-term}: interaction sequences may not result from a monolithic interest, but rather from several intertwined interests, even within a short period of time, resulting in their failures to model skip behaviors; (2) {\it long-term}: interaction sequences are primarily observed sparsely at discrete intervals, other than consecutively over the long run. This renders difficulty in inferring long-term interests, since only discrete interest representations can be derived, without taking into account interest dynamics across sequences. In this study, we address these concerns by learning (1) multi-scale representations of short-term interests; and (2) dynamics-aware representations of long-term interests. To this end, we present an \textbf{I}nterest \textbf{D}ynamics modeling framework using generative \textbf{N}eural \textbf{P}rocesses, coined IDNP, to model user interests from a functional perspective. IDNP learns a global interest function family to define each user's long-term interest as a function instantiation, manifesting interest dynamics through function continuity. Specifically, IDNP first encodes each user's short-term interactions into multi-scale representations, which are then summarized as user context. By combining latent global interest with user context, IDNP then reconstructs long-term user interest functions and predicts interactions at upcoming query timestep. Moreover, IDNP can model such interest functions even when interaction sequences are limited and non-consecutive. Extensive experiments on four real-world datasets demonstrate that our model outperforms state-of-the-arts on various evaluation metrics.

preprint2022arXiv

Simplified Algorithms for Order-Based Core Maintenance

Graph analytics attract much attention from both research and industry communities. Due to the linear time complexity, the $k$-core decomposition is widely used in many real-world applications such as biology, social networks, community detection, ecology, and information spreading. In many such applications, the data graphs continuously change over time. The changes correspond to edge insertion and removal. Instead of recomputing the $k$-core, which is time-consuming, we study how to maintain the $k$-core efficiently. That is, when inserting or deleting an edge, we need to identify the affected vertices by searching for more vertices. The state-of-the-art order-based method maintains an order, the so-called $k$-order, among all vertices, which can significantly reduce the searching space. However, this order-based method is complicated for understanding and implementation, and its correctness is not formally discussed. In this work, we propose a simplified order-based approach by introducing the classical Order Data Structure to maintain the $k$-order, which significantly improves the worst-case time complexity for both edge insertion and removal algorithms. Also, our simplified method is intuitive to understand and implement; it is easy to argue the correctness formally. Additionally, we discuss a simplified batch insertion approach. The experiments evaluate our simplified method over 12 real and synthetic graphs with billions of vertices. Compared with the existing method, our simplified approach achieves high speedups up to 7.7x and 9.7x for edge insertion and removal, respectively.

preprint2022arXiv

Uniform entropy and energy bounds for fully non-linear equations

Energy bounds which are uniform in the background metric are obtained from upper bounds for entropy-like quantities. The argument is based on auxiliary Monge-Ampère equations involving sublevel sets, and bypasses the Alexandrov-Bakelman-Pucci maximum principle. In particular, it implies uniform $L^\infty$ bounds for systems coupling a fully non-linear equation to its linearization, generalizing the cscK equation.

preprint2021arXiv

AdaSpring: Context-adaptive and Runtime-evolutionary Deep Model Compression for Mobile Applications

There are many deep learning (e.g., DNN) powered mobile and wearable applications today continuously and unobtrusively sensing the ambient surroundings to enhance all aspects of human lives. To enable robust and private mobile sensing, DNN tends to be deployed locally on the resource-constrained mobile devices via model compression. The current practice either hand-crafted DNN compression techniques, i.e., for optimizing DNN-relative performance (e.g., parameter size), or on-demand DNN compression methods, i.e., for optimizing hardware-dependent metrics (e.g., latency), cannot be locally online because they require offline retraining to ensure accuracy. Also, none of them have correlated their efforts with runtime adaptive compression to consider the dynamic nature of the deployment context of mobile applications. To address those challenges, we present AdaSpring, a context-adaptive and self-evolutionary DNN compression framework. It enables the runtime adaptive DNN compression locally online. Specifically, it presents the ensemble training of a retraining-free and self-evolutionary network to integrate multiple alternative DNN compression configurations (i.e., compressed architectures and weights). It then introduces the runtime search strategy to quickly search for the most suitable compression configurations and evolve the corresponding weights. With evaluation on five tasks across three platforms and a real-world case study, experiment outcomes show that AdaSpring obtains up to 3.1x latency reduction, 4.2 x energy efficiency improvement in DNNs, compared to hand-crafted compression techniques, while only incurring <= 6.2ms runtime-evolution latency.

preprint2021arXiv

Deep Learning for Sensor-based Human Activity Recognition: Overview, Challenges and Opportunities

The vast proliferation of sensor devices and Internet of Things enables the applications of sensor-based activity recognition. However, there exist substantial challenges that could influence the performance of the recognition system in practical scenarios. Recently, as deep learning has demonstrated its effectiveness in many areas, plenty of deep methods have been investigated to address the challenges in activity recognition. In this study, we present a survey of the state-of-the-art deep learning methods for sensor-based human activity recognition. We first introduce the multi-modality of the sensory data and provide information for public datasets that can be used for evaluation in different challenge tasks. We then propose a new taxonomy to structure the deep methods by challenges. Challenges and challenge-related deep methods are summarized and analyzed to form an overview of the current research progress. At the end of this work, we discuss the open issues and provide some insights for future directions.

preprint2020arXiv

Classification of blow-up and global existence of solutions to an initial $\textrm{Neumann}$ problem

The aim of this paper is to apply the modified potential well method and some new differential inequalities to study the asymptotic behavior of solutions to the initial homogeneous $\hbox{Neumann}$ problem of a nonlinear diffusion equation driven by the $p(x)$-\hbox{Laplace} operator. Complete classification of global existence and blow-up in finite time of solutions is given when the initial data satisfies different conditions. Roughly speaking, we obtain a threshold result for the solution to exist globally or to blow up in finite time when the initial energy is subcritical and critical, respectively. Further, the decay rate of the $L^2$ norm is also obtained for global solutions. Sufficient conditions for the existence of global and blow-up solutions are also provided for supercritical initial energy. At last, we give two-sided estimates of asymptotic behavior when the diffusion term dominates the source. This is a continuation of our previous work \cite{GG}.

preprint2020arXiv

Kahler-Einstein Metrics and Eigenvalue Gaps

The existence of Kahler-Einstein metrics on a Fano manifold is characterized in terms of a uniform gap between 0 and the first positive eigenvalue of the Cauchy-Riemann operator on smooth vector fields. It is also characterized by a similar gap between 0 and the first positive eigenvalue for Hamiltonian vector fields. The underlying tool is a compactness criterion for suitably bounded subsets of the space of Kahler potentials which implies a positive gap.

preprint2020arXiv

On the degeneration of asymptotically conical Calabi-Yau metrics

We study the degenerations of asymptotically conical Ricci-flat Kähler metrics as the Kähler class degenerates to a semi-positive class. We show that under appropriate assumptions, the Ricci-flat Kähler metrics converge to a incomplete smooth Ricci-flat Kähler metric away from a compact subvariety. As a consequence, we construct singular Calabi-Yau metrics with asymptotically conical behaviour at infinity on certain quasi-projective varieties and we show that the metric geometry of these singular metrics are homeomorphic to the topology of the singular variety. Finally, we will apply our results to study several classes of examples of geometric transitions between Calabi-Yau manifolds.

preprint2020arXiv

Towards information-rich, logical text generation with knowledge-enhanced neural models

Text generation system has made massive promising progress contributed by deep learning techniques and has been widely applied in our life. However, existing end-to-end neural models suffer from the problem of tending to generate uninformative and generic text because they cannot ground input context with background knowledge. In order to solve this problem, many researchers begin to consider combining external knowledge in text generation systems, namely knowledge-enhanced text generation. The challenges of knowledge enhanced text generation including how to select the appropriate knowledge from large-scale knowledge bases, how to read and understand extracted knowledge, and how to integrate knowledge into generation process. This survey gives a comprehensive review of knowledge-enhanced text generation systems, summarizes research progress to solving these challenges and proposes some open issues and research directions.

preprint2019arXiv

Lifting of level-1 states in the D1D5 CFT

The D1D5 CFT has a large set of states that are supersymmetric at the `free&#39; orbifold point in moduli space. When we perturb away from this point, some of these states join into long multiplets and lift in energy, while others remain supersymmetric. The count of unlifted states can be bounded below by an index, but the index does not yield the pattern of lifting; i.e., which states join into a long multiplet and how much this multiplet lifts. In this paper we consider the simple case of the D1D5 CFT where the orbifold CFT is a sigma model with targets space $(T^4)^2/S_2$ and consider states at energy level $1$. There are $2688$ states at this level. The lifted states form a triplet of long multiplets, and we compute their lift at second order in perturbation theory. Half the members of the long multiplet are in the untwisted sector and half are in the twisted sector. This and other similar studies should help in the understanding of fuzzball states that describe extremal holes, since CFT sectors with low twist describe shallow throats in the dual gravity solution while sectors with high twist describe deep throats.

preprint2019arXiv

Lifting of states in 2-dimensional $N=4$ supersymmetric CFTs

We consider states of the D1-D5 CFT where only the left-moving sector is excited. As we deform away from the orbifold point, some of these states will remain BPS while others can `lift&#39;. The lifting can be computed by a path integral containing two twist deformations; however, the relevant 4-point amplitude cannot be computed explicitly in many cases. We analyze an older proposal by Gava and Narain where the lift can be computed in terms of a finite number of 3-point functions. A direct Hamiltonian decomposition of the path integral involves an infinite number of 3-point functions, as well the first order correction to the starting state. We note that these corrections to the state account for the infinite number of 3-point functions arising from higher energy states, and one can indeed express the path-integral result in terms of a finite number of 3-point functions involving only the leading order states that are degenerate. The first order correction to the supercharge $\bar G^{(1)}$ gets replaced by a projection $\bar G^{(P)}$; this projected operator can also be used to group the states into multiplets whose members have the same lifting.

preprint2018arXiv

Smart City Development with Urban Transfer Learning

Nowadays, the smart city development levels of different cities are still unbalanced. For a large number of cities which just started development, the governments will face a critical cold-start problem: &#39;how to develop a new smart city service with limited data?&#39;. To address this problem, transfer learning can be leveraged to accelerate the smart city development, which we term the urban transfer learning paradigm. This article investigates the common process of urban transfer learning, aiming to provide city planners and relevant practitioners with guidelines on how to apply this novel learning paradigm. Our guidelines include common transfer strategies to take, general steps to follow, and case studies in public safety, transportation management, etc. We also summarize a few research opportunities and expect this article can attract more researchers to study urban transfer learning.