Researcher profile

Xiangyu Zhou

Xiangyu Zhou contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2025arXiv

Not All Tokens Are Meant to Be Forgotten

Large Language Models (LLMs), pre-trained on massive text corpora, exhibit remarkable human-level language understanding, reasoning, and decision-making abilities. However, they tend to memorize unwanted information, such as private or copyrighted content, raising significant privacy and legal concerns. Unlearning has emerged as a promising solution, but existing methods face a significant challenge of over-forgetting. This issue arises because they indiscriminately suppress the generation of all the tokens in forget samples, leading to a substantial loss of model utility. To overcome this challenge, we introduce the Targeted Information Forgetting (TIF) framework, which consists of (1) a flexible targeted information identifier designed to differentiate between unwanted words (UW) and general words (GW) in the forget samples, and (2) a novel Targeted Preference Optimization approach that leverages Logit Preference Loss to unlearn unwanted information associated with UW and Preservation Loss to retain general information in GW, effectively improving the unlearning process while mitigating utility degradation. Extensive experiments on the TOFU and MUSE benchmarks demonstrate that the proposed TIF framework enhances unlearning effectiveness while preserving model utility and achieving state-of-the-art results.

preprint2022arXiv

Linear isometric invariants of bounded domains

We introduce two new conditions for bounded domains, namely $A^p$-completeness and boundary blow down type, and show that, for two bounded domains $D_1$ and $D_2$ that are $A^p$-complete and not of boundary blow down type, if there exists a linear isometry from $A^p(D_1)$ to $A^{p}(D_2)$ for some real number $p>0$ with $p\neq $ even integers, then $D_1$ and $D_2$ must be holomorphically equivalent, where for a domain $D$, $A^p(D)$ denotes the space of $L^p$ holomorphic functions on $D$.

preprint2022arXiv

Synthesizing Analytical SQL Queries from Computation Demonstration

Analytical SQL is widely used in modern database applications and data analysis. However, its partitioning and grouping operators are challenging for novice users. Unfortunately, programming by example, shown effective on standard SQL, are less attractive because examples for analytical queries are more laborious to solve by hand. To make demonstrations easier to create, we designed a new end-user specification, programming by computation demonstration, that allows the user to demonstrate the task using a (possibly incomplete) cell-level computation trace. This specification is exploited in a new abstraction-based synthesis algorithm to prove that a partially formed query cannot be completed to satisfy the specification, allowing us to prune the search space. We implemented our approach in a tool named Sickle and tested it on 80 real-world analytical SQL tasks. Results show that even from small demonstrations, Sickle can solve 76 tasks, in 12.8 seconds on average, while the prior approaches can solve only 60 tasks and are on average 22.5x slower. Our user study with 13 participants reveals that our specification increases user efficiency and confidence on challenging tasks.

preprint2021arXiv

On the restriction formula

Let $φ$ be a quasi-psh function on a complex manifold $X$ and let $S\subset X$ be a complex submanifold. Then the multiplier ideal sheaves $\mathcal{I}(φ|_S)\subset\mathcal{I}(φ)|_{S}$ and the complex singularity exponents $c_{x}\left(φ|_{S}\right)\leqslant c_{x}(φ)$ by Ohsawa-Takegoshi $L^{2}$ extension theorem. An interesting question is to know whether it is possible to get equalities in the above formulas. In the present article, we show that the answer is positive when $S$ is chosen outside a measure zero set in a suitable projective space.

preprint2020arXiv

Positivity of holomorphic vector bundles in terms of $L^p$-conditions of $\bar\partial$

We study the positivity properties of Hermitian (or even Finsler) holomorphic vector bundles in terms of $L^p$-estimates of $\bar\partial$ and $L^p$-extensions of holomorphic objects. To this end, we introduce four conditions, called the optimal $L^p$-estimate condition, the multiple coarse $L^p$-estimate condition, the optimal $L^p$-extension condition, and the multiple coarse $L^p$-extension condition, for a Hermitian (or Finsler) vector bundle $(E,h)$. The main result of the present paper is to give a characterization of the Nakano positivity of $(E,h)$ via the optimal $L^2$-estimate condition. We also show that $(E,h)$ is Griffiths positive if it satisfies the multiple coarse $L^p$-estimate condition for some $p>1$, the optimal $L^p$-extension condition, or the multiple coarse $L^p$-extension condition for some $p>0$. These results can be roughly viewed as converses of Hörmander's $L^2$-estimate of $\bar\partial$ and Ohsawa-Takegoshi type extension theorems. As an application of the main result, we get a totally different method to Nakano positivity of direct image sheaves of twisted relative canonical bundles associated to holomorphic families of complex manifolds.