Researcher profile

You Wu

You Wu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework

The Probabilistic Transformer (PT) establishes that the Transformer's self-attention plus its feed-forward block is mathematically equivalent to Mean-Field Variational Inference (MFVI) on a Conditional Random Field (CRF). Under this equivalence the Transformer ceases to be a black-box neural network and becomes a programmable factor graph: graph topology, factor potentials, and the message-passing schedule are all explicit and inspectable primitives that can be engineered. PT was originally developed for natural language and in this report we investigate its potential for time series. We first lift PT into the Spatial-Temporal Probabilistic Transformer (ST-PT) to repair PT's missing channel axis and weak per-step semantics, and adopt ST-PT as a shared cornerstone backbone. We then identify three distinct properties that PT/ST-PT offers as a factor-graph model and derive three Research Questions, one per property, that probe how each property can be exploited in time series: RQ1. The graph topology and potentials are direct programmable primitives. Can this be used to inject symbolic time-series priors into ST-PT through structural graph modifications, especially under data scarcity and noise? RQ2. The CRF's factor matrices are the operator's potentials. Can an external condition program these factor matrices on a per-sample basis, so that conditional generation becomes structural rather than feature-level modulation of a fixed one? RQ3. Each MFVI iteration is a Bayesian posterior update on the factor graph. Can this turn the latent transition of latent-space AutoRegressive (AR) forecasting from an opaque MLP into a principled posterior update, and can a CRF teacher distill its latents into the AR student to counter cumulative error? We give one empirical study per question. Together, these three studies position ST-PT as a programmable framework for time-series modeling.

preprint2022arXiv

A Generalization of Array Codes with Local Properties and Efficient Encoding/Decoding

A maximum distance separable (MDS) array code is composed of $m\times (k+r)$ arrays such that any $k$ out of $k+r$ columns suffice to retrieve all the information symbols. Expanded-Blaum-Roth (EBR) codes and Expanded-Independent-Parity (EIP) codes are two classes of MDS array codes that can repair any one symbol in a column by locally accessing some other symbols within the column, where the number of symbols $m$ in a column is a prime number. By generalizing the constructions of EBR and EIP codes, we propose new MDS array codes, such that any one symbol can be locally recovered and the number of symbols in a column can be not only a prime number but also a power of an odd prime number. Also, we present an efficient encoding/decoding method for the proposed generalized EBR (GEBR) and generalized EIP (GEIP) codes based on the LU factorization of a Vandermonde matrix. We show that the proposed decoding method has less computational complexity than existing methods. Furthermore, we show that the proposed GEBR codes have both a larger minimum symbol distance and a larger recovery ability of erased lines for some parameters when compared to EBR codes. We show that EBR codes can recover any $r$ erased lines of a slope for any parameter $r$, which was an open problem in [2].

preprint2022arXiv

AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant

A long-standing goal of intelligent assistants such as AR glasses/robots has been to assist users in affordance-centric real-world scenarios, such as "how can I run the microwave for 1 minute?". However, there is still no clear task definition and suitable benchmarks. In this paper, we define a new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn from instructional videos to provide step-by-step help in the user's view. To support the task, we constructed AssistQ, a new dataset comprising 531 question-answer samples from 100 newly filmed instructional videos. We also developed a novel Question-to-Actions (Q2A) model to address the AQTC task and validate it on the AssistQ dataset. The results show that our model significantly outperforms several VQA-related baselines while still having large room for improvement. We expect our task and dataset to advance Egocentric AI Assistant's development. Our project page is available at: https://showlab.github.io/assistq/.

preprint2022arXiv

Identification of new classical Be stars from the LAMOST MRS survey

Be stars are B-type main-sequence stars that display broad Balmer emission lines in their spectra. Identification of Be population is essential to further examine the formation and evolutionary models. We report the detection of classical Be (CBe) stars from observations with the Large sky Area Multi-Object fiber Spectroscopic Telescope Medium Resolution Survey of Date Release 7 (LAMOST MRS DR7). We used a deep convolutional neural network, the ResNet, with an 18-layer module to examine the morphology of the H alpha profile. We identified 1,162 candidate Be stars from the collection of 2,260,387 spectra for 789,918 stars in the database. The ResNet network achieves a Be star classification accuracy of 99.5%. Among the detections, 151 of these are prior known Be stars cross-matched from the literature. By applying a three-step test, we identified 183 new CBe stars. We find that 41 CBe stars are members of known open clusters. Based upon an investigation of the kinematics of the identified CBe stars from the Gaia EDR3 astrometric solutions, we identified 16 new runaways. These new identifications will provide a reference for future follow-ups to further investigate their physical properties.

preprint2021arXiv

Autofocusing Self-Imaging: The Symmetric Pearcey Talbot-like Effect

The Talbot like effect of symmetric Pearcey beams (SPBs) is presented numerically and experimentally in the free space. Owing to the Talbot like effect, the SPBs have the property of periodic and multiple autofocusing. Meanwhile, the focal positions and focal times of SPBs are controlled by the beam shift factor and the distribution factors. What is more, the beam shift factor can also affect the Talbot-like effect and the Talbot period. Therefore, several tiny optical bottles can be generated under the appropriate parameter setting. It is believed that the results can diversify the application of the Talbot effect.

preprint2020arXiv

Generating Representative Headlines for News Stories

Millions of news articles are published online every day, which can be overwhelming for readers to follow. Grouping articles that are reporting the same event into news stories is a common way of assisting readers in their news consumption. However, it remains a challenging research problem to efficiently and effectively generate a representative headline for each story. Automatic summarization of a document set has been studied for decades, while few studies have focused on generating representative headlines for a set of articles. Unlike summaries, which aim to capture most information with least redundancy, headlines aim to capture information jointly shared by the story articles in short length, and exclude information that is too specific to each individual article. In this work, we study the problem of generating representative headlines for news stories. We develop a distant supervision approach to train large-scale generation models without any human annotation. This approach centers on two technical components. First, we propose a multi-level pre-training framework that incorporates massive unlabeled corpus with different quality-vs.-quantity balance at different levels. We show that models trained within this framework outperform those trained with pure human curated corpus. Second, we propose a novel self-voting-based article attention layer to extract salient information shared by multiple articles. We show that models that incorporate this layer are robust to potential noises in news stories and outperform existing baselines with or without noises. We can further enhance our model by incorporating human labels, and we show our distant supervision approach significantly reduces the demand on labeled data.

preprint2020arXiv

The merger history of primordial-black-hole binaries

As a candidate of dark matter, primordial black holes (PBHs) have attracted more and more attentions as they could be possible progenitors of the heavy binary black holes (BBHs) observed by LIGO/Virgo. Accurately estimating the merger rate of PBH binaries will be crucial to reconstruct the mass distribution of PBHs. It was pointed out the merger history of PBHs may shift the merger rate distribution depending on the mass function of PBHs. In this paper, we use 10 BBH events from LIGO/Virgo O1 and O2 observing runs to constrain the merger rate distribution of PBHs by accounting the effect of merger history. It is found that the second merger process makes subdominant contribution to the total merger rate, and hence the merger history effect can be safely neglected.

preprint2019arXiv

Hot subdwarf B stars with neutron star components II: Binary population synthesis

Context: Subdwarf B stars (sdBs) play a crucial role in stellar evolution, asteroseismology, and far-UV radiation of early-type galaxies, and have been intensively studied with observation and theory. It has theoretically been predicted that sdBs with neutron star (NS) companions exist in the Galaxy, but none have been discovered yet. This remains a puzzle in this field. In a previous study (hereafter Paper I), we have studied the formation channels of sdB+NS binaries from main-sequence (MS) stars plus NS binaries by establishing a model grid, but it is still unclear how these binaries consisting of MS stars and NS binaries came to be in the first place. Aims: We systematically study the formation of sdB+NS binaries from their original zero-age main-sequence progenitors. We bridge the gap left by our previous study in this way. We obtain the statistical population properties of sdB+NS binaries and provide some guidance for observational efforts. Methods: We first used Hurley's rapid binary evolution code BSE to evolve 10^7 primordial binaries to the point where the companions of NS+MS, NS+Hertzsprung gap (HG) star, and NS+Giant Branch (GB) star binaries have just filled their Roche lobes. Next, we injected these binaries into the model grid we developed in Paper I to obtain the properties of the sdB+NS populations. We adopted two prescriptions of NS natal kicks. Different values of common-envelope ejection efficiency were chosen to examine the effect of common-envelope evolution on the results. Conclusions: Most sdB+NS binaries are located in the Galactic disk with small RV semi-amplitudes. SdB+NS binaries with large RV semi-amplitudes are expected to be strong GWR sources, some of which could be detected by LISA in the future.