Source author record

You Wu

You Wu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence astro-ph.SR astro-ph.CO astro-ph.GA astro-ph.HE Computation and Language Computer Vision hep-th Information Retrieval Information Theory Machine Learning math.IT physics.flu-dyn physics.optics

Catalog footprint

What is connected

10works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework

The Probabilistic Transformer (PT) establishes that the Transformer's self-attention plus its feed-forward block is mathematically equivalent to Mean-Field Variational Inference (MFVI) on a Conditional Random Field (CRF). Under this equivalence the Transformer ceases to be a black-box neural network and becomes a programmable factor graph: graph topology, factor potentials, and the message-passing schedule are all explicit and inspectable primitives that can be engineered. PT was originally developed for natural language and in this report we investigate its potential for time series. We first lift PT into the Spatial-Temporal Probabilistic Transformer (ST-PT) to repair PT's missing channel axis and weak per-step semantics, and adopt ST-PT as a shared cornerstone backbone. We then identify three distinct properties that PT/ST-PT offers as a factor-graph model and derive three Research Questions, one per property, that probe how each property can be exploited in time series: RQ1. The graph topology and potentials are direct programmable primitives. Can this be used to inject symbolic time-series priors into ST-PT through structural graph modifications, especially under data scarcity and noise? RQ2. The CRF's factor matrices are the operator's potentials. Can an external condition program these factor matrices on a per-sample basis, so that conditional generation becomes structural rather than feature-level modulation of a fixed one? RQ3. Each MFVI iteration is a Bayesian posterior update on the factor graph. Can this turn the latent transition of latent-space AutoRegressive (AR) forecasting from an opaque MLP into a principled posterior update, and can a CRF teacher distill its latents into the AR student to counter cumulative error? We give one empirical study per question. Together, these three studies position ST-PT as a programmable framework for time-series modeling.

preprint2022arXiv

A Generalization of Array Codes with Local Properties and Efficient Encoding/Decoding

A maximum distance separable (MDS) array code is composed of $m\times (k+r)$ arrays such that any $k$ out of $k+r$ columns suffice to retrieve all the information symbols. Expanded-Blaum-Roth (EBR) codes and Expanded-Independent-Parity (EIP) codes are two classes of MDS array codes that can repair any one symbol in a column by locally accessing some other symbols within the column, where the number of symbols $m$ in a column is a prime number. By generalizing the constructions of EBR and EIP codes, we propose new MDS array codes, such that any one symbol can be locally recovered and the number of symbols in a column can be not only a prime number but also a power of an odd prime number. Also, we present an efficient encoding/decoding method for the proposed generalized EBR (GEBR) and generalized EIP (GEIP) codes based on the LU factorization of a Vandermonde matrix. We show that the proposed decoding method has less computational complexity than existing methods. Furthermore, we show that the proposed GEBR codes have both a larger minimum symbol distance and a larger recovery ability of erased lines for some parameters when compared to EBR codes. We show that EBR codes can recover any $r$ erased lines of a slope for any parameter $r$, which was an open problem in [2].

preprint2022arXiv

AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant

A long-standing goal of intelligent assistants such as AR glasses/robots has been to assist users in affordance-centric real-world scenarios, such as "how can I run the microwave for 1 minute?". However, there is still no clear task definition and suitable benchmarks. In this paper, we define a new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn from instructional videos to provide step-by-step help in the user's view. To support the task, we constructed AssistQ, a new dataset comprising 531 question-answer samples from 100 newly filmed instructional videos. We also developed a novel Question-to-Actions (Q2A) model to address the AQTC task and validate it on the AssistQ dataset. The results show that our model significantly outperforms several VQA-related baselines while still having large room for improvement. We expect our task and dataset to advance Egocentric AI Assistant's development. Our project page is available at: https://showlab.github.io/assistq/.

preprint2022arXiv

Identification of new classical Be stars from the LAMOST MRS survey

Be stars are B-type main-sequence stars that display broad Balmer emission lines in their spectra. Identification of Be population is essential to further examine the formation and evolutionary models. We report the detection of classical Be (CBe) stars from observations with the Large sky Area Multi-Object fiber Spectroscopic Telescope Medium Resolution Survey of Date Release 7 (LAMOST MRS DR7). We used a deep convolutional neural network, the ResNet, with an 18-layer module to examine the morphology of the H alpha profile. We identified 1,162 candidate Be stars from the collection of 2,260,387 spectra for 789,918 stars in the database. The ResNet network achieves a Be star classification accuracy of 99.5%. Among the detections, 151 of these are prior known Be stars cross-matched from the literature. By applying a three-step test, we identified 183 new CBe stars. We find that 41 CBe stars are members of known open clusters. Based upon an investigation of the kinematics of the identified CBe stars from the Gaia EDR3 astrometric solutions, we identified 16 new runaways. These new identifications will provide a reference for future follow-ups to further investigate their physical properties.

preprint2021arXiv

Autofocusing Self-Imaging: The Symmetric Pearcey Talbot-like Effect

The Talbot like effect of symmetric Pearcey beams (SPBs) is presented numerically and experimentally in the free space. Owing to the Talbot like effect, the SPBs have the property of periodic and multiple autofocusing. Meanwhile, the focal positions and focal times of SPBs are controlled by the beam shift factor and the distribution factors. What is more, the beam shift factor can also affect the Talbot-like effect and the Talbot period. Therefore, several tiny optical bottles can be generated under the appropriate parameter setting. It is believed that the results can diversify the application of the Talbot effect.

preprint2020arXiv

Generating Representative Headlines for News Stories

Millions of news articles are published online every day, which can be overwhelming for readers to follow. Grouping articles that are reporting the same event into news stories is a common way of assisting readers in their news consumption. However, it remains a challenging research problem to efficiently and effectively generate a representative headline for each story. Automatic summarization of a document set has been studied for decades, while few studies have focused on generating representative headlines for a set of articles. Unlike summaries, which aim to capture most information with least redundancy, headlines aim to capture information jointly shared by the story articles in short length, and exclude information that is too specific to each individual article. In this work, we study the problem of generating representative headlines for news stories. We develop a distant supervision approach to train large-scale generation models without any human annotation. This approach centers on two technical components. First, we propose a multi-level pre-training framework that incorporates massive unlabeled corpus with different quality-vs.-quantity balance at different levels. We show that models trained within this framework outperform those trained with pure human curated corpus. Second, we propose a novel self-voting-based article attention layer to extract salient information shared by multiple articles. We show that models that incorporate this layer are robust to potential noises in news stories and outperform existing baselines with or without noises. We can further enhance our model by incorporating human labels, and we show our distant supervision approach significantly reduces the demand on labeled data.

preprint2020arXiv

The merger history of primordial-black-hole binaries

As a candidate of dark matter, primordial black holes (PBHs) have attracted more and more attentions as they could be possible progenitors of the heavy binary black holes (BBHs) observed by LIGO/Virgo. Accurately estimating the merger rate of PBH binaries will be crucial to reconstruct the mass distribution of PBHs. It was pointed out the merger history of PBHs may shift the merger rate distribution depending on the mass function of PBHs. In this paper, we use 10 BBH events from LIGO/Virgo O1 and O2 observing runs to constrain the merger rate distribution of PBHs by accounting the effect of merger history. It is found that the second merger process makes subdominant contribution to the total merger rate, and hence the merger history effect can be safely neglected.

preprint2019arXiv

Hot subdwarf B stars with neutron star components II: Binary population synthesis

Context: Subdwarf B stars (sdBs) play a crucial role in stellar evolution, asteroseismology, and far-UV radiation of early-type galaxies, and have been intensively studied with observation and theory. It has theoretically been predicted that sdBs with neutron star (NS) companions exist in the Galaxy, but none have been discovered yet. This remains a puzzle in this field. In a previous study (hereafter Paper I), we have studied the formation channels of sdB+NS binaries from main-sequence (MS) stars plus NS binaries by establishing a model grid, but it is still unclear how these binaries consisting of MS stars and NS binaries came to be in the first place. Aims: We systematically study the formation of sdB+NS binaries from their original zero-age main-sequence progenitors. We bridge the gap left by our previous study in this way. We obtain the statistical population properties of sdB+NS binaries and provide some guidance for observational efforts. Methods: We first used Hurley's rapid binary evolution code BSE to evolve 10^7 primordial binaries to the point where the companions of NS+MS, NS+Hertzsprung gap (HG) star, and NS+Giant Branch (GB) star binaries have just filled their Roche lobes. Next, we injected these binaries into the model grid we developed in Paper I to obtain the properties of the sdB+NS populations. We adopted two prescriptions of NS natal kicks. Different values of common-envelope ejection efficiency were chosen to examine the effect of common-envelope evolution on the results. Conclusions: Most sdB+NS binaries are located in the Galactic disk with small RV semi-amplitudes. SdB+NS binaries with large RV semi-amplitudes are expected to be strong GWR sources, some of which could be detected by LISA in the future.

preprint2014arXiv

Vector Condensate and AdS Soliton Instability Induced by a Magnetic Field

We continue to study the holographic p-wave superconductor model in the Einstein-Maxwell-complex vector field theory with a non-minimal coupling between the complex vector field and the Maxwell field. In this paper we work in the AdS soliton background which describes a conformal field theory in the confined phase and focus on the probe approximation. We find that an applied magnetic field can lead to the condensate of the vector field and the AdS soliton instability. As a result, a vortex lattice structure forms in the spatial directions perpendicular to the applied magnetic field. As a comparison, we also discuss the vector condensate in the Einstein-SU(2) Yang-Mills theory and find that in the setup of the present paper, the Einstein-Maxwell-complex vector field model is a generalization of the SU(2) model in the sense that the vector field has a general mass and gyromagnetic ratio.

preprint2011arXiv

Karman constant and accurate mean flow prediction in a turbulent pipe

The Karman constant κ- widely used in atmospheric science and engineering turbulence modelling, and proposed by Prandtl in 1925 and von Karman in 1930 to describe the mean velocity of a turbulent wall-bounded flow - leads to a logarithmic profile in an overlap region near the wall. For over eighty years, its value was believed to be ~0.41. But more recently, many argue that it is not a constant, because of measured variations in different flows and at different Reynolds numbers (Re). Here, a multi-layer analytic theory is shown to lead to a re-interpretation of κas a global constant for both the overlap region and outer flow, and to yield a new method for its measurement. The newly determined value is 0.45 for both channel and pipe. It is shown that this new κ, together with other wall constants, yields a 99% accuracy in the prediction of mean velocity data at all points in high Re (up to 40 million) pipe flow. The theory also describes finite Re effect, and discovers a transition at the friction Re (i.e. Re_τ)=5000. An accurate model for the prediction of turbulent transport in canonical pipe and channel flows is achieved here, and we propose the model to be valid for a wide class of turbulent flows.

You Wu

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework

A Generalization of Array Codes with Local Properties and Efficient Encoding/Decoding

AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant

Identification of new classical Be stars from the LAMOST MRS survey

Autofocusing Self-Imaging: The Symmetric Pearcey Talbot-like Effect

Generating Representative Headlines for News Stories

The merger history of primordial-black-hole binaries

Hot subdwarf B stars with neutron star components II: Binary population synthesis

Vector Condensate and AdS Soliton Instability Induced by a Magnetic Field

Karman constant and accurate mean flow prediction in a turbulent pipe