Source author record

Pei Wu

Pei Wu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.soc-ph Computational Complexity Computer Vision Discrete Mathematics hep-ph Information Retrieval math.PR physics.atom-ph physics.chem-ph physics.optics physics.plasm-ph Social and Information Networks

Catalog footprint

What is connected

6works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Mimic Human Cognition, Master Multi-Image Reasoning: A Meta-Action Framework for Enhanced Visual Understanding

While Multimodal Large Language Models (MLLMs) excel at single-image understanding, they exhibit significantly degraded performance in multi-image reasoning scenarios. Multi-image reasoning presents fundamental challenges including complex inter-relationships between images and scattered critical information across image sets. Inspired by human cognitive processes, we propose the Cognition-Inspired Meta-Action Framework (CINEMA), a novel approach that decomposes multi-image reasoning into five structured meta-actions: Global, Focus, Hint, Think, and Answer which explicitly modeling the sequential cognitive steps humans naturally employ. For cold-start training, we introduce a Retrieval-Based Tree Sampling strategy that generates high-quality meta-action trajectories to bootstrap the model with reasoning patterns. During reinforcement learning, we adopt a two-stage paradigm: an exploration phase with Diversity-Preserving Strategy to avoid entropy collapse, followed by an annealed exploitation phase with DAPO to gradually strengthen exploitation. To train our model, we construct a dataset of 57k cold-start and 58k reinforcement learning instances spanning multi-image, multi-frame, and single-image tasks. We conduct extensive evaluations on multi-image reasoning benchmarks, video understanding benchmarks, and single-image benchmarks, achieving competitive state-of-the-art performance on several key benchmarks. Our model surpasses GPT-4o on the MUIR and MVMath benchmarks and notably outperforms specialized video reasoning models on video understanding benchmarks, demonstrating the effectiveness and generalizability of our human cognition-inspired reasoning framework.

preprint2022arXiv

An Optimal "It Ain't Over Till It's Over" Theorem

We study the probability of Boolean functions with small max influence to become constant under random restrictions. Let $f$ be a Boolean function such that the variance of $f$ is $Ω(1)$ and all its individual influences are bounded by $τ$. We show that when restricting all but a $ρ=\tildeΩ((\log(1/τ))^{-1})$ fraction of the coordinates, the restricted function remains nonconstant with overwhelming probability. This bound is essentially optimal, as witnessed by the tribes function $\mathrm{TRIBES}=\mathrm{AND}_{n/C\log n}\circ\mathrm{OR}_{C\log n}$. We extend it to an anti-concentration result, showing that the restricted function has nontrivial variance with probability $1-o(1)$. This gives a sharp version of the "it ain't over till it's over" theorem due to Mossel, O'Donnell, and Oleszkiewicz. Our proof is discrete, and avoids the use of the invariance principle. We also show two consequences of our above result: (i) As a corollary, we prove that for a uniformly random input $x$, the block sensitivity of $f$ at $x$ is $\tildeΩ(\log(1/τ))$ with probability $1-o(1)$. This should be compared with the implication of Kahn, Kalai, and Linial's result, which implies that the average block sensitivity of $f$ is $Ω(\log(1/τ))$. (ii) Combining our proof with a well-known result due to O'Donnell, Saks, Schramm, and Servedio, one can also conclude that: Restricting all but a $ρ=\tildeΩ(1/\sqrt{\log (1/τ) })$ fraction of the coordinates of a monotone function $f$, then the restricted function has decision tree complexity $Ω(τ^{-Θ(ρ)})$ with probability $Ω(1)$.

preprint2022arXiv

Decrypting material performance by wide-field femtosecond interferometric imaging of energy carrier evolution

Energy carrier evolution is crucial for material performance. Ultrafast microscopy has been widely applied to visualize the spatiotemporal evolution of energy carriers. However, direct imaging of small amounts of energy carriers on nanoscale remains difficult due to extremely weak transient signals. Here we present a method for ultrasensitive and high-throughput imaging of energy carrier evolution in space and time. This method combines femtosecond pump-probe techniques with interferometric scattering microscopy (iSCAT), named Femto-iSCAT. The interferometric principle and unique spatially-modulated contrast enhancement increase the transient image contrast by >2 orders of magnitude and enable the exploration of new science. We address three important and challenging problems: transport of different energy carriers at various interfaces, heterogeneous hot electron distribution and relaxation in single plasmonic resonators, and distinct structure-dependent edge state dynamics of carriers and excitons in optoelectronic semiconductors. Femto-iSCAT holds great potential as a universal tool for ultrasensitive imaging of energy carrier evolution in space and time.

preprint2013arXiv

Positronium decay in a circular polarized laser field

We calculate the lifetime of both the o-Ps and the p-Ps positronium annihilation decay $ Ps\toγγ$ in the strong circular polarized laser field. We take a strategy of the factorization to separate the effects caused by the Coulomb interaction and the strong laser field interaction. It is factorized in the time direction but not in the space direction. Our results show that in the laser with long wavelength and high intensity, the lifetimes of those Ps states are dramatically increased. For $\rm CO_2$ laser with $10\, μm$ wavelength and $10^{13} W/cm^2$ intensity, lifetime of the spin-single positronium is increased by $10^8$ times. Our result is consistent with those obtained by solving the Sch{ö}dinger equation. This effect may be useful for the high harmonic generation(HHG) effects provided with the Ps\cite{keitel2004}.

preprint2011arXiv

Effective Mechanism for Social Recommendation of News

Recommendation systems represent an important tool for news distribution on the Internet. In this work we modify a recently proposed social recommendation model in order to deal with no explicit ratings of users on news. The model consists of a network of users which continually adapts in order to achieve an efficient news traffic. To optimize network's topology we propose different stochastic algorithms that are scalable with respect to the network's size. Agent-based simulations reveal the features and the performance of these algorithms. To overcome the resultant drawbacks of each method we introduce two improved algorithms and show that they can optimize network's topology almost as fast and effectively as other not-scalable methods that make use of much more information.

preprint2010arXiv

Time-aware Collaborative Filtering with the Piecewise Decay Function

In this paper, we determine the appropriate decay function for item-based collaborative filtering (CF). Instead of intuitive deduction, we introduce the Similarity-Signal-to-Noise-Ratio (SSNR) to quantify the impacts of rated items on current recommendations. By measuring the variation of SSNR over time, drift in user interest is well visualized and quantified. Based on the trend changes of SSNR, the piecewise decay function is thus devised and incorporated to build our time-aware CF algorithm. Experiments show that the proposed algorithm strongly outperforms the conventional item-based CF algorithm and other time-aware algorithms with various decay functions.