Source author record

Weiwei Liu

Weiwei Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.optics Computer Vision cond-mat.str-el Artificial Intelligence astro-ph.SR Computation and Language Computer Science and Game Theory cond-mat.mtrl-sci cond-mat.supr-con Social and Information Networks

Catalog footprint

What is connected

13works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Long-context modeling is becoming a core capability of modern large vision-language models (LVLMs), enabling sustained context management across long-document understanding, video analysis, and multi-turn tool use in agentic workflows. Yet practical training recipes remain insufficiently explored, particularly for designing and balancing long-context data mixtures. In this work, we present a systematic study of long-context continued pre-training for LVLMs, extending a 7B model from 32K to 128K context with extensive ablations on long-document data. We first show that long-document VQA is substantially more effective than OCR transcription. Building on this observation, our ablations further yield three key findings: i) for sequence-length distribution, balanced data outperforms target-length-focused data (e.g., 128K), suggesting that long-context ability requires generalizable key-information retrieval across various lengths and positions; ii) retrieval remains the primary bottleneck, favoring retrieval-heavy mixtures with modest reasoning data for task diversity; and iii) pure long-document VQA largely preserves short-context capabilities, suggesting that instruction-formatted long data reduces the need for short-data mixing. Based on these findings, we introduce MMProLong, obtained by long-context continued pre-training from Qwen2.5-VL-7B with only a 5B-token budget. MMProLong improves long-document VQA scores by 7.1% and maintains strong performance at 256K and 512K contexts beyond its 128K training window, without additional training. It further generalizes to webpage-based multimodal needle retrieval, long-context vision-text compression, and long-video understanding without task-specific supervision. Overall, our study establishes a practical LongPT recipe and an empirical foundation for advancing long-context vision-language models.

preprint2022arXiv

EPPAC: Entity Pre-typing Relation Classification with Prompt AnswerCentralizing

Relation classification (RC) aims to predict the relationship between a pair of subject and object in a given context. Recently, prompt tuning approaches have achieved high performance in RC. However, existing prompt tuning approaches have the following issues: (1) numerous categories decrease RC performance; (2) manually designed prompts require intensive labor. To address these issues, a novel paradigm, Entity Pre-typing Relation Classification with Prompt Answer Centralizing(EPPAC) is proposed in this paper. The entity pre-tying in EPPAC is presented to address the first issue using a double-level framework that pre-types entities before RC and prompt answer centralizing is proposed to address the second issue. Extensive experiments show that our proposed EPPAC outperformed state-of-the-art approaches on TACRED and TACREV by 14.4% and 11.1%, respectively. The code is provided in the Supplementary Materials.

preprint2022arXiv

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System

Optical character recognition (OCR) technology has been widely used in various scenes, as shown in Figure 1. Designing a practical OCR system is still a meaningful but challenging task. In previous work, considering the efficiency and accuracy, we proposed a practical ultra lightweight OCR system (PP-OCR), and an optimized version PP-OCRv2. In order to further improve the performance of PP-OCRv2, a more robust OCR system PP-OCRv3 is proposed in this paper. PP-OCRv3 upgrades the text detection model and text recognition model in 9 aspects based on PP-OCRv2. For text detector, we introduce a PAN module with large receptive field named LK-PAN, a FPN module with residual attention mechanism named RSE-FPN, and DML distillation strategy. For text recognizer, the base model is replaced from CRNN to SVTR, and we introduce lightweight text recognition network SVTR LCNet, guided training of CTC by attention, data augmentation strategy TextConAug, better pre-trained model by self-supervised TextRotNet, UDML, and UIM to accelerate the model and improve the effect. Experiments on real data show that the hmean of PP-OCRv3 is 5% higher than PP-OCRv2 under comparable inference speed. All the above mentioned models are open-sourced and the code is available in the GitHub repository PaddleOCR which is powered by PaddlePaddle.

preprint2020arXiv

Crystalline Electric-Field Excitations in Quantum Spin Liquids Candidate $NaYbSe_{2}$

Very recently we revealed a large family of triangular lattice quantum spin liquid candidates named rare-earth chalcogenides, which features a high-symmetry structure without structural/charge disorders and spin impurities, and may serve as an ideal platform exploring spin liquid physics. The knowledge of crystalline electric-field (CEF) excitations is an essential step to explore the fundamental magnetism of rare-earth spin systems. Here we employed inelastic neutron scattering (INS) and Raman scattering (RS) to carry out a comprehensive CFE investigation on $NaYbSe_{2}$, a promising representative of the family. By comparison with its nonmagnetic compound $NaLuSe_{2}$, we are able to identify the CEF excitations at 15.8, 24.3 and 30.5 meV at 5K. The selected cuts of the INS spectra are well re-produced with a large anisotropy of $g$ factors ($g_{ab}:g_{c}\sim3:1$). Further, the CEF excitations are explained well by our calculations based on the point charge model. Interestingly, $NaYbSe_{2}$ exhibits an unusual CEF shift to higher energies with increasing temperatures, and the Raman mode close to the first CEF excitation shows an anomalously large softening with decreasing temperatures. The absence of the anomalies in $NaLuSe_{2}$ clearly demonstrates a CEF-phonon coupling not reported in the family. It can be understood in term of the weaker electronegativity of Se. The fact that the smallest first CEF excitation in the sub-family of $NaYbCh_{2}$ is $\sim$ 180K (Ch=O, S, Se), guarantees that the sub-family can be strictly described with an effective S=1/2 picture at sufficiently low temperatures. Interestingly the CEF-phonon coupling revealed here may present alternative possibilities to manipulate the spin systems.

preprint2020arXiv

Opinion Maximization in Social Trust Networks

Social media sites are now becoming very important platforms for product promotion or marketing campaigns. Therefore, there is broad interest in determining ways to guide a site to react more positively to a product with a limited budget. However, the practical significance of the existing studies on this subject is limited for two reasons. First, most studies have investigated the issue in oversimplified networks in which several important network characteristics are ignored. Second, the opinions of individuals are modeled as bipartite states(e.g., support or not) in numerous studies, however, this setting is too strict for many real scenarios. In this study, we focus on social trust networks(STNs), which have the significant characteristics ignored in the previous studies. We generalized a famed continuous-valued opinion dynamics model for STNs, which is more consistent with real scenarios. We subsequently formalized two novel problems for solving the issue in STNs. Moreover, we developed two matrix-based methods for these two problems and experiments on real-world datasets to demonstrate the practical utility of our methods.

preprint2020arXiv

Pressure induced metallization and possible unconventional superconductivity in spin liquid $NaYbSe_{2}$

Beyond the conventional electron pairing mediated by phonons, high-temperature superconductivity in cuprates is believed to stem from quantum spin liquid (QSL). The unconventional superconductivity by doping a spin liquid/Mott insulator, is a long-sought goal but a principal challenge in condensed matter physics because of the lack of an ideal QSL platform. Here we report the pressure induced metallization and possible unconventional superconductivity in $NaYbSe_{2}$, which belongs to a large and ideal family of triangular lattice spin liquid we revealed recently and is evidenced to possess a QSL ground state. The charge gap of NaYbSe2 is gradually reduced by applying pressures, and at ~20 GPa the crystal jumps into a superconducting (SC) phase with Tc ~ 5.8 K even before the insulating gap is completely closed. The metallization is confirmed by further high-pressure experiments but the sign of superconductivity is not well repeated. No symmetry breaking accompanies the SC transition, as indicated by X-ray diffraction and low-temperature Raman experiments under high pressures. This intrinsically connects QSL and SC phases, and suggests an unconventional superconductivity developed from QSL. We further observed the magnetic-field-tuned superconductor-insulator transition which is analogous to that found in the underdoped cuprate superconductor $La_{2-x}Sr_{x}CuO_{4}$. The study is expected to inspire interest in exploring new types of superconductors and sheds light into the intriguing physics from a spin liquid/Mott insulator to a superconductor.

preprint2020arXiv

Spectral Domain Z-scan Technique

Characterizing the nonlinear optical properties of various materials plays a prerequisite role in nonlinear optics. Among different methods, the well-known Z-scan technique and the modified versions have been recognized as a simple and accurate method for measuring both the real and imaginary parts of the nonlinear refractive index. However, all the Z-scan methods based on detecting small beam variations put forward a severe restriction on the roughness of materials. Therefore, measuring nonlinear optical properties of highly scattering media still remain challenging. Inspired by the innovation of conventional Z-scan method that converting the wavefront phase shift to the easily measurable spatial pattern in far-field, the alternative spectral domain Z-scan technique was presented in this paper. It has a great potential for highly scattering medium, based on the scattering efficiency is insensitive to the wavelength for Mie scattering as the wavelengths are far smaller than the roughness. Moreover, to demonstrate the advantages of spectral domain Z-scan technique, the nonlinear refraction of polished slides and frosted slides was measured, which agrees well with previous reports.

preprint2016arXiv

Controllable light capsules employing modified Bessel-Gauss beams

We report, in theory and experiment, on a novel class of controlled light capsules with nearly perfect darkness, directly employing intrinsic properties of modified Bessel-Gauss beams. These beams are able to naturally create three-dimensional bottle-shaped region during propagation as long as the parameters are properly chosen. Remarkably, the optical bottle can be controlled to demonstrate various geometries through tuning the beam parameters, thereby leading to an adjustable light capsule. We provide a detailed insight into the theoretical origin and characteristics of the light capsule derived from modified Bessel-Gauss beams. Moreover, a binary digital micromirror device (DMD) based scheme is first employed to shape the bottle beams by precise amplitude and phase manipulation. Further, we demonstrate their ability for optical trapping of core-shell magnetic microparticles, which play a particular role in biomedical research, with holographic optical tweezers. Therefore, our observations provide a new route for generating and controlling bottle beams and will widen the potentials for micromanipulation of absorbing particles, aerosols or even individual atoms.

preprint2015arXiv

Partial eruption of a filament with twisting nonuniform fields

The eruption of the filament with the kink fashion is often regarded as a signature of the kink instability. However, the kink instability threshold for the filament magnetic structure has been not widely understood. Using the H-alpha observation from the New Vacuum Solar Telescope (NVST), we present a partial eruptive filament. In the eruption, a filament thread appeared to split from the middle portion of the filament and to break out in a kinklike fashion. During this period, the left filament material remained below, which erupted without the kinking motion later on. The coronal magnetic field lines associated with the filament are obtained from the nonlinear force-free field (NLFFF) extrapolations using the 12 minutes cadence vector magnetograms of the Helioseismic and Magnetic Imager (HMI) on board the Solar Dynamic Observatory (SDO). We studied the extrapolated field lines passing through the magnetic dips that are in good agreement with the observed filament. The field lines are non-uniformly twisted and appear to be made up by two twisted flux ropes winding about each other. One of them has higher twist than the other, and the highly twisted one has its dips aligned with the kinking eruptive thread at the beginning of its eruption. Before the eruption, moreover, the highly twisted flux rope was found to expand with the approximately constant field twist. In addition, the helicity flux maps deduced from the HMI magnetograms show that some helicity is injected into the overlying magnetic arcade, but no significant helicity is injected into the flux ropes. Accordingly, we suggest that the highly twisted flux rope became kink unstable when the instability threshold declined with the expansion of the flux rope.

preprint2015arXiv

Time-resolved shadowgraphs of transient plasma induced by spatiotemporally focused femtosecond laser pulses in fused silica glass

We report on experimental observations of formation and evolution of transient plasma produced in fused silica glass with spatiotemporally focused (STF) femtosecond laser pulses using a pump-probe shadow imaging technique. Surprisingly, the observation shows that the track of the plasma is significantly curved, which is attributed to an asymmetric density distribution of the transient plasma produced in the focal volume caused by the pulse front tilt of the STF laser field.

preprint2013arXiv

Laterally emitted surface second harmonic generation in a single ZnTe nanowire

We report a direct observation on the unique laterally emitted surface second harmonic generation (SHG) in a single ZnTe nanowire. The highly directional surface SHG signal that polarizes along the direction vertical to the nanowire growth axis, is significantly larger than the bulk SHG contribution, indicating a high efficiency of surface SHG. Two strong SHG peaks are observed on both sides of the nanowire surfaces in the far-field scanning images, which is further supported by FDTD simulations, demonstrating that the unique laterally emitted signal is ascribed to surface SHG in the ZnTe nanowire. The surface SHG in a single ZnTe nanowire with unique lateral emission and high conversion efficiency shows great potential applications in short-wavelength nanolasers, nonlinear microscopy and polarization dependent photonic integrating.

preprint2013arXiv

Terahertz imaging with sub-wavelength resolution by femtosecond laser filament in air

Terahertz (THz) imaging provides cutting edge technique in biology, medical sciences and non-destructive evaluation. However, due to the long wavelength of the THz wave, the obtained resolution of THz imaging is normally a few hundred microns and is much lower than that of the traditional optical imaging. We introduce a sub-wavelength resolution THz imaging technique which uses the THz radiation generated by a femtosecond laser filament in air as the probe. This method is based on the fact that the femtosecond laser filament forms a waveguide for the THz wave in air. The diameter of the THz beam, which propagates inside the filament, varies from 20 μm to 50 μm, which is significantly smaller than the wavelength of the THz wave. Using this highly spatially confined THz beam as the probe, THz imaging with resolution as high as 20 μm (~λ/38) can be realized.

preprint2013arXiv

Terahertz Wave Guiding by Femtosecond Laser Filament in Air

Femtosecond laser filament generates strong terahertz (THz) pulse in air. In this paper, THz pulse waveform generated by femtosecond laser filament has been experimentally investigated as a function of the length of the filament. Superluminal propagation of THz pulse has been uncovered, indicating that the filament creates a THz waveguide in air. Numerical simulation has confirmed that the waveguide is formed because of the radially non-uniform refractive index distribution inside the filament. The underlying physical mechanisms and the control techniques of this type THz pulse generation method might be revisited based on our findings. It might also potentially open a new approach for long-distance propagation of THz wave in air.

Weiwei Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

EPPAC: Entity Pre-typing Relation Classification with Prompt AnswerCentralizing

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System

Crystalline Electric-Field Excitations in Quantum Spin Liquids Candidate $NaYbSe_{2}$

Opinion Maximization in Social Trust Networks

Pressure induced metallization and possible unconventional superconductivity in spin liquid $NaYbSe_{2}$

Spectral Domain Z-scan Technique

Controllable light capsules employing modified Bessel-Gauss beams

Partial eruption of a filament with twisting nonuniform fields

Time-resolved shadowgraphs of transient plasma induced by spatiotemporally focused femtosecond laser pulses in fused silica glass

Laterally emitted surface second harmonic generation in a single ZnTe nanowire

Terahertz imaging with sub-wavelength resolution by femtosecond laser filament in air

Terahertz Wave Guiding by Femtosecond Laser Filament in Air