Researcher profile

Weiwei Liu

Weiwei Liu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Long-context modeling is becoming a core capability of modern large vision-language models (LVLMs), enabling sustained context management across long-document understanding, video analysis, and multi-turn tool use in agentic workflows. Yet practical training recipes remain insufficiently explored, particularly for designing and balancing long-context data mixtures. In this work, we present a systematic study of long-context continued pre-training for LVLMs, extending a 7B model from 32K to 128K context with extensive ablations on long-document data. We first show that long-document VQA is substantially more effective than OCR transcription. Building on this observation, our ablations further yield three key findings: i) for sequence-length distribution, balanced data outperforms target-length-focused data (e.g., 128K), suggesting that long-context ability requires generalizable key-information retrieval across various lengths and positions; ii) retrieval remains the primary bottleneck, favoring retrieval-heavy mixtures with modest reasoning data for task diversity; and iii) pure long-document VQA largely preserves short-context capabilities, suggesting that instruction-formatted long data reduces the need for short-data mixing. Based on these findings, we introduce MMProLong, obtained by long-context continued pre-training from Qwen2.5-VL-7B with only a 5B-token budget. MMProLong improves long-document VQA scores by 7.1% and maintains strong performance at 256K and 512K contexts beyond its 128K training window, without additional training. It further generalizes to webpage-based multimodal needle retrieval, long-context vision-text compression, and long-video understanding without task-specific supervision. Overall, our study establishes a practical LongPT recipe and an empirical foundation for advancing long-context vision-language models.

preprint2022arXiv

EPPAC: Entity Pre-typing Relation Classification with Prompt AnswerCentralizing

Relation classification (RC) aims to predict the relationship between a pair of subject and object in a given context. Recently, prompt tuning approaches have achieved high performance in RC. However, existing prompt tuning approaches have the following issues: (1) numerous categories decrease RC performance; (2) manually designed prompts require intensive labor. To address these issues, a novel paradigm, Entity Pre-typing Relation Classification with Prompt Answer Centralizing(EPPAC) is proposed in this paper. The entity pre-tying in EPPAC is presented to address the first issue using a double-level framework that pre-types entities before RC and prompt answer centralizing is proposed to address the second issue. Extensive experiments show that our proposed EPPAC outperformed state-of-the-art approaches on TACRED and TACREV by 14.4% and 11.1%, respectively. The code is provided in the Supplementary Materials.

preprint2022arXiv

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System

Optical character recognition (OCR) technology has been widely used in various scenes, as shown in Figure 1. Designing a practical OCR system is still a meaningful but challenging task. In previous work, considering the efficiency and accuracy, we proposed a practical ultra lightweight OCR system (PP-OCR), and an optimized version PP-OCRv2. In order to further improve the performance of PP-OCRv2, a more robust OCR system PP-OCRv3 is proposed in this paper. PP-OCRv3 upgrades the text detection model and text recognition model in 9 aspects based on PP-OCRv2. For text detector, we introduce a PAN module with large receptive field named LK-PAN, a FPN module with residual attention mechanism named RSE-FPN, and DML distillation strategy. For text recognizer, the base model is replaced from CRNN to SVTR, and we introduce lightweight text recognition network SVTR LCNet, guided training of CTC by attention, data augmentation strategy TextConAug, better pre-trained model by self-supervised TextRotNet, UDML, and UIM to accelerate the model and improve the effect. Experiments on real data show that the hmean of PP-OCRv3 is 5% higher than PP-OCRv2 under comparable inference speed. All the above mentioned models are open-sourced and the code is available in the GitHub repository PaddleOCR which is powered by PaddlePaddle.

preprint2020arXiv

Crystalline Electric-Field Excitations in Quantum Spin Liquids Candidate $NaYbSe_{2}$

Very recently we revealed a large family of triangular lattice quantum spin liquid candidates named rare-earth chalcogenides, which features a high-symmetry structure without structural/charge disorders and spin impurities, and may serve as an ideal platform exploring spin liquid physics. The knowledge of crystalline electric-field (CEF) excitations is an essential step to explore the fundamental magnetism of rare-earth spin systems. Here we employed inelastic neutron scattering (INS) and Raman scattering (RS) to carry out a comprehensive CFE investigation on $NaYbSe_{2}$, a promising representative of the family. By comparison with its nonmagnetic compound $NaLuSe_{2}$, we are able to identify the CEF excitations at 15.8, 24.3 and 30.5 meV at 5K. The selected cuts of the INS spectra are well re-produced with a large anisotropy of $g$ factors ($g_{ab}:g_{c}\sim3:1$). Further, the CEF excitations are explained well by our calculations based on the point charge model. Interestingly, $NaYbSe_{2}$ exhibits an unusual CEF shift to higher energies with increasing temperatures, and the Raman mode close to the first CEF excitation shows an anomalously large softening with decreasing temperatures. The absence of the anomalies in $NaLuSe_{2}$ clearly demonstrates a CEF-phonon coupling not reported in the family. It can be understood in term of the weaker electronegativity of Se. The fact that the smallest first CEF excitation in the sub-family of $NaYbCh_{2}$ is $\sim$ 180K (Ch=O, S, Se), guarantees that the sub-family can be strictly described with an effective S=1/2 picture at sufficiently low temperatures. Interestingly the CEF-phonon coupling revealed here may present alternative possibilities to manipulate the spin systems.

preprint2020arXiv

Opinion Maximization in Social Trust Networks

Social media sites are now becoming very important platforms for product promotion or marketing campaigns. Therefore, there is broad interest in determining ways to guide a site to react more positively to a product with a limited budget. However, the practical significance of the existing studies on this subject is limited for two reasons. First, most studies have investigated the issue in oversimplified networks in which several important network characteristics are ignored. Second, the opinions of individuals are modeled as bipartite states(e.g., support or not) in numerous studies, however, this setting is too strict for many real scenarios. In this study, we focus on social trust networks(STNs), which have the significant characteristics ignored in the previous studies. We generalized a famed continuous-valued opinion dynamics model for STNs, which is more consistent with real scenarios. We subsequently formalized two novel problems for solving the issue in STNs. Moreover, we developed two matrix-based methods for these two problems and experiments on real-world datasets to demonstrate the practical utility of our methods.

preprint2020arXiv

Pressure induced metallization and possible unconventional superconductivity in spin liquid $NaYbSe_{2}$

Beyond the conventional electron pairing mediated by phonons, high-temperature superconductivity in cuprates is believed to stem from quantum spin liquid (QSL). The unconventional superconductivity by doping a spin liquid/Mott insulator, is a long-sought goal but a principal challenge in condensed matter physics because of the lack of an ideal QSL platform. Here we report the pressure induced metallization and possible unconventional superconductivity in $NaYbSe_{2}$, which belongs to a large and ideal family of triangular lattice spin liquid we revealed recently and is evidenced to possess a QSL ground state. The charge gap of NaYbSe2 is gradually reduced by applying pressures, and at ~20 GPa the crystal jumps into a superconducting (SC) phase with Tc ~ 5.8 K even before the insulating gap is completely closed. The metallization is confirmed by further high-pressure experiments but the sign of superconductivity is not well repeated. No symmetry breaking accompanies the SC transition, as indicated by X-ray diffraction and low-temperature Raman experiments under high pressures. This intrinsically connects QSL and SC phases, and suggests an unconventional superconductivity developed from QSL. We further observed the magnetic-field-tuned superconductor-insulator transition which is analogous to that found in the underdoped cuprate superconductor $La_{2-x}Sr_{x}CuO_{4}$. The study is expected to inspire interest in exploring new types of superconductors and sheds light into the intriguing physics from a spin liquid/Mott insulator to a superconductor.

preprint2020arXiv

Spectral Domain Z-scan Technique

Characterizing the nonlinear optical properties of various materials plays a prerequisite role in nonlinear optics. Among different methods, the well-known Z-scan technique and the modified versions have been recognized as a simple and accurate method for measuring both the real and imaginary parts of the nonlinear refractive index. However, all the Z-scan methods based on detecting small beam variations put forward a severe restriction on the roughness of materials. Therefore, measuring nonlinear optical properties of highly scattering media still remain challenging. Inspired by the innovation of conventional Z-scan method that converting the wavefront phase shift to the easily measurable spatial pattern in far-field, the alternative spectral domain Z-scan technique was presented in this paper. It has a great potential for highly scattering medium, based on the scattering efficiency is insensitive to the wavelength for Mie scattering as the wavelengths are far smaller than the roughness. Moreover, to demonstrate the advantages of spectral domain Z-scan technique, the nonlinear refraction of polished slides and frosted slides was measured, which agrees well with previous reports.