Source author record

Yujie Wang

Yujie Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.soft Computer Vision Artificial Intelligence cond-mat.dis-nn Machine Learning cond-mat.mtrl-sci cond-mat.stat-mech Computation and Language cs.CY Distributed, Parallel, and Cluster Computing eess.SP eess.SY Information Theory math.IT physics.flu-dyn physics.optics Systems and Control

Catalog footprint

What is connected

23works

17topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images

We introduce AEGIS, A holistic benchmark for Evaluating forensic analysis of AI-Generated academic ImageS. Compared to existing benchmarks, AEGIS features three key advances: (1) Domain-Specific Complexity: covering seven academic categories with 39 fine-grained subtypes, exposing intrinsic forensic difficulty, where even GPT-5.1 reaches 48.80% overall performance and expert models achieve only limited localization accuracy (IoU 30.09%); (2) Diverse Forgery Simulations: modeling four prevalent academic forgery strategies across 25 generative models, with 11 yielding average forensic accuracy below 50%, showing that forensics lag behind generative advances; and (3) Multi-Dimensional Forensic Evaluation: jointly assessing detection, reasoning, and localization, revealing complementary strengths between model families, with multimodal large language models (MLLMs) at 84.74% accuracy in textual artifact recognition and expert detectors peaking at 79.54% accuracy in binary authenticity detection. By evaluating 25 leading MLLMs, nine expert models, and one unified multimodal understanding and generation model, AEGIS serves as a diagnostic testbed exposing fundamental limitations in academic image forensics.

preprint2026arXiv

Full-Time-Scale Power Management Strategy for Hybrid AC/DC/DS Microgrid with Dynamic Concatenation and Autonomous Frequency / Voltage Restorations

Hybrid AC/DC microgrids with distributed energy storage (DS) improve power reliability in remote areas. Existing power management methods either focus on steady-state power sharing or transient inertia support, but rarely combine both. They also often ignore frequency and voltage deviations caused by droop control, which can harm sensitive loads. To overcome these issues, this paper proposes a full-time-scale (FTS) power management strategy that unifies transient inertia sharing and steady-state power allocation through a novel dynamic concatenator. It also introduces autonomous frequency/voltage restoration to eliminate steady-state deviations in each subgrid. Additionally, a global equivalent circuit model (GECM) is developed to simplify system analysis and design. Experiments confirm that the approach maintains nominal frequency and voltage in steady state while enabling seamless transition between transient inertia support and proportional power sharing across all time scales.

preprint2023arXiv

From creep to flow: Granular materials under cyclic shear

Granular materials such as sand, powders, and grains are omnipresent in daily life, industrial applications, and earth-science [1]. When unperturbed, they form stable structures that resemble the ones of other amorphous solids like metallic and colloidal glasses [2]. It is commonly conjectured that all these amorphous materials show a universal mechanical response when sheared slowly, i.e., to have an elastic regime, followed by yielding [3]. Here we use X-ray tomography to determine the microscopic dynamics of a cyclically sheared granular system in three dimensions. Independent of the shear amplitude $Γ$, the sample shows a cross-over from creep to diffusive dynamics, indicating that granular materials have no elastic response and always yield, in stark contrast to other glasses. The overlap function [4] reveals that at large $Γ$ yielding is a simple cross-over phenomenon, while for small $Γ$ it shows features of a first order transition with a critical point at $Γ\approx 0.1$ at which one finds a pronounced slowing down and dynamical heterogeneity. Our findings are directly related to the surface roughness of granular particles which induces a micro-corrugation to the potential energy landscape, thus creating relaxation channels that are absent in simple glasses. These processes must be understood for reaching an understanding of the complex relaxation dynamics of granular systems.

preprint2022arXiv

BLCU-ICALL at SemEval-2022 Task 1: Cross-Attention Multitasking Framework for Definition Modeling

This paper describes the BLCU-ICALL system used in the SemEval-2022 Task 1 Comparing Dictionaries and Word Embeddings, the Definition Modeling subtrack, achieving 1st on Italian, 2nd on Spanish and Russian, and 3rd on English and French. We propose a transformer-based multitasking framework to explore the task. The framework integrates multiple embedding architectures through the cross-attention mechanism, and captures the structure of glosses through a masking language model objective. Additionally, we also investigate a simple but effective model ensembling strategy to further improve the robustness. The evaluation results show the effectiveness of our solution. We release our code at: https://github.com/blcuicall/SemEval2022-Task1-DM.

preprint2022arXiv

Channel Estimation for Wideband MmWave MIMO OFDM System Exploiting Block Sparsity

In this letter, we investigate time-domain channel estimation for wideband millimeter wave (mmWave) MIMO OFDM system. By transmitting frequency-domain pilot symbols as well as different beamforming vectors, we observe that the time-domain mmWave MIMO channels exhibit channel delay sparsity and especially block sparsity among different spatial directions. Then we propose a time-domain channel estimation exploiting block sparsity (TDCEBS) scheme, which always aims at finding the best nonzero block achieving the largest projection of the residue at each iterations. In particular, we evaluate the system performance using the QuaDRiGa which is recommended by 5G New Radio to generate wideband mmWave MIMO channels. The effectiveness of the proposed TDCEBS scheme is verified by the simulation results, as the proposed scheme outperforms the existing schemes.

preprint2022arXiv

DTG-SSOD: Dense Teacher Guidance for Semi-Supervised Object Detection

The Mean-Teacher (MT) scheme is widely adopted in semi-supervised object detection (SSOD). In MT, the sparse pseudo labels, offered by the final predictions of the teacher (e.g., after Non Maximum Suppression (NMS) post-processing), are adopted for the dense supervision for the student via hand-crafted label assignment. However, the sparse-to-dense paradigm complicates the pipeline of SSOD, and simultaneously neglects the powerful direct, dense teacher supervision. In this paper, we attempt to directly leverage the dense guidance of teacher to supervise student training, i.e., the dense-to-dense paradigm. Specifically, we propose the Inverse NMS Clustering (INC) and Rank Matching (RM) to instantiate the dense supervision, without the widely used, conventional sparse pseudo labels. INC leads the student to group candidate boxes into clusters in NMS as the teacher does, which is implemented by learning grouping information revealed in NMS procedure of the teacher. After obtaining the same grouping scheme as the teacher via INC, the student further imitates the rank distribution of the teacher over clustered candidates through Rank Matching. With the proposed INC and RM, we integrate Dense Teacher Guidance into Semi-Supervised Object Detection (termed DTG-SSOD), successfully abandoning sparse pseudo labels and enabling more informative learning on unlabeled data. On COCO benchmark, our DTG-SSOD achieves state-of-the-art performance under various labelling ratios. For example, under 10% labelling ratio, DTG-SSOD improves the supervised baseline from 26.9 to 35.9 mAP, outperforming the previous best method Soft Teacher by 1.9 points.

preprint2022arXiv

INTERN: A New Learning Paradigm Towards General Vision

Enormous waves of technological innovations over the past several years, marked by the advances in AI technologies, are profoundly reshaping the industry and the society. However, down the road, a key challenge awaits us, that is, our capability of meeting rapidly-growing scenario-specific demands is severely limited by the cost of acquiring a commensurate amount of training data. This difficult situation is in essence due to limitations of the mainstream learning paradigm: we need to train a new model for each new scenario, based on a large quantity of well-annotated data and commonly from scratch. In tackling this fundamental problem, we move beyond and develop a new learning paradigm named INTERN. By learning with supervisory signals from multiple sources in multiple stages, the model being trained will develop strong generalizability. We evaluate our model on 26 well-known datasets that cover four categories of tasks in computer vision. In most cases, our models, adapted with only 10% of the training data in the target domain, outperform the counterparts trained with the full set of data, often by a significant margin. This is an important step towards a promising prospect where such a model with general vision capability can dramatically reduce our reliance on data, thus expediting the adoption of AI technologies. Furthermore, revolving around our new paradigm, we also introduce a new data system, a new architecture, and a new benchmark, which, together, form a general vision ecosystem to support its future development in an open and inclusive manner. See project website at https://opengvlab.shlab.org.cn .

preprint2022arXiv

Neural Implicit 3D Shapes from Single Images with Spatial Patterns

Neural implicit functions have achieved impressive results for reconstructing 3D shapes from single images. However, the image features for describing 3D point samplings of implicit functions are less effective when significant variations of occlusions, views, and appearances exist from the image. To better encode image features, we study a geometry-aware convolutional kernel to leverage geometric relationships of point samplings by the proposed \emph{spatial pattern}, i.e., a structured point set. Specifically, the kernel operates at 2D projections of 3D points from the spatial pattern. Supported by the spatial pattern, the 2D kernel encodes geometric information that is crucial for 3D reconstruction tasks, while traditional ones mainly consider appearance information. Furthermore, to enable the network to discover more adaptive spatial patterns for further capturing non-local contextual information, the kernel is devised to be deformable manipulated by a spatial pattern generator. Experimental results on both synthetic and real datasets demonstrate the superiority of the proposed method. Pre-trained models, codes, and data are available at https://github.com/yixin26/SVR-SP.

preprint2022arXiv

PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection

In this paper, we delve into two key techniques in Semi-Supervised Object Detection (SSOD), namely pseudo labeling and consistency training. We observe that these two techniques currently neglect some important properties of object detection, hindering efficient learning on unlabeled data. Specifically, for pseudo labeling, existing works only focus on the classification score yet fail to guarantee the localization precision of pseudo boxes; For consistency training, the widely adopted random-resize training only considers the label-level consistency but misses the feature-level one, which also plays an important role in ensuring the scale invariance. To address the problems incurred by noisy pseudo boxes, we design Noisy Pseudo box Learning (NPL) that includes Prediction-guided Label Assignment (PLA) and Positive-proposal Consistency Voting (PCV). PLA relies on model predictions to assign labels and makes it robust to even coarse pseudo boxes; while PCV leverages the regression consistency of positive proposals to reflect the localization quality of pseudo boxes. Furthermore, in consistency training, we propose Multi-view Scale-invariant Learning (MSL) that includes mechanisms of both label- and feature-level consistency, where feature consistency is achieved by aligning shifted feature pyramids between two images with identical content but varied scales. On COCO benchmark, our method, termed PSEudo labeling and COnsistency training (PseCo), outperforms the SOTA (Soft Teacher) by 2.0, 1.8, 2.0 points under 1%, 5%, and 10% labelling ratios, respectively. It also significantly improves the learning efficiency for SSOD, e.g., PseCo halves the training time of the SOTA approach but achieves even better performance. Code is available at https://github.com/ligang-cs/PseCo.

preprint2022arXiv

The Tree Loss: Improving Generalization with Many Classes

Multi-class classification problems often have many semantically similar classes. For example, 90 of ImageNet's 1000 classes are for different breeds of dog. We should expect that these semantically similar classes will have similar parameter vectors, but the standard cross entropy loss does not enforce this constraint. We introduce the tree loss as a drop-in replacement for the cross entropy loss. The tree loss re-parameterizes the parameter matrix in order to guarantee that semantically similar classes will have similar parameter vectors. Using simple properties of stochastic gradient descent, we show that the tree loss's generalization error is asymptotically better than the cross entropy loss's. We then validate these theoretical results on synthetic data, image data (CIFAR100, ImageNet), and text data (Twitter).

preprint2021arXiv

V2V-Based Task Offloading and Resource Allocation in Vehicular Edge Computing Networks

In the research and application of vehicle ad hoc networks (VANETs), it is often assumed that vehicles obtain cloud computing services by accessing to roadside units (RSUs). However, due to the problems of insufficient construction quantity, limited communication range and overload of calculation load of roadside units, the calculation mode relying only on vehicle to roadside units is difficult to deal with complex and changeable calculation tasks. In this paper, when the roadside unit is missing, the vehicle mobile unit is regarded as a natural edge computing node to make full use of the excess computing power of mobile vehicles and perform the offloading task of surrounding mobile vehicles in time. In this paper, the OPFTO framework is designed, an improved task allocation algorithm HGSA is proposed, and the pre-filtering process is designed with full consideration of the moving characteristics of vehicles. In addition, vehicle simulation experiments show that the proposed strategy has the advantages of low delay and high accuracy compared with other task scheduling strategies, which provides a reference scheme for the construction of Urban Intelligent Transportation in the future.

preprint2020arXiv

Arbitrary Polarization Conversion Dichroism Metasurfaces for All-in-One Full Poincaré Sphere Polarizers

The control of polarization, an essential property of light, is of wide scientific and technological interest. Polarizer is an indispensable optical element for direct polarization generations. Except common linear and circular polarizations, however, arbitrary polarization generation heavily resorts to bulky optical components by cascading linear polarizers and waveplates. Here, we present a general strategy for designing all-in-one full Poincaré sphere polarizers based on perfect arbitrary polarization conversion dichroism, and realize it in a monolayer all-dielectric metasurface. It allows preferential transmission and conversion of one polarization state locating at an arbitrary position of the Poincaré sphere to its handedness-flipped state, while completely blocking its orthogonal state. In contrast to previous work with limited flexibility to only linear or circular polarizations, our method manifests perfect dichroism close to 100% in theory and exceeding 90% in experiments for arbitrary polarization states. Leveraging this tantalizing dichroism, our demonstration of monolithic full Poincaré sphere polarization generators directly from unpolarized light can enormously extend the scope of meta-optics and dramatically push the state-of-the-art nanophotonic devices.

preprint2020arXiv

Connecting shear localization with the long-range correlated polarized stress fields in granular materials

One long-lasting puzzle in amorphous solids is shear localization, where local plastic deformation involves cooperative particle rearrangements in small regions of a few inter-particle distances, self-organizing into shear bands and eventually leading to the material failure. Understanding the connection between the structure and dynamics of amorphous solids is essential in physics, material sciences, geotechnical and civil engineering, and geophysics. Here we show a deep connection between shear localization and the intrinsic structures of internal stresses in an isotropically jammed granular material subject to shear. Specifically, we find strong (anti)correlations between the micro shear bands and two polarized stress fields along two directions of maximal shear. By exploring the tensorial characteristics and the rotational symmetry of force network, we reveal that such profound connection is a result of symmetry breaking by shear. Finally, we provide the solid experimental evidence of long-range correlated inherent shear stress in an isotropically jammed granular system.

preprint2020arXiv

DeGNN: Characterizing and Improving Graph Neural Networks with Graph Decomposition

Despite the wide application of Graph Convolutional Network (GCN), one major limitation is that it does not benefit from the increasing depth and suffers from the oversmoothing problem. In this work, we first characterize this phenomenon from the information-theoretic perspective and show that under certain conditions, the mutual information between the output after $l$ layers and the input of GCN converges to 0 exponentially with respect to $l$. We also show that, on the other hand, graph decomposition can potentially weaken the condition of such convergence rate, which enabled our analysis for GraphCNN. While different graph structures can only benefit from the corresponding decomposition, in practice, we propose an automatic connectivity-aware graph decomposition algorithm, DeGNN, to improve the performance of general graph neural networks. Extensive experiments on widely adopted benchmark datasets demonstrate that DeGNN can not only significantly boost the performance of corresponding GNNs, but also achieves the state-of-the-art performances.

preprint2020arXiv

DMCP: Differentiable Markov Channel Pruning for Neural Networks

Recent works imply that the channel pruning can be regarded as searching optimal sub-structure from unpruned networks. However, existing works based on this observation require training and evaluating a large number of structures, which limits their application. In this paper, we propose a novel differentiable method for channel pruning, named Differentiable Markov Channel Pruning (DMCP), to efficiently search the optimal sub-structure. Our method is differentiable and can be directly optimized by gradient descent with respect to standard task loss and budget regularization (e.g. FLOPs constraint). In DMCP, we model the channel pruning as a Markov process, in which each state represents for retaining the corresponding channel during pruning, and transitions between states denote the pruning process. In the end, our method is able to implicitly select the proper number of channels in each layer by the Markov process with optimized transitions. To validate the effectiveness of our method, we perform extensive experiments on Imagenet with ResNet and MobilenetV2. Results show our method can achieve consistent improvement than state-of-the-art pruning methods in various FLOPs settings. The code is available at https://github.com/zx55/dmcp

preprint2020arXiv

Friction-controlled entropy-stability competition in granular systems

Using cyclic shear to drive a two dimensional granular system, we determine the structural characteristics for different inter-particle friction coefficients. These characteristics are the result of a competition between mechanical stability and entropy, with the latter's effect increasing with friction. We show that a parameter-free maximum-entropy argument alone predicts an exponential cell order distribution, with excellent agreement with the experimental observation. We show that friction only tunes the mean cell order and, consequently, the exponential decay rate and the packing fraction. We further show that cells, which can be very large in such systems, are short-lived, implying that our systems are liquid-like rather than glassy.

preprint2020arXiv

Granular Segregation Mechanisms by Cyclic Shear

We present an X-ray tomography study of the segregation mechanisms of tracer particles in a three-dimensional cyclically sheared bi-disperse granular medium. Big tracers are dragged by convection to rise to the top surface and then remain trapped there due to the small downward convection cross-section, which leads to segregation. Additionally, we also find that the local structural up-down asymmetry due to arching effect around big tracers will induce the tracers to have a net upward displacement against its smaller neighbors, which is another mechanism for segregation.

preprint2020arXiv

Leidenfrost drop impact on inclined superheated substrates

In real applications, drops always impact on solid walls with various inclinations. For the oblique impact of a Leidenfrost drop, which has a vapor layer under its bottom surface to prevent its direct contact with the superheated substrate, the drop can nearly frictionlessly slide along the substrate accompanied by the spreading and the retracting. To individually study these processes, we experimentally observe ethanol drops impact on superheated inclined substrates using high-speed imaging from two different views synchronously. We first study the dynamic Leidenfrost temperature, which mainly depends on the normal Weber number ${We}_\perp $. Then the substrate temperature is set to be high enough to study the Leidenfrost drop behaviors. During the spreading process, drops always keep uniform. And the maximum spreading factor $D_m/D_0$ follows a power-law dependence on the large normal Weber number ${We}_\perp $ as $D_m/D_0 = \sqrt{We_\perp /12+2}$ for $We_\perp \geq 30$. During the retracting process, drops with low impact velocities become non-uniform due to the gravity effect. For the sliding process, the residence time of all studied drops is nearly a constant, which is not affected by the inclination and $We$ number. The frictionless vapor layer results in the dimensionless sliding distance $L/D_0$ follows a power-law dependence on the parallel Weber number $We_\parallel$ as $L/D_0 \propto We_\parallel^{1/2}$. Without direct contact with the substrate, the behaviors of drops can be separately determined by ${We}_\perp $ and $We_\parallel$. When the impact velocity is too high, the drop fragments into many tiny droplets, which is called the splashing phenomenon. The critical splashing criterion is found to be $We_\perp ^*\simeq$ 120 or $K_\perp = We_\perp Re_\perp^{1/2} \simeq$ 5300 in the current parameter regime.

preprint2020arXiv

Level statistics and Anderson delocalization in two-dimensional granular materials

Contrary to the theoretical predictions that all waves in two-dimensional disordered materials are localized, Anderson localization is observed only for sufficiently high frequencies in an isotropically jammed two-dimensional disordered granular packing of photoelastic disks. More specifically, we have performed an experiment in analyzing the level statistics of normal mode vibrations. We observe delocalized modes in the low-frequency boson-peak regime and localized modes in the high frequency regime with the crossover frequency just below the Debye frequency. We find that the level-distance distribution obeys Gaussian-Orthogonal-Ensemble (GOE) statistics, i.e. Wigner-Dyson distribution, in the boson-peak regime, whereas those in the high-frequency regime Poisson statistics is observed. The scenario is found to coincide with that of harmonic vibrational excitations in three-dimensional disordered solids.

preprint2020arXiv

X-ray tomography investigation of cyclically sheared granular materials

We perform combined X-ray tomography and shear force measurements on a cyclically sheared granular system with highly transient behaviors, and obtain the evolution of microscopic structures and the macroscopic shear force during the shear cycle. We explain the macroscopic behaviors of the system based on microscopic processes, including the particle level structural rearrangement and frictional contact variation. Specifically, we show how contact friction can induce large structural fluctuations and cause significant shear dilatancy effect for granular materials, and we also construct an empirical constitutive relationship for the macroscopic shear force.

preprint2015arXiv

The structural origin of the hard-sphere glass transition in granular packing

Glass transition is accompanied by a rapid growth of the structural relaxation time and a concomitant decrease of configurational entropy. It remains unclear whether the transition has a thermodynamic origin, and whether the dynamic arrest is associated with the growth of a certain static order. Using granular packing as a model hard-sphere glass, we show the glass transition as a thermodynamic phase transition with a "hidden" polytetrahedral order. This polytetrahedral order is spatially correlated with the slow dynamics. It is geometrically frustrated and has a peculiar fractal dimension. Additionally, as the packing fraction increases, its growth follows an entropy-driven nucleation process, similar to that of the random first-order transition theory. Our study essentially identifies a long-sought-after structural glass order in hard-sphere glasses.

preprint2014arXiv

Dynamic synchrotron X-ray imaging study of effective temperature in a vibrated granular medium

We present a dynamic synchrotron X-ray imaging study of the effective temperature $T_{eff}$ in a vibrated granular medium. By tracking the directed motion and the fluctuation dynamics of the tracers inside, we obtained $T_{eff}$ of the system using Einstein relation. We found that as the system unjams with increasing vibration intensities $Γ$, the structural relaxation time $τ$ increases substantially which can be fitted by an Arrhenius law using $T_{eff}$. And the characteristic energy scale of structural relaxation yielded by the Arrhenius fitting is $E = 0.21 \pm 0.02$ $pd^3$, where $p$ is the pressure and $d$ is the background particle diameter, which is consistent with those from hard sphere simulations in which the structural relaxation happens via the opening up of free volume against pressure.

preprint2013arXiv

Bridges in three-dimensional granular packings: experiments and simulations

In this letter, we present the first experimental study of bridge structures in three-dimensional dry granular packings. When bridges are small, they are predominantly 'linear', and have an exponential size distribution. Larger, predominantly 'complex' bridges, are confirmed to follow a power-law size distribution. Our experiments, which use X-ray tomography, are in good agreement with the simulations presented here, for the distribution of sizes, end-to-end lengths, base extensions and orientations of predominantly linear bridges. Quantitative differences between the present experiment and earlier simulations suggest that packing fraction is an important determinant of bridge structure.

Yujie Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

23 published item(s)

AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images

Full-Time-Scale Power Management Strategy for Hybrid AC/DC/DS Microgrid with Dynamic Concatenation and Autonomous Frequency / Voltage Restorations

From creep to flow: Granular materials under cyclic shear

BLCU-ICALL at SemEval-2022 Task 1: Cross-Attention Multitasking Framework for Definition Modeling

Channel Estimation for Wideband MmWave MIMO OFDM System Exploiting Block Sparsity

DTG-SSOD: Dense Teacher Guidance for Semi-Supervised Object Detection

INTERN: A New Learning Paradigm Towards General Vision

Neural Implicit 3D Shapes from Single Images with Spatial Patterns

PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection

The Tree Loss: Improving Generalization with Many Classes

V2V-Based Task Offloading and Resource Allocation in Vehicular Edge Computing Networks

Arbitrary Polarization Conversion Dichroism Metasurfaces for All-in-One Full Poincaré Sphere Polarizers

Connecting shear localization with the long-range correlated polarized stress fields in granular materials

DeGNN: Characterizing and Improving Graph Neural Networks with Graph Decomposition

DMCP: Differentiable Markov Channel Pruning for Neural Networks

Friction-controlled entropy-stability competition in granular systems

Granular Segregation Mechanisms by Cyclic Shear

Leidenfrost drop impact on inclined superheated substrates

Level statistics and Anderson delocalization in two-dimensional granular materials

X-ray tomography investigation of cyclically sheared granular materials

The structural origin of the hard-sphere glass transition in granular packing

Dynamic synchrotron X-ray imaging study of effective temperature in a vibrated granular medium

Bridges in three-dimensional granular packings: experiments and simulations