Researcher profile

Jun Feng

Jun Feng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
17works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

17 published item(s)

preprint2026arXiv

A Conflict-aware Evidential Framework for Reliable Sleep Stage Classification

Multi-view learning has been widely applied for sleep stage classification using multi-modal data. However, existing methods typically assume that different modalities are well-aligned, which is often unattainable in real-world scenarios, thereby compromising the reliability of the staging results. In this paper, we propose ConfSleepNet, a conflict-aware evidential framework that dynamically resolves inter-view conflicts. The framework consists of multi-view evidence extraction and conflict-aware aggregation. In the first phase, it learns category-related evidence from different modalities, which represents the degree of support for individual sleep stages. Considering the inherent characteristics of varying modalities, we propose hybrid category structures for different modalities to promote more reasonable evidence learning. In the second phase, view-specific opinions, including prediction results and uncertainty, are constructed from the learned evidence. Notably, we propose a novel conflict-aware aggregation method that integrates these view-specific opinions into a reliable joint decision. This mechanism can effectively resolve conflicts among opinions and synthesize them into a reliable joint decision. Both theoretical analysis and experimental results demonstrate the effectiveness of ConfSleepNet in sleep staging tasks. The code is available at https://github.com/By4te/ConfSleepNet_ICML2026/.

preprint2026arXiv

UniShield: Unified Face Attack Detection via KG-Informed Multimodal Reasoning

Unified face attack detection (UAD) requires recognizing physical spoofing and digital forgery within a shared decision space, yet existing discriminative or prompt-based methods largely rely on appearance correlations and provide limited evidence-grounded reasoning. We propose UniShield, a knowledge-grounded multimodal reasoning framework for unified face attack defense. UniShield constructs a Face Attack Knowledge Graph (FAKG) that links attack categories to diagnostic visual cues and attack-conditioned relations, and uses it to synthesize 52,025 FAKG-QA examples for Attack-Graph Instruction Tuning (AGIT). To improve rationale consistency, we further introduce Graph-Consistent Reasoning Optimization (GCRO), a GRPO-based objective with a KG-consistency reward that encourages generated rationales to match graph-supported cues while penalizing incompatible claims. Experiments on our multimodal UAD benchmark show that UniShield achieves strong performance across binary, coarse-grained, and fine-grained protocols, with consistently high ACC and low HTER. These results suggest that structured attack knowledge can improve both detection accuracy and reasoning reliability over discriminative baselines and general-purpose MLLMs. Our code will be released at https://anonymous.4open.science/r/Unishield-A6A3/.

preprint2022arXiv

A Deep CNN Architecture with Novel Pooling Layer Applied to Two Sudanese Arabic Sentiment Datasets

Arabic sentiment analysis has become an important research field in recent years. Initially, work focused on Modern Standard Arabic (MSA), which is the most widely-used form. Since then, work has been carried out on several different dialects, including Egyptian, Levantine and Moroccan. Moreover, a number of datasets have been created to support such work. However, up until now, less work has been carried out on Sudanese Arabic, a dialect which has 32 million speakers. In this paper, two new publicly available datasets are introduced, the 2-Class Sudanese Sentiment Dataset (SudSenti2) and the 3-Class Sudanese Sentiment Dataset (SudSenti3). Furthermore, a CNN architecture, SCM, is proposed, comprising five CNN layers together with a novel pooling layer, MMA, to extract the best features. This SCM+MMA model is applied to SudSenti2 and SudSenti3 with accuracies of 92.75% and 84.39%. Next, the model is compared to other deep learning classifiers and shown to be superior on these new datasets. Finally, the proposed model is applied to the existing Saudi Sentiment Dataset and to the MSA Hotel Arabic Review Dataset with accuracies 85.55% and 90.01%.

preprint2022arXiv

A New Amharic Speech Emotion Dataset and Classification Benchmark

In this paper we present the Amharic Speech Emotion Dataset (ASED), which covers four dialects (Gojjam, Wollo, Shewa and Gonder) and five different emotions (neutral, fearful, happy, sad and angry). We believe it is the first Speech Emotion Recognition (SER) dataset for the Amharic language. 65 volunteer participants, all native speakers, recorded 2,474 sound samples, two to four seconds in length. Eight judges assigned emotions to the samples with high agreement level (Fleiss kappa = 0.8). The resulting dataset is freely available for download. Next, we developed a four-layer variant of the well-known VGG model which we call VGGb. Three experiments were then carried out using VGGb for SER, using ASED. First, we investigated whether Mel-spectrogram features or Mel-frequency Cepstral coefficient (MFCC) features work best for Amharic. This was done by training two VGGb SER models on ASED, one using Mel-spectrograms and the other using MFCC. Four forms of training were tried, standard cross-validation, and three variants based on sentences, dialects and speaker groups. Thus, a sentence used for training would not be used for testing, and the same for a dialect and speaker group. The conclusion was that MFCC features are superior under all four training schemes. MFCC was therefore adopted for Experiment 2, where VGGb and three other existing models were compared on ASED: RESNet50, Alex-Net and LSTM. VGGb was found to have very good accuracy (90.73%) as well as the fastest training time. In Experiment 3, the performance of VGGb was compared when trained on two existing SER datasets, RAVDESS (English) and EMO-DB (German) as well as on ASED (Amharic). Results are comparable across these languages, with ASED being the highest. This suggests that VGGb can be successfully applied to other languages. We hope that ASED will encourage researchers to experiment with other models for Amharic SER.

preprint2022arXiv

Invariant Content Synergistic Learning for Domain Generalization of Medical Image Segmentation

While achieving remarkable success for medical image segmentation, deep convolution neural networks (DCNNs) often fail to maintain their robustness when confronting test data with the novel distribution. To address such a drawback, the inductive bias of DCNNs is recently well-recognized. Specifically, DCNNs exhibit an inductive bias towards image style (e.g., superficial texture) rather than invariant content (e.g., object shapes). In this paper, we propose a method, named Invariant Content Synergistic Learning (ICSL), to improve the generalization ability of DCNNs on unseen datasets by controlling the inductive bias. First, ICSL mixes the style of training instances to perturb the training distribution. That is to say, more diverse domains or styles would be made available for training DCNNs. Based on the perturbed distribution, we carefully design a dual-branches invariant content synergistic learning strategy to prevent style-biased predictions and focus more on the invariant content. Extensive experimental results on two typical medical image segmentation tasks show that our approach performs better than state-of-the-art domain generalization methods.

preprint2022arXiv

Quantum Fisher information as a probe for Unruh thermality

A long-standing debate on Unruh effect is about its obscure thermal nature. In this Letter, we use quantum Fisher information (QFI) as an effective probe to explore the thermal nature of Unruh effect from both local and global perspectives. By resolving the full dynamics of UDW detector, we find that the QFI is a time-evolving function of detector's energy gap, Unruh temperature $T_U$ and particularities of background field, e.g., mass and spacetime dimensionality. We show that the asymptotic QFI whence detector arrives its equilibrium is solely determined by $T_U$, demonstrating the global side of Unruh thermality alluded by the KMS condition. We also show that the local side of Unruh effect, i.e., the different ways for the detector to approach the same thermal equilibrium, is encoded in the corresponding time-evolution of the QFI. In particular, we find that with massless scalar background the QFI has unique monotonicity in $n=3$ dimensional spacetime, and becomes non-monotonous for $n\neq3$ models where a local peak value exists at early time and for finite acceleration, indicating an enhanced precision of estimation on Unruh temperature at a relative low acceleration can be achieved. Once the field acquiring mass, the related QFI becomes significantly robust against the Unruh decoherence in the sense that its local peak sustains for a very long time. While coupling to a more massive background, the persistence can even be strengthened and the QFI possesses a larger maximal value. Such robustness of QFI can surely facilitate any practical quantum estimation task.

preprint2022arXiv

Scalar perturbation of gravitating double-kink solutions

In this letter, a two-dimensional (2D) gravity-scalar model is studied. This model supports interesting double-kink solutions, and the corresponding metric solutions can be derived analytically. Depending on a tunable parameter $c$, the metric can be symmetric or asymmetric. The Schrödinger-like equation for normal modes of the physical linear perturbation is derived. As $c$ varies, the effective potential can have one or two singular barriers. If $c$ is larger than a critical value, the zero mode will be normalizable, despite of the appearance of a strong repulsive singularity. The double-kink solution is always stable against linear perturbations.

preprint2022arXiv

Thermality of the Unruh effect with intermediate statistics

Utilizing quantum coherence monotone, we reexamine the thermal nature of the Unruh effect of an accelerating detector. We consider an UDW detector coupling to a n-dimensional conformal field in Minkowski spacetime, whose response spectrum generally exhibits an intermediate statistics of (1+1) anyon field. We find that the thermal nature of the Unruh effect guaranteed by KMS condition is characterized by a vanishing asymptotic quantum coherence. We show that the time-evolution of coherence monotone can distinguish the different thermalizing ways of the detector, which depends on the scaling dimension of the conformal primary field. In particular, for the conformal background with certain scaling dimension, we demonstrate that at fixed proper time a revival of coherence can occur even for growing Unruh decoherence. Finally, we show that coherence monotone has distinct dynamics under the Unruh decoherence and a thermal bath for a static observer.

preprint2021arXiv

Lookup subnet based Spatial Graph Convolutional neural Network

Convolutional Neural Networks(CNNs) has achieved remarkable performance breakthrough in Euclidean structure data. Recently, aggregation-transformation based Graph Neural networks(GNNs) gradually produce a powerful performance on non-Euclidean data. In this paper, we propose a cross-correlation based graph convolution method allowing to naturally generalize CNNs to non-Euclidean domains and inherit the excellent natures of CNNs, such as local filters, parameter sharing, flexible receptive field, etc. Meanwhile, it leverages dynamically generated convolution kernel and cross-correlation operators to address the shortcomings of prior methods based on aggregation-transformation or their approximations. Our method has achieved or matched popular state-of-the-art results across three established graph benchmarks: the Cora, Citeseer, and Pubmed citation network datasets.

preprint2020arXiv

Design of optimal illumination patterns in single-pixel imaging using image dictionaries

Single-pixel imaging (SPI) has a major drawback that many sequential illuminations are required for capturing one single image with long acquisition time. Basis illumination patterns such as Fourier patterns and Hadamard patterns can achieve much better imaging efficiency than random patterns. But the performance is still sub-optimal since the basis patterns are fixed and non-adaptive for varying object images. This Letter proposes a novel scheme for designing and optimizing the illumination patterns adaptively from an image dictionary by extracting the common image features using principal component analysis (PCA). Simulation and experimental results reveal that our proposed scheme outperforms conventional Fourier SPI in terms of imaging efficiency.

preprint2020arXiv

Does deep learning always outperform simple linear regression in optical imaging?

Deep learning has been extensively applied in many optical imaging applications in recent years. Despite the success, the limitations and drawbacks of deep learning in optical imaging have been seldom investigated. In this work, we show that conventional linear-regression-based methods can outperform the previously proposed deep learning approaches for two black-box optical imaging problems in some extent. Deep learning demonstrates its weakness especially when the number of training samples is small. The advantages and disadvantages of linear-regression-based methods and deep learning are analyzed and compared. Since many optical systems are essentially linear, a deep learning network containing many nonlinearity functions sometimes may not be the most suitable option.

preprint2020arXiv

Identification of New Assembly Mode in the Heliconical Nematic Phase via Tender Resonant X-ray Scattering

Helical structures are exciting and are utilized in numerous applications ranging from biotechnology to displays to medicine. Accurate description and understanding of resonance effects in helical structures provides crucial knowledge on molecular packing beyond positional ordering. We exam-ined the manifestation of resonance effects in a nematic phase with heliconical structure, the so called twist bend nematic (NTB) via tender resonant X-ray scattering (TReXS) at the sulfur K-edge. We demonstrate for the first time quantitatively that the energy dependence of the scattering peak in the NTB phase follows the energy dependence of the complex refractive indices measured by X-ray absorption. This allows us to identify a new self-assembly mode for specific sets of liquid crystal dimers in the NTB phase. We anticipate that new avenues in the exploration of complex orientational structures both in static as well as in dynamic modes induced by external stimuli will be pursued.

preprint2020arXiv

Stream-Flow Forecasting of Small Rivers Based on LSTM

Stream-flow forecasting for small rivers has always been of great importance, yet comparatively challenging due to the special features of rivers with smaller volume. Artificial Intelligence (AI) methods have been employed in this area for long, but improvement of forecast quality is still on the way. In this paper, we tried to provide a new method to do the forecast using the Long-Short Term Memory (LSTM) deep learning model, which aims in the field of time-series data. Utilizing LSTM, we collected the stream flow data from one hydrologic station in Tunxi, China, and precipitation data from 11 rainfall stations around to forecast the stream flow data from that hydrologic station 6 hours in the future. We evaluated the prediction results using three criteria: root mean square error (RMSE), mean absolute error (MAE), and coefficient of determination (R^2). By comparing LSTM's prediction with predictions of Support Vector Regression (SVR) and Multilayer Perceptions (MLP) models, we showed that LSTM has better performance, achieving RMSE of 82.007, MAE of 27.752, and R^2 of 0.970. We also did extended experiments on LSTM model, discussing influence factors of its performance.

preprint2019arXiv

Visual cryptography in single-pixel imaging

Two novel visual cryptography (VC) schemes are proposed by combining VC with single-pixel imaging (SPI) for the first time. It is pointed out that the overlapping of visual key images in VC is similar to the superposition of pixel intensities by a single-pixel detector in SPI. In the first scheme, QR-code VC is designed by using opaque sheets instead of transparent sheets. The secret image can be recovered when identical illumination patterns are projected onto multiple visual key images and a single detector is used to record the total light intensities. In the second scheme, the secret image is shared by multiple illumination pattern sequences and it can be recovered when the visual key patterns are projected onto identical items. The application of VC can be extended to more diversified scenarios by our proposed schemes.

preprint2012arXiv

Quantum correlations with vacuum ambiguity in de Sitter space

We study the quantum correlations of free scalar field with vacuum ambiguity of de Sitter space. We show the occurrence of degradation of quantum entanglement and quantum discord between field modes for inertial observer in curved space due to the radiation associated with cosmological horizon. In particular, we find that quantum correlations can be used to encode infinite de Sitter invariant vacua, which correspond to infinite set of possible physical worlds. This may provide a superselection rule of physical vacuum via quantum information tasks. We also discuss the simulation of such quantum effects of vacuum ambiguity in ion trap experiments.

preprint2011arXiv

Scalar products of the open XYZ chain with non-diagonal boundary terms

With the help of the F-basis provided by the Drinfeld twist or factorizing F-matrix of the eight-vertex solid-on-solid (SOS) model, we obtain the determinant representations of the scalar products of Bethe states for the open XYZ chain with non-diagonal boundary terms. By taking the on shell limit, we obtain the determinant representations (or Gaudin formula) of the norms of the Bethe states.