Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
21works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

21 published item(s)

preprint2026arXiv

Learning Generalizable Multimodal Representations for Software Vulnerability Detection

Source code and its accompanying comments are complementary yet naturally aligned modalities-code encodes structural logic while comments capture developer intent. However, existing vulnerability detection methods mostly rely on single-modality code representations, overlooking the complementary semantic information embedded in comments and thus limiting their generalization across complex code structures and logical relationships. To address this, we propose MultiVul, a multimodal contrastive framework that aligns code and comment representations through dual similarity learning and consistency regularization, augmented with diverse code-text pairs to improve robustness. Experiments on widely adopted DiverseVul and Devign datasets across four large language models (LLMs) (i.e., DeepSeek-Coder-6.7B, Qwen2.5-Coder-7B, StarCoder2-7B, and CodeLlama-7B) show that MultiVul achieves up to 27.07% F1 improvement over prompting-based methods and 13.37% over code-only Fine-Tuning, while maintaining comparable inference efficiency.

preprint2023arXiv

LaF: Labeling-Free Model Selection for Automated Deep Neural Network Reusing

Applying deep learning to science is a new trend in recent years which leads DL engineering to become an important problem. Although training data preparation, model architecture design, and model training are the normal processes to build DL models, all of them are complex and costly. Therefore, reusing the open-sourced pre-trained model is a practical way to bypass this hurdle for developers. Given a specific task, developers can collect massive pre-trained deep neural networks from public sources for re-using. However, testing the performance (e.g., accuracy and robustness) of multiple DNNs and recommending which model should be used is challenging regarding the scarcity of labeled data and the demand for domain expertise. In this paper, we propose a labeling-free (LaF) model selection approach to overcome the limitations of labeling efforts for automated model reusing. The main idea is to statistically learn a Bayesian model to infer the models' specialty only based on predicted labels. We evaluate LaF using 9 benchmark datasets including image, text, and source code, and 165 DNNs, considering both the accuracy and robustness of models. The experimental results demonstrate that LaF outperforms the baseline methods by up to 0.74 and 0.53 on Spearman's correlation and Kendall's $τ$, respectively.

preprint2023arXiv

MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation

Inspired by the great success of Deep Neural Networks (DNNs) in natural language processing (NLP), DNNs have been increasingly applied in source code analysis and attracted significant attention from the software engineering community. Due to its data-driven nature, a DNN model requires massive and high-quality labeled training data to achieve expert-level performance. Collecting such data is often not hard, but the labeling process is notoriously laborious. The task of DNN-based code analysis even worsens the situation because source code labeling also demands sophisticated expertise. Data augmentation has been a popular approach to supplement training data in domains such as computer vision and NLP. However, existing data augmentation approaches in code analysis adopt simple methods, such as data transformation and adversarial example generation, thus bringing limited performance superiority. In this paper, we propose a data augmentation approach MIXCODE that aims to effectively supplement valid training data, inspired by the recent advance named Mixup in computer vision. Specifically, we first utilize multiple code refactoring methods to generate transformed code that holds consistent labels with the original data. Then, we adapt the Mixup technique to mix the original code with the transformed code to augment the training data. We evaluate MIXCODE on two programming languages (Java and Python), two code tasks (problem classification and bug detection), four benchmark datasets (JAVA250, Python800, CodRep1, and Refactory), and seven model architectures (including two pre-trained models CodeBERT and GraphCodeBERT). Experimental results demonstrate that MIXCODE outperforms the baseline data augmentation approach by up to 6.24% in accuracy and 26.06% in robustness.

preprint2022arXiv

A Magnetic Flux Rope Configuration Derived by Optimization of Two-Spacecraft In-situ Measurements

Increasingly one interplanetary coronal mass ejection (ICME) structure can propagate across more than one spacecraft in the solar wind. This usually happens when two or more spacecraft are nearly radially aligned with a relatively small longitudinal separation angle from one another. This provides multi-point measurements of the same structure and enables better characterization and validation of modeling results of the structures embedded in these ICMEs. We report such an event during October 13-14, 2019 when the Solar TErrestrial RElations Observatory Ahead (STA) spacecraft and the Parker Solar Probe (PSP) crossed one ICME structure at two different locations with nominal separations in both heliocentric distances and the longitudinal angles. We first perform an optimal fitting to the STA in-situ measurements, based on an analytic quasi-three dimensional (3D) model, yielding a minimum reduced $χ^2=0.468$. Then we further apply the optimization approach by combining the magnetic field measurements from both spacecraft along their separate paths across the ICME structure. We find that the output based on the optimization (with the minimum reduced $χ^2=3.15$) of the combined two-spacecraft dataset yields a more consistent result, given the much improved agreement of the model output with PSP data. The result demonstrates a magnetic flux rope configuration with clear 3D spatial variations.

preprint2022arXiv

An efficient distributed scheduling algorithm for relay-assisted mmWave backhaul networks

In this paper, a novel distributed scheduling algorithm is proposed, which aims to efficiently schedule both the uplink and downlink backhaul traffic in the relay-assisted mmWave backhaul network with a tree topology. The handshaking of control messages, calculation of local schedules, and the determination of final valid schedule are all discussed. Simulation results show that the performance of the distributed algorithm can reach very close to the maximum traffic demand of the backhaul network, and it can also adapt to the dynamic traffic with sharp traffic demand change of small-cell BSs quickly and accurately.

preprint2022arXiv

Characterizing and Understanding the Behavior of Quantized Models for Reliable Deployment

Deep Neural Networks (DNNs) have gained considerable attention in the past decades due to their astounding performance in different applications, such as natural language modeling, self-driving assistance, and source code understanding. With rapid exploration, more and more complex DNN architectures have been proposed along with huge pre-trained model parameters. The common way to use such DNN models in user-friendly devices (e.g., mobile phones) is to perform model compression before deployment. However, recent research has demonstrated that model compression, e.g., model quantization, yields accuracy degradation as well as outputs disagreements when tested on unseen data. Since the unseen data always include distribution shifts and often appear in the wild, the quality and reliability of quantized models are not ensured. In this paper, we conduct a comprehensive study to characterize and help users understand the behaviors of quantized models. Our study considers 4 datasets spanning from image to text, 8 DNN architectures including feed-forward neural networks and recurrent neural networks, and 42 shifted sets with both synthetic and natural distribution shifts. The results reveal that 1) data with distribution shifts happen more disagreements than without. 2) Quantization-aware training can produce more stable models than standard, adversarial, and Mixup training. 3) Disagreements often have closer top-1 and top-2 output probabilities, and $Margin$ is a better indicator than the other uncertainty metrics to distinguish disagreements. 4) Retraining with disagreements has limited efficiency in removing disagreements. We opensource our code and models as a new benchmark for further studying the quantized models.

preprint2022arXiv

Compton scattering for photon and gluon in fixed-target collisions at AFTER@LHC

We calculate the Compton scattering for photon and gluon with the Klein-Nishina formula in fixed-target collisions by using the proton and lead beams at AFTER@LHC. In these collisions, we can investigate the particular case of Compton scattering at the partonic level, such as $γq\rightarrow qγ$, $γq\rightarrow qg$, $gq\rightarrow qγ$, and $gq\rightarrow qg$, that can help to check of the equivalent-photon approximation and understand the dynamics of hadron collisions at high energies, as well as probe the inner hadron structure.

preprint2022arXiv

GraphCode2Vec: Generic Code Embedding via Lexical and Program Dependence Analyses

Code embedding is a keystone in the application of machine learning on several Software Engineering (SE) tasks. To effectively support a plethora of SE tasks, the embedding needs to capture program syntax and semantics in a way that is generic. To this end, we propose the first self-supervised pre-training approach (called GraphCode2Vec) which produces task-agnostic embedding of lexical and program dependence features. GraphCode2Vec achieves this via a synergistic combination of code analysis and Graph Neural Networks. GraphCode2Vec is generic, it allows pre-training, and it is applicable to several SE downstream tasks. We evaluate the effectiveness of GraphCode2Vec on four (4) tasks (method name prediction, solution classification, mutation testing and overfitted patch classification), and compare it with four (4) similarly generic code embedding baselines (Code2Seq, Code2Vec, CodeBERT, GraphCodeBERT) and 7 task-specific, learning-based methods. In particular, GraphCode2Vec is more effective than both generic and task-specific learning-based baselines. It is also complementary and comparable to GraphCodeBERT (a larger and more complex model). We also demonstrate through a probing and ablation study that GraphCode2Vec learns lexical and program dependence features and that self-supervised pre-training improves effectiveness.

preprint2022arXiv

Quantitative Characterization of Magnetic Flux Rope Properties for Two Solar Eruption Events

In order to bridge the gap between heliospheric and solar observations of coronal mass ejections (CMEs), one of the key steps is to improve the understanding of their corresponding magnetic structures like the magnetic flux ropes (MFRs). But it remains a challenge to confirm the existence of a coherent MFR before or upon the CME eruption on the Sun and to quantitatively characterize the CME-MFR due to the lack of direct magnetic field measurement in the corona. In this study, we investigate the MFR structures, originating from two active regions (ARs), AR 11719 and AR 12158, and estimate their magnetic properties quantitatively. We perform the nonlinear force-free field extrapolations with preprocessed photospheric vector magnetograms. In addition, remote-sensing observations are employed to find indirect evidence of MFRs on the Sun and to analyze the time evolution of magnetic reconnection flux associated with the flare ribbons during the eruption. A coherent "pre-existing" MFR structure prior to the flare eruption is identified quantitatively for one event from the combined analysis of the extrapolation and observation. Then the characteristics of MFRs for two events on the Sun before and during the eruption, forming the CME-MFR, including the axial magnetic flux, field-line twist, and reconnection flux, are estimated and compared with the corresponding in situ modeling results. We find that the magnetic reconnection associated with the accompanying flares for both events injects significant amount of flux into the erupted CME-MFRs.

preprint2022arXiv

Towards the Maximum Traffic Demand and Throughput Supported by Relay-Assisted mmWave Backhaul Networks

This paper investigates the throughput performance issue of the relay-assisted mmWave backhaul network. The maximum traffic demand of small-cell base stations (BSs) and the maximum throughput at the macro-cell BS have been found in a tree-style backhaul network through linear programming under different network settings, which concern both the number of radio chains available on BSs and the interference relationship between logical links in the backhaul network. A novel interference model for the relay-assisted mmWave backhaul network in the dense urban environment is proposed, which demonstrates the limited interference footprint of mmWave directional communications. Moreover, a scheduling algorithm is developed to find the optimal scheduling for tree-style mmWave backhaul networks. Extensive numerical analysis and simulations are conducted to show and validate the network throughput performance and the scheduling algorithm.

preprint2022arXiv

Validation and interpretation of three-dimensional configuration of a magnetic cloud flux rope

One "strong" magnetic cloud (MC) with the magnetic field magnitude reaching $\sim$ 40 nT at 1 au during 2012 June 16-17 is examined in association with a pre-existing magnetic flux rope (MFR) identified on the Sun. The MC is characterized by a quasi-three dimensional (3D) flux rope model based on in situ measurements from the Wind spacecraft. The magnetic flux contents and other parameters are quantified. In addition, a correlative study with the corresponding measurements of the same structure crossed by the Venus Express (VEX) spacecraft at a heliocentric distance 0.7 au and with an angular separation $\sim 6^\circ$ in longitude is performed to validate the MC modeling results. The spatial variation between the Wind and VEX magnetic field measurements is attributed to the 3D configuration of the structure as featured by a knotted bundle of flux. The comparison of the magnetic flux contents between the MC and the source region on the Sun indicates that the 3D reconnection process accompanying an M1.9 flare may correspond to the magnetic reconnection between the field lines of the pre-existing MFR rooted in the opposite polarity footpoints. Such a process reduces the amount of the axial magnetic flux in the erupted flux rope, by approximately 50\%, in this case.

preprint2022arXiv

WCL-BBCD: A Contrastive Learning and Knowledge Graph Approach to Named Entity Recognition

Named Entity Recognition task is one of the core tasks of information extraction. Word ambiguity and word abbreviation are important reasons for the low recognition rate of named entities. In this paper, we propose a novel named entity recognition model WCL-BBCD (Word Contrastive Learning with BERT-BiLSTM-CRF-DBpedia), which incorporates the idea of contrastive learning. The model first trains the sentence pairs in the text, calculate similarity between sentence pairs, and fine-tunes BERT used for the named entity recognition task according to the similarity, so as to alleviate word ambiguity. Then, the fine-tuned BERT is combined with BiLSTM-CRF to perform the named entity recognition task. Finally, the recognition results are corrected in combination with prior knowledge such as knowledge graphs, so as to alleviate the low-recognition-rate problem caused by word abbreviations. The results of experimentals conducted on the CoNLL-2003 English dataset and OntoNotes V5 English dataset show that our model outperforms other similar models on.

preprint2021arXiv

Comparison of the Hall Magnetohydrodynamics and Magnetohydrodynamics evolution of a flaring solar active region

This work analyzes the Hall magnetohydrodynamics (HMHD) and magnetohydrodynamics (MHD) numerical simulations of a flaring solar active region as a testbed while idealizing the coronal Alfvén speed to be of two orders of magnitude lesser. HMHD supports faster magnetic reconnection and shows richer complexity in magnetic field line evolution compared to the MHD. The magnetic reconnections triggering the flare are explored by numerical simulations augmented with relevant multi-wavelength observations. The initial coronal magnetic field is constructed by non-force-free extrapolation of photospheric vector magnetic field. Magnetic structure involved in the flare is identified to be a flux rope, with its overlying magnetic field lines constituting the quasi-separatrix layers (QSLs) along with a three-dimensional null point and a null line. Compared to the MHD simulation, the HMHD simulation shows a higher and faster ascend of the rope together with the overlying field lines, which further reconnect at the QSL located higher up in the corona. The foot points of the field lines match better with the observations for the HMHD case with the central part of the flare ribbon located at the chromosphere. Additionally, field lines are found to rotate in a circular pattern in the HMHD, whereas no such rotation is seen in the MHD results. Interestingly, plasma is also observed to be rotating in a co-spatial chromospheric region, which makes the HMHD simulation more credible. Based on the aforementioned agreements, HMHD simulation is found to agree better with observations and, thus, opens up a novel avenue to explore.

preprint2021arXiv

Quantum state engineering using weak measurement

State preparation via postselected weak measurement in three wave mixing process is studied. Assuming the signal input mode prepared in a vacuum state, coherent state or squeezed vacuum state, separately, while the idler input prepared in weak coherent state and passing the medium characterized by the second-order nonlinear susceptibility. It is shown that when the single photon is detected at one of the output channels of idler beam's path, the signal output channel is prepared in single-photon Fock state, single-photon-added coherent state or single-photon-added squeezed vacuum state with very high fidelity, depending upon the input signal states and related controllable parameters. The properties including squeezing, signal amplification, second order correlation and Wigner functions of the weak measurement based output states are also investigated. Our scheme promising to provide alternate new effective method for producing useful nonclassical states in quantum information processing.

preprint2021arXiv

Small-scale magnetic flux ropes and their properties based on in-situ measurements from Parker Solar Probe

We report small-scale magnetic flux ropes via the Parker Solar Probe in situ measurements during the first six encounters and present additional analyses to supplement our prior work in Chen et al. 2021. These flux ropes are detected by the Grad-Shafranov-based algorithm with the duration and scale size ranging from 10 seconds to $\lesssim$1 hour and from a few hundred kilometers to 10$^{-3}$ au, respectively. They include both static structures and those with significant field-aligned plasma flows. Most structures tend to possess large cross helicity, while the residual energy distributes in wide ranges. We find that these dynamic flux ropes mostly propagate anti-sunward, with no preferential sign of magnetic helicity. The magnetic flux function follows a power law and is proportional to scale size. We also present case studies showing reconstructed two-dimensional (2D) configurations, which confirm that the static and dynamic flux ropes have the common configuration of spiral magnetic field lines (also streamlines). Moreover, the existence of such events hints at the interchange reconnection as a possible mechanism to generate flux rope-like structures near the Sun. Lastly, we summarize the major findings and discuss the possible correlation between these flux rope-like structures and turbulence due to the process of local Alfvenic alignment.

preprint2020arXiv

Effects of Cowling Resistivity in the Weakly-Ionized Chromosphere

The physics of the solar chromosphere is complex from both theoretical and modeling perspectives. The plasma temperature from the photosphere to corona increases from ~5,000 K to ~1 million K over a distance of only ~10,000 km from the chromosphere and the transition region. Certain regions of the solar atmosphere have sufficiently low temperature and ionization rates to be considered as weakly-ionized. In particular, this is true at the lower chromosphere. As a result, the Cowling resistivity is orders of magnitude greater than the Coulomb resistivity. Ohm's law therefore includes anisotropic dissipation. To evaluate the Cowling resistivity, we need to know the external magnetic field strength and to estimate the neutral fraction as a function of the bulk plasma density and temperature. In this study, we determine the magnetic field topology using the non-force-free field (NFFF) extrapolation technique based on SDO/HMI SHARP vector magnetogram data, and the stratified density and temperature profiles from the Maltby-M umbral core model for sunspots. We investigate the variation and effects of Cowling resistivity on heating and magnetic reconnection in the chromosphere as the flare-producing active region (AR) 11166 evolves. In particular, we analyze a C2.0 flare emerging from AR11166 and find a normalized reconnection rate of 0.051.

preprint2020arXiv

Effects of Radial Distances on Small-scale Magnetic Flux Ropes in the Solar Wind

Small-scale magnetic flux ropes (SFRs), in the solar wind, have been studied for decades. Statistical analysis utilizing various in situ spacecraft measurements is the main observational approach which helps investigate the generation and evolution of these small-scale structures. Based on the Grad-Shafranov (GS) reconstruction technique, we use the automated detection algorithm to build the databases of these small-scale structures via various spacecraft measurements at different heliocentric distances. We present the SFR properties including the magnetic field and plasma parameters at different radial distances from the sun near the ecliptic plane. It is found that the event occurrence rate is still in the order of a few hundreds per month, the duration and scale size distributions follow power laws, and the flux rope axis orientations are approximately centered around the local Parker spiral directions. In general, most SFR properties exhibit radial decays. In addition, with various databases established, we derive scaling laws for the changes of average field magnitude, event counts, and SFR scale sizes, with respect to the radial distances, ranging from $\sim$ 0.3 au for Helios to $\sim$ 7 au for the Voyager spacecraft. The implications of our results for comparisons with the relevant theoretical works and for the application to the Parker Solar Probe (PSP) mission are discussed.

preprint2020arXiv

How Does Magnetic Reconnection Drive the Early Stage Evolution of Coronal Mass Ejections?

Theoretically, CME kinematics are related to magnetic reconnection processes in the solar corona. However, the current quantitative understanding of this relationship is based on the analysis of only a handful of events. Here we report a statistical study of 60 CME-flare events from August 2010 to December 2013. We investigate kinematic properties of CMEs and magnetic reconnection in the low corona during the early phase of the eruptions, by combining limb observations from STEREO with simultaneous on-disk views from SDO. For a subset of 42 events with reconnection rate evaluated by the magnetic fluxes swept by the flare ribbons on the solar disk observed from SDO, we find a strong correlation between the peak CME acceleration and the peak reconnection rate. Also, the maximum velocities of relatively fast CMEs (> 600 km/s) are positively correlated with the reconnection flux, but no such correlation is found for slow CMEs. A time-lagged correlation analysis suggests that the distribution of the time lag of CME acceleration relative to reconnection rate exhibits three peaks, approximately 10 minutes apart, and on average, acceleration-lead events have smaller reconnection rates. We further compare the CME total mechanical energy with the estimated energy in the current sheet. The comparison suggests that, for small-flare events, reconnection in the current sheet alone is insufficient to fuel CMEs. Results from this study suggest that flare reconnection may dominate the acceleration of fast CMEs, but for events of slow CMEs and weak reconnection, other mechanisms may be more important.

preprint2020arXiv

LGNN: A Context-aware Line Segment Detector

We present a novel real-time line segment detection scheme called Line Graph Neural Network (LGNN). Existing approaches require a computationally expensive verification or postprocessing step. Our LGNN employs a deep convolutional neural network (DCNN) for proposing line segment directly, with a graph neural network (GNN) module for reasoning their connectivities. Specifically, LGNN exploits a new quadruplet representation for each line segment where the GNN module takes the predicted candidates as vertexes and constructs a sparse graph to enforce structural context. Compared with the state-of-the-art, LGNN achieves near real-time performance without compromising accuracy. LGNN further enables time-sensitive 3D applications. When a 3D point cloud is accessible, we present a multi-modal line segment classification technique for extracting a 3D wireframe of the environment robustly and efficiently.

preprint2020arXiv

Towards Characterizing Adversarial Defects of Deep Learning Software from the Lens of Uncertainty

Over the past decade, deep learning (DL) has been successfully applied to many industrial domain-specific tasks. However, the current state-of-the-art DL software still suffers from quality issues, which raises great concern especially in the context of safety- and security-critical scenarios. Adversarial examples (AEs) represent a typical and important type of defects needed to be urgently addressed, on which a DL software makes incorrect decisions. Such defects occur through either intentional attack or physical-world noise perceived by input sensors, potentially hindering further industry deployment. The intrinsic uncertainty nature of deep learning decisions can be a fundamental reason for its incorrect behavior. Although some testing, adversarial attack and defense techniques have been recently proposed, it still lacks a systematic study to uncover the relationship between AEs and DL uncertainty. In this paper, we conduct a large-scale study towards bridging this gap. We first investigate the capability of multiple uncertainty metrics in differentiating benign examples (BEs) and AEs, which enables to characterize the uncertainty patterns of input data. Then, we identify and categorize the uncertainty patterns of BEs and AEs, and find that while BEs and AEs generated by existing methods do follow common uncertainty patterns, some other uncertainty patterns are largely missed. Based on this, we propose an automated testing technique to generate multiple types of uncommon AEs and BEs that are largely missed by existing techniques. Our further evaluation reveals that the uncommon data generated by our method is hard to be defended by the existing defense techniques with the average defense success rate reduced by 35\%. Our results call for attention and necessity to generate more diverse data for evaluating quality assurance solutions of DL software.

preprint2019arXiv

Quantifying the Toroidal Flux of Pre-existing Flux Ropes of CMEs

In past decades, much progress has been achieved on the origin and evolution of coronal mass ejections (CMEs). In-situ observations of the counterparts of CMEs, especially magnetic clouds (MCs) near the Earth, have provided measurements of the structure and total flux of CME flux ropes. However, it has been difficult to measure these properties in the erupting CME flux rope, in particular in the pre-existing flux rope. In this work, we propose a model to estimate the toroidal flux of the pre-existing flux rope by subtracting the flux contributed by magnetic reconnection during the eruption from the flux measured in the MC. The flux by the reconnection is derived from geometric properties of two-ribbon flares based on a quasi-2D reconnection model. We then apply the model to four CME/flare events and find that the ratio of toroidal flux in the pre-existing flux rope to that of the associated MC lies in the range of 0.40--0.88. It indicates that the toroidal flux of the pre-existing flux rope has an important contribution to that of the CME flux rope and is usually at least as large as the flux arising from the eruption process for the selected events.