Source author record

Zhenyu Liu

Zhenyu Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory math.IT Machine Learning eess.SP Artificial Intelligence Computation and Language Computational Engineering, Finance, and Science Computer Vision math.OC physics.comp-ph physics.flu-dyn eess.AS math.NA Numerical Analysis Quantitative Methods Sound

Catalog footprint

What is connected

15works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

VulTriage: Triple-Path Context Augmentation for LLM-Based Vulnerability Detection

Automated vulnerability detection is a fundamental task in software security, yet existing learning-based methods still struggle to capture the structural dependencies, domain-specific vulnerability knowledge, and complex program semantics required for accurate detection. Recent Large Language Models (LLMs) have shown strong code understanding ability, but directly prompting them with raw source code often leads to missed vulnerabilities or false alarms, especially when vulnerable and benign functions differ only in subtle semantic details. To address this, we propose VulTriage, a triple-path context augmentation framework for LLM-based vulnerability detection. VulTriage enhances the LLM input through three complementary paths: a Control Path that extracts and verbalizes AST, CFG, and DFG information to expose control and data dependencies; a Knowledge Path that retrieves relevant CWE-derived vulnerability patterns and examples through hybrid dense--sparse retrieval; and a Semantic Path that summarizes the functional behavior of the code before the final judgment. These contexts are integrated into a unified instruction to guide the LLM toward more reliable vulnerability reasoning. Experiments on the PrimeVul pair test set show that VulTriage achieves state-of-the-art performance, outperforming existing deep learning and LLM-based baselines on key pair-wise and classification metrics. Further ablation studies verify the effectiveness of each path, and additional experiments on the Kotlin dataset demonstrate the generalization ability of VulTriage under low-resource and class-imbalanced settings. Our code is available at https://github.com/vinsontang1/VulTriage

preprint2022arXiv

Age of Information-based Scheduling for Wireless D2D Systems with a Deep Learning Approach

Device-to-device (D2D) links scheduling for avoiding excessive interference is critical to the success of wireless D2D communications. Most of the traditional scheduling schemes only consider the maximum throughput or fairness of the system and do not consider the freshness of information. In this paper, we propose a novel D2D links scheduling scheme to optimize an age of information (AoI) and throughput jointly scheduling problem when D2D links transmit packets under the last-come-first-serve policy with packet-replacement (LCFS-PR). It is motivated by the fact that the maximum throughput scheduling may reduce the activation probability of links with poor channel conditions, which results in terrible AoI performance. Specifically, We derive the expression of the overall average AoI and throughput of the network under the spatio-temporal interfering queue dynamics with the mean-field assumption. Moreover, a neural network structure is proposed to learn the mapping from the geographic location to the optimal scheduling parameters under a stationary randomized policy, where the scheduling decision can be made without estimating the channel state information(CSI) after the neural network is well-trained. To overcome the problem that implicit loss functions cannot be back-propagated, we derive a numerical solution of the gradient. Finally, numerical results reveal that the performance of the deep learning approach is close to that of a local optimal algorithm which has a higher computational complexity. The trade-off curve of AoI and throughput is also obtained, where the AoI tends to infinity when throughput is maximized.

preprint2022arXiv

Fiber bundle topology optimization for surface flows

This paper presents a topology optimization approach for the surface flows on variable design domains. Via this approach, the matching between the pattern of a surface flow and the 2-manifold used to define the pattern can be optimized, where the 2-manifold is implicitly defined on another fixed 2-manifold named as the base manifold. The fiber bundle topology optimization approach is developed based on the description of the topological structure of the surface flow by using the differential geometry concept of the fiber bundle. The material distribution method is used to achieve the evolution of the pattern of the surface flow. The evolution of the implicit 2-manifold is realized via a homeomorphous map. The design variable of the pattern of the surface flow and that of the implicit 2-manifold are regularized by two sequentially implemented surface-PDE filters. The two surface-PDE filters are coupled, because they are defined on the implicit 2-manifold and base manifold, respectively. The surface Navier-Stokes equations, defined on the implicit 2-manifold, are used to describe the surface flow. The fiber bundle topology optimization problem is analyzed using the continuous adjoint method implemented on the first-order Sobolev space. Several numerical examples have been provided to demonstrate this approach, where the combination of the viscous dissipation and pressure drop is used as the design objective.

preprint2022arXiv

Information Systems Dynamics: Foundations and Applications

This article firstly reviews and summarizes the rapid development of information technology, characterized by the close combination of computer and network communication, which leads to a series of investigations, including the analyses of the important role of a series of technological achievements in the context of information movement and application, the interrelationship between the real-world, information space and information system, and the integrated framework of the real-world and information system, and the modifications and improvements of the Xu's previous mathematical theory on information models, properties and metrics. Based on the mathematical foundations, eleven types of information measure efficacies and their distribution across information systems are put forward, and then the dynamics configurations of information systems are comprehensively analyzed, which constitutes the basic theoretical framework of information systems dynamics with general significance. Finally, Smart Court SoSs (System of Systems) Engineering Project of China are introduced as the exemplified application of the theoretical work, which aims at providing a reference for the analysis, design, development and evaluation of large-scale complex information systems.

preprint2022arXiv

Propagation Path Loss Models in Forest Scenario at 605 MHz

When signals propagate through forest areas, they will be affected by environmental factors such as vegetation. Different types of environments have different influences on signal attenuation. This paper analyzes the existing classical propagation path loss models and the model with excess loss caused by forest areas and then proposes a new short-range wireless channel propagation model, which can be applied to different types of forest environments. We conducted continuous-wave measurements at a center frequency of 605 MHz on predetermined routes in distinct types of forest areas and recorded the reference signal received power. Then, we use various path loss models to fit the measured data based on different vegetation types and distributions. Simulation results show that the proposed model has substantially smaller fitting errors with reasonable computational complexity, as compared with representative traditional counterparts.

preprint2022arXiv

Task-Balanced Distillation for Object Detection

Mainstream object detectors are commonly constituted of two sub-tasks, including classification and regression tasks, implemented by two parallel heads. This classic design paradigm inevitably leads to inconsistent spatial distributions between classification score and localization quality (IOU). Therefore, this paper alleviates this misalignment in the view of knowledge distillation. First, we observe that the massive teacher achieves a higher proportion of harmonious predictions than the lightweight student. Based on this intriguing observation, a novel Harmony Score (HS) is devised to estimate the alignment of classification and regression qualities. HS models the relationship between two sub-tasks and is seen as prior knowledge to promote harmonious predictions for the student. Second, this spatial misalignment will result in inharmonious region selection when distilling features. To alleviate this problem, a novel Task-decoupled Feature Distillation (TFD) is proposed by flexibly balancing the contributions of classification and regression tasks. Eventually, HD and TFD constitute the proposed method, named Task-Balanced Distillation (TBD). Extensive experiments demonstrate the considerable potential and generalization of the proposed method. Specifically, when equipped with TBD, RetinaNet with ResNet-50 achieves 41.0 mAP under the COCO benchmark, outperforming the recent FGD and FRS.

preprint2022arXiv

Training Enhancement of Deep Learning Models for Massive MIMO CSI Feedback with Small Datasets

Accurate downlink channel state information (CSI) is vital to achieving high spectrum efficiency in massive MIMO systems. Existing works on the deep learning (DL) model for CSI feedback have shown efficient compression and recovery in frequency division duplex (FDD) systems. However, practical DL networks require sizeable wireless CSI datasets during training to achieve high model accuracy. To address this labor-intensive problem, this work develops an efficient training enhancement solution of DL-based feedback architecture based on a modest dataset by exploiting the complex CSI features, and augmenting CSI dataset based on domain knowledge. We first propose a spherical CSI feedback network, SPTM2-ISTANet+, which employs the spherical normalization framework to mitigate the effect of path loss variation. We exploit the trainable measurement matrix and residual recovery structure to improve the encoding efficiency and recovery accuracy. For limited CSI measurements, we propose a model-driven lightweight and universal augmentation strategy based on decoupling CSI magnitude and phase information, applying the circular shift in angular-delay domain, and randomizing the CSI phase to approximate phase distribution. Test results demonstrate the efficacy and efficiency of the proposed training strategy and feedback architecture for accurate CSI feedback under limited measurements.

preprint2020arXiv

A Markovian Model-Driven Deep Learning Framework for Massive MIMO CSI Feedback

Forward channel state information (CSI) often plays a vital role in scheduling and capacity-approaching transmission optimization for massive multiple-input multiple-output (MIMO) communication systems. In frequency division duplex (FDD) massive MIMO systems, forwardlink CSI reconstruction at the transmitter relies critically on CSI feedback from receiving nodes and must carefully weigh the tradeoff between reconstruction accuracy and feedback bandwidth. Recent studies on the use of recurrent neural networks (RNNs) have demonstrated strong promises, though the cost of computation and memory remains high, for massive MIMO deployment. In this work, we exploit channel coherence in time to substantially improve the feedback efficiency. Using a Markovian model, we develop a deep convolutional neural network (CNN)-based framework MarkovNet to differentially encode forward CSI in time to effectively improve reconstruction accuracy. Furthermore, we explore important physical insights, including spherical normalization of input data and convolutional layers for feedback compression. We demonstrate substantial performance improvement and complexity reduction over the RNN-based work by our proposed MarkovNet to recover forward CSI estimates accurately. We explore additional practical consideration in feedback quantization, and show that MarkovNet outperforms RNN-based CSI estimation networks at a fraction of the computational cost.

preprint2020arXiv

A Novel Decision Tree for Depression Recognition in Speech

Depression is a common mental disorder worldwide which causes a range of serious outcomes. The diagnosis of depression relies on patient-reported scales and psychiatrist interview which may lead to subjective bias. In recent years, more and more researchers are devoted to depression recognition in speech , which may be an effective and objective indicator. This study proposes a new speech segment fusion method based on decision tree to improve the depression recognition accuracy and conducts a validation on a sample of 52 subjects (23 depressed patients and 29 healthy controls). The recognition accuracy are 75.8% and 68.5% for male and female respectively on gender-dependent models. It can be concluded from the data that the proposed decision tree model can improve the depression classification performance.

preprint2020arXiv

Human Activity Recognition based on Dynamic Spatio-Temporal Relations

Human activity, which usually consists of several actions, generally covers interactions among persons and or objects. In particular, human actions involve certain spatial and temporal relationships, are the components of more complicated activity, and evolve dynamically over time. Therefore, the description of a single human action and the modeling of the evolution of successive human actions are two major issues in human activity recognition. In this paper, we develop a method for human activity recognition that tackles these two issues. In the proposed method, an activity is divided into several successive actions represented by spatio temporal patterns, and the evolution of these actions are captured by a sequential model. A refined comprehensive spatio temporal graph is utilized to represent a single action, which is a qualitative representation of a human action incorporating both the spatial and temporal relations of the participant objects. Next, a discrete hidden Markov model is applied to model the evolution of action sequences. Moreover, a fully automatic partition method is proposed to divide a long-term human activity video into several human actions based on variational objects and qualitative spatial relations. Finally, a hierarchical decomposition of the human body is introduced to obtain a discriminative representation for a single action. Experimental results on the Cornell Activity Dataset demonstrate the efficiency and effectiveness of the proposed approach, which will enable long videos of human activity to be better recognized.

preprint2020arXiv

Multichannel CNN with Attention for Text Classification

Recent years, the approaches based on neural networks have shown remarkable potential for sentence modeling. There are two main neural network structures: recurrent neural network (RNN) and convolution neural network (CNN). RNN can capture long term dependencies and store the semantics of the previous information in a fixed-sized vector. However, RNN is a biased model and its ability to extract global semantics is restricted by the fixed-sized vector. Alternatively, CNN is able to capture n-gram features of texts by utilizing convolutional filters. But the width of convolutional filters restricts its performance. In order to combine the strengths of the two kinds of networks and alleviate their shortcomings, this paper proposes Attention-based Multichannel Convolutional Neural Network (AMCNN) for text classification. AMCNN utilizes a bi-directional long short-term memory to encode the history and future information of words into high dimensional representations, so that the information of both the front and back of the sentence can be fully expressed. Then the scalar attention and vectorial attention are applied to obtain multichannel representations. The scalar attention can calculate the word-level importance and the vectorial attention can calculate the feature-level importance. In the classification task, AMCNN uses a CNN structure to cpture word relations on the representations generated by the scalar and vectorial attention mechanism instead of calculating the weighted sums. It can effectively extract the n-gram features of the text. The experimental results on the benchmark datasets demonstrate that AMCNN achieves better performance than state-of-the-art methods. In addition, the visualization results verify the semantic richness of multichannel representations.

preprint2020arXiv

Peregrine: Network Localization and Navigation with Scalable Inference and Efficient Operation

Location-aware networks will enable new services and applications in fields such as autonomous driving, smart cities, and the Internet-of-Things. One promising solution for ubiquitous localization is network localization and navigation (NLN), where devices form a network that cooperatively localizes itself, reducing the infrastructure needed for accurate localization. This paper introduces a real-time NLN system named Peregrine, which combines distributed NLN algorithms with commercially available ultra-wideband (UWB) sensing and communication technology. The Peregrine software application, for the first time, integrates three NLN algorithms to jointly perform the tasks of localization and network operation in a technology agnostic manner, leveraging both spatial and temporal cooperation. Peregrine hardware is composed of low-cost, compact devices that comprise a microprocessor and a commercial UWB radio. This paper presents the design of the Peregrine system and characterizes the performance impact of each algorithmic component. Indoor experiments validate that our approach to realizing NLN is both reliable and scalable, and maintains sub-meter-level accuracy even in challenging indoor scenarios.

preprint2020arXiv

Text Classification based on Multi-granularity Attention Hybrid Neural Network

Neural network-based approaches have become the driven forces for Natural Language Processing (NLP) tasks. Conventionally, there are two mainstream neural architectures for NLP tasks: the recurrent neural network (RNN) and the convolution neural network (ConvNet). RNNs are good at modeling long-term dependencies over input texts, but preclude parallel computation. ConvNets do not have memory capability and it has to model sequential data as un-ordered features. Therefore, ConvNets fail to learn sequential dependencies over the input texts, but it is able to carry out high-efficient parallel computation. As each neural architecture, such as RNN and ConvNets, has its own pro and con, integration of different architectures is assumed to be able to enrich the semantic representation of texts, thus enhance the performance of NLP tasks. However, few investigation explores the reconciliation of these seemingly incompatible architectures. To address this issue, we propose a hybrid architecture based on a novel hierarchical multi-granularity attention mechanism, named Multi-granularity Attention-based Hybrid Neural Network (MahNN). The attention mechanism is to assign different weights to different parts of the input sequence to increase the computation efficiency and performance of neural models. In MahNN, two types of attentions are introduced: the syntactical attention and the semantical attention. The syntactical attention computes the importance of the syntactic elements (such as words or sentence) at the lower symbolic level and the semantical attention is used to compute the importance of the embedded space dimension corresponding to the upper latent semantics. We adopt the text classification as an exemplifying way to illustrate the ability of MahNN to understand texts.

preprint2020arXiv

Topology optimization of surface flows

This paper presents a topology optimization approach for surface flows, which can represent the viscous and incompressible fluidic motions at the solid/liquid and liquid/vapor interfaces. The fluidic motions on such material interfaces can be described by the surface Navier-Stokes equations defined on 2-manifolds or two-dimensional manifolds, where the elementary tangential calculus is implemented in terms of exterior differential operators expressed in a Cartesian system. Based on the topology optimization model for fluidic flows with porous medium filling the design domain, an artificial Darcy friction is added to the area force term of the surface Navier-Stokes equations and the physical area forces are penalized to eliminate their existence in the fluidic regions and to avoid the invalidity of the porous medium model. Topology optimization for steady and unsteady surface flows can be implemented by iteratively evolving the impermeability of the porous medium on the 2-manifolds, where the impermeability is interpolated by the material density derived from a design variable. The related partial differential equations are solved by using the surface finite element method. Numerical examples have been provided to demonstrate this topology optimization approach for surface flows, including the boundary velocity driven flows, area force driven flows and convection-diffusion flows.

preprint2019arXiv

Topology optimization on two-dimensional manifolds

This paper implements topology optimization on two-dimensional manifolds. In this paper, the material interpolation is implemented on a material parameter in the partial differential equation used to describe a physical field, when this physical field is defined on a two-dimensional manifold; the material density is used to formulate a mixed boundary condition of the physical field and implement the penalization between two different types of boundary conditions, when this physical field is defined on a three-dimensional domain with its boundary conditions defined on the two-dimensional manifold corresponding a surface or an interface of this three-dimensional domain. Based on the homeomorphic property of two-dimensional manifolds, typical two-dimensional manifolds, e.g., sphere, torus, Möbius strip and Klein bottle, are included in the numerical tests, which are provided for the problems on fluidic mechanics, heat transfer and electromagnetics.

Zhenyu Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

VulTriage: Triple-Path Context Augmentation for LLM-Based Vulnerability Detection

Age of Information-based Scheduling for Wireless D2D Systems with a Deep Learning Approach

Fiber bundle topology optimization for surface flows

Information Systems Dynamics: Foundations and Applications

Propagation Path Loss Models in Forest Scenario at 605 MHz

Task-Balanced Distillation for Object Detection

Training Enhancement of Deep Learning Models for Massive MIMO CSI Feedback with Small Datasets

A Markovian Model-Driven Deep Learning Framework for Massive MIMO CSI Feedback

A Novel Decision Tree for Depression Recognition in Speech

Human Activity Recognition based on Dynamic Spatio-Temporal Relations

Multichannel CNN with Attention for Text Classification

Peregrine: Network Localization and Navigation with Scalable Inference and Efficient Operation

Text Classification based on Multi-granularity Attention Hybrid Neural Network

Topology optimization of surface flows

Topology optimization on two-dimensional manifolds