Researcher profile

Mengyuan Li

Mengyuan Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2022arXiv

CROSSLINE: Breaking "Security-by-Crash" based Memory Isolation in AMD SEV

AMD's Secure Encrypted Virtualization (SEV) is an emerging security feature on AMD processors that allows virtual machines to run on encrypted memory and perform confidential computing even with an untrusted hypervisor. This paper first demystifies SEV's improper use of address space identifier (ASID) for controlling accesses of a VM to encrypted memory pages, cache lines, and TLB entries. We then present the CROSSLINE attacks, a novel class of attacks against SEV that allow the adversary to launch an attacker VM and change its ASID to that of the victim VM to impersonate the victim. We present two variants of CROSSLINE attacks: CROSSLINE V1 decrypts victim's page tables or memory blocks following the format of a page table entry; CROSSLINE V2 constructs encryption and decryption oracles by executing instructions of the victim VM. We have successfully performed CROSSLINE attacks on SEV and SEV-ES processors.

preprint2022arXiv

Deep Multi-Scale U-Net Architecture and Label-Noise Robust Training Strategies for Histopathological Image Segmentation

Although the U-Net architecture has been extensively used for segmentation of medical images, we address two of its shortcomings in this work. Firstly, the accuracy of vanilla U-Net degrades when the target regions for segmentation exhibit significant variations in shape and size. Even though the U-Net already possesses some capability to analyze features at various scales, we propose to explicitly add multi-scale feature maps in each convolutional module of the U-Net encoder to improve segmentation of histology images. Secondly, the accuracy of a U-Net model also suffers when the annotations for supervised learning are noisy or incomplete. This can happen due to the inherent difficulty for a human expert to identify and delineate all instances of specific pathology very precisely and accurately. We address this challenge by introducing auxiliary confidence maps that emphasize less on the boundaries of the given target regions. Further, we utilize the bootstrapping properties of the deep network to address the missing annotation problem intelligently. In our experiments on a private dataset of breast cancer lymph nodes, where the primary task was to segment germinal centres and sinus histiocytosis, we observed substantial improvement over a U-Net baseline based on the two proposed augmentations.

preprint2022arXiv

Semi-overlap Functions and Novel Fuzzy Reasoning Algorithms with Applications

It is worth noticing that a fuzzy conjunction and its corresponding fuzzy implication can form a residual pair if and only if it is left-continuous. In order to get a more general result related on residual implications that induced by aggregation functions, we relax the definition of general overlap functions, more precisely, removing its right-continuous, and then introduce a new kind of aggregation functions, which called semi-overlap functions. Subsequently, we study some of their related algebraic properties and investigate their corresponding residual implications. Moreover, serval scholars have provided kinds of methods for fuzzy modus ponens (FMP,for short) problems so far, such as Zadeh's compositional rule of inference (CRI, for short), Wang's triple I method (TIM, for short) and the quintuple implication principle (QIP, for short). Compared with CRI and TIM methods, QIP method has some advantages in solving FMP problems, in this paper, we further consider the QIP method for FMP problems and prove that it satisfies the reducibility of multiple-rules fuzzy reasoning. Finally, we propose a new classification algorithm that based on semi-overlap functions and QIP method, which called SO5I-FRC algorithm. Through the comparative tests, the average accuracy of SO5I-FRC algorithm is higher than FARC-HD algorithm. The experimental results indicate that semi-overlap functions and QIP method have certain advantages and a wide range of applications in classification problems.

preprint2020arXiv

Deep Learning Based FDD Non-Stationary Massive MIMO Downlink Channel Reconstruction

This paper proposes a model-driven deep learning-based downlink channel reconstruction scheme for frequency division duplexing (FDD) massive multi-input multi-output (MIMO) systems. The spatial non-stationarity, which is the key feature of the future extremely large aperture massive MIMO system, is considered. Instead of the channel matrix, the channel model parameters are learned by neural networks to save the overhead and improve the accuracy of channel reconstruction. By viewing the channel as an image, we introduce You Only Look Once (YOLO), a powerful neural network for object detection, to enable a rapid estimation process of the model parameters, including the detection of angles and delays of the paths and the identification of visibility regions of the scatterers. The deep learning-based scheme avoids the complicated iterative process introduced by the algorithm-based parameter extraction methods. A low-complexity algorithm-based refiner further refines the YOLO estimates toward high accuracy. Given the efficiency of model-driven deep learning and the combination of neural network and algorithm, the proposed scheme can rapidly and accurately reconstruct the non-stationary downlink channel. Moreover, the proposed scheme is also applicable to widely concerned stationary systems and achieves comparable reconstruction accuracy as an algorithm-based method with greatly reduced time consumption.

preprint2020arXiv

Guiding Variational Response Generator to Exploit Persona

Leveraging persona information of users in Neural Response Generators (NRG) to perform personalized conversations has been considered as an attractive and important topic in the research of conversational agents over the past few years. Despite of the promising progresses achieved by recent studies in this field, persona information tends to be incorporated into neural networks in the form of user embeddings, with the expectation that the persona can be involved via the End-to-End learning. This paper proposes to adopt the personality-related characteristics of human conversations into variational response generators, by designing a specific conditional variational autoencoder based deep model with two new regularization terms employed to the loss function, so as to guide the optimization towards the direction of generating both persona-aware and relevant responses. Besides, to reasonably evaluate the performances of various persona modeling approaches, this paper further presents three direct persona-oriented metrics from different perspectives. The experimental results have shown that our proposed methodology can notably improve the performance of persona-aware response generation, and the metrics are reasonable to evaluate the results.

preprint2020arXiv

Towards Non-task-specific Distillation of BERT via Sentence Representation Approximation

Recently, BERT has become an essential ingredient of various NLP deep models due to its effectiveness and universal-usability. However, the online deployment of BERT is often blocked by its large-scale parameters and high computational cost. There are plenty of studies showing that the knowledge distillation is efficient in transferring the knowledge from BERT into the model with a smaller size of parameters. Nevertheless, current BERT distillation approaches mainly focus on task-specified distillation, such methodologies lead to the loss of the general semantic knowledge of BERT for universal-usability. In this paper, we propose a sentence representation approximating oriented distillation framework that can distill the pre-trained BERT into a simple LSTM based model without specifying tasks. Consistent with BERT, our distilled model is able to perform transfer learning via fine-tuning to adapt to any sentence-level downstream task. Besides, our model can further cooperate with task-specific distillation procedures. The experimental results on multiple NLP tasks from the GLUE benchmark show that our approach outperforms other task-specific distillation methods or even much larger models, i.e., ELMO, with efficiency well-improved.