Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
60works
0followers
46topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

60 published item(s)

preprint2026arXiv

A Unified Spoken Language Model with Injected Emotional-Attribution Thinking for Human-like Interaction

This paper presents a unified spoken language model for emotional intelligence, enhanced by a novel data construction strategy termed Injected Emotional-Attribution Thinking (IEAT). IEAT incorporates user emotional states and their underlying causes into the model's internal reasoning process, enabling emotion-aware reasoning to be internalized rather than treated as explicit supervision. The model is trained with a two-stage progressive strategy. The first stage performs speech-text alignment and emotional attribute modeling via self-distillation, while the second stage conducts end-to-end cross-modal joint optimization to ensure consistency between textual and spoken emotional expressions. Experiments on the Human-like Spoken Dialogue Systems Challenge (HumDial) Emotional Intelligence benchmark demonstrate that the proposed approach achieves top-ranked performance across emotional trajectory modeling, emotional reasoning, and empathetic response generation under both LLM-based and human evaluations.

preprint2026arXiv

Impact of Nuclear Reaction Rates on Calcium Production in Population III Stars: A Global Analysis

We investigate the sensitivity of calcium production to nuclear reaction rates of a 40 solar-mass Population III star using 1D multi-zone stellar models. A comprehensive nuclear reaction network was constructed, and all $(p,γ)$ and $(p,α)$ reaction rates were individually varied by a factor of 10 up and down, identifying 13 preliminary key reactions for calcium production. To propagate the reaction rate uncertainties on calcium production, two sets of Monte Carlo simulations were performed for these key reactions: one adopting STARLIB reaction rates and the other incorporating updated rates from recent experimental data and evaluations. Our results show that Monte Carlo simulations using the updated rates show good agreement with the observed calcium abundance of the extremely iron-poor star SMSS J031300.36-670839.3 within the 68% confidence interval predicted by the models. In contrast, the observed calcium abundance lies marginally outside the 68% C.I. when using the STARLIB rates. Spearman rank-order correlation analysis and SHAP values show that the $(p,γ)$ and $(p,α)$ reactions of F18 and F19 exhibit strong coupled effects on calcium production. These reaction-rate uncertainties need to be reduced to constrain the stellar model predictions. Our study provides insights for future nuclear physics experiments aimed at reducing reaction rate uncertainties in the nucleosynthesis of Population III Stars. Additionally, comparisons between 20 solar-mass and 40 solar-mass Population III stellar models confirm that the latter, with updated reaction rates, is more capable of reproducing the observed Ca abundance and [Ca/Mg] ratio.

preprint2026arXiv

OrchJail: Jailbreaking Tool-Calling Text-to-Image Agents by Orchestration-Guided Fuzzing

Tool-calling text-to-image (T2I) agents can plan and execute multi-step tool chains to accomplish complex generation and editing queries. However, this capability introduces a new safety attack surface: harmful outputs may arise from tool orchestration, where individually benign steps combine into unsafe results, making prompt-only jailbreak techniques insufficient. We present OrchJail, an orchestration-guided fuzzing framework for jailbreaking tool-calling T2I agents. Its core idea is to exploit high-risk tool-orchestration patterns: by learning from successful jailbreak tool-calling traces and their causal relationships to prompt wording, OrchJail directly guides the fuzzing search toward prompts that are more likely to trigger unsafe multi-step tool behaviors, rather than relying on surface-level textual perturbations. Extensive experiments demonstrate that OrchJail improves jailbreak effectiveness and efficiency across representative toolcalling T2I agents, achieving higher attack success rates, better image fidelity, and lower query costs, while remaining robust against common jailbreak defenses. Our work highlights tool orchestration as a critical, previously unexplored attack surface and provides a novel framework for uncovering safety risks in T2I agents.

preprint2026arXiv

Safety Anchor: Defending Harmful Fine-tuning via Geometric Bottlenecks

The safety alignment of Large Language Models (LLMs) remains vulnerable to Harmful Fine-tuning (HFT). While existing defenses impose constraints on parameters, gradients, or internal representations, we observe that they can be effectively circumvented under persistent HFT. Our analysis traces this failure to the inherent redundancy of the high-dimensional parameter space: attackers exploit optimization trajectories that are orthogonal to defense constraints to restore harmful capabilities while deceptively adhering to safety restrictions. To address this, we propose Safety Bottleneck Regularization (SBR). SBR shifts the defensive focus from the redundant parameter space to the unembedding layer, which serves as a geometric bottleneck. By anchoring the final hidden states of harmful queries to those of the safety-aligned model, SBR enables the model to maintain safe responses even under persistent HFT. Extensive experiments confirm SBR's effectiveness, demonstrating that utilizing just a single safety anchor is sufficient to reduce the Harmful Score to $<$10 while preserving competitive performance on benign downstream tasks.

preprint2026arXiv

Split CNN Inference on Networked Microcontrollers

Running deep neural networks on microcontroller units (MCUs) is severely constrained by limited memory resources. While TinyML techniques reduce model size and computation, they often fail in practice due to excessive peak Random Access Memory (RAM) usage during inference, dominated by intermediate activations. As a result, many models remain infeasible on standalone MCUs. In this work, we present a fine-grained split inference system for networked MCUs that enables collaborative inference of Convolutional Neural Networks (CNN) models across multiple devices. Our key insight is that breaking the memory bottleneck requires splitting inference at sub-layer granularity rather than at layer boundaries. We reinterpret pre-trained models to enable kernel-wise and neuron-wise partitioning, and distribute both model parameters and intermediate activations across multiple MCUs. A lightweight, resource-aware coordinator orchestrates the inference across MCU devices with heterogeneous resources. We implement the proposed system on a real testbed and evaluate it on up to 8 MCUs using MobileNetV2, a representative CNN model. Our experimental results show that CNN models infeasible on a single MCU can be executed across networked MCUs, reducing the per-MCU peak RAM usage while maintaining the practical end-to-end inference latency. All the source code of this work can be found here: https://github.com/shashsuresh/split-inference-on-MCUs.

preprint2026arXiv

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Spoken language models (SLMs) have advanced rapidly in recent years, accompanied by a growing number of evaluation benchmarks. However, most existing benchmarks emphasize task completion and capability scaling, while remaining poorly aligned with how users interact with SLMs in real-world spoken conversations. Effective spoken interaction requires not only accurate understanding of user intent and content, but also the ability to respond with appropriate interactional strategies. In this paper, we present TELEVAL, a dynamic, user-centered benchmark for evaluating SLMs in realistic Chinese spoken interaction scenarios. TELEVAL consolidates evaluation into two core aspects. Reliable Content Fulfillment assesses whether models can comprehend spoken inputs and produce semantically correct responses. Interactional Appropriateness evaluates whether models act as socially capable interlocutors, requiring them not only to generate human-like, colloquial responses, but also to implicitly incorporate paralinguistic cues for natural interaction. Experiments reveal that, despite strong performance on semantic and knowledge-oriented tasks, current SLMs still struggle to produce natural and interactionally appropriate responses, highlighting the need for more interaction-faithful evaluation.

preprint2026arXiv

Towards Unbiased Cross-Modal Representation Learning for Food Image-to-Recipe Retrieval

This paper addresses the challenges of learning representations for recipes and food images in the cross-modal retrieval problem. As the relationship between a recipe and its cooked dish is cause-and-effect, treating a recipe as a text source describing the visual appearance of a dish for learning representation, as the existing approaches, will create bias misleading image-and-recipe similarity judgment. Specifically, a food image may not equally capture every detail in a recipe, due to factors such as the cooking process, dish presentation, and image-capturing conditions. The current representation learning tends to capture dominant visual-text alignment while overlooking subtle variations that determine retrieval relevance. In this paper, we model such bias in cross-modal representation learning using causal theory. The causal view of this problem suggests ingredients as one of the confounder sources and a simple backdoor adjustment can alleviate the bias. By causal intervention, we reformulate the conventional model for food-to-recipe retrieval with an additional term to remove the potential bias in similarity judgment. Based on this theory-informed formulation, we empirically prove the oracle performance of retrieval on the Recipe1M dataset to be MedR=1 across the testing data sizes of 1K, 10K, and even 50K. We also propose a plug-and-play neural module, which is essentially a multi-label ingredient classifier for debiasing. New state-of-the-art search performances are reported on the Recipe1M dataset.

preprint2023arXiv

Numerical simulation of the radiation force from transient acoustic fields: Application to laser-guided acoustic tweezers

Using pulsed acoustic waves could provide a superior selectivity for microscale acoustic tweezers. However, the theory for the radiation force of pulsed acoustic waves has only been recently derived and no numerical implementations are available. In this paper, we present a finite-element implementation of this model to simulate the transient acoustic radiation force on small spheres. We use the model to simulate laser-guided acoustic tweezers and optimize their performance. By enabling numerical simulations of the transient radiation force, this work may accelerate the rational design of pulse-based high-selectivity acoustic tweezers devices.

preprint2023arXiv

Significance of Strain Rate in Severe Plastic Deformation on Steady-State Microstructure and Strength

The microstructure and mechanical properties of materials saturate to steady states after severe plastic deformation (SPD). Despite the well-known effect of temperature on the steady-state microstructure, there is no general agreement on the significance of strain rate and the applicability of the Zener-Hollomon parameter in this regard. In this study, several pure metals (aluminum, copper, titanium, and iron) and a Cu-30Zn (wt%) brass alloy have been processed by a high-speed high-pressure torsion (HPT) equipment with controllable rotation speeds in the range of 0.06 to 60 rpm. It is found that crystallite/grain size, dislocation density, microhardness and shear stress at the steady state are reasonably rate-independent for the von Mises strain rates in the range of 0.004 to 20 s-1. Because both rates of grain refinement and of dynamic recrystallization are proportional to the strain rate, it is suggested that their balance, which determines the steady state, is rate-independent.

preprint2022arXiv

A Large-scale Comprehensive Dataset and Copy-overlap Aware Evaluation Protocol for Segment-level Video Copy Detection

In this paper, we introduce VCSL (Video Copy Segment Localization), a new comprehensive segment-level annotated video copy dataset. Compared with existing copy detection datasets restricted by either video-level annotation or small-scale, VCSL not only has two orders of magnitude more segment-level labelled data, with 160k realistic video copy pairs containing more than 280k localized copied segment pairs, but also covers a variety of video categories and a wide range of video duration. All the copied segments inside each collected video pair are manually extracted and accompanied by precisely annotated starting and ending timestamps. Alongside the dataset, we also propose a novel evaluation protocol that better measures the prediction accuracy of copy overlapping segments between a video pair and shows improved adaptability in different scenarios. By benchmarking several baseline and state-of-the-art segment-level video copy detection methods with the proposed dataset and evaluation metric, we provide a comprehensive analysis that uncovers the strengths and weaknesses of current approaches, hoping to open up promising directions for future works. The VCSL dataset, metric and benchmark codes are all publicly available at https://github.com/alipay/VCSL.

preprint2022arXiv

A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification

We propose a novel neural model compression strategy combining data augmentation, knowledge transfer, pruning, and quantization for device-robust acoustic scene classification (ASC). Specifically, we tackle the ASC task in a low-resource environment leveraging a recently proposed advanced neural network pruning mechanism, namely Lottery Ticket Hypothesis (LTH), to find a sub-network neural model associated with a small amount non-zero model parameters. The effectiveness of LTH for low-complexity acoustic modeling is assessed by investigating various data augmentation and compression schemes, and we report an efficient joint framework for low-complexity multi-device ASC, called \emph{Acoustic Lottery}. Acoustic Lottery could compress an ASC model up to $1/10^{4}$ and attain a superior performance (validation accuracy of 79.4% and Log loss of 0.64) compared to its not compressed seed model. All results reported in this work are based on a joint effort of four groups, namely GT-USTC-UKE-Tencent, aiming to address the &#34;Low-Complexity Acoustic Scene Classification (ASC) with Multiple Devices&#34; in the DCASE 2021 Challenge Task 1a.

preprint2022arXiv

A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification

In this paper, we propose two techniques, namely joint modeling and data augmentation, to improve system performances for audio-visual scene classification (AVSC). We employ pre-trained networks trained only on image data sets to extract video embedding; whereas for audio embedding models, we decide to train them from scratch. We explore different neural network architectures for joint modeling to effectively combine the video and audio modalities. Moreover, data augmentation strategies are investigated to increase audio-visual training set size. For the video modality the effectiveness of several operations in RandAugment is verified. An audio-video joint mixup scheme is proposed to further improve AVSC performances. Evaluated on the development set of TAU Urban Audio Visual Scenes 2021, our final system can achieve the best accuracy of 94.2% among all single AVSC systems submitted to DCASE 2021 Task 1b.

preprint2022arXiv

A TensorFlow Simulation Framework for Scientific Computing of Fluid Flows on Tensor Processing Units

A computational fluid dynamics (CFD) simulation framework for fluid-flow prediction is developed on the Tensor Processing Unit (TPU) platform. The TPU architecture is featured with accelerated dense matrix multiplication, large high bandwidth memory, and a fast inter-chip interconnect, making it attractive for high-performance scientific computing. The CFD framework solves the variable-density Navier-Stokes equation using a low-Mach approximation, and the governing equations are discretized by a finite-difference method on a collocated structured mesh. It uses the graph-based TensorFlow as the programming paradigm. The accuracy and performance of this framework is studied both numerically and analytically, specifically focusing on effects of TPU-native single precision floating point arithmetic. The algorithm and implementation are validated with canonical 2D and 3D Taylor-Green vortex simulations. To demonstrate the capability for simulating turbulent flows, simulations are conducted for two configurations, namely decaying homogeneous isotropic turbulence and a turbulent planar jet. Both simulations show good statistical agreement with reference solutions. The performance analysis shows a linear weak scaling and a superlinear strong scaling up to a full TPU v3 pod with 2048 cores.

preprint2022arXiv

A unified construction of vertex algebras from infinite-dimensional Lie algebras

In this paper, we give a unified construction of vertex algebras arising from infinite-dimensional Lie algebras, including the affine Kac-Moody algebras, Virasoro algebras, Heisenberg algebras and their higher rank analogs, orbifolds and deformations. We define a notion of what we call quasi vertex Lie algebra to unify these Lie algebras. Starting from any (maximal) quasi vertex Lie algebra $\mathfrak{g}$, we construct a corresponding vertex Lie algebra ${\mathfrak{g}}_0$, and establish a canonical isomorphism between the category of restricted $\mathfrak{g}$-modules and that of equivariant $ϕ$-coordinated quasi $V_{\mathfrak{g}_0}$-modules, where $V_{\mathfrak{g}_0}$ is the universal enveloping vertex algebra of ${\mathfrak{g}}_0$. This unified all the previous constructions of vertex algebras from infinite-dimensional Lie algebras and shed light on the way to associate vertex algebras with Lie algebras.

preprint2022arXiv

Automatic Comment Generation via Multi-Pass Deliberation

Deliberation is a common and natural behavior in human daily life. For example, when writing papers or articles, we usually first write drafts, and then iteratively polish them until satisfied. In light of such a human cognitive process, we propose DECOM, which is a multi-pass deliberation framework for automatic comment generation. DECOM consists of multiple Deliberation Models and one Evaluation Model. Given a code snippet, we first extract keywords from the code and retrieve a similar code fragment from a pre-defined corpus. Then, we treat the comment of the retrieved code as the initial draft and input it with the code and keywords into DECOM to start the iterative deliberation process. At each deliberation, the deliberation model polishes the draft and generates a new comment. The evaluation model measures the quality of the newly generated comment to determine whether to end the iterative process or not. When the iterative process is terminated, the best-generated comment will be selected as the target comment. Our approach is evaluated on two real-world datasets in Java (87K) and Python (108K), and experiment results show that our approach outperforms the state-of-the-art baselines. A human evaluation study also confirms the comments generated by DECOM tend to be more readable, informative, and useful.

preprint2022arXiv

BatchHL: Answering Distance Queries on Batch-Dynamic Networks at Scale

Many real-world applications operate on dynamic graphs that undergo rapid changes in their topological structure over time. However, it is challenging to design dynamic algorithms that are capable of supporting such graph changes efficiently. To circumvent the challenge, we propose a batch-dynamic framework for answering distance queries, which combines offline labelling and online searching to leverage the advantages from both sides - accelerating query processing through a partial distance labelling that is of limited size but provides a good approximation to bound online searches. We devise batch-dynamic algorithms to dynamize a distance labelling efficiently in order to reflect batch updates on the underlying graph. In addition to providing theoretical analysis for the correctness, labelling minimality, and computational complexity, we have conducted experiments on 14 real-world networks to empirically verify the efficiency and scalability of the proposed algorithms.

preprint2022arXiv

BugListener: Identifying and Synthesizing Bug Reports from Collaborative Live Chats

In community-based software development, developers frequently rely on live-chatting to discuss emergent bugs/errors they encounter in daily development tasks. However, it remains a challenging task to accurately record such knowledge due to the noisy nature of interleaved dialogs in live chat data. In this paper, we first formulate the task of identifying and synthesizing bug reports from community live chats, and propose a novel approach, named BugListener, to address the challenges. Specifically, BugListener automates three sub-tasks: 1) Disentangle the dialogs from massive chat logs by using a Feed-Forward neural network; 2) Identify the bug-report dialogs from separated dialogs by modeling the original dialog to the graph-structured dialog and leveraging the graph neural network to learn the contextual information; 3) Synthesize the bug reports by utilizing the TextCNN model and Transfer Learning network to classify the sentences into three groups: observed behaviors (OB), expected behaviors (EB), and steps to reproduce the bug (SR). BugListener is evaluated on six open source projects. The results show that: for bug report identification, BugListener achieves the average F1 of 74.21%, improving the best baseline by 10.37%; and for bug report synthesis task, BugListener could classify the OB, EB, and SR sentences with the F1 of 67.37%, 87.14%, and 65.03%, improving the best baselines by 7.21%, 7.38%, 5.30%, respectively. A human evaluation also confirms the effectiveness of BugListener in generating relevant and accurate bug reports. These demonstrate the significant potential of applying BugListener in community-based software development, for promoting bug discovery and quality improvement.

preprint2022arXiv

CardioID: Mitigating the Effects of Irregular Cardiac Signals for Biometric Identification

Cardiac patterns are being used to obtain hard-to-forge biometric signatures and have led to high accuracy in state-of-the-art (SoA) identification applications. However, this performance is obtained under controlled scenarios where cardiac signals maintain a relatively uniform pattern, facilitating the identification process. In this work, we analyze cardiac signals collected in more realistic (uncontrolled) scenarios and show that their high signal variability (i.e., irregularity) makes it harder to obtain stable and distinct user features. Furthermore, SoA usually fails to identify specific groups of users, rendering existing identification methods futile in uncontrolled scenarios. To solve these problems, we propose a framework with three novel properties. First, we design an adaptive method that achieves stable and distinct features by tailoring the filtering spectrum to each user. Second, we show that users can have multiple cardiac morphologies, offering us a much bigger pool of cardiac signals and users compared to SoA. Third, we overcome other distortion effects present in authentication applications with a multi-cluster approach and the Mahalanobis distance. Our evaluation shows that the average balanced accuracy (BAC) of SoA drops from above 90% in controlled scenarios to 75% in uncontrolled ones, while our method maintains an average BAC above 90% in uncontrolled scenarios.

preprint2022arXiv

Epipolar Focus Spectrum: A Novel Light Field Representation and Application in Dense-view Reconstruction

Existing light field representations, such as epipolar plane image (EPI) and sub-aperture images, do not consider the structural characteristics across the views, so they usually require additional disparity and spatial structure cues for follow-up tasks. Besides, they have difficulties dealing with occlusions or larger disparity scenes. To this end, this paper proposes a novel Epipolar Focus Spectrum (EFS) representation by rearranging the EPI spectrum. Different from the classical EPI representation where an EPI line corresponds to a specific depth, there is a one-to-one mapping from the EFS line to the view. Accordingly, compared to a sparsely-sampled light field, a densely-sampled one with the same field of view (FoV) leads to a more compact distribution of such linear structures in the double-cone-shaped region with the identical opening angle in its corresponding EFS. Hence the EFS representation is invariant to the scene depth. To demonstrate its effectiveness, we develop a trainable EFS-based pipeline for light field reconstruction, where a dense light field can be reconstructed by compensating the &#34;missing EFS lines&#34; given a sparse light field, yielding promising results with cross-view consistency, especially in the presence of severe occlusion and large disparity. Experimental results on both synthetic and real-world datasets demonstrate the validity and superiority of the proposed method over SOTA methods.

preprint2022arXiv

Guided Bug Crush: Assist Manual GUI Testing of Android Apps via Hint Moves

Mobile apps are indispensable for people&#39;s daily life. Complementing with automated GUI testing, manual testing is the last line of defence for app quality. However, the repeated actions and easily missing of functionalities make manual testing time-consuming and inefficient. Inspired by the game candy crush with flashy candies as hint moves for players, we propose an approach named NaviDroid for navigating testers via highlighted next operations for more effective and efficient testing. Within NaviDroid, we construct an enriched state transition graph with the triggering actions as the edges for two involved states. Based on it, we utilize the dynamic programming algorithm to plan the exploration path, and augment the GUI with visualized hints for testers to quickly explore untested activities and avoid duplicate explorations. The automated experiments demonstrate the high coverage and efficient path planning of NaviDroid and a user study further confirms its usefulness. The NaviDroid can help us develop more robust software that works in more mission-critical settings, not only by performing more thorough testing with the same effort that has been put in before, but also by integrating these techniques into different parts of development pipeline.

preprint2022arXiv

Identifying Emergent Leadership in OSS Projects Based on Communication Styles

In open source software (OSS) communities, existing leadership indicators are dominantly measured by code contribution or community influence. Recent studies on emergent leadership shed light on additional dimensions such as intellectual stimulation in collaborative communications. To that end, this paper proposes an automated approach, named iLead, to mine communication styles and identify emergent leadership behaviors in OSS communities, using issue comments data. We start with the construction of 6 categories of leadership behaviors based on existing leadership studies. Then, we manually label leadership behaviors in 10,000 issue comments from 10 OSS projects, and extract 304 heuristic linguistic patterns which represent different types of emergent leadership behaviors in flexible and concise manners. Next, an automated algorithm is developed to merge and consolidate different pattern sets extracted from multiple projects into a final pattern ranking list, which can be applied for the automatic leadership identification. The evaluation results show that iLead can achieve a median precision of 0.82 and recall of 0.78, outperforming ten machine/deep learning baselines. To demonstrate practical usefulness, we also conduct empirical analysis and human evaluation of the identified leadership behaviors from iLead. We argue that emergent leadership behaviors in issue discussion should be taken into consideration to broaden existing OSS leadership viewpoints. Practical insights on community building and leadership skill development are offered for OSS community and individual developers, respectively.

preprint2022arXiv

Knowledge Tracing: A Survey

Humans ability to transfer knowledge through teaching is one of the essential aspects for human intelligence. A human teacher can track the knowledge of students to customize the teaching on students needs. With the rise of online education platforms, there is a similar need for machines to track the knowledge of students and tailor their learning experience. This is known as the Knowledge Tracing (KT) problem in the literature. Effectively solving the KT problem would unlock the potential of computer-aided education applications such as intelligent tutoring systems, curriculum learning, and learning materials&#39; recommendation. Moreover, from a more general viewpoint, a student may represent any kind of intelligent agents including both human and artificial agents. Thus, the potential of KT can be extended to any machine teaching application scenarios which seek for customizing the learning experience for a student agent (i.e., a machine learning model). In this paper, we provide a comprehensive and systematic review for the KT literature. We cover a broad range of methods starting from the early attempts to the recent state-of-the-art methods using deep learning, while highlighting the theoretical aspects of models and the characteristics of benchmark datasets. Besides these, we shed light on key modelling differences between closely related methods and summarize them in an easy-to-understand format. Finally, we discuss current research gaps in the KT literature and possible future research and application directions.

preprint2022arXiv

Laser-guided acoustic tweezers

Acoustic tweezers can manipulate microscopic objects and cells independently of the optical, magnetic and electrical properties of the objects or their medium. However, because ultrasonic waves are attenuated within few millimeters, existing devices must synthesize intricate and powerful acoustic fields in a very narrow footprint immediately close to the manipulated object. Here we show that the design of microscale acoustic tweezers can be considerably simplified by taking advantage of the nonlinear nature of the acoustic trapping force. In our experiment, a featureless piezoelectric crystal coated with a photoacoustic conversion layer is hit by an electric pulse and a spatially modulated laser pulse to generate synchronized electro- and photo- acoustic waves. Interference between these waves creates a hybrid acoustic trapping force 30 times stronger than the laser pulse alone but with the same spatial information. By disentangling the tradeoff between spatial resolution and trapping force that has long held back the development of acoustic tweezers, this field hybridization strategy opens new avenues for cell manipulation in organ on chip and organ printing.

preprint2022arXiv

Machine Learning assisted excess noise suppression for continuous-variable quantum key distribution

Excess noise is a major obstacle to high-performance continuous-variable quantum key distribution (CVQKD), which is mainly derived from the amplitude attenuation and phase fluctuation of quantum signals caused by channel instability. Here, an excess noise suppression scheme based on equalization is proposed. In this scheme, the distorted signals can be corrected through equalization assisted by a neural network and pilot tone, relieving the pressure on the post-processing and eliminating the hardware cost. For a free-space channel with more intense fluctuation, a classification algorithm is added to classify the received variables, and then the distinctive equalization correction for different classes is carried out. The experimental results show that the scheme can suppress the excess noise to a lower level, and has a significant performance improvement. Moreover, the scheme also enables the system to cope with strong turbulence. It breaks the bottleneck of long-distance quantum communication and lays a foundation for the large-scale application of CVQKD.

preprint2022arXiv

Manifold Optimization Based Multi-user Rate Maximization Aided by Intelligent Reflecting Surface

In this work, two problems associated with a downlink multi-user system are considered with the aid of intelligent reflecting surface (IRS): weighted sum-rate maximization and weighted minimal-rate maximization. For the first problem, a novel DOuble Manifold ALternating Optimization (DOMALO) algorithm is proposed by exploiting the matrix manifold theory and introducing the beamforming matrix and reflection vector using complex sphere manifold and complex oblique manifold, respectively, which incorporate the inherent geometrical structure and the required constraint. A smooth double manifold alternating optimization (S-DOMALO) algorithm is then developed based on the Dinkelbach-type algorithm and smooth exponential penalty function for the second problem. Finally, possible cooperative beamforming gain between IRSs and the IRS phase shift with limited resolution is studied, providing a reference for practical implementation. Numerical results show that our proposed algorithms can significantly outperform the benchmark schemes.

preprint2022arXiv

NaviDroid: A Tool for Guiding Manual Android Testing via Hint Moves

Manual testing, as a complement to automated GUI testing, is the last line of defense for app quality especially in spotting usability and accessibility issues. However, the repeated actions and easy missing of some functionalities make manual testing time-consuming, labor-extensive and inefficient. Inspired by the game candy crush with flashy candies as hint moves for players, we develop a tool named NaviDroid for navigating human testers via highlighted next operations for more effective and efficient testing. Within NaviDroid, it constructs an enriched state transition graph (STG) with the trigger actions as the edges for two involved states. Based on the STG, NaviDroid utilizes the dynamic programming algorithm to plan the exploration path, and augment the run-time GUI with visualized hint moves for testers to quickly explore untested states and avoid duplication. The automated experiments demonstrate the high coverage and efficient path planning of NaviDroid. A user study further confirms its usefulness in the participants covering more states and activities, detecting more bugs within less time compared with the control group. NaviDroid demo video: https://youtu.be/lShFyg_nTA0.

preprint2022arXiv

Nighthawk: Fully Automated Localizing UI Display Issues via Visual Understanding

Graphical User Interface (GUI) provides a visual bridge between a software application and end users, through which they can interact with each other. With the upgrading of mobile devices and the development of aesthetics, the visual effects of the GUI are more and more attracting, and users pay more attention to the accessibility and usability of applications. However, such GUI complexity posts a great challenge to the GUI implementation. According to our pilot study of crowdtesting bug reports, display issues such as text overlap, component occlusion, missing image always occur during GUI rendering on different devices due to the software or hardware compatibility. They negatively influence the app usability, resulting in poor user experience. To detect these issues, we propose a fully automated approach, Nighthawk, based on deep learning for modelling visual information of the GUI screenshot. Nighthawk can detect GUIs with display issues and also locate the detailed region of the issue in the given GUI for guiding developers to fix the bug. At the same time, training the model needs a large amount of labeled buggy screenshots, which requires considerable manual effort to prepare them. We therefore propose a heuristic-based training data auto-generation method to automatically generate the labeled training data. The evaluation demonstrates that our Nighthawk can achieve average 0.84 precision and 0.84 recall in detecting UI display issues, average 0.59 AP and 0.60 AR in localizing these issues. We also evaluate Nighthawk with popular Android apps on Google Play and F-Droid, and successfully uncover 151 previously-undetected UI display issues with 75 of them being confirmed or fixed so far.

preprint2022arXiv

NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results. In this challenge, the new Large-scale Diverse Video (LDV) dataset is employed. The challenge has three tracks. Tracks 1 and 2 aim at enhancing the videos compressed by HEVC at a fixed QP, while Track 3 is designed for enhancing the videos compressed by x265 at a fixed bit-rate. Besides, the quality enhancement of Tracks 1 and 3 targets at improving the fidelity (PSNR), and Track 2 targets at enhancing the perceptual quality. The three tracks totally attract 482 registrations. In the test phase, 12 teams, 8 teams and 11 teams submitted the final results of Tracks 1, 2 and 3, respectively. The proposed methods and solutions gauge the state-of-the-art of video quality enhancement. The homepage of the challenge: https://github.com/RenYang-home/NTIRE21_VEnh

preprint2022arXiv

NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results

This paper reviews the NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video. In this challenge, we proposed the LDV 2.0 dataset, which includes the LDV dataset (240 videos) and 95 additional videos. This challenge includes three tracks. Track 1 aims at enhancing the videos compressed by HEVC at a fixed QP. Track 2 and Track 3 target both the super-resolution and quality enhancement of HEVC compressed video. They require x2 and x4 super-resolution, respectively. The three tracks totally attract more than 600 registrations. In the test phase, 8 teams, 8 teams and 12 teams submitted the final results to Tracks 1, 2 and 3, respectively. The proposed methods and solutions gauge the state-of-the-art of super-resolution and quality enhancement of compressed video. The proposed LDV 2.0 dataset is available at https://github.com/RenYang-home/LDV_dataset. The homepage of this challenge (including open-sourced codes) is at https://github.com/RenYang-home/NTIRE22_VEnh_SR.

preprint2022arXiv

Stereo Unstructured Magnification: Multiple Homography Image for View Synthesis

This paper studies the problem of view synthesis with certain amount of rotations from a pair of images, what we called stereo unstructured magnification. While the multi-plane image representation is well suited for view synthesis with depth invariant, how to generalize it to unstructured views remains a significant challenge. This is primarily due to the depth-dependency caused by camera frontal parallel representation. Here we propose a novel multiple homography image (MHI) representation, comprising of a set of scene planes with fixed normals and distances. A two-stage network is developed for novel view synthesis. Stage-1 is an MHI reconstruction module that predicts the MHIs and composites layered multi-normal images along the normal direction. Stage-2 is a normal-blending module to find blending weights. We also derive an angle-based cost to guide the blending of multi-normal images by exploiting per-normal geometry. Compared with the state-of-the-art methods, our method achieves superior performance for view synthesis qualitatively and quantitatively, especially for cases when the cameras undergo rotations.

preprint2022arXiv

Tensor Decomposition based Personalized Federated Learning

Federated learning (FL) is a new distributed machine learning framework that can achieve reliably collaborative training without collecting users&#39; private data. However, due to FL&#39;s frequent communication and average aggregation strategy, they experience challenges scaling to statistical diversity data and large-scale models. In this paper, we propose a personalized FL framework, named Tensor Decomposition based Personalized Federated learning (TDPFed), in which we design a novel tensorized local model with tensorized linear layers and convolutional layers to reduce the communication cost. TDPFed uses a bi-level loss function to decouple personalized model optimization from the global model learning by controlling the gap between the personalized model and the tensorized local model. Moreover, an effective distributed learning strategy and two different model aggregation strategies are well designed for the proposed TDPFed framework. Theoretical convergence analysis and thorough experiments demonstrate that our proposed TDPFed framework achieves state-of-the-art performance while reducing the communication cost.

preprint2022arXiv

Trigonometric Lie algebras, affine Kac-Moody Lie algebras, and equivariant quasi modules for vertex algebras

In this paper, we study a family of infinite-dimensional Lie algebras $\widehat{X}_{S}$, where $X$ stands for the type: $A,B,C,D$, and $S$ is an abelian group, which generalize the $A,B,C,D$ series of trigonometric Lie algebras. Among the main results, we identify $\widehat{X}_{S}$ with what are called the covariant algebras of the affine Lie algebra $\widehat{\mathcal{L}_{S}}$ with respect to some automorphism groups, where $\mathcal{L}_{S}$ is an explicitly defined associative algebra viewed as a Lie algebra. We then show that restricted $\widehat{X}_{S}$-modules of level $\ell$ naturally correspond to equivariant quasi modules for affine vertex algebras related to $\mathcal{L}_{S}$. Furthermore, for any finite cyclic group $S$, we completely determine the structures of these four families of Lie algebras, showing that they are essentially affine Kac-Moody Lie algebras of certain types.

preprint2022arXiv

Where is Your App Frustrating Users?

User reviews of mobile apps provide a communication channel for developers to perceive user satisfaction. Many app features that users have problems with are usually expressed by key phrases such as &#34;upload pictures&#34;, which could be buried in the review texts. The lack of fine-grained view about problematic features could obscure the developers&#39; understanding of where the app is frustrating users, and postpone the improvement of the apps. Existing pattern-based approaches to extract target phrases suffer from low accuracy due to insufficient semantic understanding of the reviews, thus can only summarize the high-level topics/aspects of the reviews. This paper proposes a semantic-aware, fine-grained app review analysis approach (SIRA) to extract, cluster, and visualize the problematic features of apps. The main component of SIRA is a novel BERT+Attr-CRF model for fine-grained problematic feature extraction, which combines textual descriptions and review attributes to better model the semantics of reviews and boost the performance of the traditional BERT-CRF model. SIRA also clusters the extracted phrases based on their semantic relations and presents a visualization of the summaries. Our evaluation on 3,426 reviews from six apps confirms the effectiveness of SIRA in problematic feature extraction and clustering. We further conduct an empirical study with SIRA on 318,534 reviews of 18 popular apps to explore its potential application and examine its usefulness in real-world practice.

preprint2021arXiv

A Basis Approach to Surface Clustering

This paper presents a novel method for clustering surfaces. The proposal involves first using basis functions in a tensor product to smooth the data and thus reduce the dimension to a finite number of coefficients, and then using these estimated coefficients to cluster the surfaces via the k-means algorithm. An extension of the algorithm to clustering tensors is also discussed. We show that the proposed algorithm exhibits the property of strong consistency, with or without measurement errors, in correctly clustering the data as the sample size increases. Simulation studies suggest that the proposed method outperforms the benchmark k-means algorithm which uses the original vectorized data. In addition, an EGG real data example is considered to illustrate the practical application of the proposal.

preprint2021arXiv

A Highly Scalable Labelling Approach for Exact Distance Queries in Complex Networks

Answering exact shortest path distance queries is a fundamental task in graph theory. Despite a tremendous amount of research on the subject, there is still no satisfactory solution that can scale to billion-scale complex networks. Labelling-based methods are well-known for rendering fast response time to distance queries; however, existing works can only construct labelling on moderately large networks (million-scale) and cannot scale to large networks (billion-scale) due to their prohibitively large space requirements and very long preprocessing time. In this work, we present novel techniques to efficiently construct distance labelling and process exact shortest path distance queries for complex networks with billions of vertices and billions of edges. Our method is based on two ingredients: (i) a scalable labelling algorithm for constructing minimal distance labelling, and (ii) a querying framework that supports fast distance-bounded search on a sparsified graph. Thus, we first develop a novel labelling algorithm that can scale to graphs at the billion-scale. Then, we formalize a querying framework for exact distance queries, which combines our proposed highway cover distance labelling with distance-bounded searches to enable fast distance computation. To speed up the labelling construction process, we further propose a parallel labelling method that can construct labelling simultaneously for multiple landmarks. We evaluated the performance of the proposed methods on 12 real-world networks. The experiments show that the proposed methods can not only handle networks with billions of vertices, but also be up to 70 times faster in constructing labelling and save up to 90\% of labelling space. In particular, our method can answer distance queries on a billion-scale network of around 8B edges in less than 1ms, on average.

preprint2021arXiv

Deep Anti-aliasing of Whole Focal Stack Using Slice Spectrum

The paper aims at removing the aliasing effects of the whole focal stack generated from a sparse-sampled {4D} light field, while keeping the consistency across all the focal layers. We first explore the structural characteristics embedded in the focal stack slice and its corresponding frequency-domain representation, i.e., the Focal Stack Spectrum (FSS). We observe that the energy distribution of the FSS always resides within the same triangular area under different angular sampling rates, additionally the continuity of the Point Spread Function (PSF) is intrinsically maintained in the FSS. Based on these two observations, we propose a learning-based FSS reconstruction approach for one-time aliasing removing over the whole focal stack. Moreover, a novel conjugate-symmetric loss function is proposed for the optimization. Compared to previous works, our method avoids an explicit depth estimation, and can handle challenging large-disparity scenarios. Experimental results on both synthetic and real light field datasets show the superiority of the proposed approach for different scenes and various angular sampling rates.

preprint2021arXiv

Deep Learning Models for Predicting Wildfires from Historical Remote-Sensing Data

Identifying regions that have high likelihood for wildfires is a key component of land and forestry management and disaster preparedness. We create a data set by aggregating nearly a decade of remote-sensing data and historical fire records to predict wildfires. This prediction problem is framed as three machine learning tasks. Results are compared and analyzed for four different deep learning models to estimate wildfire likelihood. The results demonstrate that deep learning models can successfully identify areas of high fire likelihood using aggregated data about vegetation, weather, and topography with an AUC of 83%.

preprint2021arXiv

Efficient Maintenance of Distance Labelling for Incremental Updates in Large Dynamic Graphs

Finding the shortest path distance between an arbitrary pair of vertices is a fundamental problem in graph theory. A tremendous amount of research has been successfully attempted on this problem, most of which is limited to static graphs. Due to the dynamic nature of real-world networks, there is a pressing need to address this problem for dynamic networks undergoing changes. In this paper, we propose an \emph{online incremental} method to efficiently answer distance queries over very large dynamic graphs. Our proposed method incorporates incremental update operations, i.e. edge and vertex additions, into a highly scalable framework of answering distance queries. We theoretically prove the correctness of our method and the preservation of labelling minimality. We have also conducted extensive experiments on 12 large real-world networks to empirically verify the efficiency, scalability, and robustness of our method.

preprint2021arXiv

FIXME: Enhance Software Reliability with Hybrid Approaches in Cloud

With the promise of reliability in cloud, more enterprises are migrating to cloud. The process of continuous integration/deployment (CICD) in cloud connects developers who need to deliver value faster and more transparently with site reliability engineers (SREs) who need to manage applications reliably. SREs feed back development issues to developers, and developers commit fixes and trigger CICD to redeploy. The release cycle is more continuous than ever, thus the code to production is faster and more automated. To provide this higher level agility, the cloud platforms become more complex in the face of flexibility with deeper layers of virtualization. However, reliability does not come for free with all these complexities. Software engineers and SREs need to deal with wider information spectrum from virtualized layers. Therefore, providing correlated information with true positive evidences is critical to identify the root cause of issues quickly in order to reduce mean time to recover (MTTR), performance metrics for SREs. Similarity, knowledge, or statistics driven approaches have been effective, but with increasing data volume and types, an individual approach is limited to correlate semantic relations of different data sources. In this paper, we introduce FIXME to enhance software reliability with hybrid diagnosis approaches for enterprises. Our evaluation results show using hybrid diagnosis approach is about 17% better in precision. The results are helpful for both practitioners and researchers to develop hybrid diagnosis in the highly dynamic cloud environment.

preprint2021arXiv

Machine learning accelerated computational fluid dynamics

Numerical simulation of fluids plays an essential role in modeling many physical phenomena, such as weather, climate, aerodynamics and plasma physics. Fluids are well described by the Navier-Stokes equations, but solving these equations at scale remains daunting, limited by the computational cost of resolving the smallest spatiotemporal features. This leads to unfavorable trade-offs between accuracy and tractability. Here we use end-to-end deep learning to improve approximations inside computational fluid dynamics for modeling two-dimensional turbulent flows. For both direct numerical simulation of turbulence and large eddy simulation, our results are as accurate as baseline solvers with 8-10x finer resolution in each spatial dimension, resulting in 40-80x fold computational speedups. Our method remains stable during long simulations, and generalizes to forcing functions and Reynolds numbers outside of the flows where it is trained, in contrast to black box machine learning approaches. Our approach exemplifies how scientific computing can leverage machine learning and hardware accelerators to improve simulations without sacrificing accuracy or generalization.

preprint2021arXiv

Quadratic fractional solitons

We introduce a system combining the quadratic self-attractive or composite quadratic-cubic nonlinearity, acting in the combination with the fractional diffraction, which is characterized by its Lévy index $α$. The model applies to a gas of quantum particles moving by Lévy flights, with the quadratic term representing the Lee-Huang-Yang correction to the mean-field interactions. A family of fundamental solitons is constructed in a numerical form, while the dependence of its norm on the chemical potential characteristic is obtained in an exact analytical form. The family of \textit{quasi-Townes solitons}, appearing in the limit case of $α=1/2$, is investigated by means of a variational approximation. A nonlinear lattice, represented by spatially periodical modulation of the quadratic term, is briefly addressed too. The consideration of the interplay of competing quadratic (attractive) and cubic (repulsive) terms with a lattice potential reveals families of single-, double-, and triple-peak gap solitons (GSs) in two finite bandgaps. The competing nonlinearity gives rise to alternating regions of stability and instability of the GS, the stability intervals shrinking with the increase of the number of peaks in the GS.

preprint2021arXiv

Resolution of the paradox of the diamagnetic effect on the Kibble coil

Employing very simple electro-mechanical principles known from classical physics, the Kibble balance establishes a very precise and absolute link between quantum electrical standards and macroscopic mass or force measurements. The success of the Kibble balance, in both determining fundamental constants ($h$, $N_A$, $e$) and realizing a quasi-quantum mass in the 2019 newly revised International System of Units, relies on the perfection of Maxwell&#39;s equations and the symmetry they describe between Lorentz&#39;s force and Faraday&#39;s induction, a principle and a symmetry stunningly demonstrated in the weighing and velocity modes of Kibble balances to within $1\times10^{-8}$, with nothing but imperfect wires and magnets. However, recent advances in the understanding of the current effect in Kibble balances reveal a troubling paradox. A diamagnetic effect, a force that does not cancel between mass-on and mass-off measurement, is challenging balance maker&#39;s assumptions of symmetry at levels that are almost two orders of magnitude larger than the reported uncertainties. The diamagnetic effect, if it exists, shows up in weighing mode without a readily apparent reciprocal effect in the velocity mode, begging questions about systematic errors at the very foundation of the new measurement system. The hypothetical force is caused by the coil current changing the magnetic field, producing an unaccounted force that is systematically modulated with the weighing current. Here we show that this diamagnetic force exists, but the additional force does not change the equivalence between weighing and velocity measurements. We reveal the unexpected way that symmetry is preserved and show that for typical materials and geometries the total relative effect on the measurement is $\approx 1\times10^{-9}$.

preprint2021arXiv

The NPU System for the 2020 Personalized Voice Trigger Challenge

This paper describes the system developed by the NPU team for the 2020 personalized voice trigger challenge. Our submitted system consists of two independently trained subsystems: a small footprint keyword spotting (KWS) system and a speaker verification (SV) system. For the KWS system, a multi-scale dilated temporal convolutional (MDTC) network is proposed to detect wake-up word (WuW). For SV system, Write something here. The KWS predicts posterior probabilities of whether an audio utterance contains WuW and estimates the location of WuW at the same time. When the posterior probability ofWuW reaches a predefined threshold, the identity information of triggered segment is determined by the SV system. On evaluation dataset, our submitted system obtains detection costs of 0.081and 0.091 in close talking and far-field tasks, respectively.

preprint2020arXiv

A Simple Improvement for Permanent Magnet Systems for Kibble Balances: More Flat Field at Almost No Cost

Permanent magnets together with yokes to concentrate the magnetic flux into a cylindrical air-gap are widely employed in Kibble balances. These experiments require a uniform magnetic flux density along a vertical path, typically a substantial fraction of the length of the air-gap. Fringe fields that are present at both ends of the air-gap limit the region where the flux density does not change more than a certain relative fraction (here: $5\times 10^{-4}$) of the flux density in the center of the magnet system. By simply adding an iron ring with a rectangular cross-section to the inner yoke at each end of the air gap, the effects of the fringe fields can be counteracted, and, hence, the length of the region, where the flux density remains within a given tolerance band is increased. Compared to the alternative, employing a taller magnet, the proposed method yields a magnet system with an extended region of a uniform field without significantly increasing the mass of the magnet system. Potential applications include compact and table-top Kibble balances. We investigate possible adverse effects on the performance of the magnet system caused by the additional rings: magnetic field strength, coil-current effect, and a dependence of the radial field on the radial position in the field. No substantial disadvantage was found. Instead, the method presented here outperformed previously suggested methods to improve the radial dependence of the radial field, e.g., shorter outer yoke. In summary, adding rings to the inner yoke improves the uniformity of the field without a detrimental effect to function, cost, and form factor of the magnet system.

preprint2020arXiv

Beyond Symmetries : Anomalies in Transverse Ward--Takahashi Identities

Anomalies in transverse Ward--Takahashi identities are studied, allowing discussion of the feasibility of anomalies arising in general non-symmetry Ward--Takahashi identities. We adopt the popular Fujikawa&#39;s method and rigorous dimensional renormalization to verify the existence of transverse anomalies to one-loop order and any loop order, respectively. The arbitrariness of coefficients of transverse anomalies is revealed, and a way out is also proposed after relating transverse anomalies to Schwinger terms and comparing symmetry and non-symmetry anomalies. Papers that claim the non-existence of transverse anomalies are reviewed to find anomalies hidden in their approaches. The role played by transverse anomalies is discussed.

preprint2020arXiv

Chern-Simons quantum mechanics and fractional angular momentum in atom system

The model of a planar atom which possesses a non-vanishing electric dipole moment interacting with magnetic fields in a specific setting is studied. Energy spectra of this model and its reduced model, which is the limit of cooling down the atom to the negligible kinetic energy, are solved exactly. We show that %similar with the Chern-Simons quantum mechanics, energy spectra of the reduced model can not be obtained directly from the full ones by taking the same limit. In order to match them, we must regularize energy spectra of the full model when the limit of the negligible kinetic energy is taken. It is one of the characteristics of the Chern-Simons quantum mechanics. Besides it, the canonical angular momentum of the reduced model will take fractional values although the full model can only take integers. It means that it is possible to realize the Chern-Simons quantum mechanics and fractional angular momentum simultaneously by this model.

preprint2020arXiv

Chiral effective Lagrangian for heavy-light mesons from QCD

We derive the chiral effective Lagrangian for heavy-light mesons in the heavy quark limit from QCD under proper approximations. The low energy constants in the effective Lagrangian are expressed in terms of the light quark self-energy. With typical forms of the running coupling constant of QCD and the quark self-energy obtained from Dyson-Schwinger equations as well as lattice QCD, we estimate the low energy constants in the model and the strong decay widths. A comparison with data and some discussions of the numerical results are presented.

preprint2020arXiv

DFNets: Spectral CNNs for Graphs with Feedback-Looped Filters

We propose a novel spectral convolutional neural network (CNN) model on graph structured data, namely Distributed Feedback-Looped Networks (DFNets). This model is incorporated with a robust class of spectral graph filters, called feedback-looped filters, to provide better localization on vertices, while still attaining fast convergence and linear memory requirements. Theoretically, feedback-looped filters can guarantee convergence w.r.t. a specified error bound, and be applied universally to any graph without knowing its structure. Furthermore, the propagation rule of this model can diversify features from the preceding layers to produce strong gradient flows. We have evaluated our model using two benchmark tasks: semi-supervised document classification on citation networks and semi-supervised entity classification on a knowledge graph. The experimental results show that our model considerably outperforms the state-of-the-art methods in both benchmark tasks over all datasets.

preprint2020arXiv

Energy Self-Sustainability in Full-Spectrum 6G

Full-spectrum ranging from sub 6 GHz to THz and visible light will be exploited in 6G in order to reach unprecedented key-performance-indicators (KPIs). However, extraordinary amount of energy will be consumed by network infrastructure, while functions of massively deployed Internet of Everything (IoE) devices are limited by embedded batteries. Therefore, energy self-sustainable 6G is proposed in this article. First of all, it may achieve network-wide energy efficiency by exploiting cell-free and airborne access networks as well as by implementing intelligent holographic environments. Secondly, by exploiting radio-frequency/visible light signals for providing on-demand wireless power transfer (WPT) and for enabling passive backscatter communication, ``zero-energy&#39;&#39; devices may become a reality. Furthermore, IoE devices actively adapt their transceivers for better performance to a dynamic environment. This article aims to provide a first glance at primary designing principles of energy self-sustainable 6G.

preprint2020arXiv

Extended affine Lie algebras, vertex algebras, and reductive groups

In this paper, we explore natural connections among the representations of the extended affine Lie algebra $\widehat{sl_N}(\mathbb{C}_q)$ with $\mathbb{C}_q=\mathbb{C}_q[t_0^{\pm1},t_1^{\pm1}]$ an irrational quantum 2-torus, the simple affine vertex algebra $L_{\widehat{sl_{\infty}}}(\ell,0)$ with $\ell$ a positive integer, and Levi subgroups $G$ of $GL_\ell(\mathbb{C})$. First, we give a canonical isomorphism between the category of integrable restricted $\widehat{sl_N}(\mathbb{C}_q)$-modules of level $\ell$ and that of equivariant quasi $L_{\widehat{sl_{\infty}}}(\ell,0)$-modules. Second, we classify irreducible $\mathbb{N}$-graded equivariant quasi $L_{\widehat{sl_{\infty}}}(\ell,0)$-modules. Third, we establish a duality between irreducible $\mathbb{N}$-graded equivariant quasi $L_{\widehat{sl_{\infty}}}(\ell,0)$-modules and irreducible regular $G$-modules on certain fermionic Fock spaces. Fourth, we obtain an explicit realization of every irreducible $\mathbb{N}$-graded equivariant quasi $L_{\widehat{sl_{\infty}}}(\ell,0)$-module. Fifth, we completely determine the following branchings: 1 The branching from $L_{\widehat{sl_{\infty}}}(\ell,0)\otimes L_{\widehat{sl_{\infty}}}(\ell&#39;,0)$ to $L_{\widehat{sl_{\infty}}}(\ell+\ell&#39;,0)$ for quasi modules. 2 The branching from $\widehat{sl_N}(\mathbb{C}_q)$ to its Levi subalgebras. 3 The branching from $\widehat{sl_N}(\mathbb{C}_q)$ to its subalgebras $\widehat{sl_N}(\mathbb{C}_q[t_0^{\pm M_0},t_1^{\pm M_1}])$.

preprint2020arXiv

From $μ_0$ to $e$: A Survey of Major Impacts for Electrical Measurements in Recent SI Revision

A milestone revision of the International System of Units (SI) was made at the 26th General Conference on Weights and Measures that four of the seven SI base units, i.e. kilogram, ampere, kelvin, and mole, are redefined by fundamental physical constants of nature. The SI base unit founding the electrical measurement activities, i.e. ampere, is defined by fixing the numerical value of the elementary charge to $e=1.602\,176\,634\times10^{-19}$C. For electrical measurement, several major adjustments, mostly positive, are involved in this SI revision. In this paper, the main impacts of the new SI for electrical measurement activities are surveyed under the new framework.

preprint2020arXiv

Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition

Speaker recognition is a popular topic in biometric authentication and many deep learning approaches have achieved extraordinary performances. However, it has been shown in both image and speech applications that deep neural networks are vulnerable to adversarial examples. In this study, we aim to exploit this weakness to perform targeted adversarial attacks against the x-vector based speaker recognition system. We propose to generate inaudible adversarial perturbations achieving targeted white-box attacks to speaker recognition system based on the psychoacoustic principle of frequency masking. Specifically, we constrict the perturbation under the masking threshold of original audio, instead of using a common l_p norm to measure the perturbations. Experiments on Aishell-1 corpus show that our approach yields up to 98.5% attack success rate to arbitrary gender speaker targets, while retaining indistinguishable attribute to listeners. Furthermore, we also achieve an effective speaker attack when applying the proposed approach to a completely irrelevant waveform, such as music.

preprint2020arXiv

On parafermion vertex algebras of $\frak{sl}(2)_{-3/2}$ and $\frak{sl}(3)_{-3/2}$

We study parafermion vertex algebras $N_{-3/2}(\frak{sl}(2))$ and $N_{-3/2}(\frak{sl}(3))$. Using the isomorphism between $N_{-3/2}(\frak{sl}(3))$ and the logarithmic vertex algebra $\mathcal{W}^{0} (2)_{A_2} $ from [2], we show that these parafermion vertex algebras are infinite direct sums of irreducible modules for the Zamolodchikov algebra $\mathcal{W}(2,3)$ of central charge $c=-10$, and that $N_{-3/2}(\frak{sl}(3))$ is a direct sum of irreducible $N_{-3/2}(\frak{sl}(2))$-modules. As a byproduct, we prove certain conjectures about the vertex algebra $\mathcal{W}^0(p)_{A_2}$. We also obtain a vertex-algebraic proof of the irreducibility of a family of $\mathcal W(2,3)_{c}$ modules at $c=-10$.

preprint2020arXiv

Owl Eyes: Spotting UI Display Issues via Visual Understanding

Graphical User Interface (GUI) provides a visual bridge between a software application and end users, through which they can interact with each other. With the development of technology and aesthetics, the visual effects of the GUI are more and more attracting. However, such GUI complexity posts a great challenge to the GUI implementation. According to our pilot study of crowdtesting bug reports, display issues such as text overlap, blurred screen, missing image always occur during GUI rendering on different devices due to the software or hardware compatibility. They negatively influence the app usability, resulting in poor user experience. To detect these issues, we propose a novel approach, OwlEye, based on deep learning for modelling visual information of the GUI screenshot. Therefore, OwlEye can detect GUIs with display issues and also locate the detailed region of the issue in the given GUI for guiding developers to fix the bug. We manually construct a large-scale labelled dataset with 4,470 GUI screenshots with UI display issues and develop a heuristics-based data augmentation method for boosting the performance of our OwlEye. The evaluation demonstrates that our OwlEye can achieve 85% precision and 84% recall in detecting UI display issues, and 90% accuracy in localizing these issues. We also evaluate OwlEye with popular Android apps on Google Play and F-droid, and successfully uncover 57 previously-undetected UI display issues with 26 of them being confirmed or fixed so far.

preprint2020arXiv

PEL-BERT: A Joint Model for Protocol Entity Linking

Pre-trained models such as BERT are widely used in NLP tasks and are fine-tuned to improve the performance of various NLP tasks consistently. Nevertheless, the fine-tuned BERT model trained on our protocol corpus still has a weak performance on the Entity Linking (EL) task. In this paper, we propose a model that joints a fine-tuned language model with an RFC Domain Model. Firstly, we design a Protocol Knowledge Base as the guideline for protocol EL. Secondly, we propose a novel model, PEL-BERT, to link named entities in protocols to categories in Protocol Knowledge Base. Finally, we conduct a comprehensive study on the performance of pre-trained language models on descriptive texts and abstract concepts. Experimental results demonstrate that our model achieves state-of-the-art performance in EL on our annotated dataset, outperforming all the baselines.

preprint2020arXiv

Search for a generic heavy Higgs at the LHC

A generic heavy Higgs has both dim-4 and effective dim-6 interactions with the Standard Model (SM) particles. The former has been the focus of LHC searches in all major Higgs production channels, just as the SM one, but with negative results so far. If the heavy Higgs is connected with Beyond Standard Model (BSM) physics at a few TeV scale, its dim-6 operators will play a very important role - they significantly enhance the Higgs momentum, and reduce the SM background in a special phase space corner to a level such that a heavy Higgs emerges, which is not possible with dim-4 operators only. We focus on the associated VH production channel, where the effect of dim-6 operators is the largest and the SM background is the lowest. Main search regions for this type of signal are identified, and substructure variables of boosted jets are employed to enhance the signal from backgrounds. The parameter space of these operators are scanned over, and expected exclusion regions with 300 fb$^{-1}$ and 3 ab$^{-1}$ LHC data are shown, if no BSM is present. The strategy given in this paper will shed light on a heavy Higgs which may be otherwise hiding in the present and future LHC data.

preprint2020arXiv

Simulate anyons by cold atoms with induced electric dipole momentum

We show that it is possible to simulate an anyon by a trapped atom which possesses an induced electric dipole moment in the background of electromagnetic fields with a specific configuration. The electromagnetic fields we applied contain a magnetic and two electric fields. We find that when the atom is cooled down to the limit of the negligibly small kinetic energy, the atom behaves like an anyon because its angular momentum takes fractional values. The fractional part of the angular momentum is determined by both the magnetic and one of the electric fields. Roles two electromagnetic fields played are analyzed.

preprint2020arXiv

Terrain Visibility Graphs: Persistence is Not Enough

In this paper, we consider the Visibility Graph Recognition and Reconstruction problems in the context of terrains. Here, we are given a graph $G$ with labeled vertices $v_0, v_1, \ldots, v_{n-1}$ such that the labeling corresponds with a Hamiltonian path $H$. $G$ also may contain other edges. We are interested in determining if there is a terrain $T$ with vertices $p_0, p_1, \ldots, p_{n-1}$ such that $G$ is the visibility graph of $T$ and the boundary of $T$ corresponds with $H$. $G$ is said to be persistent if and only if it satisfies the so-called X-property and Bar-property. It is known that every &#34;pseudo-terrain&#34; has a persistent visibility graph and that every persistent graph is the visibility graph for some pseudo-terrain. The connection is not as clear for (geometric) terrains. It is known that the visibility graph of any terrain $T$ is persistent, but it has been unclear whether every persistent graph $G$ has a terrain $T$ such that $G$ is the visibility graph of $T$. There actually have been several papers that claim this to be the case (although no formal proof has ever been published), and recent works made steps towards building a terrain reconstruction algorithm for any persistent graph. In this paper, we show that there exists a persistent graph $G$ that is not the visibility graph for any terrain $T$. This means persistence is not enough by itself to characterize the visibility graphs of terrains, and implies that pseudo-terrains are not stretchable.

preprint2019arXiv

Realization of cyons and anyons by atoms

We propose theoretical schemes to realize the cyon and anyon by atoms which possess non-vanishing electric dipole moments. To realize a cyon, besides the atom, we need a magnetic field produced by a long magnetic-charged filament. To realize an anyon, however, apart from these we need a harmonic potential and an additional magnetic field produced by a uniformly distributed magnetic charges. We find that the atom will be an anyon when cooled down to the negligibly small kinetic energy limit. The relationship between our results and the previous ones is investigated from the electromagnetic duality.

preprint2018arXiv

Crowdtesting : When is The Party Over?

Trade-offs such as &#34;how much testing is enough&#34; are critical yet challenging project decisions in software engineering. Most existing approaches adopt risk-driven or value-based analysis to prioritize test cases and minimize test runs. However, none of these is applicable to the emerging crowd testing paradigm where task requesters typically have no control over online crowdworkers&#39;s dynamic behavior and uncertain performance. In current practice, deciding when to close a crowdtesting task is largely done by guesswork due to lack of decision support. This paper intends to fill this gap by introducing automated decision support for monitoring and determining appropriate time to close the crowdtesting tasks. First, this paper investigates the necessity and feasibility of close prediction of crowdtesting tasks based on industrial dataset. Then,it designs 8 methods for close prediction, based on various models including the bug trend, bug arrival model, capture-recapture model.Finally, the evaluation is conducted on 218 crowdtesting tasks from one of the largest crowdtesting platforms in China, and the results show that a median of 91% bugs can be detected with 49% saved cost.