Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
116works
0followers
57topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

116 published item(s)

preprint2026arXiv

ARCQuant: Boosting NVFP4 Quantization with Augmented Residual Channels for LLMs

The emergence of fine-grained numerical formats like NVFP4 presents new opportunities for efficient Large Language Model (LLM) inference. However, it is difficult to adapt existing Post-Training Quantization (PTQ) strategies to these formats: rotation-based methods compromise fine-grained block isolation; smoothing techniques struggle with significant 4-bit quantization errors; and mixed-precision approaches often conflict with hardware constraints on unified-precision computation. To address these challenges, we propose ARCQuant, a framework that boosts NVFP4 performance via Augmented Residual Channels. Distinct from methods that compromise block isolation or hardware uniformity, ARCQuant maintains a strictly unified NVFP4 format by augmenting the activation matrix with quantized residual channels. This design integrates the error compensation process directly into the matrix reduction dimension, enabling the use of standard, highly optimized GEMM kernels with minimal overhead. Theoretical analysis confirms that the worst-case error bound of our dual-stage NVFP4 quantization is comparable to that of standard 8-bit formats such as MXFP8. Extensive experiments on LLaMA and Qwen models demonstrate that ARCQuant achieves state-of-the-art accuracy, comparable to full-precision baselines in perplexity and downstream tasks. Furthermore, deployment on RTX 5090 and RTX PRO 6000 GPUs confirms practical benefits, achieving up to 3x speedup over FP16. Our code is available at https://github.com/actypedef/ARCQuant .

preprint2026arXiv

Brightest GRB flare observed in GRB 221009A: bridge the last gap between flare and prompt emission in GRB

Flares are usually observed during the afterglow phase of Gamma-Ray Bursts (GRBs) in soft X-ray, optical and radio bands, but rarely in gamma-ray band. Despite the extraordinary brightness, GECAM-C has accurately measured both the bright prompt emission and flare emission of GRB 221009A without instrumental effects, offering a good opportunity to study the relation between them. In this work, we present a comprehensive analysis of flare emission of GRB 221009A, which is composed of a series of flares. Among them, we identify an exceptionally bright flare with a record-breaking isotropic energy $E_{\rm iso} = 1.82 \times 10^{53}$ erg of GRB flares. It exhibits the highest peak energy ever detected in GRB flares, $E_{\rm peak} \sim 300$ keV, making it a genuine gamma-ray flare. It also shows rapid rise and decay timescales, significantly shorter than those of typical X-ray flares observed in soft X-ray or optical band, but comparable to those observed in prompt emissions. Despite these exceptional properties, the flare shares several common properties with typical GRB flares. We note that this is the first observation of a GRB flare in the keV-MeV band with sufficiently high temporal resolution and high statistics, which bridges the last gap between prompt emission and flare.

preprint2026arXiv

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

General reasoning represents a long-standing and formidable challenge in artificial intelligence. Recent breakthroughs, exemplified by large language models (LLMs) and chain-of-thought prompting, have achieved considerable success on foundational reasoning tasks. However, this success is heavily contingent upon extensive human-annotated demonstrations, and models' capabilities are still insufficient for more complex problems. Here we show that the reasoning abilities of LLMs can be incentivized through pure reinforcement learning (RL), obviating the need for human-labeled reasoning trajectories. The proposed RL framework facilitates the emergent development of advanced reasoning patterns, such as self-reflection, verification, and dynamic strategy adaptation. Consequently, the trained model achieves superior performance on verifiable tasks such as mathematics, coding competitions, and STEM fields, surpassing its counterparts trained via conventional supervised learning on human demonstrations. Moreover, the emergent reasoning patterns exhibited by these large-scale models can be systematically harnessed to guide and enhance the reasoning capabilities of smaller models.

preprint2026arXiv

Exact Relation Between Wehrl-Rényi Entropy and Many-Body Entanglement

Quantum entanglement is key to understanding correlations and emergent phenomena in quantum many-body systems. For $N$ qubits (distinguishable spin-$1/2$ particles) in a pure quantum state, many-body entanglement can be characterized by the purity of the reduced density matrix of a subsystem, defined as the trace of the square of this reduced density matrix. Nevertheless, this approach depends on the choice of subsystem. In this letter, we establish an exact relation between the Wehrl-Rényi entropy (WRE) $S_W^{(2)}$, which is the 2nd Rényi entropy of the Husimi function of the entire system, and the purities of all possible subsystems. Specifically, we prove the relation $e^{-S_W^{(2)}} = (6π)^{-N} \sum_A \mathrm{Tr}({{\hat ρ}_A}^2)$, where $A$ denotes a subsystem with reduced density matrix ${\hat ρ}_A$, and the summation runs over all $2^N$ possible subsystems. Furthermore, we show that the WRE can be experimentally measured via a concrete scheme. Therefore, the WRE is a subsystem-independent and experimentally measurable characterization of the overall entanglement in pure states of $N$ qubits. It can be applied to the study of strongly correlated spin systems, particularly those with all-to-all couplings that do not have a natural subsystem division, such as systems realized with natural atoms in optical tweezer arrays or superconducting quantum circuits. We also analytically derive the WRE for several representative many-body states, including Haar-random states, the Greenberger-Horne-Zeilinger (GHZ) state, and the W state.

preprint2026arXiv

Exploring the Challenge and Value of Deep Learning in Automated Skin Disease Diagnosis

Skin cancer is one of the most prevalent and deadly forms of cancer worldwide, highlighting the critical importance of early detection and diagnosis in improving patient outcomes. Deep learning (DL) has shown significant promise in enhancing the accuracy and efficiency of automated skin disease diagnosis, particularly in detecting and classifying skin lesions. However, several challenges remain for DL-based skin cancer diagnosis, including complex features, image noise, intra-class variation, inter-class similarity, and data imbalance. This review synthesizes recent research and discusses innovative approaches to address these challenges, such as data augmentation, hybrid models, and feature fusion. Furthermore, the review highlights the integration of DL models into clinical workflows, offering insights into the potential of deep learning to revolutionize skin disease diagnosis and improve clinical decision-making. This review uniquely integrates a PRISMA-based methodology with a challenge-oriented taxonomy, providing a systematic and transparent synthesis of recent deep learning advances for skin disease diagnosis. It further highlights emerging directions such as hybrid CNN-Transformer architectures and uncertainty-aware models, emphasizing its contribution to future dermatological AI research.

preprint2026arXiv

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

We present GLM-4.1V-Thinking, GLM-4.5V, and GLM-4.6V, a family of vision-language models (VLMs) designed to advance general-purpose multimodal understanding and reasoning. In this report, we share our key findings in the development of the reasoning-centric training framework. We first develop a capable vision foundation model with significant potential through large-scale pre-training, which arguably sets the upper bound for the final performance. We then propose Reinforcement Learning with Curriculum Sampling (RLCS) to unlock the full potential of the model, leading to comprehensive capability enhancement across a diverse range of tasks, including STEM problem solving, video understanding, content recognition, coding, grounding, GUI-based agents, and long document interpretation. In a comprehensive evaluation across 42 public benchmarks, GLM-4.5V achieves state-of-the-art performance on nearly all tasks among open-source models of similar size, and demonstrates competitive or even superior results compared to closed-source models such as Gemini-2.5-Flash on challenging tasks including Coding and GUI Agents. Meanwhile, the smaller GLM-4.1V-9B-Thinking remains highly competitive-achieving superior results to the much larger Qwen2.5-VL-72B on 29 benchmarks. We open-source both GLM-4.1V-9B-Thinking and GLM-4.5V. We further introduce the GLM-4.6V series, open-source multimodal models with native tool use and a 128K context window. A brief overview is available at https://z.ai/blog/glm-4.6v. Code, models and more information are released at https://github.com/zai-org/GLM-V.

preprint2026arXiv

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

We present GLM-5V-Turbo, a step toward native foundation models for multimodal agents. As foundation models are increasingly deployed in real environments, agentic capability depends not only on language reasoning, but also on the ability to perceive, interpret, and act over heterogeneous contexts such as images, videos, webpages, documents, GUIs. GLM-5V-Turbo is built around this objective: multimodal perception is integrated as a core component of reasoning, planning, tool use, and execution, rather than as an auxiliary interface to a language model. This report summarizes the main improvements behind GLM-5V-Turbo across model design, multimodal training, reinforcement learning, toolchain expansion, and integration with agent frameworks. These developments lead to strong performance in multimodal coding, visual tool use, and framework-based agentic tasks, while preserving competitive text-only coding capability. More importantly, our development process offers practical insights for building multimodal agents, highlighting the central role of multimodal perception, hierarchical optimization, and reliable end-to-end verification.

preprint2026arXiv

GPS-Synchronized Monitoring of Core-collapse Supernova Bursts with PandaX-4T via Coherent Elastic Neutrino Nuclear Scattering

The landmark detection of neutrinos from SN1987A marked the dawn of neutrino astrophysics. The neutrino burst provided essential insights into fundamental properties of neutrinos, and served as key probes of stellar evolution and supernova dynamics. The recent advancement in coherent elastic neutrino-nucleus scattering enables the detection of core-collapse supernova burst neutrinos using tonne-scale liquid xenon detectors originally designed for dark matter direct detection. Leveraging this capability, we developed and deployed an online supernova monitoring system for the PandaX-4T experiment. This system features a GPS module with millisecond-level timing precision, a low false-alarm rate, and high sensitivity to galactic core-collapse supernova explosion events. The methodology is robust, directly scalable, and planned for implementation in the next-generation PandaX-20T experiment.

preprint2026arXiv

MoHoBench: Assessing Honesty of Multimodal Large Language Models via Unanswerable Visual Questions

Recently Multimodal Large Language Models (MLLMs) have achieved considerable advancements in vision-language tasks, yet produce potentially harmful or untrustworthy content. Despite substantial work investigating the trustworthiness of language models, MMLMs' capability to act honestly, especially when faced with visually unanswerable questions, remains largely underexplored. This work presents the first systematic assessment of honesty behaviors across various MLLMs. We ground honesty in models' response behaviors to unanswerable visual questions, define four representative types of such questions, and construct MoHoBench, a large-scale MMLM honest benchmark, consisting of 12k+ visual question samples, whose quality is guaranteed by multi-stage filtering and human verification. Using MoHoBench, we benchmarked the honesty of 28 popular MMLMs and conducted a comprehensive analysis. Our findings show that: (1) most models fail to appropriately refuse to answer when necessary, and (2) MMLMs' honesty is not solely a language modeling issue, but is deeply influenced by visual information, necessitating the development of dedicated methods for multimodal honesty alignment. Therefore, we implemented initial alignment methods using supervised and preference learning to improve honesty behavior, providing a foundation for future work on trustworthy MLLMs. Our data and code can be found at https://github.com/yanxuzhu/MoHoBench.

preprint2026arXiv

On the Ultra-Long Gamma-Ray Transient GRB 250702B/EP250702

GRB 250702B/EP250702a is an interesting long-duration gamma-ray transient whose nature is in debate. To obtain a full picture in gamma-ray band, we implement a comprehensive targeted search of burst emission in a wide window of 30 days jointly with Insight-HXMT, GECAM and Fermi/GBM data within the ETJASMIN framework. In gamma-ray band, we find there is a 50-second precursor about 25 hours before the 4-hour main burst, which generally consists of 4 emission episodes. Remarkably, we find that the soft X-ray emission (after the main burst) decays as a power-law with start time aligning with the last episode of main emission and index of -5/3 perfectly consistent with the canonical prediction of fallback accretion. We conclude that the properties of precursor, main burst and the following soft X-ray emission strongly support the atypical collapsar Ultra-Long Gamma-Ray Burst (ULGRB) scenario rather than the Tidal Disruption Event (TDE), and all these gamma-ray and soft X-ray emission probably originate from relativistic jet whose luminosity is dominated by the fallback accretion rate during the death collapse of a supergiant star.

preprint2026arXiv

Post-Training Quantization of OpenPangu Models for Efficient Deployment on Atlas A2

Huawei's openPangu-Embedded-1B and openPangu-Embedded-7B are variants of the openPangu large language model, designed for efficient deployment on Ascend NPUs. The 7B variant supports three distinct Chain-of-Thought (CoT) reasoning paradigms, namely slow_think, auto_think, and no_think, while the 1B variant operates exclusively in the no_think mode, which employs condensed reasoning for higher efficiency. Although CoT reasoning enhances capability, the generation of extended reasoning traces introduces substantial memory and latency overheads, posing challenges for practical deployment on Ascend NPUs. This paper addresses these computational constraints by leveraging low-bit quantization, which transforms FP16 computations into more efficient integer arithmetic. We introduce a unified low-bit inference framework, supporting INT8 (W8A8) and W4A8 quantization, specifically optimized for openPangu-Embedded models on the Atlas A2. Our comprehensive evaluation on code generation benchmarks (HumanEval and MBPP) demonstrates the efficacy of this approach. INT8 quantization consistently preserves over 90\% of the FP16 baseline accuracy and achieves a 1.5x prefill speedup on the Atlas A2. Furthermore, W4A8 quantization significantly reduces memory consumption, albeit with a moderate trade-off in accuracy. These findings collectively indicate that low-bit quantization effectively facilitates efficient CoT reasoning on Ascend NPUs, maintaining high model fidelity.

preprint2026arXiv

SoulSeek: Exploring the Use of Social Cues in LLM-based Information Seeking

Social cues, which convey others' presence, behaviors, or identities, play a crucial role in human information seeking by helping individuals judge relevance and trustworthiness. However, existing LLM-based search systems primarily rely on semantic features, creating a misalignment with the socialized cognition underlying natural information seeking. To address this gap, we explore how the integration of social cues into LLM-based search influences users' perceptions, experiences, and behaviors. Focusing on social media platforms that are beginning to adopt LLM-based search, we integrate design workshops, the implementation of the prototype system (SoulSeek), a between-subjects study, and mixed-method analyses to examine both outcome- and process-level findings. The workshop informs the prototype's cue-integrated design. The study shows that social cues improve perceived outcomes and experiences, promote reflective information behaviors, and reveal limits of current LLM-based search. We propose design implications emphasizing better social-knowledge understanding, personalized cue settings, and controllable interactions.

preprint2026arXiv

Understanding Generalization through Decision Pattern Shift

Understanding why deep neural networks (DNNs) fail to generalize to unseen samples remains a long-standing challenge. Existing studies mainly examine changes in externally observable factors such as data, representations, or outputs, yet offer limited insight into how a model's internal decision mechanism evolves from training to test. To address this gap, we introduce Decision Pattern Shift (DPS), a new perspective that defines generalization through the stability of internal decision patterns and quantifies failure as their deviation from those learned during training. Specifically, we represent each sample's decision pattern as a GradCAM-based channel-contribution vector, which captures how feature channels collectively support a prediction, and we propose the DPS metric to measure its discrepancy from the class-average pattern. Empirical analyses across multiple datasets and architectures show that, (i) decision patterns form a highly structured, class-consistent space with strong intra-class cohesion and low inter-class confusion, enabling direct analysis of a model's decision logic; (ii) the DPS magnitude correlates linearly with the generalization gap (nearly all Pearson r > 0.8), revealing generalization as a systematic drift in the model's internal decision mechanism; (iii) the DPS spectrum organizes diverse generalization degradation scenarios (covering ideal generalization, in-distribution degradation, domain shift, out-of-distribution, and shortcut learning) into a continuous trajectory, providing a unified explanation of their failure modes. These findings open up new possibilities for early generalization-risk detection, failure-mode diagnosis, and channel-level defect localization.

preprint2025arXiv

A reduced-scale autonomous morphing vehicle prototype with enhanced aerodynamic efficiency

Road vehicles contribute to significant levels of greenhouse gas (GHG) emissions. A potential strategy for improving their aerodynamic efficiency and reducing emissions is through active adaptation of their exterior shapes to the aerodynamic environment. In this study, we present a reduced-scale morphing vehicle prototype capable of actively interacting with the aerodynamic environment to enhance fuel economy. Morphing is accomplished by retrofitting a deformable structure actively actuated by built-in motors. The morphing vehicle prototype is integrated with an optimization algorithm that can autonomously identify the structural shape that minimizes aerodynamic drag. The performance of the morphing vehicle prototype is investigated through an extensive experimental campaign in a large-scale wind tunnel facility. The autonomous optimization algorithm identifies an optimal morphing shape that can elicit an 8.5% reduction in the mean drag force. Our experiments provide a comprehensive dataset that validates the efficiency of shape morphing, demonstrating a clear and consistent decrease in the drag force as the vehicle transitions from a suboptimal to the optimal shape. Insights gained from experiments on scaled-down models provide valuable guidelines for the design of full-size morphing vehicles, which could lead to appreciable energy savings and reductions in GHG emissions. This study highlights the feasibility and benefits of real-time shape morphing under conditions representative of realistic road environments, paving the way for the realization of full-scale morphing vehicles with enhanced aerodynamic efficiency and reduced GHG emissions.

preprint2025arXiv

BIOME-Bench: A Benchmark for Biomolecular Interaction Inference and Multi-Omics Pathway Mechanism Elucidation from Scientific Literature

Multi-omics studies often rely on pathway enrichment to interpret heterogeneous molecular changes, but pathway enrichment (PE)-based workflows inherit structural limitations of pathway resources, including curation lag, functional redundancy, and limited sensitivity to molecular states and interventions. Although recent work has explored using large language models (LLMs) to improve PE-based interpretation, the lack of a standardized benchmark for end-to-end multi-omics pathway mechanism elucidation has largely confined evaluation to small, manually curated datasets or ad hoc case studies, hindering reproducible progress. To address this issue, we introduce BIOME-Bench, constructed via a rigorous four-stage workflow, to evaluate two core capabilities of LLMs in multi-omics analysis: Biomolecular Interaction Inference and end-to-end Multi-Omics Pathway Mechanism Elucidation. We develop evaluation protocols for both tasks and conduct comprehensive experiments across multiple strong contemporary models. Experimental results demonstrate that existing models still exhibit substantial deficiencies in multi-omics analysis, struggling to reliably distinguish fine-grained biomolecular relation types and to generate faithful, robust pathway-level mechanistic explanations.

preprint2025arXiv

Large Emotional World Model

World Models serve as tools for understanding the current state of the world and predicting its future dynamics, with broad application potential across numerous fields. As a key component of world knowledge, emotion significantly influences human decision-making. While existing Large Language Models (LLMs) have shown preliminary capability in capturing world knowledge, they primarily focus on modeling physical-world regularities and lack systematic exploration of emotional factors. In this paper, we first demonstrate the importance of emotion in understanding the world by showing that removing emotionally relevant information degrades reasoning performance. Inspired by theory of mind, we further propose a Large Emotional World Model (LEWM). Specifically, we construct the Emotion-Why-How (EWH) dataset, which integrates emotion into causal relationships and enables reasoning about why actions occur and how emotions drive future world states. Based on this dataset, LEWM explicitly models emotional states alongside visual observations and actions, allowing the world model to predict both future states and emotional transitions. Experimental results show that LEWM more accurately predicts emotion-driven social behaviors while maintaining comparable performance to general world models on basic tasks.

preprint2025arXiv

LUNCH: A Lightweight Unified Deep-Learning Framework for General Transients Classification in High-Energy Time-Domain Astronomy

The increasing data volume of high-energy space monitors necessitates real-time, automated transient classification for multi-messenger follow-up. Conventional methods rely on empirical features like hardness ratios and reliable localization, which are not always precisely available during early detection. We developed the Lightweight Unified Neural Classifier for High-energy Transients (LUNCH) - an end-to-end deep-learning framework that performs general transient classification directly from raw multi-band light curves, eliminating the need for background subtraction or source localization. Its dual-scale architecture fuses long- and short-scale temporal evolution adaptively. Evaluated on 15 years of Fermi/GBM triggers, the optimal model achieves 97.23% accuracy when trained on complete energy spectra. A lightweight version using only three broad energy bands retains 95.07% accuracy, demonstrating that coarse spectral information fused with temporal context enables robust discrimination. The system significantly outperforms the GBM in-flight classifier on three months of independent test data. Feature visualization reveals well-separated class clusters, confirming physical interpretability. LUNCH combines high accuracy, low computational cost, and instrument-agnostic inputs, offering a practical solution for real-time in-flight processing that enables timely triggers for immediate multi-wavelength and multi-messenger follow-up observations in future time-domain missions.

preprint2025arXiv

MiMo-Audio: Audio Language Models are Few-Shot Learners

Existing audio language models typically rely on task-specific fine-tuning to accomplish particular audio tasks. In contrast, humans are able to generalize to new audio tasks with only a few examples or simple instructions. GPT-3 has shown that scaling next-token prediction pretraining enables strong generalization capabilities in text, and we believe this paradigm is equally applicable to the audio domain. By scaling MiMo-Audio's pretraining data to over one hundred million of hours, we observe the emergence of few-shot learning capabilities across a diverse set of audio tasks. We develop a systematic evaluation of these capabilities and find that MiMo-Audio-7B-Base achieves SOTA performance on both speech intelligence and audio understanding benchmarks among open-source models. Beyond standard metrics, MiMo-Audio-7B-Base generalizes to tasks absent from its training data, such as voice conversion, style transfer, and speech editing. MiMo-Audio-7B-Base also demonstrates powerful speech continuation capabilities, capable of generating highly realistic talk shows, recitations, livestreaming and debates. At the post-training stage, we curate a diverse instruction-tuning corpus and introduce thinking mechanisms into both audio understanding and generation. MiMo-Audio-7B-Instruct achieves open-source SOTA on audio understanding benchmarks (MMSU, MMAU, MMAR, MMAU-Pro), spoken dialogue benchmarks (Big Bench Audio, MultiChallenge Audio) and instruct-TTS evaluations, approaching or surpassing closed-source models. Model checkpoints and full evaluation suite are available at https://github.com/XiaomiMiMo/MiMo-Audio.

preprint2025arXiv

Quantum Visual Word Sense Disambiguation: Unraveling Ambiguities Through Quantum Inference Model

Visual word sense disambiguation focuses on polysemous words, where candidate images can be easily confused. Traditional methods use classical probability to calculate the likelihood of an image matching each gloss of the target word, summing these to form a posterior probability. However, due to the challenge of semantic uncertainty, glosses from different sources inevitably carry semantic biases, which can lead to biased disambiguation results. Inspired by quantum superposition in modeling uncertainty, this paper proposes a Quantum Inference Model for Unsupervised Visual Word Sense Disambiguation (Q-VWSD). It encodes multiple glosses of the target word into a superposition state to mitigate semantic biases. Then, the quantum circuit is executed, and the results are observed. By formalizing our method, we find that Q-VWSD is a quantum generalization of the method based on classical probability. Building on this, we further designed a heuristic version of Q-VWSD that can run more efficiently on classical computing. The experiments demonstrate that our method outperforms state-of-the-art classical methods, particularly by effectively leveraging non-specialized glosses from large language models, which further enhances performance. Our approach showcases the potential of quantum machine learning in practical applications and provides a case for leveraging quantum modeling advantages on classical computers while quantum hardware remains immature.

preprint2024arXiv

Re-evaluating the Memory-balanced Pipeline Parallelism: BPipe

Pipeline parallelism is an essential technique in the training of large-scale Transformer models. However, it suffers from imbalanced memory consumption, leading to insufficient memory utilization. The BPipe technique was proposed to address this issue and has proven effective in the GPT-3 model. Nevertheless, our experiments have not yielded similar benefits for LLaMA training. Additionally, BPipe only yields negligible benefits for GPT-3 training when applying flash attention. We analyze the underlying causes of the divergent performance of BPipe on GPT-3 and LLaMA. Furthermore, we introduce a novel method to estimate the performance of BPipe.

preprint2024arXiv

Tunning the number of chiral edge channels in a fixed quantum anomalous Hall system

Quantum anomalous Hall (QAH) insulators exhibit chiral edge channels characterized by vanishing longitudinal conductance and quantized Hall conductance of Ce2/h, wherein the Chern number C is an integer equal to the number of the parallel chiral edge channels. These chiral edge channels conduct dissipationless transport in QAH insulators, making them pivotal for applications in low-consumption electronics and topological quantum computing. While the QAH effect with multiple chiral edge channels (i.e., C >1) has been demonstrated in multilayers consisting of magnetic topological insulators and normal insulators, the channel number remains fixed for a given sample. Here, we unveil the tunability of the number of chiral edge channels within a single QAH insulator device. By tuning the magnetization of individual layers within the multilayer system, Chern insulating states with different Chern numbers are unveiled. The tunable Chern number was corroborated by our theoretical calculations. Furthermore, we conducted layer-dependent calculations to elucidate the contribution of the Chern number from different layers in the multilayer. Our findings demonstrate an extra degree of freedom in manipulating the chiral edge channels in QAH insulators. This newfound tunability offers extra dimension for the implementation of the QAH-based multi-channel dissipationless transport.

preprint2023arXiv

A First Search for Solar $^8$B Neutrino in the PandaX-4T Experiment using Neutrino-Nucleus Coherent Scattering

A search for interactions from solar $^8$B neutrinos elastically scattering off xenon nuclei using PandaX-4T commissioning data is reported. The energy threshold of this search is further lowered compared with the previous search for dark matter, with various techniques utilized to suppress the background that emerges from data with the lowered threshold. A blind analysis is performed on the data with an effective exposure of 0.48 tonne$\cdot$year, and no significant excess of events is observed. Among results obtained using the neutrino-nucleus coherent scattering, our results give the best constraint on the solar $^8$B neutrino flux. We further provide a more stringent limit on the cross section between dark matter and nucleon in the mass range from 3 to 9 GeV/c$^2$.

preprint2023arXiv

AS-PD: An Arbitrary-Size Downsampling Framework for Point Clouds

Point cloud downsampling is a crucial pre-processing operation to downsample points in order to unify data size and reduce computational cost, to name a few. Recent research on point cloud downsampling has achieved great success which concentrates on learning to sample in a task-aware way. However, existing learnable samplers can not directly perform arbitrary-size downsampling, and assume the input size is fixed. In this paper, we introduce the AS-PD, a novel task-aware sampling framework that directly downsamples point clouds to any smaller size based on a sample-to-refine strategy. Given an input point cloud of arbitrary size, we first perform a task-agnostic pre-sampling on the input point cloud to a specified sample size. Then, we obtain the sampled set by refining the pre-sampled set to make it task-aware, driven by downstream task losses. The refinement is realized by adding each pre-sampled point with a small offset predicted by point-wise multi-layer perceptrons (MLPs). With the density encoding and proper training scheme, the framework can learn to adaptively downsample point clouds of different input sizes to arbitrary sample sizes. We evaluate sampled results for classification and registration tasks, respectively. The proposed AS-PD surpasses the state-of-the-art method in terms of downstream performance. Further experiments also show that our AS-PD exhibits better generality to unseen task models, implying that the proposed sampler is optimized to the task rather than a specified task model.

preprint2023arXiv

GraphTheta: A Distributed Graph Neural Network Learning System With Flexible Training Strategy

Graph neural networks (GNNs) have been demonstrated as a powerful tool for analyzing non-Euclidean graph data. However, the lack of efficient distributed graph learning systems severely hinders applications of GNNs, especially when graphs are big and GNNs are relatively deep. Herein, we present GraphTheta, the first distributed and scalable graph learning system built upon vertex-centric distributed graph processing with neural network operators implemented as user-defined functions. This system supports multiple training strategies and enables efficient and scalable big-graph learning on distributed (virtual) machines with low memory. To facilitate graph convolutions, GraphTheta puts forward a new graph learning abstraction named NN-TGAR to bridge the gap between graph processing and graph deep learning. A distributed graph engine is proposed to conduct the stochastic gradient descent optimization with a hybrid-parallel execution, and a new cluster-batched training strategy is supported. We evaluate GraphTheta using several datasets with network sizes ranging from small-, modest- to large-scale. Experimental results show that GraphTheta can scale well to 1,024 workers for training an in-house developed GNN on an industry-scale Alipay dataset of 1.4 billion nodes and 4.1 billion attributed edges, with a cluster of CPU virtual machines (dockers) of small memory each (5$\sim$12GB). Moreover, GraphTheta can outperform DistDGL by up to $2.02\times$, with better scalability, and GraphLearn by up to $30.56\times$. As for model accuracy, GraphTheta is capable of learning as good GNNs as existing frameworks. To the best of our knowledge, this work presents the largest edge-attributed GNN learning task in the literature.

preprint2023arXiv

Structural tuning magnetism and topology in a magnetic topological insulator

To date, the most widely-studied quantum anomalous Hall insulator (QAHI) platform is achieved by dilute doping of magnetic ions into thin films of the alloyed tetradymite topological insulator (TI) (Bi$_{1-x}$Sb$_x$)$_2$Te$_3$ (BST). In these films, long-range magnetic ordering of the transition metal substituants opens an exchange gap $Δ$ in the topological surface states, stabilizing spin-polarized, dissipationless edge channels with a nonzero Chern number $\mathcal{C}$. The long-range ordering of the spatially separated magnetic ions is itself mediated by electronic states in the host TI, leading to a sophisticated feedback between magnetic and electronic properties. Here we present a study of the electronic and magnetic response of a BST-based QAHI system to structural tuning via hydrostatic pressure. We identify a systematic closure of the topological gap under compressive strain accompanied by a simultaneous enhancement in the magnetic ordering strength. Combining these experimental results with first-principle calculations we identify structural deformation as a strong tuning parameter to traverse a rich topological phase space and modify magnetism in the magnetically doped BST system.

preprint2023arXiv

Towards Net-Zero Carbon Emissions in Network AI for 6G and Beyond

A global effort has been initiated to reduce the worldwide greenhouse gas (GHG) emissions, primarily carbon emissions, by half by 2030 and reach net-zero by 2050. The development of 6G must also be compliant with this goal. Unfortunately, developing a sustainable and net-zero emission systems to meet the users' fast growing demands on mobile services, especially smart services and applications, may be much more challenging than expected. Particularly, despite the energy efficiency improvement in both hardware and software designs, the overall energy consumption and carbon emission of mobile networks are still increasing at a tremendous speed. The growing penetration of resource-demanding AI algorithms and solutions further exacerbate this challenge. In this article, we identify the major emission sources and introduce an evaluation framework for analyzing the lifecycle of network AI implementations. A novel joint dynamic energy trading and task allocation optimization framework, called DETA, has been introduced to reduce the overall carbon emissions. We consider a federated edge intelligence-based network AI system as a case study to verify the effectiveness of our proposed solution. Experimental results based on a hardware prototype suggest that our proposed solution can reduce carbon emissions of network AI systems by up to 74.9%. Finally, open problems and future directions are discussed.

preprint2022arXiv

A Framework Based on Generational and Environmental Response Strategies for Dynamic Multi-objective Optimization

Due to the dynamics and uncertainty of the dynamic multi-objective optimization problems (DMOPs), it is difficult for algorithms to find a satisfactory solution set before the next environmental change, especially for some complex environments. One reason may be that the information in the environmental static stage can not be used well in the traditional framework. In this paper, a novel framework based on generational and environmental response strategies (FGERS) is proposed, in which response strategies are run both in the environmental change stage and the environmental static stage to obtain population evolution information of those both stages. Unlike in the traditional framework, response strategies are only run in the environmental change stage. For simplicity, the feed-forward center point strategy was chosen to be the response strategy in the novel dynamic framework (FGERS-CPS). FGERS-CPS is not only to predict change trend of the optimum solution set in the environmental change stage, but to predict the evolution trend of the population after several generations in the environmental static stage. Together with the feed-forward center point strategy, a simple memory strategy and adaptive diversity maintenance strategy were used to form the complete FGERS-CPS. On 13 DMOPs with various characteristics, FGERS-CPS was compared with four classical response strategies in the traditional framework. Experimental results show that FGERS-CPS is effective for DMOPs.

preprint2022arXiv

A Multi-Metric Latent Factor Model for Analyzing High-Dimensional and Sparse data

High-dimensional and sparse (HiDS) matrices are omnipresent in a variety of big data-related applications. Latent factor analysis (LFA) is a typical representation learning method that extracts useful yet latent knowledge from HiDS matrices via low-rank approximation. Current LFA-based models mainly focus on a single-metric representation, where the representation strategy designed for the approximation Loss function, is fixed and exclusive. However, real-world HiDS matrices are commonly heterogeneous and inclusive and have diverse underlying patterns, such that a single-metric representation is most likely to yield inferior performance. Motivated by this, we in this paper propose a multi-metric latent factor (MMLF) model. Its main idea is two-fold: 1) two vector spaces and three Lp-norms are simultaneously employed to develop six variants of LFA model, each of which resides in a unique metric representation space, and 2) all the variants are ensembled with a tailored, self-adaptive weighting strategy. As such, our proposed MMLF enjoys the merits originated from a set of disparate metric spaces all at once, achieving the comprehensive and unbiased representation of HiDS matrices. Theoretical study guarantees that MMLF attains a performance gain. Extensive experiments on eight real-world HiDS datasets, spanning a wide range of industrial and science domains, verify that our MMLF significantly outperforms ten state-of-the-art, shallow and deep counterparts.

preprint2022arXiv

A Search for the Cosmic Ray Boosted Sub-GeV Dark Matter at the PandaX-II Experiment

We report a novel search for the cosmic ray boosted dark matter using the 100~tonne$\cdot$day full data set of the PandaX-II detector located at the China Jinping Underground Laboratory. With the extra energy gained from the cosmic rays, sub-GeV dark matter particles can produce visible recoil signals in the detector. The diurnal modulations in rate and energy spectrum are utilized to further enhance the signal sensitivity. Our result excludes the dark matter-nucleon elastic scattering cross section between 10$^{-31}$cm$^{2}$ and 10$^{-28}$cm$^{2}$ for a dark matter masses from 0.1 MeV/$c^2$ to 0.1 GeV/$c^2$, with a large parameter space previously unexplored by experimental collaborations.

preprint2022arXiv

A search for two-component Majorana dark matter in a simplified model using the full exposure data of PandaX-II experiment

In the two-component Majorana dark matter model, one dark matter particle can scatter off the target nuclei, and turn into a slightly heavier component. In the framework of a simplified model with a vector boson mediator, both the tree-level and loop-level processes contribute to the signal in direct detection experiment. In this paper, we report the search results for such dark matter from PandaX-II experiment, using total data of the full 100.7 tonne$\cdot$day exposure. No significant excess is observed, so strong constraints on the combined parameter space of mediator mass and dark matter mass are derived. With the complementary search results from collider experiments, a large range of parameter space can be excluded.

preprint2022arXiv

Absolute Zero-Shot Learning

Considering the increasing concerns about data copyright and privacy issues, we present a novel Absolute Zero-Shot Learning (AZSL) paradigm, i.e., training a classifier with zero real data. The key innovation is to involve a teacher model as the data safeguard to guide the AZSL model training without data leaking. The AZSL model consists of a generator and student network, which can achieve date-free knowledge transfer while maintaining the performance of the teacher network. We investigate `black-box' and `white-box' scenarios in AZSL task as different levels of model security. Besides, we also provide discussion of teacher model in both inductive and transductive settings. Despite embarrassingly simple implementations and data-missing disadvantages, our AZSL framework can retain state-of-the-art ZSL and GZSL performance under the `white-box' scenario. Extensive qualitative and quantitative analysis also demonstrates promising results when deploying the model under `black-box' scenario.

preprint2022arXiv

AMinerGNN: Heterogeneous Graph Neural Network for Paper Click-through Rate Prediction with Fusion Query

Paper recommendation with user-generated keyword is to suggest papers that simultaneously meet user's interests and are relevant to the input keyword. This is a recommendation task with two queries, a.k.a. user ID and keyword. However, existing methods focus on recommendation according to one query, a.k.a. user ID, and are not applicable to solving this problem. In this paper, we propose a novel click-through rate (CTR) prediction model with heterogeneous graph neural network, called AMinerGNN, to recommend papers with two queries. Specifically, AMinerGNN constructs a heterogeneous graph to project user, paper, and keyword into the same embedding space by graph representation learning. To process two queries, a novel query attentive fusion layer is designed to recognize their importances dynamically and then fuse them as one query to build a unified and end-to-end recommender system. Experimental results on our proposed dataset and online A/B tests prove the superiority of AMinerGNN.

preprint2022arXiv

An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection

DeepFake based digital facial forgery is threatening the public media security, especially when lip manipulation has been used in talking face generation, the difficulty of fake video detection is further improved. By only changing lip shape to match the given speech, the facial features of identity is hard to be discriminated in such fake talking face videos. Together with the lack of attention on audio stream as the prior knowledge, the detection failure of fake talking face generation also becomes inevitable. Inspired by the decision-making mechanism of human multisensory perception system, which enables the auditory information to enhance post-sensory visual evidence for informed decisions output, in this study, a fake talking face detection framework FTFDNet is proposed by incorporating audio and visual representation to achieve more accurate fake talking face videos detection. Furthermore, an audio-visual attention mechanism (AVAM) is proposed to discover more informative features, which can be seamlessly integrated into any audio-visual CNN architectures by modularization. With the additional AVAM, the proposed FTFDNet is able to achieve a better detection performance on the established dataset (FTFDD). The evaluation of the proposed work has shown an excellent performance on the detection of fake talking face videos, which is able to arrive at a detection rate above 97%.

preprint2022arXiv

Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild

Talking face generation with great practical significance has attracted more attention in recent audio-visual studies. How to achieve accurate lip synchronization is a long-standing challenge to be further investigated. Motivated by xxx, in this paper, an AttnWav2Lip model is proposed by incorporating spatial attention module and channel attention module into lip-syncing strategy. Rather than focusing on the unimportant regions of the face image, the proposed AttnWav2Lip model is able to pay more attention on the lip region reconstruction. To our limited knowledge, this is the first attempt to introduce attention mechanism to the scheme of talking face generation. An extensive experiments have been conducted to evaluate the effectiveness of the proposed model. Compared to the baseline measured by LSE-D and LSE-C metrics, a superior performance has been demonstrated on the benchmark lip synthesis datasets, including LRW, LRS2 and LRS3.

preprint2022arXiv

Audio-visual speech separation based on joint feature representation with cross-modal attention

Multi-modal based speech separation has exhibited a specific advantage on isolating the target character in multi-talker noisy environments. Unfortunately, most of current separation strategies prefer a straightforward fusion based on feature learning of each single modality, which is far from sufficient consideration of inter-relationships between modalites. Inspired by learning joint feature representations from audio and visual streams with attention mechanism, in this study, a novel cross-modal fusion strategy is proposed to benefit the whole framework with semantic correlations between different modalities. To further improve audio-visual speech separation, the dense optical flow of lip motion is incorporated to strengthen the robustness of visual representation. The evaluation of the proposed work is performed on two public audio-visual speech separation benchmark datasets. The overall improvement of the performance has demonstrated that the additional motion network effectively enhances the visual representation of the combined lip images and audio signal, as well as outperforming the baseline in terms of all metrics with the proposed cross-modal fusion.

preprint2022arXiv

Botnets Breaking Transformers: Localization of Power Botnet Attacks Against the Distribution Grid

Traditional botnet attacks leverage large and distributed numbers of compromised internet-connected devices to target and overwhelm other devices with internet packets. With increasing consumer adoption of high-wattage internet-facing "smart devices", a new "power botnet" attack emerges, where such devices are used to target and overwhelm power grid devices with unusual load demand. We introduce a variant of this attack, the power-botnet weardown-attack, which does not intend to cause blackouts or short-term acute instability, but instead forces expensive mechanical components to activate more frequently, necessitating costly replacements / repairs. Specifically, we target the on-load tap-changer (OLTC) transformer, which uses a mechanical switch that responds to change in load demand. In our analysis and simulations, these attacks can halve the lifespan of an OLTC, or in the most extreme cases, reduce it to $2.5\%$ of its original lifespan. Notably, these power botnets are composed of devices not connected to the internal SCADA systems used to control power grids. This represents a new internet-based cyberattack that targets the power grid from the outside. To help the power system to mitigate these types of botnet attacks, we develop attack-localization strategies. We formulate the problem as a supervised machine learning task to locate the source of power botnet attacks. Within a simulated environment, we generate the training and testing dataset to evaluate several machine learning algorithm based localization methods, including SVM, neural network and decision tree. We show that decision-tree based classification successfully identifies power botnet attacks and locates compromised devices with at least $94\%$ improvement of accuracy over a baseline "most-frequent" classifier.

preprint2022arXiv

Chromium-Doped Bismuth Antimony Telluride for Future Quantum Hall Resistance Standards

Since 2017, epitaxial graphene has been the base material for the US national standard for resistance. A future avenue of research within electrical metrology is to remove the need for strong magnetic fields, as is currently the case for devices exhibiting the quantum Hall effect. The quantum Hall effect is just one of many research endeavours that revolve around recent quantum physical phenomena like composite fermions, charge density waves, and topological properties [1-2]. New materials, like magnetically doped topological insulators (MTIs), offer access to the quantum anomalous Hall effect, which in its ideal form, could become a future resistance standard needing only a small permanent magnet to activate a quantized resistance value [3-5]. Furthermore, these devices could operate at zero-field for measurements, making the dissemination of the ohm more economical and portable. Here we present results on precision measurements of the h/e2 quantized plateau of Cr-Doped (BixSb1-x)2Te3 and give them context by comparing them to modern graphene-based resistance standards. Ultimately, MTI-based devices could be combined in a single system with magnetic-field-averse Josephson voltage standards to obtain an alternative quantum current standard.

preprint2022arXiv

CODE: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking

Expert finding, a popular service provided by many online websites such as Expertise Finder, LinkedIn, and AMiner, is beneficial to seeking candidate qualifications, consultants, and collaborators. However, its quality is suffered from lack of ample sources of expert information. This paper employs AMiner as the basis with an aim at linking any external experts to the counterparts on AMiner. As it is infeasible to acquire sufficient linkages from arbitrary external sources, we explore the problem of zero-shot expert linking. In this paper, we propose CODE, which first pre-trains an expert linking model by contrastive learning on AMiner such that it can capture the representation and matching patterns of experts without supervised signals, then it is fine-tuned between AMiner and external sources to enhance the models transferability in an adversarial manner. For evaluation, we first design two intrinsic tasks, author identification and paper clustering, to validate the representation and matching capability endowed by contrastive learning. Then the final external expert linking performance on two genres of external sources also implies the superiority of the adversarial fine-tuning method. Additionally, we show the online deployment of CODE, and continuously improve its online performance via active learning.

preprint2022arXiv

Direct imaging of polymer filaments pulled from rebounding drops

Polymer filaments form the foundation of biology from cell scaffolding to DNA. Their study and fabrication play an important role in a wide range of processes from tissue engineering to molecular machines. We present a simple method to deposit stretched polymer fibers between micro-pillars. This occurs when a polymeric drop impacts on and rebounds from an inclined superhydrophobic substrate. It wets the top of the pillars and pulls out liquid filaments which are stretched and can attach to adjacent pillars leaving minuscule threads, with the solvent evaporating to leave the exposed polymers. We use high-speed video at the microscale to characterize the most robust filament-forming configurations, by varying the impact velocity, substrate structure and inclination angle, as well as the PEO-polymer concentration. Impacts onto plant leaves or randomized nano-structured surface leads to the formation of a branched structure, through filament mergers at the free surface of the drop. SEM shows the deposition of filament bundles which are thinner than those formed by evaporation or rolling drops. Raman spectroscopy identifies mode B stretched DNA filaments from aqueous-solution droplets.

preprint2022arXiv

Exchange-biased quantum anomalous Hall effect

The quantum anomalous Hall (QAH) effect is characterized by a dissipationless chiral edge state with a quantized Hall resistance at zero magnetic field. Manipulating the QAH state is of great importance in both the understanding of topological quantum physics and the implementation of dissipationless electronics. Here, we realized the QAH effect in the magnetic topological insulator Cr-doped (Bi,Sb)2Te3 (CBST) grown on an uncompensated antiferromagnetic insulator Al-doped Cr2O3. Through polarized neutron reflectometry (PNR), we find a strong exchange coupling between CBST and Al-Cr2O3 surface spins fixing interfacial magnetic moments perpendicular to the film plane. The interfacial coupling results in an exchange-biased QAH effect. We further demonstrate that the magnitude and sign of the exchange bias can be effectively controlled using a field training process to set the magnetization of the Al-Cr2O3 layer. Our work demonstrates the use of the exchange bias effect to effectively manipulate the QAH state, opening new possibilities in QAH-based spintronics.

preprint2022arXiv

Graph Contrastive Learning for Anomaly Detection

Graph-based anomaly detection has been widely used for detecting malicious activities in real-world applications. Existing attempts to address this problem have thus far focused on structural feature engineering or learning in the binary classification regime. In this work, we propose to leverage graph contrastive coding and present the supervised GraphCAD model for contrasting abnormal nodes with normal ones in terms of their distances to the global context (e.g., the average of all nodes). To handle scenarios with scarce labels, we further enable GraphCAD as a self-supervised framework by designing a graph corrupting strategy for generating synthetic node labels. To achieve the contrastive objective, we design a graph neural network encoder that can infer and further remove suspicious links during message passing, as well as learn the global context of the input graph. We conduct extensive experiments on four public datasets, demonstrating that 1) GraphCAD significantly and consistently outperforms various advanced baselines and 2) its self-supervised version without fine-tuning can achieve comparable performance with its fully supervised version.

preprint2022arXiv

Hardness Results for Laplacians of Simplicial Complexes via Sparse-Linear Equation Complete Gadgets

We study linear equations in combinatorial Laplacians of $k$-dimensional simplicial complexes ($k$-complexes), a natural generalization of graph Laplacians. Combinatorial Laplacians play a crucial role in homology and are a central tool in topology. Beyond this, they have various applications in data analysis and physical modeling problems. It is known that nearly-linear time solvers exist for graph Laplacians. However, nearly-linear time solvers for combinatorial Laplacians are only known for restricted classes of complexes. This paper shows that linear equations in combinatorial Laplacians of 2-complexes are as hard to solve as general linear equations. More precisely, for any constant $c \geq 1$, if we can solve linear equations in combinatorial Laplacians of 2-complexes up to high accuracy in time $\tilde{O}((\# \text{ of nonzero coefficients})^c)$, then we can solve general linear equations with polynomially bounded integer coefficients and condition numbers up to high accuracy in time $\tilde{O}((\# \text{ of nonzero coefficients})^c)$. We prove this by a nearly-linear time reduction from general linear equations to combinatorial Laplacians of 2-complexes. Our reduction preserves the sparsity of the problem instances up to poly-logarithmic factors.

preprint2022arXiv

Hardness Results for Weaver's Discrepancy Problem

Marcus, Spielman and Srivastava (Annals of Mathematics 2014) solved the Kadison--Singer Problem by proving a strong form of Weaver's conjecture: they showed that for all $α> 0$ and all lists of vectors of norm at most $\sqrtα$ whose outer products sum to the identity, there exists a signed sum of those outer products with operator norm at most $\sqrt{8 α} + 2 α.$ We prove that it is NP-hard to distinguish such a list of vectors for which there is a signed sum that equals the zero matrix from those in which every signed sum has operator norm at least $κ\sqrtα$, for some absolute constant $κ> 0.$ Thus, it is NP-hard to construct a signing that is a constant factor better than that guaranteed to exist. For $α= 1/4$, we prove that it is NP-hard to distinguish whether there is a signed sum that equals the zero matrix from the case in which every signed sum has operator norm at least $1/4$.

preprint2022arXiv

Interference Between Molecular and Photon Field-Mediated Electron Transfer Coupling Pathways in Cavities

Cavity polaritonics is capturing the imagination of the chemistry community because of the novel opportunities it creates to direct chemistry. Electron transfer (ET) reactions are among the simplest reactions, and they also underpin bioenergetics. As such, new conceptual strategies to manipulate and direct electron flow at the nanoscale are of wide-ranging interest in biochemistry, energy science, bio-inspired materials science, and chemistry. We show that optical cavities can modulate electron transfer pathway interferences and ET rates in donor-bridge-acceptor (DBA) systems. We derive the rate for DBA electron transfer systems when they are coupled with cavity photon fields (which may be off- or on-resonance with a molecular electronic transition), emphasizing novel cavity-induced pathway interferences with the molecular electronic coupling pathways, as these interferences allow a new kind of ET rate tuning. We also examined the ET kinetics for both low and high cavity frequency regimes as the light-matter coupling strength is varied. The interference between the cavity-induced and intrinsic molecular coupling pathway interference is defined by the cavity properties, including the cavity frequency and the light-matter coupling interaction strength. Thus, manipulating the cavity-induced interferences with the chemical coupling pathways offers new strategies to direct charge flow at the nanoscale.

preprint2022arXiv

Kernelized Similarity Learning and Embedding for Dynamic Texture Synthesis

Dynamic texture (DT) exhibits statistical stationarity in the spatial domain and stochastic repetitiveness in the temporal dimension, indicating that different frames of DT possess a high similarity correlation that is critical prior knowledge. However, existing methods cannot effectively learn a promising synthesis model for high-dimensional DT from a small number of training data. In this paper, we propose a novel DT synthesis method, which makes full use of similarity prior knowledge to address this issue. Our method bases on the proposed kernel similarity embedding, which not only can mitigate the high-dimensionality and small sample issues, but also has the advantage of modeling nonlinear feature relationship. Specifically, we first raise two hypotheses that are essential for DT model to generate new frames using similarity correlation. Then, we integrate kernel learning and extreme learning machine into a unified synthesis model to learn kernel similarity embedding for representing DT. Extensive experiments on DT videos collected from the internet and two benchmark datasets, i.e., Gatech Graphcut Textures and Dyntex, demonstrate that the learned kernel similarity embedding can effectively exhibit the discriminative representation for DT. Accordingly, our method is capable of preserving the long-term temporal continuity of the synthesized DT sequences with excellent sustainability and generalization. Meanwhile, it effectively generates realistic DT videos with fast speed and low computation, compared with the state-of-the-art methods. The code and more synthesis videos are available at our project page https://shiming-chen.github.io/Similarity-page/Similarit.html.

preprint2022arXiv

LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment

Table structure recognition is a challenging task due to the various structures and complicated cell spanning relations. Previous methods handled the problem starting from elements in different granularities (rows/columns, text regions), which somehow fell into the issues like lossy heuristic rules or neglect of empty cell division. Based on table structure characteristics, we find that obtaining the aligned bounding boxes of text region can effectively maintain the entire relevant range of different cells. However, the aligned bounding boxes are hard to be accurately predicted due to the visual ambiguities. In this paper, we aim to obtain more reliable aligned bounding boxes by fully utilizing the visual information from both text regions in proposed local features and cell relations in global features. Specifically, we propose the framework of Local and Global Pyramid Mask Alignment, which adopts the soft pyramid mask learning mechanism in both the local and global feature maps. It allows the predicted boundaries of bounding boxes to break through the limitation of original proposals. A pyramid mask re-scoring module is then integrated to compromise the local and global information and refine the predicted boundaries. Finally, we propose a robust table structure recovery pipeline to obtain the final structure, in which we also effectively solve the problems of empty cells locating and division. Experimental results show that the proposed method achieves competitive and even new state-of-the-art performance on several public benchmarks.

preprint2022arXiv

Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement

Active speaker detection and speech enhancement have become two increasingly attractive topics in audio-visual scenario understanding. According to their respective characteristics, the scheme of independently designed architecture has been widely used in correspondence to each single task. This may lead to the representation learned by the model being task-specific, and inevitably result in the lack of generalization ability of the feature based on multi-modal modeling. More recent studies have shown that establishing cross-modal relationship between auditory and visual stream is a promising solution for the challenge of audio-visual multi-task learning. Therefore, as a motivation to bridge the multi-modal associations in audio-visual tasks, a unified framework is proposed to achieve target speaker detection and speech enhancement with joint learning of audio-visual modeling in this study.

preprint2022arXiv

Low Radioactive Material Screening and Background Control for the PandaX-4T Experiment

PandaX-4T is a ton-scale dark matter direct detection experiment using a dual-phase TPC technique at the China Jinping Underground Laboratory. Various ultra-low background technologies have been developed and applied to material screening for PandaX-4T, including HPGe gamma spectroscopy, ICP-MS, NAA, radon emanation measurement system, krypton assay station, and alpha detection system. Low background materials were selected to assemble the detector. Surface treatment procedures were investigated to further suppress radioactive background. Combining measured results and Monte Carlo simulation, the total material background rates of PandaX-4T in the energy region of 1-25 keV$\rm{}_{ee}$ are estimated to be (9.9 $\pm$ 1.9) $\times \ 10^{-3}$ mDRU for electron recoil and (2.8 $\pm$ 0.6) $\times \ 10^{-4}$ mDRU for nuclear recoil. In addition, $^{nat}$Kr in the detector is estimated to be <8 ppt.

preprint2022arXiv

Mass Testing and Characterization of 20-inch PMTs for JUNO

Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).

preprint2022arXiv

Measurement of carbon finance level and exploration of its influencing factors

Faced with increasingly severe environmental problems, carbon trading markets and related financial activities aiming at limiting carbon dioxide emissions are booming. Considering the complexity and urgency of carbon market, it is necessary to construct an effective evaluation index system. This paper selected carbon finance index as a composite indicator. Taking Beijing, Shanghai, and Guangdong as examples, we adopted the classic method of multiple criteria decision analysis (MCDA) to analyze the composite indicator. Potential impact factors were screened extensively and calculated through normalization, weighting by coefficient of variation and different aggregation methods. Under the measurement of Shannon-Spearman Measure, the method with the least loss of information was used to obtain the carbon finance index (CFI) of the pilot areas. Through panel model analysis, we found that company size, the number of patents per 10,000 people and the proportion of new energy generation were the factors with significant influence. Based on the research, corresponding suggestions were put forward for different market entities. Hopefully, this research will contribute to the steady development of the national carbon market.

preprint2022arXiv

Neutron-induced nuclear recoil background in the PandaX-4T experiment

Neutron-induced nuclear recoil background is critical to the dark matter searches in the PandaX-4T liquid xenon experiment. This paper studies the feature of neutron background in liquid xenon and evaluates their contribution in the single scattering nuclear recoil events through three methods. The first method is fully Monte Carlo simulation based. The last two are data-driven methods that also use the multiple scattering signals and high energy signals in the data, respectively. In the PandaX-4T commissioning data with an exposure of 0.63 tonne-year, all these methods give a consistent result that there are $1.15\pm0.57$ neutron-induced background in dark matter signal region within an approximated nuclear recoil energy window between 5 and 100 keV.

preprint2022arXiv

Personalized Image Aesthetics Assessment with Rich Attributes

Personalized image aesthetics assessment (PIAA) is challenging due to its highly subjective nature. People&#39;s aesthetic tastes depend on diversified factors, including image characteristics and subject characters. The existing PIAA databases are limited in terms of annotation diversity, especially the subject aspect, which can no longer meet the increasing demands of PIAA research. To solve the dilemma, we conduct so far, the most comprehensive subjective study of personalized image aesthetics and introduce a new Personalized image Aesthetics database with Rich Attributes (PARA), which consists of 31,220 images with annotations by 438 subjects. PARA features wealthy annotations, including 9 image-oriented objective attributes and 4 human-oriented subjective attributes. In addition, desensitized subject information, such as personality traits, is also provided to support study of PIAA and user portraits. A comprehensive analysis of the annotation data is provided and statistic study indicates that the aesthetic preferences can be mirrored by proposed subjective attributes. We also propose a conditional PIAA model by utilizing subject information as conditional prior. Experimental results indicate that the conditional PIAA model can outperform the control group, which is also the first attempt to demonstrate how image aesthetics and subject characters interact to produce the intricate personalized tastes on image aesthetics. We believe the database and the associated analysis would be useful for conducting next-generation PIAA study. The project page of PARA can be found at: https://cv-datasets.institutecv.com/#/data-sets.

preprint2022arXiv

Phase Optimization for Massive IRS-aided Two-way Relay Network

In this paper, with the help of an intelligent reflecting surface (IRS), the source (S) and destination (D) exchange information through the two-way decode-and-forward relay (TW-DFR). We mainly focus on the phase optimization of IRS to improve the system rate performance. Firstly, a maximizing receive power sum (Max-RPS) method is proposed via eigenvalue decomposition (EVD) with an appreciable rate enhancement, which is called Max-RPS-EVD. To further achieve a higher rate, a method of maximizing minimum rate (Max-Min-R) is proposed with high complexity. To reduce its complexity, a low-complexity method of maximizing the sum rate (Max-SR) via general power iterative (GPI) is proposed, which is called Max-SR-GPI. Simulation results show that the proposed three methods outperform the case of random phase method, especially the proposed Max-SR-GPI method is the best one achieving at least 20\% rate gain over random phase. Additionally, it is also proved the optimal rate can be achieved when TW-DFR and IRS are located in the middle of S and D.

preprint2022arXiv

Power Allocation for IRS-aided Two-way Decode-and-Forward Relay Wireless Network

In this paper, an intelligent reflecting surface (IRS)-aided two-way decode-and-forward (DF) relay wireless network is considered, where two users exchange information via IRS and DF relay. To enhance the sum rate performance, three power allocation (PA) strategies are proposed. Firstly, a method of maximizing sum rate (Max-SR) is proposed to jointly optimize the PA factors of user U1, user U2 and relay station (RS). To further improve the sum rate performance, two high-performance schemes, namely maximizing minimum sum rate (Max-Min-SR) and maximizing sum rate with rate constraint (Max-SR-RC), are presented. Simulation results show that the proposed three methods outperform the equal power allocation (EPA) method in terms of sum rate performance. In particular, the highest performance gain achieved by Max-SR-RC method is up to 45.2% over EPA. Furthermore, it is verified that the total power and random shadow variable Xσ have a substantial impact on the sum rate performance.

preprint2022arXiv

Quasi-periodic oscillations of the X-ray burst from the magnetar SGR J1935+2154 and associated with the fast radio burst FRB 200428

The origin(s) and mechanism(s) of fast radio bursts (FRBs), which are short radio pulses from cosmological distances, have remained a major puzzle since their discovery. We report a strong Quasi-Periodic Oscillation(QPO) of 40 Hz in the X-ray burst from the magnetar SGR J1935+2154 and associated with FRB 200428, significantly detected with the Hard X-ray Modulation Telescope (Insight-HXMT) and also hinted by the Konus-Wind data. QPOs from magnetar bursts have only been rarely detected; our 3.4 sigma (p-value is 2.9e-4) detection of the QPO reported here reveals the strongest QPO signal observed from magnetars (except in some very rare giant flares), making this X-ray burst unique among magnetar bursts. The two X-ray spikes coinciding with the two FRB pulses are also among the peaks of the QPO. Our results suggest that at least some FRBs are related to strong oscillation processes of neutron stars. We also show that we may overestimate the significance of the QPO signal and underestimate the errors of QPO parameters if QPO exists only in a fraction of the time series of a X-ray burst which we use to calculate the Leahy-normalized periodogram.

preprint2022arXiv

Rapid Phase Ambiguity Elimination Methods for DOA Estimator via Hybrid Massive MIMO Receive Array

For a sub-connected hybrid multiple-input multiple-output (MIMO) receiver with $K$ subarrays and $N$ antennas, there exists a challenging problem of how to rapidly remove phase ambiguity in only single time-slot. First, a DOA estimator of maximizing received power (Max-RP) is proposed to find the maximum value of $K$-subarray output powers, where each subarray is in charge of one sector, and the center angle of the sector corresponding to the maximum output is the estimated true DOA. To make an enhancement on precision, Max-RP plus quadratic interpolation (Max-RP-QI) method is designed. In the proposed Max-RP-QI, a quadratic interpolation scheme is adopted to interpolate the three DOA values corresponding to the largest three receive powers of Max-RP. Finally, to achieve the CRLB, a Root-MUSIC plus Max-RP-QI scheme is developed. Simulation results show that the proposed three methods eliminate the phase ambiguity during one time-slot and also show low-computational-complexities. In particular, the proposed Root-MUSIC plus Max-RP-QI scheme can reach the CRLB, and the proposed Max-RP and Max-RP-QI are still some performance losses $2dB\thicksim4dB$ compared to the CRLB.

preprint2022arXiv

Resilience in Industrial Internet of Things Systems: A Communication Perspective

Industrial Internet of Things is an ultra-large-scale system that is much more sophisticated and fragile than conventional industrial platforms. The effective management of such a system relies heavily on the resilience of the network, especially the communication part. Imperative as resilient communication is, there is not enough attention from literature and a standardized framework is still missing. In awareness of these, this paper intends to provide a systematic overview of resilience in IIoT with a communication perspective, aiming to answer the questions of why we need it, what it is, how to enhance it, and where it can be applied. Specifically, we emphasize the urgency of resilience studies via examining existing literature and analyzing malfunction data from a real satellite communication system. Resilience-related concepts and metrics, together with standardization efforts are then summarized and discussed, presenting a basic framework for analyzing the resilience of the system before, during, and after disruptive events. On the basis of the framework, key resilience concerns associated with the design, deployment, and operation of IIoT are briefly described to shed light on the methods for resilience enhancement. Promising resilient applications in different IIoT sectors are also introduced to highlight the opportunities and challenges in practical implementations.

preprint2022arXiv

Signal Shaping for Semantic Communication Systems with A Few Message Candidates

Semantic communications target to reliably convey the semantic meaning of messages. It is different from existing communication systems focusing on reliable bit transmission. To achieve the goal of semantic communications, we propose a signal shaping method by minimizing the semantic loss, which is measured by the pretrained bidirectional encoder representation from transformers (BERT) model. The signal set optimization problem for semantic communication systems with a few message candidates is investigated. We propose an efficient projected gradient descent method to solve the problem and prove its convergence. Simulation results show that the proposed method outperforms existing signal shaping methods in minimizing the semantic loss.

preprint2022arXiv

Solutions of the Schrödinger equation for anisotropic dipole-dipole interaction plus isotropic van der Waals interaction

By generalizing Bo Gao&#39;s approach [Phys. Rev. A 58, 1728 (1998)] for solving the Schrödinger equation for an isotropic van der Waals (vdW) potential to the systems with a multi-scale anisotropic long-range interaction, we derive the solutions for the Schrödinger equation for an anisotropic dipole-dipole interaction plus an isotropic attractive vdW potential, i.e., ${C_d(1-3\cos^2θ)}/{r^3}-{C_6}/{r^6}$, which is projected to the subspace with angular momentum $l\leq l_{\rm cut}$, with $l_{\rm cut}$ being an arbitrary angular-momentum cutoff. Here $θ$ is the polar angle of the coordinate $\boldsymbol{r}$ and $r=|\boldsymbol{r}|$. The asymptotic behaviors of these solutions for $r\rightarrow 0$ and $r\rightarrow \infty$ are obtained. These results can be used in the research of collisions and chemical reactions between ultra-cold polar molecules in a static electric field. Our approach to derive the solutions can be applied to the systems with a general long-range potential $\sum_{λ= 2}^{λ_{\rm max}} {V_λ(θ,φ)}/{r^λ}$, with $φ$ being the azimuthal angle of $\boldsymbol{r}$, and thus can be used in various problems on molecule-molecule interaction.

preprint2022arXiv

Stock Trading Optimization through Model-based Reinforcement Learning with Resistance Support Relative Strength

Reinforcement learning (RL) is gaining attention by more and more researchers in quantitative finance as the agent-environment interaction framework is aligned with decision making process in many business problems. Most of the current financial applications using RL algorithms are based on model-free method, which still faces stability and adaptivity challenges. As lots of cutting-edge model-based reinforcement learning (MBRL) algorithms mature in applications such as video games or robotics, we design a new approach that leverages resistance and support (RS) level as regularization terms for action in MBRL, to improve the algorithm&#39;s efficiency and stability. From the experiment results, we can see RS level, as a market timing technique, enhances the performance of pure MBRL models in terms of various measurements and obtains better profit gain with less riskiness. Besides, our proposed method even resists big drop (less maximum drawdown) during COVID-19 pandemic period when the financial market got unpredictable crisis. Explanations on why control of resistance and support level can boost MBRL is also investigated through numerical experiments, such as loss of actor-critic network and prediction error of the transition dynamical model. It shows that RS indicators indeed help the MBRL algorithms to converge faster at early stage and obtain smaller critic loss as training episodes increase.

preprint2022arXiv

Subgraph Neighboring Relations Infomax for Inductive Link Prediction on Knowledge Graphs

Inductive link prediction for knowledge graph aims at predicting missing links between unseen entities, those not shown in training stage. Most previous works learn entity-specific embeddings of entities, which cannot handle unseen entities. Recent several methods utilize enclosing subgraph to obtain inductive ability. However, all these works only consider the enclosing part of subgraph without complete neighboring relations, which leads to the issue that partial neighboring relations are neglected, and sparse subgraphs are hard to be handled. To address that, we propose Subgraph Neighboring Relations Infomax, SNRI, which sufficiently exploits complete neighboring relations from two aspects: neighboring relational feature for node feature and neighboring relational path for sparse subgraph. To further model neighboring relations in a global way, we innovatively apply mutual information (MI) maximization for knowledge graph. Experiments show that SNRI outperforms existing state-of-art methods by a large margin on inductive link prediction task, and verify the effectiveness of exploring complete neighboring relations in a global way to characterize node features and reason on sparse subgraphs.

preprint2022arXiv

Test suite effectiveness metric evaluation: what do we know and what should we do?

Comparing test suite effectiveness metrics has always been a research hotspot. However, prior studies have different conclusions or even contradict each other for comparing different test suite effectiveness metrics. The problem we found most troubling to our community is that researchers tend to oversimplify the description of the ground truth they use. For example, a common expression is that &#34;we studied the correlation between real faults and the metric to evaluate (MTE)&#34;. However, the meaning of &#34;real faults&#34; is not clear-cut. As a result, there is a need to scrutinize the meaning of &#34;real faults&#34;. Without this, it will be half-knowledgeable with the conclusions. To tackle this challenge, we propose a framework ASSENT (evAluating teSt Suite EffectiveNess meTrics) to guide the follow-up research. In nature, ASSENT consists of three fundamental components: ground truth, benchmark test suites, and agreement indicator. First, materialize the ground truth for determining the real order in effectiveness among test suites. Second, generate a set of benchmark test suites and derive their ground truth order in effectiveness. Third, for the benchmark test suites, generate the MTE order in effectiveness by the metric to evaluate (MTE). Finally, calculate the agreement indicator between the two orders. Under ASSENT, we are able to compare the accuracy of different test suite effectiveness metrics. We apply ASSENT to evaluate representative test suite effectiveness metrics, including mutation score metrics and code coverage metrics. Our results show that, based on the real faults, mutation score and subsuming mutation score are the best metrics to quantify test suite effectiveness. Meanwhile, by using mutants instead of real faults, MTEs will be overestimated by more than 20% in values.

preprint2022arXiv

Text/Speech-Driven Full-Body Animation

Due to the increasing demand in films and games, synthesizing 3D avatar animation has attracted much attention recently. In this work, we present a production-ready text/speech-driven full-body animation synthesis system. Given the text and corresponding speech, our system synthesizes face and body animations simultaneously, which are then skinned and rendered to obtain a video stream output. We adopt a learning-based approach for synthesizing facial animation and a graph-based approach to animate the body, which generates high-quality avatar animation efficiently and robustly. Our results demonstrate the generated avatar animations are realistic, diverse and highly text/speech-correlated.

preprint2022arXiv

Three-dimensional Propagation of the Global EUV Wave associated with a solar eruption on 2021 October 28

We present a case study for the global extreme ultraviolet (EUV) wave and its chromospheric counterpart `Moreton-Ramsey wave&#39; associated with the second X-class flare in Solar Cycle 25 and a halo coronal mass ejection (CME). The EUV wave was observed in the H$α$ and EUV passbands with different characteristic temperatures. In the 171 Å and 193/195 Å images, the wave propagates circularly with an initial velocity of 600-720 km s$^{-1}$ and a deceleration of 110-320 m s$^{-2}$. The local coronal plasma is heated from log(T/K)=5.9 to log(T/K)=6.2 during the passage of the wavefront. The H$α$ and 304 Å images also reveal signatures of wave propagation with a velocity of 310-540 km s$^{-1}$. With multi-wavelength and dual-perspective observations, we found that the wavefront likely propagates forwardly inclined to the solar surface with a tilt angle of ~53.2$^{\circ}$. Our results suggest that this EUV wave is a fast-mode magnetohydrodynamic wave or shock driven by the expansion of the associated CME, whose wavefront is likely a dome-shaped structure that could impact the upper chromosphere, transition region and corona.

preprint2022arXiv

TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents

Recently, automatically extracting information from visually rich documents (e.g., tickets and resumes) has become a hot and vital research topic due to its widespread commercial value. Most existing methods divide this task into two subparts: the text reading part for obtaining the plain text from the original document images and the information extraction part for extracting key contents. These methods mainly focus on improving the second, while neglecting that the two parts are highly correlated. This paper proposes a unified end-to-end information extraction framework from visually rich documents, where text reading and information extraction can reinforce each other via a well-designed multi-modal context block. Specifically, the text reading part provides multi-modal features like visual, textual and layout features. The multi-modal context block is developed to fuse the generated multi-modal features and even the prior knowledge from the pre-trained language model for better semantic representation. The information extraction part is responsible for generating key contents with the fused context features. The framework can be trained in an end-to-end trainable manner, achieving global optimization. What is more, we define and group visually rich documents into four categories across two dimensions, the layout and text type. For each document category, we provide or recommend the corresponding benchmarks, experimental settings and strong baselines for remedying the problem that this research area lacks the uniform evaluation standard. Extensive experiments on four kinds of benchmarks (from fixed layout to variable layout, from full-structured text to semi-unstructured text) are reported, demonstrating the proposed method&#39;s effectiveness. Data, source code and models are available.

preprint2022arXiv

Two-Commodity Flow is Equivalent to Linear Programming under Nearly-Linear Time Reductions

We give a nearly-linear time reduction that encodes any linear program as a 2-commodity flow problem with only a small blow-up in size. Under mild assumptions similar to those employed by modern fast solvers for linear programs, our reduction causes only a polylogarithmic multiplicative increase in the size of the program and runs in nearly-linear time. Our reduction applies to high-accuracy approximation algorithms and exact algorithms. Given an approximate solution to the 2-commodity flow problem, we can extract a solution to the linear program in linear time with only a polynomial factor increase in the error. This implies that any algorithm that solves the 2-commodity flow problem can solve linear programs in essentially the same time. Given a directed graph with edge capacities and two source-sink pairs, the goal of the 2-commodity flow problem is to maximize the sum of the flows routed between the two source-sink pairs subject to edge capacities and flow conservation. A 2-commodity flow can be directly written as a linear program, and thus we establish a nearly-tight equivalence between these two classes of problems. Our proof follows the outline of Itai&#39;s polynomial-time reduction of a linear program to a 2-commodity flow problem (JACM&#39;78). Itai&#39;s reduction shows that exactly solving 2-commodity flow and exactly solving linear programming are polynomial-time equivalent. We improve Itai&#39;s reduction to nearly preserve the problem representation size in each step. In addition, we establish an error bound for approximately solving each intermediate problem in the reduction, and show that the accumulated error is polynomially bounded. We remark that our reduction does not run in strongly polynomial time and that it is open whether 2-commodity flow and linear programming are equivalent in strongly polynomial time.

preprint2022arXiv

Universal energy-dependent pseudopotential for the two-body problem of confined ultracold atoms

The two-body scattering amplitude and energy spectrum of confined ultracold atoms are of fundamental importance for studies of ultracold atom physics. For many systems, one can efficiently calculate these quantities via the zero-range Huang-Yang pseudopotential (HYP), in which the interatomic interaction is characterized by the scattering length $a$. Furthermore, when the scattering length is dependent on the kinetic energy $\varepsilon_{\rm r}$ of two-atom relative motion, the results are applicable for a broad energy region. However, when the free Hamiltonian of atomic internal state does not commute with the inter-atomic interaction, or the center-of-mass (c.m.) motion is coupled to the relative motion, the generalization of this technique is still lacking. We solve this problem and construct a reasonable energy-dependent multi-channel HYP, which is characterized by a &#34;scattering length operator&#34; ${\hat a}_{\rm eff}$. Here ${\hat a}_{\rm eff}$ is an operator for atomic internal states and c.m. motion, and depends on both the total two-atom energy and the external field as well as the trapping parameter. The effects from the internal-state or c.m.-relative motion coupling can be self-consistently taken into account by ${\hat a}_{\rm eff}$. We further show a method based on the quantum defect theory, with which ${\hat a}_{\rm eff}$ can be analytically derived for systems with van der Waals inter-atomic interaction. To demonstrate our method, we calculate the spectrum of two ultracold fermionic alkaline-earth-like atoms confined in an optical lattice. By comparing our results with the recent experimental measurements for two Yb173 atoms and two Yb171 atoms, we calibrate the scattering lengths $a_{\pm}$ with respect to anti-symmetric and symmetric nuclear-spin states to be $a_{+}=2012(19)a_{0}$ and $a_{-}=193(4)a_{0}$ for Yb173, and $a_{+}=232(3)a_{0}$ and $a_{-}=372(1)a_{0}$ for Yb171.

preprint2021arXiv

A Feature Fusion-Net Using Deep Spatial Context Encoder and Nonstationary Joint Statistical Model for High Resolution SAR Image Classification

Convolutional neural networks (CNNs) have been applied to learn spatial features for high-resolution (HR) synthetic aperture radar (SAR) image classification. However, there has been little work on integrating the unique statistical distributions of SAR images which can reveal physical properties of terrain objects, into CNNs in a supervised feature learning framework. To address this problem, a novel end-to-end supervised classification method is proposed for HR SAR images by considering both spatial context and statistical features. First, to extract more effective spatial features from SAR images, a new deep spatial context encoder network (DSCEN) is proposed, which is a lightweight structure and can be effectively trained with a small number of samples. Meanwhile, to enhance the diversity of statistics, the nonstationary joint statistical model (NS-JSM) is adopted to form the global statistical features. Specifically, SAR images are transformed into the Gabor wavelet domain and the produced multi-subbands magnitudes and phases are modeled by the log-normal and uniform distribution. The covariance matrix is further utilized to capture the inter-scale and intra-scale nonstationary correlation between the statistical subbands and make the joint statistical features more compact and distinguishable. Considering complementary advantages, a feature fusion network (Fusion-Net) base on group compression and smooth normalization is constructed to embed the statistical features into the spatial features and optimize the fusion feature representation. As a result, our model can learn the discriminative features and improve the final classification performance. Experiments on four HR SAR images validate the superiority of the proposed method over other related algorithms.

preprint2021arXiv

A review of artificial intelligence methods combined with Raman spectroscopy to identify the composition of substances

In general, most of the substances in nature exist in mixtures, and the noninvasive identification of mixture composition with high speed and accuracy remains a difficult task. However, the development of Raman spectroscopy, machine learning, and deep learning techniques have paved the way for achieving efficient analytical tools capable of identifying mixture components, making an apparent breakthrough in the identification of mixtures beyond the traditional chemical analysis methods. This article summarizes the work of Raman spectroscopy in identifying the composition of substances as well as provides detailed reviews on the preprocessing process of Raman spectroscopy, the analysis methods and applications of artificial intelligence. This review summarizes the work of Raman spectroscopy in identifying the composition of substances and reviews the preprocessing process of Raman spectroscopy, the analysis methods and applications of artificial intelligence. Finally, the advantages and disadvantages and development prospects of Raman spectroscopy are discussed in detail.

preprint2021arXiv

An Intelligent Self-driving Truck System For Highway Transportation

Recently, there have been many advances in autonomous driving society, attracting a lot of attention from academia and industry. However, existing works mainly focus on cars, extra development is still required for self-driving truck algorithms and models. In this paper, we introduce an intelligent self-driving truck system. Our presented system consists of three main components, 1) a realistic traffic simulation module for generating realistic traffic flow in testing scenarios, 2) a high-fidelity truck model which is designed and evaluated for mimicking real truck response in real-world deployment, 3) an intelligent planning module with learning-based decision making algorithm and multi-mode trajectory planner, taking into account the truck&#39;s constraints, road slope changes, and the surrounding traffic flow. We provide quantitative evaluations for each component individually to demonstrate the fidelity and performance of each part. We also deploy our proposed system on a real truck and conduct real world experiments which shows our system&#39;s capacity of mitigating sim-to-real gap. Our code is available at https://github.com/InceptioResearch/IITS

preprint2021arXiv

D2A U-Net: Automatic Segmentation of COVID-19 Lesions from CT Slices with Dilated Convolution and Dual Attention Mechanism

Coronavirus Disease 2019 (COVID-19) has caused great casualties and becomes almost the most urgent public health events worldwide. Computed tomography (CT) is a significant screening tool for COVID-19 infection, and automated segmentation of lung infection in COVID-19 CT images will greatly assist diagnosis and health care of patients. However, accurate and automatic segmentation of COVID-19 lung infections remains to be challenging. In this paper we propose a dilated dual attention U-Net (D2A U-Net) for COVID-19 lesion segmentation in CT slices based on dilated convolution and a novel dual attention mechanism to address the issues above. We introduce a dilated convolution module in model decoder to achieve large receptive field, which refines decoding process and contributes to segmentation accuracy. Also, we present a dual attention mechanism composed of two attention modules which are inserted to skip connection and model decoder respectively. The dual attention mechanism is utilized to refine feature maps and reduce semantic gap between different levels of the model. The proposed method has been evaluated on open-source dataset and outperforms cutting edges methods in semantic segmentation. Our proposed D2A U-Net with pretrained encoder achieves a Dice score of 0.7298 and recall score of 0.7071. Besides, we also build a simplified D2A U-Net without pretrained encoder to provide a fair comparison with other models trained from scratch, which still outperforms popular U-Net family models with a Dice score of 0.7047 and recall score of 0.6626. Our experiment results have shown that by introducing dilated convolution and dual attention mechanism, the number of false positives is significantly reduced, which improves sensitivity to COVID-19 lesions and subsequently brings significant increase to Dice score.

preprint2021arXiv

Dark Matter Search Results from the PandaX-4T Commissioning Run

We report the first dark matter search results using the commissioning data from PandaX-4T. Using a time projection chamber with 3.7-tonne of liquid xenon target and an exposure of 0.63 tonne$\cdot$year, 1058 candidate events are identified within an approximate nuclear recoil energy window between 5 and 100 keV. No significant excess over background is observed. Our data set a stringent limit to the dark matter-nucleon spin-independent interactions, with a lowest excluded cross section (90% C.L.) of $3.8\times10^{-47} $cm$^2$ at a dark matter mass of 30 GeV/$c^2$.

preprint2021arXiv

Data Engagement Reconsidered: A Study of Automatic Stress Tracking Technology in Use

In today&#39;s fast-paced world, stress has become a growing health concern. While more automatic stress tracking technologies have recently become available on wearable or mobile devices, there is still a limited understanding of how they are actually used in everyday life. This paper presents an empirical study of automatic stress-tracking technologies in use in China, based on semi-structured interviews with 17 users. The study highlights three challenges of stress-tracking data engagement that prevent effective technology usage: the lack of immediate awareness, the lack of pre-required knowledge, and the lack of corresponding communal support. Drawing on the stress-tracking practices uncovered in the study, we bring these issues to the fore, and unpack assumptions embedded in related works on self-tracking and how data engagement is approached. We end by calling for a reconsideration of data engagement as part of self-tracking practices with technologies rather than simply looking at the user interface.

preprint2021arXiv

Distributed and Asynchronous Operational Optimization of Networked Microgrids

Smart programmable microgrids (SPM) is an emerging technology for making microgrids more software-defined and less hardware-independent such that converting distributed energy resources (DERs) to networked community microgrids becomes affordable, autonomic, and secure. As one of the cornerstones of SPM, this paper pioneers a concept of software-defined operation optimization for networked microgrids, where operation objectives, grid connection, and DER participation will be defined by software and plug-and-play, and can be quickly reconfigured, based on the development of modularized and tightened models and a novel asynchronous price-based decomposition-and-coordination method. Key contributions include: (1) design the architecture of the operational optimization of networked microgrids which can be readily implemented to ensure the programmability of islanded microgrids in solving the distributed optimization models, (2) realize a novel discrete model of droop controller, and (3) introduce a powerful distributed and asynchronous method Distributed and Asynchronous Surrogate Lagrangian Relaxation (DA-SLR) to efficiently coordinate microgrids asynchronously. Two case studies are tested to demonstrate the efficiency of developed DA-SLR, and specifically, the testing results show the superiority of DA-SLR as compared to previous methods such as ADMM.

preprint2021arXiv

JUNO Physics and Detector

The Jiangmen Underground Neutrino Observatory (JUNO) is a 20 kton LS detector at 700-m underground. An excellent energy resolution and a large fiducial volume offer exciting opportunities for addressing many important topics in neutrino and astro-particle physics. With 6 years of data, the neutrino mass ordering can be determined at 3-4 sigma and three oscillation parameters can be measured to a precision of 0.6% or better by detecting reactor antineutrinos. With 10 years of data, DSNB could be observed at 3-sigma; a lower limit of the proton lifetime of 8.34e33 years (90% C.L.) can be set by searching for p->nu_bar K^+; detection of solar neutrinos would shed new light on the solar metallicity problem and examine the vacuum-matter transition region. A core-collapse supernova at 10 kpc would lead to ~5000 IBD and ~2000 (300) all-flavor neutrino-proton (electron) scattering events. Geo-neutrinos can be detected with a rate of ~400 events/year. We also summarize the final design of the JUNO detector and the key R&D achievements. All 20-inch PMTs have been tested. The average photon detection efficiency is 28.9% for the 15,000 MCP PMTs and 28.1% for the 5,000 dynode PMTs, higher than the JUNO requirement of 27%. Together with the >20 m attenuation length of LS, we expect a yield of 1345 p.e. per MeV and an effective energy resolution of 3.02%/\sqrt{E (MeV)}$ in simulations. The underwater electronics is designed to have a loss rate <0.5% in 6 years. With degassing membranes and a micro-bubble system, the radon concentration in the 35-kton water pool could be lowered to <10 mBq/m^3. Acrylic panels of radiopurity <0.5 ppt U/Th are produced. The 20-kton LS will be purified onsite. Singles in the fiducial volume can be controlled to ~10 Hz. The JUNO experiment also features a double calorimeter system with 25,600 3-inch PMTs, a LS testing facility OSIRIS, and a near detector TAO.

preprint2021arXiv

Light yield and field dependence measurement in PandaX-II dual-phase xenon detector

The dual-phase xenon time projection chamber (TPC) is one of the most sensitive detector technology for dark matter direct search, where the energy deposition of incoming particle can be converted into photons and electrons through xenon excitation and ionization. The detector response to signal energy deposition varies significantly with the electric field in liquid xenon. We study the detector&#39;s light yield and its dependence on the electric field in the PandaX-II dual-phase detector containing 580~kg liquid xenon in the sensitive volume. From our measurements, the light yield at electric fields from 0~V/cm to 317~V/cm is obtained for energy depositions up to 236~keV.

preprint2021arXiv

Material design with the van der Waals stacking of bismuth-halide chains realizing a higher-order topological insulator

The van der Waals (vdW) materials with low dimensions have been extensively studied as a platform to generate exotic quantum properties. Advancing this view, a great deal of attention is currently paid to topological quantum materials with vdW structures. Here, we provide a new concept of designing topological materials by the vdW stacking of quantum spin Hall insulators (QSHIs). Most interestingly, a slight shift of inversion center in the unit cell caused by a modification of stacking is found to induce the topological variation from a trivial insulator to a higher-order topological insulator (HOTI). Based on that, we present the first experimental realization of a HOTI by investigating a bismuth bromide Bi4Br4 with angle-resolved photoemission spectroscopy (ARPES). The unique feature in bismuth halides capable of selecting various topology only by differently stacking chains, combined with the great advantage of the vdW structure, offers a fascinating playground for engineering topologically non-trivial edge-states toward future spintronics applications.

preprint2021arXiv

Mesoscopic Transport of Quantum Anomalous Hall Effect in Sub-Micron Size Regime

The quantum anomalous Hall (QAH) effect has been demonstrated in two-dimensional topological insulator systems incorporated with ferromagnetism. However, a comprehensive understanding of mesoscopic transport in sub-micron QAH devices has yet been established. Here we fabricated miniaturized QAH devices with channel widths down to 600 nm, where the QAH features are still preserved. A back-scattering channel is formed in narrow QAH devices through percolative hopping between 2D compressible puddles. Large resistance fluctuations are observed in narrow devices near the coercive field, which is associated with collective interference between intersecting paths along domain walls when the device geometry is smaller than the phase coherence length $L_ϕ$. Through measurement of size-dependent breakdown current, we confirmed that the chiral edge states are confined at the physical boundary with its width on the order of Fermi wavelength.

preprint2021arXiv

Mutant reduction evaluation: what is there and what is missing?

Background. Many mutation reduction strategies, which aim to reduce the number of mutants, have been proposed. Problem. It is important to measure the ability of a mutation reduction strategy to maintain test suite effectiveness evaluation. However, existing evaluation indicators are unable to measure the &#34;order-preserving ability&#34;. Objective. We aim to propose evaluation indicators to measure the &#34;order-preserving ability&#34; of a mutation reduction strategy, which is important but missing in our community. Method. Given a test suite on a Software Under Test (SUT) with a set of original mutants, we leverage the test suite to generate a group of test suites that have a partial order relationship in fault detecting potential. When evaluating a reduction strategy, we first construct two partial order relationships among the generated test suites in terms of mutation score, one with the original mutants and another with the reduced mutants. Then, we measure the extent to which the two partial order relationships are consistent. The more consistent the two partial order relationships are, the stronger the Order Preservation (OP) of the mutation reduction strategy is, and the more effective the reduction strategy is. Furthermore, we propose Effort-aware Relative Order Preservation (EROP) to measure how much gain a mutation reduction strategy can provide compared with a random reduction strategy. Result. The experimental results show that OP and EROP are able to efficiently measure the &#34;order-preserving ability&#34; of a mutation reduction strategy. As a result, they have a better ability to distinguish various mutation reduction strategies compared with the existing evaluation indicators. Conclusion. We suggest, for the researchers, that OP and EROP should be used to measure the effectiveness of a mutant reduction strategy.

preprint2021arXiv

Neuro-Reachability of Networked Microgrids

A neural ordinary differential equations network (ODE-Net)-enabled reachability method (Neuro-Reachability) is devised for the dynamic verification of networked microgrids (NMs) with unidentified subsystems and heterogeneous uncertainties. Three new contributions are presented: 1) An ODENet-enabled dynamic model discovery approach is devised to construct the data-driven state-space model which preserves the nonlinear and differential structure of the NMs system; 2) A physics-data-integrated (PDI) NMs model is established, which empowers various NM analytics; and 3) A conformance-empowered reachability analysis is developed to enhance the reliability of the PDI-driven dynamic verification. Extensive case studies demonstrate the efficacy of the ODE-Net-enabled method in microgrid dynamic model discovery, and the effectiveness of the Neuro-Reachability approach in verifying the NMs dynamics under multiple uncertainties and various operational scenarios.

preprint2021arXiv

Selective observation of surface and bulk bands in polar WTe2 by laser-based spin- and angle-resolved photoemission spectroscopy

The electronic state of WTe2, a candidate of type-II Weyl semimetal, is investigated by using laser-based spin- and angle-resolved photoemission spectroscopy (SARPES). We prepare the pair of WTe2 samples, one with (001) surface and the other with (00-1) surface, by &#34;sandwich method&#34;, and measure the band structures of each surface separately. The Fermi arcs are observed on both surfaces. We identify that the Fermi arcs on the two surfaces are both originating from surface states. We further find a surface resonance band, which connects with the Fermi-arc band, forming a Dirac-cone-like band dispersion. Our results indicate that the bulk electron and hole bands are much closer in momentum space than band calculations.

preprint2021arXiv

WIMP Dark Matter Hidden behind its Companion

The WIMP dark matter (DM) hypothesis now is in an awkward position, owing to the stronger and stronger exclusion from DM direct detection. In this article we design a mechanism to evade this constraint.The idea is simple. DM has a companion, and they are both charged under the DM protecting symmetry G; they admit the trilinear coupling DM-DM-companion, so the latter provides a portal to the standard model (SM) via, for instance, the coupling to Higgs doublet.Then, DM semi-annihilates into the companion to arrive correct relic density, without leaving DM-nucleon scattering signal. The idea can be realized for ZN symmetric models with N >2.We stress that this mechanism has the characteristics of co-annihilation, and as a matter of fact its effect becomes necessary near or above the TeV region. This means that it may be difficult to detect our dark matter directly or indirectly.

preprint2020arXiv

Analytical solution for the spectrum of two ultracold atoms in a completely anisotropic confinement

We study the system of two ultracold atoms in a three-dimensional (3D) or two-dimensional (2D) completely anisotropic harmonic trap. We derive the algebraic equation J_{3D}(E) = 1/a_{3D} (J_{2D}(E) = ln a_{2D}) for the eigen-energy E of this system in the 3D (2D) case, with a_{3D} and a_{2D} being the corresponding s-wave scattering lengths, and provide the analytical expressions of the functions J_{3D}(E) and J_{2D}(E). In previous researches this type of equation was obtained for spherically or axially symmetric harmonic traps (T. Busch, et. al., Found. Phys. 28, 549 (1998); Z. Idziaszek and T. Calarco, Phys. Rev. A 74, 022712 (2006)). However, for our cases with a completely anisotropic trap, only the equation for the ground-state energy of some cases has been derived (J. Liang and C. Zhang, Phys. Scr. 77, 025302 (2008)). Our results in this work are applicable for arbitrary eigen-energy of this system, and can be used for the studies of dynamics and thermal-dynamics of interacting ultracold atoms in this trap, e.g., the calculation of the 2nd virial coefficient or the evolution of two-body wave functions. In addition, our approach for the derivation of the above equations can also be used for other two-body problems of ultracold atoms.

preprint2020arXiv

Approximate Reconstruction of Torsional Potential Energy Surface based on Voronoi Tessellation

Torsional modes within a complex molecule containing various functional groups are often strongly coupled so that the harmonic approximation and one-dimensional torsional treatment are inaccurate to evaluate their partition functions. A family of multi-structural approximation methods have been proposed and applied in recent years to deal with the torsional anharmonicity.However, these methods approximate the exact &#34;almost periodic&#34; potential energy as a summation of local periodic functions with symmetric barrier positions and heights. In the present theoretical study, we illustrated that the approximation is inaccurate when torsional modes present non-uniformly distributed local minima. Thereby, we proposed an improved method to reconstruct approximate potential to replace the periodic potential by using information of the local minima and their Voronoi tessellation. First, we established asymmetric barrier heights by introducing two periodicity parameters and assuming that the exact barrier positions are at the boundaries of Voronoi cells. Second, we used multiplicatively weighted Voronoi tessellation to refine the barrier heights and positions by defining a structure-related distance metric. The proposed method has been tested for a few higher-dimensional cases, all of which show promising improved accuracy.

preprint2020arXiv

Calculation of Feynman loop integration and phase-space integration via auxiliary mass flow

We extend the auxiliary-mass-flow (AMF) method originally developed for Feynman loop integration to calculate integrals involving also phase-space integration. Flow of the auxiliary mass from the boundary ($\infty$) to the physical point ($0^+$) is obtained by numerically solving differential equations with respective to the auxiliary mass. For problems with two or more kinematical invariants, the AMF method can be combined with traditional differential equation method by providing systematical boundary conditions and highly nontrivial self-consistent check. The method is described in detail with a pedagogical example of $e^+e^-\rightarrow γ^* \rightarrow t\bar{t}+X$ at NNLO. We show that the AMF method can systematically and efficiently calculate integrals to high precision.

preprint2020arXiv

Deep Reinforcement Learning (DRL): Another Perspective for Unsupervised Wireless Localization

Location is key to spatialize internet-of-things (IoT) data. However, it is challenging to use low-cost IoT devices for robust unsupervised localization (i.e., localization without training data that have known location labels). Thus, this paper proposes a deep reinforcement learning (DRL) based unsupervised wireless-localization method. The main contributions are as follows. (1) This paper proposes an approach to model a continuous wireless-localization process as a Markov decision process (MDP) and process it within a DRL framework. (2) To alleviate the challenge of obtaining rewards when using unlabeled data (e.g., daily-life crowdsourced data), this paper presents a reward-setting mechanism, which extracts robust landmark data from unlabeled wireless received signal strengths (RSS). (3) To ease requirements for model re-training when using DRL for localization, this paper uses RSS measurements together with agent location to construct DRL inputs. The proposed method was tested by using field testing data from multiple Bluetooth 5 smart ear tags in a pasture. Meanwhile, the experimental verification process reflected the advantages and challenges for using DRL in wireless localization.

preprint2020arXiv

Developing the radium measurement system for the water Cherenkov detector of the Jiangmen Underground Neutrino Observatory

The Jiangmen Underground Neutrino Observatory is proposed to determine neutrino mass hierarchy using a 20~ktonne liquid scintillator detector. Strict radio-purity requirements have been put forward for all the components of the detector. According to the MC simulation results, the radon dissolved in the water Cherenkov detector should be below 200~mBq/m$^3$. Radium, the progenitor of radon, should also be taken seriously into account. In order to measure the radium concentration in water, a radium measurement system, which consists of a radium extraction system, a radon emanation chamber and a radon concentration measurement system, has been developed. In this paper, the updated radon concentration in gas measurement system with a one-day-measurement sensitivity of $\sim$5~mBq/m$^3$, the detail of the development of the radium concentration in water measurement system with a sensitivity of $\sim$23~mBq/m$^3$ as well as the measurement results of Daya Bay water samples will be presented.

preprint2020arXiv

Diffeomorphic Shape Matching by Operator Splitting in 3D Cardiology Imaging

We develop an operator splitting approach to solve diffeomorphic matching problems for sequences of surfaces in three-dimensional space. The goal is to smoothly match, at a very fast rate, finite sequences of observed 3D-snapshots extracted from movies recording the smooth dynamic deformations of &#34;soft&#34; surfaces. We have implemented our algorithms in a proprietary software installed at The Methodist Hospital (Cardiology) to monitor mitral valve strain through computer analysis of noninvasive patients&#39; echocardiographies.

preprint2020arXiv

Enabling Cyberattack-Resilient Load Forecasting through Adversarial Machine Learning

In the face of an increasingly broad cyberattack surface, cyberattack-resilient load forecasting for electric utilities is both more necessary and more challenging than ever. In this paper, we propose an adversarial machine learning (AML) approach, which can respond to a wide range of attack behaviors without detecting outliers. It strikes a balance between enhancing a system&#39;s robustness against cyberattacks and maintaining a reasonable degree of forecasting accuracy when there is no attack. Attack models and configurations for the adversarial training were selected and evaluated to achieve the desired level of performance in a simulation study. The results validate the effectiveness and excellent performance of the proposed method.

preprint2020arXiv

Encoding word order in complex embeddings

Sequential word order is important when processing text. Currently, neural networks (NNs) address this by modeling word position using position embeddings. The problem is that position embeddings capture the position of individual words, but not the ordered relationship (e.g., adjacency or precedence) between individual word positions. We present a novel and principled solution for modeling both the global absolute positions of words and their order relationships. Our solution generalizes word embeddings, previously defined as independent vectors, to continuous word functions over a variable (position). The benefit of continuous functions over variable positions is that word representations shift smoothly with increasing positions. Hence, word representations in different positions can correlate with each other in a continuous function. The general solution of these functions is extended to complex-valued domain due to richer representations. We extend CNN, RNN and Transformer NNs to complex-valued versions to incorporate our complex embedding (we make all code available). Experiments on text classification, machine translation and language modeling show gains over both classical word embeddings and position-enriched word embeddings. To our knowledge, this is the first work in NLP to link imaginary numbers in complex-valued representations to concrete meanings (i.e., word order).

preprint2020arXiv

Enhanced Microgrid Power Flow Incorporating Hierarchical Control

An enhanced microgrid power flow (EMPF) is devised to incorporate hierarchical control effects. The new contributions are threefold: 1) an advanced-hierarchical-control-based Newton approach is established to accurately assess power sharing and voltage regulation effects; 2) a modified Jacobian matrix is derived to incorporate droop control and various secondary control modes; and 3) the secondary adjustment is calculated on top of the droop-control-based power flow results to ensure a robust Newton solution. Case studies validate that EMPF is efficacious and efficient and can serve as a powerful tool for microgrid operation and monitoring, especially for those highly meshed microgrids in urban areas.

preprint2020arXiv

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

The Jiangmen Underground Neutrino Observatory~(JUNO) features a 20~kt multi-purpose underground liquid scintillator sphere as its main detector. Some of JUNO&#39;s features make it an excellent experiment for $^8$B solar neutrino measurements, such as its low-energy threshold, its high energy resolution compared to water Cherenkov detectors, and its much large target mass compared to previous liquid scintillator detectors. In this paper we present a comprehensive assessment of JUNO&#39;s potential for detecting $^8$B solar neutrinos via the neutrino-electron elastic scattering process. A reduced 2~MeV threshold on the recoil electron energy is found to be achievable assuming the intrinsic radioactive background $^{238}$U and $^{232}$Th in the liquid scintillator can be controlled to 10$^{-17}$~g/g. With ten years of data taking, about 60,000 signal and 30,000 background events are expected. This large sample will enable an examination of the distortion of the recoil electron spectrum that is dominated by the neutrino flavor transformation in the dense solar matter, which will shed new light on the tension between the measured electron spectra and the predictions of the standard three-flavor neutrino oscillation framework. If $Δm^{2}_{21}=4.8\times10^{-5}~(7.5\times10^{-5})$~eV$^{2}$, JUNO can provide evidence of neutrino oscillation in the Earth at the about 3$σ$~(2$σ$) level by measuring the non-zero signal rate variation with respect to the solar zenith angle. Moveover, JUNO can simultaneously measure $Δm^2_{21}$ using $^8$B solar neutrinos to a precision of 20\% or better depending on the central value and to sub-percent precision using reactor antineutrinos. A comparison of these two measurements from the same detector will help elucidate the current tension between the value of $Δm^2_{21}$ reported by solar neutrino experiments and the KamLAND experiment.

preprint2020arXiv

High Temperature Virial Expansion to Universal Quench Dynamics

High temperature virial expansion is a powerful tool in equilibrium statistical mechanics. In this letter we generalize the high temperature virial expansion approach to treat far-from-equilibrium quench dynamics. As an application of our framework, we study the dynamics of a Bose gas quenched from non-interacting to unitarity, and we compare our theoretical results with unexplained experimental results by the Cambridge group [Eigen et al., Nature 563, 221 (2018)]. We show that, during the quench dynamics, the momentum distribution decreases for low-momentum part with $k<k^*$, and increases for high-momentum part with $k>k^*$, where $k^*$ is a characteristic momentum scale separating the low- and the high-momentum regimes. We determine the universal value of $k^*λ$ that agrees perfectly with the experiment, with $λ$ being the thermal de Broglie wave length. We also find a jump of the half-way relaxation time across $k^*λ$ and the non-monotonic behavior of energy distribution, both of which agree with the experiment. Finally, we address the issue whether the long-time steady state thermalizes or not, and we find that this state does thermalize except for the very high momentum tail with $kλ\gg 1$. Our framework can also be applied to quench dynamics in other systems.

preprint2020arXiv

KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge

Reinforcement learning agents usually learn from scratch, which requires a large number of interactions with the environment. This is quite different from the learning process of human. When faced with a new task, human naturally have the common sense and use the prior knowledge to derive an initial policy and guide the learning process afterwards. Although the prior knowledge may be not fully applicable to the new task, the learning process is significantly sped up since the initial policy ensures a quick-start of learning and intermediate guidance allows to avoid unnecessary exploration. Taking this inspiration, we propose knowledge guided policy network (KoGuN), a novel framework that combines human prior suboptimal knowledge with reinforcement learning. Our framework consists of a fuzzy rule controller to represent human knowledge and a refine module to fine-tune suboptimal prior knowledge. The proposed framework is end-to-end and can be combined with existing policy-based reinforcement learning algorithm. We conduct experiments on both discrete and continuous control tasks. The empirical results show that our approach, which combines human suboptimal knowledge and RL, achieves significant improvement on learning efficiency of flat RL algorithms, even with very low-performance human prior knowledge.

preprint2020arXiv

Magnetic topological insulator MnBi6Te10 with zero-field ferromagnetic state and gapped Dirac surface states

Magnetic topological insulators (TIs) with nontrivial topological electronic structure and broken time-reversal symmetry exhibit various exotic topological quantum phenomena. The realization of such exotic phenomena at high temperature is one of central topics in this area. We reveal that MnBi6Te10 is a magnetic TI with an antiferromagnetic ground state below 10.8 K whose nontrivial topology is manifested by Dirac-like surface states. The ferromagnetic axion insulator state with Z4 = 2 emerges once spins polarized at field as low as 0.1 T, accompanied with saturated anomalous Hall resistivity up to 10 K. Such a ferromagnetic state is preserved even external field down to zero at 2 K. Theoretical calculations indicate that the few-layer ferromagnetic MnBi6Te10 is also topologically nontrivial with a non-zero Chern number. Angle-resolved photoemission spectroscopy experiments further reveal three types of Dirac surface states arising from different terminations on the cleavage surfaces, one of which has insulating behavior with an energy gap of ~ 28 meV at the Dirac point. These outstanding features suggest that MnBi6Te10 is a promising system to realize various topological quantum effects at zero field and high temperature.

preprint2020arXiv

Magnetically Induced Optical Transparency With Ultra-Narrow Spectrum

Magnetically induced optical transparency (MIOT) is a technique to realize the narrow transmission spectrum in a cavity quantum electric dynamics (cavity QED) system, which is demonstrated in the recent experiment of cold 88Sr atoms in an optical cavity [Phys. Rev. Lett. 118, 263601 (2017)]. In this experiment, MIOT induces a new narrow transmission window for the probe beam, which is highly immune to the fluctuation of the cavity mode frequency. The linewidth of this transmission window approaches the decay rate of the electronic 3P1 state (about 2pi*7.5kHz) and is much less than the uncertainty of the cavity mode frequency (about 2pi*150kHz). In this work, we propose an approach to further reduce the linewidth of this MIOT-induced transmission window, with the help of two Raman beams which couples the electronic 3P1 state to the3S1state, and the3S1state to the 3P0 state, respectively. With this approach, one can reduce the transmission linewidth by orders of magnitude. Moreover, the peak value of the relative transmission power or the transmission rate of the probe beam is almost unchanged by the Raman beams. Our results are helpful for the study of precision measurement and other quantum optical processes based on cavity quantum electronic dynamics (cavity-QED).

preprint2020arXiv

NbO2-based memristive neurons for burst-based perceptron

Neuromorphic computing using spike-based learning has broad prospects in reducing computing power. Memristive neurons composed with two locally active memristors have been used to mimic the dynamical behaviors of biological neurons. In this work, the dynamic operating conditions of NbO2-based memristive neurons and their transformation boundaries between the spiking and the bursting are comprehensively investigated. Furthermore, the underlying mechanism of bursting is analyzed and the controllability of the number of spikes during each burst period is demonstrated. Finally, pattern classification and information transmitting in a perceptron neural network by using the number of spikes per bursting period to encode information is proposed. The results show a promising approach for the practical implementation of neuristor in spiking neural networks.

preprint2020arXiv

Noise Reduction Technique for Raman Spectrum using Deep Learning Network

In a normal indoor environment, Raman spectrum encounters noise often conceal spectrum peak, leading to difficulty in spectrum interpretation. This paper proposes deep learning (DL) based noise reduction technique for Raman spectroscopy. The proposed DL network is developed with several training and test sets of noisy Raman spectrum. The proposed technique is applied to denoise and compare the performance with different wavelet noise reduction methods. Output signal-to-noise ratio (SNR), root-mean-square error (RMSE) and mean absolute percentage error (MAPE) are the performance evaluation index. It is shown that output SNR of the proposed noise reduction technology is 10.24 dB greater than that of the wavelet noise reduction method while the RMSE and the MAPE are 292.63 and 10.09, which are much better than the proposed technique.

preprint2020arXiv

Non-axisymmetric flow characteristics in Head-on Collision of Spinning Droplets

Effects of spinning motion on the bouncing and coalescence between a spinning droplet and a non-spinning droplet undergoing the head-on collision were numerically studied by using a Volume-of-Fluid method. A prominent discovery is that the spinning droplet can induce significant non-axisymmetric flow features for the head-on collision of equal-size droplets composed of the same liquid. Specifically, a non-axisymmetric bouncing was observed, and it is caused by the conversion of the spinning angular momentum into the orbital angular momentum. This process is accompanied by the rotational kinetic energy loss due to the interaction between the rotational and radial flows of the droplets. A non-axisymmetric internal flow and a delayed separation after temporary coalescence were also observed, and they are caused by the enhanced interface oscillation and internal-flow-induced viscous dissipation. The spinning motion can also promote the mass interminglement of droplets because the locally non-uniform mass exchange occurs at the early collision stage by non-axisymmetric flow and is further stretched along the filament at later collision stages. In addition, it is found that the non-axisymmetric flow features increase with increasing the orthogonality of the initial translational motion and the spinning motion of droplets.

preprint2020arXiv

Observation and control of the weak topological insulator state in ZrTe5

A quantum spin Hall insulator hosts topological states at the one-dimensional edge, along which backscattering by nonmagnetic impurities is strictly prohibited and dissipationless current flows. Its 3D analogue, a weak topological insulator (WTI), possesses similar quasi-1D topological states confined at side surfaces of crystals. The enhanced confinement could provide a route for dissipationless current and better advantages for applications relative to the widely studied strong topological insulators. However, the topological side surface is usually not cleavable and is thus hard to observe by angle-resolved photoemission spectroscopy (ARPES), which has hindered the revealing of the electronic properties of WTIs. Here, we visualize the topological surface states of the WTI candidate ZrTe5 for the first time by spin and angle-resolved photoemission spectroscopy: a quasi-1D band with spin-momentum locking was revealed on the side surface. We further demonstrate that the bulk band gap in ZrTe5 is controlled by strain to the crystal, realizing a more stabilized WTI state or an ideal Dirac semimetal state depending on the direction of the external strain. The highly directional spin-current and the tunable band gap we found in ZrTe5 will provide an excellent platform for applications.

preprint2020arXiv

On skin friction in wall-bounded turbulence

In this paper, we derive mathematical formulas for the skin friction coefficient in wall-bounded turbulence based on the Reynolds averaged streamwise momentum equation and the total stress. Specially, with the theoretical or empirical relation of the total stress, the skin friction coefficient is expressed in terms of the mean velocity and the Reynolds shear stress in an arbitrary wall-normal region $[h_0, h_1]$. The formulas are validated using the direct numerical simulation data of turbulent channel and boundary layer flows, and the results show that our formulas estimate the skin friction coefficient very accurately with an error less than $2\%$. We believe that the present integral formula can be used to determine the skin friction in turbulent channel and boundary layer flows at high Reynolds numbers where the near wall statistics are very difficult to measure accurately.

preprint2020arXiv

Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures: A Machine Learning Based Approach

This article presents an automatic approach to quickly derive a good solution for hardware resource partition and task granularity for task-based parallel applications on heterogeneous many-core architectures. Our approach employs a performance model to estimate the resulting performance of the target application under a given resource partition and task granularity configuration. The model is used as a utility to quickly search for a good configuration at runtime. Instead of hand-crafting an analytical model that requires expert insights into low-level hardware details, we employ machine learning techniques to automatically learn it. We achieve this by first learning a predictive model offline using training programs. The learnt model can then be used to predict the performance of any unseen program at runtime. We apply our approach to 39 representative parallel applications and evaluate it on two representative heterogeneous many-core platforms: a CPU-XeonPhi platform and a CPU-GPU platform. Compared to the single-stream version, our approach achieves, on average, a 1.6x and 1.1x speedup on the XeonPhi and the GPU platform, respectively. These results translate to over 93% of the performance delivered by a theoretically perfect predictor.

preprint2020arXiv

Quantum-Secure Microgrid

Existing microgrid communication relies on classical public key systems, which are vulnerable to attacks from quantum computers. This paper uses quantum key distribution (QKD) to solve these quantum-era microgrid challenges. Specifically, this paper makes the following novel contributions: 1) it offers a QKD-based microgrid communication architecture for microgrids; 2) it shows how to build a quantum-secure microgrid testbed in an RTDS environment; 3) it develops a key pool sharing (KPS) strategy to improve the cyberattack resilience of the QKD-based microgrid; and 4) it analyzes the impacts of critical QKD parameters with the testbed. Test results provide insightful resources for building a quantum-secure microgrid.

preprint2020arXiv

Reflecting Modulation

Reconfigurable intelligent surface (RIS) has emerged as a promising technique for future wireless communication networks. How to reliably transmit information in a RIS-based communication system arouses much interest. This paper proposes a reflecting modulation (RM) scheme for RIS-based communications, where both the reflecting patterns and transmit signals can carry information. Depending on that the transmitter and RIS jointly or independently deliver information, RM is further classified into two categories: jointly mapped RM (JRM) and separately mapped RM (SRM). JRM and SRM are naturally superior to existing schemes, because the transmit signal vectors, reflecting patterns, and bit mapping methods of JRM and SRM are more flexibly designed. To enhance transmission reliability, this paper proposes a discrete optimization-based joint signal mapping, shaping, and reflecting (DJMSR) design for JRM and SRM to minimize the bit error rate (BER) with a given transmit signal candidate set and a given reflecting pattern candidate set. To further improve the performance, this paper optimizes multiple reflecting patterns and their associated transmit signal sets in continuous fields for JRM and SRM. Numerical results show that JRM and SRM with the proposed system optimization methods considerably outperform existing schemes in BER.

preprint2020arXiv

Revisiting Regex Generation for Modeling Industrial Applications by Incorporating Byte Pair Encoder

Regular expression is important for many natural language processing tasks especially when used to deal with unstructured and semi-structured data. This work focuses on automatically generating regular expressions and proposes a novel genetic algorithm to deal with this problem. Different from the methods which generate regular expressions from character level, we first utilize byte pair encoder (BPE) to extract some frequent items, which are then used to construct regular expressions. The fitness function of our genetic algorithm contains multi objectives and is solved based on evolutionary procedure including crossover and mutation operation. In the fitness function, we take the length of generated regular expression, the maximum matching characters and samples for positive training samples, and the minimum matching characters and samples for negative training samples into consideration. In addition, to accelerate the training process, we do exponential decay on the population size of the genetic algorithm. Our method together with a strong baseline is tested on 13 kinds of challenging datasets. The results demonstrate the effectiveness of our method, which outperforms the baseline on 10 kinds of data and achieves nearly 50 percent improvement on average. By doing exponential decay, the training speed is approximately 100 times faster than the methods without using exponential decay. In summary, our method possesses both effectiveness and efficiency, and can be implemented for the industry application.

preprint2020arXiv

ScenarioSA: A Large Scale Conversational Database for Interactive Sentiment Analysis

Interactive sentiment analysis is an emerging, yet challenging, subtask of the sentiment analysis problem. It aims to discover the affective state and sentimental change of each person in a conversation. Existing sentiment analysis approaches are insufficient in modelling the interactions among people. However, the development of new approaches are critically limited by the lack of labelled interactive sentiment datasets. In this paper, we present a new conversational emotion database that we have created and made publically available, namely ScenarioSA. We manually label 2,214 multi-turn English conversations collected from natural contexts. In comparison with existing sentiment datasets, ScenarioSA (1) covers a wide range of scenarios; (2) describes the interactions between two speakers; and (3) reflects the sentimental evolution of each speaker over the course of a conversation. Finally, we evaluate various state-of-the-art algorithms on ScenarioSA, demonstrating the need of novel interactive sentiment analysis models and the potential of ScenarioSA to facilitate the development of such models.

preprint2020arXiv

Signal Shaping for Non-Uniform Beamspace Modulated mmWave Hybrid MIMO Communications

This paper investigates adaptive signal shaping methods for millimeter wave (mmWave) multiple-input multiple-output (MIMO) communications based on the maximizing the minimum Euclidean distance (MMED) criterion. In this work, we utilize the indices of analog precoders to carry information and optimize the symbol vector sets used for each analog precoder activation state. Specifically, we firstly propose a joint optimization based signal shaping (JOSS) approach, in which the symbol vector sets used for all analog precoder activation states are jointly optimized by solving a series of quadratically constrained quadratic programming (QCQP) problems. JOSS exhibits good performance, however, with a high computational complexity. To reduce the computational complexity, we then propose a full precoding based signal shaping (FPSS) method and a diagonal precoding based signal shaping (DPSS) method, where the full or diagonal digital precoders for all analog precoder activation states are optimized by solving two small-scale QCQP problems. Simulation results show that the proposed signal shaping methods can provide considerable performance gain in reliability in comparison with existing mmWave transmission solutions.

preprint2020arXiv

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

The Taishan Antineutrino Observatory (TAO, also known as JUNO-TAO) is a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO). A ton-level liquid scintillator detector will be placed at about 30 m from a core of the Taishan Nuclear Power Plant. The reactor antineutrino spectrum will be measured with sub-percent energy resolution, to provide a reference spectrum for future reactor neutrino experiments, and to provide a benchmark measurement to test nuclear databases. A spherical acrylic vessel containing 2.8 ton gadolinium-doped liquid scintillator will be viewed by 10 m^2 Silicon Photomultipliers (SiPMs) of >50% photon detection efficiency with almost full coverage. The photoelectron yield is about 4500 per MeV, an order higher than any existing large-scale liquid scintillator detectors. The detector operates at -50 degree C to lower the dark noise of SiPMs to an acceptable level. The detector will measure about 2000 reactor antineutrinos per day, and is designed to be well shielded from cosmogenic backgrounds and ambient radioactivities to have about 10% background-to-signal ratio. The experiment is expected to start operation in 2022.

preprint2020arXiv

TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling

Transformer has been widely-used in many Natural Language Processing (NLP) tasks and the scaled dot-product attention between tokens is a core module of Transformer. This attention is a token-wise design and its complexity is quadratic to the length of sequence, limiting its application potential for long sequence tasks. In this paper, we propose a dimension-wise attention mechanism based on which a novel language modeling approach (namely TensorCoder) can be developed. The dimension-wise attention can reduce the attention complexity from the original $O(N^2d)$ to $O(Nd^2)$, where $N$ is the length of the sequence and $d$ is the dimensionality of head. We verify TensorCoder on two tasks including masked language modeling and neural machine translation. Compared with the original Transformer, TensorCoder not only greatly reduces the calculation of the original model but also obtains improved performance on masked language modeling task (in PTB dataset) and comparable performance on machine translation tasks.

preprint2020arXiv

The liquid argon detector and measurement of SiPM array at liquid argon temperature

Particle detectors based on liquid argon (LAr) have recently become recognized as an extremely attractive technology for the direct detection of dark matter as well as the measurement of coherent elastic neutrino-nucleus scattering (CE$ν$NS). The Chinese argon group at Institute of High Energy Physics has been studying the LAr detector technology and a LAr detector has been operating steadily. A program of using a dual phase LAr detector to measure the CE$ν$NS at Taishang Nuclear Power Plant has been proposed and the R\&D work is ongoing. Considering the requirements of ultra-low radio-purity and high photon collection efficiency, SiPMs will be a good choice and will be used in the detector. In this proceeding, an introduction of the LAr detector and the measurement results of SiPM array at LAr temperature will be presented.

preprint2020arXiv

Thermal Rectification of Solid-liquid Phase Change Thermal Diode under the Effect of Supercooling

Thermal rectification ratios of solid-liquid phase change thermal diodes (SL-PCTDs) are not sustainable beyond a certain temperature bias, necessitating reasonably compatible designs. Manipulating the heat transfer in the forward and reverse directions of the SL-PCTD is of great practical interest to achieve persistent performance. Herein, a SL-PCTD prototype is investigated under the supercooling effect of a phase change material (calcium chloride hexahydrate). In the forward direction, supercooling effect plays a significant role in sustaining natural convection far below the melting temperature (30 C), which leads to robust heat transfer within a temperature bias of 10-40 C. While in the reverse direction, dislodging the supercooling via manual supercooling release (MSR) within a large temperature range of 30-7 C tends to greatly inhibit heat transfer. As a consequence, thermal rectification ratio of 2.95 is achieved, which is competitively sustainable at large temperature bias compared to the SL-PCTD without supercooling effect. To address the effect of supercooling on thermal rectification comprehensively, thermal resistance approach is applied to theoretically model the SL-PCTD, which shows a good agreement with experimental data. Such a device with supercooling control in only one direction is further named as the SL-PCTD with gating functionality, where manipulating supercooling effect of phase change material opens up a feasible avenue for enabling generalized thermal diode.

preprint2020arXiv

Tight-Binding Kondo Model and Spin-Exchange Collision Rate of Alkaline-Earth Atoms in a Mixed-Dimensional Optical Lattice

We study the two-body problem of the ultracold fermionic alkaline-earth (like) atoms in the electronic $^1$S$_0$ state ($g$-state) and $^3$P$_0$ state ($e$-state), which are confined in a quasi-one-dimensional (quasi-1D) tube. In addition, in the axial direction, the $g$-atom experience a 1D optical lattice and the $e$-atom is localized by a harmonic potential. In this work, we propose two appropriate tight-binding models, which are applicable for the cases that the odd-wave scattering between the $g$- and $e$-atom is negligible or not, respectively. We further give a microscopic derivation for the inter-atomic interaction parameters of these tight-binding models, by exactly calculating the low-energy inter-atomic scattering amplitude. Our results show that, as one can predict, these interaction parameters can be efficiently controlled by the confinement potentials. We further exam the simple &#34;projection approximation&#34; with which one derives the interaction parameters by directly projecting the 3D Huang-Yang pseudopotential on the ground state of the confinement and the lowest band of the optical lattice. We find that one should be very careful about determining the interaction parameters in the tight-binding models. Furthermore, we calculate the spin-exchanging rate, which dependents on the incident quasi-momentum $k$ of the $g$-atom, for the recent experiment (L. Riegger, {\it et. al.,} Phys. Rev. Lett. {\bf 120}, 143601 (2018)) of $^{173}$Yb atoms in this quasi-(1+0)D system, and study finite-momentum effect in this experiment. Our results show that in this system the finite-momentum effect of the $g$-atom is very significant, and the momentum of the $g$-atoms in this experiment may be pretty high (already in the second Brillouin zone of the optical lattice)

preprint2020arXiv

Universal Free Energy Landscape Produces Efficient and Reversible Electron Bifurcation

For decades, it was unknown how electron bifurcating systems in Nature prevented energy-wasting short-circuiting reactions that have large driving forces, so synthetic electron bifurcating molecular machines could not be designed and built. The underpinning free energy landscapes for electron bifurcation were also enigmatic. We predict that a simple and universal free energy landscape enables electron bifurcation, and we show that it enables high-efficiency bifurcation with limited short-circuiting (the EB-scheme). The landscape relies on steep free energy slopes in the two redox branches to insulate against short-circuiting without relying on nuanced changes in the microscopic rate constants for the short-circuiting reactions. The EB-scheme thus provides a blueprint for future campaigns to establish synthetic electron bifurcating machines.

preprint2019arXiv

Interaction Control of Ultracold Alkaline-Earth Atoms

Ultracold alkaline-earth atoms have now been widely explored for precision measurements and quantum simulation. Because of its unique atomic structure, alkaline earth atoms possess great advantages for quantum simulation and studying quantum many-body matters, such as simulating synthetic gauge field, Kondo physics and $SU(N)$ physics. To fully explore the potential of ultracold alkaline-earth atoms, these systems also need to be equipped with the capability of tuning the inter-atomic interaction to the strongly interacting regime. Recently several theoretical proposals and experimental demonstrations have shown that both spin-independent and spin-exchanging interaction can be tuned to resonance. In this perspective, we will review these progress and discuss the new opportunities brought by these interaction control tools for future quantum simulation studies with ultracold alkaline-earth atoms.

preprint2019arXiv

Universal Dynamics of a Degenerate Bose Gas Quenched to Unitarity

Motivated by an unexpected experimental observation from the Cambridge group, [Eigen {\it et al.,} Nature {\bf563}, 221 (2018)], we study the evolution of the momentum distribution of a degenerate Bose gas quenched from the weakly interacting to the unitarity regime. For the two-body problem, we establish a relation that connects the momentum distribution at a long time to a sub-leading term in the initial wave function. For the many-body problem, we employ the time-dependent Bogoliubov variational wave function and find that, in certain momentum regimes, the momentum distribution at long times displays the same exponential behavior found by the experiment. Moreover, we find that this behavior is universal and independent of the short-range details of the interaction potential. Consistent with the relation found in the two-body problem, we also numerically show that this exponential form is hidden in the same sub-leading term of the Bogoliubov wave function in the initial stages. Our results establish a consistent picture to understand the universal dynamics observed in the Cambridge experiment.

preprint2018arXiv

Dynamic characterization of cellulose nanofibrils in sheared and extended semi-dilute dispersions

New materials made through controlled assembly of dispersed cellulose nanofibrils (CNF) has the potential to develop into biobased competitors to some of the highest performing materials today. The performance of these new cellulose materials depends on how easily CNF alignment can be controlled with hydrodynamic forces, which are always in competition with a different process driving the system towards isotropy, called rotary diffusion. In this work, we present a flow-stop experiment using polarized optical microscopy (POM) to study the rotary diffusion of CNF dispersions in process relevant flows and concentrations. This is combined with small angle X-ray scattering (SAXS) experiments to analyze the true orientation distribution function (ODF) of the flowing fibrils. It is found that the rotary diffusion process of CNF occurs at multiple time scales, where the fastest scale seems to be dependent on the deformation history of the dispersion before the stop. At the same time, the hypothesis that rotary diffusion is dependent on the initial ODF does not hold as the same distribution can result in different diffusion time scales. The rotary diffusion is found to be faster in flows dominated by shear compared to pure extensional flows. Furthermore, the experimental setup can be used to quickly characterize the dynamic properties of flowing CNF and thus aid in determining the quality of the dispersion and its usability in material processes.