Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
37works
0followers
20topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

37 published item(s)

preprint2026arXiv

FED-FSTQ: Fisher-Guided Token Quantization for Communication-Efficient Federated Fine-Tuning of LLMs on Edge Devices

Federated fine-tuning provides a practical route to adapt large language models (LLMs) on edge devices without centralizing private data, yet in mobile deployments the training wall-clock is often bottlenecked by straggler-limited uplink communication under heterogeneous bandwidth and intermittent participation. Although parameter-efficient fine-tuning (PEFT) reduces trainable parameters, per-round payloads remain prohibitive in non-IID regimes, where uniform compression can discard rare but task-critical signals. We propose Fed-FSTQ, a Fisher-guided token quantization system primitive for communication-efficient federated LLM fine-tuning. Fed-FSTQ employs a lightweight Fisher proxy to estimate token sensitivity, coupling importance-aware token selection with non-uniform mixed-precision quantization to allocate higher fidelity to informative evidence while suppressing redundant transmission. The method is model-agnostic, serves as a drop-in module for standard federated PEFT pipelines, e.g., LoRA, without modifying the server aggregation rule, and supports bandwidth-heterogeneous clients via compact sparse message packing. Experiments on multilingual QA and medical QA under non-IID partitions show that Fed-FSTQ reduces cumulative uplink traffic required to reach a fixed quality threshold by 46x relative to a standard LoRA baseline, and improves end-to-end wall-clock time-to-accuracy by 52%. Furthermore, enabling Fisher-guided token reduction at inference yields up to a 1.55x end-to-end speedup on NVIDIA Jetson-class edge devices, demonstrating deployability under tight resource constraints.

preprint2023arXiv

The diagnostic utility of endocytoscopy for the detection of esophageal lesions: a systematic review and meta-analysis

Objective: To systematically evaluate the value of endocytoscopy (ECS) in the diagnosis of early esophageal cancer (EC). Methods: Pubmed, Ovid and EMbase databases were searched to collect diagnostic tests of ECS assisted diagnosis of early EC. The retrieval time was from the establishment of the database to August 2022. Review manager 5.4, Stata 16.0 and Meta-Disc 1.4 were used for meta-analysis after two researchers independently screened literature, extracted data and evaluated the bias risk of included studies. Results: A total of 7 studies were included, including 520 lesions. Meta-analysis results showed that the combined sensitivity(SE), specificity(SP), positive likelihood ratio (PLR), negative likelihood ratio (NLR), diagnostic odds ratio (DOR) and positive posterior probability (PPP) of ECS screening for early EC were 0.95[95%CI: 0.84, 0.98], 0.92 [95%CI: 0.83, 0.96], 11.8 [95%CI: 5.3, 26.1], 0.06 [95%CI: 0.02, 0.18], 203 [95%CI: 50, 816], and 75%, respectively. The area (AUC) under the receiver Operating Characteristic curve (SROC) was 0.98[95%CI: 0.96, 0.99]. Conclusions: Current evidence suggests that ECS can be used as an effective screening tool for early EC. Due to the limited number and quality of included studies, it is imperative to conduct more high-quality studies to verify the above conclusions.

preprint2022arXiv

2020 CATARACTS Semantic Segmentation Challenge

Surgical scene segmentation is essential for anatomy and instrument localization which can be further used to assess tissue-instrument interactions during a surgical procedure. In 2017, the Challenge on Automatic Tool Annotation for cataRACT Surgery (CATARACTS) released 50 cataract surgery videos accompanied by instrument usage annotations. These annotations included frame-level instrument presence information. In 2020, we released pixel-wise semantic annotations for anatomy and instruments for 4670 images sampled from 25 videos of the CATARACTS training set. The 2020 CATARACTS Semantic Segmentation Challenge, which was a sub-challenge of the 2020 MICCAI Endoscopic Vision (EndoVis) Challenge, presented three sub-tasks to assess participating solutions on anatomical structure and instrument segmentation. Their performance was assessed on a hidden test set of 531 images from 10 videos of the CATARACTS test set.

preprint2022arXiv

A Flow-based Distributed Trading Mechanism in Regional Electricity Market with Energy Hub

The concept of Energy Hub (EH) has been emerged to accommodate renewable energy sources in a multi-energy system to deploy the synergies between electricity and other energy sources. However, the market mechanisms for the integration of the EHs into the energy markets are not sufficiently elaborated. This paper proposes a flow-based two-level distributed trading mechanism in the regional electricity market with EH. At the lower level, the regional system operator coordinates the regional grids transactions in two markets, the local energy market with EH and the wholesale market of the upstream grid. Every nodal agent as an independent stakeholder leverages price discrepancy to cross arbitrage from different markets. At the upper level, the EH is a third player intending to maximize profit from trading in the regional electricity market and gas market. The regional electricity market clearing problem is formulated as a mathematical program with equilibrium constraints, for which we develop an ADMM-based distributed algorithm to obtain the equilibrium solution. The DC power flow is decomposed into optimization problems for the regional system operator and agents at different nodes, which can be solved in a distributed manner to achieve global optimality without violating the privacy of players. Case studies based on a realistic regional grid verify the effectiveness of the proposed algorithm and show that the mechanism is effective in decomposing power flow and increasing energy efficiency.

preprint2022arXiv

Anchor Graph Structure Fusion Hashing for Cross-Modal Similarity Search

Cross-modal hashing still has some challenges needed to address: (1) most existing CMH methods take graphs as input to model data distribution. These methods omit to consider the correlation of graph structure among multiple modalities; (2) most existing CMH methods ignores considering the fusion affinity among multi-modalities data; (3) most existing CMH methods relax the discrete constraints to solve the optimization objective, significantly degrading the retrieval performance. To solve the above limitations, we propose a novel Anchor Graph Structure Fusion Hashing (AGSFH). AGSFH constructs the anchor graph structure fusion matrix from different anchor graphs of multiple modalities with the Hadamard product, which can fully exploit the geometric property of underlying data structure. Based on the anchor graph structure fusion matrix, AGSFH attempts to directly learn an intrinsic anchor graph, where the structure of the intrinsic anchor graph is adaptively tuned so that the number of components of the intrinsic graph is exactly equal to the number of clusters. Besides, AGSFH preserves the anchor fusion affinity into the common binary Hamming space. Furthermore, a discrete optimization framework is designed to learn the unified binary codes. Extensive experimental results on three public social datasets demonstrate the superiority of AGSFH.

preprint2022arXiv

Discrimination-Based Double Auction for Maximizing Social Welfare in the Electricity and Heating Market Considering Privacy Preservation

This paper proposes a doubled-sided auction mechanism with price discrimination for social welfare (SW) maximization in the electricity and heating market. In this mechanism, energy service providers (ESPs) submit offers and load aggregators (LAs) submit bids to an energy trading center (ETC) to maximize their utility; in turn, the selfless ETC as an auctioneer leverages dis-criminatory price weights to regulate the behaviors of ESPs and LAs, which combines the individual benefits of each stakeholder with the overall social welfare to achieve the global optimum. Nash games are employed to describe the interactions between players with the same market role. Theoretically, we first prove the existence and uniqueness of the Nash equilibrium; then, considering the requirement of game players to preserve privacy, a distributed algorithm based on the alternating direction method of multipliers is developed to implement distributed bidding and analytical target cascading algorithm is applied to reach the balance of demand and supply. We validated the proposed mechanism using case studies on a city-level distribution system. The results indicated that the achieved SW improved by 4%-15% compared with other mechanisms, and also verified the effectiveness of the distributed algorithm.

preprint2022arXiv

Efficient Argument Structure Extraction with Transfer Learning and Active Learning

The automation of extracting argument structures faces a pair of challenges on (1) encoding long-term contexts to facilitate comprehensive understanding, and (2) improving data efficiency since constructing high-quality argument structures is time-consuming. In this work, we propose a novel context-aware Transformer-based argument structure prediction model which, on five different domains, significantly outperforms models that rely on features or only encode limited contexts. To tackle the difficulty of data annotation, we examine two complementary methods: (i) transfer learning to leverage existing annotated data to boost model performance in a new target domain, and (ii) active learning to strategically identify a small amount of samples for annotation. We further propose model-independent sample acquisition strategies, which can be generalized to diverse domains. With extensive experiments, we show that our simple-yet-effective acquisition strategies yield competitive results against three strong comparisons. Combined with transfer learning, substantial F1 score boost (5-25) can be further achieved during the early iterations of active learning across domains.

preprint2022arXiv

Extensive Study of Multiple Deep Neural Networks for Complex Random Telegraph Signals

Time-fluctuating signals are ubiquitous and diverse in many physical, chemical, and biological systems, among which random telegraph signals (RTSs) refer to a series of instantaneous switching events between two discrete levels from single-particle movements. Reliable RTS analyses are crucial prerequisite to identify underlying mechanisms related to performance sensitivity. When numerous levels partake, complex patterns of multilevel RTSs occur, making their quantitative analysis exponentially difficult, hereby systematic approaches are found elusive. Here, we present a three-step analysis protocol via progressive knowledge-transfer, where the outputs of early step are passed onto a subsequent step. Especially, to quantify complex RTSs, we build three deep neural network architectures that can process temporal data well and demonstrate the model accuracy extensively with a large dataset of different RTS types affected by controlling background noise size. Our protocol offers structured schemes to quantify complex RTSs from which meaningful interpretation and inference can ensue.

preprint2022arXiv

HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization

Document structure is critical for efficient information consumption. However, it is challenging to encode it efficiently into the modern Transformer architecture. In this work, we present HIBRIDS, which injects Hierarchical Biases foR Incorporating Document Structure into the calculation of attention scores. We further present a new task, hierarchical question-summary generation, for summarizing salient content in the source document into a hierarchy of questions and summaries, where each follow-up question inquires about the content of its parent question-summary pair. We also annotate a new dataset with 6,153 question-summary hierarchies labeled on long government reports. Experiment results show that our model produces better question-summary hierarchies than comparisons on both hierarchy quality and content coverage, a finding also echoed by human judges. Additionally, our model improves the generation of long-form summaries from lengthy government reports and Wikipedia articles, as measured by ROUGE scores.

preprint2022arXiv

Mass Testing and Characterization of 20-inch PMTs for JUNO

Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).

preprint2022arXiv

Simulating nuclear and electronic quantum effects in enzymes

An accurate treatment of the structures and dynamics that lead to enhanced chemical reactivity in enzymes requires explicit treatment of both electronic and nuclear quantum effects. The former can be captured in ab initio molecular dynamics (AIMD) simulations while the latter can be included by performing ab initio path integral molecular dynamics (AI-PIMD) simulations. Both AIMD and AI-PIMD simulations have traditionally been computationally prohibitive for large enzymatic systems. Recent developments in streaming computer architectures and new algorithms to accelerate path integral simulations now make these simulations practical for biological systems, allowing elucidation of enzymatic reactions in unprecedented detail. In this chapter, we summarize these recent developments and discuss practical considerations for applying AIMD and AI-PIMD simulations to enzymes.

preprint2022arXiv

Spatio-Temporal Coupling Controlled Laser for Electron Acceleration

Limited by the difficulty in acceleration synchronization, it has been a long-term challenge for on-chip dielectric laser-based accelerators (DLA) to bridge the gap between non-relativistic and relativistic regimes. Here, we propose a DLA based on a spatio-temporal coupling (STC) controlled laser pulse, which enables the acceleration of a non-relativistic electron to a sub-MeV level in a single acceleration structure (chirped spatial grating). It provides high precision temporal and spatial tuning of the driving laser via the dispersion manipulation, leading to a synchronous acceleration of the velocity increasing electrons over a large energy range. Additionally, the STC scheme is a general method and can be extended to driving fields of other wavelengths such as terahertz pulses. Our results bring new possibilities to MeV-scale portable electron sources and table-top acceleration experiments.

preprint2022arXiv

Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model

In spoken language understanding (SLU), what the user says is converted to his/her intent. Recent work on end-to-end SLU has shown that accuracy can be improved via pre-training approaches. We revisit ideas presented by Lugosch et al. using speech pre-training and three-module modeling; however, to ease construction of the end-to-end SLU model, we use as our phoneme module an open-source acoustic-phonetic model from a DNN-HMM hybrid automatic speech recognition (ASR) system instead of training one from scratch. Hence we fine-tune on speech only for the word module, and we apply multi-target learning (MTL) on the word and intent modules to jointly optimize SLU performance. MTL yields a relative reduction of 40% in intent-classification error rates (from 1.0% to 0.6%). Note that our three-module model is a streaming method. The final outcome of the proposed three-module modeling approach yields an intent accuracy of 99.4% on FluentSpeech, an intent error rate reduction of 50% compared to that of Lugosch et al. Although we focus on real-time streaming methods, we also list non-streaming methods for comparison.

preprint2021arXiv

JUNO Physics and Detector

The Jiangmen Underground Neutrino Observatory (JUNO) is a 20 kton LS detector at 700-m underground. An excellent energy resolution and a large fiducial volume offer exciting opportunities for addressing many important topics in neutrino and astro-particle physics. With 6 years of data, the neutrino mass ordering can be determined at 3-4 sigma and three oscillation parameters can be measured to a precision of 0.6% or better by detecting reactor antineutrinos. With 10 years of data, DSNB could be observed at 3-sigma; a lower limit of the proton lifetime of 8.34e33 years (90% C.L.) can be set by searching for p->nu_bar K^+; detection of solar neutrinos would shed new light on the solar metallicity problem and examine the vacuum-matter transition region. A core-collapse supernova at 10 kpc would lead to ~5000 IBD and ~2000 (300) all-flavor neutrino-proton (electron) scattering events. Geo-neutrinos can be detected with a rate of ~400 events/year. We also summarize the final design of the JUNO detector and the key R&D achievements. All 20-inch PMTs have been tested. The average photon detection efficiency is 28.9% for the 15,000 MCP PMTs and 28.1% for the 5,000 dynode PMTs, higher than the JUNO requirement of 27%. Together with the >20 m attenuation length of LS, we expect a yield of 1345 p.e. per MeV and an effective energy resolution of 3.02%/\sqrt{E (MeV)}$ in simulations. The underwater electronics is designed to have a loss rate <0.5% in 6 years. With degassing membranes and a micro-bubble system, the radon concentration in the 35-kton water pool could be lowered to <10 mBq/m^3. Acrylic panels of radiopurity <0.5 ppt U/Th are produced. The 20-kton LS will be purified onsite. Singles in the fiducial volume can be controlled to ~10 Hz. The JUNO experiment also features a double calorimeter system with 25,600 3-inch PMTs, a LS testing facility OSIRIS, and a near detector TAO.

preprint2021arXiv

MD-MTL: An Ensemble Med-Multi-Task Learning Package for DiseaseScores Prediction and Multi-Level Risk Factor Analysis

While many machine learning methods have been used for medical prediction and risk factor analysis on healthcare data, most prior research has involved single-task learning (STL) methods. However, healthcare research often involves multiple related tasks. For instance, implementation of disease scores prediction and risk factor analysis in multiple subgroups of patients simultaneously and risk factor analysis at multi-levels synchronously. In this paper, we developed a new ensemble machine learning Python package based on multi-task learning (MTL), referred to as the Med-Multi-Task Learning (MD-MTL) package and applied it in predicting disease scores of patients, and in carrying out risk factor analysis on multiple subgroups of patients simultaneously. Our experimental results on two datasets demonstrate the utility of the MD-MTL package, and show the advantage of MTL (vs. STL), when analyzing data that is organized into different categories (tasks, which can be various age groups, different levels of disease severity, etc.).

preprint2020arXiv

A uniqueness theorem for asymptotically cylindrical shrinking Ricci solitons

We prove that a shrinking gradient Ricci soliton which agrees to infinite order at spatial infinity with one of the standard cylindrical metrics on $S^k\times \RR^{n-k}$ for $k\geq 2$ along some end must be isometric to the cylinder on that end. When the underlying manifold is complete, it must be globally isometric either to the cylinder or (when $k=n-1$) to its $\ZZ_2$-quotient.

preprint2020arXiv

Annotation-Efficient Learning for Medical Image Segmentation based on Noisy Pseudo Labels and Adversarial Learning

Despite that deep learning has achieved state-of-the-art performance for medical image segmentation, its success relies on a large set of manually annotated images for training that are expensive to acquire. In this paper, we propose an annotation-efficient learning framework for segmentation tasks that avoids annotations of training images, where we use an improved Cycle-Consistent Generative Adversarial Network (GAN) to learn from a set of unpaired medical images and auxiliary masks obtained either from a shape model or public datasets. We first use the GAN to generate pseudo labels for our training images under the implicit high-level shape constraint represented by a Variational Auto-encoder (VAE)-based discriminator with the help of the auxiliary masks, and build a Discriminator-guided Generator Channel Calibration (DGCC) module which employs our discriminator&#39;s feedback to calibrate the generator for better pseudo labels. To learn from the pseudo labels that are noisy, we further introduce a noise-robust iterative learning method using noise-weighted Dice loss. We validated our framework with two situations: objects with a simple shape model like optic disc in fundus images and fetal head in ultrasound images, and complex structures like lung in X-Ray images and liver in CT images. Experimental results demonstrated that 1) Our VAE-based discriminator and DGCC module help to obtain high-quality pseudo labels. 2) Our proposed noise-robust learning method can effectively overcome the effect of noisy pseudo labels. 3) The segmentation performance of our method without using annotations of training images is close or even comparable to that of learning from human annotations.

preprint2020arXiv

Asymmetric Correlation Quantization Hashing for Cross-modal Retrieval

Due to the superiority in similarity computation and database storage for large-scale multiple modalities data, cross-modal hashing methods have attracted extensive attention in similarity retrieval across the heterogeneous modalities. However, there are still some limitations to be further taken into account: (1) most current CMH methods transform real-valued data points into discrete compact binary codes under the binary constraints, limiting the capability of representation for original data on account of abundant loss of information and producing suboptimal hash codes; (2) the discrete binary constraint learning model is hard to solve, where the retrieval performance may greatly reduce by relaxing the binary constraints for large quantization error; (3) handling the learning problem of CMH in a symmetric framework, leading to difficult and complex optimization objective. To address above challenges, in this paper, a novel Asymmetric Correlation Quantization Hashing (ACQH) method is proposed. Specifically, ACQH learns the projection matrixs of heterogeneous modalities data points for transforming query into a low-dimensional real-valued vector in latent semantic space and constructs the stacked compositional quantization embedding in a coarse-to-fine manner for indicating database points by a series of learnt real-valued codeword in the codebook with the help of pointwise label information regression simultaneously. Besides, the unified hash codes across modalities can be directly obtained by the discrete iterative optimization framework devised in the paper. Comprehensive experiments on diverse three benchmark datasets have shown the effectiveness and rationality of ACQH.

preprint2020arXiv

Coronary Artery Segmentation in Angiographic Videos Using A 3D-2D CE-Net

Coronary angiography is an indispensable assistive technique for cardiac interventional surgery. Segmentation and extraction of blood vessels from coronary angiography videos are very essential prerequisites for physicians to locate, assess and diagnose the plaques and stenosis in blood vessels. This article proposes a new video segmentation framework that can extract the clearest and most comprehensive coronary angiography images from a video sequence, thereby helping physicians to better observe the condition of blood vessels. This framework combines a 3D convolutional layer to extract spatial--temporal information from a video sequence and a 2D CE--Net to accomplish the segmentation task of an image sequence. The input is a few continuous frames of angiographic video, and the output is a mask of segmentation result. From the results of segmentation and extraction, we can get good segmentation results despite the poor quality of coronary angiography video sequences.

preprint2020arXiv

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

The Jiangmen Underground Neutrino Observatory~(JUNO) features a 20~kt multi-purpose underground liquid scintillator sphere as its main detector. Some of JUNO&#39;s features make it an excellent experiment for $^8$B solar neutrino measurements, such as its low-energy threshold, its high energy resolution compared to water Cherenkov detectors, and its much large target mass compared to previous liquid scintillator detectors. In this paper we present a comprehensive assessment of JUNO&#39;s potential for detecting $^8$B solar neutrinos via the neutrino-electron elastic scattering process. A reduced 2~MeV threshold on the recoil electron energy is found to be achievable assuming the intrinsic radioactive background $^{238}$U and $^{232}$Th in the liquid scintillator can be controlled to 10$^{-17}$~g/g. With ten years of data taking, about 60,000 signal and 30,000 background events are expected. This large sample will enable an examination of the distortion of the recoil electron spectrum that is dominated by the neutrino flavor transformation in the dense solar matter, which will shed new light on the tension between the measured electron spectra and the predictions of the standard three-flavor neutrino oscillation framework. If $Δm^{2}_{21}=4.8\times10^{-5}~(7.5\times10^{-5})$~eV$^{2}$, JUNO can provide evidence of neutrino oscillation in the Earth at the about 3$σ$~(2$σ$) level by measuring the non-zero signal rate variation with respect to the solar zenith angle. Moveover, JUNO can simultaneously measure $Δm^2_{21}$ using $^8$B solar neutrinos to a precision of 20\% or better depending on the central value and to sub-percent precision using reactor antineutrinos. A comparison of these two measurements from the same detector will help elucidate the current tension between the value of $Δm^2_{21}$ reported by solar neutrino experiments and the KamLAND experiment.

preprint2020arXiv

Intrinsic current drive by electromagnetic electron drift wave turbulence in tokamak pedestal region

The local intrinsic parallel current density driven by electron drift wave (DW) turbulence including electromagnetic (EM) effects is analytically studied. The scalings of the ratios of intrinsic current density driven by residual turbulent flux and by turbulent source to the bootstrap current density with electron density and temperature are predicted to be $T_e^{3/4}T_i/n_e$ and $T_eT_i/n_e$, respectively. Based on the typical parameters in DIII-D pedestal region, the local intrinsic current density driven by both the residual turbulent flux and the turbulent source is negligible. However, despite the negligible turbulent source driven current, the residual turbulent flux driven local intrinsic current density by EM DW turbulence can reach about 66% of the bootstrap current density for ITER pedestal parameters due to much lower collisionality in ITER than in DIII-D. Moreover, the contributions from adiabatic ES parts, non-adiabatic ES parts and non-adiabatic EM parts of the plasma response to electromagnetic fluctuations are analyzed. It is found that there exists strong cancelation between non-adiabatic ES response and the non-adiabatic EM response for the ITER pedestal case, and thus the kinetic stress contributed by the adiabatic ES response of parallel electron pressure dominates the intrinsic current drive. This is different from the ES electron DW case. Therefore, the EM effects on turbulence driven intrinsic current density should be carefully considered in the future reactor with high ratio of electron pressure to the magnetic pressure and steep pressure profile.

preprint2020arXiv

Intrinsic current drive by electromagnetic electron temperature gradient turbulence in tokamak plasmas

The mean parallel current density evolution equation is presented using electromagnetic (EM) gyrokinetic equation. There exist two types of intrinsic current driving mechanisms resulted from EM electron temperature gradient (ETG) turbulence. The first type is the divergence of residual turbulent flux including a residual stress-like term and a kinetic stress-like term. The second type is named as residual turbulent source, which is driven by the correlation between density and parallel electric field fluctuations. The intrinsic current density driven by the residual turbulent source is negligible as compared to that driven by the residual turbulent flux. The ratio of intrinsic current density driven by EM ETG turbulence to the background bootstrap current density is estimated. The local intrinsic current density driven by the residual turbulent flux for mesoscale variation of turbulent flux can reach about 80% of the bootstrap current density in the core region of ITER standard scenario, but there is no net intrinsic current on a global scale. Based on this, the local intrinsic current driven by EM micro-turbulence and its effects on local modification of the profile of safety factor may be needed to be carefully taken into account in the future device with high beta_e which is the ratio between electron pressure to the magnetic pressure.

preprint2020arXiv

Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward

Sequence-to-sequence models for abstractive summarization have been studied extensively, yet the generated summaries commonly suffer from fabricated content, and are often found to be near-extractive. We argue that, to address these issues, the summarizer should acquire semantic interpretation over input, e.g., via structured representation, to allow the generation of more informative summaries. In this paper, we present ASGARD, a novel framework for Abstractive Summarization with Graph-Augmentation and semantic-driven RewarD. We propose the use of dual encoders---a sequential document encoder and a graph-structured encoder---to maintain the global context and local characteristics of entities, complementing each other. We further design a reward based on a multiple choice cloze test to drive the model to better capture entity interactions. Results show that our models produce significantly higher ROUGE scores than a variant without knowledge graph as input on both New York Times and CNN/Daily Mail datasets. We also obtain better or comparable performance compared to systems that are fine-tuned from large pretrained language models. Human judges further rate our model outputs as more informative and containing fewer unfaithful errors.

preprint2020arXiv

Quantum kinetic energy and isotope fractionation in aqueous ionic solutions

At room temperature, the quantum contribution to the kinetic energy of a water molecule exceeds the classical contribution by an order of magnitude. The quantum kinetic energy (QKE) of a water molecule is modulated by its local chemical environment and leads to uneven partitioning of isotopes between different phases in thermal equilibrium, which would not occur if the nuclei behaved classically. In this work, we use ab initio path integral simulations to show that QKEs of the water molecules and the equilibrium isotope fractionation ratios of the oxygen and hydrogen isotopes are sensitive probes of the hydrogen bonding structures in aqueous ionic solutions. In particular, we demonstrate how the QKE of water molecules in path integral simulations can be decomposed into translational, rotational and vibrational degrees of freedom, and use them to determine the impact of solvation on different molecular motions. By analyzing the QKEs and isotope fractionation ratios, we show how the addition of the Na$^+$, Cl$^-$ and HPO$_4^{2-}$ ions perturbs the competition between quantum effects in liquid water and impacts their local solvation structures.

preprint2020arXiv

Reducing the X-ray radiation exposure frequency in cardio-angiography via deep-learning based video interpolation

Cardiac coronary angiography is a major technology to assist doctors during cardiac interventional surgeries. Under the exposure of X-ray radiation, doctors inject contrast agents through catheters to determine the position and status of coronary vessels in real time. To get a coronary angiography video with a high frame rate, the doctor needs to increase the exposure frequency and intensity of the X-ray. This will inevitably increase the X-ray harm to both patients and surgeons. In this work, we innovatively utilize a deep-learning based video interpolation algorithm to interpolate coronary angiography videos. Moreover, we establish a new coronary angiography image dataset ,which contains 95,039 triplets images to retrain the video interpolation network model. Using the retrained network we synthesize high frame rate coronary angiography video from the low frame rate coronary angiography video. The average peak signal to noise ratio(PSNR) of those synthesized video frames reaches 34dB. Extensive experiment results demonstrate the feasibility of using the video frame interpolation algorithm to synthesize continuous and clear high frame rate coronary angiography video. With the help of this technology, doctors can significantly reduce exposure frequency and intensity of the X-ray during coronary angiography.

preprint2020arXiv

Relative expander entropy in the presence of a two-sided obstacle and applications

We study a notion of relative entropy motivated by self-expanders of mean curvature flow. In particular, we obtain the existence of this quantity for arbitrary hypersurfaces trapped between two disjoint self-expanders asymptotic to the same cone. This allows us to begin to develop the variational theory for the relative entropy functional for the associated obstacle problem. We also obtain a version of the forward monotonicity formula for mean curvature flow proposed by Ilmanen.

preprint2020arXiv

Robust Medical Instrument Segmentation Challenge 2019

Intraoperative tracking of laparoscopic instruments is often a prerequisite for computer and robotic-assisted interventions. While numerous methods for detecting, segmenting and tracking of medical instruments based on endoscopic video images have been proposed in the literature, key limitations remain to be addressed: Firstly, robustness, that is, the reliable performance of state-of-the-art methods when run on challenging images (e.g. in the presence of blood, smoke or motion artifacts). Secondly, generalization; algorithms trained for a specific intervention in a specific hospital should generalize to other interventions or institutions. In an effort to promote solutions for these limitations, we organized the Robust Medical Instrument Segmentation (ROBUST-MIS) challenge as an international benchmarking competition with a specific focus on the robustness and generalization capabilities of algorithms. For the first time in the field of endoscopic image processing, our challenge included a task on binary segmentation and also addressed multi-instance detection and segmentation. The challenge was based on a surgical data set comprising 10,040 annotated images acquired from a total of 30 surgical procedures from three different types of surgery. The validation of the competing methods for the three tasks (binary segmentation, multi-instance detection and multi-instance segmentation) was performed in three different stages with an increasing domain gap between the training and the test data. The results confirm the initial hypothesis, namely that algorithm performance degrades with an increasing domain gap. While the average detection and segmentation quality of the best-performing algorithms is high, future research should concentrate on detection and segmentation of small, crossing, moving and transparent instrument(s) (parts).

preprint2020arXiv

Tackling Ordinal Regression Problem for Heterogeneous Data: Sparse and Deep Multi-Task Learning Approaches

Many real-world datasets are labeled with natural orders, i.e., ordinal labels. Ordinal regression is a method to predict ordinal labels that finds a wide range of applications in data-rich domains, such as natural, health and social sciences. Most existing ordinal regression approaches work well for independent and identically distributed (IID) instances via formulating a single ordinal regression task. However, for heterogeneous non-IID instances with well-defined local geometric structures, e.g., subpopulation groups, multi-task learning (MTL) provides a promising framework to encode task (subgroup) relatedness, bridge data from all tasks, and simultaneously learn multiple related tasks in efforts to improve generalization performance. Even though MTL methods have been extensively studied, there is barely existing work investigating MTL for heterogeneous data with ordinal labels. We tackle this important problem via sparse and deep multi-task approaches. Specifically, we develop a regularized multi-task ordinal regression (MTOR) model for smaller datasets and a deep neural networks based MTOR model for large-scale datasets. We evaluate the performance using three real-world healthcare datasets with applications to multi-stage disease progression diagnosis. Our experiments indicate that the proposed MTOR models markedly improve the prediction performance comparing with single-task ordinal regression models.

preprint2020arXiv

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

The Taishan Antineutrino Observatory (TAO, also known as JUNO-TAO) is a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO). A ton-level liquid scintillator detector will be placed at about 30 m from a core of the Taishan Nuclear Power Plant. The reactor antineutrino spectrum will be measured with sub-percent energy resolution, to provide a reference spectrum for future reactor neutrino experiments, and to provide a benchmark measurement to test nuclear databases. A spherical acrylic vessel containing 2.8 ton gadolinium-doped liquid scintillator will be viewed by 10 m^2 Silicon Photomultipliers (SiPMs) of >50% photon detection efficiency with almost full coverage. The photoelectron yield is about 4500 per MeV, an order higher than any existing large-scale liquid scintillator detectors. The detector operates at -50 degree C to lower the dark noise of SiPMs to an acceptable level. The detector will measure about 2000 reactor antineutrinos per day, and is designed to be well shielded from cosmogenic backgrounds and ambient radioactivities to have about 10% background-to-signal ratio. The experiment is expected to start operation in 2022.

preprint2020arXiv

Tilted-pulse-front schemes for terahertz generation

High energy single- to few-cycle terahertz pulses enable the exploration of the frontier of science, such as electron acceleration, strong-field physics, and spectroscopy. One important method of generating such terahertz pulses is to use the tilted-pulse-front (TPF) technique. However, due to the non-collinear phase-matching, the large angular dispersion leads to a spatial and temporal break-up of the optical pump, limiting the terahertz generation efficiency and reducing the few-cycle character of the generated terahertz fields. To reduce the effects caused by angular dispersion, multiple schemes with discrete pulse-front-tilt have been suggested. We propose a terahertz generation scheme, where a spatio-temporally chirped (STC) pump pulse is utilized. With our 2D+1 numerical model, we perform a systematic comparative study of the conventional TPF scheme, three discrete TPF schemes, and the STC scheme. This model predicts smaller optimal interaction length and significantly smaller conversion efficiency compared to simple 1D models. We conclude that the STC scheme delivers spatially homogeneous few-cycle terahertz pulses with the highest conversion efficiency. Additionally, given a short interaction length, the discrete TPF schemes cannot outperform the continuous ones. In general, this work gives guidance to choose the most appropriate setup for a given terahertz experiment

preprint2020arXiv

Vision-Based Fall Event Detection in Complex Background Using Attention Guided Bi-directional LSTM

Fall event detection, as one of the greatest risks to the elderly, has been a hot research issue in the solitary scene in recent years. Nevertheless, there are few researches on the fall event detection in complex background. Different from most conventional background subtraction methods which depend on background modeling, Mask R-CNN method based on deep learning technique can clearly extract the moving object in noise background. We further propose an attention guided Bi-directional LSTM model for the final fall event detection. To demonstrate the efficiency, the proposed method is verified in the public dataset and self-build dataset. Evaluation of the algorithm performances in comparison with other state-of-the-art methods indicates that the proposed design is accurate and robust, which means it is suitable for the task of fall event detection in complex situation.

preprint2020arXiv

XREF: Entity Linking for Chinese News Comments with Supplementary Article Reference

Automatic identification of mentioned entities in social media posts facilitates quick digestion of trending topics and popular opinions. Nonetheless, this remains a challenging task due to limited context and diverse name variations. In this paper, we study the problem of entity linking for Chinese news comments given mentions&#39; spans. We hypothesize that comments often refer to entities in the corresponding news article, as well as topics involving the entities. We therefore propose a novel model, XREF, that leverages attention mechanisms to (1) pinpoint relevant context within comments, and (2) detect supporting entities from the news article. To improve training, we make two contributions: (a) we propose a supervised attention loss in addition to the standard cross entropy, and (b) we develop a weakly supervised training scheme to utilize the large-scale unlabeled corpus. Two new datasets in entertainment and product domains are collected and annotated for experiments. Our proposed method outperforms previous methods on both datasets.

preprint2019arXiv

Cluster-wise Unsupervised Hashing for Cross-Modal Similarity Search

Large-scale cross-modal hashing similarity retrieval has attracted more and more attention in modern search applications such as search engines and autopilot, showing great superiority in computation and storage. However, current unsupervised cross-modal hashing methods still have some limitations: (1)many methods relax the discrete constraints to solve the optimization objective which may significantly degrade the retrieval performance;(2)most existing hashing model project heterogenous data into a common latent space, which may always lose sight of diversity in heterogenous data;(3)transforming real-valued data point to binary codes always results in abundant loss of information, producing the suboptimal continuous latent space. To overcome above problems, in this paper, a novel Cluster-wise Unsupervised Hashing (CUH) method is proposed. Specifically, CUH jointly performs the multi-view clustering that projects the original data points from different modalities into its own low-dimensional latent semantic space and finds the cluster centroid points and the common clustering indicators in its own low-dimensional space, and learns the compact hash codes and the corresponding linear hash functions. An discrete optimization framework is developed to learn the unified binary codes across modalities under the guidance cluster-wise code-prototypes. The reasonableness and effectiveness of CUH is well demonstrated by comprehensive experiments on diverse benchmark datasets.

preprint2019arXiv

Full 3D+1 modelling of the tilted-pulse-front setups for single-cycle terahertz generation

The tilted-pulse-front setup utilizing a diffraction grating is one of the most successful methods to generate single- to few-cycle terahertz pulses. However, the generated terahertz pulses have a large spatial inhomogeneity, due to the noncollinear phase matching condition and the asymmetry of the prism-shaped nonlinear crystal geometry, especially when pushing for high optical-to-terahertz conversion efficiency. A 3D+1 (x,y,z,t) numerical model is necessary in order to fully investigate the terahertz generation problem in the tilted-pulse-front scheme. We compare in detail the differences between 1D+1, 2D+1 and 3D+1 models. The simulations show that the size of the optical beam in the pulse-front-tilt plane sensitively affects the spatio-temporal properties of the terahertz electric field. The terahertz electric field is found to have a strong spatial dependence such that a few-cycle pulse is only generated near the apex of the prism. The part of the beam farther from the apex contains a large fraction of the energy but has a waveform that deviates from a few-cycle. This strong spatial dependence must be accounted for when using the terahertz pulses for strong-field physics and carrier-envelope-phase sensitive experiments such as terahertz acceleration, coherent control of antiferromagnetic spin waves and terahertz high-harmonic generation.