Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
31works
0followers
23topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

31 published item(s)

preprint2026arXiv

Capability Conditioned Scaffolding for Professional Human LLM Collaboration

Large language model personalization typically adapts outputs to user preferences and style but does not account for differences in user evaluation capacity across domains of expertise. This limitation can encourage Professional Domain Drift, where users rely on AI generated reasoning in domains they cannot reliably evaluate. We introduce Capability Conditioned Scaffolding, a typed framework that partitions expertise into strong, mixed, and weak domains and conditions intervention behavior on structured capability profiles. A pilot evaluation across multiple MMLU subsets and four LLM substrates shows consistent profile conditioned intervention behavior, including categorical inversion under profile swapping and selective activation in mixed domain risk zones. These findings suggest that capability aware scaffolding can support more reliable professional human AI collaboration beyond stylistic personalization.

preprint2026arXiv

Deformations of Chow groups via cyclic homology

Let $X$ be a smooth projective variety over an arbitrary field $k$ of characteristic zero. We explore infinitesimal deformations of the Chow group $CH^{p}(X)$ via its formal completion $\widehat{CH}^{p}$, a functor defined on the category of local augmented Artinian $k$-algebras. Under a natural vanishing condition on Hodge cohomology groups, for certain augmented graded Artinian $k$-algebras $A$ with the maximal ideal $m_{A}$, we prove that \[ \widehat{CH}^{p}(A) \cong H^{p}(X, Ω^{p-1}_{X/ k})\otimes_{k}m_{A}. \]This extends earlier results of Bloch and others from the case where $k$ is algebraic over $\mathbb{Q}$ to arbitrary fields of characteristic zero,and gives a partial affirmative answer to a general question linking the pro-representability of Chow groups to a specific set of Hodge-theoretic vanishing conditions.

preprint2026arXiv

HisTrackMap: Global Vectorized High-Definition Map Construction via History Map Tracking

As an essential component of autonomous driving systems, high-definition (HD) maps provide rich and precise environmental information for auto-driving scenarios; however, existing methods, which primarily rely on query-based detection frameworks to directly model map elements or implicitly propagate queries over time, often struggle to maintain consistent temporal perception outcomes. These inconsistencies pose significant challenges to the stability and reliability of real-world autonomous driving and map data collection systems. To address this limitation, we propose a novel end-to-end tracking framework for global map construction by temporally tracking map elements' historical trajectories. Firstly, instance-level historical rasterization map representation is designed to explicitly store previous perception results, which can control and maintain different global instances' history information in a fine-grained way. Secondly, we introduce a Map-Trajectory Prior Fusion module within this tracking framework, leveraging historical priors for tracked instances to improve temporal smoothness and continuity. Thirdly, we propose a global perspective metric to evaluate the quality of temporal geometry construction in HD maps, filling the gap in current metrics for assessing global geometric perception results. Substantial experiments on the nuScenes and Argoverse2 datasets demonstrate that the proposed method outperforms state-of-the-art (SOTA) methods in both single-frame and temporal metrics. The project page is available at: https://yj772881654.github.io/HisTrackMap.

preprint2026arXiv

Pro-representability of Chow groups and Hodge numbers

Let $k$ be an algebraic field extension of $\mathbb{Q}$ and let $X$ be a smooth projective variety over $k$ of dimension $d \geq 2$. We study the pro-representability of the Chow group $CH^{p}(X)$ with $2 \leq p \leq d$. When certain Hodge numbers of $X$ vanish, namely, $H^{p}(X,Ω^{i}_{X/k})=H^{p+1}(X,Ω^{i}_{X/k})= \cdots =H^{2p-1-i}(X,Ω^{i}_{X/k})=0$ for $i$ such that $0 \leq i \leq p-2$, we prove that the formal completion $\widehat{CH}^{p}(A)$ of $CH^{p}(X)$ at a local augmented Artinian $k$-algebra $A$ with the maximal ideal $m_{A}$ satisfies \[ \widehat{CH}^{p}(A) \cong H^{p}(X, Ω^{p-1}_{X/ k})\otimes_{k}m_{A}. \]This provides a unified cohomological criterion for the pro-representability of the functor $\widehat{CH}^{p}$, generalizing earlier work by Bloch, Stienstra, and Mackall for $p=2$ and $p=3$. Our result reveals an intrinsic connection between the deformation theory of algebraic cycles and the Hodge structure of $X$.

preprint2024arXiv

First Principles based High-precision Modelling and Identification of Piezoelectric Fast Steering Mirror

We establish a high-precision composite model for a piezoelectric fast steering mirror (PFSM) using a Hammerstein structure. A novel asymmetric Bouc-Wen model is proposed to describe the nonlinear rate-independent hysteresis, while a dynamic model is derived to represent the linear rate-dependent component. By analyzing the physical process from the displacement of the piezoelectric actuator to the angle of the PFSM, cross-axis coupling is modeled based on first principles. Given the dynamic isolation of each module on different frequency scales, a step-by-step method for model parameter identification is carried out. Finally, experimental results demonstrate that the identified parameters can accurately represent the hysteresis, creep, and mechanical dynamic characteristics of the PFSM. Furthermore, by comparing the outputs of the identified model with the real PFSM under different excitation signals, the effectiveness of the proposed dual-input dual-output composite model is validated.

preprint2024arXiv

High-Accuracy Model Predictive Control with Inverse Hysteresis for High-Speed Trajectory Tracking of Piezoelectric Fast Steering Mirror

Piezoelectric fast steering mirrors (PFSM) are widely utilized in beam precision-pointing systems but encounter considerable challenges in achieving high-precision tracking of fast trajectories due to nonlinear hysteresis and mechanical dual-axis cross-coupling. This paper proposes a model predictive control (MPC) approach integrated with a hysteresis inverse based on the Hammerstein modeling structure of the PFSM. The MPC is designed to decouple the rate-dependent dual-axis linear components, with an augmented error integral variable introduced in the state space to eliminate steady-state errors. Moreover, proofs of zero steady-state error and disturbance rejection are provided. The hysteresis inverse model is then cascaded to compensate for the rate-independent nonlinear components. Finally, PFSM tracking experiments are conducted on step, sinusoidal, triangular, and composite trajectories. Compared to traditional model-free and existing model-based controllers, the proposed method significantly enhances tracking accuracy, demonstrating superior tracking performance and robustness to frequency variations. These results offer valuable insights for engineering applications.

preprint2023arXiv

Artificial intelligence for diagnosing and predicting survival of patients with renal cell carcinoma: Retrospective multi-center study

Background: Clear cell renal cell carcinoma (ccRCC) is the most common renal-related tumor with high heterogeneity. There is still an urgent need for novel diagnostic and prognostic biomarkers for ccRCC. Methods: We proposed a weakly-supervised deep learning strategy using conventional histology of 1752 whole slide images from multiple centers. Our study was demonstrated through internal cross-validation and external validations for the deep learning-based models. Results: Automatic diagnosis for ccRCC through intelligent subtyping of renal cell carcinoma was proved in this study. Our graderisk achieved aera the curve (AUC) of 0.840 (95% confidence interval: 0.805-0.871) in the TCGA cohort, 0.840 (0.805-0.871) in the General cohort, and 0.840 (0.805-0.871) in the CPTAC cohort for the recognition of high-grade tumor. The OSrisk for the prediction of 5-year survival status achieved AUC of 0.784 (0.746-0.819) in the TCGA cohort, which was further verified in the independent General cohort and the CPTAC cohort, with AUC of 0.774 (0.723-0.820) and 0.702 (0.632-0.765), respectively. Cox regression analysis indicated that graderisk, OSrisk, tumor grade, and tumor stage were found to be independent prognostic factors, which were further incorporated into the competing-risk nomogram (CRN). Kaplan-Meier survival analyses further illustrated that our CRN could significantly distinguish patients with high survival risk, with hazard ratio of 5.664 (3.893-8.239, p < 0.0001) in the TCGA cohort, 35.740 (5.889-216.900, p < 0.0001) in the General cohort and 6.107 (1.815 to 20.540, p < 0.0001) in the CPTAC cohort. Comparison analyses conformed that our CRN outperformed current prognosis indicators in the prediction of survival status, with higher concordance index for clinical prognosis.

preprint2022arXiv

Adversarial Contrastive Self-Supervised Learning

Recently, learning from vast unlabeled data, especially self-supervised learning, has been emerging and attracted widespread attention. Self-supervised learning followed by the supervised fine-tuning on a few labeled examples can significantly improve label efficiency and outperform standard supervised training using fully annotated data. In this work, we present a novel self-supervised deep learning paradigm based on online hard negative pair mining. Specifically, we design a student-teacher network to generate multi-view of the data for self-supervised learning and integrate hard negative pair mining into the training. Then we derive a new triplet-like loss considering both positive sample pairs and mined hard negative sample pairs. Extensive experiments demonstrate the effectiveness of the proposed method and its components on ILSVRC-2012.

preprint2022arXiv

Bloch-Ogus theorem,cyclic homology and deformation of Chow groups

Using Bloch-Ogus theorem and Chern character from K-theory to cyclic homology, we answer a question of Green and Griffiths on extending Bloch formula. Moreover, we construct a map from local Hilbert functor to local cohomology. With suitable assumptions, we use this map to answer a question of Bloch on constructing a natural transformation from local Hilbert functor to cohomological Chow groups.

preprint2022arXiv

Chern character, semi-regularity map and obstructions

Using Chern character, we construct a natural transformation from the local Hilbert functor to a functor of Artin rings defined from Hochschild homology, which allows us to reconstruct the semi-regularity map and the infinitesimal Abel-Jacobi map. Combining that construction of the semi-regularity map with obstruction theory of functors of Artin rings, we give a different proof of a theorem of Bloch stating that the semi-regularity map annihilates certain obstructions to embedded deformations of a closed subvariety which is a locally complete intersection.

preprint2022arXiv

Do Prompts Solve NLP Tasks Using Natural Language?

Thanks to the advanced improvement of large pre-trained language models, prompt-based fine-tuning is shown to be effective on a variety of downstream tasks. Though many prompting methods have been investigated, it remains unknown which type of prompts are the most effective among three types of prompts (i.e., human-designed prompts, schema prompts and null prompts). In this work, we empirically compare the three types of prompts under both few-shot and fully-supervised settings. Our experimental results show that schema prompts are the most effective in general. Besides, the performance gaps tend to diminish when the scale of training data grows large.

preprint2022arXiv

Exploring Runtime Decision Support for Trauma Resuscitation

AI-based recommender systems have been successfully applied in many domains (e.g., e-commerce, feeds ranking). Medical experts believe that incorporating such methods into a clinical decision support system may help reduce medical team errors and improve patient outcomes during treatment processes (e.g., trauma resuscitation, surgical processes). Limited research, however, has been done to develop automatic data-driven treatment decision support. We explored the feasibility of building a treatment recommender system to provide runtime next-minute activity predictions. The system uses patient context (e.g., demographics and vital signs) and process context (e.g., activities) to continuously predict activities that will be performed in the next minute. We evaluated our system on a pre-recorded dataset of trauma resuscitation and conducted an ablation study on different model variants. The best model achieved an average F1-score of 0.67 for 61 activity types. We include medical team feedback and discuss the future work.

preprint2022arXiv

FINT: Field-aware INTeraction Neural Network For CTR Prediction

As a critical component for online advertising and marking, click-through rate (CTR) prediction has draw lots of attentions from both industry and academia field. Recently, the deep learning has become the mainstream methodological choice for CTR. Despite of sustainable efforts have been made, existing approaches still pose several challenges. On the one hand, high-order interaction between the features is under-explored. On the other hand, high-order interactions may neglect the semantic information from the low-order fields. In this paper, we proposed a novel prediction method, named FINT, that employs the Field-aware INTeraction layer which captures high-order feature interactions while retaining the low-order field information. To empirically investigate the effectiveness and robustness of the FINT, we perform extensive experiments on the three realistic databases: KDD2012, Criteo and Avazu. The obtained results demonstrate that the FINT can significantly improve the performance compared to the existing methods, without increasing the amount of computation required. Moreover, the proposed method brought about 2.72\% increase to the advertising revenue of a big online video app through A/B testing. To better promote the research in CTR field, we released our code as well as reference implementation at: https://github.com/zhishan01/FINT.

preprint2022arXiv

Generating Privacy-Preserving Process Data with Deep Generative Models

Process data with confidential information cannot be shared directly in public, which hinders the research in process data mining and analytics. Data encryption methods have been studied to protect the data, but they still may be decrypted, which leads to individual identification. We experimented with different models of representation learning and used the learned model to generate synthetic process data. We introduced an adversarial generative network for process data generation (ProcessGAN) with two Transformer networks for the generator and the discriminator. We evaluated ProcessGAN and traditional models on six real-world datasets, of which two are public and four are collected in medical domains. We used statistical metrics and supervised learning scores to evaluate the synthetic data. We also used process mining to discover workflows for the authentic and synthetic datasets and had medical experts evaluate the clinical applicability of the synthetic workflows. We found that ProcessGAN outperformed traditional sequential models when trained on small authentic datasets of complex processes. ProcessGAN better represented the long-range dependencies between the activities, which is important for complicated processes such as the medical processes. Traditional sequential models performed better when trained on large data of simple processes. We conclude that ProcessGAN can generate a large amount of sharable synthetic process data indistinguishable from authentic data.

preprint2022arXiv

Investigating Non-local Features for Neural Constituency Parsing

Thanks to the strong representation power of neural encoders, neural chart-based parsers have achieved highly competitive performance by using local features. Recently, it has been shown that non-local features in CRF structures lead to improvements. In this paper, we investigate injecting non-local features into the training process of a local span-based parser, by predicting constituent n-gram non-local patterns and ensuring consistency between non-local patterns and local constituents. Results show that our simple method gives better results than the self-attentive parser on both PTB and CTB. Besides, our method achieves state-of-the-art BERT-based performance on PTB (95.92 F1) and strong performance on CTB (92.31 F1). Our parser also achieves better or competitive performance in multilingual and zero-shot cross-domain settings compared with the baseline.

preprint2022arXiv

Microscopic study of optically-stable, coherent color centers in diamond generated by high-temperature annealing

Single color centers in solid have emerged as promising physical platforms for quantum information science. Creating these centers with excellent quantum properties is a key foundation for further technological developments. In particular, the microscopic understanding of the spin bath environments is the key to engineer color centers for quantum control. In this work, we propose and demonstrate a distinct high-temperature annealing (HTA) approach for creating high-quality nitrogen vacancy (NV) centers in implantation-free diamonds. Simultaneously using the created NV centers as probes for their local environment we verify that no damage was microscopically induced by the HTA. Nearly all single NV centers created in ultra-low-nitrogen-concentration membranes possess stable and Fourier-transform-limited optical spectra. Furthermore, HTA strongly reduces noise sources naturally grown in ensemble samples, and leads to more than three-fold improvements of decoherence time and sensitivity. We also verify that the vacancy activation and defect reformation, especially H3 and P1 centers, can explain the reconfiguration between spin baths and color centers. This novel approach will become a powerful tool in vacancy-based quantum technology.

preprint2022arXiv

Monitoring AGNs with H$β$ Asymmetry. III. Long-term Reverberation Mapping Results of 15 Palomar-Green Quasars

In this third paper of the series reporting on the reverberation mapping (RM) campaign of active galactic nuclei with asymmetric H$β$ emission-line profiles, we present results for 15 Palomar-Green (PG) quasars using spectra obtained between the end of 2016 to May 2021. This campaign combines long time spans with relatively high cadence. For 8 objects, both the time lags obtained from the entire light curves and the measurements from individual observing seasons are provided. Reverberation mapping of 9 of our targets has been attempted for the first time, while the results for 6 others can be compared with previous campaigns. We measure the H$β$ time lags over periods of years and estimate their black hole masses. The long duration of the campaign enables us to investigate their broad line region (BLR) geometry and kinematics for different years by using velocity-resolved lags, which demonstrate signatures of diverse BLR geometry and kinematics. The BLR geometry and kinematics of individual objects are discussed. In this sample, the BLR kinematics of Keplerian/virialized motion and inflow is more common than outflow.

preprint2022arXiv

Multi-task Pre-training Language Model for Semantic Network Completion

Semantic networks, such as the knowledge graph, can represent the knowledge leveraging the graph structure. Although the knowledge graph shows promising values in natural language processing, it suffers from incompleteness. This paper focuses on knowledge graph completion by predicting linkage between entities, which is a fundamental yet critical task. Semantic matching is a potential solution as it can deal with unseen entities, which the translational distance based methods struggle with. However, to achieve competitive performance as translational distance based methods, semantic matching based methods require large-scale datasets for the training purpose, which are typically unavailable in practical settings. Therefore, we employ the language model and introduce a novel knowledge graph architecture named LP-BERT, which contains two main stages: multi-task pre-training and knowledge graph fine-tuning. In the pre-training phase, three tasks are taken to drive the model to learn the relationship from triples by predicting either entities or relations. While in the fine-tuning phase, inspired by contrastive learning, we design a triple-style negative sampling in a batch, which greatly increases the proportion of negative sampling while keeping the training time almost unchanged. Furthermore, we propose a new data augmentation method utilizing the inverse relationship of triples to improve the performance and robustness of the model. To demonstrate the effectiveness of our method, we conduct extensive experiments on three widely-used datasets, WN18RR, FB15k-237, and UMLS. The experimental results demonstrate the superiority of our methods, and our approach achieves state-of-the-art results on WN18RR and FB15k-237 datasets. Significantly, Hits@10 indicator is improved by 5% from previous state-of-the-art result on the WN18RR dataset while reaching 100% on the UMLS dataset.

preprint2022arXiv

Nuclear Norm Maximization Based Curiosity-Driven Learning

To handle the sparsity of the extrinsic rewards in reinforcement learning, researchers have proposed intrinsic reward which enables the agent to learn the skills that might come in handy for pursuing the rewards in the future, such as encouraging the agent to visit novel states. However, the intrinsic reward can be noisy due to the undesirable environment&#39;s stochasticity and directly applying the noisy value predictions to supervise the policy is detrimental to improve the learning performance and efficiency. Moreover, many previous studies employ $\ell^2$ norm or variance to measure the exploration novelty, which will amplify the noise due to the square operation. In this paper, we address aforementioned challenges by proposing a novel curiosity leveraging the nuclear norm maximization (NNM), which can quantify the novelty of exploring the environment more accurately while providing high-tolerance to the noise and outliers. We conduct extensive experiments across a variety of benchmark environments and the results suggest that NNM can provide state-of-the-art performance compared with previous curiosity methods. On 26 Atari games subset, when trained with only intrinsic reward, NNM achieves a human-normalized score of 1.09, which doubles that of competitive intrinsic rewards-based approaches. Our code will be released publicly to enhance the reproducibility.

preprint2022arXiv

SimCC: a Simple Coordinate Classification Perspective for Human Pose Estimation

The 2D heatmap-based approaches have dominated Human Pose Estimation (HPE) for years due to high performance. However, the long-standing quantization error problem in the 2D heatmap-based methods leads to several well-known drawbacks: 1) The performance for the low-resolution inputs is limited; 2) To improve the feature map resolution for higher localization precision, multiple costly upsampling layers are required; 3) Extra post-processing is adopted to reduce the quantization error. To address these issues, we aim to explore a brand new scheme, called \textit{SimCC}, which reformulates HPE as two classification tasks for horizontal and vertical coordinates. The proposed SimCC uniformly divides each pixel into several bins, thus achieving \emph{sub-pixel} localization precision and low quantization error. Benefiting from that, SimCC can omit additional refinement post-processing and exclude upsampling layers under certain settings, resulting in a more simple and effective pipeline for HPE. Extensive experiments conducted over COCO, CrowdPose, and MPII datasets show that SimCC outperforms heatmap-based counterparts, especially in low-resolution settings by a large margin.

preprint2022arXiv

Thin Accretion Disk onto slowly rotating black holes in Einstein-Æther theory

The accretion disk is formed by particles moving in closed orbits around a compact object, whose physical properties and the electromagnetic radiation characteristics are determined by the space-time geometry around the compact object. In this paper, we study the physical properties and the optical appearance of the electromagnetic radiation emitted from a thin accretion disk around the two types of the black hole solution in Einstein-Æther theory. We investigate in detail the effects of the æther field on the energy flux, temperature distribution, and electromagnetic spectrum of the disk in the two types of slowly rotating Einstein-Æther black holes. Then we plot the ray-traced redshifted image as well as the intensity and polarization profile of a lensed accretion disk around the two types of Einstein-Æther black holes. We found that from the image simulation, the æther field only has a certain effect on the central shadow area of the accretion disk.

preprint2022arXiv

Unified Visual Transformer Compression

Vision transformers (ViTs) have gained popularity recently. Even without customized image operators such as convolutions, ViTs can yield competitive performance when properly trained on massive data. However, the computational overhead of ViTs remains prohibitive, due to stacking multi-head self-attention modules and else. Compared to the vast literature and prevailing success in compressing convolutional neural networks, the study of Vision Transformer compression has also just emerged, and existing works focused on one or two aspects of compression. This paper proposes a unified ViT compression framework that seamlessly assembles three effective techniques: pruning, layer skipping, and knowledge distillation. We formulate a budget-constrained, end-to-end optimization framework, targeting jointly learning model weights, layer-wise pruning ratios/masks, and skip configurations, under a distillation loss. The optimization problem is then solved using the primal-dual algorithm. Experiments are conducted with several ViT variants, e.g. DeiT and T2T-ViT backbones on the ImageNet dataset, and our approach consistently outperforms recent competitors. For example, DeiT-Tiny can be trimmed down to 50\% of the original FLOPs almost without losing accuracy. Codes are available online:~\url{https://github.com/VITA-Group/UVC}.

preprint2021arXiv

SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification

Recently, x-vector has been a successful and popular approach for speaker verification, which employs a time delay neural network (TDNN) and statistics pooling to extract speaker characterizing embedding from variable-length utterances. Improvement upon the x-vector has been an active research area, and enormous neural networks have been elaborately designed based on the x-vector, eg, extended TDNN (E-TDNN), factorized TDNN (F-TDNN), and densely connected TDNN (D-TDNN). In this work, we try to identify the optimal architectures from a TDNN based search space employing neural architecture search (NAS), named SpeechNAS. Leveraging the recent advances in the speaker recognition, such as high-order statistics pooling, multi-branch mechanism, D-TDNN and angular additive margin softmax (AAM) loss with a minimum hyper-spherical energy (MHE), SpeechNAS automatically discovers five network architectures, from SpeechNAS-1 to SpeechNAS-5, of various numbers of parameters and GFLOPs on the large-scale text-independent speaker recognition dataset VoxCeleb1. Our derived best neural network achieves an equal error rate (EER) of 1.02% on the standard test set of VoxCeleb1, which surpasses previous TDNN based state-of-the-art approaches by a large margin. Code and trained weights are in https://github.com/wentaozhu/speechnas.git

preprint2020arXiv

Diverse electronic and magnetic properties of CrS2 enabling novel strain-controlled 2D lateral heterostructure spintronic devices

Lateral heterostructures of two-dimensional (2D) materials, integrating different phases or materials into a single piece of nanosheet, have attracted intensive research interests in the past few years for high-performance electronic and optoelectronic devices. It also holds promises to significantly improve the performance and enable new functions of spintronic devices. It is imperative to have a 2D material possessing diverse electronic and magnetic properties that are required in spintronics. In this work, using density functional theory calculations, we surveyed all IV, V and VI group transition metal dichalcogenides (TMDs) and discovered that CrS2 has the most diverse electronic and magnetic properties: antiferromagnetic (AFM) metallic 1T phase, nonmagnetic (NM) semiconductor 2H phase, and ferromagnetic (FM) semiconductor 1T_prime phase with a Curie temperature of ~1000 K. More interestingly, we found that a tensile or compressive strain could turn 1T_prime phase into a spin-up or spin-down half metal. Such a unique feature enables designing strain-controlled spintronic devices using a single piece of CrS2 crystal with improved energy efficiency, which remains a challenge in miniaturization of spintronic devices. In-depth analysis attributed the unique strain tunability to the interplay between strain-induced lattice deformation and different spatial orientation of the spin-up/spin-down electronic orbitals. A prototypical design of a simple spin-valve logic device operated by strain is also presented.

preprint2020arXiv

Event Arguments Extraction via Dilate Gated Convolutional Neural Network with Enhanced Local Features

Event Extraction plays an important role in information-extraction to understand the world. Event extraction could be split into two subtasks: one is event trigger extraction, the other is event arguments extraction. However, the F-Score of event arguments extraction is much lower than that of event trigger extraction, i.e. in the most recent work, event trigger extraction achieves 80.7%, while event arguments extraction achieves only 58%. In pipelined structures, the difficulty of event arguments extraction lies in its lack of classification feature, and the much higher computation consumption. In this work, we proposed a novel Event Extraction approach based on multi-layer Dilate Gated Convolutional Neural Network (EE-DGCNN) which has fewer parameters. In addition, enhanced local information is incorporated into word features, to assign event arguments roles for triggers predicted by the first subtask. The numerical experiments demonstrated significant performance improvement beyond state-of-art event extraction approaches on real-world datasets. Further analysis of extraction procedure is presented, as well as experiments are conducted to analyze impact factors related to the performance improvement.

preprint2020arXiv

In-Situ Studies of Stress Environment in Amorphous Solids Using Negatively Charged Nitrogen Vacancy Centers in Nanodiamond

Amorphous solids, which show characteristic differences from crystals, are common in daily usage. Glasses, gels, and polymers are familiar examples, and polymers are particularly important in terms of their role in construction and crafting. Previous studies have mainly focused on the bulk properties of polymeric products, and the local properties are less discussed. Here, we designed a distinctive protocol using the negatively charged nitrogen vacancy center in nanodiamond to study properties inside polymeric products in situ. Choosing the curing of poly dimethylsiloxane and the polymerization of cyanoacrylate as subjects of investigation, we measured the time dependence of local pressure and strain in the materials during the chemical processes. From the measurements, we were able to probe the local shear stress inside the two polymeric substances in situ. By regarding the surprisingly large shear stress as the internal tension, we attempted to provide a microscopic explanation for the ultimate tensile strength of a bulk solid. Our current methodology is applicable to any kind of transparent amorphous solids with the stress in the order of MPa and to the study of in situ properties in nanoscale. With better apparatus, we expect the limit can be pushed to sub-MPa scale.

preprint2020arXiv

Leveraging Adversarial Training in Self-Learning for Cross-Lingual Text Classification

In cross-lingual text classification, one seeks to exploit labeled data from one language to train a text classification model that can then be applied to a completely different language. Recent multilingual representation models have made it much easier to achieve this. Still, there may still be subtle differences between languages that are neglected when doing so. To address this, we present a semi-supervised adversarial training process that minimizes the maximal loss for label-preserving input perturbations. The resulting model then serves as a teacher to induce labels for unlabeled target language samples that can be used during further adversarial training, allowing us to gradually adapt our model to the target language. Compared with a number of strong baselines, we observe significant gains in effectiveness on document and intent classification for a diverse set of languages.

preprint2020arXiv

Probing local pressure environment in anvil cells with nitrogen vacancy (NV-) centers in diamond

Important discoveries have frequently been made through the studies of matter under high pressure. The conditions of the pressure environment are important for the interpretation of the experimental results. Due to various restrictions inside the pressure cell, detailed information relevant to the pressure environment, such as the pressure distribution, can be hard to obtain experimentally. Here we present the study of pressure distributions inside the pressure medium under different experimental conditions with NV centers in diamond particles as the sensor. These studies not only show a good spatial resolution, wide temperature and pressure working ranges, compatibility of the existing pressure cell design with the new method, but also demonstrate the usefulness to measure with these sensors as the pressure distribution is sensitive to various factors. The method and the results will benefit many disciplines such as material research and phase transitions in fluid dynamics.

preprint2020arXiv

SK-Unet: an Improved U-net Model with Selective Kernel for the Segmentation of Multi-sequence Cardiac MR

In the clinical environment, myocardial infarction (MI) as one com-mon cardiovascular disease is mainly evaluated based on the late gadolinium enhancement (LGE) cardiac magnetic resonance images (CMRIs). The auto-matic segmentations of left ventricle (LV), right ventricle (RV), and left ven-tricular myocardium (LVM) in the LGE CMRIs are desired for the aided diag-nosis in clinic. To accomplish this segmentation task, this paper proposes a modified U-net architecture by combining multi-sequence CMRIs, including the cine, LGE, and T2-weighted CMRIs. The cine and T2-weighted CMRIs are used to assist the segmentation in the LGE CMRIs. In this segmentation net-work, the squeeze-and-excitation residual (SE-Res) and selective kernel (SK) modules are inserted in the down-sampling and up-sampling stages, respective-ly. The SK module makes the obtained feature maps more informative in both spatial and channel-wise space, and attains more precise segmentation result. The utilized dataset is from the MICCAI challenge (MS-CMRSeg 2019), which is acquired from 45 patients including three CMR sequences. The cine and T2-weighted CMRIs acquired from 35 patients and the LGE CMRIs acquired from 5 patients are labeled. Our method achieves the mean dice score of 0.922 (LV), 0.827 (LVM), and 0.874 (RV) in the LGE CMRIs.

preprint2020arXiv

Spherical Accretion Flow onto General Parameterized Spherically Symmetric Black Hole Spacetimes

The transonic phenomenon of black hole accretion and the existence of the photon sphere are the characteristics of strong gravitational fields near a black hole horizon. In this work, we study spherical accretion flow onto a general parametrized spherically symmetric black hole spacetimes. For this purpose, we analyze the accretion process of various perfect fluids, such as the isothermal fluid of ultra-stiff, ultra-relativistic, and sub-relativistic types and polytropic fluid, respectively. The influences of extra parameters beyond the Schwarzschild black hole in the general parameterized spherically symmetric black hole on the flow behaviors of the above-mentioned test fluids are studied in detail. In addition, by studying the accretion of ideal photon gas, we further discuss the correspondence between the sonic radius of accreting photon gas and the photon sphere for the general parameterized spherically symmetric black hole. Some possible future extensions of our analysis are also discussed.

preprint2018arXiv

Quantum sensing of local magnetic field texture in strongly correlated electron systems under extreme conditions

An important feature of strong correlated electron systems is the tunability between interesting ground states such as unconventional superconductivity and exotic magnetism. Pressure is a clean, continuous and systematic tuning parameter. However, due to the restricted accessibility introduced by high-pressure devices, compatible magnetic field sensors with sufficient sensitivity are rare. This greatly limits the detections and detailed studies of pressure-induced phenomena. Here, we utilize nitrogen vacancy (NV) centers in diamond as a powerful, spatially-resolved vector field sensor for material research under pressure at cryogenic temperatures. Using a single crystal of BaFe2(As0:59P0:41)2 as an example, we extract the superconducting transition temperature (Tc), the local magnetic field profile in the Meissner state and the critical fields (Hc1 and Hc2). The method developed in this work will become a unique tool for tuning, probing and understanding quantum many body systems.