Source author record

Sheng Li

Sheng Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

60works

29topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

DRNet: All-in-One Image Restoration via Prior-Guided Dynamic Reparameterization

All-in-one image restoration aims to handle diverse degradations within a single model. However, existing methods often suffer from three key limitations: 1) per-input computational overhead from dynamic degradation estimation; 2) optimization challenges due to task heterogeneity; and 3) inefficient, frequency-agnostic encoder designs. To overcome these, we introduce the Dynamic Reparameterization Network (DRNet), a novel framework operating on an initialization-stage reconfiguration paradigm that fundamentally eliminates per-input overhead. At its core, a Dynamic Reparameterization MLP (DRMLP) guided by a Task-Specific Modulator (TSM), which effectively mitigates task heterogeneity by orchestrating both specific restoration goals and a versatile general-purpose mode within a unified architecture. Furthermore, we incorporate a Continuous Wavelet Transform Encoder (CWTE) that explicitly leverages frequency characteristics via wavelet decomposition for a lightweight yet powerful design. Extensive experiments demonstrate that DRNet achieves state-of-the-art performance across five restoration tasks with superior parameter efficiency. Crucially, it showcases unique flexibility, excelling as both a highly competitive foundation model for blind restoration and a top-performing user-guided specialist.

preprint2025arXiv

Observation of robust one-dimensional edge channels in a three-dimensional quantum spin Hall insulator

Topologically protected edge channels show prospects for quantum devices. They have been found experimentally in two-dimensional (2D) quantum spin Hall insulators (QSHIs), weak topological insulators and higher-order topological insulators (HOTIs), but the number of materials realizing these topologies is still quite limited. Here, we provide evidence for topological edge states within a novel topology named three-dimensional (3D) QSHIs. Its topology originates solely from a nonzero $S_z$ spin Chern number for each $k_z$ plane of the crystal and is realized in bulk $α$-Bi$_4$I$_4$ with trivial symmetry indicators, as we show by density functional theory calculations. We experimentally observe the related edge states at each type of monolayer and bilayer step of this material by scanning tunneling microscopy. Consistently, the edge states are neither interrupted, nor backscattered by defects at the step edges corroborating their helical character as expected from the nontrivial topology. Furthermore, two individual edge channels are directly observed at bilayer steps without visible interaction gap opening, demonstrating the robustness of these edge modes against vertical stacking. Our results establish $α$-Bi$_4$I$_4$ as the first material realization of a 3D QSHI whose definition goes beyond the scope of topological symmetry indicators, and provide a pathway for realizing nearly-quantized spin Hall conductivity per unit cell in a bulk crystal.

preprint2024arXiv

From Covert Hiding to Visual Editing: Robust Generative Video Steganography

Traditional video steganography methods are based on modifying the covert space for embedding, whereas we propose an innovative approach that embeds secret message within semantic feature for steganography during the video editing process. Although existing traditional video steganography methods display a certain level of security and embedding capacity, they lack adequate robustness against common distortions in online social networks (OSNs). In this paper, we introduce an end-to-end robust generative video steganography network (RoGVS), which achieves visual editing by modifying semantic feature of videos to embed secret message. We employ face-swapping scenario to showcase the visual editing effects. We first design a secret message embedding module to adaptively hide secret message into the semantic feature of videos. Extensive experiments display that the proposed RoGVS method applied to facial video datasets demonstrate its superiority over existing video and image steganography techniques in terms of both robustness and capacity.

preprint2024arXiv

Object-oriented backdoor attack against image captioning

Backdoor attack against image classification task has been widely studied and proven to be successful, while there exist little research on the backdoor attack against vision-language models. In this paper, we explore backdoor attack towards image captioning models by poisoning training data. Assuming the attacker has total access to the training dataset, and cannot intervene in model construction or training process. Specifically, a portion of benign training samples is randomly selected to be poisoned. Afterwards, considering that the captions are usually unfolded around objects in an image, we design an object-oriented method to craft poisons, which aims to modify pixel values by a slight range with the modification number proportional to the scale of the current detected object region. After training with the poisoned data, the attacked model behaves normally on benign images, but for poisoned images, the model will generate some sentences irrelevant to the given image. The attack controls the model behavior on specific test images without sacrificing the generation performance on benign test images. Our method proves the weakness of image captioning models to backdoor attack and we hope this work can raise the awareness of defending against backdoor attack in the image captioning field.

preprint2024arXiv

PROMPT-IML: Image Manipulation Localization with Pre-trained Foundation Models Through Prompt Tuning

Deceptive images can be shared in seconds with social networking services, posing substantial risks. Tampering traces, such as boundary artifacts and high-frequency information, have been significantly emphasized by massive networks in the Image Manipulation Localization (IML) field. However, they are prone to image post-processing operations, which limit the generalization and robustness of existing methods. We present a novel Prompt-IML framework. We observe that humans tend to discern the authenticity of an image based on both semantic and high-frequency information, inspired by which, the proposed framework leverages rich semantic knowledge from pre-trained visual foundation models to assist IML. We are the first to design a framework that utilizes visual foundation models specially for the IML task. Moreover, we design a Feature Alignment and Fusion module to align and fuse features of semantic features with high-frequency features, which aims at locating tampered regions from multiple perspectives. Experimental results demonstrate that our model can achieve better performance on eight typical fake image datasets and outstanding robustness.

preprint2023arXiv

Trojaning semi-supervised learning model via poisoning wild images on the web

Wild images on the web are vulnerable to backdoor (also called trojan) poisoning, causing machine learning models learned on these images to be injected with backdoors. Most previous attacks assumed that the wild images are labeled. In reality, however, most images on the web are unlabeled. Specifically, we study the effects of unlabeled backdoor images under semi-supervised learning (SSL) on widely studied deep neural networks. To be realistic, we assume that the adversary is zero-knowledge and that the semi-supervised learning model is trained from scratch. Firstly, we find the fact that backdoor poisoning always fails when poisoned unlabeled images come from different classes, which is different from poisoning the labeled images. The reason is that the SSL algorithms always strive to correct them during training. Therefore, for unlabeled images, we implement backdoor poisoning on images from the target class. Then, we propose a gradient matching strategy to craft poisoned images such that their gradients match the gradients of target images on the SSL model, which can fit poisoned images to the target class and realize backdoor injection. To the best of our knowledge, this may be the first approach to backdoor poisoning on unlabeled images of trained-from-scratch SSL models. Experiments show that our poisoning achieves state-of-the-art attack success rates on most SSL algorithms while bypassing modern backdoor defenses.

preprint2022arXiv

A DTCWT-SVD Based Video Watermarking resistant to frame rate conversion

Videos can be easily tampered, copied and redistributed by attackers for illegal and monetary usage. Such behaviors severely jeopardize the interest of content owners. Despite huge efforts made in digital video watermarking for copyright protection, typical distortions in video transmission including signal attacks, geometric attacks and temporal synchronization attacks can still easily erase the embedded signal. Among them, temporal synchronization attacks which include frame dropping, frame insertion and frame rate conversion is one of the most prevalent attacks. To address this issue, we present a new video watermarking based on joint Dual-Tree Cosine Wavelet Transformation (DTCWT) and Singular Value Decomposition (SVD), which is resistant to frame rate conversion. We first extract a set of candidate coefficient by applying SVD decomposition after DTCWT transform. Then, we simulate the watermark embedding by adjusting the shape of candidate coefficient. Finally, we perform group-level watermarking that includes moderate temporal redundancy to resist temporal desynchronization attacks. Extensive experimental results show that the proposed scheme is more resilient to temporal desynchronization attacks and performs better than the existing blind video watermarking schemes.

preprint2022arXiv

Coupling Visual Semantics of Artificial Neural Networks and Human Brain Function via Synchronized Activations

Artificial neural networks (ANNs), originally inspired by biological neural networks (BNNs), have achieved remarkable successes in many tasks such as visual representation learning. However, whether there exists semantic correlations/connections between the visual representations in ANNs and those in BNNs remains largely unexplored due to both the lack of an effective tool to link and couple two different domains, and the lack of a general and effective framework of representing the visual semantics in BNNs such as human functional brain networks (FBNs). To answer this question, we propose a novel computational framework, Synchronized Activations (Sync-ACT), to couple the visual representation spaces and semantics between ANNs and BNNs in human brain based on naturalistic functional magnetic resonance imaging (nfMRI) data. With this approach, we are able to semantically annotate the neurons in ANNs with biologically meaningful description derived from human brain imaging for the first time. We evaluated the Sync-ACT framework on two publicly available movie-watching nfMRI datasets. The experiments demonstrate a) the significant correlation and similarity of the semantics between the visual representations in FBNs and those in a variety of convolutional neural networks (CNNs) models; b) the close relationship between CNN's visual representation similarity to BNNs and its performance in image classification tasks. Overall, our study introduces a general and effective paradigm to couple the ANNs and BNNs and provides novel insights for future studies such as brain-inspired artificial intelligence.

preprint2022arXiv

Emergence of insulating ferrimagnetism and perpendicular magnetic anisotropy in 3d-5d perovskite oxide composite films for insulator spintronic

Magnetic insulators with strong perpendicular magnetic anisotropy (PMA) play a key role in exploring pure spin current phenomena and developing ultralow-dissipation spintronic devices, thereby it is highly desirable to develop new material platforms. Here we report epitaxial growth of La2/3Sr1/3MnO3 (LSMO)-SrIrO3 (SIO) composite oxide films (LSMIO) with different crystalline orientations fabricated by sequential two-target ablation process using pulsed laser deposition. The LSMIO films exhibit high crystalline quality with homogeneous mixture of LSMO and SIO at atomic level. Ferrimagnetic and insulating transport characteristics are observed, with the temperature-dependent electric resistivity well fitted by Mott variable-range-hopping model. Moreover, the LSMIO films show strong PMA. Through further constructing all perovskite oxide heterostructures of the ferrimagnetic insulator LSMIO and a strong spin-orbital coupled SIO layer, pronounced spin Hall magnetoresistance (SMR) and spin Hall-like anomalous Hall effect (SH-AHE) were observed. These results illustrate the potential application of the ferrimagnetic insulator LSMIO in developing all-oxide ultralow-dissipation spintronic devices.

preprint2022arXiv

Exploring Depth Information for Face Manipulation Detection

Face manipulation detection has been receiving a lot of attention for the reliability and security of the face images. Recent studies focus on using auxiliary information or prior knowledge to capture robust manipulation traces, which are shown to be promising. As one of the important face features, the face depth map, which has shown to be effective in other areas such as the face recognition or face detection, is unfortunately paid little attention to in literature for detecting the manipulated face images. In this paper, we explore the possibility of incorporating the face depth map as auxiliary information to tackle the problem of face manipulation detection in real world applications. To this end, we first propose a Face Depth Map Transformer (FDMT) to estimate the face depth map patch by patch from a RGB face image, which is able to capture the local depth anomaly created due to manipulation. The estimated face depth map is then considered as auxiliary information to be integrated with the backbone features using a Multi-head Depth Attention (MDA) mechanism that is newly designed. Various experiments demonstrate the advantage of our proposed method for face manipulation detection.

preprint2022arXiv

Fusion of Self-supervised Learned Models for MOS Prediction

We participated in the mean opinion score (MOS) prediction challenge, 2022. This challenge aims to predict MOS scores of synthetic speech on two tracks, the main track and a more challenging sub-track: out-of-domain (OOD). To improve the accuracy of the predicted scores, we have explored several model fusion-related strategies and proposed a fused framework in which seven pretrained self-supervised learned (SSL) models have been engaged. These pretrained SSL models are derived from three ASR frameworks, including Wav2Vec, Hubert, and WavLM. For the OOD track, we followed the 7 SSL models selected on the main track and adopted a semi-supervised learning method to exploit the unlabeled data. According to the official analysis results, our system has achieved 1st rank in 6 out of 16 metrics and is one of the top 3 systems for 13 out of 16 metrics. Specifically, we have achieved the highest LCC, SRCC, and KTAU scores at the system level on main track, as well as the best performance on the LCC, SRCC, and KTAU evaluation metrics at the utterance level on OOD track. Compared with the basic SSL models, the prediction accuracy of the fused system has been largely improved, especially on OOD sub-track.

preprint2022arXiv

Generative Steganography Network

Steganography usually modifies cover media to embed secret data. A new steganographic approach called generative steganography (GS) has emerged recently, in which stego images (images containing secret data) are generated from secret data directly without cover media. However, existing GS schemes are often criticized for their poor performances. In this paper, we propose an advanced generative steganography network (GSN) that can generate realistic stego images without using cover images. We firstly introduce the mutual information mechanism in GS, which helps to achieve high secret extraction accuracy. Our model contains four sub-networks, i.e., an image generator ($G$), a discriminator ($D$), a steganalyzer ($S$), and a data extractor ($E$). $D$ and $S$ act as two adversarial discriminators to ensure the visual quality and security of generated stego images. $E$ is to extract the hidden secret from generated stego images. The generator $G$ is flexibly constructed to synthesize either cover or stego images with different inputs. It facilitates covert communication by concealing the function of generating stego images in a normal generator. A module named secret block is designed to hide secret data in the feature maps during image generation, with which high hiding capacity and image fidelity are achieved. In addition, a novel hierarchical gradient decay (HGD) skill is developed to resist steganalysis detection. Experiments demonstrate the superiority of our work over existing methods.

preprint2022arXiv

Hierarchical Capsule Prediction Network for Marketing Campaigns Effect

Marketing campaigns are a set of strategic activities that can promote a business's goal. The effect prediction for marketing campaigns in a real industrial scenario is very complex and challenging due to the fact that prior knowledge is often learned from observation data, without any intervention for the marketing campaign. Furthermore, each subject is always under the interference of several marketing campaigns simultaneously. Therefore, we cannot easily parse and evaluate the effect of a single marketing campaign. To the best of our knowledge, there are currently no effective methodologies to solve such a problem, i.e., modeling an individual-level prediction task based on a hierarchical structure with multiple intertwined events. In this paper, we provide an in-depth analysis of the underlying parse tree-like structure involved in the effect prediction task and we further establish a Hierarchical Capsule Prediction Network (HapNet) for predicting the effects of marketing campaigns. Extensive results based on both the synthetic data and real data demonstrate the superiority of our model over the state-of-the-art methods and show remarkable practicability in real industrial applications.

preprint2022arXiv

High-Capacity Framework for Reversible Data Hiding in Encrypted Image Using Pixel Predictions and Entropy Encoding

While the existing vacating room before encryption (VRBE) based schemes can achieve decent embedding rate, the payloads of the existing vacating room after encryption (VRAE) based schemes are relatively low. To address this issue, this paper proposes a generalized framework for high-capacity RDHEI for both VRBE and VRAE cases. First, an efficient embedding room generation algorithm (ERGA) is designed to produce large embedding room by using pixel prediction and entropy encoding. Then, we propose two RDHEI schemes, one for VRBE, another for VRAE. In the VRBE scenario, the image owner generates the embedding room with ERGA and encrypts the preprocessed image by using the stream cipher with two encryption keys. Then, the data hider locates the embedding room and embeds the encrypted additional data. In the VRAE scenario, the cover image is encrypted by an improved block modulation and permutation encryption algorithm, where the spatial redundancy in the plain-text image is largely preserved. Then, the data hider applies ERGA on the encrypted image to generate the embedding room and conducts data embedding. For both schemes, the receivers with different authentication keys can respectively conduct error-free data extraction and/or error-free image recovery. The experimental results show that the two proposed schemes outperform many state-of-the-art RDHEI arts. Besides, the schemes can ensure high security level, where the original image can be hardly discovered from the encrypted version before and after data hiding by the unauthorized user.

preprint2022arXiv

Image Generation Network for Covert Transmission in Online Social Network

Online social networks have stimulated communications over the Internet more than ever, making it possible for secret message transmission over such noisy channels. In this paper, we propose a Coverless Image Steganography Network, called CIS-Net, that synthesizes a high-quality image directly conditioned on the secret message to transfer. CIS-Net is composed of four modules, namely, the Generation, Adversarial, Extraction, and Noise Module. The receiver can extract the hidden message without any loss even the images have been distorted by JPEG compression attacks. To disguise the behaviour of steganography, we collected images in the context of profile photos and stickers and train our network accordingly. As such, the generated images are more inclined to escape from malicious detection and attack. The distinctions from previous image steganography methods are majorly the robustness and losslessness against diverse attacks. Experiments over diverse public datasets have manifested the superior ability of anti-steganalysis.

preprint2022arXiv

Learning Infomax and Domain-Independent Representations for Causal Effect Inference with Real-World Data

The foremost challenge to causal inference with real-world data is to handle the imbalance in the covariates with respect to different treatment options, caused by treatment selection bias. To address this issue, recent literature has explored domain-invariant representation learning based on different domain divergence metrics (e.g., Wasserstein distance, maximum mean discrepancy, position-dependent metric, and domain overlap). In this paper, we reveal the weaknesses of these strategies, i.e., they lead to the loss of predictive information when enforcing the domain invariance; and the treatment effect estimation performance is unstable, which heavily relies on the characteristics of the domain distributions and the choice of domain divergence metrics. Motivated by information theory, we propose to learn the Infomax and Domain-Independent Representations to solve the above puzzles. Our method utilizes the mutual information between the global feature representations and individual feature representations, and the mutual information between feature representations and treatment assignment predictions, in order to maximally capture the common predictive information for both treatment and control groups. Moreover, our method filters out the influence of instrumental and irrelevant variables, and thus it effectively increases the predictive ability of potential outcomes. Experimental results on both the synthetic and real-world datasets show that our method achieves state-of-the-art performance on causal effect inference. Moreover, our method exhibits reliable prediction performances when facing data with different characteristics of data distributions, complicated variable types, and severe covariate imbalance.

preprint2022arXiv

Multi-Task Adversarial Learning for Treatment Effect Estimation in Basket Trials

Estimating treatment effects from observational data provides insights about causality guiding many real-world applications such as different clinical study designs, which are the formulations of trials, experiments, and observational studies in medical, clinical, and other types of research. In this paper, we describe causal inference for application in a novel clinical design called basket trial that tests how well a new drug works in patients who have different types of cancer that all have the same mutation. We propose a multi-task adversarial learning (MTAL) method, which incorporates feature selection multi-task representation learning and adversarial learning to estimate potential outcomes across different tumor types for patients sharing the same genetic mutation but having different tumor types. In our paper, the basket trial is employed as an intuitive example to present this new causal inference setting. This new causal inference setting includes, but is not limited to basket trials. This setting has the same challenges as the traditional causal inference problem, i.e., missing counterfactual outcomes under different subgroups and treatment selection bias due to confounders. We present the practical advantages of our MTAL method for the analysis of synthetic basket trial data and evaluate the proposed estimator on two benchmarks, IHDP and News. The results demonstrate the superiority of our MTAL method over the competing state-of-the-art methods.

preprint2022arXiv

Multimodal Fake News Detection via CLIP-Guided Learning

Multimodal fake news detection has attracted many research interests in social forensics. Many existing approaches introduce tailored attention mechanisms to guide the fusion of unimodal features. However, how the similarity of these features is calculated and how it will affect the decision-making process in FND are still open questions. Besides, the potential of pretrained multi-modal feature learning models in fake news detection has not been well exploited. This paper proposes a FND-CLIP framework, i.e., a multimodal Fake News Detection network based on Contrastive Language-Image Pretraining (CLIP). Given a targeted multimodal news, we extract the deep representations from the image and text using a ResNet-based encoder, a BERT-based encoder and two pair-wise CLIP encoders. The multimodal feature is a concatenation of the CLIP-generated features weighted by the standardized cross-modal similarity of the two modalities. The extracted features are further processed for redundancy reduction before feeding them into the final classifier. We introduce a modality-wise attention module to adaptively reweight and aggregate the features. We have conducted extensive experiments on typical fake news datasets. The results indicate that the proposed framework has a better capability in mining crucial features for fake news detection. The proposed FND-CLIP can achieve better performances than previous works, i.e., 0.7\%, 6.8\% and 1.3\% improvements in overall accuracy on Weibo, Politifact and Gossipcop, respectively. Besides, we justify that CLIP-based learning can allow better flexibility on multimodal feature selection.

preprint2022arXiv

NeuralSound: Learning-based Modal Sound Synthesis With Acoustic Transfer

We present a novel learning-based modal sound synthesis approach that includes a mixed vibration solver for modal analysis and an end-to-end sound radiation network for acoustic transfer. Our mixed vibration solver consists of a 3D sparse convolution network and a Locally Optimal Block Preconditioned Conjugate Gradient module (LOBPCG) for iterative optimization. Moreover, we highlight the correlation between a standard modal vibration solver and our network architecture. Our radiation network predicts the Far-Field Acoustic Transfer maps (FFAT Maps) from the surface vibration of the object. The overall running time of our learning method for any new object is less than one second on a GTX 3080 Ti GPU while maintaining a high sound quality close to the ground truth that is computed using standard numerical methods. We also evaluate the numerical accuracy and perceptual accuracy of our sound synthesis approach on different objects corresponding to various materials.

preprint2022arXiv

Robust Watermarking for Video Forgery Detection with Improved Imperceptibility and Robustness

Videos are prone to tampering attacks that alter the meaning and deceive the audience. Previous video forgery detection schemes find tiny clues to locate the tampered areas. However, attackers can successfully evade supervision by destroying such clues using video compression or blurring. This paper proposes a video watermarking network for tampering localization. We jointly train a 3D-UNet-based watermark embedding network and a decoder that predicts the tampering mask. The perturbation made by watermark embedding is close to imperceptible. Considering that there is no off-the-shelf differentiable video codec simulator, we propose to mimic video compression by ensembling simulation results of other typical attacks, e.g., JPEG compression and blurring, as an approximation. Experimental results demonstrate that our method generates watermarked videos with good imperceptibility and robustly and accurately locates tampered areas within the attacked version.

preprint2021arXiv

Cooperative control of perpendicular magnetic anisotropy via crystal structure and orientation in single-crystal flexible SrRuO3 membranes

Flexible magnetic materials with robust and controllable perpendicular magnetic anisotropy (PMA) are highly desirable for developing flexible high-performance spintronic devices. However, it is still challenge to fabricate PMA films through current techniques of direct deposition on polymers. Here, we report a facile method for synthesizing single-crystal freestanding SrRuO3 (SRO) membranes with controlled crystal structure and orientation using water-soluble Ca3-xSrxAl2O6 sacrificial layers. Through cooperative effect of crystal structure and orientation engineering, flexible SrRuO3 membranes reveal highly tunable magnetic anisotropy from in-plane to our-of-plane with a remarkable PMA energy of 7.34*106 erg/cm3. Based on the first-principles calculations, it reveals that the underlying mechanism of PMA modulation is intimately correlated with structure-controlled Ru 4d-orbital occupation, as well as the spin-orbital matrix element differences, dependent on the crystal orientation. In addition, there are no obvious changes of the magnetism after 10,000 bending cycles, indicating an excellent magnetism reliability in the prepared films. This work provides a feasible approach to prepare the flexible oxide films with strong and controllable PMA.

preprint2021arXiv

Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) requires coordination to efficiently solve certain tasks. Fully centralized control is often infeasible in such domains due to the size of joint action spaces. Coordination graph based formalization allows reasoning about the joint action based on the structure of interactions. However, they often require domain expertise in their design. This paper introduces the deep implicit coordination graph (DICG) architecture for such scenarios. DICG consists of a module for inferring the dynamic coordination graph structure which is then used by a graph neural network based module to learn to implicitly reason about the joint actions or values. DICG allows learning the tradeoff between full centralization and decentralization via standard actor-critic methods to significantly improve coordination for domains with large number of agents. We apply DICG to both centralized-training-centralized-execution and centralized-training-decentralized-execution regimes. We demonstrate that DICG solves the relative overgeneralization pathology in predatory-prey tasks as well as outperforms various MARL baselines on the challenging StarCraft II Multi-agent Challenge (SMAC) and traffic junction environments.

preprint2021arXiv

Enhanced Superconductivity in the Se-substituted 1T-PdTe$_2$

Two-dimensional transition metal dichalcogenide PdTe$_2$ recently attracts much attention due to its phase coexistence of type-II Dirac semimetal and type-I superconductivity. Here we report a 67 % enhancement of superconducting transition temperature in the 1T-PdSeTe in comparison to that of PdTe2 through partial substitution of Te atoms by Se. The superconductivity has been unambiguously confirmed by the magnetization, resistivity and specific heat measurements. 1T-PdSeTe shows type-II superconductivity with large anisotropy and non-bulk superconductivity nature with volume fraction ~ 20 % estimated from magnetic and heat capacity measurements. 1T-PdSeTe expands the family of superconducting transition metal dichalcogenides and thus provides additional insights for understanding superconductivity and topological physics in the 1T-PdTe$_2$ system

preprint2021arXiv

Lateral modulation of magnetic anisotropy in tricolor 3d-5d oxide superlattices

Manipulating magnetic anisotropy (MA) purposefully in transition metal oxides (TMOs) enables the development of oxide-based spintronic devices with practical applications. Here, we report a pathway to reversibly switch the lateral magnetic easy-axis via interfacial oxygen octahedral coupling (OOC) effects in 3d-5d tricolor superlattices, i.e. [SrIrO3,mRTiO3,SrIrO3,2La0.67Sr0.33MnO3]10 (RTiO3: SrTiO3 and CaTiO3). In the heterostructures, the anisotropy energy (MAE) is enhanced over one magnitude to ~106 erg/cm3 compared to La0.67Sr0.33MnO3 films. Moreover, the magnetic easy-axis is reversibly reoriented between (100)- and (110)-directions by changing the RTiO3. Using first-principles density functional theory calculations, we find that the SrIrO3 owns a large single-ion anisotropy due to its strong spin-orbit interaction. This anisotropy can be reversibly controlled by the OOC, then reorient the easy-axis of the superlattices. Additionally, it enlarges the MAE of the films via the cooperation with a robust orbital hybridization between the Ir and Mn atoms. Our results indicate that the tricolor superlattices consisting of 3d and 5d oxides provide a powerful platform to study the MA and develop oxide-based spintronic devices.

preprint2021arXiv

Learning Emergent Discrete Message Communication for Cooperative Reinforcement Learning

Communication is a important factor that enables agents work cooperatively in multi-agent reinforcement learning (MARL). Most previous work uses continuous message communication whose high representational capacity comes at the expense of interpretability. Allowing agents to learn their own discrete message communication protocol emerged from a variety of domains can increase the interpretability for human designers and other agents.This paper proposes a method to generate discrete messages analogous to human languages, and achieve communication by a broadcast-and-listen mechanism based on self-attention. We show that discrete message communication has performance comparable to continuous message communication but with much a much smaller vocabulary size.Furthermore, we propose an approach that allows humans to interactively send discrete messages to agents.

preprint2021arXiv

Searching for Fast Model Families on Datacenter Accelerators

Neural Architecture Search (NAS), together with model scaling, has shown remarkable progress in designing high accuracy and fast convolutional architecture families. However, as neither NAS nor model scaling considers sufficient hardware architecture details, they do not take full advantage of the emerging datacenter (DC) accelerators. In this paper, we search for fast and accurate CNN model families for efficient inference on DC accelerators. We first analyze DC accelerators and find that existing CNNs suffer from insufficient operational intensity, parallelism, and execution efficiency. These insights let us create a DC-accelerator-optimized search space, with space-to-depth, space-to-batch, hybrid fused convolution structures with vanilla and depthwise convolutions, and block-wise activation functions. On top of our DC accelerator optimized neural architecture search space, we further propose a latency-aware compound scaling (LACS), the first multi-objective compound scaling method optimizing both accuracy and latency. Our LACS discovers that network depth should grow much faster than image size and network width, which is quite different from previous compound scaling results. With the new search space and LACS, our search and scaling on datacenter accelerators results in a new model series named EfficientNet-X. EfficientNet-X is up to more than 2X faster than EfficientNet (a model series with state-of-the-art trade-off on FLOPs and accuracy) on TPUv3 and GPUv100, with comparable accuracy. EfficientNet-X is also up to 7X faster than recent RegNet and ResNeSt on TPUv3 and GPUv100.

preprint2020arXiv

A Survey on Causal Inference

Causal inference is a critical research topic across many domains, such as statistics, computer science, education, public policy and economics, for decades. Nowadays, estimating causal effect from observational data has become an appealing research direction owing to the large amount of available data and low budget requirement, compared with randomized controlled trials. Embraced with the rapidly developed machine learning area, various causal effect estimation methods for observational data have sprung up. In this survey, we provide a comprehensive review of causal inference methods under the potential outcome framework, one of the well known causal inference framework. The methods are divided into two categories depending on whether they require all three assumptions of the potential outcome framework or not. For each category, both the traditional statistical methods and the recent machine learning enhanced methods are discussed and compared. The plausible applications of these methods are also presented, including the applications in advertising, recommendation, medicine and so on. Moreover, the commonly used benchmark datasets as well as the open-source codes are also summarized, which facilitate researchers and practitioners to explore, evaluate and apply the causal inference methods.

preprint2020arXiv

Analysis of Fleet Management and Network Design for On-Demand Urban Air Mobility Operations

A significant challenge in estimating operational feasibility of Urban Air Mobility (UAM) missions lies in understanding how choices in design impact the performance of a complex system-of-systems. This work examines the ability of the UAM ecosystem and the operations within it to meet a variety of demand profiles that may emerge in the coming years. We perform a set of simulation driven feasibility and scalability analyses based on UAM operational models with the goal of estimating capacity and throughput for a given set of parameters that represent an operational UAM ecosystem. UAM ecosystem design guidelines, vehicle constraints, and effective operational policies can be drawn from our analysis. Results show that, while critical for enabling UAM, the performance of the UAM ecosystem is robust to variations in ground infrastructure and fleet design decisions, while being sensitive to decisions for fleet and traffic management policies. We show that so long as the ecosystem design parameters for ground infrastructure and fleet design fall within a sensible range, the performance of the UAM ecosystem is affected by the policies used to manage the UAM traffic.

preprint2020arXiv

Cross-scale Attention Model for Acoustic Event Classification

A major advantage of a deep convolutional neural network (CNN) is that the focused receptive field size is increased by stacking multiple convolutional layers. Accordingly, the model can explore the long-range dependency of features from the top layers. However, a potential limitation of the network is that the discriminative features from the bottom layers (which can model the short-range dependency) are smoothed out in the final representation. This limitation is especially evident in the acoustic event classification (AEC) task, where both short- and long-duration events are involved in an audio clip and needed to be classified. In this paper, we propose a cross-scale attention (CSA) model, which explicitly integrates features from different scales to form the final representation. Moreover, we propose the adoption of the attention mechanism to specify the weights of local and global features based on the spatial and temporal characteristics of acoustic events. Using mathematic formulations, we further reveal that the proposed CSA model can be regarded as a weighted residual CNN (ResCNN) model when the ResCNN is used as a backbone model. We tested the proposed model on two AEC datasets: one is an urban AEC task, and the other is an AEC task in smart car environments. Experimental results show that the proposed CSA model can effectively improve the performance of current state-of-the-art deep learning algorithms.

preprint2020arXiv

Effect of isotope disorder on the Raman spectra of cubic boron arsenide

Boron arsenide (c-BAs) is at the forefront of research on ultrahigh thermal conductivity materials. We present a Raman scattering study of isotopically tailored cubic boron arsenide single crystals for 11 isotopic compositions spanning the range from nearly pure c-$^{10}$BAs to nearly pure c-$^{11}$BAs. Our results provide insights on the effects of strong mass disorder on optical phonons and the appearance of two-mode behavior in the Raman spectra of mixed crystals. Strong isotope disorder also relaxes the one-phonon Raman selection rules, resulting in disorder-activated Raman scattering by acoustic phonons.

preprint2020arXiv

Learning Robust Data Representation: A Knowledge Flow Perspective

It is always demanding to learn robust visual representation for various learning problems; however, this learning and maintenance process usually suffers from noise, incompleteness or knowledge domain mismatch. Thus, robust representation learning by removing noisy features or samples, complementing incomplete data, and mitigating the distribution difference becomes the key. Along this line of research, low-rank modeling has been widely-applied to solving representation learning challenges. This survey covers the topic from a knowledge flow perspective in terms of: (1) robust knowledge recovery, (2) robust knowledge transfer, and (3) robust knowledge fusion, centered around several major applications. First of all, we deliver a unified formulation for robust knowledge discovery given single dataset. Second, we discuss robust knowledge transfer and fusion given multiple datasets with different knowledge flows, followed by practical challenges, model variations, and remarks. Finally, we highlight future research of robust knowledge discovery for incomplete, unbalance, large-scale data analysis. This would benefit AI community from literature review to future direction.

preprint2020arXiv

Non-Abelian Aharonov-Bohm Caging in Photonic Lattices

Aharonov-Bohm (AB) caging is the localization effect in translational-invariant lattices due to destructive interference induced by penetrated magnetic fields. While current research focuses mainly on the case of Abelian AB caging, here we go beyond and develop the non-Abelian AB caging concept by considering the particle localization in a 1D multi-component rhombic lattice with non-Abelian background gauge field. In contrast to its Abelian counterpart, the non-Abelian AB cage depends on both the form of the nilpotent interference matrix and the initial state of the lattice. This phenomena is the consequence of the non-Abelian nature of the gauge potential and thus has no Abelian analog. We further propose a circuit quantum electrodynamics realization of the proposed physics, in which the required non-Abelian gauge field can be synthesized by the parametric conversion method, and the non-Abelian AB caging can be unambiguously demonstrated through the pumping and the steady-state measurements of only a few sites on the lattice. Requiring only currently available technique, our proposal can be readily tested in experiment and may pave a new route towards the investigation of exotic photonic quantum fluids.

preprint2020arXiv

Novel polymorphic phase of BaCu2As2: impact of flux for new phase formation in crystal growth

In this work, we have thoroughly studied the effects of flux composition and temperature on the crystal growth of the BaCu2As2 compound. While Pb and CuAs self-flux produce the well-known α-phase ThCr2Si2-type structure (Z=2), a new polymorphic phase of BaCu2As2 (\b{eta} phase) with a much larger c lattice parameter (Z=10), which could be considered an intergrowth of the ThCr2Si2- and CaBe2Ge2-type structures, has been discovered via Sn flux growth. We have characterized this structure through single-crystal X-ray diffraction, transmission electron microscopy (TEM), and scanning transmission electron microscopy (STEM) studies. Furthermore, we compare this new polymorphic intergrowth structure with the α-phase BaCu2As2 (ThCr2Si2 type with Z=2) and the \b{eta}-phase BaCu2Sb2 (intergrowth of ThCr2Si2 and CaBe2Ge2 types with Z=6), both with the same space group I4/mmm. Electrical transport studies reveal p-type carriers and magnetoresistivity up to 22% at 5 K and under a magnetic field of 7 T. Our work suggests a new route for the discovery of new polymorphic structures through flux and temperature control during material synthesis.

preprint2020arXiv

SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

3D vehicle detection based on point cloud is a challenging task in real-world applications such as autonomous driving. Despite significant progress has been made, we observe two aspects to be further improved. First, the semantic context information in LiDAR is seldom explored in previous works, which may help identify ambiguous vehicles. Second, the distribution of point cloud on vehicles varies continuously with increasing depths, which may not be well modeled by a single model. In this work, we propose a unified model SegVoxelNet to address the above two problems. A semantic context encoder is proposed to leverage the free-of-charge semantic segmentation masks in the bird's eye view. Suspicious regions could be highlighted while noisy regions are suppressed by this module. To better deal with vehicles at different depths, a novel depth-aware head is designed to explicitly model the distribution differences and each part of the depth-aware head is made to focus on its own target detection range. Extensive experiments on the KITTI dataset show that the proposed method outperforms the state-of-the-art alternatives in both accuracy and efficiency with point cloud as input only.

preprint2020arXiv

Stereotype-Free Classification of Fictitious Faces

Equal Opportunity and Fairness are receiving increasing attention in artificial intelligence. Stereotyping is another source of discrimination, which yet has been unstudied in literature. GAN-made faces would be exposed to such discrimination, if they are classified by human perception. It is possible to eliminate the human impact on fictitious faces classification task by the use of statistical approaches. We present a novel approach through penalized regression to label stereotype-free GAN-generated synthetic unlabeled images. The proposed approach aids labeling new data (fictitious output images) by minimizing a penalized version of the least squares cost function between realistic pictures and target pictures.

preprint2020arXiv

Voice-Indistinguishability: Protecting Voiceprint in Privacy-Preserving Speech Data Release

With the development of smart devices, such as the Amazon Echo and Apple's HomePod, speech data have become a new dimension of big data. However, privacy and security concerns may hinder the collection and sharing of real-world speech data, which contain the speaker's identifiable information, i.e., voiceprint, which is considered a type of biometric identifier. Current studies on voiceprint privacy protection do not provide either a meaningful privacy-utility trade-off or a formal and rigorous definition of privacy. In this study, we design a novel and rigorous privacy metric for voiceprint privacy, which is referred to as voice-indistinguishability, by extending differential privacy. We also propose mechanisms and frameworks for privacy-preserving speech data release satisfying voice-indistinguishability. Experiments on public datasets verify the effectiveness and efficiency of the proposed methods.

preprint2016arXiv

Parallelizing Word2Vec in Multi-Core and Many-Core Architectures

Word2vec is a widely used algorithm for extracting low-dimensional vector representations of words. State-of-the-art algorithms including those by Mikolov et al. have been parallelized for multi-core CPU architectures, but are based on vector-vector operations with "Hogwild" updates that are memory-bandwidth intensive and do not efficiently use computational resources. In this paper, we propose "HogBatch" by improving reuse of various data structures in the algorithm through the use of minibatching and negative sample sharing, hence allowing us to express the problem using matrix multiply operations. We also explore different techniques to distribute word2vec computation across nodes in a compute cluster, and demonstrate good strong scalability up to 32 nodes. The new algorithm is particularly suitable for modern multi-core/many-core architectures, especially Intel's latest Knights Landing processors, and allows us to scale up the computation near linearly across cores and nodes, and process hundreds of millions of words per second, which is the fastest word2vec implementation to the best of our knowledge.

preprint2016arXiv

Parallelizing Word2Vec in Shared and Distributed Memory

Word2Vec is a widely used algorithm for extracting low-dimensional vector representations of words. It generated considerable excitement in the machine learning and natural language processing (NLP) communities recently due to its exceptional performance in many NLP applications such as named entity recognition, sentiment analysis, machine translation and question answering. State-of-the-art algorithms including those by Mikolov et al. have been parallelized for multi-core CPU architectures but are based on vector-vector operations that are memory-bandwidth intensive and do not efficiently use computational resources. In this paper, we improve reuse of various data structures in the algorithm through the use of minibatching, hence allowing us to express the problem using matrix multiply operations. We also explore different techniques to distribute word2vec computation across nodes in a compute cluster, and demonstrate good strong scalability up to 32 nodes. In combination, these techniques allow us to scale up the computation near linearly across cores and nodes, and process hundreds of millions of words per second, which is the fastest word2vec implementation to the best of our knowledge.

preprint2015arXiv

Nodal Superconducting Gap in $β$-FeS

Low temperature specific heat has been measured in superconductor $β$-FeS with T$_c$ = 4.55 K. It is found that the low temperature electronic specific heat C$_e$/T can be fitted to a linear relation in the low temperature region, but fails to be described by an exponential relation as expected by an s-wave gap. We try fittings to the data with different gap structures and find that a model with one or two nodal gaps can fit the data. Under a magnetic field, the field induced specific heat $Δγ$=[C$_e$(H)-C$_e$(0)]/T shows the Volovik relation $Δγ_e(H)\propto \sqrt{H}$, suggesting the presence of nodal gap(s) in this material.

preprint2015arXiv

Pressure Induced Enhancement of Superconductivity in LaRu2P2

To explore new superconductors beyond the copper-based and iron-based systems is very important. The Ru element locates just below the Fe in the periodic table and behaves like the Fe in many ways. One of the common thread to induce high temperature superconductivity is to introduce moderate correlation into the system. In this paper, we report the significant enhancement of superconducting transition temperature from 3.84K to 5.77K by using a pressure only of 1.74 GPa in LaRu2P2 which has an iso-structure of the iron-based 122 superconductors. The ab-initio calculation shows that the superconductivity in LaRu2P2 at ambient pressure can be explained by the McMillan's theory with strong electron-phonon coupling. However, it is difficult to interpret the significant enhancement of Tc versus pressure within this picture. Detailed analysis of the pressure induced evolution of resistivity and upper critical field Hc2(T) reveals that the increases of Tc with pressure may be accompanied by the involvement of extra electronic correlation effect. This suggests that the Ru-based system has some commonality as the Fe-based superconductors.

preprint2014arXiv

Assessing Technical Performance in Differential Gene Expression Experiments with External Spike-in RNA Control Ratio Mixtures

There is a critical need for standard approaches to assess, report, and compare the technical performance of genome-scale differential gene expression experiments. We assess technical performance with a proposed "standard" dashboard of metrics derived from analysis of external spike-in RNA control ratio mixtures. These control ratio mixtures with defined abundance ratios enable assessment of diagnostic performance of differentially expressed transcript lists, limit of detection of ratio (LODR) estimates, and expression ratio variability and measurement bias. The performance metrics suite is applicable to analysis of a typical experiment, and here we also apply these metrics to evaluate technical performance among laboratories. An interlaboratory study using identical samples shared amongst 12 laboratories with three different measurement processes demonstrated generally consistent diagnostic power across 11 laboratories. Ratio measurement variability and bias were also comparable amongst laboratories for the same measurement process. Different biases were observed for measurement processes using different mRNA enrichment protocols.

preprint2014arXiv

Observational constraints on tachyon and DBI inflation

We present a systematic method for evaluation of perturbation observables in non-canonical single-field inflation models within the slow-roll approximation, which allied with field redefinitions enables predictions to be established for a wide range of models. We use this to investigate various non-canonical inflation models, including Tachyon inflation and DBI inflation. The Lambert $W$ function will be used extensively in our method for the evaluation of observables. In the Tachyon case, in the slow-roll approximation the model can be approximated by a canonical field with a redefined potential, which yields predictions in better agreement with observations than the canonical equivalents. For DBI inflation models we consider contributions from both the scalar potential and the warp geometry. In the case of a quartic potential, we find a formula for the observables under both non-relativistic and relativistic behaviour of the scalar DBI inflaton. For a quadratic potential we find two branches in the non-relativistic case, determined by the competition of model parameters, while for the relativistic case we find consistency with results already in the literature. We present a comparison to the latest Planck satellite observations. Most of the non-canonical models we investigate, including the Tachyon, are better fits to data than canonical models with the same potential, but we find that DBI models in the slow-roll regime have difficulty in matching the data.

preprint2014arXiv

Power Law Like Correlation between Condensation Energy and Superconducting Transition Temperatures in Iron Pnictide/Chalcogenide Superconductors: Beyond the BCS Understanding

Superconducting condensation energy $U_0^{int}$ has been determined by integrating the electronic entropy in various iron pnictide/chalcogenide superconducting systems. It is found that $U_0^{int}\propto T_c^n$ with $n$ = 3 to 4, which is in sharp contrast to the simple BCS prediction $U_0^{BCS}=1/2N_FΔ_s^2$ with $N_F$ the quasiparticle density of states at the Fermi energy, $Δ_s$ the superconducting gap. A similar correlation holds if we compute the condensation energy through $U_0^{cal}=3γ_n^{eff}Δ_s^2/4π^2k_B^2$ with $γ_n^{eff}$ the effective normal state electronic specific heat coefficient. This indicates a general relationship $γ_n^{eff} \propto T_c^m$ with $m$ = 1 to 2, which is not predicted by the BCS scheme. A picture based on quantum criticality is proposed to explain this phenomenon.

preprint2014arXiv

Pressure Tuned Enhancement of Superconductivity and Change of Ground State Properties in LaO0.5F0.5BiSe2 Single Crystals

By using a hydrostatic pressure, we have successfully tuned the ground state and superconductivity in LaO0.5F0.5BiSe2 single crystals. It is found that, with the increase of pressure, the original superconducting phase with Tc about 3.5 K can be tuned to a state with lower Tc, and then a new superconducting phase with Tc about 6.5 K emerges. Accompanied by this crossover, the ground state is switched from a semiconducting state to a metallic one. Accordingly, the normal state resistivity also shows a nonmonotonic change with the external pressure. Furthermore, by applying a magnetic field, the new superconducting state under pressure with Tc about 6.5 K is suppressed, and the normal state reveals a weak semiconducting feature again. These results illustrate a non-trivial relationship between the normal state property and superconductivity in this newly discovered superconducting system.

preprint2013arXiv

Adaptive Frequency Domain Detectors for SC-FDE in Multiuser DS-UWB Systems with Structured Channel Estimation and Direct Adaptation

In this paper, we propose two adaptive detection schemes based on single-carrier frequency domain equalization (SC-FDE) for multiuser direct-sequence ultra-wideband (DS-UWB) systems, which are termed structured channel estimation (SCE) and direct adaptation (DA). Both schemes use the minimum mean square error (MMSE) linear detection strategy and employ a cyclic prefix. In the SCE scheme, we perform the adaptive channel estimation in the frequency domain and implement the despreading in the time domain after the FDE. In this scheme, the MMSE detection requires the knowledge of the number of users and the noise variance. For this purpose, we propose simple algorithms for estimating these parameters. In the DA scheme, the interference suppression task is fulfilled with only one adaptive filter in the frequency domain and a new signal expression is adopted to simplify the design of such a filter. Least-mean squares (LMS), recursive least squares (RLS) and conjugate gradient (CG) adaptive algorithms are then developed for both schemes. A complexity analysis compares the computational complexity of the proposed algorithms and schemes, and simulation results for the downlink illustrate their performance.

preprint2013arXiv

Blind Adaptive Reduced-Rank Detectors for DS-UWB Systems Based on Joint Iterative Optimization and the Constrained Constant Modulus Criterion

A novel linear blind adaptive receiver based on joint iterative optimization (JIO) and the constrained constant modulus (CCM) design criterion is proposed for interference suppression in direct-sequence ultra-wideband (DS-UWB) systems. The proposed blind receiver consists of two parts, a transformation matrix that performs dimensionality reduction and a reduced-rank filter that produces the output. In the proposed receiver, the transformation matrix and the reduced-rank filter are updated jointly and iteratively to minimize the constant modulus (CM) cost function subject to a constraint. Adaptive implementations for the JIO receiver are developed by using the normalized stochastic gradient (NSG) and recursive least-squares (RLS) algorithms. In order to obtain a low-complexity scheme, the columns of the transformation matrix with the RLS algorithm are updated individually. Blind channel estimation algorithms for both versions (NSG and RLS) are implemented. Assuming the perfect timing, the JIO receiver only requires the spreading code of the desired user and the received data. Simulation results show that both versions of the proposed JIO receivers have excellent performance in suppressing the inter-symbol interference (ISI) and multiple access interference (MAI) with a low complexity.

preprint2013arXiv

Frequency-Domain Group-based Shrinkage Estimators for UWB Systems

In this work, we propose low-complexity adaptive biased estimation algorithms, called group-based shrinkage estimators (GSEs), for parameter estimation and interference suppression scenarios with mechanisms to automatically adjust the shrinkage factors. The proposed estimation algorithms divide the target parameter vector into a number of groups and adaptively calculate one shrinkage factor for each group. GSE schemes improve the performance of the conventional least squares (LS) estimator in terms of the mean-squared error (MSE), while requiring a very modest increase in complexity. An MSE analysis is presented which indicates the lower bounds of the GSE schemes with different group sizes. We prove that our proposed schemes outperform the biased estimation with only one shrinkage factor and the best performance of GSE can be obtained with the maximum number of groups. Then, we consider an application of the proposed algorithms to single-carrier frequency-domain equalization (SC-FDE) of direct-sequence ultra-wideband (DS-UWB) systems, in which the structured channel estimation (SCE) algorithm and the frequency domain receiver employ the GSE. The simulation results show that the proposed algorithms significantly outperform the conventional unbiased estimator in the analyzed scenarios.

preprint2013arXiv

Impurity effect and suppression to superconductivity in Na(Fe$_{0.97-x}$Co$_{0.03}$T$_x$)As (T=Cu, Mn)

We report the successful growth and the impurity scattering effect of single crystals of Na(Fe$_{0.97-x}$Co$_{0.03}$T$_x$)As (T=Cu, Mn). The temperature dependence of DC magnetization at high magnetic fields is measured for different concentrations of Cu and Mn. Detailed analysis based on the Curie-Weiss law indicates that the Cu doping weakens the average magnetic moments, while doping Mn enhances the local magnetic moments greatly, suggesting that the former may be non- or very weak magnetic impurities, and the latter give rise to magnetic impurities. However, it is found that both doping Cu and Mn will enhance the residual resistivity and suppress the superconductivity at the same rate in the low doping region, being consistent with the prediction of the S$^{\pm}$ model. For the Cu-doped system, the superconductivity is suppressed completely at a residual resistivity $ρ_0$ = 0.87 m$Ω$ cm at which a strong localization effect is observed. However, in the case of Mn doping, the behavior of suppression to \emph{T}$_{c}$ changes from a fast speed to a slow one and keeps superconductive even up to a residual resistivity of 2.86 m$Ω$ cm. Clearly the magnetic Mn impurities are even not as detrimental as the non- or very weak magnetic Cu impurities to superconductivity in the high doping regime.

preprint2013arXiv

Inhomogeneous Reheating Scenario with DBI fields

We discuss a new mechanism which can be responsible for the origin of the primordial perturbation in inflationary models, the inhomogeneous DBI reheating scenario. Light DBI fields fluctuate during inflation, and finally create the density perturbations through modulation of the inflation decay rate. In this note, we investigate the curvature perturbation and its non-Gaussianity from this new mechanism. Presenting generalized expressions for them, we show that the curvature perturbation not only depends on the particular process of decay but is also dependent on the sound speed $c_s$ from the DBI action. More interestingly we find that the non-Gaussianity parameter $f_{NL}$ is independent of $c_s$. As an application we exemplify some decay processes which give a viable and detectable non-Gaussianity. Finally we find a possible connection between our model and the DBI-Curvaton mechanism.

preprint2013arXiv

Linear Reduced-Rank Interference Suppression for DS-UWB Systems Using Switched Approximations of Adaptive Basis Functions

In this work, we propose a novel low-complexity reduced-rank scheme and consider its application to linear interference suppression in direct-sequence ultra-wideband (DS-UWB) systems. Firstly, we investigate a generic reduced-rank scheme that jointly optimizes a projection vector and a reduced-rank filter by using the minimum mean-squared error (MMSE) criterion. Then a low-complexity scheme, denoted switched approximation of adaptive basis functions (SAABF), is proposed. The SAABF scheme is an extension of the generic scheme, in which the complexity reduction is achieved by using a multi-branch framework to simplify the structure of the projection vector. Adaptive implementations for the SAABF scheme are developed by using least-mean squares (LMS) and recursive least-squares (RLS) algorithms. We also develop algorithms for selecting the branch number and the model order of the SAABF scheme. Simulations show that in the scenarios with severe inter-symbol interference (ISI) and multiple access interference (MAI), the proposed SAABF scheme has fast convergence and remarkable interference suppression performance with low complexity.

preprint2012arXiv

Distinct behaviors of suppression to superconductivity in $LaRu_3Si_2$ induced by Fe and Co dopants

In the superconductor LaRu$_3$Si$_2$ with the Kagome lattice of Ru, we have successfully doped the Ru with Fe and Co atoms. Contrasting behaviors of suppression to superconductivity is discovered between the Fe and the Co dopants: Fe-impurities can suppress the superconductivity completely at a doping level of only 3%, while the superconductivity is suppressed slowly with the Co dopants. A systematic magnetization measurements indicate that the doped Fe impurities lead to spin-polarized electrons yielding magnetic moments with the magnitude of 1.6 $μ_B$\ per Fe, while the electrons given by the Co dopants have the same density of states for spin-up and spin-down leading to much weaker magnetic moments. It is the strong local magnetic moments given by the Fe-dopants that suppress the superconductivity. The band structure calculation further supports this conclusion.

preprint2012arXiv

Multi-Band Exotic Superconductivity in the New Superconductor Bi4O4S3

Resistivity, Hall effect and magnetization have been investigated on the new superconductor Bi4O4S3. A weak insulating behavior has been induced in the normal state when the superconductivity is suppressed. Hall effect measurements illustrate clearly a multiband feature dominated by electron charge carriers, which is further supported by the magnetoresistance data. Interestingly, a kink appears on the temperature dependence of resistivity at about 4 K at all high magnetic fields when the bulk superconductivity is completely suppressed. This kink can be well traced back to the upper critical field Hc2(T) in the low field region, and is explained as the possible evidence of residual Cooper pairs on the one dimensional chains.

preprint2012arXiv

Observational constraints on K-inflation models

We extend the ModeCode software of Mortonson, Peiris and Easther to enable numerical computation of perturbations in K-inflation models, where the scalar field no longer has a canonical kinetic term. Focussing on models where the kinetic and potential terms can be separated into a sum, we compute slow-roll predictions for various models and use these to verify the numerical code. A Markov chain Monte Carlo analysis is then used to impose constraints from WMAP7 data on the addition of a term quadratic in the kinetic energy to the Lagrangian of simple chaotic inflation models. For a quadratic potential, the data do not discriminate against addition of such a term, while for a quartic (λϕ^4) potential inclusion of such a term is actually favoured. Overall, constraints on such a term from present data are found to be extremely weak.

preprint2012arXiv

Superconductivity Appears in the Vicinity of an Insulating-Like Behavior in CeO$_{1-x}$F$_{x}$BiS$_{2}$

Resistive and magnetization properties have been measured in BiS$_2$-based samples CeO$_{1-x}$F$_{x}$BiS$_{2}$ with a systematic substitution of O with F (0 $<$ x $<$ 0.6). In contrast to the band structure calculations, it is found that the parent phase of CeOBiS$_2$ is a bad metal, instead of an band insulator. By doping electrons into the system, it is surprising to find that superconductivity appears together with an insulating normal state. This evolution is clearly different from the cuprate and the iron pnictide systems, and is interpreted as approaching the von Hove singularity. Furthermore, ferromagnetism which may arise from the Ce moments, has been observed in the low temperature region in all samples, suggesting the co-existence of superconductivity and ferromagnetism in the superconducting samples.

preprint2012arXiv

Unexpected weak spatial variation of local density of sates induced by individual Co impurity atoms in Na(Fe{1-x}Cox)As as revealed by scanning tunneling spectroscopy

We use spatially resolved scanning tunneling spectroscopy in Na(Fe{1-x}Cox)As to investigate the impurity effect induced by Co dopants. The Co impurities are successfully identified, and the spatial distributions of local density of state at different energies around these impurities are investigated. It is found that the spectrum shows negligible spatial variation at different positions near the Co impurity, although there is a continuum of the in-gap states which lifts the zero-bias conductance to a finite value. Our results put constraints on the S+- and S++ models and sharpen the debate on the role of scattering potentials induced by the Co dopants.

preprint2011arXiv

Anomalous Properties in the Normal and Superconducting States of LaRu$_3$Si$_2$

Superconductivity in LaRu$_3$Si$_2$ with the honeycomb structure of Ru atoms has been investigated. It is found that the normal state specific heat C/T exhibits a deviation from the Debye model down to the lowest temperature. A relation $C/T = γ_n+βT^2-ATlnT$ which concerns the electron correlations can fit the data very well. The suppression to the superconductivity by the magnetic field is not the mean-field like, which is associated well with the observation of strong superconducting fluctuations. The field dependence of the induced quasiparticle density of states measured by the low temperature specific heat shows a non-linear feature, indicating the significant contributions given by the delocalized quasiparticles.

preprint2010arXiv

Quantum logical gates with four-level SQUIDs coupled to a superconducting resonator

We propose a way for realizing a two-qubit controlled phase gate with superconducting quantum interference devices (SQUIDs) coupled to a superconducting resonator. In this proposal, the two lowest levels of each SQUID serve as the logical states and two intermediate levels of each SQUID are used for the gate realization. We show that neither adjustment of SQUID level spacings during the gate operation nor uniformity in SQUID parameters is required by this proposal. In addition, this proposal does not require the adiabatic passage or a second-order detuning and thus the gate is much faster.

preprint2009arXiv

Inflation in a Web

In a given path with multiple branches, in principle, it can be expected that there are some fork points, where one branch is bifurcated into different branches, or various branches converge into one or several branches. In this paper, it is showed that if there is a web formed by such branches in a given field space, in which each branch can be responsible for a period of slow roll inflation, a multiverse separated by domain wall network will come into being, some of which might corresponds to our observable universe. We discuss this scenario and show possible observations of a given observer at late time.

preprint2009arXiv

Integrating fluctuations into distribution of resources in transportation networks

We propose a resource distribution strategy to reduce the average travel time in a transportation network given a fixed generation rate. Suppose that there are essential resources to avoid congestion in the network as well as some extra resources. The strategy distributes the essential resources by the average loads on the vertices and integrates the fluctuations of the instantaneous loads into the distribution of the extra resources. The fluctuations are calculated with the assumption of unlimited resources, where the calculation is incorporated into the calculation of the average loads without adding to the time complexity. Simulation results show that the fluctuation-integrated strategy provides shorter average travel time than a previous distribution strategy while keeping similar robustness. The strategy is especially beneficial when the extra resources are scarce and the network is heterogeneous and lowly loaded.

preprint2006arXiv

Evolving Network With Different Edges

We proposed an evolving network model constituted by the same nodes but different edges. The competition between nodes and different links were introduced. Scale free properties have been found in this model by continuum theory. Different network topologies can be generated by some tunable parameters. Simulation results consolidate the prediction.

Sheng Li

What is connected

Connect this record

See the researcher in context

Building this map preview

60 published item(s)

DRNet: All-in-One Image Restoration via Prior-Guided Dynamic Reparameterization

Observation of robust one-dimensional edge channels in a three-dimensional quantum spin Hall insulator

From Covert Hiding to Visual Editing: Robust Generative Video Steganography

Object-oriented backdoor attack against image captioning

PROMPT-IML: Image Manipulation Localization with Pre-trained Foundation Models Through Prompt Tuning

Trojaning semi-supervised learning model via poisoning wild images on the web

A DTCWT-SVD Based Video Watermarking resistant to frame rate conversion

Coupling Visual Semantics of Artificial Neural Networks and Human Brain Function via Synchronized Activations

Emergence of insulating ferrimagnetism and perpendicular magnetic anisotropy in 3d-5d perovskite oxide composite films for insulator spintronic

Exploring Depth Information for Face Manipulation Detection

Fusion of Self-supervised Learned Models for MOS Prediction

Generative Steganography Network

Hierarchical Capsule Prediction Network for Marketing Campaigns Effect

High-Capacity Framework for Reversible Data Hiding in Encrypted Image Using Pixel Predictions and Entropy Encoding

Image Generation Network for Covert Transmission in Online Social Network

Learning Infomax and Domain-Independent Representations for Causal Effect Inference with Real-World Data

Multi-Task Adversarial Learning for Treatment Effect Estimation in Basket Trials

Multimodal Fake News Detection via CLIP-Guided Learning

NeuralSound: Learning-based Modal Sound Synthesis With Acoustic Transfer

Robust Watermarking for Video Forgery Detection with Improved Imperceptibility and Robustness

Cooperative control of perpendicular magnetic anisotropy via crystal structure and orientation in single-crystal flexible SrRuO3 membranes

Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning

Enhanced Superconductivity in the Se-substituted 1T-PdTe$_2$

Lateral modulation of magnetic anisotropy in tricolor 3d-5d oxide superlattices

Learning Emergent Discrete Message Communication for Cooperative Reinforcement Learning

Searching for Fast Model Families on Datacenter Accelerators

A Survey on Causal Inference

Analysis of Fleet Management and Network Design for On-Demand Urban Air Mobility Operations

Cross-scale Attention Model for Acoustic Event Classification

Effect of isotope disorder on the Raman spectra of cubic boron arsenide

Learning Robust Data Representation: A Knowledge Flow Perspective

Non-Abelian Aharonov-Bohm Caging in Photonic Lattices

Novel polymorphic phase of BaCu2As2: impact of flux for new phase formation in crystal growth

SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

Stereotype-Free Classification of Fictitious Faces

Voice-Indistinguishability: Protecting Voiceprint in Privacy-Preserving Speech Data Release

Parallelizing Word2Vec in Multi-Core and Many-Core Architectures

Parallelizing Word2Vec in Shared and Distributed Memory

Nodal Superconducting Gap in $β$-FeS

Pressure Induced Enhancement of Superconductivity in LaRu2P2

Assessing Technical Performance in Differential Gene Expression Experiments with External Spike-in RNA Control Ratio Mixtures

Observational constraints on tachyon and DBI inflation

Power Law Like Correlation between Condensation Energy and Superconducting Transition Temperatures in Iron Pnictide/Chalcogenide Superconductors: Beyond the BCS Understanding

Pressure Tuned Enhancement of Superconductivity and Change of Ground State Properties in LaO0.5F0.5BiSe2 Single Crystals

Adaptive Frequency Domain Detectors for SC-FDE in Multiuser DS-UWB Systems with Structured Channel Estimation and Direct Adaptation

Blind Adaptive Reduced-Rank Detectors for DS-UWB Systems Based on Joint Iterative Optimization and the Constrained Constant Modulus Criterion

Frequency-Domain Group-based Shrinkage Estimators for UWB Systems

Impurity effect and suppression to superconductivity in Na(Fe$_{0.97-x}$Co$_{0.03}$T$_x$)As (T=Cu, Mn)

Inhomogeneous Reheating Scenario with DBI fields

Linear Reduced-Rank Interference Suppression for DS-UWB Systems Using Switched Approximations of Adaptive Basis Functions

Distinct behaviors of suppression to superconductivity in $LaRu_3Si_2$ induced by Fe and Co dopants

Multi-Band Exotic Superconductivity in the New Superconductor Bi4O4S3

Observational constraints on K-inflation models

Superconductivity Appears in the Vicinity of an Insulating-Like Behavior in CeO$_{1-x}$F$_{x}$BiS$_{2}$

Unexpected weak spatial variation of local density of sates induced by individual Co impurity atoms in Na(Fe{1-x}Cox)As as revealed by scanning tunneling spectroscopy

Anomalous Properties in the Normal and Superconducting States of LaRu$_3$Si$_2$

Quantum logical gates with four-level SQUIDs coupled to a superconducting resonator

Inflation in a Web

Integrating fluctuations into distribution of resources in transportation networks

Evolving Network With Different Edges