Source author record

Liang Chen

Liang Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.HE Computer Vision Machine Learning Artificial Intelligence Computation and Language hep-ph astro-ph.CO eess.IV eess.SP cond-mat.mtrl-sci cond-mat.str-el Distributed, Parallel, and Cluster Computing math.NA math.OC physics.ins-det physics.optics quant-ph Systems and Control cond-mat.dis-nn cond-mat.other eess.SY Information Retrieval Networking and Internet Architecture Numerical Analysis Quantitative Methods Social and Information Networks Software Engineering astro-ph.GA astro-ph.SR Computational Engineering, Finance, and Science cond-mat.mes-hall hep-th Information Theory math.DS math.IT Multimedia Neurons and Cognition nucl-ex physics.app-ph Biological Physics Computational Complexity cond-mat cond-mat.quant-gas cond-mat.soft cond-mat.supr-con Cryptography and Security Data Structures and Algorithms eess.AS math.AP Neural and Evolutionary Computing nlin.AO Other Computer Science Performance physics.atom-ph physics.chem-ph physics.comp-ph Populations and Evolution Sound supr-con

Catalog footprint

What is connected

129works

59topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A geometry aware framework enhances noninvasive mapping of whole human brain dynamics

Non-invasive electrophysiology lacks methods that accurately reconstruct whole-brain spatiotemporal dynamics while incorporating individual cortical geometry, leaving current electroencephalography and magnetoencephalography source imaging limited by simplistic or biologically implausible priors. Here, we show that embedding participant-specific Geometric Basis Functions (GBFs), eigenmodes derived from each individual's cortical surface, provides a powerful anatomic constraint that resolves the inverse problem and improves reconstruction fidelity. The method reconstructs neural sources as linear combinations of geometric basis functions, thereby aligning source estimates with the geometric organization of neural dynamics. We validate GBF across the Meta-Source Benchmark, task-evoked data, resting-state networks, intracranial stimulation, and epilepsy data. The results demonstrate that GBF yields high localization accuracy and captures fast spatiotemporal dynamics consistent with anatomical pathways. These findings suggest that both spontaneous and evoked whole-brain activity can be described by hundreds of geometric modes, providing a compact yet accurate representation of neural sources. By linking cortical geometry to electrophysiological dynamics, GBF offers a versatile source imaging tool for both scientific and clinical applications.

preprint2026arXiv

An Enigmatic PeVatron in an Area around HII Region G35.6$-$0.5

Identifying Galactic PeVatrons (PeV particle accelerators) from the ultra-high-energy (UHE, >100 TeV) $γ$-ray sources plays a crucial role in revealing the origin of Galactic cosmic rays. The UHE source 1LHAASO J1857+0203u is suggested to be associated with HESS J1858+020, which may be attributed to the possible PeVatron candidate supernova remnant (SNR) G35.6$-$0.4 or HII region G35.6$-$0.5. We perform detailed analysis on the very-high-energy and UHE $γ$-ray emissions towards this region with data from the Large High Altitude Air Shower Observatory (LHAASO). 1LHAASO J1857+0203u is detected with a significance of 11.6$σ$ above 100 TeV, indicating the presence of a PeVatron. It has an extension of $\sim 0.18^\circ$ with a power-law (PL) spectral index of $\sim$2.5 in 1-25 TeV and a point-like emission with a PL spectral index of $\sim$3.2 above 25 TeV. Using the archival CO and HI data, we identify some molecular and atomic clouds that may be associated with the TeV $γ$-ray emissions. Our modelling indicates that the TeV $γ$-ray emissions are unlikely to arise from the clouds illuminated by the protons that escaped from SNR G35.6$-$0.4. In the scenario that HII region G35.6$-$0.5 could accelerate particles to the UHE band, the observed GeV-TeV $γ$-ray emission could be well explained by a hadronic model with a PL spectral index of $\sim$2.0 and cutoff energy of $\sim$450 TeV. However, an evolved pulsar wind nebula origin cannot be ruled out.

preprint2026arXiv

An Ultrahigh-energy $γ$-ray Bubble Powered by a Super PeVatron

We report the detection of a $γ$-ray bubble spanning at least 100$\rm deg^2$ in ultra high energy (UHE) up to a few PeV in the direction of the star-forming region Cygnus X, implying the presence Super PeVatron(s) accelerating protons to at least 10 PeV. A log-parabola form with the photon index $Γ(E) = (2.71 \pm 0.02) + (0.11 \pm 0.02) \times \log_{10} (E/10 \ {\rm TeV})$ is found fitting the gamma-ray energy spectrum of the bubble well. UHE sources, `hot spots' correlated with very massive molecular clouds, and a quasi-spherical amorphous $γ$-ray emitter with a sharp central brightening are observed in the bubble. In the core of $\sim 0.5^{\circ}$, spatially associating with a region containing massive OB association (Cygnus OB2) and a microquasar (Cygnus X-3), as well as previously reported multi-TeV sources, an enhanced concentration of UHE $γ$-rays are observed with 2 photons at energies above 1 PeV. The general feature of the bubble, the morphology and the energy spectrum, are reasonably reproduced by the assumption of a particle accelerator in the core, continuously injecting protons into the ambient medium.

preprint2026arXiv

BabyVision: Visual Reasoning Beyond Language

While humans develop core visual skills long before acquiring language, contemporary Multimodal LLMs (MLLMs) still rely heavily on linguistic priors to compensate for their fragile visual understanding. We uncovered a crucial fact: state-of-the-art MLLMs consistently fail on basic visual tasks that humans, even 3-year-olds, can solve effortlessly. To systematically investigate this gap, we introduce BabyVision, a benchmark designed to assess core visual abilities independent of linguistic knowledge for MLLMs. BabyVision spans a wide range of tasks, with 388 items divided into 22 subclasses across four key categories. Empirical results and human evaluation reveal that leading MLLMs perform significantly below human baselines. Gemini3-Pro-Preview scores 49.7, lagging behind 6-year-old humans and falling well behind the average adult score of 94.1. These results show despite excelling in knowledge-heavy evaluations, current MLLMs still lack fundamental visual primitives. Progress in BabyVision represents a step toward human-level visual perception and reasoning capabilities. We also explore solving visual reasoning with generation models by proposing BabyVision-Gen and automatic evaluation toolkit. Our code and benchmark data are released at https://github.com/UniPat-AI/BabyVision for reproduction.

preprint2026arXiv

Constraining the Cosmic-ray Energy Based on Observations of Nearby Galaxy Clusters by LHAASO

Galaxy clusters act as reservoirs of high-energy cosmic rays (CRs). As CRs propagate through the intracluster medium, they generate diffuse $γ$-rays detectable by arrays such as LHAASO. These $γ$-rays result from proton-proton ($pp$) collisions of very high-energy cosmic rays (VHECRs) or inverse Compton (IC) scattering of positron-electron pairs created by $pγ$ interactions of ultra-high-energy cosmic rays (UHECRs). We analyzed diffuse $γ$-ray emission from the Coma, Perseus, and Virgo clusters using LHAASO data. Diffuse emission was modeled as a disk of radius $R_{500}$ for each cluster while accounting for point sources. No significant diffuse emission was detected, yielding 95\% confidence level (C.L.) upper limits on the $γ$-ray flux: for WCDA (1-25~TeV) and KM2A ($>25$~TeV), less than $(49.4, 13.7, 54.0)$ and $(1.34, 1.14, 0.40) \times 10^{-14}$~ph~cm$^{-2}$~s$^{-1}$ for Coma, Perseus, and Virgo, respectively. The $γ$-ray upper limits can be used to derive model-independent constraints on the integral energy of CRp above 10~TeV (corresponding to the LHAASO observational range $>1$~TeV under the $pp$ scenario) to be less than $(1.96, 0.59, 0.08) \times 10^{61}$~erg. The absence of detectable annuli/ring-like structures, indicative of cluster accretion or merging shocks, imposes further constraints on models in which the UHECRs are accelerated in the merging shocks of galaxy clusters.

preprint2026arXiv

Constraints on heavy decaying dark matter from 570 days of LHAASO observations

The Kilometer Square Array~(KM2A) of the Large High Altitude Air Shower Observatory (LHAASO) aims at surveying the northern gamma-ray sky at energies above 10 TeV with unprecedented sensitivity. Gamma-ray observations have long been one of the most powerful tools for dark matter searches, as e.g., high-energy gamma-rays could be produced by the decays of heavy dark matter particles. In this letter, we present the first dark matter analysis with LHAASO-KM2A, using the first 340~days of data from 1/2-KM2A and 230~days of data from 3/4-KM2A. Several regions of interest are used to search for a signal and account for the residual cosmic-ray background after gamma/hadron separation. We find no excess of dark matter signals, and thus place some of the strongest gamma-ray constraints on the lifetime of heavy dark matter particles with mass between 10^5 and 10^9~GeV. Our results with LHAASO are robust, and have important implications for dark matter interpretations of the diffuse astrophysical high-energy neutrino emission.

preprint2026arXiv

Deep view of Composite SNR CTA1 with LHAASO in $γ$-rays up to 300 TeV

The ultra-high-energy (UHE) gamma-ray source 1LHAASO J0007+7303u is positionally associated with the composite SNR CTA1 that is located at high Galactic Latitude $b\approx 10.5^\circ$. This provides a rare opportunity to spatially resolve the component of the pulsar wind nebula (PWN) and supernova remnant (SNR) at UHE. This paper conducted a dedicated data analysis of 1LHAASO J0007+7303u using the data collected from December 2019 to July 2023. This source is well detected with significances of 21$σ$ and 17$σ$ at 8$-$100 TeV and $>$100 TeV, respectively. The corresponding extensions are determined to be 0.23$^{\circ}\pm$0.03$^{\circ}$ and 0.17$^{\circ}\pm$0.03$^{\circ}$. The emission is proposed to originate from the relativistic electrons and positrons accelerated within the PWN of PSR J0007+7303. The energy spectrum is well described by a power-law with an exponential cutoff function $dN/dE = (42.4\pm4.1)(\frac{E}{20\rm\ TeV})^{-2.31\pm0.11}\exp(-\frac{E}{110\pm25\rm\ TeV})$ $\rm\ TeV^{-1}\ cm^{-2}\ s^{-1}$in the energy range from 8 TeV to 300 TeV, implying a steady-state parent electron spectrum $dN_e/dE_e\propto (\frac{E_e}{100\rm\ TeV})^{-3.13\pm0.16}\exp[(\frac{-E_e}{373\pm70\rm\ TeV})^2]$ at energies above $\approx 50 \rm\ TeV$. The cutoff energy of the electron spectrum is roughly equal to the expected current maximum energy of particles accelerated at the PWN terminal shock. Combining the X-ray and gamma-ray emission, the current space-averaged magnetic field can be limited to $\approx 4.5\rm\ μG$. To satisfy the multi-wavelength spectrum and the $γ$-ray extensions, the transport of relativistic particles within the PWN is likely dominated by the advection process under the free-expansion phase assumption.

preprint2026arXiv

Discovery of a new $γ$-ray source LHAASO J0341+5258 with emission up to 200TeV

We report the discovery of a new unidentified extended $γ$-ray source in the Galactic plane named LHAASO J0341+5258 with a pre-trial significance of 8.2 standard deviations above 25 TeV. The best fit position is R.A.$=55.34^{\circ}\pm0.11^{\circ}$ and Dec$=52.97^{\circ}\pm0.07^{\circ}$. The angular size of LHAASO J0341+5258 is $0.29^\circ \pm 0.06^\circ_{stat} \pm0.02^\circ_{sys}$. The flux above 25 TeV is about $20\%$ of the flux of Crab Nebula. Although a power-law fit of the spectrum from 10 TeV to 200 TeV with the photon index $α=2.98 \pm 0.19_{stat} \pm 0.02_{sys}$ is not excluded, the LHAASO data together with the flux upper limit at 10 GeV set by the Fermi LAT observation, indicate a noticeable steepening of an initially hard power-law spectrum %($α\leq 1.75$) spectrum with a cutoff at $\approx 50$ TeV. We briefly discuss the origin of UHE gamma-rays. The lack of an energetic pulsar and a young SNR inside or in the vicinity of LHAASO J0341+5258 challenge, but do not exclude both the leptonic and hadronic scenarios of gamma-ray production.

preprint2026arXiv

Discovery of the Ultra-high energy gamma-ray source LHAASO J2108+5157

We report the discovery of a UHE gamma-ray source, LHAASO J2108+5157, by analyzing the LHAASO-KM2A data of 308.33 live days. Significant excess of gamma-ray induced showers is observed in both energy bands of 25-100 TeV and $\gt$100 TeV with 9.5 sigma and 8.5 sigma, respectively. This source is not significantly favored as an extensive source with the angular extension smaller than the point-spread function of KM2A. The measured energy spectrum from 20 to 200 TeV can be approximately described by a power-law function with an index of -2.83$\pm$ 0.18stat. A harder spectrum is demanded at lower energies considering the flux upper limit set by Fermi-LAT observations. The position of the gamma-ray emission is correlated with a giant molecular cloud, which favors a hadronic origin. No obvious counterparts have been found, deeper multiwavelength observations will help to shed new light on this intriguing UHE source.

preprint2026arXiv

Energy calibration of LHAASO-KM2A using the cosmic ray Moon shadow

We present a precise measurement of the westward, rigidity-dependent shift of the Moon's shadow using three and a half years of cosmic-ray data collected by the Kilometer Square Array (KM2A) of the Large High Altitude Air Shower Observatory (LHAASO). These measurements enable us to calibrate the detector energy response in the range 20-260 TeV, with results showing excellent agreement with the response derived from Monte Carlo (MC) simulations of the KM2A detector. We also measure a best-fit parameter $ε= 0.015 \pm 0.08$, corresponding to a 95% confidence interval of [-14%, +17%] for the energy-scale estimation. This result establishes the exceptional accuracy of the KM2A-MC in simulating the detector's response within this energy range.

preprint2026arXiv

Energy-Dependent Shifts of Medium-Scale Anisotropies in Very-High-Energy Cosmic Rays Observed by LHAASO-KM2A

Small deviations from isotropy in the arrival directions of Galactic cosmic rays serve as a unique probe of the local magnetic environment. In this Letter, we report observations of medium-scale anisotropies (MSA) at energies above 10 TeV using the LHAASO-KM2A array. Our analysis identifies four regions of excess and four regions of deficit, each spanning angular scales of approximately ten degrees. Crucially, we detect significant energy-dependent shifts in the centroids of two excess regions: Region B and the newly identified Region $\mathrm{\widetilde{D}}$. We also characterize the energy evolution of the fractional relative intensity across both excess and deficit regions. These findings imply that the observed anisotropies are shaped by the specific realization of the local turbulent magnetic field within the cosmic ray scattering length. Such energy-dependent behaviors impose strict constraints on local turbulence models and cosmic ray propagation theories.

preprint2026arXiv

Evidence for particle acceleration approaching PeV energies in the W51 complex

The $γ$-ray emission from the W51 complex is widely acknowledged to be attributed to the interaction between the cosmic rays (CRs) accelerated by the shock of supernova remnant (SNR) W51C and the dense molecular clouds in the adjacent star-forming region, W51B. However, the maximum acceleration capability of W51C for CRs remains elusive. Based on observations conducted with the Large High Altitude Air Shower Observatory (LHAASO), we report a significant detection of $γ$ rays emanating from the W51 complex, with energies from 2 TeV to 200 TeV. The LHAASO measurements, for the first time, extend the $γ$-ray emission from the W51 complex beyond 100 TeV and reveal a significant spectrum bending at tens of TeV. By combining the ``$π^0$-decay bump" featured data from Fermi-LAT, the broadband $γ$-ray spectrum of the W51 region can be well-characterized by a simple pp-collision model. The observed spectral bending feature suggests an exponential cutoff at $\sim400$~TeV or a power-law break at $\sim200$~TeV in the CR proton spectrum, most likely providing the first evidence of SNRs serving as CR accelerators approaching the PeV regime. Additionally, two young star clusters within W51B could also be theoretically viable to produce the most energetic $γ$ rays observed by LHAASO. Our findings strongly support the presence of extreme CR accelerators within the W51 complex and provide new insights into the origin of Galactic CRs.

preprint2026arXiv

Exploring Lorentz Invariance Violation from Ultra-high-energy Gamma Rays Observed by LHAASO

Recently the LHAASO Collaboration published the detection of 12 ultra-high-energy gamma-ray sources above 100 TeV, with the highest energy photon reaching 1.4 PeV. The first detection of PeV gamma rays from astrophysical sources may provide a very sensitive probe of the effect of the Lorentz invariance violation (LIV), which results in decay of high-energy gamma rays in the superluminal scenario and hence a sharp cutoff of the energy spectrum. Two highest energy sources are studied in this work. No signature of the existence of LIV is found in their energy spectra, and the lower limits on the LIV energy scale are derived. Our results show that the first-order LIV energy scale should be higher than about 10^5 times the Planck scale M_{pl} and that the second-order LIV scale is >10^{-3}M_{pl}. Both limits improve by at least one order of magnitude the previous results.

preprint2026arXiv

Extended Very-High-Energy Gamma-Ray Emission Surrounding PSR J0622 + 3749 Observed by LHAASO-KM2A

We report the discovery of an extended very-high-energy (VHE) gamma-ray source around the location of the middle-aged (207.8 kyr) pulsar PSR J0622+3749 with the Large High Altitude Air Shower Observatory (LHAASO). The source is detected with a significance of $8.2σ$ for $E>25$~TeV assuming a Gaussian template. The best-fit location is (R.A., Dec.)$=(95^{\circ}\!.47\pm0^{\circ}\!.11,\,37^{\circ}\!.92 \pm0^{\circ}\!.09)$, and the extension is $0^{\circ}\!.40\pm0^{\circ}\!.07$. The energy spectrum can be described by a power-law spectrum with an index of ${-2.92 \pm 0.17_{\rm stat} \pm 0.02_{\rm sys} }$. No clear extended multi-wavelength counterpart of the LHAASO source has been found from the radio to sub-TeV bands. The LHAASO observations are consistent with the scenario that VHE electrons escaped from the pulsar, diffused in the interstellar medium, and scattered the interstellar radiation field. If interpreted as the pulsar halo scenario, the diffusion coefficient, inferred for electrons with median energies of $\sim160$~TeV, is consistent with those obtained from the extended halos around Geminga and Monogem and much smaller than that derived from cosmic ray secondaries. The LHAASO discovery of this source thus likely enriches the class of so-called pulsar halos and confirms that high-energy particles generally diffuse very slowly in the disturbed medium around pulsars.

preprint2026arXiv

LHAASO Detection of Ultra-High-Energy Gamma-Ray Emission toward the Giant Molecular Clouds

The $γ$-ray from Giant molecular clouds (GMCs) is regarded as the most ideal tool to perform in-situ measurement of cosmic ray (CR) density and spectra in our Galaxy. We report the first detection of $γ$-ray emissions in the very-high-energy (VHE) domain from the five nearby GMCs with a stacking analysis based on a 4.5-year $γ$-ray observation with the Large High Altitude Air Shower Observatory (LHAASO) experiment. The spectral energy distributions derived from the GMCs are consistent with the expected $γ$-ray flux produced via CR interacting with the ISM in the energy interval 1 - 100 $~\rm$ TeV. In addition, we investigate the presence of the CR spectral `knee' by introducing a spectral break in the $γ$-ray data. While no significant evidence for the CR knee is found, the current KM2A measurements from GMCs strongly favor a proton CR knee located above 0.9$~\rm$ PeV, which is consistent with the latest measurement of the CR spectrum by ground-based experiments.

preprint2026arXiv

Measurement of attenuation length of the muon content in extensive air showers from 0.3 to 30 PeV with LHAASO

The attenuation length of the muon content in extensive air showers provides important information regarding the generation and development of air showers. This information can be used not only to improve the description of such showers but also to test fundamental models of hadronic interactions. Using data from the LHAASO-KM2A experiment, the development of the muon content in high-energy air showers was studied. The attenuation length of muon content in the air showers was measured from experimental data in the energy range from 0.3 to 30 PeV using the constant intensity cut method. By comparing the attenuation length of the muon content with predictions from high-energy hadronic interaction models (QGSJET-II-04, SIBYLL 2.3d, and EPOS-LHC), it is evident that LHAASO results are significantly shorter than those predicted by the first two models (QGSJET-II-04 and SIBYLL 2.3d) but relatively close to those predicted by the third model (EPOS-LHC). Thus, the LHAASO data favor the EPOS-LHC model over the other two models. The three interaction models confirmed an increasing trend in the attenuation length as the cosmic-ray energy increases.

preprint2026arXiv

Measurement of Very-high-energy Diffuse Gamma-ray Emissions from the Galactic Plane with LHAASO-WCDA

The diffuse Galactic gamma-ray emission is a very important tool used to study the propagation and interaction of cosmic rays in the Milky Way. In this work, we report the measurements of the diffuse emission from the Galactic plane, covering Galactic longitudes from $15^{\circ}$ to $235^{\circ}$ and latitudes from $-5^{\circ}$ to $+5^{\circ}$, in an energy range of 1 TeV to 25 TeV, with the Water Cherenkov Detector Array (WCDA) of the Large High Altitude Air Shower Observatory (LHAASO). After masking the sky regions of known sources, the diffuse emission is detected with $24.6σ$ and $9.1σ$ significance in the inner Galactic plane and outer Galactic plane, respectively. The WCDA spectra in both regions can be well described by a power-law function, with spectral indices of $-2.67\pm0.05_{\rm stat}$ in the inner region and $-2.83\pm0.19_{\rm stat}$ in the outer region, respectively. Combined with the Square Kilometer Array (KM2A) measurements at higher energies, a clear softening of the spectrum is found in the inner region, with change of spectral indices by $\sim0.5$ at a break energy around $30$ TeV. The fluxes of the diffuse emission are higher by a factor of $1.5-2.7$ than the model prediction assuming local CR spectra and the gas column density, which are consistent with those measured by the KM2A. Along Galactic longitude, the spatial distribution of the diffuse emission shows deviation from that of the gas column density. The spectral shape of the diffuse emission are possibly variation in different longitude region. The WCDA measurements bridge the gap between the low-energy measurements by space detectors and the ultra-high-energy observations by LHAASO-KM2A and other experiments. These results suggest that improved modeling of the wide-band diffuse emission is required.

preprint2026arXiv

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

The rapid development of interactive and autonomous AI systems signals our entry into the agentic era. Training and evaluating agents on complex agentic tasks such as software engineering and computer use requires not only efficient model computation but also sophisticated infrastructure capable of coordinating vast agent-environment interactions. However, no open-source infrastructure can effectively support large-scale training and evaluation on such complex agentic tasks. To address this challenge, we present MegaFlow, a large-scale distributed orchestration system that enables efficient scheduling, resource allocation, and fine-grained task management for agent-environment workloads. MegaFlow abstracts agent training infrastructure into three independent services (Model Service, Agent Service, and Environment Service) that interact through unified interfaces, enabling independent scaling and flexible resource allocation across diverse agent-environment configurations. In our agent training deployments, MegaFlow successfully orchestrates tens of thousands of concurrent agent tasks while maintaining high system stability and achieving efficient resource utilization. By enabling such large-scale agent training, MegaFlow addresses a critical infrastructure gap in the emerging agentic AI landscape.

preprint2026arXiv

Optimized readout strategies for neutral atom quantum processors

Neutral atom quantum processors have emerged as a promising platform for scalable quantum information processing, offering high-fidelity operations and exceptional qubit scalability. A key challenge in realizing practical applications is efficiently extracting readout outcomes while maintaining high system throughput, i.e., the rate of quantum task executions. In this work, we develop a theoretical framework to quantify the trade-off between readout fidelity and atomic retention. Moreover, we introduce a metric of quantum circuit iteration rate (qCIR) and employ normalized quantum Fisher information to characterize system overall performance. Further, by carefully balancing fidelity and retention, we demonstrate a readout strategy for optimizing information acquisition efficiency. Considering the experimentally feasible parameters for 87Rb atoms, we demonstrate that qCIRs of 197.2Hz and 154.5Hz are achievable using single photon detectors and cameras, respectively. These results provide practical guidance for constructing scalable and high-throughput neutral atom quantum processors for applications in sensing, simulation, and near-term algorithm implementation.

preprint2026arXiv

RoadmapBench: Evaluating Long-Horizon Agentic Software Development Across Version Upgrades

Coding agents are increasingly deployed in real software development, where a single version iteration requires months of coordinated work across many files. However, most existing benchmarks focus predominantly on single-issue bug fixes from Python repositories, with coarse pass/fail evaluation outcomes, and thus fail to capture long-horizon, multi-target development at real engineering scale. To address this gap, we present RoadmapBench, a benchmark of 115 long-horizon coding tasks grounded in real open-source version upgrades across 17 repositories and 5 programming languages. Each task places the agent on a source-version code snapshot and provides a multi-target roadmap instruction requiring it to implement the functionality introduced in the target version, with a median modification of 3,700 lines across 51 files. We conduct a systematic evaluation on thirteen frontier models and find that even the strongest, Claude-Opus-4.7, resolves only 39.1% of tasks, while the weakest achieves merely 5.2%, in stark contrast to existing bug-fix benchmarks, suggesting that long-horizon software development remains a largely unsolved problem.

preprint2026arXiv

SaaS-Bench: Can Computer-Use Agents Leverage Real-World SaaS to Solve Professional Workflows?

Computer-Using Agents (CUAs) are rapidly extending large language models (LLMs) beyond text-based reasoning toward action execution in more complex environments, such as web browsers and graphical user interfaces (GUIs). However, existing web and GUI agent benchmarks often rely on simplified settings, isolated tasks, or short-horizon interactions, making it difficult to assess capabilities of agents in realistic professional workflows. Software-as-a-Service (SaaS) environments are a natural choice for CUA evaluation, as they host a large share of modern digital work and naturally involve dynamic system states, cross-application coordination, domain-specific knowledge, and long-horizon dependencies. To this end, we introduce SaaS-Bench, a benchmark built on 23 deployable SaaS systems across six professional domains, containing 106 tasks grounded in realistic work scenarios. These tasks require long-horizon execution, cover both text-only and multimodal settings, and are evaluated with weighted verification checkpoints that measure strict task completion and partial progress. Experiments show that representative LLM-based agents struggle on SaaS-Bench, with even the strongest model completing fewer than 4% of tasks end-to-end, exposing limitations in planning, state tracking, cross-application context maintenance, and error recovery. Code are available at https://github.com/UniPat-AI/SaaS-Bench for reproduction.

preprint2026arXiv

Step-wise Rubric Rewards for LLM Reasoning

Reinforcement Learning with Verifiable Rewards (RLVR) is widely used to improve reasoning in large language models, but rewards only final-answer correctness with no supervision over intermediate steps. Rubric-based methods such as Rubrics as Rewards (RaR) introduce finer-grained supervision by scoring rollouts against structured criteria, yet the rubric scores are still aggregated into a single scalar applied to the entire response, causing three weaknesses: loss of multi-criterion structure, uniform supervision of correct and incorrect steps, and reward hacking through unbounded self-correction. On 1,000 problems, we find 18.2% of steps in correct-answer responses are wrong yet positively rewarded, while 49.9% of steps in incorrect-answer responses are correct yet penalized. We introduce Step-wise Rubrics as Rewards (SRaR), an RLVR framework that (i) uses an LLM judge to attribute each rubric item to a specific reasoning step, (ii) normalizes per-step rubric scores across rollouts so only steps whose quality varies produce a learning signal, and (iii) combines the per-step reward with the outcome reward through a decoupled advantage estimator that keeps the outcome baseline stable. We further build a 16K-problem rubric dataset by contrastively distilling rubric items from correct and flawed reasoning paths sampled from a strong model. Across six mathematical reasoning benchmarks, SRaR improves average accuracy over RaR by 3.57 points on Qwen3-8B and 2.75 points on Qwen3-32B, raises the Faithful Reasoning Rate on AIME 2025 from 34.5% to 46.7%, and reduces self-correction looping from 48.1% to 26.5%.

preprint2026arXiv

Transient Large-Scale Anisotropy in TeV Cosmic Rays due to an Interplanetary Coronal Mass Ejection

Large- or medium-scale cosmic ray anisotropy at TeV energies has not previously been confirmed to vary with time. Transient anisotropy changes have been observed below 150 GeV, especially near the passage of an interplanetary shock and coronal mass ejection containing a magnetic flux rope ejected by a solar storm, which can trigger a geomagnetic storm with practical consequences. In such events, cosmic rays provide remote sensing of the magnetic field properties. Here we report the observation of transient large-scale anisotropy in TeV cosmic ray ions using data from the Large High Altitude Air Shower Observatory (LHAASO). We analyze hourly skymaps of the transient cosmic ray intensity excess or deficit, the gradient of which indicates the direction and magnitude of transient large-scale anisotropy across the field of view. We observe enhanced anisotropy above typical hourly fluctuations with $>$5$σ$ significance during some hours of November 4, 2021, in separate data sets for four primary cosmic ray energy ranges of median energy from $E$=0.7 to 3.1 TeV. The gradient varies with energy as $E^γ$, where $γ\approx-0.5$. At a median energy $\leq$1.0 TeV, this gradient corresponds to a dipole anisotropy of at least 1\%, or possibly a weaker anisotropy of higher order. This new type of observation opens the opportunity to study interplanetary magnetic structures using air shower arrays around the world, complementing existing in situ and remote measurements of plasma properties.

preprint2026arXiv

Variational Matrix-Learning Fourier Networks for Parametric Multiphysics Surrogates

Multiphysics simulation is critical for system-technology co-optimization (STCO) in chiplet-based design, but repeated finite-element solutions of PDE-governed problems are computationally expensive in parametric design exploration. This paper proposes a variational matrix-learning Fourier network (VMLFN) for efficient parametric multiphysics surrogate modeling. VMLFN constructs a log-space sine neural representation with randomly sampled spectral frequencies, frequency-dependent decay regulation, and embedded Dirichlet boundary conditions. With fixed hidden-layer parameters, the output-layer weights are determined by reformulating the governing PDEs into variational weak forms and enforcing the stationarity condition of the resulting energy functional. This converts physics-informed training into a linear matrix-solving problem, requiring only first-order derivatives and avoiding both high-order automatic differentiation and penalty-coefficient tuning. A heuristic frequency-scanning algorithm is further introduced to select a problem-adaptive maximum frequency that covers the dominant spectral range of the target problem. The proposed method is validated on heat conduction, solid mechanics, and Helmholtz wave propagation problems. Results from five benchmark cases demonstrate that VMLFN delivers accurate full-field predictions with substantial speedup over conventional physics-informed neural networks and repeated finite-element simulations.

preprint2026arXiv

VCG-Bench: Towards A Unified Visual-Centric Benchmark for Structured Generation and Editing

Despite the rapid advancements in Vision-Language Models (VLMs), a critical gap remains in their ability to handle structured, controllable diagrammatic tasks essential for professional workflows. Existing methods predominantly rely on pixel-based synthesis, which operates in probabilistic pixel spaces and is inherently limited in editability and fidelity. Instead, we propose a new Diagram-as-Code paradigm with symbolic logic that leverages mxGraph Extensible Markup Language (XML) for precise diagram generation and editing. We present VCG-Bench, a unified benchmark for visual-centric \texttt{mxGraph} tasks. VCG-Bench comprises: (1) a taxonomized dataset of 1,449 diverse diagrams spanning 6 domains and 15 sub-domains, (2) a paradigm definition that integrates Generation (Vision-to-Code) and Editability (Code-to-Code), (3) a Tailored Evaluation Protocol employing multi-dimensional metrics such as \texttt{mxGraph} Execution Success Rate, Style Consistency Score (SCS), etc. Experimental results highlight the challenges faced by current State-of-the-Art (SOTA) VLMs in structured fidelity and instruction compliance, reflecting their vision and reasoning capabilities.

preprint2025arXiv

Antarctic TianMu Staring Observation Project I: Overview and Implementation of the Prototype Telescope

Wide-field rapid sky surveys serve as critical observational methods for time-domain astronomical research. The Antarctic region, with several months of continuous dark nights annually, is an ideal site for time-domain astronomical observations. The Antarctic TianMu Staring Observation Project aims to deploy a fleet of small telescopes, adopting an array observation model to conduct time-domain optical observations in Antarctica, featuring wide-sky coverage, high-cadence sampling, long-period staring, and simultaneous multi-band measurements. Considering the severe challenges optical telescopes face in Antarctica, including extremely low temperatures, unattended operation, and limited power supply and network transmission, we have designed and developed the Antarctic TianMu prototype telescope based on drift-scan charge-coupled device technology. In October 2022, our prototype (with an aperture of 18 cm), named AT-Proto was transported to Zhongshan Station in Antarctica aboard China's 39th Antarctic Research Expedition. It has since operated stably and reliably in the frigid environment for over two years, demonstrating the significant advantages of this technology in polar astronomical observations. The experimental observation results of AT-Proto provide a solid foundation for the subsequent construction of a time-domain astronomy observation array in Antarctica.

preprint2025arXiv

Observation of the $γ$-ray Emission from W43 with LHAASO

In this paper, we report the detection of the very-high-energy (VHE, $ 100{\rm\ GeV} < E < 100{\rm\ TeV} $) and ultra-high-energy (UHE, $E > 100\rm\ TeV$) $γ$-ray emissions from the direction of the young star-forming region W43, observed by the Large High Altitude Air Shower Observation (LHAASO). The extended $γ$-ray source was detected with a significance of ${\sim}16\,σ$ by KM2A and ${\sim}17\,σ$ by WCDA, respectively. The angular extension of this $γ$-ray source is about 0.5 degrees, corresponding to a physical size of about 50 pc. We discuss the origin of the $γ$-ray emission and possible cosmic ray acceleration in the W43 region using multi-wavelength data. Our findings suggest that W43 is likely another young star cluster capable of accelerating cosmic rays (CRs) to at least several hundred TeV.

preprint2025arXiv

Spiking Heterogeneous Graph Attention Networks

Real-world graphs or networks are usually heterogeneous, involving multiple types of nodes and relationships. Heterogeneous graph neural networks (HGNNs) can effectively handle these diverse nodes and edges, capturing heterogeneous information within the graph, thus exhibiting outstanding performance. However, most methods of HGNNs usually involve complex structural designs, leading to problems such as high memory usage, long inference time, and extensive consumption of computing resources. These limitations pose certain challenges for the practical application of HGNNs, especially for resource-constrained devices. To mitigate this issue, we propose the Spiking Heterogeneous Graph Attention Networks (SpikingHAN), which incorporates the brain-inspired and energy-saving properties of Spiking Neural Networks (SNNs) into heterogeneous graph learning to reduce the computing cost without compromising the performance. Specifically, SpikingHAN aggregates metapath-based neighbor information using a single-layer graph convolution with shared parameters. It then employs a semantic-level attention mechanism to capture the importance of different meta-paths and performs semantic aggregation. Finally, it encodes the heterogeneous information into a spike sequence through SNNs, simulating bioinformatic processing to derive a binarized 1-bit representation of the heterogeneous graph. Comprehensive experimental results from three real-world heterogeneous graph datasets show that SpikingHAN delivers competitive node classification performance. It achieves this with fewer parameters, quicker inference, reduced memory usage, and lower energy consumption. Code is available at https://github.com/QianPeng369/SpikingHAN.

preprint2025arXiv

Study of Ultra-High-Energy Gamma-Ray Source 1LHAASO J0056+6346u and Its Possible Origins

We report a dedicated study of the newly discovered extended UHE $γ$-ray source 1LHAASO J0056+6346u. Analyzing 979 days of LHAASO-WCDA data and 1389 days of LHAASO-KM2A data, we observed a significant excess of $γ$-ray events with both WCDA and KM2A. Assuming a point power-law source with a fixed spectral index, the significance maps reveal excesses of ${\sim}12.65\,σ$, ${\sim}22.18\,σ$, and ${\sim}10.24\,σ$ in the energy ranges of 1--25 TeV, 25--100 TeV, and $> 100$ TeV, respectively. We use a 3D likelihood algorithm to derive the morphological and spectral parameters, and the source is detected with significances of $12.65\,σ$ by WCDA and $25.27\,σ$ by KM2A. The best-fit positions derived from WCDA and KM2A data are (R.A. = $13.96^\circ\pm0.09^\circ$, Decl. = $63.92^\circ\pm0.05^\circ$) and (R.A. = $14.00^\circ\pm0.05^\circ$, Decl. = $63.79^\circ\pm0.02^\circ$), respectively. The angular size ($r_{39}$) of 1LHAASO J0056+6346u is $0.34^\circ\pm0.04^\circ$ at 1--25 TeV and $0.24^\circ\pm0.02^\circ$ at $> 25$ TeV. The differential flux of this UHE $γ$-ray source can be described by an exponential cutoff power-law function: $(2.67\pm0.25) \times 10^{-15} (E/20\,\text{TeV})^{-1.97\pm0.10} e^{-E/(55.1\pm7.2)\,\text{TeV}} \,\text{TeV}^{-1}\,\text{cm}^{-2}\,\text{s}^{-1}$. To explore potential sources of $γ$-ray emission, we investigated the gas distribution around 1LHAASO J0056+6346u. 1LHAASO J0056+6346u is likely to be a TeV PWN powered by an unknown pulsar, which would naturally explain both its spatial and spectral properties. Another explanation is that this UHE $γ$-ray source might be associated with gas content illuminated by a nearby CR accelerator, possibly the SNR candidate G124.0+1.4.

preprint2025arXiv

Ultrahigh-Energy Gamma-ray Emission Associated with Black Hole-Jet Systems

Black holes (BH), one of the most intriguing objects in the universe, can manifest themselves through electromagnetic radiation initiated by the accretion flow. Some stellar-mass BHs drive relativistic jets when accreting matter from their companion stars, forming microquasars. Non-thermal emission from the radio to tera-electronvolt (TeV) gamma-ray band has been observed from microquasars, indicating the acceleration of relativistic particles. Here we report detection of four microquasars (SS 433, V4641 Sgr, GRS 1915+105, MAXI J1820+070) of spectrum extending to the ultrahigh-energy (UHE; photon energy $E>100$ TeV) band and one microquasar (Cygnus X-1) of spectrum approaching 100 TeV, using the Large High Altitude Air Shower Observatory (LHAASO). Notably, the total emission associated with SS 433 cannot be interpreted with a single leptonic component. In the UHE band, its emission is in spatial coincidence with a giant atomic cloud, which is consistent with a hadronic origin. An elongated source is discovered from V4641 Sgr with the spectrum continuing up to 800 TeV. The detection of UHE gamma rays demonstrates that accreting BHs and their environments can operate as extremely efficient accelerators of particles out of 1 peta-electronvolt (PeV), suggesting microquasars to be important contributors to Galactic cosmic rays especially around the `knee' region.

preprint2025arXiv

Volcano Architecture for Scalable Quantum Processor Units

Quantum information processing platforms based on array of matter qubits, such as neutral atoms, trapped ions, and quantum dots, face significant challenges in scalable addressing and readout as system sizes increase. Here, we propose the "Volcano" architecture that establishes a new quantum processing unit implementation method based on optical channel mapping on a arbitrarily arranged static qubit array. To support the feasibility of Volcano architecture, we show a proof-of-principle demonstration by employing a photonic chip that leverages custom-designed three-dimensional waveguide structures to transform one-dimensional beam arrays into arbitrary two-dimensional output patterns matching qubit array geometries. We demonstrate parallel and independent control of 49-channel with negligible crosstalk and high uniformity. This architecture addresses the challenges in scaling up quantum processors, including both the classical link for parallel qubit control and the quantum link for efficient photon collection, and holds the potential for interfacing with neutral atom arrays and trapped ion crystals, as well as networking of heterogeneous quantum systems.

preprint2024arXiv

Expected Transaction Value Optimization for Precise Marketing in FinTech Platforms

FinTech platforms facilitated by digital payments are watching growth rapidly, which enable the distribution of mutual funds personalized to individual investors via mobile Apps. As the important intermediation of financial products investment, these platforms distribute thousands of mutual funds obtaining impressions under guaranteed delivery (GD) strategy required by fund companies. Driven by the profit from fund purchases of users, the platform aims to maximize each transaction amount of customers by promoting mutual funds to these investors who will be interested in. Different from the conversions in traditional advertising or e-commerce recommendations, the investment amount in each purchase varies greatly even for the same financial product, which provides a significant challenge for the promotion recommendation of mutual funds. In addition to predicting the click-through rate (CTR) or the conversion rate (CVR) as in traditional recommendations, it is essential for FinTech platforms to estimate the customers' purchase amount for each delivered fund and achieve an effective allocation of impressions based on the predicted results to optimize the total expected transaction value (ETV). In this paper, we propose an ETV optimized customer allocation framework (EOCA) that aims to maximize the total ETV of recommended funds, under the constraints of GD dealt with fund companies. To the best of our knowledge, it's the first attempt to solve the GD problem for financial product promotions based on customer purchase amount prediction. We conduct extensive experiments on large scale real-world datasets and online tests based on LiCaiTong, Tencent wealth management platform, to demonstrate the effectiveness of our proposed EOCA framework.

preprint2023arXiv

CARD: Semantic Segmentation with Efficient Class-Aware Regularized Decoder

Semantic segmentation has recently achieved notable advances by exploiting "class-level" contextual information during learning. However, these approaches simply concatenate class-level information to pixel features to boost the pixel representation learning, which cannot fully utilize intra-class and inter-class contextual information. Moreover, these approaches learn soft class centers based on coarse mask prediction, which is prone to error accumulation. To better exploit class level information, we propose a universal Class-Aware Regularization (CAR) approach to optimize the intra-class variance and inter-class distance during feature learning, motivated by the fact that humans can recognize an object by itself no matter which other objects it appears with. Moreover, we design a dedicated decoder for CAR (CARD), which consists of a novel spatial token mixer and an upsampling module, to maximize its gain for existing baselines while being highly efficient in terms of computational cost. Specifically, CAR consists of three novel loss functions. The first loss function encourages more compact class representations within each class, the second directly maximizes the distance between different class centers, and the third further pushes the distance between inter-class centers and pixels. Furthermore, the class center in our approach is directly generated from ground truth instead of from the error-prone coarse prediction. CAR can be directly applied to most existing segmentation models during training, and can largely improve their accuracy at no additional inference overhead. Extensive experiments and ablation studies conducted on multiple benchmark datasets demonstrate that the proposed CAR can boost the accuracy of all baseline models by up to 2.23% mIOU with superior generalization ability. CARD outperforms SOTA approaches on multiple benchmarks with a highly efficient architecture.

preprint2022arXiv

A Survey of Adversarial Learning on Graphs

Deep learning models on graphs have achieved remarkable performance in various graph analysis tasks, e.g., node classification, link prediction, and graph clustering. However, they expose uncertainty and unreliability against the well-designed inputs, i.e., adversarial examples. Accordingly, a line of studies has emerged for both attack and defense addressed in different graph analysis tasks, leading to the arms race in graph adversarial learning. Despite the booming works, there still lacks a unified problem definition and a comprehensive review. To bridge this gap, we investigate and summarize the existing works on graph adversarial learning tasks systemically. Specifically, we survey and unify the existing works w.r.t. attack and defense in graph analysis tasks, and give appropriate definitions and taxonomies at the same time. Besides, we emphasize the importance of related evaluation metrics, investigate and summarize them comprehensively. Hopefully, our works can provide a comprehensive overview and offer insights for the relevant researchers. Latest advances in graph adversarial learning are summarized in our GitHub repository https://github.com/EdisonLeeeee/Graph-Adversarial-Learning.

preprint2022arXiv

A Survey of Trustworthy Graph Learning: Reliability, Explainability, and Privacy Protection

Deep graph learning has achieved remarkable progresses in both business and scientific areas ranging from finance and e-commerce, to drug and advanced material discovery. Despite these progresses, how to ensure various deep graph learning algorithms behave in a socially responsible manner and meet regulatory compliance requirements becomes an emerging problem, especially in risk-sensitive domains. Trustworthy graph learning (TwGL) aims to solve the above problems from a technical viewpoint. In contrast to conventional graph learning research which mainly cares about model performance, TwGL considers various reliability and safety aspects of the graph learning framework including but not limited to robustness, explainability, and privacy. In this survey, we provide a comprehensive review of recent leading approaches in the TwGL field from three dimensions, namely, reliability, explainability, and privacy protection. We give a general categorization for existing work and review typical work for each category. To give further insights for TwGL research, we provide a unified view to inspect previous works and build the connection between them. We also point out some important open problems remaining to be solved in the future developments of TwGL.

preprint2022arXiv

ATP: AMRize Then Parse! Enhancing AMR Parsing with PseudoAMRs

As Abstract Meaning Representation (AMR) implicitly involves compound semantic annotations, we hypothesize auxiliary tasks which are semantically or formally related can better enhance AMR parsing. We find that 1) Semantic role labeling (SRL) and dependency parsing (DP), would bring more performance gain than other tasks e.g. MT and summarization in the text-to-AMR transition even with much less data. 2) To make a better fit for AMR, data from auxiliary tasks should be properly "AMRized" to PseudoAMR before training. Knowledge from shallow level parsing tasks can be better transferred to AMR Parsing with structure transform. 3) Intermediate-task learning is a better paradigm to introduce auxiliary tasks to AMR parsing, compared to multitask learning. From an empirical perspective, we propose a principled method to involve auxiliary tasks to boost AMR parsing. Extensive experiments show that our method achieves new state-of-the-art performance on different benchmarks especially in topology-related scores.

preprint2022arXiv

CAR: Class-aware Regularizations for Semantic Segmentation

Recent segmentation methods, such as OCR and CPNet, utilizing "class level" information in addition to pixel features, have achieved notable success for boosting the accuracy of existing network modules. However, the extracted class-level information was simply concatenated to pixel features, without explicitly being exploited for better pixel representation learning. Moreover, these approaches learn soft class centers based on coarse mask prediction, which is prone to error accumulation. In this paper, aiming to use class level information more effectively, we propose a universal Class-Aware Regularization (CAR) approach to optimize the intra-class variance and inter-class distance during feature learning, motivated by the fact that humans can recognize an object by itself no matter which other objects it appears with. Three novel loss functions are proposed. The first loss function encourages more compact class representations within each class, the second directly maximizes the distance between different class centers, and the third further pushes the distance between inter-class centers and pixels. Furthermore, the class center in our approach is directly generated from ground truth instead of from the error-prone coarse prediction. Our method can be easily applied to most existing segmentation models during training, including OCR and CPNet, and can largely improve their accuracy at no additional inference overhead. Extensive experiments and ablation studies conducted on multiple benchmark datasets demonstrate that the proposed CAR can boost the accuracy of all baseline models by up to 2.23% mIOU with superior generalization ability. The complete code is available at https://github.com/edwardyehuang/CAR.

preprint2022arXiv

Detection of DC electric forces with zeptonewton sensitivity by single-ion phonon laser

Detecting extremely small forces helps exploring new physics quantitatively. Here we demonstrate that the phonon laser made of a single trapped $^{40}$Ca$^{+}$ ion behaves as an exquisite sensor for small force measurement. We report our successful detection of small electric forces regarding the DC trapping potential with sensitivity of 2.41$\pm$0.49 zN/$\sqrt{\rm Hz}$, with the ion only under Doppler cooling, based on the injection-locking of the oscillation phase of the phonon laser in addition to the classical squeezing applied to suppress the measurement uncertainty. We anticipate that such a single-ion sensor would reach a much better force detection sensitivity in the future once the trapping system is further improved and the fluorescence collection efficiency is further enhanced.

preprint2022arXiv

Enhanced DeepONet for Modeling Partial Differential Operators Considering Multiple Input Functions

Machine learning, especially deep learning is gaining much attention due to the breakthrough performance in various cognitive applications. Recently, neural networks (NN) have been intensively explored to model partial differential equations as NN can be viewed as universal approximators for nonlinear functions. A deep network operator (DeepONet) architecture was proposed to model the general non-linear continuous operators for partial differential equations (PDE) due to its better generalization capabilities than existing mainstream deep neural network architectures. However, existing DeepONet can only accept one input function, which limits its application. In this work, we explore the DeepONet architecture to extend it to accept two or more input functions. We propose new Enhanced DeepONet or EDeepONet high-level neural network structure, in which two input functions are represented by two branch DNN sub-networks, which are then connected with output truck network via inner product to generate the output of the whole neural network. The proposed EDeepONet structure can be easily extended to deal with multiple input functions. Our numerical results on modeling two partial differential equation examples shows that the proposed enhanced DeepONet is about 7X-17X or about one order of magnitude more accurate than the fully connected neural network and is about 2X-3X more accurate than a simple extended DeepONet for both training and test.

preprint2022arXiv

Enhancing MR Image Segmentation with Realistic Adversarial Data Augmentation

The success of neural networks on medical image segmentation tasks typically relies on large labeled datasets for model training. However, acquiring and manually labeling a large medical image set is resource-intensive, expensive, and sometimes impractical due to data sharing and privacy issues. To address this challenge, we propose AdvChain, a generic adversarial data augmentation framework, aiming at improving both the diversity and effectiveness of training data for medical image segmentation tasks. AdvChain augments data with dynamic data augmentation, generating randomly chained photo-metric and geometric transformations to resemble realistic yet challenging imaging variations to expand training data. By jointly optimizing the data augmentation model and a segmentation network during training, challenging examples are generated to enhance network generalizability for the downstream task. The proposed adversarial data augmentation does not rely on generative networks and can be used as a plug-in module in general segmentation networks. It is computationally efficient and applicable for both low-shot supervised and semi-supervised learning. We analyze and evaluate the method on two MR image segmentation tasks: cardiac segmentation and prostate segmentation with limited labeled data. Results show that the proposed approach can alleviate the need for labeled data while improving model generalization ability, indicating its practical value in medical imaging applications.

preprint2022arXiv

Exact mean-field models for spiking neural networks with adaptation

Networks of spiking neurons with adaption have been shown to be able to reproduce a wide range of neural activities, including the emergent population bursting and spike synchrony that underpin brain disorders and normal function. Exact mean-field models derived from spiking neural networks are extremely valuable, as such models can be used to determine how individual neuron and network parameters interact to produce macroscopic network behaviour. In the paper, we derive and analyze a set of exact mean-field equations for the neural network with spike frequency adaptation. Specifically, our model is a network of Izhikevich neurons, where each neuron is modeled by a two dimensional system consisting of a quadratic integrate and fire equation plus an equation which implements spike frequency adaptation. Previous work deriving a mean-field model for this type of network, relied on the assumption of sufficiently slow dynamics of the adaptation variable. However, this approximation did not succeeded in establishing an exact correspondence between the macroscopic description and the realistic neural network, especially when the adaptation time constant was not large. The challenge lies in how to achieve a closed set of mean-field equations with the inclusion of the mean-field expression of the adaptation variable. We address this challenge by using a Lorentzian ansatz combined with the moment closure approach to arrive at the mean-field system in the thermodynamic limit. The resulting macroscopic description is capable of qualitatively and quantitatively describing the collective dynamics of the neural network, including transition between tonic firing and bursting.

preprint2022arXiv

Faithful learning with sure data for lung nodule diagnosis

Recent evolution in deep learning has proven its value for CT-based lung nodule classification. Most current techniques are intrinsically black-box systems, suffering from two generalizability issues in clinical practice. First, benign-malignant discrimination is often assessed by human observers without pathologic diagnoses at the nodule level. We termed these data as "unsure data". Second, a classifier does not necessarily acquire reliable nodule features for stable learning and robust prediction with patch-level labels during learning. In this study, we construct a sure dataset with pathologically-confirmed labels and propose a collaborative learning framework to facilitate sure nodule classification by integrating unsure data knowledge through nodule segmentation and malignancy score regression. A loss function is designed to learn reliable features by introducing interpretability constraints regulated with nodule segmentation maps. Furthermore, based on model inference results that reflect the understanding from both machine and experts, we explore a new nodule analysis method for similar historical nodule retrieval and interpretable diagnosis. Detailed experimental results demonstrate that our approach is beneficial for achieving improved performance coupled with faithful model reasoning for lung cancer prediction. Extensive cross-evaluation results further illustrate the effect of unsure data for deep-learning-based methods in lung nodule classification.

preprint2022arXiv

Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation

Label smoothing and vocabulary sharing are two widely used techniques in neural machine translation models. However, we argue that simply applying both techniques can be conflicting and even leads to sub-optimal performance. When allocating smoothed probability, original label smoothing treats the source-side words that would never appear in the target language equally to the real target-side words, which could bias the translation model. To address this issue, we propose Masked Label Smoothing (MLS), a new mechanism that masks the soft label probability of source-side words to zero. Simple yet effective, MLS manages to better integrate label smoothing with vocabulary sharing. Our extensive experiments show that MLS consistently yields improvement over original label smoothing on different datasets, including bilingual and multilingual translation from both translation quality and model's calibration. Our code is released at https://github.com/PKUnlp-icler/MLS

preprint2022arXiv

Hierarchical Curriculum Learning for AMR Parsing

Abstract Meaning Representation (AMR) parsing aims to translate sentences to semantic representation with a hierarchical structure, and is recently empowered by pretrained sequence-to-sequence models. However, there exists a gap between their flat training objective (i.e., equally treats all output tokens) and the hierarchical AMR structure, which limits the model generalization. To bridge this gap, we propose a Hierarchical Curriculum Learning (HCL) framework with Structure-level (SC) and Instance-level Curricula (IC). SC switches progressively from core to detail AMR semantic elements while IC transits from structure-simple to -complex AMR instances during training. Through these two warming-up processes, HCL reduces the difficulty of learning complex structures, thus the flat model can better adapt to the AMR hierarchy. Extensive experiments on AMR2.0, AMR3.0, structure-complex and out-of-distribution situations verify the effectiveness of HCL.

preprint2022arXiv

Neighboring Backdoor Attacks on Graph Convolutional Network

Backdoor attacks have been widely studied to hide the misclassification rules in the normal models, which are only activated when the model is aware of the specific inputs (i.e., the trigger). However, despite their success in the conventional Euclidean space, there are few studies of backdoor attacks on graph structured data. In this paper, we propose a new type of backdoor which is specific to graph data, called neighboring backdoor. Considering the discreteness of graph data, how to effectively design the triggers while retaining the model accuracy on the original task is the major challenge. To address such a challenge, we set the trigger as a single node, and the backdoor is activated when the trigger node is connected to the target node. To preserve the model accuracy, the model parameters are not allowed to be modified. Thus, when the trigger node is not connected, the model performs normally. Under these settings, in this work, we focus on generating the features of the trigger node. Two types of backdoors are proposed: (1) Linear Graph Convolution Backdoor which finds an approximation solution for the feature generation (can be viewed as an integer programming problem) by looking at the linear part of GCNs. (2) Variants of existing graph attacks. We extend current gradient-based attack methods to our backdoor attack scenario. Extensive experiments on two social networks and two citation networks datasets demonstrate that all proposed backdoors can achieve an almost 100\% attack success rate while having no impact on predictive accuracy.

preprint2022arXiv

Re-thinking and Re-labeling LIDC-IDRI for Robust Pulmonary Cancer Prediction

The LIDC-IDRI database is the most popular benchmark for lung cancer prediction. However, with subjective assessment from radiologists, nodules in LIDC may have entirely different malignancy annotations from the pathological ground truth, introducing label assignment errors and subsequent supervision bias during training. The LIDC database thus requires more objective labels for learning-based cancer prediction. Based on an extra small dataset containing 180 nodules diagnosed by pathological examination, we propose to re-label LIDC data to mitigate the effect of original annotation bias verified on this robust benchmark. We demonstrate in this paper that providing new labels by similar nodule retrieval based on metric learning would be an effective re-labeling strategy. Training on these re-labeled LIDC nodules leads to improved model performance, which is enhanced when new labels of uncertain nodules are added. We further infer that re-labeling LIDC is current an expedient way for robust lung cancer prediction while building a large pathological-proven nodule database provides the long-term solution.

preprint2022arXiv

Remarkably strong magnetic response in molecules with polar groups

For more than a century, electricity and magnetism have been believed to always exhibit inextricable link due to the symmetry in electromagnetism. At the interface, polar groups that have polar charges, are indispensable to be considered, which interact directly with other polar charges/external charges/external electric fields. However, there is no report on the corresponding magnetic properties on these polar groups. Clearly, such asymmetry, that is, only the interaction between the polar groups and charges, is out of bounds. Here we show that those molecules with considerable polar groups, such as cellulose acetate (CA) and other cellulose derivatives with different polar groups, can have strong magnetic response, indicating that they are strongly paramagnetic. Density functional theory (DFT) calculation shows that the polarity greatly reduces the excitation energy from the state without net spin (singlet) to the state with net spin (triplet), making the considerable existence of magnetic moments on the polar groups. We note that the hydrophobic groups in these molecules have no magnetic moments, however, they make the molecules aggregate to amply the magnetic effect of the magnetic moments in the polar groups, so that these magnetic moments can induce the strong paramagnetism. Our observations suggest a recovery of the symmetry with inextricable link between the electricity and magnetism at the interface. The findings leave many imaginations of the role of the magnetic interaction in biological systems as well as other magnetic applications considering that many of those polar materials are biological materials, pharmaceutical materials, chemical raw materials, and even an essential hormone in agricultural production.

preprint2022arXiv

Robust Dynamic State Estimator of Integrated Energy Systems based on Natural Gas Partial Differential Equations

The reliability and precision of dynamic database are vital for the optimal operating and global control of integrated energy systems. One of the effective ways to obtain the accurate states is state estimations. A novel robust dynamic state estimation methodology for integrated natural gas and electric power systems is proposed based on Kalman filter. To take full advantage of measurement redundancies and predictions for enhancing the estimating accuracy, the dynamic state estimation model coupling gas and power systems by gas turbine units is established. The exponential smoothing technique and gas physical model are integrated in Kalman filter. Additionally, the time-varying scalar matrix is proposed to conquer bad data in Kalman filter algorithm. The proposed method is applied to an integrated gas and power systems formed by GasLib-40 and IEEE 39-bus system with five gas turbine units. The simulating results show that the method can obtain the accurate dynamic states under three different measurement error conditions, and the filtering performance are better than separate estimation methods. Additionally, the proposed method is robust when the measurements experience bad data.

preprint2022arXiv

Self-Calibrated Efficient Transformer for Lightweight Super-Resolution

Recently, deep learning has been successfully applied to the single-image super-resolution (SISR) with remarkable performance. However, most existing methods focus on building a more complex network with a large number of layers, which can entail heavy computational costs and memory storage. To address this problem, we present a lightweight Self-Calibrated Efficient Transformer (SCET) network to solve this problem. The architecture of SCET mainly consists of the self-calibrated module and efficient transformer block, where the self-calibrated module adopts the pixel attention mechanism to extract image features effectively. To further exploit the contextual information from features, we employ an efficient transformer to help the network obtain similar features over long distances and thus recover sufficient texture details. We provide comprehensive results on different settings of the overall network. Our proposed method achieves more remarkable performance than baseline methods. The source code and pre-trained models are available at https://github.com/AlexZou14/SCET.

preprint2022arXiv

Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection

Recent studies in deepfake detection have yielded promising results when the training and testing face forgeries are from the same dataset. However, the problem remains challenging when one tries to generalize the detector to forgeries created by unseen methods in the training dataset. This work addresses the generalizable deepfake detection from a simple principle: a generalizable representation should be sensitive to diverse types of forgeries. Following this principle, we propose to enrich the "diversity" of forgeries by synthesizing augmented forgeries with a pool of forgery configurations and strengthen the "sensitivity" to the forgeries by enforcing the model to predict the forgery configurations. To effectively explore the large forgery augmentation space, we further propose to use the adversarial training strategy to dynamically synthesize the most challenging forgeries to the current model. Through extensive experiments, we show that the proposed strategies are surprisingly effective (see Figure 1), and they could achieve superior performance than the current state-of-the-art methods. Code is available at \url{https://github.com/liangchen527/SLADD}.

preprint2022arXiv

Spiking Graph Convolutional Networks

Graph Convolutional Networks (GCNs) achieve an impressive performance due to the remarkable representation ability in learning the graph information. However, GCNs, when implemented on a deep network, require expensive computation power, making them difficult to be deployed on battery-powered devices. In contrast, Spiking Neural Networks (SNNs), which perform a bio-fidelity inference process, offer an energy-efficient neural architecture. In this work, we propose SpikingGCN, an end-to-end framework that aims to integrate the embedding of GCNs with the biofidelity characteristics of SNNs. The original graph data are encoded into spike trains based on the incorporation of graph convolution. We further model biological information processing by utilizing a fully connected layer combined with neuron nodes. In a wide range of scenarios (e.g. citation networks, image graph classification, and recommender systems), our experimental results show that the proposed method could gain competitive performance against state-of-the-art approaches. Furthermore, we show that SpikingGCN on a neuromorphic chip can bring a clear advantage of energy efficiency into graph data analysis, which demonstrates its great potential to construct environment-friendly machine learning models.

preprint2022arXiv

The Curvature of Spectral Energy Distribution and $γ$-ray Dominance of Fermi BL Lac Objects

The extragalactic $γ$-ray sky is dominated by blazars and their study plays an important role in understanding jet physics, cosmic evolution history and origin of ultra high energy cosmic rays. In this work, we study a large sample of BL Lac objects to investigate why some sources are detected in $γ$-rays, while others not. We selected 170 BL Lac objects, with measured synchrotron spectral curvature and Doppler factor, and divided them into Fermi-LAT detected (FBLs) and non-detected (NFBLs) sources. We show that FBLs have smaller curvature than NFBLs, even after getting rid of Doppler beaming effect. The BL Lac objects PKS 0048-09 and S5 0716+714 have similar synchrotron peak frequency and luminosity but different $γ$-ray dominance and their quasi-simultaneous broadband spectral energy distributions (SEDs) can be well fitted by a log-parabolic synchrotron self-Compton (SSC) model with same jet parameters except for the curvature and source size, assuming curvature being proportional to the size of emission region. Our results imply that for a given synchrotron luminosity, the different SED curvature and Compton dominance may account for the discrepancy between FBLs and NFBLs. We discuss these results in context of stochastic particle acceleration and radiation mechanisms.

preprint2021arXiv

A High-dimensional Sparse Fourier Transform in the Continuous Setting

In this paper, we theoretically propose a new hashing scheme to establish the sparse Fourier transform in high-dimensional space. The estimation of the algorithm complexity shows that this sparse Fourier transform can overcome the curse of dimensionality. To the best of our knowledge, this is the first polynomial-time algorithm to recover the high-dimensional continuous frequencies.

preprint2021arXiv

Analytical Solution of Magnetically Dominated Astrophysical Jets and Winds: Jet Launching, Acceleration, and Collimation

We present an analytical solution of a highly magnetized jet/wind flow. The left side of the general force-free jet/wind equation (the "pulsar" equation) is separated into a rotating and a nonrotating term. The two equations with either term can be solved analytically, and the two solutions match each other very well. Therefore, we obtain a general approximate solution of a magnetically dominated jet/wind, which covers from the nonrelativistic to relativistic regimes, with the drift velocity well matching the cold plasma velocity. The acceleration of a jet includes three stages. (1) The jet flow is located within the Alfvén critical surface (i.e. the light cylinder), has a nonrelativistic speed, and is dominated by toroidal motion. (2) The jet is beyond the Alfvén critical surface where the flow is dominated by poloidal motion and becomes relativistic. The total velocity in these two stages follows the same law $vΓ=ΩR$. (3) The evolution law is replaced by $vΓ\approx1/\left(θ\sqrt{2-ν}\right)$, where $θ$ is the half-opening angle of the jet and $0\leqν\leq2$ is a free parameter determined by the magnetic field configuration. This is because the earlier efficient acceleration finally breaks the causality connection between different parts in the jet, preventing a global solution. The jet has to carry local charges and currents to support an electromagnetic balance. This approximate solution is consistent with known theoretical results and numerical simulations, and it is more convenient to directly compare with observations. This theory may be used to constrain the spin of black holes in astrophysical jets.

preprint2021arXiv

Crossed-Time Delay Neural Network for Speaker Recognition

Time Delay Neural Network (TDNN) is a well-performing structure for DNN-based speaker recognition systems. In this paper we introduce a novel structure Crossed-Time Delay Neural Network (CTDNN) to enhance the performance of current TDNN. Inspired by the multi-filters setting of convolution layer from convolution neural network, we set multiple time delay units each with different context size at the bottom layer and construct a multilayer parallel network. The proposed CTDNN gives significant improvements over original TDNN on both speaker verification and identification tasks. It outperforms in VoxCeleb1 dataset in verification experiment with a 2.6% absolute Equal Error Rate improvement. In few shots condition CTDNN reaches 90.4% identification accuracy, which doubles the identification accuracy of original TDNN. We also compare the proposed CTDNN with another new variant of TDNN, FTDNN, which shows that our model has a 36% absolute identification accuracy improvement under few shots condition and can better handle training of a larger batch in a shorter training time, which better utilize the calculation resources. The code of the new model is released at https://github.com/chenllliang/CTDNN

preprint2021arXiv

Deep Reinforcement Learning with Spatio-temporal Traffic Forecasting for Data-Driven Base Station Sleep Control

To meet the ever increasing mobile traffic demand in 5G era, base stations (BSs) have been densely deployed in radio access networks (RANs) to increase the network coverage and capacity. However, as the high density of BSs is designed to accommodate peak traffic, it would consume an unnecessarily large amount of energy if BSs are on during off-peak time. To save the energy consumption of cellular networks, an effective way is to deactivate some idle base stations that do not serve any traffic demand. In this paper, we develop a traffic-aware dynamic BS sleep control framework, named DeepBSC, which presents a novel data-driven learning approach to determine the BS active/sleep modes while meeting lower energy consumption and satisfactory Quality of Service (QoS) requirements. Specifically, the traffic demands are predicted by the proposed GS-STN model, which leverages the geographical and semantic spatial-temporal correlations of mobile traffic. With accurate mobile traffic forecasting, the BS sleep control problem is cast as a Markov Decision Process that is solved by Actor-Critic reinforcement learning methods. To reduce the variance of cost estimation in the dynamic environment, we propose a benchmark transformation method that provides robust performance indicator for policy update. To expedite the training process, we adopt a Deep Deterministic Policy Gradient (DDPG) approach, together with an explorer network, which can strengthen the exploration further. Extensive experiments with a real-world dataset corroborate that our proposed framework significantly outperforms the existing methods.

preprint2021arXiv

Dynamic State Estimation for Integrated Natural Gas and Electric Power Systems

A dynamic state estimation method of integrated natural gas and electric power systems (IGESs) in proposed. Firstly, the coupling model of gas pipeline networks and power systems by gas turbine units (GTUs) is established. Secondly, the Kalman filter based linear DSE model for the IGES is built. The gas density and mass flow rate, as well as the real and imaginary parts of bus voltages are taken as states, which are predicted by the linearized fluid dynamic equations of gases and exponential smoothing techniques. Boundary conditions of pipeline networks are used as supplementary constraints in the system model. At last, the proposed method is applied to an IGES including a 30-node pipeline network and IEEE 39-bus system coupled by two GTUs. Two indexes are used to evaluate the DSE performance under three measurement error conditions, and the results show that the DSE can obtain the accurate dynamic states in different conditions.

preprint2021arXiv

GraphGallery: A Platform for Fast Benchmarking and Easy Development of Graph Neural Networks Based Intelligent Software

Graph Neural Networks (GNNs) have recently shown to be powerful tools for representing and analyzing graph data. So far GNNs is becoming an increasingly critical role in software engineering including program analysis, type inference, and code representation. In this paper, we introduce GraphGallery, a platform for fast benchmarking and easy development of GNNs based software. GraphGallery is an easy-to-use platform that allows developers to automatically deploy GNNs even with less domain-specific knowledge. It offers a set of implementations of common GNN models based on mainstream deep learning frameworks. In addition, existing GNNs toolboxes such as PyG and DGL can be easily incorporated into the platform. Experiments demonstrate the reliability of implementations and superiority in fast coding. The official source code of GraphGallery is available at https://github.com/EdisonLeeeee/GraphGallery and a demo video can be found at https://youtu.be/mv7Zs1YeaYo.

preprint2021arXiv

Hysteresis bifurcation and application to delayed Fitzhugh-Nagumo neural systems

Hysteresis dynamics has been described in a vast number of biological experimental studies. Many such studies are phenomenological and a mathematical appreciation has not attracted enough attention. In the paper, we explore the nature of hysteresis and study it from the dynamical system point of view by using the bifurcation and perturbation theories. We firstly make a classification of hysteresis according to the system behaviours transiting between different types of attractors. Then, we focus on a mathematically amenable situation where hysteretic movements between the equilibrium point and the limit cycle are initiated by a subcritical Hopf bifurcation and a saddle-node bifurcation of limit cycles. We present a analytical framework by using the method of multiple scales to obtain the normal form up to the fifth order. Theoretical results are compared with time domain simulations and numerical continuation, showing good agreement. Although we consider the time-delayed FitzHugh-Nagumo neural system in the paper, the generalization should be clear to other systems or parameters. The general framework we present in the paper can be naturally extended to the notion of bursting activity in neuroscience where hysteresis is a dominant mechanism to generate bursting oscillations.

preprint2021arXiv

Lower Bound on the Optimal Access Bandwidth of ($K+2,K,2$)-MDS Array Code with Degraded Read Friendly

Accessing the data in the failed disk (degraded read) with low latency is crucial for an erasure-coded storage system. In this work, the maximum distance separable (MDS) array code with the property of degraded-read friendly (DRF) is discussed. For the DRF MDS array code with 2 redundant nodes and the sub-packetization level of 2, the lower bound of its access bandwidth is derived.

preprint2021arXiv

On the origin of GeV spectral break for Fermi blazars: 3C 454.3

The GeV break in spectra of the blazar 3C 454.3 is a special observation feature that has been discovered by the {\it Fermi}-LAT. The origin of the GeV break in the spectra is still under debate. In order to explore the possible source of GeV spectral break in 3C 454.3, a one-zone homogeneous leptonic jet model, as well as the {\it McFit} technique are utilized for fitting the quasi-simultaneous multi-waveband spectral energy distribution (SED) of 3C 454.3. The outside border of the broad-line region (BLR) and inner dust torus are chosen to contribute radiation in the model as external, seed photons to the external-Compton process, considering the observed $γ$-ray radiation. The combination of two components, namely the Compton-scattered BLR and dust torus radiation, assuming a broken power-law distribution of emitted particles, provides a proper fitting to the multi-waveband SED of 3C 454.3 detected 2008 Aug 3 - Sept 2 and explains the GeV spectral break. We propose that the spectral break of 3C 454.3 may originate from an inherent break in the energy distribution of the emitted particles and the Klein-Nishina effect. A comparison is performed between the energy density of the 'external' photon field for the whole BLR $U_{\rm BLR}$ achieved via model fitting and that constrained from the BLR data. The distance from the position of the $γ$-ray radiation area of 3C 454.3 to the central black hole could be constrained at $\sim 0.78$pc ($\sim 4.00 R_{\rm BLR}$, the size of the BLR).

preprint2021arXiv

Robust Kalman filter-based dynamic state estimation of natural gas pipeline networks

To obtain the accurate transient states of the big scale natural gas pipeline networks under the bad data and non-zero mean noises conditions, a robust Kalman filter-based dynamic state estimation method is proposed using the linearized gas pipeline transient flow equations in this paper. Firstly, the dynamic state estimation model is built. Since the gas pipeline transient flow equations are less than the states, the boundary conditions are used as supplementary constraints to predict the transient states. To increase the measurement redundancy, the zero mass flow rate constraints at the sink nodes are taken as virtual measurements. Secondly, to ensure the stability under bad data condition, the robust Kalman filter algorithm is proposed by introducing a time-varying scalar matrix to regulate the measurement error variances correctly according to the innovation vector at every time step. At last, the proposed method is applied to a 30-node gas pipeline networks in several kinds of measurement conditions. The simulation shows that the proposed robust dynamic state estimation can decrease the effects of bad data and achieve better estimating results.

preprint2021arXiv

Structural engineering from an inverse problems perspective

The field of structural engineering is vast, spanning areas from the design of new infrastructure to the assessment of existing infrastructure. From the onset, traditional entry-level university courses teach students to analyse structural response given data including external forces, geometry, member sizes, restraint, etc. -- characterising a forward problem (structural causalities $\to$ structural response). Shortly thereafter, junior engineers are introduced to structural design where they aim to, for example, select an appropriate structural form for members based on design criteria, which is the inverse of what they previously learned. Similar inverse realisations also hold true in structural health monitoring and a number of structural engineering sub-fields (response $\to$ structural causalities). In this light, we aim to demonstrate that many structural engineering sub-fields may be fundamentally or partially viewed as inverse problems and thus benefit via the rich and established methodologies from the inverse problems community. To this end, we conclude that the future of inverse problems in structural engineering is inexorably linked to engineering education and machine learning developments.

preprint2020arXiv

AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor x4 based on a set of prior examples of low and corresponding high resolution images. The goal is to devise a network that reduces one or several aspects such as runtime, parameter count, FLOPs, activations, and memory consumption while at least maintaining PSNR of MSRResNet. The track had 150 registered participants, and 25 teams submitted the final results. They gauge the state-of-the-art in efficient single image super-resolution.

preprint2020arXiv

DeepCP: Deep Learning Driven Cascade Prediction Based Autonomous Content Placement in Closed Social Network

Online social networks (OSNs) are emerging as the most popular mainstream platform for content cascade diffusion. In order to provide satisfactory quality of experience (QoE) for users in OSNs, much research dedicates to proactive content placement by using the propagation pattern, user's personal profiles and social relationships in open social network scenarios (e.g., Twitter and Weibo). In this paper, we take a new direction of popularity-aware content placement in a closed social network (e.g., WeChat Moment) where user's privacy is highly enhanced. We propose a novel data-driven holistic deep learning framework, namely DeepCP, for joint diffusion-aware cascade prediction and autonomous content placement without utilizing users' personal and social information. We first devise a time-window LSTM model for content popularity prediction and cascade geo-distribution estimation. Accordingly, we further propose a novel autonomous content placement mechanism CP-GAN which adopts the generative adversarial network (GAN) for agile placement decision making to reduce the content access latency and enhance users' QoE. We conduct extensive experiments using cascade diffusion traces in WeChat Moment (WM). Evaluation results corroborate that the proposed DeepCP framework can predict the content popularity with a high accuracy, generate efficient placement decision in a real-time manner, and achieve significant content access latency reduction over existing schemes.

preprint2020arXiv

Diversity of gamma-ray and Radio Variabilities of Bright Blazars and Implications for gamma-ray Emission Location

Violent multi-wavelength variabilities are observed in gamma-ray-selected blazars. We present an analysis of long-term light curves for eight bright blazars to explore the co-variation pattern in the gamma-ray and radio bands. We extract their gamma-ray light curves and spectra with data observed by the Fermi/LAT since 2008. We find diverse co-variation patterns between the gamma-ray and radio (at 43 GHz) fluxes in these sources. The gamma-ray and radio fluxes of 3C 454.3 and PKS 1633+382 are correlated without any time-lag, suggesting that they are from the same radiation region. Similar correlation is also observed in 3C 273 and PKS 1222+216, but the radio flux is lag behind the gamma-ray flux approximately ~160 days and ~290 days, respectively. This likely suggests that their gamma-ray emission regions are located at the upstream of their radio cores at 43 GHz. The gamma-ray and radio fluxes of the other four blazars are not correlated, implying that the gamma-ray and radio emission may be from different regions in their jets. The gamma-ray light curves of the eight blazars can be decomposed into some long timescale variability components and fast spike flares. We propose that they may be attributed to the central engine activity and the magnetic reconnection process or turbulence in the local emission region, respectively.

preprint2020arXiv

Interactive Path Reasoning on Graph for Conversational Recommendation

Traditional recommendation systems estimate user preference on items from past interaction history, thus suffering from the limitations of obtaining fine-grained and dynamic user preference. Conversational recommendation system (CRS) brings revolutions to those limitations by enabling the system to directly ask users about their preferred attributes on items. However, existing CRS methods do not make full use of such advantage -- they only use the attribute feedback in rather implicit ways such as updating the latent user representation. In this paper, we propose Conversational Path Reasoning (CPR), a generic framework that models conversational recommendation as an interactive path reasoning problem on a graph. It walks through the attribute vertices by following user feedback, utilizing the user preferred attributes in an explicit way. By leveraging on the graph structure, CPR is able to prune off many irrelevant candidate attributes, leading to better chance of hitting user preferred attributes. To demonstrate how CPR works, we propose a simple yet effective instantiation named SCPR (Simple CPR). We perform empirical studies on the multi-round conversational recommendation scenario, the most realistic CRS setting so far that considers multiple rounds of asking attributes and recommending items. Through extensive experiments on two datasets Yelp and LastFM, we validate the effectiveness of our SCPR, which significantly outperforms the state-of-the-art CRS methods EAR (arXiv:2002.09102) and CRM (arXiv:1806.03277). In particular, we find that the more attributes there are, the more advantages our method can achieve.

preprint2020arXiv

Modelling High-Order Social Relations for Item Recommendation

The prevalence of online social network makes it compulsory to study how social relations affect user choice. However, most existing methods leverage only first-order social relations, that is, the direct neighbors that are connected to the target user. The high-order social relations, e.g., the friends of friends, which very informative to reveal user preference, have been largely ignored. In this work, we focus on modeling the indirect influence from the high-order neighbors in social networks to improve the performance of item recommendation. Distinct from mainstream social recommenders that regularize the model learning with social relations, we instead propose to directly factor social relations in the predictive model, aiming at learning better user embeddings to improve recommendation. To address the challenge that high-order neighbors increase dramatically with the order size, we propose to recursively "propagate" embeddings along the social network, effectively injecting the influence of high-order neighbors into user representation. We conduct experiments on two real datasets of Yelp and Douban to verify our High-Order Social Recommender (HOSR) model. Empirical results show that our HOSR significantly outperforms recent graph regularization-based recommenders NSCR and IF-BPR+, and graph convolutional network-based social influence prediction model DeepInf, achieving new state-of-the-arts of the task.

preprint2020arXiv

On the Origin and Evolution of Curvature of the Spectral Energy Distribution of Fermi Bright Blazars

The origin and evolution of spectral curvature in blazar spectral energy distribution (SED) is still unclear. Since the observed SED curvature is related to an intrinsic curvature in emitting electron energy distribution (EED), we study this question by employing a log-parabolic EED with a curvature parameter and peak energy to model the quasi-simultaneous broadband SEDs of selected blazars in Fermi-LAT Bright AGN Sample (LBAS) using synchrotron and inverse Compton (IC) processes. We find that log-parabolic IC model can successfully explain the emission in all blazars in our sample. On average, FSRQs have higher magnetic field, Doppler factor, and curvature than BL Lac objects. The BL Lac objects show an anticorrelation between the curvature parameter of the EED and its peak energy, which is a signature of stochastic acceleration. FSRQs do not manifest such correlation and rather show a mild positive relationship between these parameters. This suggests that the evolution of spectral curvature in the BL Lac objects is dominated by a strong stochastic acceleration component, whereas the curvature in FSRQs evolves in a cooling dominated regime due to an additional external Compton (EC) component. The strong cooling in FSRQs not only restricts the electron peak energy but also adds extra curvature to the high energy tail of emitting EED. Since the curvature decreases from FSRQs toward high peak BL Lac objects (HBLs), opposite to peak energy, the curvature parameter can be considered a third parameter of the blazar sequence in addition to peak frequency and luminosity.

preprint2020arXiv

Photon directional profile from stimulated decay of axion clouds with arbitrary momentum distributions

We model clusters of axions with spherically symmetric spacial but arbitrary momentum distributions and study the directional profile of photos produced in their evolution through spontaneous and stimulated decay of axions via the process a --> gamma + gamma. Several specific examples are presented.

preprint2020arXiv

Real-Time Differential Epidemic Analysis and Prediction for COVID-19 Pandemic

In this paper, we propose a new real-time differential virus transmission model, which can give more accurate and robust short-term predictions of COVID-19 transmitted infectious disease with benefits of near-term trend projection. Different from the existing Susceptible-Exposed-Infected-Removed (SEIR) based virus transmission models, which fits well for pandemic modeling with sufficient historical data, the new model, which is also SEIR based, uses short history data to find the trend of the changing disease dynamics for the infected, the dead and the recovered so that it can naturally accommodate the adaptive real-time changes of disease mitigation, business activity and social behavior of populations. As the parameters of the improved SEIR models are trained by short history window data for accurate trend prediction, our differential epidemic model, essentially are window-based time-varying SEIR model. Since SEIR model still is a physics-based disease transmission model, its near-term (like one month) projection can still be very instrumental for policy makers to guide their decision for disease mitigation and business activity policy change in a real-time. This is especially useful if the pandemic lasts more than one year with different phases across the world like 1918 flu pandemic. Numerical results on the recent COVID-19 data from China, Italy and US, California and New York states have been analyzed.

preprint2020arXiv

Realistic Adversarial Data Augmentation for MR Image Segmentation

Neural network-based approaches can achieve high accuracy in various medical image segmentation tasks. However, they generally require large labelled datasets for supervised learning. Acquiring and manually labelling a large medical dataset is expensive and sometimes impractical due to data sharing and privacy issues. In this work, we propose an adversarial data augmentation method for training neural networks for medical image segmentation. Instead of generating pixel-wise adversarial attacks, our model generates plausible and realistic signal corruptions, which models the intensity inhomogeneities caused by a common type of artefacts in MR imaging: bias field. The proposed method does not rely on generative networks, and can be used as a plug-in module for general segmentation networks in both supervised and semi-supervised learning. Using cardiac MR imaging we show that such an approach can improve the generalization ability and robustness of models as well as provide significant improvements in low-data scenarios.

preprint2020arXiv

Remarkable antibacterial activity of reduced graphene oxide functionalized by copper ions

Despite long-term efforts for exploring antibacterial agents or drugs, it remains challenging how to potentiate antibacterial activity and meanwhile minimize toxicity hazards to the environment. Here, we experimentally show that the functionality of reduced graphene oxide (rGO) through copper ions displays selective antibacterial activity significantly stronger than that of rGO itself and no toxicity to mammalian cells. Remarkably, this antibacterial activity is two orders of magnitude greater than the activity of its surrounding copper ions. We demonstrate that the rGO is functionalized through the cation-$π$ interaction to massively adsorb copper ions to form a rGO-copper composite in solution and result in an extremely low concentration level of surrounding copper ions (less than ~0.5 $μM$). These copper ions on rGO are positively charged and strongly interact with negatively charged bacterial cells to selectively achieve antibacterial activity, while rGO exhibits the functionality to not only actuate rapid delivery of copper ions and massive assembly onto bacterial cells but also result in the valence shift in the copper ions from Cu$^{2+}$ into Cu$^{+}$ which greatly enhances the antibacterial activity. Notably, this functionality of rGO through cation-$π$ interaction with copper ions can similarly achieve algaecidal activity but does not exert cytotoxicity against neutrally charged mammalian cells. The remarkable selective antibacterial activity from the rGO functionality as well as the inherent broad-spectrum-antibacterial physical mechanism represents a significant step toward the development of a novel antibacterial material and reagent without environmental hazards for practical application.

preprint2020arXiv

Stochastic Time-Periodic Tonelli Lagrangian on Compact Manifold

In this paper, we study a class of time-periodic stochastic Tonelli Lagrangians on compact manifolds. Precisely, we discuss the stochastic Mane Critical Value, prove the existence of stochastic Weak KAM solutions of the related Hamilton-Jacobi equation. Furthermore, we survey the global minimizer.

preprint2020arXiv

The X-ray emission in young radio AGNs

In this work, we investigated the X-ray emission for a sample of young radio AGNs by combining their data from Chandra/XMM-Newton and at other wavebands. We find strong correlations between the X-ray luminosity $L_{\rm X}$ in 2$-$10 keV and the radio luminosities $L_{\rm R}$ at 5 GHz for VLBI radio-core, VLA radio-core and FIRST component, indicating that both pc- and kpc-scale radio emission strongly correlate with X-ray emission in these sources. We find approximately linear dependence of radio on X-ray luminosity in the sources with radiative efficient accretion flows (i.e., the Eddington ratio $R_{\rm edd} \gtrsim 10^{-3} $) with b $\sim$ 1 ($L_{\rm R}$ $\propto$ $L_{\rm X} ^{b}$) and $ξ_{\rm RX}$ $\sim$ 1 in fundamental plane using VLBI data, where the dependence is consistent with the re-analysed result on the previous study in \cite{2016ApJ...818..185F} at $R_{\rm edd} \gtrsim 10^{-3}$, however is significantly deviated from the theoretical prediction of accretion flow as the origin of X-ray emission. In contrast to radio-quiet quasars, there is no significant correlation between $Γ$ and Eddington ratio. Our results seem to indicate that the X-ray emission of high-accreting young radio AGNs may be from jet. We constructed the SEDs for 18 sources (most are in radiative efficient accretion) including 9 galaxies and 9 quasars with high-quality X-ray data, and find that the X-ray emission of most quasars is more luminous than that of normal radio-quiet quasars. This is clearly seen from the quasar composite SED, of which the X-ray emission is apparently higher than that of radio-quiet quasars, likely supporting the jet-related X-ray emission in young radio AGNs. The scenario that the X-ray emission is from self-synchrotron Compton (SSC) is discussed.

preprint2020arXiv

Topology Optimization through Differentiable Finite Element Solver

In this paper, a topology optimization framework utilizing automatic differentiation is presented as an efficient way for solving 2D density-based topology optimization problem by calculating gradients through the fully differentiable finite element solver. The optimization framework with the differentiable physics solver is proposed and tested on several classical topology optimization examples. The differentiable solver is implemented in Julia programming language and can be automatically differentiated in reverse mode to provide the pullback functions of every single operation. The entire end-to-end gradient information can be then backed up by utilizing chain rule. This framework incorporates a generator built from convolutional layers with a set of learnable parameters to propose new designs for every iteration. Since the whole process is differentiable, the parameters of the generator can be updated using any optimization algorithm given the gradient information from automatic differentiation. The proposed optimization framework is demonstrated on designing a half MBB beam and compared to the results with the ones from the efficient 88-line code. By only changing the objective function and the boundary conditions, it can run an optimization for designing a compliant mechanism, e.g. a force inverter where the output displacement is in the opposite direction of the input.

preprint2020arXiv

Variance-reduced Language Pretraining via a Mask Proposal Network

Self-supervised learning, a.k.a., pretraining, is important in natural language processing. Most of the pretraining methods first randomly mask some positions in a sentence and then train a model to recover the tokens at the masked positions. In such a way, the model can be trained without human labeling, and the massive data can be used with billion parameters. Therefore, the optimization efficiency becomes critical. In this paper, we tackle the problem from the view of gradient variance reduction. In particular, we first propose a principled gradient variance decomposition theorem, which shows that the variance of the stochastic gradient of the language pretraining can be naturally decomposed into two terms: the variance that arises from the sample of data in a batch, and the variance that arises from the sampling of the mask. The second term is the key difference between selfsupervised learning and supervised learning, which makes the pretraining slower. In order to reduce the variance of the second part, we leverage the importance sampling strategy, which aims at sampling the masks according to a proposal distribution instead of the uniform distribution. It can be shown that if the proposal distribution is proportional to the gradient norm, the variance of the sampling is reduced. To improve efficiency, we introduced a MAsk Proposal Network (MAPNet), which approximates the optimal mask proposal distribution and is trained end-to-end along with the model. According to the experimental result, our model converges much faster and achieves higher performance than the baseline BERT model.

preprint2019arXiv

A Unified Algorithmic Framework of Symmetric Gauss-Seidel Decomposition based Proximal ADMMs for Convex Composite Programming

This paper aims to present a fairly accessible generalization of several symmetric Gauss-Seidel decomposition based multi-block proximal alternating direction methods of multipliers (ADMMs) for convex composite optimization problems. The proposed method unifies and refines many constructive techniques that were separately developed for the computational efficiency of multi-block ADMM-type algorithms. Specifically, the majorized augmented Lagrangian functions, the indefinite proximal terms, the inexact symmetric Gauss-Seidel decomposition theorem, the tolerance criteria of approximately solving the subproblems, and the large dual step-lengths, are all incorporated in one algorithmic framework, which we named as sGS-imiPADMM. From the popularity of convergent variants of multi-block ADMMs in recent years, especially for high-dimensional multi-block convex composite conic programming problems, the unification presented in this paper, as well as the corresponding convergence results, may have the great potential of facilitating the implementation of many multi-block ADMMs in various problem settings.

preprint2019arXiv

Property Studies of "Loner" Flares of Gamma-Ray Blazars

We search through $γ$-ray data obtained with the Large Area Telescope (LAT) onboard the Fermi Gamma-Ray Space Telescope and find 24 blazars (or candidates) that have a single clear flare event in their 9.5 year long-term light curves. We define these events as loner flares since each flare stands out significantly above the relatively stable, low-flux light curve. We analyze the LAT data in detail for these 24 sources. The flares in ten of them are primarily due to a single sharp peak, for which we study by fitting with two different analytic functions. The time durations thus determined for the sharp peaks are in a range of 4-25 days. The $γ$-ray spectra of the 24 blazar sources can be described with a power-law or a log-parabola function. We obtain their spectral properties in the flaring and quiescent states, and find that in the flares 16 of the sources have harder emission and three have softer emission while the other five keep the same emission. We discuss a possible correlation between the differences in photon index in the quiescent and flaring states and photon indices in quiescence. In addition, the sharp peak flares seem to have a tendency of having long time durations and hard emission, possibly related to their physical origin in a blazar jet. Studies of more similar flares will help establish these possible features.

preprint2019arXiv

Robust Cubature Kalman Filter for Dynamic State Estimation of Synchronous Machines under Unknown Measurement Noise Statistics

Kalman-type filtering techniques including cubature Kalman filter (CKF) does not work well in non-Gaussian environments, especially in the presence of outliers. To solve this problem, Huber's M-estimation based robust CKF (RCKF) is proposed for synchronous machines by combining the Huber's M-estimation theory with the classical CKF, which is capable of coping with the deterioration in performance and discretization of tracking curves when measurement noise statistics deviatefrom the prior noise statistics. The proposed RCKF algorithm has good adaptability to unknown measurement noise statistics characteristics including non-Gaussian measurement noise and outliers. The simulation results on the WSCC 3-machine 9-bus system and New England 16-machine 68-bus system verify the effectiveness of the proposed method and its advantage over the classical CKF.

preprint2019arXiv

Steady-state Simulation of Semiconductor Devices using Discontinuous Galerkin Methods

Design of modern nanostructured semiconductor devices often calls for simulation tools capable of modeling arbitrarily-shaped multiscale geometries. In this work, to this end, a discontinuous Galerkin (DG) method-based framework is developed to simulate steady-state response of semiconductor devices. The proposed framework solves a system of Poisson equation (in electric potential) and drift-diffusion equations (in charge densities), which are nonlinearly coupled via the drift current and the charge distribution. This system is decoupled and linearized using the Gummel method and the resulting equations are discretized using a local DG scheme. The proposed framework is used to simulate geometrically intricate semiconductor devices with realistic models of mobility and recombination rate. Its accuracy is demonstrated by comparing the results to those obtained by the finite volume and finite element methods implemented in a commercial software package.

preprint2019arXiv

Unveiling delay-time-resolved phase noise dynamics of narrow-linewidth laser via coherent optical time domain reflectometry

Laser with high spectral purity plays a crucial role in high-precision optical metrology and coherent communication. Thanks to the rapid development of laser frequency stabilization, the laser phase noise can be remarkably compensated, allowing its ultra-narrow linewidth subject to mostly quantum limit. Nevertheless, the accurate characterization of phase noise dynamics and its intrinsic linewidth of a highly coherent laser remains ambiguous and challenging. Here, we present an approach capable of revealing delay-time-resolved phase noise dynamics of a coherent laser based on coherent optical time domain reflectometry (COTDR), in which distributed Rayleigh scattering along a delay fibre essentially allows a time-of-flight mapping of a heterodyne beating signal associated with delay-time-dependent phase information from a single laser source. Ultimately, this novel technique facilitates a precise measurement of ultra-narrow laser linewidth by exploiting its delay-time-resolved phase jitter statistics, confirmed with the analytical modelling and numerical simulations.

preprint2016arXiv

A Note on the Convergence of ADMM for Linearly Constrained Convex Optimization Problems

This note serves two purposes. Firstly, we construct a counterexample to show that the statement on the convergence of the alternating direction method of multipliers (ADMM) for solving linearly constrained convex optimization problems in a highly influential paper by Boyd et al. [Found. Trends Mach. Learn. 3(1) 1-122 (2011)] can be false if no prior condition on the existence of solutions to all the subproblems involved is assumed to hold. Secondly, we present fairly mild conditions to guarantee the existence of solutions to all the subproblems and provide a rigorous convergence analysis on the ADMM, under a more general and useful semi-proximal ADMM (sPADMM) setting considered by Fazel et al. [SIAM J. Matrix Anal. Appl. 34(3) 946-977 (2013)], with a computationally more attractive large step-length that can even exceed the practically much preferred golden ratio of $(1+\sqrt{5})/2$.

preprint2016arXiv

An Efficient Inexact Symmetric Gauss-Seidel Based Majorized ADMM for High-Dimensional Convex Composite Conic Programming

In this paper, we propose an inexact multi-block ADMM-type first-order method for solving a class of high-dimensional convex composite conic optimization problems to moderate accuracy. The design of this method combines an inexact 2-block majorized semi-proximal ADMM and the recent advances in the inexact symmetric Gauss-Seidel (sGS) technique for solving a multi-block convex composite quadratic programming whose objective contains a nonsmooth term involving only the first block-variable. One distinctive feature of our proposed method (the sGS-imsPADMM) is that it only needs one cycle of an inexact sGS method, instead of an unknown number of cycles, to solve each of the subproblems involved.With some simple and implementable error tolerance criteria, the cost for solving the subproblems can be greatly reduced, and many steps in the forward sweep of each sGS cycle can often be skipped, which further contributes to the efficiency of the proposed method. Global convergence as well as the iteration complexity in the non-ergodic sense is established.Preliminary numerical experiments on some high-dimensional linear and convex quadratic SDP problems with a large number of linear equality and inequality constraints are also provided. The results show that for the vast majority of the tested problems, the sGS-imsPADMM is 2 to 3 times faster than the directly extended multi-block ADMM with the aggressive step-length of 1.618, which is currently the benchmark among first-order methods for solving multi-block linear and quadratic SDP problems though its convergence is not guaranteed.

preprint2016arXiv

Boost Picking: A Universal Method on Converting Supervised Classification to Semi-supervised Classification

This paper proposes a universal method, Boost Picking, to train supervised classification models mainly by un-labeled data. Boost Picking only adopts two weak classifiers to estimate and correct the error. It is theoretically proved that Boost Picking could train a supervised model mainly by un-labeled data as effectively as the same model trained by 100% labeled data, only if recalls of the two weak classifiers are all greater than zero and the sum of precisions is greater than one. Based on Boost Picking, we present "Test along with Training (TawT)" to improve the generalization of supervised models. Both Boost Picking and TawT are successfully tested in varied little data sets.

preprint2016arXiv

Discovery of $γ$-ray emission from the radio-intermediate quasar III Zw 2: violent jet activity with intraday $γ$-ray variability

III Zw 2 is the prototype of radio-intermediate quasars. Although there is the evidence of possessing strong jet, significant $γ$-ray emission has not been reported before. In this work, we carry out a detailed analysis of the latest {\it Fermi}-LAT {\it Pass} 8 data. No significant $γ$-ray signal has been detected in the time-averaged 7-year {\it Fermi}-LAT data of III Zw 2, we however have identified two distinct $γ$-ray flares with isotropic luminosities of $\sim 10^{45}$ erg $\rm s^{-1}$. Multiwavelength data analysis (including also the optical photometric observations from Yunnan Observatories) are presented and the main finding is the simultaneous optical and $γ$-ray flares of III Zw 2 appearing in Nov. 2009. Violent $γ$-ray variability with doubling timescale of 2.5 hours was detected in another $γ$-ray flare in May 2010, for which the 3-hour $γ$-ray peak flux is $\sim 250$ times of the average flux in 7 years. Rather similar behaviors are observed in blazars and the blazar model can reasonably reproduce the spectral energy distribution of III Zw 2 in a wide energy range, strongly suggesting that its central engine resembles that of blazars. In view of its core which shares radio similarities with young radio sources together with the weak extended radio lobe emission, we suggest that III Zw 2 harbors a recurrent activity core and serves as a valuable target for investigating the fuelling and triggering of the activity in radio loud AGNs.

preprint2016arXiv

Feature-Area Optimization: A Novel SAR Image Registration Method

This letter proposes a synthetic aperture radar (SAR) image registration method named Feature-Area Optimization (FAO). First, the traditional area-based optimization model is reconstructed and decomposed into three key but uncertain factors: initialization, slice set and regularization. Next, structural features are extracted by scale invariant feature transform (SIFT) in dual-resolution space (SIFT-DRS), a novel SIFT-Like method dedicated to FAO. Then, the three key factors are determined based on these features. Finally, solving the factor-determined optimization model can get the registration result. A series of experiments demonstrate that the proposed method can register multi-temporal SAR images accurately and efficiently.

preprint2016arXiv

High Performance WSe2 Field-Effect Transistors via Controlled Formation of In-Plane Heterojunctions

Monolayer WSe2 is a two dimensional (2D) semiconductor with a direct bandgap, and it has been recently explored as a promising material for electronics and optoelectronics. Low field effect mobility is the main constraint preventing WSe2 from becoming one of the competing channel materials for field-effect transistors (FETs). Recent results have demonstrated that chemical treatments can modify the electrical properties of transition metal dichalcogenides (TMDCs) including MoS2 and WSe2. Here, we report that controlled heating in air significantly improves device performance of WSe2 FETs in terms of on-state currents and field-effect mobilities. Specifically, after heating at optimized conditions, chemical vapor deposition grown monolayer WSe2 FETs showed an average FET mobility of 31 cm2/Vs and on/off current ratios up to 5*108. For few-layer WSe2 FETs, after the same treatment applied, we achieved a high mobility up to 92 cm2/Vs. These values are significantly higher than FETs fabricated using as-grown WSe2 flakes without heating treatment, demonstrating the effectiveness of air heating on the performance improvements of WSe2 FETs. The underlying chemical processes involved during air heating and the formation of in-plane heterojunctions of WSe2 and WO3-x, which is believed to be the reason for the improved FET performance, were studied by spectroscopy and transmission electron microscopy. We further demonstrated that by combining air heating method developed in this work with supporting 2D materials on BN substrate, we achieved a noteworthy field effect mobility of 83 cm2/Vs for monolayer WSe2 FETs. This work is a step towards controlled modification of the properties of WSe2 and potentially other TMDCs, and may greatly improve device performance for future applications of 2D materials in electronics and optoelectronics.

preprint2016arXiv

Highly Efficient Ion Rejection by Graphene Oxide Membranes via Ion-controlling Interlayer Spacing

Because they may provide ultrathin, high-flux, and energy-efficient membranes for precise ionic and molecular sieving in aqueous solution, GO membranes (partially oxidized, stacked sheets of graphene) have shown great potential in water desalination and purification, gas and ion separation, biosensors, proton conductors, lithium-based batteries and super-capacitors. Unlike carbon nanotube (CNT) membranes, in which the nanotube pores have fixed sizes, the pores of GO membranes - the interlayer spacing between GO sheets - are of variable size. This presents a challenge for using GO membranes for filtration. Despite the great efforts to tune and fix the interlayer spacing, it remains difficult both to reduce the interlayer spacing sufficiently to exclude small ions while keeping this separation constant against the tendency of GO membranes to swell when immersed in aqueous solution, which greatly affects the applications of GO membranes. Here, we demonstrate experimentally that highly efficient and selective ion rejection by GO membranes can be readily achieved by controlling the interlayer spacing of GO membranes using cations (K+, Na+, Ca2+, Li+ and Mg2+) themselves. The interspacing can be controlled with precision as small as 1 A, and GO membranes controlled by one kind of cation can exclude other cations with a larger hydrated volume, which can only be accommodated with a larger interlayer spacing. First-principles calculations reveal that the strong noncovalent cation-pi interactions between hydrated cations in solution and aromatic ring structures in GO are the cause of this unexpected behavior. These findings open up new avenues for using GO membranes for water desalination and purification, lithium-based batteries and super-capacitors, molecular sieves for separating ions or molecules, and many other applications.

preprint2016arXiv

On the origin of the soft photons of the high synchrotron peaked blazar : PKS 1424+240

PKS 1424+240 is a distant very high energy gamma-ray BL Lac object with redshift $z=0.601$. It was found that pure synchrotron self-Compton (SSC) process normally need extreme input parameters (e.g., very low magnetic field intensity and extraordinarily large Doppler factor) to explain its multi-wavelength spectral energy distributions (SEDs). To avoid the extreme model parameters, different models have been proposed (e.g., two-zone SSC model or lepto-hadronic model). In this work, we employ the traditional one-zone leptonic model after including a weak external Compton component to re-explore the simultaneous multi-wavelength SEDs of PKS 1424+240 in both high (2009) and low (2013) states. We find that the input parameters of magnetic field and Doppler factor are roughly consistent with those of other BL Lacs if a weak external photon field from either broad line region (BLR) or the dust torus. However, the required energy density of seed photons from BLR or torus is about 3 orders of magnitude less than that constrained in luminous quasars (e.g., flat-spectrum radio quasars, FSRQs). This result suggests that the BLR/torus in BL Lacs is much weaker than that of luminous FSRQs (but not fully disappear), and the inverse-Compton of external photons from BLR/torus may still play a role even in high synchrotron peaked blazars.

preprint2015arXiv

Chemical Vapor Deposition Growth of Monolayer WSe2 with Tunable Device Characteristics and Growth Mechanism Study

Semiconducting transition metal dichalcogenides (TMDCs) have attracted a lot of attention recently, because of their interesting electronic, optical, and mechanical properties. Among large numbers of TMDCs, monolayer of tungsten diselenides (WSe2) is of particular interest since it possesses a direct band gap and tunable charge transport behaviors, which make it suitable for a variety of electronic and optoelectronic applications. Direct synthesis of large domains of monolayer WSe2 and their growth mechanism studies are important steps toward applications of WSe2. Here, we report systematical studies on ambient pressure chemical vapor deposition (CVD) growth of monolayer and few layer WSe2 flakes directly on silica substrates. The WSe2 flakes were characterized using optical microscopy, atomic force microscopy, Raman spectroscopy, and photoluminescence spectroscopy. We investigated how growth parameters, with emphases on growth temperatures and durations, affect the sizes, layer numbers, and shapes of as-grown WSe2 flakes. We also demonstrated that transport properties of CVD-grown monolayer WSe2, similar to mechanically-exfoliated samples, can be tuned into either p-type or ambipolar electrical behavior, depending on the types of metal contacts. These results deepen our understandings on the vapor phase growth mechanism of WSe2, and may benefit the uses of these CVD-grown monolayer materials in electronic and optoelectronics.

preprint2015arXiv

Discovery of gamma-ray emission from a strongly lobe-dominated quasar 3C 275.1

We systematically analyze the 6-year {\it Fermi}/LAT data of the lobe-dominated quasars (LDQs) in the complete LDQ sample from 3CRR survey and report the discovery of high-energy $γ$-ray emission from 3C 275.1. The $γ$-ray emission of 3C 207 is confirmed and significant variability of the lightcurve is identified. We do not find statistically significant $γ$-ray emission from other LDQs. 3C 275.1 is the known $γ$-ray quasar with the lowest core dominance parameter (i.e., $R=0.11$). We also show that both the northern radio hotspot and parsec jet models can reasonably reproduce the $γ$-ray data. The parsec jet model, however, is favored by the potential $γ$-ray variability at the timescale of months. We suggest that some dimmer $γ$-ray LDQs will be detected in the future and LDQs could contribute non-negligibly to the extragalactic $γ$-ray background.

preprint2015arXiv

Neutrino oscillation from the beam with Gaussian-like energy distribution

A recent neutrino experiment at Daya Bay gives superior data of the distribution of the prompt energy. In this paper, the energy distribution presented in the experiment is simulated by applying a Gaussian-like packet to the neutrino wave function received by the detector. We find that the wave packet of neutrinos is expanded during the propagation. As a result, the mixing angle $θ_{13}$ is more difficult to be measured than $θ_{12}$ and $θ_{23}$ in long baseline experiments. Some other propagation properties, such as the time evaluation of the survival probability, the neutrino oscillation and the $CP$ violation, are also studied with the employment of the coherent state method. When the Gaussian packet width increases, the amplitude of the neutrino oscillation decreases, whereas the oscillation period increases gradually.

preprint2015arXiv

Study of digital pulse shape discrimination method for n-γ separation of EJ-301 liquid scintillation detector

A digital pulse shape discrimination system based on a programmable module NI-5772 has been established and tested with EJ-301 liquid scintillation detector. The module was operated by means of running programs developed in LabVIEW with the sampling frequency up to 1.6GS/s. Standard gamma sources 22Na, 137Cs and 60Co were used to calibrate the EJ-301 liquid scintillation detector, and the gamma response function has been obtained. Digital algorithms for charge comparison method and zero-crossing method have been developed. The experimental results showed that both digital signal processing (DSP) algorithms could discriminate neutrons from gamma-rays. Moreover, the zero-crossing method shows better n-γ discrimination at 80 keVee and lower, whereas the charge comparison method gives better results at higher thresholds. In addition, the figure-of-merit (FOM) of two different dimension detectors were extracted at 9 energy thresholds, and it was found that the smaller one presented a better n-γ separation property for fission neutrons.

preprint2015arXiv

Testing General Relativity with Present and Future Astrophysical Observations

One century after its formulation, Einstein's general relativity has made remarkable predictions and turned out to be compatible with all experimental tests. Most of these tests probe the theory in the weak-field regime, and there are theoretical and experimental reasons to believe that general relativity should be modified when gravitational fields are strong and spacetime curvature is large. The best astrophysical laboratories to probe strong-field gravity are black holes and neutron stars, whether isolated or in binary systems. We review the motivations to consider extensions of general relativity. We present a (necessarily incomplete) catalog of modified theories of gravity for which strong-field predictions have been computed and contrasted to Einstein's theory, and we summarize our current understanding of the structure and dynamics of compact objects in these theories. We discuss current bounds on modified gravity from binary pulsar and cosmological observations, and we highlight the potential of future gravitational wave measurements to inform us on the behavior of gravity in the strong-field regime.

preprint2015arXiv

The Kondo Temperature of a Two-dimensional Electron Gas with Rashba Spin-orbit Coupling

We use the Hirsch-Fye quantum Monte Carlo method to study the single magnetic impurity problem in a two-dimensional electron gas with Rashba spin-orbit coupling. We calculate the spin susceptibility for various values of spin-orbit coupling, Hubbard interaction, and chemical potential. The Kondo temperatures for different parameters are estimated by fitting the universal curves of spin susceptibility. We find that the Kondo temperature is almost a linear function of Rashba spin-orbit energy when the chemical potential is close to the edge of the conduction band. When the chemical potential is far away from the band edge, the Kondo temperature is independent of the spin-orbit coupling. These results demonstrate that, for single impurity problem in this system, the most important reason to change the Kondo temperature is the divergence of density of states near the band edge, and the divergence is induced by the Rashba spin-orbit coupling.

preprint2014arXiv

Constraints on minimum electron Lorentz factor and matter content of jets for a sample of bright Fermi blazars

We fit the (quasi-)simultaneous multi-waveband spectral energy distributions (SEDs) for a sample of low-synchrotron-peaked (LSP) blazars with a one-zone leptonic model. The seed photons that predominantly come from broad line region (BLR) and infrared (IR) molecular torus are considered respectively in external Compton process. We find that the modeling with IR seed photons is systematically better than that with BLR photons based on a $χ^2$ test, which suggest that $γ$-ray emitting region most possibly stay outside the BLR. The minimum electron Lorentz factor, $γ_{\rm min}$, is constrained from the modeling for these LSP blazars with good soft X-ray data (ranges from 5 to 160 with a median value of 55), which plays a key role in jet power estimation. Assuming one-to-one ratio of proton and electron, we find that the jet power for LSP blazars is systematically higher than that of FR II radio galaxies at given 151 MHz radio luminosity, $L_{\rm 151MHz}$, even though FR IIs are regarded as same as LSP blazars in unification scheme except the jet viewing angle. The possible reason is that there are some $e^{\pm}$ pairs in the jet of these blazars. If this is the case, we find the number density of $e^{\pm}$ pairs should be several times higher than that of $e^{-}-p$ pairs by assuming the jet power is the same for LSP blazars and FR IIs at given $L_{\rm 151MHz}$.

preprint2014arXiv

Is BZB J1450+5201 the most distant $γ$-ray BL Lacertae object?

BL Lacertae (BL Lac) objects at high redshifts ($z\geq 2$) are rarely detected. Through careful analysis of the SDSS spectrum, BZB J1450+5201 is confirmed to be a high-$z$ BL Lac object with $z\geq$ 2.471 by identifying the Ly$α$ 1216 and CIV 1548/1550 absorption lines. This indicates that BZB J1450+5201 is the most distant BL Lac object discovered to date. Careful analysis of the five-year \fermi data of 2FGL J1451.0+5159 shows that its $γ$-ray emission is robust with confidence level of 6.2$σ$ at 1-3 GeV and 6.7$σ$ at 3-10 GeV, and that the confusion of bright neighbor is negligible, which can not be fixed in the analysis of the two-year data. Meanwhile, 2FGL J1451.0+5159 is confirmed to be associated with BZB J1450+5201 using the five-year data. The analysis of multiwavelength data, from radio to $γ$-ray energies, indicates BZB J1450+5201 is an intermediate synchrotron peaked (ISP) source and consistent with distributions of other ISP sources at lower redshifts in the second LAT AGN catalog. The pure SSC model seems to be disfavoured, while scattering of weak external emission plus SSC process can provide a satisfactory description of the broadband emission.

preprint2014arXiv

Long-range Order in One-dimensional Spinless Fermi Gas with Attractive Dipole-Dipole Interaction

One-dimensional spinless Fermi gas with attractive dipole-dipole interaction is investigated. Results obtained show when the interaction is weak, the excitation spectrum is linear and the superconducting correlation function decays as power law, indicating the validity of the Tomonaga-Luttinger (TL) liquid picture. However, when the interaction reaches a critical value, the excitation spectrum is nonlinear and the superconducting correlation function keeps finite for infinity separation, indicating real long-range order established and the breakdown of the TL liquid picture. We prove that the existence of long-range order is not in contradiction with the Hohenberg theorem and show that this system is related to the Kitaev toy model, therefore, it has potential applications for the future topological quantum computation.

preprint2014arXiv

Multi-wavelength variability properties of Fermi blazar S5 0716+714

S5 0716+714 is a typical BL Lacertae object. In this paper we present the analysis and results of long term simultaneous observations in the radio, near-infrared, optical, X-ray and $γ$-ray bands, together with our own photometric observations for this source. The light curves show that the variability amplitudes in $γ$-ray and optical bands are larger than those in the hard X-ray and radio bands and that the spectral energy distribution (SED) peaks move to shorter wavelengths when the source becomes brighter, which are similar to other blazars, i.e., more variable at wavelengths shorter than the SED peak frequencies. Analysis shows that the characteristic variability timescales in the 14.5 GHz, the optical, the X-ray, and the $γ$-ray bands are comparable to each other. The variations of the hard X-ray and 14.5 GHz emissions are correlated with zero-lag, so are the V band and $γ$-ray variations, which are consistent with the leptonic models. Coincidences of $γ$-ray and optical flares with a dramatic change of the optical polarization are detected. Hadronic models do not have the same nature explanation for these observations as the leptonic models. A strong optical flare correlating a $γ$-ray flare whose peak flux is lower than the average flux is detected. Leptonic model can explain this variability phenomenon through simultaneous SED modeling. Different leptonic models are distinguished by average SED modeling. The synchrotron plus synchrotron self-Compton (SSC) model is ruled out due to the extreme input parameters. Scattering of external seed photons, such as the hot dust or broad line region emission, and the SSC process are probably both needed to explain the $γ$-ray emission of S5 0716+714.

preprint2014arXiv

Screw-Dislocation-Driven Growth of Two-Dimensional Few-Layer and Pyramid-Like WSe2 by Sulfur-Assisted Chemical Vapor Deposition

Two-dimensional (2D) layered tungsten diselenides (WSe2) material has recently drawn a lot of attention due to its unique optoelectronic properties and ambipolar transport behavior. However, direct chemical vapor deposition (CVD) synthesis of 2D WSe2 is not as straightforward as other 2D materials due to the low reactivity between reactants in WSe2 synthesis. In addition, the growth mechanism of WSe2 in such CVD process remains unclear. Here we report the observation of a screw-dislocation-driven (SDD) spiral growth of 2D WSe2 flakes and pyramid-like structures using a sulfur-assisted CVD method. Few-layer and pyramid-like WSe2 flakes instead of monolayer were synthesized by introducing a small amount of sulfur as a reducer to help the selenization of WO3, which is the precursor of tungsten. Clear observations of steps, helical fringes, and herring-bone contours under atomic force microscope characterization reveal the existence of screw dislocations in the as-grown WSe2. The generation and propagation mechanisms of screw dislocations during the growth of WSe2 were discussed. Back-gated field-effect transistors were made on these 2D WSe2 materials, which show on/off current ratios of 106 and mobility up to 44 cm2/Vs.

preprint2014arXiv

Smart Streaming for Online Video Services

Bandwidth consumption is a significant concern for online video service providers. Practical video streaming systems usually use some form of HTTP streaming (progressive download) to let users download the video at a faster rate than the video bitrate. Since users may quit before viewing the complete video, however, much of the downloaded video will be "wasted". To the extent that users' departure behavior can be predicted, we develop smart streaming that can be used to improve user QoE with limited server bandwidth or save bandwidth cost with unlimited server bandwidth. Through measurement, we extract certain user behavior properties for implementing such smart streaming, and demonstrate its advantage using prototype implementation as well as simulations.

preprint2014arXiv

The curvature of spectral energy distribution of blazars

The SED of blazars show significant curvature. In this paper, we study the curvature properties for a large sample of Fermi/LAT bright blazars based on quasi-simultaneous SED. Both SEDs of synchrotron and inverse Compton (IC) components are fitted by a log-parabolic law in log_v-log_vfv diagram. The second-degree term of log-parabola measures the curvature of SED. We find a statistically significant correlation between synchrotron peak frequency and its curvature. This result is in agreement with the theoretical prediction, and confirms previous studies, which dealt with single source with various epoch observations or a small sample. If a broken power-law is employed to fit the SED, the difference between the two spectral indexes (alpha2-alpha1) can be considered as a "surrogate" of the SED curvature. We collect spectral parameters of a sample blazars from literature, and find a correlation between the synchrotron peak frequency and the spectral difference. We do not find a significant correlation between the IC peak frequency and its curvature, which may be caused by complicated seed photon field. It is also found that the synchrotron curvatures are on average larger than that of IC curvatures, and there is no correlation between these two parameters. As suggested by previous works in literature, both the log-parabolic law of SED and above correlation can be explained by statistical and/or stochastic particle accelerations. Stochastic particle acceleration predicts a different slope of the correlation from that of statistical one, and our result seems favor stochastic acceleration mechanisms and emission processes. Some of other evidences also seem to support that the electron energy distribution (and/or synchrotron SED) may be log-parabolic, which include SED modeling, particle acceleration simulation, and comparisons between some predictions and empirical relations/correlations.

preprint2014arXiv

The study of neutron spectra in water bath from Pb target irradiated by 250MeV/u protons

The spallation neutrons were produced by the irradiation of Pb with 250 MeV protons. The Pb target was surrounded by water which was used to slow down the emitted neutrons. The moderated neutrons in the water bath were measured by using the resonance detectors of Au, Mn and In with Cd cover. According to the measured activities of the foils, the neutron flux at different resonance energy were deduced and the epithermal neutron spectra were proposed. Corresponding results calculated with the Monte Carlo code MCNPX were compared with the experimental data to check the validity of the code.

preprint2014arXiv

Towards Cross-layer Reliability Analysis of Transient and Permanent Faults

Due to the increasing complexity of Multi-Processor Systems on Chip (MPSoCs), system-level design methodologies have got a lot of attention in recent years. However, the significant gap between the system-level reliability analysis and the level where the actual faults occur necessitates a cross-layer approach in which the sufficient data about the effects of faults at low levels are passed to the system level. So far, the cross-layer reliability analysis techniques focus on a specific type of faults, e.g., either permanent or transient faults. In this work, we aim at proposing a cross-layer reliability analysis which considers different fault types concurrently and connects reliability analysis techniques at different levels of abstraction using adapters.

preprint2014arXiv

Why are some BL Lacs detected by \fermi, but others not ?

By cross-correlating an archival sample of 170 BL Lacs with 2 year \fermilat AGN sample, we have compiled a sample of 100 BL Lacs with \fermi detection (FBLs), and a sample of 70 non-\fermi BL Lacs (NFBLs). We compared various parameters of FBLs with those of NFBLs, including the redshift, the low frequency radio luminosity at 408 MHz ($L_{\rm 408MHz}$), the absolute magnitude of host galaxies ($M_{\rm host}$), the polarization fraction from NVSS survey ($P_{\rm NVSS}$), the observed arcsecond scale radio core flux at 5 GHz ($F_{\rm core}$) and jet Doppler factor; all the parameters are directly \textbf{measured} or derived from available data from literatures. We found that the Doppler factor is on average larger in FBLs than in NFBLs, and the $Fermi~ γ$-ray detection rate is higher in sources with higher Doppler factor. In contrast, there are no significant differences in terms of the intrinsic parameters of redshift, $ L_{\rm 408MHz}$, $ M_{\rm host}$ and $ P_{\rm NVSS}$. FBLs seem to have a higher probability of exhibiting measurable proper motion. These results strongly indicate a higher beaming effect in FBLs compared to NFBLs. The radio core flux is found to be strongly correlated with $γ$-ray flux, which remains after excluding the common dependence of the Doppler factor. At the fixed Doppler factor, FBLs have systematically larger radio core flux than NFBLs, implying lower $γ$-ray emission in NFBLs since the radio and $γ$-ray flux are significantly correlated. Our results indicate that the Doppler factor is an important parameter of $γ$-ray detection, the non-detection of $γ$-ray emission in NFBLs is likely due to low beaming effect, and/or low intrinsic $γ$-ray flux, and the gamma-rays are likely produced co-spatially with the arcsecond-scale radio core radiation and mainly through the SSC process.

preprint2013arXiv

A complicated Duffing oscillator in the surface-electrode ion trap

The oscillation coupling and different nonlinear effects are observed in a single trapped $^{40}Ca^{+}$ ion confined in our home-built surface-electrode trap (SET). The coupling and the nonlinearity are originated from the high-order multipole potentials due to different layouts and the fabrication asymmetry of the SET. We solve a complicated Duffing equation with coupled oscillation terms by the multiple scale method, which fits the experimental values very well. Our investigation in the SET helps for exploring nonlinearity using currently available techniques and for suppressing instability of qubits in quantum information processing with trapped ions.

preprint2013arXiv

Fake View Analytics in Online Video Services

Online video-on-demand(VoD) services invariably maintain a view count for each video they serve, and it has become an important currency for various stakeholders, from viewers, to content owners, advertizers, and the online service providers themselves. There is often significant financial incentive to use a robot (or a botnet) to artificially create fake views. How can we detect the fake views? Can we detect them (and stop them) using online algorithms as they occur? What is the extent of fake views with current VoD service providers? These are the questions we study in the paper. We develop some algorithms and show that they are quite effective for this problem.

preprint2013arXiv

Fast optical cooling of nanomechanical cantilever with the dynamical Zeeman effect

We propose an efficient optical electromagnetically induced transparency (EIT) cooling scheme for a cantilever with a nitrogen-vacancy center attached in a non-uniform magnetic field using dynamical Zeeman effect. In our scheme, the Zeeman effect combined with the quantum interference effect enhances the desired cooling transition and suppresses the undesired heating transitions. As a result, the cantilever can be cooled down to nearly the vibrational ground state under realistic experimental conditions within a short time. This efficient optical EIT cooling scheme can be reduced to the typical EIT cooling scheme under special conditions.

preprint2013arXiv

Moment-generating function method used to accurately evaluate the impact of the linearized optical noise amplified by EDFAs

In a nonlinear optical fiber communication (OFC) system with signal power much stronger than noise power, the noise field in the fiber can be described by linearized noise equation (LNE). In this case, the noise impact on the system performance can be evaluated by moment-generating function (MGF) method. Many published MGF calculations were based on the LNE using continuous wave (CW) approximation, where the modulated signal needs to be artificially simplified as an unmodulated signal. Results thus obtained should be treated carefully. More reliable results can be obtained by improving the CW-based LNE with the accurate LNE proposed by Holzlohner et al in Ref. [1]. In this work we show that, for the case of linearized noise amplified by EDFAs, its MGF can be calculated by obtaining the noise propagation information directly from the accurate LNE. Our results agree well with the experimental data of multi-span DPSK systems.

preprint2013arXiv

Spin-spin interaction in the bulk of topological insulators

We apply mean-field theory and Hirsch-Fye quantum Monte Carlo method to study the spin-spin interaction in the bulk of three-dimensional topological insulators. We find that the spin-spin interaction has three different components: the longitudinal, the transverse and the transverse Dzyaloshinskii-Moriya-like terms. When the Fermi energy is located in the bulk gap of topological insulators, the spin-spin interaction decays exponentially due to Bloembergen-Rowland interaction. The longitudinal correlation is antiferromagnetic and the transverse correlations are ferromagnetic. When the chemical potential is in the conduction or valence band, the spin-spin interaction follows power law decay, and isotropic ferromagnetic interaction dominates in short separation limit.

preprint2012arXiv

A Novel Low Power UWB Cascode SiGe BiCMOS LNA with Current Reuse and Zero-Pole Cancellation

A low power cascode SiGe BiCMOS low noise amplifier (LNA) with current reuse and zero-pole cancellation is presented for ultra-wideband (UWB) application. The LNA is composed of cascode input stage and common emitter (CE) output stage with dual loop feedbacks. The novel cascode-CE current reuse topology replaces the traditional two stages topology so as to obtain low power consumption. The emitter degenerative inductor in input stage is adopted to achieve good input impedance matching and noise performance. The two poles are introduced by the emitter inductor, which will degrade the gain performance, are cancelled by the dual loop feedbacks of the resistance-inductor (RL) shunt-shunt feedback and resistance-capacitor (RC) series-series feedback in the output stage. Meanwhile, output impedance matching is also achieved. Based on TSMC 0.35μm SiGe BiCMOS process, the topology and chip layout of the proposed LNA are designed and post-simulated. The LNA achieves the noise figure of 2.3~4.1dB, gain of 18.9~20.2dB, gain flatness of \pm0.65dB, input third order intercept point (IIP3) of -7dBm at 6GHz, exhibits less than 16ps of group delay variation, good input and output impedances matching, and unconditionally stable over the whole band. The power consuming is only 18mW.

preprint2012arXiv

Characterizing Short Necklace States in Logarithmic Transmission Spectrum of Strongly Localized Systems

High transmission plateaus exist widely in the logarithmic transmission spectra of localized systems. Their physical origins are short chains of coupled-localized-states embedded inside the localized system, which are dubbed as "short necklace states". In this work, we define the essential quantities and then, based on these quantities, we investigate the short necklace states' properties statistically and quantitatively. Two different approaches are utilized and the results from them agree with each other very well. In the first approach, the typical plateau-width and the typical order of short necklace states are obtained from the correlation function of logarithmic transmission. In the second approach, we investigate statistical distributions of the peak/plateau-width measured in logarithmic transmission spectra. A novel distribution is found, which can be exactly fitted by the summation of two Gaussian distributions. These two distributions are the results of sharp peaks of localized states and the high plateaus of short necklace states. The center of the second distribution also tells us the typical plateau-width of short necklace states. With increasing the system length, the scaling property of typical plateau-width is very special since it almost does not decrease. The methods and the quantities defined in this work can be widely used on Anderson localization studies.

preprint2012arXiv

Short Necklace States, Logarithm Transmission Fluctuation and Localization Length

We investigate the widely-existing short necklace states in random systems. It is found that their peak width and relative height in lnT spectra keep almost constant when the system length increases, which is explained by the coupled-resonator theory with intrinsic parameters. This property makes them special in contribution of lnT fluctuation. Further, short necklace states can help us to deeply understand the physical meaning of localization length and the delocalized effectin localized regime.

preprint2012arXiv

Surface Spectral Function of Momentum-dependent Pairing Potentials in a Topological Insulator: Application to Cu$_x$Bi$_2$Se$_3$

We propose three possible momentum-dependent pairing potentials for candidate of topological superconductor (for example Cu$_x$Bi$_2$Se$_3$), and calculate the surface spectral function and surface density of state with these pairing potentials. We find that the first two can give the same spectral functions as the fully-gapped and node-contacted pairing potentials given in [Phys. Rev. Lett. 105, 097001], and that the third one can obtain topological non-trivial case which exists flat Andreev bound state and preserves the $C_3$ rotation symmetry. We hope our proposals and results be judged by future experiment.

preprint2012arXiv

The central engines of two unusual radio-intermediate/quiet active galactic nuclei: III Zw 2 and PG 1407+265

We use the accretion disk/corona+jet model to fit the multi-band spectral energy distributions (SEDs) of two unusual radio-intermediate/quiet quasars. It is found that the optical/UV emission of III Zw 2 is probably dominated by the emission from the accretion disk. The X-ray emission should be dominated by the radiation from the jet, while the contribution of the disk corona is negligible. The optical/UV component in the SED of PG 1407+265 can be well modeled as the emission from the accretion disk, while the IR component is attributed to the thermal radiation from the dust torus with an opening angle ~ 50\circ. If the X-ray continuum emission is dominated by the synchrotron emission of the jet, the source should be a "high peak frequency blazar", which obviously deviates the normal blazar sequence. The observed SED can also be fitted quite well by the accretion disk/corona model with the viscosity parameter ? = 0:5. The spectrum of the accretion disk/corona in PG 1407+265 satisfies the weak line quasar criterion suggested in Laor & Davis.

preprint2011arXiv

A Uniform Correlation between Synchrotron Luminosity and Doppler Factor in Gamma-ray Bursts and Blazars: hint of similar intrinsic luminosities?

We compile 23 Gamma-ray Bursts (GRBs) and 21 blazars with estimated Doppler factors, and the Doppler factors of GRBs are estimated from their Lorentz factors by assuming their jet viewing angles θ->0 degree. Using the conventional assumption that the prompt emission of GRBs is dominated by the synchrotron radiation, we calculate the synchrotron luminosity of GRBs from their total isotropic energy and burst duration. Intriguingly, we discover a uniform correlation between the synchrotron luminosity and Doppler factor, L_syn \propto D^3.1, for GRBs and blazars, which suggests that they may share some similar jet physics. One possible reason is that GRBs and blazars have, more or less, similar intrinsic synchrotron luminosities and both of them are strongly enhanced by the beaming effect. After Doppler and redshift-correction, we find that the intrinsic peak energy of the GRBs ranges from 0.1 to 3 keV with a typical value of 1 keV. We further correct the beaming effect for the observed luminosity of GRBs and find that there exists a positive correlation between the intrinsic synchrotron luminosity and peak energy for GRBs, which is similar to that of blazars. Our results suggest that both the intrinsic positive correlation and the beaming effect may be responsible for the observed tight correlation between the isotropic energy and the peak energy in GRBs (so called "Amati" relation).

preprint2011arXiv

Casimir interaction between topological insulators with finite surface band gap

Casimir interaction between topological insulators with opposite topological magnetoelectric polarizabilities and finite surface band gaps has been investigated. For large surface band gap limit$(m\rightarrow\infty)$, we can obtain results given in [Phys. Rev. Lett. \textbf{106}, 020403 (2011)]. For small surface band gap limit$(m\rightarrow0)$, Casimir interaction between topological insulators is attractive and analogy to ideal mental in short separation limit. Generally, there is a critical value $m_c$ and when the surface band gap is greater than the critical value, the Casimir force is repulsive in an intermediate separation region. We estimate the critical surface band gap $m_c \sim 1/(2a)$, where $a$ is a critical separation where Casimir force vanishes.

preprint2011arXiv

Critical surface band gap of repulsive Casimir interaction between three dimensional topological insulators at finite temperature

We generalize the calculation of Casimir interaction between topological insulators with opposite topological magnetoelectric polarizabilities and finite surface band gaps to finite Temperature cases. We find that finite temperature quantitatively depress the repulsive peak and enlarge the critical surface gap $m_c$ for repulsive Casimir force. However the universal property $m_c a \sim 1/2$ is still valid for various oscillation strength, temperature region and topological magnetoelectric polarizabilities.

preprint2011arXiv

Evolutions of helical edge states in disordered HgTe/CdTe quantum wells

We study the evolutions of the nonmagnetic disorder-induced edge states with the disorder strength in the HgTe/CdTe quantum wells. From the supercell band structures and wave-functions, it is clearly shown that the conducting helical edge states, which are responsible for the reported quantized conductance plateau, appear above a critical disorder strength after a gap-closing phase transition. These edge states are then found to decline with the increase of disorder strength in a stepwise pattern due to the finite-width effect, where the opposite edges couple with each other through the localized states in the bulk. This is in sharp contrast with the localization of the edge states themselves if magnetic disorders are doped which breaks the time-reversal symmetry. The size-independent boundary of the topological phase is obtained by scaling analysis, and an Anderson transition to an Anderson insulator at even stronger disorder is identified, in-between of which, a metallic phase is found to separate the two topologically distinct phases.

preprint2011arXiv

Implications on the blazar sequence and inverse Compton models from Fermi bright blazars

In this paper, we use the quasi-simultaneous spectra of Fermi bright blazars and Fermi detected narrow line Seyfert 1 (NLS1) to study the blazar sequence and inverse Compton (IC) models. I. The synchrotron peak luminosities (L_{s}) significantly inverse correlate with the synchrotron peak frequencies (ν_{s}), L_{s}\proptoν_{s}^{-0.44}, which is consistent with the blazar sequence. In addition to the correlation, there are some blazars showing low ν_{s} and low L_{s}. To study the relation between these low ν_{s} low L_{s} blazars and the blazar sequence, we present correlations of the parameter L_{s}ν_{s}^{1/4} with the ratio of Compton to synchrotron peak frequencies (r_{Cs}\equivν_{C}/ν_{s}) and with the ratio of Compton to synchrotron luminosities (CD\equiv L_{C}/L_{s}). The results indicate that both correlations are significant with a Pearson's probability for null correlation of p=0.0218 and p=0.0286 respectively. This does not support the idea that the low ν_{s} low L_{s} blazars are sources with less beaming. Another possibility, as suggested by Ghisellini & Tavecchio, is that these blazars have relative lower black hole masses. To test this, we collect the black hole masses of 30 blazars from archives, and find that the hole mass correlates with the parameter L_{s}ν_{s}^{0.44} (p=0.0344). Therefore, the black hole masses of low ν_{s} low L_{s} blazars are statistically small. The NLS1s are thought to have lower black hole masses. We find that the four NLS1s detected by Fermi have low ν_{s} and low L_{s}. This supports the above result. II. The ratio r_{Cs} correlates with CD significantly (p=0.00375). The external Compton (EC) model can naturally explain this correlation, while synchrotron self Compton (SSC) model can not. This agrees with the findings of many authors that the EC process dominates the gamma-ray emission of Flat Spectrum Radio Quasars.

preprint2010arXiv

A fourth-order Runge-Kutta in the interaction picture method for coupled nonlinear Schrodinger equation

A fourth-order Runge-Kutta in the interaction picture (RK4IP) method is presented for solving the coupled nonlinear Schrodinger equation (CNLSE) that governs the light propagation in optical fibers with randomly varying birefringence. The computational error of RK4IP is caused by the fourth-order Runge-Kutta algorithm, better than the split-step approximation limited by the step size. As a result, the step size of RK4IP can have the same order of magnitude as the dispersion length and/or the nonlinear length of the fiber, provided the birefringence effect is small. For communication fibers with random birefringence, the step size of RK4IP can be orders of magnitude larger than the correlation length and the beating length of the fibers, depending on the interaction between linear and nonlinear effects. Our approach can be applied to the fibers having the general form of local birefringence and treat the Kerr nonlinearity without approximation. For the systems with realistic parameters, the RK4IP results are consistent with those using Manakov-PMD approximation. However, increased interaction between the linear and nonlinear terms in CNLSE leads to increased discrepancy between RK4IP and Manakov-PMD approximation.

preprint2010arXiv

A puzzle of radio-quiet quasar PG 1407+265: are optical and X-ray emissions produced from relativistic jet?

PG 1407+265 is a radio quiet quasar but has a relativistic jet. In this report, we show some peculiar properties of its optical and X-ray emissions, which indicate their possible non-thermal origins produced from jet. We use a simple synchrotron + synchrotron self Compton (SSC) model to fit the emissions with different ratios of energy densities between magnetic field and electrons (η=UB/Ue), which predict different γ-ray luminosity. The First LAT AGN Catalog (1LAC) did not include PG 1407+265, which indicates an upper limit of γ-ray luminosity. This upper limit constrains the ratio unreasonable large (η>10^4-10^5). This inversely indicates that the optical and X-ray emissions may be not produced from the beaming jet. We discuss the physical implications of these results.

preprint2010arXiv

Distributed and Optimal Reduced Primal-Dual Algorithm for Uplink OFDM Resource Allocation

Orthogonal Frequency Division Multiplexing (OFDM) is the key component of many emerging broadband wireless access standards. The resource allocation in OFDM uplink, however, is challenging due to heterogeneity of users' Quality of Service requirements, channel conditions, and individual resource constraints. We formulate the resource allocation problem as a non-strictly convex optimization problem, which typically has multiple global optimal solutions. We then propose a reduced primal-dual algorithm, which is distributed, low in computational complexity, and probably globally convergent to a global optimal solution. The performance of the algorithm is studied through a realistic OFDM simulator. Compared with the previously proposed centralized optimal algorithm, our algorithm not only significantly reduces the message overhead but also requires less iterations to converge.

preprint2010arXiv

Minimal Conductivity of Topological Surface States with Magnetic Impurities

In this paper we use the semiclassical Boltzmann equation to investigate the transport properties of Dirac fermion on the surface of topological insulator with magnetic impurities. The results obtained show that there is also a minimal conductivity in this system as in graphene. We also argue the low temperature transport property, and find that there is no low temperature anomaly known as Kondo effect when the temperature is T>10^{-6}K.

preprint2010arXiv

Possible gamma-ray emission of radio intermediate AGN III Zw 2 and its implication on the evolution of jets in AGNs

AGNs with hard $γ$-ray emission identified so far are radio-loud. III Zw 2 is a radio intermediate AGN with a relativistic jet. We study its spectral energy distribution (SED) and find that the broad band emissions are dominated by the non-thermal emissions from the jet. We model its SED through a synchrotron + inverse Compton (IC) model. The results show that the IC component of III Zw 2 peaks at a few MeV, and the flux density drops rapidly at higher energy with photon index $Γ\approx3.3$ above 0.1 GeV. The predicted flux is slightly over the sensitivity of $Fermi$/LAT, but it is not included in the first $Fermi$/LAT AGN catalog. The reason for this may be: 1) that the IC peak is low and the spectrum is very steep above 0.1 GeV, 2) that III Zw 2 is in the low state during the period of the $Fermi$/LAT operation. We also find that III Zw 2 follows similar jet processes as those in $γ$-ray AGNs, e.g., the relation between jet power and radiation power, called the blazar sequence. We suggest that III Zw 2 may be a young source at an earlier stage of jet activity.

preprint2010arXiv

Supply-demand diagrams and a new framework for analyzing the inhomogeneous Lighthill-Whitham-Richards model

Traditionally, the Lighthill-Whitham-Richards (LWR) models for homogeneous and inhomogeneous roads have been analyzed in flux-density space with the fundamental diagram of the flux-density relation. In this paper, we present a new framework for analyzing the LWR model, especially the Riemann problem at a linear boundary in which the upstream and downstream links are homogeneous and initially carry uniform traffic. We first review the definitions of local supply and demand functions and then introduce the so-called supply-demand diagram, on which a traffic state can be represented by its supply and demand, rather than as density and flux as on a fundamental diagram. It is well-known that the solutions to the Riemann problem at each link are self-similar with a stationary state, and that the wave on the link is determined by the stationary state and the initial state. In our new framework, there can also exist an interior state next to the linear boundary on each link, which takes infinitesimal space, and admissible conditions for the upstream and downstream stationary and interior states can be derived in supply-demand space. With an entropy condition consistent with a local supply-demand method in interior states, we show that the stationary states exist and are unique within the solution framework. We also develop a graphical scheme for solving the Riemann problem, and the results are shown to be consistent with those in the literature. We further discuss asymptotic stationary states on an inhomogeneous ring road with arbitrary initial conditions and demonstrate the existence of interior states with a numerical example. The framework developed in this study is simpler than existing ones and can be extended for analyzing the traffic dynamics in general road networks.

preprint2009arXiv

X-ray Radiation Mechanisms and Beaming Effect of Hot Spots and Knots in Active Galactic Nuclear Jets

The observed radio-optical-X-ray spectral energy distributions (SEDs) of 22 hot spots and 45 knots in the jets of 35 active galactic nuclei are complied from literature and modeled with single-zone lepton models. It is found that the observed luminosities at 5 GHz (L_5GHz) and at 1 keV (L_1keV) are tightly correlated, and the two kinds of sources can be roughly separated with a division of L_1keV=L_5GHz. Our SED fits show that the mechanisms of the X-rays are diverse. Considering the sources at rest, the synchrotron-self-Compton (SSC) scattering would dominate the IC process. This model can interpret the X-rays of some hot spots with a magnetic field strength (B_ssc^delta=1) being consistent with the equipartition magnetic field (B_eq^delta=1) in one order of magnitude, but an unreasonably low B_ssc^delta=1 is required to model the X-rays for all knots. The ratio R_B=B_eq^delta=1/B_ssc^delta=1 is greater than 1 and it is tightly anti-correlated with ratio R_L= L_1keV/L_5GHz for both the knots and the hot spots. We propose that the deviation may be due to the neglect of the relativistic bulk motion for these sources. Considering this effect, we show that the IC/CMB model well explains the X-ray emission for most sources. Both B_eq' and delta are tentatively correlated with R_L. Corrected by the beaming effect, the L'_5GHz-L'_1keV relations for the two kinds of sources are even tighter than the observed ones. These facts suggest that, under the equipartition condition, the observational differences of the X-rays from the knots and hot spots may be mainly due the differences on the Doppler boosting effect and the co-moving magnetic field of the two kinds of sources. Our IC scattering models predict a prominent GeV-TeV component in the SEDs for some sources, which are detectable with H.E.S.S. and Fermi/LAT.

preprint1995arXiv

Magnetic and pair correlations of the Hubbard model with next-nearest-neighbor hopping

A combination of analytical approaches and quantum Monte Carlo simulations is used to study both magnetic and pairing correlations for a version of the Hubbard model that includes second-neighbor hopping $t^{\prime }=-0.35t$ as a model for high-temperature superconductors. Magnetic properties are analyzed using the Two-Particle Self-Consistent approach. The maximum in magnetic susceptibility as a function of doping appears both at finite $% t^{\prime }$ and at $t^{\prime }=0$ but for two totally different physical reasons. When $t^{\prime }=0$, it is induced by antiferromagnetic correlations while at $t^{\prime }=-0.35t$ it is a band structure effect amplified by interactions. Finally, pairing fluctuations are compared with $% T $-matrix results to disentangle the effects of van Hove singularity and of nesting on superconducting correlations. The addition of antiferromagnetic fluctuations increases slightly the $d$-wave superconducting correlations despite the presence of a van Hove singularity which tends to decrease them in the repulsive model. Some aspects of the phase diagram and some subtleties of finite-size scaling in Monte Carlo simulations, such as inverted finite-size dependence, are also discussed.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint

Fields this researcher appears in

Source provenance