Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
112works
0followers
41topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

112 published item(s)

preprint2026arXiv

A Boundary-Aware Non-parametric Granular-Ball Classifier Based on Minimum Description Length

Existing granular-ball classification methods are often driven by handcrafted quality measures, neighborhood rules, or heuristic splitting and stopping criteria, which may reduce the transparency of local construction decisions and hinder explicit modeling of boundary-sensitive regions. To address this issue, this paper proposes a Minimum Description Length based Granular-Ball Classifier (MDL-GBC), a boundary-aware non-parametric and interpretable granular-ball classifier. MDL-GBC formulates class-conditional granular-ball construction as a local model selection problem under the Minimum Description Length principle. For each class, samples from the target class provide positive class evidence, while samples from the remaining classes provide negative boundary evidence. For each current granular ball, three candidate explanations are compared under a unified description-length criterion: a single-ball model, a two-ball model, and a core-boundary model. The selected model determines whether the ball is retained, geometrically split, or refined into core and boundary-sensitive child balls, thereby making local construction decisions consistent with the MDL-based classification mechanism. During prediction, a class-level mixture coding rule aggregates stable granular balls of the same class and assigns the test sample by comparing class-wise coding costs. Experiments on 18 benchmark datasets show that MDL-GBC achieves competitive classification performance against classical classifiers and representative granular-ball-based methods, obtaining the best average Accuracy, Macro-F1, and average rank. These results indicate that MDL-GBC provides an effective and interpretable alternative to conventional heuristic granular-ball classification strategies.

preprint2026arXiv

A Breast Vision Pathology Foundation Model for Real-world Clinical Utility

Pathology foundation models have shown strong retrospective performance, but whether such systems can support clinically relevant use remains unclear. This challenge is particularly important in breast cancer, where pathological assessment serves as the gold standard for diagnosis and guides treatment planning, surgical decision-making and risk stratification across pre-, intra- and post-operative stages. Here we present \textbf{BRAVE}, a breast-adaptive pathology foundation model developed and evaluated using a total resource of 101,638 breast whole-slide images from 32 sources across Asia, Europe and North America. We assessed BRAVE across 34 tasks in 82 cohorts spanning pre-operative biopsy, intra-operative frozen section and post-operative resection, using an evidence chain comprising retrospective benchmarking, clinically challenging scenarios, workflow-oriented clinical impact simulations, prospective observational validation with the thresholds locked in the retrospective cohorts and crossover pathologist-AI interaction studies. Across these settings, BRAVE supported practical roles in the clinical workflow, including safe exclusion of low-risk cases from routine review, AI-assisted second-review rescue of initially missed positives and prioritization of cases for further assessment. In prospective validation across three centres, BRAVE excluded 76.9% of negative biopsy cases (NPV 0.953) and 70.1% of negative frozen-section cases (NPV 0.973), and triaged 78.8% of post-operative subtyping cases as high-confidence clear-cut cases (NPV 1.000). In reader studies, AI assistance improved balanced accuracy from 88.5% to 95.1% (OR 3.14, P<0.001), with better efficiency, confidence and inter-rater agreement. BRAVE-derived scores also independently predicted disease-free survival (adjusted HR 4.79, P<0.001) and overall survival (adjusted HR 8.14, P<0.001).

preprint2026arXiv

An Enigmatic PeVatron in an Area around HII Region G35.6$-$0.5

Identifying Galactic PeVatrons (PeV particle accelerators) from the ultra-high-energy (UHE, >100 TeV) $γ$-ray sources plays a crucial role in revealing the origin of Galactic cosmic rays. The UHE source 1LHAASO J1857+0203u is suggested to be associated with HESS J1858+020, which may be attributed to the possible PeVatron candidate supernova remnant (SNR) G35.6$-$0.4 or HII region G35.6$-$0.5. We perform detailed analysis on the very-high-energy and UHE $γ$-ray emissions towards this region with data from the Large High Altitude Air Shower Observatory (LHAASO). 1LHAASO J1857+0203u is detected with a significance of 11.6$σ$ above 100 TeV, indicating the presence of a PeVatron. It has an extension of $\sim 0.18^\circ$ with a power-law (PL) spectral index of $\sim$2.5 in 1-25 TeV and a point-like emission with a PL spectral index of $\sim$3.2 above 25 TeV. Using the archival CO and HI data, we identify some molecular and atomic clouds that may be associated with the TeV $γ$-ray emissions. Our modelling indicates that the TeV $γ$-ray emissions are unlikely to arise from the clouds illuminated by the protons that escaped from SNR G35.6$-$0.4. In the scenario that HII region G35.6$-$0.5 could accelerate particles to the UHE band, the observed GeV-TeV $γ$-ray emission could be well explained by a hadronic model with a PL spectral index of $\sim$2.0 and cutoff energy of $\sim$450 TeV. However, an evolved pulsar wind nebula origin cannot be ruled out.

preprint2026arXiv

An Ultrahigh-energy $γ$-ray Bubble Powered by a Super PeVatron

We report the detection of a $γ$-ray bubble spanning at least 100$\rm deg^2$ in ultra high energy (UHE) up to a few PeV in the direction of the star-forming region Cygnus X, implying the presence Super PeVatron(s) accelerating protons to at least 10 PeV. A log-parabola form with the photon index $Γ(E) = (2.71 \pm 0.02) + (0.11 \pm 0.02) \times \log_{10} (E/10 \ {\rm TeV})$ is found fitting the gamma-ray energy spectrum of the bubble well. UHE sources, `hot spots&#39; correlated with very massive molecular clouds, and a quasi-spherical amorphous $γ$-ray emitter with a sharp central brightening are observed in the bubble. In the core of $\sim 0.5^{\circ}$, spatially associating with a region containing massive OB association (Cygnus OB2) and a microquasar (Cygnus X-3), as well as previously reported multi-TeV sources, an enhanced concentration of UHE $γ$-rays are observed with 2 photons at energies above 1 PeV. The general feature of the bubble, the morphology and the energy spectrum, are reasonably reproduced by the assumption of a particle accelerator in the core, continuously injecting protons into the ambient medium.

preprint2026arXiv

Constraining the Cosmic-ray Energy Based on Observations of Nearby Galaxy Clusters by LHAASO

Galaxy clusters act as reservoirs of high-energy cosmic rays (CRs). As CRs propagate through the intracluster medium, they generate diffuse $γ$-rays detectable by arrays such as LHAASO. These $γ$-rays result from proton-proton ($pp$) collisions of very high-energy cosmic rays (VHECRs) or inverse Compton (IC) scattering of positron-electron pairs created by $pγ$ interactions of ultra-high-energy cosmic rays (UHECRs). We analyzed diffuse $γ$-ray emission from the Coma, Perseus, and Virgo clusters using LHAASO data. Diffuse emission was modeled as a disk of radius $R_{500}$ for each cluster while accounting for point sources. No significant diffuse emission was detected, yielding 95\% confidence level (C.L.) upper limits on the $γ$-ray flux: for WCDA (1-25~TeV) and KM2A ($>25$~TeV), less than $(49.4, 13.7, 54.0)$ and $(1.34, 1.14, 0.40) \times 10^{-14}$~ph~cm$^{-2}$~s$^{-1}$ for Coma, Perseus, and Virgo, respectively. The $γ$-ray upper limits can be used to derive model-independent constraints on the integral energy of CRp above 10~TeV (corresponding to the LHAASO observational range $>1$~TeV under the $pp$ scenario) to be less than $(1.96, 0.59, 0.08) \times 10^{61}$~erg. The absence of detectable annuli/ring-like structures, indicative of cluster accretion or merging shocks, imposes further constraints on models in which the UHECRs are accelerated in the merging shocks of galaxy clusters.

preprint2026arXiv

Constraints on heavy decaying dark matter from 570 days of LHAASO observations

The Kilometer Square Array~(KM2A) of the Large High Altitude Air Shower Observatory (LHAASO) aims at surveying the northern gamma-ray sky at energies above 10 TeV with unprecedented sensitivity. Gamma-ray observations have long been one of the most powerful tools for dark matter searches, as e.g., high-energy gamma-rays could be produced by the decays of heavy dark matter particles. In this letter, we present the first dark matter analysis with LHAASO-KM2A, using the first 340~days of data from 1/2-KM2A and 230~days of data from 3/4-KM2A. Several regions of interest are used to search for a signal and account for the residual cosmic-ray background after gamma/hadron separation. We find no excess of dark matter signals, and thus place some of the strongest gamma-ray constraints on the lifetime of heavy dark matter particles with mass between 10^5 and 10^9~GeV. Our results with LHAASO are robust, and have important implications for dark matter interpretations of the diffuse astrophysical high-energy neutrino emission.

preprint2026arXiv

CPPO: Contrastive Perception for Vision Language Policy Optimization

We introduce CPPO, a Contrastive Perception Policy Optimization method for finetuning vision-language models (VLMs). While reinforcement learning (RL) has advanced reasoning in language models, extending it to multimodal reasoning requires improving both the perception and reasoning aspects. Prior works tackle this challenge mainly with explicit perception rewards, but disentangling perception tokens from reasoning tokens is difficult, requiring extra LLMs, ground-truth data, forced separation of perception from reasoning by policy model, or applying rewards indiscriminately to all output tokens. CPPO addresses this problem by detecting perception tokens via entropy shifts in the model outputs under perturbed input images. CPPO then extends the RL objective function with a Contrastive Perception Loss (CPL) that enforces consistency under information-preserving perturbations and sensitivity under information-removing ones. Experiments show that CPPO surpasses previous perception-rewarding methods, while avoiding extra models, making training more efficient and scalable.

preprint2026arXiv

Deep view of Composite SNR CTA1 with LHAASO in $γ$-rays up to 300 TeV

The ultra-high-energy (UHE) gamma-ray source 1LHAASO J0007+7303u is positionally associated with the composite SNR CTA1 that is located at high Galactic Latitude $b\approx 10.5^\circ$. This provides a rare opportunity to spatially resolve the component of the pulsar wind nebula (PWN) and supernova remnant (SNR) at UHE. This paper conducted a dedicated data analysis of 1LHAASO J0007+7303u using the data collected from December 2019 to July 2023. This source is well detected with significances of 21$σ$ and 17$σ$ at 8$-$100 TeV and $>$100 TeV, respectively. The corresponding extensions are determined to be 0.23$^{\circ}\pm$0.03$^{\circ}$ and 0.17$^{\circ}\pm$0.03$^{\circ}$. The emission is proposed to originate from the relativistic electrons and positrons accelerated within the PWN of PSR J0007+7303. The energy spectrum is well described by a power-law with an exponential cutoff function $dN/dE = (42.4\pm4.1)(\frac{E}{20\rm\ TeV})^{-2.31\pm0.11}\exp(-\frac{E}{110\pm25\rm\ TeV})$ $\rm\ TeV^{-1}\ cm^{-2}\ s^{-1}$in the energy range from 8 TeV to 300 TeV, implying a steady-state parent electron spectrum $dN_e/dE_e\propto (\frac{E_e}{100\rm\ TeV})^{-3.13\pm0.16}\exp[(\frac{-E_e}{373\pm70\rm\ TeV})^2]$ at energies above $\approx 50 \rm\ TeV$. The cutoff energy of the electron spectrum is roughly equal to the expected current maximum energy of particles accelerated at the PWN terminal shock. Combining the X-ray and gamma-ray emission, the current space-averaged magnetic field can be limited to $\approx 4.5\rm\ μG$. To satisfy the multi-wavelength spectrum and the $γ$-ray extensions, the transport of relativistic particles within the PWN is likely dominated by the advection process under the free-expansion phase assumption.

preprint2026arXiv

Discovery of a new $γ$-ray source LHAASO J0341+5258 with emission up to 200TeV

We report the discovery of a new unidentified extended $γ$-ray source in the Galactic plane named LHAASO J0341+5258 with a pre-trial significance of 8.2 standard deviations above 25 TeV. The best fit position is R.A.$=55.34^{\circ}\pm0.11^{\circ}$ and Dec$=52.97^{\circ}\pm0.07^{\circ}$. The angular size of LHAASO J0341+5258 is $0.29^\circ \pm 0.06^\circ_{stat} \pm0.02^\circ_{sys}$. The flux above 25 TeV is about $20\%$ of the flux of Crab Nebula. Although a power-law fit of the spectrum from 10 TeV to 200 TeV with the photon index $α=2.98 \pm 0.19_{stat} \pm 0.02_{sys}$ is not excluded, the LHAASO data together with the flux upper limit at 10 GeV set by the Fermi LAT observation, indicate a noticeable steepening of an initially hard power-law spectrum %($α\leq 1.75$) spectrum with a cutoff at $\approx 50$ TeV. We briefly discuss the origin of UHE gamma-rays. The lack of an energetic pulsar and a young SNR inside or in the vicinity of LHAASO J0341+5258 challenge, but do not exclude both the leptonic and hadronic scenarios of gamma-ray production.

preprint2026arXiv

Discovery of the Ultra-high energy gamma-ray source LHAASO J2108+5157

We report the discovery of a UHE gamma-ray source, LHAASO J2108+5157, by analyzing the LHAASO-KM2A data of 308.33 live days. Significant excess of gamma-ray induced showers is observed in both energy bands of 25-100 TeV and $\gt$100 TeV with 9.5 sigma and 8.5 sigma, respectively. This source is not significantly favored as an extensive source with the angular extension smaller than the point-spread function of KM2A. The measured energy spectrum from 20 to 200 TeV can be approximately described by a power-law function with an index of -2.83$\pm$ 0.18stat. A harder spectrum is demanded at lower energies considering the flux upper limit set by Fermi-LAT observations. The position of the gamma-ray emission is correlated with a giant molecular cloud, which favors a hadronic origin. No obvious counterparts have been found, deeper multiwavelength observations will help to shed new light on this intriguing UHE source.

preprint2026arXiv

Energy calibration of LHAASO-KM2A using the cosmic ray Moon shadow

We present a precise measurement of the westward, rigidity-dependent shift of the Moon&#39;s shadow using three and a half years of cosmic-ray data collected by the Kilometer Square Array (KM2A) of the Large High Altitude Air Shower Observatory (LHAASO). These measurements enable us to calibrate the detector energy response in the range 20-260 TeV, with results showing excellent agreement with the response derived from Monte Carlo (MC) simulations of the KM2A detector. We also measure a best-fit parameter $ε= 0.015 \pm 0.08$, corresponding to a 95% confidence interval of [-14%, +17%] for the energy-scale estimation. This result establishes the exceptional accuracy of the KM2A-MC in simulating the detector&#39;s response within this energy range.

preprint2026arXiv

Energy-Dependent Shifts of Medium-Scale Anisotropies in Very-High-Energy Cosmic Rays Observed by LHAASO-KM2A

Small deviations from isotropy in the arrival directions of Galactic cosmic rays serve as a unique probe of the local magnetic environment. In this Letter, we report observations of medium-scale anisotropies (MSA) at energies above 10 TeV using the LHAASO-KM2A array. Our analysis identifies four regions of excess and four regions of deficit, each spanning angular scales of approximately ten degrees. Crucially, we detect significant energy-dependent shifts in the centroids of two excess regions: Region B and the newly identified Region $\mathrm{\widetilde{D}}$. We also characterize the energy evolution of the fractional relative intensity across both excess and deficit regions. These findings imply that the observed anisotropies are shaped by the specific realization of the local turbulent magnetic field within the cosmic ray scattering length. Such energy-dependent behaviors impose strict constraints on local turbulence models and cosmic ray propagation theories.

preprint2026arXiv

Evidence for particle acceleration approaching PeV energies in the W51 complex

The $γ$-ray emission from the W51 complex is widely acknowledged to be attributed to the interaction between the cosmic rays (CRs) accelerated by the shock of supernova remnant (SNR) W51C and the dense molecular clouds in the adjacent star-forming region, W51B. However, the maximum acceleration capability of W51C for CRs remains elusive. Based on observations conducted with the Large High Altitude Air Shower Observatory (LHAASO), we report a significant detection of $γ$ rays emanating from the W51 complex, with energies from 2 TeV to 200 TeV. The LHAASO measurements, for the first time, extend the $γ$-ray emission from the W51 complex beyond 100 TeV and reveal a significant spectrum bending at tens of TeV. By combining the ``$π^0$-decay bump&#34; featured data from Fermi-LAT, the broadband $γ$-ray spectrum of the W51 region can be well-characterized by a simple pp-collision model. The observed spectral bending feature suggests an exponential cutoff at $\sim400$~TeV or a power-law break at $\sim200$~TeV in the CR proton spectrum, most likely providing the first evidence of SNRs serving as CR accelerators approaching the PeV regime. Additionally, two young star clusters within W51B could also be theoretically viable to produce the most energetic $γ$ rays observed by LHAASO. Our findings strongly support the presence of extreme CR accelerators within the W51 complex and provide new insights into the origin of Galactic CRs.

preprint2026arXiv

Exploring Lorentz Invariance Violation from Ultra-high-energy Gamma Rays Observed by LHAASO

Recently the LHAASO Collaboration published the detection of 12 ultra-high-energy gamma-ray sources above 100 TeV, with the highest energy photon reaching 1.4 PeV. The first detection of PeV gamma rays from astrophysical sources may provide a very sensitive probe of the effect of the Lorentz invariance violation (LIV), which results in decay of high-energy gamma rays in the superluminal scenario and hence a sharp cutoff of the energy spectrum. Two highest energy sources are studied in this work. No signature of the existence of LIV is found in their energy spectra, and the lower limits on the LIV energy scale are derived. Our results show that the first-order LIV energy scale should be higher than about 10^5 times the Planck scale M_{pl} and that the second-order LIV scale is >10^{-3}M_{pl}. Both limits improve by at least one order of magnitude the previous results.

preprint2026arXiv

Extended Very-High-Energy Gamma-Ray Emission Surrounding PSR J0622 + 3749 Observed by LHAASO-KM2A

We report the discovery of an extended very-high-energy (VHE) gamma-ray source around the location of the middle-aged (207.8 kyr) pulsar PSR J0622+3749 with the Large High Altitude Air Shower Observatory (LHAASO). The source is detected with a significance of $8.2σ$ for $E>25$~TeV assuming a Gaussian template. The best-fit location is (R.A., Dec.)$=(95^{\circ}\!.47\pm0^{\circ}\!.11,\,37^{\circ}\!.92 \pm0^{\circ}\!.09)$, and the extension is $0^{\circ}\!.40\pm0^{\circ}\!.07$. The energy spectrum can be described by a power-law spectrum with an index of ${-2.92 \pm 0.17_{\rm stat} \pm 0.02_{\rm sys} }$. No clear extended multi-wavelength counterpart of the LHAASO source has been found from the radio to sub-TeV bands. The LHAASO observations are consistent with the scenario that VHE electrons escaped from the pulsar, diffused in the interstellar medium, and scattered the interstellar radiation field. If interpreted as the pulsar halo scenario, the diffusion coefficient, inferred for electrons with median energies of $\sim160$~TeV, is consistent with those obtained from the extended halos around Geminga and Monogem and much smaller than that derived from cosmic ray secondaries. The LHAASO discovery of this source thus likely enriches the class of so-called pulsar halos and confirms that high-energy particles generally diffuse very slowly in the disturbed medium around pulsars.

preprint2026arXiv

LHAASO Detection of Ultra-High-Energy Gamma-Ray Emission toward the Giant Molecular Clouds

The $γ$-ray from Giant molecular clouds (GMCs) is regarded as the most ideal tool to perform in-situ measurement of cosmic ray (CR) density and spectra in our Galaxy. We report the first detection of $γ$-ray emissions in the very-high-energy (VHE) domain from the five nearby GMCs with a stacking analysis based on a 4.5-year $γ$-ray observation with the Large High Altitude Air Shower Observatory (LHAASO) experiment. The spectral energy distributions derived from the GMCs are consistent with the expected $γ$-ray flux produced via CR interacting with the ISM in the energy interval 1 - 100 $~\rm$ TeV. In addition, we investigate the presence of the CR spectral `knee&#39; by introducing a spectral break in the $γ$-ray data. While no significant evidence for the CR knee is found, the current KM2A measurements from GMCs strongly favor a proton CR knee located above 0.9$~\rm$ PeV, which is consistent with the latest measurement of the CR spectrum by ground-based experiments.

preprint2026arXiv

MARLaaS: Multi-Tenant Asynchronous Reinforcement Learning as a Service

Reinforcement Learning from Verifiable Rewards (RLVR) has significantly improved the reasoning capabilities of large language models (LLMs), particularly in multi-turn agentic settings involving environment interaction like tool use. However, fine-tuning such models remains prohibitively expensive due to high computational requirements, limiting accessibility. We propose MARLaaS (Multi-tenant Asynchronous RL as a Service), a system for concurrent RL fine-tuning across multiple users and tasks. Our approach is based on two key ideas: (1) sharing a base model across tenants using lightweight LoRA adapters, and (2) a disaggregated asynchronous architecture that decouples rollout generation, environment interaction, and policy training into independently scheduled stages. This design enables tasks to progress through the RL pipeline at their own pace in an event-driven manner, reducing cross-task interference, idle time, and end-to-end latency. In multi-task settings (we report up to 32 concurrent tasks), MARLaaS achieves single-task state-of-the-art performance while improving accelerator utilization by up to 4.3x and reducing end-to-end training time by 85%.

preprint2026arXiv

MDL-GBG: A Non-parametric and Interpretable Granular-Ball Generation Method for Clustering

Existing granular-ball generation methods are still mainly driven by handcrafted quality measures and heuristic splitting or stopping criteria, which may weaken the transparency of local generation decisions in clustering. To address this issue, this paper proposes Minimum Description Length based Granular-Ball Generation (MDL-GBG), a non-parametric and interpretable granular-ball generation method for clustering. MDL-GBG reformulates granular-ball generation as a local model selection problem under the Minimum Description Length principle. For each granular ball, three candidate explanations are compared, namely a single-ball model, a two-ball model, and a core-ball-plus-residual model, and the model with the shortest description length is selected. In this way, ball retention, splitting, and residual peeling are unified within a common coding-theoretic framework. A residual reassignment mechanism is further introduced to re-evaluate peeled-off boundary samples after stable granular-balls are formed. Experiments on 20 UCI datasets show that the stable granular-balls generated by MDL-GBG provide an effective upstream representation for clustering. In particular, MDL-GBG+AC achieves the best average ranks in ARI, ACC, and NMI among the compared methods. These results indicate that MDL-GBG offers a principled and interpretable alternative to heuristic granular-ball generation strategies.

preprint2026arXiv

Measurement of attenuation length of the muon content in extensive air showers from 0.3 to 30 PeV with LHAASO

The attenuation length of the muon content in extensive air showers provides important information regarding the generation and development of air showers. This information can be used not only to improve the description of such showers but also to test fundamental models of hadronic interactions. Using data from the LHAASO-KM2A experiment, the development of the muon content in high-energy air showers was studied. The attenuation length of muon content in the air showers was measured from experimental data in the energy range from 0.3 to 30 PeV using the constant intensity cut method. By comparing the attenuation length of the muon content with predictions from high-energy hadronic interaction models (QGSJET-II-04, SIBYLL 2.3d, and EPOS-LHC), it is evident that LHAASO results are significantly shorter than those predicted by the first two models (QGSJET-II-04 and SIBYLL 2.3d) but relatively close to those predicted by the third model (EPOS-LHC). Thus, the LHAASO data favor the EPOS-LHC model over the other two models. The three interaction models confirmed an increasing trend in the attenuation length as the cosmic-ray energy increases.

preprint2026arXiv

Measurement of Very-high-energy Diffuse Gamma-ray Emissions from the Galactic Plane with LHAASO-WCDA

The diffuse Galactic gamma-ray emission is a very important tool used to study the propagation and interaction of cosmic rays in the Milky Way. In this work, we report the measurements of the diffuse emission from the Galactic plane, covering Galactic longitudes from $15^{\circ}$ to $235^{\circ}$ and latitudes from $-5^{\circ}$ to $+5^{\circ}$, in an energy range of 1 TeV to 25 TeV, with the Water Cherenkov Detector Array (WCDA) of the Large High Altitude Air Shower Observatory (LHAASO). After masking the sky regions of known sources, the diffuse emission is detected with $24.6σ$ and $9.1σ$ significance in the inner Galactic plane and outer Galactic plane, respectively. The WCDA spectra in both regions can be well described by a power-law function, with spectral indices of $-2.67\pm0.05_{\rm stat}$ in the inner region and $-2.83\pm0.19_{\rm stat}$ in the outer region, respectively. Combined with the Square Kilometer Array (KM2A) measurements at higher energies, a clear softening of the spectrum is found in the inner region, with change of spectral indices by $\sim0.5$ at a break energy around $30$ TeV. The fluxes of the diffuse emission are higher by a factor of $1.5-2.7$ than the model prediction assuming local CR spectra and the gas column density, which are consistent with those measured by the KM2A. Along Galactic longitude, the spatial distribution of the diffuse emission shows deviation from that of the gas column density. The spectral shape of the diffuse emission are possibly variation in different longitude region. The WCDA measurements bridge the gap between the low-energy measurements by space detectors and the ultra-high-energy observations by LHAASO-KM2A and other experiments. These results suggest that improved modeling of the wide-band diffuse emission is required.

preprint2026arXiv

MixServe: An Automatic Distributed Serving System for MoE Models with Hybrid Parallelism Based on Fused Communication Algorithm

The Mixture of Experts (MoE) models are emerging as the latest paradigm for Large Language Models (LLMs). However, due to memory constraints, MoE models with billions or even trillions of parameters can only be deployed in multi-GPU or even multi-node & multi-GPU based serving systems. Thus, communication has became a major bottleneck in distributed serving systems, especially inter-node communication. Contemporary distributed MoE models are primarily implemented using all-reduce (AR) based tensor parallelism (TP) and all-to-all (A2A) based expert parallelism (EP). However, TP generally exhibits low inter-node efficiency and is thus confined to high-speed intra-node bandwidth. In contrast, EP tends to suffer from load imbalance, especially when the parallel degree is high. In this work, we introduce MixServe, a novel automatic distributed serving system for efficient deployment of MoE models by a novel TP-EP hybrid parallelism based on fused AR-A2A communication algorithm. MixServe begins by evaluating the communication overhead associated with various parallel strategies, taking into account the model hyperparameters and the configurations of network and hardware resources, and then automatically selects the most efficient parallel strategy. Then, we propose the TP-EP hybrid parallelism based on fused AR-A2A communication algorithm that overlaps intra-node AR communication and inter-node A2A communication. Extensive experiments on DeepSeek-R1 and Qwen3 models demonstrate that MixServe achieves superior inference performance, with 1.08~3.80x acceleration in time to first token (TTFT), 1.03~1.66x acceleration in inter-token latency (ITL), and 5.2%~50.3% throughput improvement compared to existing approaches.

preprint2026arXiv

Observation of anomalous exciton polariton bands in PEPI perovskite based microcavity at room temperature

Recently anomalous energy bands with negative mass attract intensive attention where non Hermiticity plays an important role. In this work we observe anomalous exciton polariton bands in PEPI perovskite based microcavity at room temperature. We simulate the anomalous band structure using a non-Hermitian coupled oscillator model which agree with experiments very well. Our results offer to study non-Hermitian polariton wave dynamics at room temperature.

preprint2026arXiv

Open World Knowledge Aided Single-Cell Foundation Model with Robust Cross-Modal Cell-Language Pre-training

Recent advancements in single-cell multi-omics, particularly RNA-seq, have provided profound insights into cellular heterogeneity and gene regulation. While pre-trained language model (PLM) paradigm based single-cell foundation models have shown promise, they remain constrained by insufficient integration of in-depth individual profiles and neglecting the influence of noise within multi-modal data. To address both issues, we propose an Open-world Language Knowledge-Aided Robust Single-Cell Foundation Model (OKR-CELL). It is built based on a cross-modal Cell-Language pre-training framework, which comprises two key innovations: (1) leveraging Large Language Models (LLMs) based workflow with retrieval-augmented generation (RAG) enriches cell textual descriptions using open-world knowledge; (2) devising a Cross-modal Robust Alignment (CRA) objective that incorporates sample reliability assessment, curriculum learning, and coupled momentum contrastive learning to strengthen the model&#39;s resistance to noisy data. After pretraining on 32M cell-text pairs, OKR-CELL obtains cutting-edge results across 6 evaluation tasks. Beyond standard benchmarks such as cell clustering, cell-type annotation, batch-effect correction, and few-shot annotation, the model also demonstrates superior performance in broader multi-modal applications, including zero-shot cell-type annotation and bidirectional cell-text retrieval.

preprint2026arXiv

REE-TTT: Highly Adaptive Radar Echo Extrapolation Based on Test-Time Training

Precipitation nowcasting is critically important for meteorological forecasting. Deep learning-based Radar Echo Extrapolation (REE) has become a predominant nowcasting approach, yet it suffers from poor generalization due to its reliance on high-quality local training data and static model parameters, limiting its applicability across diverse regions and extreme events. To overcome this, we propose REE-TTT, a novel model that incorporates an adaptive Test-Time Training (TTT) mechanism. The core of our model lies in the newly designed Spatio-temporal Test-Time Training (ST-TTT) block, which replaces the standard linear projections in TTT layers with task-specific attention mechanisms, enabling robust adaptation to non-stationary meteorological distributions and thereby significantly enhancing the feature representation of precipitation. Experiments under cross-regional extreme precipitation scenarios demonstrate that REE-TTT substantially outperforms state-of-the-art baseline models in prediction accuracy and generalization, exhibiting remarkable adaptability to data distribution shifts.

preprint2026arXiv

Transient Large-Scale Anisotropy in TeV Cosmic Rays due to an Interplanetary Coronal Mass Ejection

Large- or medium-scale cosmic ray anisotropy at TeV energies has not previously been confirmed to vary with time. Transient anisotropy changes have been observed below 150 GeV, especially near the passage of an interplanetary shock and coronal mass ejection containing a magnetic flux rope ejected by a solar storm, which can trigger a geomagnetic storm with practical consequences. In such events, cosmic rays provide remote sensing of the magnetic field properties. Here we report the observation of transient large-scale anisotropy in TeV cosmic ray ions using data from the Large High Altitude Air Shower Observatory (LHAASO). We analyze hourly skymaps of the transient cosmic ray intensity excess or deficit, the gradient of which indicates the direction and magnitude of transient large-scale anisotropy across the field of view. We observe enhanced anisotropy above typical hourly fluctuations with $>$5$σ$ significance during some hours of November 4, 2021, in separate data sets for four primary cosmic ray energy ranges of median energy from $E$=0.7 to 3.1 TeV. The gradient varies with energy as $E^γ$, where $γ\approx-0.5$. At a median energy $\leq$1.0 TeV, this gradient corresponds to a dipole anisotropy of at least 1\%, or possibly a weaker anisotropy of higher order. This new type of observation opens the opportunity to study interplanetary magnetic structures using air shower arrays around the world, complementing existing in situ and remote measurements of plasma properties.

preprint2025arXiv

CrossTrafficLLM: A Human-Centric Framework for Interpretable Traffic Intelligence via Large Language Model

While accurate traffic forecasting is vital for Intelligent Transportation Systems (ITS), effectively communicating predicted conditions via natural language for human-centric decision support remains a challenge and is often handled separately. To address this, we propose CrossTrafficLLM, a novel GenAI-driven framework that simultaneously predicts future spatiotemporal traffic states and generates corresponding natural language descriptions, specifically targeting conditional abnormal event summaries. We tackle the core challenge of aligning quantitative traffic data with qualitative textual semantics by leveraging Large Language Models (LLMs) within a unified architecture. This design allows generative textual context to improve prediction accuracy while ensuring generated reports are directly informed by the forecast. Technically, a text-guided adaptive graph convolutional network is employed to effectively merge high-level semantic information with the traffic network structure. Evaluated on the BJTT dataset, CrossTrafficLLM demonstrably surpasses state-of-the-art methods in both traffic forecasting performance and text generation quality. By unifying prediction and description generation, CrossTrafficLLM delivers a more interpretable, and actionable approach to generative traffic intelligence, offering significant advantages for modern ITS applications.

preprint2025arXiv

Observation of the $γ$-ray Emission from W43 with LHAASO

In this paper, we report the detection of the very-high-energy (VHE, $ 100{\rm\ GeV} < E < 100{\rm\ TeV} $) and ultra-high-energy (UHE, $E > 100\rm\ TeV$) $γ$-ray emissions from the direction of the young star-forming region W43, observed by the Large High Altitude Air Shower Observation (LHAASO). The extended $γ$-ray source was detected with a significance of ${\sim}16\,σ$ by KM2A and ${\sim}17\,σ$ by WCDA, respectively. The angular extension of this $γ$-ray source is about 0.5 degrees, corresponding to a physical size of about 50 pc. We discuss the origin of the $γ$-ray emission and possible cosmic ray acceleration in the W43 region using multi-wavelength data. Our findings suggest that W43 is likely another young star cluster capable of accelerating cosmic rays (CRs) to at least several hundred TeV.

preprint2025arXiv

Study of Ultra-High-Energy Gamma-Ray Source 1LHAASO J0056+6346u and Its Possible Origins

We report a dedicated study of the newly discovered extended UHE $γ$-ray source 1LHAASO J0056+6346u. Analyzing 979 days of LHAASO-WCDA data and 1389 days of LHAASO-KM2A data, we observed a significant excess of $γ$-ray events with both WCDA and KM2A. Assuming a point power-law source with a fixed spectral index, the significance maps reveal excesses of ${\sim}12.65\,σ$, ${\sim}22.18\,σ$, and ${\sim}10.24\,σ$ in the energy ranges of 1--25 TeV, 25--100 TeV, and $> 100$ TeV, respectively. We use a 3D likelihood algorithm to derive the morphological and spectral parameters, and the source is detected with significances of $12.65\,σ$ by WCDA and $25.27\,σ$ by KM2A. The best-fit positions derived from WCDA and KM2A data are (R.A. = $13.96^\circ\pm0.09^\circ$, Decl. = $63.92^\circ\pm0.05^\circ$) and (R.A. = $14.00^\circ\pm0.05^\circ$, Decl. = $63.79^\circ\pm0.02^\circ$), respectively. The angular size ($r_{39}$) of 1LHAASO J0056+6346u is $0.34^\circ\pm0.04^\circ$ at 1--25 TeV and $0.24^\circ\pm0.02^\circ$ at $> 25$ TeV. The differential flux of this UHE $γ$-ray source can be described by an exponential cutoff power-law function: $(2.67\pm0.25) \times 10^{-15} (E/20\,\text{TeV})^{-1.97\pm0.10} e^{-E/(55.1\pm7.2)\,\text{TeV}} \,\text{TeV}^{-1}\,\text{cm}^{-2}\,\text{s}^{-1}$. To explore potential sources of $γ$-ray emission, we investigated the gas distribution around 1LHAASO J0056+6346u. 1LHAASO J0056+6346u is likely to be a TeV PWN powered by an unknown pulsar, which would naturally explain both its spatial and spectral properties. Another explanation is that this UHE $γ$-ray source might be associated with gas content illuminated by a nearby CR accelerator, possibly the SNR candidate G124.0+1.4.

preprint2025arXiv

Ultrahigh-Energy Gamma-ray Emission Associated with Black Hole-Jet Systems

Black holes (BH), one of the most intriguing objects in the universe, can manifest themselves through electromagnetic radiation initiated by the accretion flow. Some stellar-mass BHs drive relativistic jets when accreting matter from their companion stars, forming microquasars. Non-thermal emission from the radio to tera-electronvolt (TeV) gamma-ray band has been observed from microquasars, indicating the acceleration of relativistic particles. Here we report detection of four microquasars (SS 433, V4641 Sgr, GRS 1915+105, MAXI J1820+070) of spectrum extending to the ultrahigh-energy (UHE; photon energy $E>100$ TeV) band and one microquasar (Cygnus X-1) of spectrum approaching 100 TeV, using the Large High Altitude Air Shower Observatory (LHAASO). Notably, the total emission associated with SS 433 cannot be interpreted with a single leptonic component. In the UHE band, its emission is in spatial coincidence with a giant atomic cloud, which is consistent with a hadronic origin. An elongated source is discovered from V4641 Sgr with the spectrum continuing up to 800 TeV. The detection of UHE gamma rays demonstrates that accreting BHs and their environments can operate as extremely efficient accelerators of particles out of 1 peta-electronvolt (PeV), suggesting microquasars to be important contributors to Galactic cosmic rays especially around the `knee&#39; region.

preprint2023arXiv

Gas-phase molecules in protoplanetary nebulae with the 21 μm emission feature

It has been more than 30 years since the enigmatic 21 μm emission feature was first discovered in protoplanetary nebulae (PPNs). Although dozens of different dust carrier candidates have been proposed, there is as yet no widely accepted one. We present the results of molecular observations toward 21μm objects using the 10m Submillimeter Telescope of Arizona Radio Observatory at the 1.3 mm band and the 13.7 m telescope of Purple Mountain Observatory at the 3mm band, aiming to investigate whether the gas-phase environments of these unusual sources have some peculiarities compared to normal PPNs. We detect 31 emission lines belonging to seven different molecular species, most of which are the first detection in 21 μm PPNs. The observations provide clues on the identification of the 21 μm feature. We report a correlation study between the fractional abundance of gas-phase molecules and the strengths of the 21 μm emission. Our study shows that given the small sample size, the 21 μm feature has weak or no correlations with the gas-phase molecules. Future radio observations of high spatial and spectral resolution toward a large sample are desirable to elucidate the 21 μm emission phenomena.

preprint2023arXiv

Generalizable Black-Box Adversarial Attack with Meta Learning

In the scenario of black-box adversarial attack, the target model&#39;s parameters are unknown, and the attacker aims to find a successful adversarial perturbation based on query feedback under a query budget. Due to the limited feedback information, existing query-based black-box attack methods often require many queries for attacking each benign example. To reduce query cost, we propose to utilize the feedback information across historical attacks, dubbed example-level adversarial transferability. Specifically, by treating the attack on each benign example as one task, we develop a meta-learning framework by training a meta-generator to produce perturbations conditioned on benign examples. When attacking a new benign example, the meta generator can be quickly fine-tuned based on the feedback information of the new task as well as a few historical attacks to produce effective perturbations. Moreover, since the meta-train procedure consumes many queries to learn a generalizable generator, we utilize model-level adversarial transferability to train the meta-generator on a white-box surrogate model, then transfer it to help the attack against the target model. The proposed framework with the two types of adversarial transferability can be naturally combined with any off-the-shelf query-based attack methods to boost their performance, which is verified by extensive experiments.

preprint2022arXiv

A characteristic-spectral-mixed scheme for six-dimensional Wigner-Coulomb dynamics

Numerical resolution for 6-D Wigner dynamics under the Coulomb potential faces with the combined challenges of high dimensionality, nonlocality, oscillation and singularity. In particular, the extremely huge memory storage of 6-D grids hinders the usage of all existing deterministic numerical scheme, which is well-known as the curse of dimensionality. To surmount these difficulties, we propose a massively parallel solver, termed the CHAracteristic-Spectral-Mixed (CHASM) scheme, by fully exploiting two distinct features of the Wigner equation: Locality of spatial advection and nonlocality of quantum interaction. Our scheme utilizes the local cubic B-spline basis to interpolate the local spatial advection. The key is to use a perfectly matched boundary condition to give a closure of spline coefficients, so that distributed pieces can recover the global one as accurately as possible owing to the rapid decay of wavelet basis in the dual space, and communication costs are significantly reduced. To resolve the nonlocal pseudodifferential operator with weakly singular symbol, CHASM further adopts the truncated kernel method to attain a highly efficient approximation. Several typical experiments including the quantum harmonic oscillator and Hydrogen 1s state demonstrate the accuracy and efficiency of CHASM. The non-equilibrium electron-proton couplings are also clearly displayed and reveal the uncertainty principle and quantum tunneling in phase space. Finally, the scalability of CHASM up to 16000 cores is presented.

preprint2022arXiv

A deep learning-based remaining useful life prediction approach for bearings

In industrial applications, nearly half the failures of motors are caused by the degradation of rolling element bearings (REBs). Therefore, accurately estimating the remaining useful life (RUL) for REBs are of crucial importance to ensure the reliability and safety of mechanical systems. To tackle this challenge, model-based approaches are often limited by the complexity of mathematical modeling. Conventional data-driven approaches, on the other hand, require massive efforts to extract the degradation features and construct health index. In this paper, a novel online data-driven framework is proposed to exploit the adoption of deep convolutional neural networks (CNN) in predicting the RUL of bearings. More concretely, the raw vibrations of training bearings are first processed using the Hilbert-Huang transform (HHT) and a novel nonlinear degradation indicator is constructed as the label for learning. The CNN is then employed to identify the hidden pattern between the extracted degradation indicator and the vibration of training bearings, which makes it possible to estimate the degradation of the test bearings automatically. Finally, testing bearings&#39; RULs are predicted by using a $ε$-support vector regression model. The superior performance of the proposed RUL estimation framework, compared with the state-of-the-art approaches, is demonstrated through the experimental results. The generality of the proposed CNN model is also validated by transferring to bearings undergoing different operating conditions.

preprint2022arXiv

A multi view multi stage and multi window framework for pulmonary artery segmentation from CT scans

This is the technical report of the 9th place in the final result of PARSE2022 Challenge. We solve the segmentation problem of the pulmonary artery by using a two-stage method based on a 3D CNN network. The coarse model is used to locate the ROI, and the fine model is used to refine the segmentation result. In addition, in order to improve the segmentation performance, we adopt multi-view and multi-window level method, at the same time we employ a fine-tune strategy to mitigate the impact of inconsistent labeling.

preprint2022arXiv

A phase transition driven by subtle distortion without broken symmetry on spin, charge and lattice in Layered LnCu4-δP2(Ln=Eu, Sr)

In the scenario of Landau phase transition theory in condensed matter physics, any thermal dynamic phase transition must be subject to some kind of broken symmetries, that are relative to its spin, charge, orbital and lattice. Here we report a rare phase transition at Tp ~120 K or 140 K in layered materials LnCu4-δP2 (Ln=Eu, Sr) driven by a subtle structural-distortion without any broken symmetry on charge, spin and lattice. The variations of the lattice parameters, (ΔLc/Lc) ~ 0.013% or 0.062%, verified by thermal expansion, is much less than that for a typical crystalline phase transition (~0.5-1%), but the significant anomaly in heat capacity provides clear evidence of its intrinsic nature of thermodynamic transition.

preprint2022arXiv

A Real-time Critical-scenario-generation Framework for Testing Autonomous Driving System

In order to find the most likely failure scenarios which may occur under certain given operation domain, critical-scenario-based test is supposed as an effective and widely used method, which gives suggestions for designers to improve the developing algorithm. However, for the state of art, critical-scenario generation approaches commonly utilize random-search or reinforcement learning methods to generate series of scenarios for a specific algorithm, which takes amounts of computing resource for testing a developing target that is always changing, and inapplicable for testing a real-time system. In this paper, we proposed a real-time critical-scenario-generation (RTCSG) framework to address the above challenges. In our framework, an aggressive-driving algorithm is proposed in controlling the virtual agent vehicles, a specially designed cost function is presented to guide scenarios to evolve towards critical conditions, and a self-adaptive coefficient iteration is designed that enable the approach to operate successfully in different conditions. With our proposed method, the critical-scenarios can be directly generated for the target under test which is a black-box system, and the real-time critical-scenario test can be brought into reality. The simulation results show that our approach is able to obtain more critical scenarios in most conditions than current methods, with a higher stability of success. For a real-time testing, our approach improves the efficiency around 16 times.

preprint2022arXiv

Achieving Model Fairness in Vertical Federated Learning

Vertical federated learning (VFL) has attracted greater and greater interest since it enables multiple parties possessing non-overlapping features to strengthen their machine learning models without disclosing their private data and model parameters. Similar to other machine learning algorithms, VFL faces demands and challenges of fairness, i.e., the learned model may be unfairly discriminatory over some groups with sensitive attributes. To tackle this problem, we propose a fair VFL framework in this work. First, we systematically formulate the problem of training fair models in VFL, where the learning task is modelled as a constrained optimization problem. To solve it in a federated and privacy-preserving manner, we consider the equivalent dual form of the problem and develop an asynchronous gradient coordinate-descent ascent algorithm, where some active data parties perform multiple parallelized local updates per communication round to effectively reduce the number of communication rounds. The messages that the server sends to passive parties are deliberately designed such that the information necessary for local updates is released without intruding on the privacy of data and sensitive attributes. We rigorously study the convergence of the algorithm when applied to general nonconvex-concave min-max problems. We prove that the algorithm finds a $δ$-stationary point of the dual objective in $\mathcal{O}(δ^{-4})$ communication rounds under mild conditions. Finally, the extensive experiments on three benchmark datasets demonstrate the superior performance of our method in training fair models.

preprint2022arXiv

ATOMS: ALMA Three-millimeter Observations of Massive Star-forming regions -- VII. A catalogue of SiO clumps from ACA observations

To understand the nature of SiO emission, we conducted ACA observations of the SiO (2-1) lines toward 146 massive star-forming regions, as part of the ALMA Three-millimeter Observations of Massive Star-forming regions (ATOMS) survey. We detected SiO emission in 128 (87.7$\%$) sources and identified 171 SiO clumps, 105 of which are spatially separated from 3 mm continuum emission. A large amount of the SiO line profiles (60$\%$) are non-Gaussian. The velocity dispersion of the SiO lines ranges from 0.3 to 5.43 km s$^{-1}$. In 63 sources the SiO clumps are associated with H$_\rm{II}$ regions characterized by H40$α$ emission. We find that 68$\%$ (116) of the SiO clumps are associated with strong outflows. The median velocity dispersion of the SiO line for outflow sources and non-outflow sources is 1.91 km s$^{-1}$ and 0.99 km s$^{-1}$, respectively. These results indicate that outflow activities could be connected to strongly shocked gas. The velocity dispersion and [SiO]/[H$^{13}$CO$^+$] intensity ratio do not show any correlation with the dust temperature and particle number density of clumps. We find a positive correlation between the SiO line luminosity and the bolometric luminosity, implying stronger shock activities are associated with more luminous proto-clusters. The SiO clumps in associations with H$_\rm{II}$ regions were found to show a steeper feature in $L_\rm{sio}$/$L_\rm{bol}$. The SiO line luminosity and the fraction of shocked gas have no apparent evidence of correlation with the evolutionary stages traced by luminosity to mass ratio ($L_\rm{bol}/M$).

preprint2022arXiv

ATOMS: ALMA Three-millimeter Observations of Massive Star-forming regions -- VIII. A search for hot cores by using C$_2$H$_5$CN, CH$_3$OCHO and CH$_3$OH lines

Hot cores characterized by rich lines of complex organic molecules are considered as ideal sites for investigating the physical and chemical environments of massive star formation. We present a search for hot cores by using typical nitrogen- and oxygen-bearing complex organic molecules (C$_2$H$_5$CN, CH$_3$OCHO and CH$_3$OH), based on ALMA Three-millimeter Observations of Massive Star-forming regions (ATOMS). The angular resolutions and line sensitivities of the ALMA observations are better than 2 arcsec and 10 mJy/beam, respectively. A total of 60 hot cores are identified with 45 being newly detected, in which the complex organic molecules have high gas temperatures ($>$ 100 K) and small source sizes ($<$ 0.1 pc). So far this is the largest sample of hot cores observed with similar angular resolution and spectral coverage. The observations have also shown nitrogen and oxygen differentiation in both line emission and gas distribution in 29 hot cores. Column densities of CH$_3$OH and CH$_3$OCHO increase as rotation temperatures rise. The column density of CH$_3$OCHO correlates tightly with that of CH$_3$OH. The pathways for production of different species are discussed. Based on the spatial position difference between hot cores and UC~H{\sc ii} regions, we conclude that 24 hot cores are externally heated while the other hot cores are internally heated. The observations presented here will potentially help establish a hot core template for studying massive star formation and astrochemistry.

preprint2022arXiv

ATOMS: ALMA Three-millimeter Observations of Massive Star-forming regions -- XI. From inflow to infall in hub-filament systems

We investigate the presence of hub-filament systems in a large sample of 146 active proto-clusters, using H$^{13}$CO$^{+}$ J=1-0 molecular line data obtained from the ATOMS survey. We find that filaments are ubiquitous in proto-clusters, and hub-filament systems are very common from dense core scales ($\sim$0.1 pc) to clump/cloud scales ($\sim$1-10 pc). The proportion of proto-clusters containing hub-filament systems decreases with increasing dust temperature ($T_d$) and luminosity-to-mass ratios ($L/M$) of clumps, indicating that stellar feedback from H{\sc ii} regions gradually destroys the hub-filament systems as proto-clusters evolve. Clear velocity gradients are seen along the longest filaments with a mean velocity gradient of 8.71 km s$^{-1}$pc$^{-1}$ and a median velocity gradient of 5.54 km s$^{-1}$pc$^{-1}$. We find that velocity gradients are small for filament lengths larger than $\sim$1~pc, probably hinting at the existence of inertial inflows, although we cannot determine whether the latter are driven by large-scale turbulence or large-scale gravitational contraction. In contrast, velocity gradients below $\sim$1~pc dramatically increase as filament lengths decrease, indicating that the gravity of the hubs or cores starts to dominate gas infall at small scales. We suggest that self-similar hub-filament systems and filamentary accretion at all scales may play a key role in high-mass star formation.

preprint2022arXiv

ATOMS: ALMA Three-millimeter Observations of Massive Star-forming regions -- XII: Fragmentation and multi-scale gas kinematics in protoclusters G12.42+0.50 and G19.88-0.53

We present new continuum and molecular line data from the ALMA Three-millimeter Observations of Massive Star-forming regions (ATOMS) survey for the two protoclusters, G12.42+0.50 and G19.88-0.53. The 3 mm continuum maps reveal seven cores in each of the two globally contracting protoclusters. These cores satisfy the radius-mass relation and the surface mass density criteria for high-mass star formation. Similar to their natal clumps, the virial analysis of the cores suggests that they are undergoing gravitational collapse ($\rm α_{vir} << 2$). The clump to core scale fragmentation is investigated and the derived core masses and separations are found to be consistent with thermal Jeans fragmentation. We detect large-scale filamentary structures with velocity gradients and multiple outflows in both regions. Dendrogram analysis of the H$^{13}$CO$^{+}$ map identifies several branch and leaf structures with sizes $\sim$ 0.1 and 0.03 pc, respectively. The supersonic gas motion displayed by the branch structures is in agreement with the Larson power-law indicating that the gas kinematics at this spatial scale is driven by turbulence. The transition to transonic/subsonic gas motion is seen to occur at spatial scales of $\sim$0.1 pc indicating the dissipation of turbulence. In agreement with this, the leaf structures reveal gas motions that deviate from the slope of Larson&#39;s law. From the large-scale converging filaments to the collapsing cores, the gas dynamics in G12.42+0.50 and G19.88-0.53 show scale-dependent dominance of turbulence and gravity and the combination of these two driving mechanisms needs to be invoked to explain massive star formation in the protoclusters.

preprint2022arXiv

ATOMS: ALMA Three-millimeter Observations of Massive Star-forming regions-IX. A pilot study towards IRDC G034.43+00.24 on multi-scale structures and gas kinematics

We present a comprehensive study of the gas kinematics associated with density structures at different spatial scales in the filamentary infrared dark cloud, G034.43+00.24 (G34). This study makes use of the H13CO+ (1-0) molecular line data from the ALMA Three-millimeter Observations of Massive Star-forming regions (ATOMS) survey, which has spatial and velocity resolution of 0.04 pc and 0.2 km/s, respectively. Several tens of dendrogram structures have been extracted in the position-position-velocity space of H13CO+, which include 21 small-scale leaves and 20 larger-scale branches. Overall, their gas motions are supersonic but they exhibit the interesting behavior where leaves tend to be less dynamically supersonic than the branches. For the larger-scale, branch structures, the observed velocity-size relation (i.e., velocity variation/dispersion versus size) are seen to follow the Larson scaling exponent while the smaller-scale, leaf structures show a systematic deviation and display a steeper slope. We argue that the origin of the observed kinematics of the branch structures is likely to be a combination of turbulence and gravity-driven ordered gas flows. In comparison, gravity-driven chaotic gas motion is likely at the level of small-scale leaf structures. The results presented in our previous paper and this current follow-up study suggest that the main driving mechanism for mass accretion/inflow observed in G34 varies at different spatial scales. We therefore conclude that a scale-dependent combined effect of turbulence and gravity is essential to explain the star-formation processes in G34.

preprint2022arXiv

AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data

Semi-supervised learning (SSL) has seen great strides when labeled data is scarce but unlabeled data is abundant. Critically, most recent work assume that such unlabeled data is drawn from the same distribution as the labeled data. In this work, we show that state-of-the-art SSL algorithms suffer a degradation in performance in the presence of unlabeled auxiliary data that does not necessarily possess the same class distribution as the labeled set. We term this problem as Auxiliary-SSL and propose AuxMix, an algorithm that leverages self-supervised learning tasks to learn generic features in order to mask auxiliary data that are not semantically similar to the labeled set. We also propose to regularize learning by maximizing the predicted entropy for dissimilar auxiliary samples. We show an improvement of 5% over existing baselines on a ResNet-50 model when trained on CIFAR10 dataset with 4k labeled samples and all unlabeled data is drawn from the Tiny-ImageNet dataset. We report competitive results on several datasets and conduct ablation studies.

preprint2022arXiv

Boosting Fast Adversarial Training with Learnable Adversarial Initialization

Adversarial training (AT) has been demonstrated to be effective in improving model robustness by leveraging adversarial examples for training. However, most AT methods are in face of expensive time and computational cost for calculating gradients at multiple steps in generating adversarial examples. To boost training efficiency, fast gradient sign method (FGSM) is adopted in fast AT methods by calculating gradient only once. Unfortunately, the robustness is far from satisfactory. One reason may arise from the initialization fashion. Existing fast AT generally uses a random sample-agnostic initialization, which facilitates the efficiency yet hinders a further robustness improvement. Up to now, the initialization in fast AT is still not extensively explored. In this paper, we boost fast AT with a sample-dependent adversarial initialization, i.e., an output from a generative network conditioned on a benign image and its gradient information from the target network. As the generative network and the target network are optimized jointly in the training phase, the former can adaptively generate an effective initialization with respect to the latter, which motivates gradually improved robustness. Experimental evaluations on four benchmark databases demonstrate the superiority of our proposed method over state-of-the-art fast AT methods, as well as comparable robustness to advanced multi-step AT methods. The code is released at https://github.com//jiaxiaojunQAQ//FGSM-SDI.

preprint2022arXiv

Cosine Model Watermarking Against Ensemble Distillation

Many model watermarking methods have been developed to prevent valuable deployed commercial models from being stealthily stolen by model distillations. However, watermarks produced by most existing model watermarking methods can be easily evaded by ensemble distillation, because averaging the outputs of multiple ensembled models can significantly reduce or even erase the watermarks. In this paper, we focus on tackling the challenging task of defending against ensemble distillation. We propose a novel watermarking technique named CosWM to achieve outstanding model watermarking performance against ensemble distillation. CosWM is not only elegant in design, but also comes with desirable theoretical guarantees. Our extensive experiments on public data sets demonstrate the excellent performance of CosWM and its advantages over the state-of-the-art baselines.

preprint2022arXiv

Deep Reinforcement Learning for Exact Combinatorial Optimization: Learning to Branch

Branch-and-bound is a systematic enumerative method for combinatorial optimization, where the performance highly relies on the variable selection strategy. State-of-the-art handcrafted heuristic strategies suffer from relatively slow inference time for each selection, while the current machine learning methods require a significant amount of labeled data. We propose a new approach for solving the data labeling and inference latency issues in combinatorial optimization based on the use of the reinforcement learning (RL) paradigm. We use imitation learning to bootstrap an RL agent and then use Proximal Policy Optimization (PPO) to further explore global optimal actions. Then, a value network is used to run Monte-Carlo tree search (MCTS) to enhance the policy network. We evaluate the performance of our method on four different categories of combinatorial optimization problems and show that our approach performs strongly compared to the state-of-the-art machine learning and heuristics based methods.

preprint2022arXiv

E-LANG: Energy-Based Joint Inferencing of Super and Swift Language Models

Building huge and highly capable language models has been a trend in the past years. Despite their great performance, they incur high computational cost. A common solution is to apply model compression or choose light-weight architectures, which often need a separate fixed-size model for each desirable computational budget, and may lose performance in case of heavy compression. This paper proposes an effective dynamic inference approach, called E-LANG, which distributes the inference between large accurate Super-models and light-weight Swift models. To this end, a decision making module routes the inputs to Super or Swift models based on the energy characteristics of the representations in the latent space. This method is easily adoptable and architecture agnostic. As such, it can be applied to black-box pre-trained models without a need for architectural manipulations, reassembling of modules, or re-training. Unlike existing methods that are only applicable to encoder-only backbones and classification tasks, our method also works for encoder-decoder structures and sequence-to-sequence tasks such as translation. The E-LANG performance is verified through a set of experiments with T5 and BERT backbones on GLUE, SuperGLUE, and WMT. In particular, we outperform T5-11B with an average computations speed-up of 3.3$\times$ on GLUE and 2.9$\times$ on SuperGLUE. We also achieve BERT-based SOTA on GLUE with 3.2$\times$ less computations. Code and demo are available in the supplementary materials.

preprint2022arXiv

EBHI:A New Enteroscope Biopsy Histopathological H&E Image Dataset for Image Classification Evaluation

Background and purpose: Colorectal cancer has become the third most common cancer worldwide, accounting for approximately 10% of cancer patients. Early detection of the disease is important for the treatment of colorectal cancer patients. Histopathological examination is the gold standard for screening colorectal cancer. However, the current lack of histopathological image datasets of colorectal cancer, especially enteroscope biopsies, hinders the accurate evaluation of computer-aided diagnosis techniques. Methods: A new publicly available Enteroscope Biopsy Histopathological H&E Image Dataset (EBHI) is published in this paper. To demonstrate the effectiveness of the EBHI dataset, we have utilized several machine learning, convolutional neural networks and novel transformer-based classifiers for experimentation and evaluation, using an image with a magnification of 200x. Results: Experimental results show that the deep learning method performs well on the EBHI dataset. Traditional machine learning methods achieve maximum accuracy of 76.02% and deep learning method achieves a maximum accuracy of 95.37%. Conclusion: To the best of our knowledge, EBHI is the first publicly available colorectal histopathology enteroscope biopsy dataset with four magnifications and five types of images of tumor differentiation stages, totaling 5532 images. We believe that EBHI could attract researchers to explore new classification algorithms for the automated diagnosis of colorectal cancer, which could help physicians and patients in clinical settings.

preprint2022arXiv

Effect of surface H$_2$ on molecular hydrogen formation on interstellar grains

We investigate how the existence of hydrogen molecules on grain surfaces may affect H$_2$ formation efficiency in diffuse and translucent clouds. Hydrogen molecules are able to reduce the desorption energy of H atoms on grain surfaces in models. The detailed microscopic Monte Carlo method is used to perform model simulations. We found that the impact of the existence of H$_2$ on H$_2$ formation efficiency strongly depends on the diffusion barriers of H$_2$ on grain surfaces. Diffuse cloud models that do not consider surface H$_2$ predict that H atom recombination efficiency is above 0.5 over a grain temperature (T) range 10 K and 14 K. The adopted H$_2$ diffusion barriers in diffuse cloud models that consider surface H$_2$ are 80$\%$ H$_2$ desorption energies so that H$_2$ can be trapped in stronger binding sites. Depending on model parameters, these diffuse cloud models predict that the recombination efficiency is between nearly 0 and 0.5 at 10 K $\leq$ T $\leq$ 14 K. Translucent cloud model results show that H$_2$ formation efficiency is not affected by the existence of surface H$_2$ if the adopted average H$_2$ diffusion barrier on grain surfaces is low (194 K) so that H$_2$ can diffuse rapidly on grain surfaces. However, the recombination efficiency can drop to below 0.002 at T $\geq$ 10 K if higher average H$_2$ diffusion barrier is used (255 K) in translucent cloud models.

preprint2022arXiv

Estimating Visual Information From Audio Through Manifold Learning

We propose a new framework for extracting visual information about a scene only using audio signals. Audio-based methods can overcome some of the limitations of vision-based methods i.e., they do not require &#34;line-of-sight&#34;, are robust to occlusions and changes in illumination, and can function as a backup in case vision/lidar sensors fail. Therefore, audio-based methods can be useful even for applications in which only visual information is of interest Our framework is based on Manifold Learning and consists of two steps. First, we train a Vector-Quantized Variational Auto-Encoder to learn the data manifold of the particular visual modality we are interested in. Second, we train an Audio Transformation network to map multi-channel audio signals to the latent representation of the corresponding visual sample. We show that our method is able to produce meaningful images from audio using a publicly available audio/visual dataset. In particular, we consider the prediction of the following visual modalities from audio: depth and semantic segmentation. We hope the findings of our work can facilitate further research in visual information extraction from audio. Code is available at: https://github.com/ubc-vision/audio_manifold.

preprint2022arXiv

Extending Momentum Contrast with Cross Similarity Consistency Regularization

Contrastive self-supervised representation learning methods maximize the similarity between the positive pairs, and at the same time tend to minimize the similarity between the negative pairs. However, in general the interplay between the negative pairs is ignored as they do not put in place special mechanisms to treat negative pairs differently according to their specific differences and similarities. In this paper, we present Extended Momentum Contrast (XMoCo), a self-supervised representation learning method founded upon the legacy of the momentum-encoder unit proposed in the MoCo family configurations. To this end, we introduce a cross consistency regularization loss, with which we extend the transformation consistency to dissimilar images (negative pairs). Under the cross consistency regularization rule, we argue that semantic representations associated with any pair of images (positive or negative) should preserve their cross-similarity under pretext transformations. Moreover, we further regularize the training loss by enforcing a uniform distribution of similarity over the negative pairs across a batch. The proposed regularization can easily be added to existing self-supervised learning algorithms in a plug-and-play fashion. Empirically, we report a competitive performance on the standard Imagenet-1K linear head classification benchmark. In addition, by transferring the learned representations to common downstream tasks, we show that using XMoCo with the prevalently utilized augmentations can lead to improvements in the performance of such tasks. We hope the findings of this paper serve as a motivation for researchers to take into consideration the important interplay among the negative examples in self-supervised learning.

preprint2022arXiv

Fast Adversarial Training with Adaptive Step Size

While adversarial training and its variants have shown to be the most effective algorithms to defend against adversarial attacks, their extremely slow training process makes it hard to scale to large datasets like ImageNet. The key idea of recent works to accelerate adversarial training is to substitute multi-step attacks (e.g., PGD) with single-step attacks (e.g., FGSM). However, these single-step methods suffer from catastrophic overfitting, where the accuracy against PGD attack suddenly drops to nearly 0% during training, destroying the robustness of the networks. In this work, we study the phenomenon from the perspective of training instances. We show that catastrophic overfitting is instance-dependent and fitting instances with larger gradient norm is more likely to cause catastrophic overfitting. Based on our findings, we propose a simple but effective method, Adversarial Training with Adaptive Step size (ATAS). ATAS learns an instancewise adaptive step size that is inversely proportional to its gradient norm. The theoretical analysis shows that ATAS converges faster than the commonly adopted non-adaptive counterparts. Empirically, ATAS consistently mitigates catastrophic overfitting and achieves higher robust accuracy on CIFAR10, CIFAR100 and ImageNet when evaluated on various adversarial budgets.

preprint2022arXiv

FAST search for circumstellar atomic hydrogen--I: the young planetary nebula IC 4997

Using the Five-hundred-meter Aperture Spherical radio Telescope (FAST) in Guizhou, China, we detect the 21cm neutral atomic hydrogen absorption in the young planetary nebula IC 4997. The absorption arises from a shell also associated with Na I D lines. The H I shell has a mass of $1.46\times10^{-2}$ M$_\odot$ and a dynamic age of 990yr. The column density of H I is estimated to be $7.1\times10^{20}$ cm$^{-2}$, which can be well explained in terms of a photodissociation region around the ionized nebula, limited by self shielding of H$_2$. We find that the atomic-to-ionized hydrogen ratio is 0.6, suggesting that H I substantially contributes to overall nebular mass.

preprint2022arXiv

FENeRF: Face Editing in Neural Radiance Fields

Previous portrait image generation methods roughly fall into two categories: 2D GANs and 3D-aware GANs. 2D GANs can generate high fidelity portraits but with low view consistency. 3D-aware GAN methods can maintain view consistency but their generated images are not locally editable. To overcome these limitations, we propose FENeRF, a 3D-aware generator that can produce view-consistent and locally-editable portrait images. Our method uses two decoupled latent codes to generate corresponding facial semantics and texture in a spatial aligned 3D volume with shared geometry. Benefiting from such underlying 3D representation, FENeRF can jointly render the boundary-aligned image and semantic mask and use the semantic mask to edit the 3D volume via GAN inversion. We further show such 3D representation can be learned from widely available monocular image and semantic mask pairs. Moreover, we reveal that joint learning semantics and texture helps to generate finer geometry. Our experiments demonstrate that FENeRF outperforms state-of-the-art methods in various face editing tasks.

preprint2022arXiv

Global bifurcation structure and geometric properties for steady periodic water waves with vorticity

This paper studies the classical water wave problem with vorticity described by the Euler equations with a free surface under the influence of gravity over a flat bottom. Based on fundamental work \cite{ConstantinStrauss}, we first obtain two continuous bifurcation curves which meet the laminar flow only one time by using modified analytic bifurcation theorem. They are symmetric waves whose profiles are monotone between each crest and trough. Furthermore, we find that there is at least one inflection point on the wave profile between successive crests and troughs and the free surface is strictly concave at any crest and strictly convex at any trough. In addition, for favorable vorticity, we prove that the vertical displacement of water waves decreases with depth.

preprint2022arXiv

How and what to learn:The modes of machine learning

Despite their great success, neural networks still remain as black-boxes due to the lack of interpretability. Here we propose a new analyzing method, namely the weight pathway analysis (WPA), to make them transparent. We consider weights in pathways that link neurons longitudinally from input neurons to output neurons, or simply weight pathways, as the basic units for understanding a neural network, and decompose a neural network into a series of subnetworks of such weight pathways. A visualization scheme of the subnetworks is presented that gives longitudinal perspectives of the network like radiographs, making the internal structures of the network visible. Impacts of parameter adjustments or structural changes to the network can be visualized via such radiographs. Characteristic maps are established for subnetworks to characterize the enhancement or suppression of the influence of input samples on each output neuron. Using WPA, we discover that neural network store and utilize information in a holographic way, that is, subnetworks encode all training samples in a coherent structure and thus only by investigating the weight pathways can one explore samples stored in the network. Furthermore, with WPA, we reveal fundamental learning modes of a neural network: the linear learning mode and the nonlinear learning mode. The former extracts linearly separable features while the latter extracts linearly inseparable features. The hidden-layer neurons self-organize into different classes for establishing learning modes and for reaching the training goal. The finding of learning modes provides us the theoretical ground for understanding some of the fundamental problems of machine learning, such as the dynamics of learning process, the role of linear and nonlinear neurons, as well as the role of network width and depth.

preprint2022arXiv

Instability dynamics of nonlinear normal modes in the Fermi-Pasta-Ulam-Tsingou chains

Nonlinear normal modes are periodic orbits that survive in nonlinear chains, whose instability plays a crucial role in the dynamics of many-body Hamiltonian systems toward thermalization. Here we focus on how the stability of nonlinear modes depends on the perturbation strength and the system size to observe whether they have the same behavior in different models. To this end, as illustrating examples, the instability dynamics of the ${N}/{2}$ mode in both the Fermi-Pasta-Ulam-Tsingou (FPUT) -$α$ and -$β$ chains under fixed boundary conditions are studied systematically. Applying the Floquet theory, we show that for both models the stability time $T$ as a function of the perturbation strength $λ$ follows the same behavior; i.e., $T\propto(λ-λ_c)^{-\frac{1}{2}}$, where $λ_c$ is the instability threshold. The dependence of $λ_c$ on $N$ is also obtained. The results of $T$ and $λ_c$ agree well with those obtained by the direct molecular dynamics simulations. Finally, the effect of instability dynamics on the thermalization properties of a system is briefly discussed.

preprint2022arXiv

Jointly Learning Knowledge Embedding and Neighborhood Consensus with Relational Knowledge Distillation for Entity Alignment

Entity alignment aims at integrating heterogeneous knowledge from different knowledge graphs. Recent studies employ embedding-based methods by first learning the representation of Knowledge Graphs and then performing entity alignment via measuring the similarity between entity embeddings. However, they failed to make good use of the relation semantic information due to the trade-off problem caused by the different objectives of learning knowledge embedding and neighborhood consensus. To address this problem, we propose Relational Knowledge Distillation for Entity Alignment (RKDEA), a Graph Convolutional Network (GCN) based model equipped with knowledge distillation for entity alignment. We adopt GCN-based models to learn the representation of entities by considering the graph structure and incorporating the relation semantic information into GCN via knowledge distillation. Then, we introduce a novel adaptive mechanism to transfer relational knowledge so as to jointly learn entity embedding and neighborhood consensus. Experimental results on several benchmarking datasets demonstrate the effectiveness of our proposed model.

preprint2022arXiv

Knowledge-Injected Federated Learning

Federated learning is an emerging technique for training models from decentralized data sets. In many applications, data owners participating in the federated learning system hold not only the data but also a set of domain knowledge. Such knowledge includes human know-how and craftsmanship that can be extremely helpful to the federated learning task. In this work, we propose a federated learning framework that allows the injection of participants&#39; domain knowledge, where the key idea is to refine the global model with knowledge locally. The scenario we consider is motivated by a real industry-level application, and we demonstrate the effectiveness of our approach to this application.

preprint2022arXiv

LAS-AT: Adversarial Training with Learnable Attack Strategy

Adversarial training (AT) is always formulated as a minimax problem, of which the performance depends on the inner optimization that involves the generation of adversarial examples (AEs). Most previous methods adopt Projected Gradient Decent (PGD) with manually specifying attack parameters for AE generation. A combination of the attack parameters can be referred to as an attack strategy. Several works have revealed that using a fixed attack strategy to generate AEs during the whole training phase limits the model robustness and propose to exploit different attack strategies at different training stages to improve robustness. But those multi-stage hand-crafted attack strategies need much domain expertise, and the robustness improvement is limited. In this paper, we propose a novel framework for adversarial training by introducing the concept of &#34;learnable attack strategy&#34;, dubbed LAS-AT, which learns to automatically produce attack strategies to improve the model robustness. Our framework is composed of a target network that uses AEs for training to improve robustness and a strategy network that produces attack strategies to control the AE generation. Experimental evaluations on three benchmark databases demonstrate the superiority of the proposed method. The code is released at https://github.com/jiaxiaojunQAQ/LAS-AT.

preprint2022arXiv

Membership Privacy Protection for Image Translation Models via Adversarial Knowledge Distillation

Image-to-image translation models are shown to be vulnerable to the Membership Inference Attack (MIA), in which the adversary&#39;s goal is to identify whether a sample is used to train the model or not. With daily increasing applications based on image-to-image translation models, it is crucial to protect the privacy of these models against MIAs. We propose adversarial knowledge distillation (AKD) as a defense method against MIAs for image-to-image translation models. The proposed method protects the privacy of the training samples by improving the generalizability of the model. We conduct experiments on the image-to-image translation models and show that AKD achieves the state-of-the-art utility-privacy tradeoff by reducing the attack performance up to 38.9% compared with the regular training model at the cost of a slight drop in the quality of the generated output images. The experimental results also indicate that the models trained by AKD generalize better than the regular training models. Furthermore, compared with existing defense methods, the results show that at the same privacy protection level, image translation models trained by AKD generate outputs with higher quality; while at the same quality of outputs, AKD enhances the privacy protection over 30%.

preprint2022arXiv

Molecules in the carbon-rich protoplanetary nebula CRL 2688

We present observations of the carbon-rich protoplanetary nebula (PPN) CRL 2688 made with the Institut de Radioastronomie Millimetrique (IRAM) 30 m telescope in the 3mm and 2mm bands. In total, 196 transition lines belonging to 38 molecular species and isotopologues are detected, among which, to our best knowledge, 153 transition lines and 13 species are the first report for this object. Additionally, in order to contribute to future research, we have collected observational data on the molecular lines of CRL 2688 from the literature and compiled them into a single unified catalog. We find that the molecular abundance of CRL 2688 cannot be explained by the standard model of a circumstellar envelope. The implications of metal-bearing molecules on circumstellar chemistry are discussed.

preprint2022arXiv

Multi-Camera View Based Proactive BS Selection and Beam Switching for V2X

Due to the short wavelength and large attenuation of millimeter-wave (mmWave), mmWave BSs are densely distributed and require beamforming with high directivity. When the user moves out of the coverage of the current BS or is severely blocked, the mmWave BS must be switched to ensure the communication quality. In this paper, we proposed a multi-camera view based proactive BS selection and beam switching that can predict the optimal BS of the user in the future frame and switch the corresponding beam pair. Specifically, we extract the features of multi-camera view images and a small part of channel state information (CSI) in historical frames, and dynamically adjust the weight of each modality feature. Then we design a multi-task learning module to guide the network to better understand the main task, thereby enhancing the accuracy and the robustness of BS selection and beam switching. Using the outputs of all tasks, a prior knowledge based fine tuning network is designed to further increase the BS switching accuracy. After the optimal BS is obtained, a beam pair switching network is proposed to directly predict the optimal beam pair of the corresponding BS. Simulation results in an outdoor intersection environment show the superior performance of our proposed solution under several metrics such as predicting accuracy, achievable rate, harmonic mean of precision and recall.

preprint2022arXiv

One-loop diagrams with quadratic propagators from the worldsheet

It is well known that forward limits of tree-level amplitudes (and those trivalent diagrams they consist of) produce one-loop amplitudes and trivalent diagrams with propagators linear in the loop momentum. They naturally arise from one-loop worldsheet formulae, and an important open problem is how to recombine them into usual one-loop diagrams with quadratic propagators. In this paper, we study a new collection of worldsheet functions: generalized one-loop Parke-Taylor factors with tensor numerators, which are conjectured to serve as a basis for one-loop worldsheet functions with this nice property. We present all-multiplicity, closed-form expressions for combinations of one-loop trivalent diagrams with quadratic propagators and tensor numerators to arbitrary rank (including possible tadpole contributions), produced by any pair of Parke-Taylor factors. We also briefly comment on reducing worldsheet functions onto such a basis, and applications to one-loop amplitudes in physical theories.

preprint2022arXiv

PackCache: An Online Cost-driven Data Caching Algorithm in the Cloud

In this paper, we study a data caching problem in the cloud environment, where multiple frequently co-utilised data items could be packed as a single item being transferred to serve a sequence of data requests dynamically with reduced cost. To this end, we propose an online algorithm with respect to a homogeneous cost model, called PackCache, that can leverage the FP-Tree technique to mine those frequently co-utilised data items for packing whereby the incoming requests could be cost-effectively served online by exploiting the concept of anticipatory caching. We show the algorithm is 2αcompetitive, reaching the lower bound of the competitive ratio for any deterministic online algorithm on the studied caching problem, and also time and space efficient to serve the requests. Finally, we evaluate the performance of the algorithm via experimental studies to show its actual cost-effectiveness and scalability.

preprint2022arXiv

PDRs4All: A JWST Early Release Science Program on radiative feedback from massive stars

Massive stars disrupt their natal molecular cloud material through radiative and mechanical feedback processes. These processes have profound effects on the evolution of interstellar matter in our Galaxy and throughout the Universe, from the era of vigorous star formation at redshifts of 1-3 to the present day. The dominant feedback processes can be probed by observations of the Photo-Dissociation Regions (PDRs) where the far-ultraviolet photons of massive stars create warm regions of gas and dust in the neutral atomic and molecular gas. PDR emission provides a unique tool to study in detail the physical and chemical processes that are relevant for most of the mass in inter- and circumstellar media including diffuse clouds, proto-planetary disks and molecular cloud surfaces, globules, planetary nebulae, and star-forming regions. PDR emission dominates the infrared (IR) spectra of star-forming galaxies. Most of the Galactic and extragalactic observations obtained with the James Webb Space Telescope (JWST) will therefore arise in PDR emission. In this paper we present an Early Release Science program using the MIRI, NIRSpec, and NIRCam instruments dedicated to the observations of an emblematic and nearby PDR: the Orion Bar. These early JWST observations will provide template datasets designed to identify key PDR characteristics in JWST observations. These data will serve to benchmark PDR models and extend them into the JWST era. We also present the Science-Enabling products that we will provide to the community. These template datasets and Science-Enabling products will guide the preparation of future proposals on star-forming regions in our Galaxy and beyond and will facilitate data analysis and interpretation of forthcoming JWST observations.

preprint2022arXiv

Performance evaluations on the parallel CHAracteristic-Spectral-Mixed (CHASM) scheme

Performance evaluations on the deterministic algorithms for 6-D problems are rarely found in literatures except some recent advances in the Vlasov and Boltzmann community [Dimarco et al. (2018), Kormann et al. (2019)], due to the extremely high complexity. Thus a detailed comparison among various techniques shall be useful to the researchers in the related fields. We try to make a thorough evaluation on a parallel CHAracteristic-Spectral-Mixed (CHASM) scheme to support its usage. CHASM utilizes the cubic B-spline expansion in the spatial space and spectral expansion in the momentum space, which many potentially overcome the computational burden in solving classical and quantum kinetic equations in 6-D phase space. Our purpose is three-pronged. First, we would like show that by imposing some effective Hermite boundary conditions, the local cubic spline can approximate to the global one as accurately as possible. Second, we will illustrate the necessity of adopting the truncated kernel method in calculating the pseudodifferential operator with a singular symbol, since the widely used pseudo-spectral method [Ringhofer (1990)] might fail to properly tackle the singularity. Finally, we make a comparison among non-splitting Lawson schemes and Strang operator splitting. Our numerical results demonstrate the advantage of the one-stage Lawson predictor-corrector scheme over multi-stage ones as well as the splitting scheme in both accuracy and stability.

preprint2022arXiv

Prior-Guided Adversarial Initialization for Fast Adversarial Training

Fast adversarial training (FAT) effectively improves the efficiency of standard adversarial training (SAT). However, initial FAT encounters catastrophic overfitting, i.e.,the robust accuracy against adversarial attacks suddenly and dramatically decreases. Though several FAT variants spare no effort to prevent overfitting, they sacrifice much calculation cost. In this paper, we explore the difference between the training processes of SAT and FAT and observe that the attack success rate of adversarial examples (AEs) of FAT gets worse gradually in the late training stage, resulting in overfitting. The AEs are generated by the fast gradient sign method (FGSM) with a zero or random initialization. Based on the observation, we propose a prior-guided FGSM initialization method to avoid overfitting after investigating several initialization strategies, improving the quality of the AEs during the whole training process. The initialization is formed by leveraging historically generated AEs without additional calculation cost. We further provide a theoretical analysis for the proposed initialization method. We also propose a simple yet effective regularizer based on the prior-guided initialization,i.e., the currently generated perturbation should not deviate too much from the prior-guided initialization. The regularizer adopts both historical and current adversarial perturbations to guide the model learning. Evaluations on four datasets demonstrate that the proposed method can prevent catastrophic overfitting and outperform state-of-the-art FAT methods. The code is released at https://github.com/jiaxiaojunQAQ/FGSM-PGI.

preprint2022arXiv

Revealing Unfair Models by Mining Interpretable Evidence

The popularity of machine learning has increased the risk of unfair models getting deployed in high-stake applications, such as justice system, drug/vaccination design, and medical diagnosis. Although there are effective methods to train fair models from scratch, how to automatically reveal and explain the unfairness of a trained model remains a challenging task. Revealing unfairness of machine learning models in interpretable fashion is a critical step towards fair and trustworthy AI. In this paper, we systematically tackle the novel task of revealing unfair models by mining interpretable evidence (RUMIE). The key idea is to find solid evidence in the form of a group of data instances discriminated most by the model. To make the evidence interpretable, we also find a set of human-understandable key attributes and decision rules that characterize the discriminated data instances and distinguish them from the other non-discriminated data. As demonstrated by extensive experiments on many real-world data sets, our method finds highly interpretable and solid evidence to effectively reveal the unfairness of trained models. Moreover, it is much more scalable than all of the baseline methods.

preprint2022arXiv

Robust Counterfactual Explanations on Graph Neural Networks

Massive deployment of Graph Neural Networks (GNNs) in high-stake applications generates a strong demand for explanations that are robust to noise and align well with human intuition. Most existing methods generate explanations by identifying a subgraph of an input graph that has a strong correlation with the prediction. These explanations are not robust to noise because independently optimizing the correlation for a single input can easily overfit noise. Moreover, they do not align well with human intuition because removing an identified subgraph from an input graph does not necessarily change the prediction result. In this paper, we propose a novel method to generate robust counterfactual explanations on GNNs by explicitly modelling the common decision logic of GNNs on similar input graphs. Our explanations are naturally robust to noise because they are produced from the common decision boundaries of a GNN that govern the predictions of many similar input graphs. The explanations also align well with human intuition because removing the set of edges identified by an explanation from the input graph changes the prediction significantly. Exhaustive experiments on many public datasets demonstrate the superior performance of our method.

preprint2022arXiv

Room Temperature Gate Tunable Non Reciprocal Charge Transport in Lattice Matched InSb/CdTe Heterostructures

The manipulation of symmetry provides an effective way to tailor the physical orders in solid-state systems. With the breaking of both the inversion and time-reversal symmetries, non-reciprocal magneto-transport may emerge in assorted non-magnetic systems to enrich spintronic physics. Here, we report the observation of the uni-directional magneto-resistance (UMR) in the lattice-matched InSb/CdTe film up to room temperature. Benefiting from the strong built-in electric field of $0.13 \mathrm{~V} \cdot \mathrm{nm}^{-1}$ in the hetero-junction region, the resulting Rashba-type spin-orbit coupling and quantum confinement warrant stable angular-dependent second-order charge current with the non-reciprocal coefficient 1-2 orders of magnitude larger than most non-centrosymmetric materials at 298 K. More importantly, this heterostructure configuration enables highly-efficient gate tuning of the rectification response in which the enhancement of the UMR amplitude by 40% is realized. Our results advocate the narrow-gap semiconductor-based hybrid system with the robust two-dimensional interfacial spin texture as a suitable platform for the pursuit of controllable chiral spin-orbit devices and applications.

preprint2022arXiv

Self-Attention for Incomplete Utterance Rewriting

Incomplete utterance rewriting (IUR) has recently become an essential task in NLP, aiming to complement the incomplete utterance with sufficient context information for comprehension. In this paper, we propose a novel method by directly extracting the coreference and omission relationship from the self-attention weight matrix of the transformer instead of word embeddings and edit the original text accordingly to generate the complete utterance. Benefiting from the rich information in the self-attention weight matrix, our method achieved competitive results on public IUR datasets.

preprint2022arXiv

Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection

Recent studies in deepfake detection have yielded promising results when the training and testing face forgeries are from the same dataset. However, the problem remains challenging when one tries to generalize the detector to forgeries created by unseen methods in the training dataset. This work addresses the generalizable deepfake detection from a simple principle: a generalizable representation should be sensitive to diverse types of forgeries. Following this principle, we propose to enrich the &#34;diversity&#34; of forgeries by synthesizing augmented forgeries with a pool of forgery configurations and strengthen the &#34;sensitivity&#34; to the forgeries by enforcing the model to predict the forgery configurations. To effectively explore the large forgery augmentation space, we further propose to use the adversarial training strategy to dynamically synthesize the most challenging forgeries to the current model. Through extensive experiments, we show that the proposed strategies are surprisingly effective (see Figure 1), and they could achieve superior performance than the current state-of-the-art methods. Code is available at \url{https://github.com/liangchen527/SLADD}.

preprint2022arXiv

Spatial Cross-Attention Improves Self-Supervised Visual Representation Learning

Unsupervised representation learning methods like SwAV are proved to be effective in learning visual semantics of a target dataset. The main idea behind these methods is that different views of a same image represent the same semantics. In this paper, we further introduce an add-on module to facilitate the injection of the knowledge accounting for spatial cross correlations among the samples. This in turn results in distilling intra-class information including feature level locations and cross similarities between same-class instances. The proposed add-on can be added to existing methods such as the SwAV. We can later remove the add-on module for inference without any modification of the learned weights. Through an extensive set of empirical evaluations, we verify that our method yields an improved performance in detecting the class activation maps, top-1 classification accuracy, and down-stream tasks such as object detection, with different configuration settings.

preprint2022arXiv

Spatially decomposed $γ$-ray features surrounding SNR Kes 79 & PSR J1853+0056

There have been substantial improvements on Fermi Large Area Telescope (LAT) data and analysis tools since the last analysis on the mid-aged supernova remnant (SNR) Kes 79 (Auchettl et al. 2014). Recent multi-wavelength studies confirmed its interaction with molecular clouds. About $0.36\degr$ north from Kes 79, a powerful pulsar -- PSR J1853+0056 also deserves our attention. In this work, we analyse the 11.5-year Fermi-LAT data to investigate the $γ$-ray feature in/around this complex region. Our result shows a more significant detection ($\sim$34.8$σ$ in 0.1--50 GeV) for this region. With $\ge$5 GeV data, we detect two extended sources -- Src-N (the brighter one; radius $\approx0.31\degr$) concentrated at the north of the SNR while enclosing PSR J1853+0056, and Src-S (radius $\approx0.58\degr$) concentrated at the south of the SNR. Their spectra have distinct peak energies ($\sim$1.0 GeV for Src-N and $\lesssim$0.5 GeV for Src-S), suggesting different origins for them. In our hadronic model that includes the leaked cosmic-rays (CRs) from the shock-cloud collision, even with extreme values of parameters, SNR Kes 79 can by no means provide enough CRs reaching clouds at Src-N to explain the local GeV spectrum. We propose that the Src-N emission could be predominantly reproduced by a putative pulsar wind nebula (PWN) powered by PSR J1853+0056. On the other hand, our same hadronic model can reproduce a majority of the GeV emission at Src-S with typical values of parameters, while the three known pulsars inside Src-S release a total power that is too low to account for half of its $γ$-ray emission.

preprint2022arXiv

StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN

One-shot talking face generation aims at synthesizing a high-quality talking face video from an arbitrary portrait image, driven by a video or an audio segment. One challenging quality factor is the resolution of the output video: higher resolution conveys more details. In this work, we investigate the latent feature space of a pre-trained StyleGAN and discover some excellent spatial transformation properties. Upon the observation, we explore the possibility of using a pre-trained StyleGAN to break through the resolution limit of training datasets. We propose a novel unified framework based on a pre-trained StyleGAN that enables a set of powerful functionalities, i.e., high-resolution video generation, disentangled control by driving video or audio, and flexible face editing. Our framework elevates the resolution of the synthesized talking face to 1024*1024 for the first time, even though the training dataset has a lower resolution. We design a video-based motion generation module and an audio-based one, which can be plugged into the framework either individually or jointly to drive the video generation. The predicted motion is used to transform the latent features of StyleGAN for visual animation. To compensate for the transformation distortion, we propose a calibration network as well as a domain loss to refine the features. Moreover, our framework allows two types of facial editing, i.e., global editing via GAN inversion and intuitive editing based on 3D morphable models. Comprehensive experiments show superior video quality, flexible controllability, and editability over state-of-the-art methods.

preprint2022arXiv

The anti-Fermi-Pasta-Ulam-Tsingou problem in one-dimensional diatomic lattices

We study the thermalization dynamics of one-dimensional diatomic lattices (which represents the simplest system possessing multi-branch phonons), exemplified by the famous Fermi-Pasta-Ulam-Tsingou (FPUT)-$β$ and the Toda models. Here we focus on how the system relaxes to the equilibrium state when part of highest-frequency optical modes are initially excited, which is called the anti-FPUT problem comparing with the original FPUT problem (low frequency excitations of the monatomic lattice). It is shown numerically that the final thermalization time $T_{\rm eq}$ of the diatomic FPUT-$β$ chain depends on whether its acoustic modes are thermalized, whereas the $T_{\rm eq}$ of the diatomic Toda chain depends on the optical ones; in addition, the metastable state of both models have different energy distributions and lifetimes. Despite these differences, in the near-integrable region, the $T_{\rm eq}$ of both models still follows the same scaling law, i.e., $T_{\rm eq}$ is inversely proportional to the square of the perturbation strength. Finally, comparisons of the thermalization behavior between different models under various initial conditions are briefly summarized.

preprint2022arXiv

The Properties and Evolutions of Starspots on Three Detached Eclipsing Binaries in the LAMOST-Kepler survey

The spotted detached eclipsing binary (DEB) offers insights into starspots on the binary. Three spotted DEBs, KIC 8097825, KIC 6859813, and KIC 5527172, which were observed by the Kepler photometry and LAMOST spectroscopy, are studied in this work. The physical parameters of binaries are determined by binary modeling. The sizes, lifetimes, and single/double-dip ratio (SDR) of starspots are derived by starspot analysis. KIC 8097825 has large starspots. KIC 6859813 has a spot rotation period shorter than its orbital period but the system should be synchronized inferred from timescale estimation. The difference may be the result of the surface differential rotation. The KIC 5527172 has a long spot lifetime and an M dwarf component with an inflation radius. The primaries of these binaries and the secondary of KIC 8097825 have spots. Adding spotted DEBs of literature, we compare the starspots on binaries with those on the single stars. The spot sizes of starspots on 65% binaries are smaller than the median of those on single stars. The lifetimes of starspots on binaries are consistent with those on single stars when the rotation periods are larger than 3 days. SDRs for half of the binaries are consistent with those of single star systems, while another half are smaller. The relative lifetime positively correlates with the RMS and SDR but negatively correlates with the rotation period. These relations are similar to those of spots on the single star systems. Binaries with luminosity ratios close to the unit tend to have more double dips.

preprint2022arXiv

The symmetry for two class of steady stratified periodic water waves

In this paper, we mainly consider two class of travelling stratified periodic water waves, one with negative (or without) surface tension and the other with constant Bernoulli&#39;s function and stagnation points. We first establish the symmetry result for stratified water waves with negative (or without) surface tension, but without stagnation by using the modified maximum principle. Furthermore, the symmetry property of stratified water waves with constant Bernoulli&#39;s function and stagnation points is also obtained provided the monotonic property is known.

preprint2022arXiv

Towards Real-World Video Deblurring by Exploring Blur Formation Process

This paper aims at exploring how to synthesize close-to-real blurs that existing video deblurring models trained on them can generalize well to real-world blurry videos. In recent years, deep learning-based approaches have achieved promising success on video deblurring task. However, the models trained on existing synthetic datasets still suffer from generalization problems over real-world blurry scenarios with undesired artifacts. The factors accounting for the failure remain unknown. Therefore, we revisit the classical blur synthesis pipeline and figure out the possible reasons, including shooting parameters, blur formation space, and image signal processor~(ISP). To analyze the effects of these potential factors, we first collect an ultra-high frame-rate (940 FPS) RAW video dataset as the data basis to synthesize various kinds of blurs. Then we propose a novel realistic blur synthesis pipeline termed as RAW-Blur by leveraging blur formation cues. Through numerous experiments, we demonstrate that synthesizing blurs in the RAW space and adopting the same ISP as the real-world testing data can effectively eliminate the negative effects of synthetic data. Furthermore, the shooting parameters of the synthesized blurry video, e.g., exposure time and frame-rate play significant roles in improving the performance of deblurring models. Impressively, the models trained on the blurry data synthesized by the proposed RAW-Blur pipeline can obtain more than 5dB PSNR gain against those trained on the existing synthetic blur datasets. We believe the novel realistic synthesis pipeline and the corresponding RAW video dataset can help the community to easily construct customized blur datasets to improve real-world video deblurring performance largely, instead of laboriously collecting real data pairs.

preprint2022arXiv

VDTR: Video Deblurring with Transformer

Video deblurring is still an unsolved problem due to the challenging spatio-temporal modeling process. While existing convolutional neural network-based methods show a limited capacity for effective spatial and temporal modeling for video deblurring. This paper presents VDTR, an effective Transformer-based model that makes the first attempt to adapt Transformer for video deblurring. VDTR exploits the superior long-range and relation modeling capabilities of Transformer for both spatial and temporal modeling. However, it is challenging to design an appropriate Transformer-based model for video deblurring due to the complicated non-uniform blurs, misalignment across multiple frames and the high computational costs for high-resolution spatial modeling. To address these problems, VDTR advocates performing attention within non-overlapping windows and exploiting the hierarchical structure for long-range dependencies modeling. For frame-level spatial modeling, we propose an encoder-decoder Transformer that utilizes multi-scale features for deblurring. For multi-frame temporal modeling, we adapt Transformer to fuse multiple spatial features efficiently. Compared with CNN-based methods, the proposed method achieves highly competitive results on both synthetic and real-world video deblurring benchmarks, including DVD, GOPRO, REDS and BSD. We hope such a Transformer-based architecture can serve as a powerful alternative baseline for video deblurring and other video restoration tasks. The source code will be available at \url{https://github.com/ljzycmd/VDTR}.

preprint2021arXiv

A Comprehensive Review of Computer-aided Whole-slide Image Analysis: from Datasets to Feature Extraction, Segmentation, Classification, and Detection Approaches

With the development of computer-aided diagnosis (CAD) and image scanning technology, Whole-slide Image (WSI) scanners are widely used in the field of pathological diagnosis. Therefore, WSI analysis has become the key to modern digital pathology. Since 2004, WSI has been used more and more in CAD. Since machine vision methods are usually based on semi-automatic or fully automatic computers, they are highly efficient and labor-saving. The combination of WSI and CAD technologies for segmentation, classification, and detection helps histopathologists obtain more stable and quantitative analysis results, save labor costs and improve diagnosis objectivity. This paper reviews the methods of WSI analysis based on machine learning. Firstly, the development status of WSI and CAD methods are introduced. Secondly, we discuss publicly available WSI datasets and evaluation metrics for segmentation, classification, and detection tasks. Then, the latest development of machine learning in WSI segmentation, classification, and detection are reviewed continuously. Finally, the existing methods are studied, the applicabilities of the analysis methods are analyzed, and the application prospects of the analysis methods in this field are forecasted.

preprint2021arXiv

CaEGCN: Cross-Attention Fusion based Enhanced Graph Convolutional Network for Clustering

With the powerful learning ability of deep convolutional networks, deep clustering methods can extract the most discriminative information from individual data and produce more satisfactory clustering results. However, existing deep clustering methods usually ignore the relationship between the data. Fortunately, the graph convolutional network can handle such relationship, opening up a new research direction for deep clustering. In this paper, we propose a cross-attention based deep clustering framework, named Cross-Attention Fusion based Enhanced Graph Convolutional Network (CaEGCN), which contains four main modules: the cross-attention fusion module which innovatively concatenates the Content Auto-encoder module (CAE) relating to the individual data and Graph Convolutional Auto-encoder module (GAE) relating to the relationship between the data in a layer-by-layer manner, and the self-supervised model that highlights the discriminative information for clustering tasks. While the cross-attention fusion module fuses two kinds of heterogeneous representation, the CAE module supplements the content information for the GAE module, which avoids the over-smoothing problem of GCN. In the GAE module, two novel loss functions are proposed that reconstruct the content and relationship between the data, respectively. Finally, the self-supervised module constrains the distributions of the middle layer representations of CAE and GAE to be consistent. Experimental results on different types of datasets prove the superiority and robustness of the proposed CaEGCN.

preprint2021arXiv

Effect of pressure on thermalization of one-dimensional nonlinear chains

Pressure plays a vital role in changing the transport properties of matter. To understand this phenomenon at a microscopic level, we here focus on a more fundamental problem, i.e., how pressure affects the thermalization properties of solids. As illustrating examples, we study the thermalization behavior of the monatomic chain and the mass-disordered chain of Fermi-Pasta-Ulam-Tsingou-$β$ under different strains in the thermodynamic limit. It is found that the pressure-induced change in nonintegrability results in qualitatively different thermalization processes for the two kinds of chains. However, for both cases, the thermalization time follows the same law -- it is inversely proportional to the square of the nonintegrability strength. This result suggests that pressure can significantly change the integrability of a system, which provides a new perspective for understanding the pressure-dependent thermal transport behavior.

preprint2021arXiv

Large Magnetoresistance and Weak Antilocalization in V1-delta Sb2 Single Crystal

The binary pnictide semimetals have attracted considerable attention due to their fantastic physical properties that include topological effects, negative magnetoresistance, Weyl fermions and large non-saturation magnetoresistance. In this paper, we have successfully grown the high-quality V1-deltaSb2 single crystals by Sb flux method and investigated their electronic transport properties. A large positive magnetoresistance that reaches 477% under a magnetic field of 12 T at T = 1.8 K was observed. Notably, the magnetoresistance showed a cusp-like feature at the low magnetic fields and such feature weakened gradually as the temperature increased, which indicated the presence of weak antilocalization effect (WAL). The angle-dependent magnetoconductance and the ultra-large prefactor alpha extracted from the Hikami-Larkin-Nagaoka equation revealed that the WAL effect is a 3D bulk effect originated from the three-dimensional bulk spin-orbital coupling.

preprint2021arXiv

Overlap-Minimization Scheduling Strategy for Data Transmission in VANET

The vehicular ad-hoc network (VANET) based on dedicated short-range communication (DSRC) is a distributed communication system, in which all the nodes share the wireless channel with carrier sense multiple access/collision avoid (CSMA/CA) protocol. However, the competition and backoff mechanisms of CSMA/CA often bring additional delays and data packet collisions, which may hardly meet the QoS requirements in terms of delay and packets delivery ratio (PDR). Moreover, because of the distribution nature of security information in broadcast mode, the sender cannot know whether the receivers have received the information successfully. Similarly, this problem also exists in no-acknowledge (non-ACK) transmissions of VANET. Therefore, the probability of packet collisions should be considered in broadcast or non-ACK working modes. This paper presents a connection-level scheduling algorithm overlaid on CSMA/CA to schedule the start sending time of each transmission. By converting the object of reducing collision probability to minimizing the overlap of transmission durations of connections, the probability of backoff-activation can be greatly decreased. Then the delay and the probability of packet collisions can also be decreased. Numerical simulations have been conducted in our unified platform containing SUMO, Veins and Omnet++. The result shows that the proposed algorithm can effectively improve the PDR and reduce the packets collision in VANET.

preprint2021arXiv

QoS-aware Link Scheduling Strategy for Data Transmission in SDVN

The vehicular ad-hoc network (VANET) based on dedicated short-range communication (DSRC) is a distributed communication system, in which all the nodes share the wireless channel with carrier sense multiple access/collision avoid (CSMA/CA) protocol. However, the backoff mechanism of CSMA/CA in the channel contention might cause uncertain transmission delay and impede a certain quality of service (QoS) of applications. Moreover, there still exists a possibility of parlous data-packets collision, especially for broadcast or non-acknowledgement (NACK) transmissions. The original contributions of this paper are summarized as follows: (1) Model the packets collision probability of broadcast or NACK transmission in VANET with the combination theory and investigate the potential influence of miss my packets (MMP) problem. (2) Based on the software define vehicular network (SDVN) framework and QoS requirement, a novel link-level scheduling strategy, which determines the start-sending time for each connection, is proposed to maximize packets delivery ratio (PDR). Alternatively, maximizing PDR has been converted to the overlap minimization among transmission durations. (3) Meanwhile, an innovative transmission scheduling greedy search (TSGS) algorithm is originally proposed to mitigate computational complexity. Extensive simulations have been done in a unified platform Veins combining SUMO and OMNET++. And numerous results show that the proposed algorithm can effectively improve the PDR by at least 15%, enhance the collision-avoidance performance by almost 40%, and reduce the MMP ratio by about 3% compared with the random transmitting, meanwhile meet the QoS requirement.

preprint2021arXiv

Searching for an additional high-energy component in Fermi-LAT GRB afterglows

The VHE component from at least two GRBs, i.e., GRB180720B and GRB190114C, has been detected in the afterglow phase. We systematically analyzed 199 GRBs detected by Fermi-LAT during 2008-2019. If an additional high-energy component exists in the afterglows of Fermi-LAT GRBs, the best-fit spectral model could be a broken power-law (BPL) model with an upturn above a break energy. We compare the afterglow spectra using PL and BPL representations. Out of the 30 GRBs with >10GeV photons that arrived after T90, 25 GRBs are tentatively or significantly detected at 0.1-200 GeV after 2*T90. The spectrum of GRB131231A shows an upturn above a break of 1.6+-0.8~GeV, supporting the BPL model. For GRB131231A, we performed a modeling of its X-ray and gamma-ray spectra, and found that the SSC model can explain the upturn with acceptable parameter values. In the cases of GRBs 190114C, 171210A, 150902A, 130907A, 130427A, and 090902B, the improvement of the BPL fit compared to the PL fit is tentative or marginal. There is no conclusive evidence that an additional higher energy component commonly exists in Fermi-LAT GRB afterglows, except for a group of Fermi-LAT GRBs mentioned above. Such an additional high-energy component may be explained by the synchrotron self-Compton mechanism. Current and future VHE observations will provide important constraints on the issue.

preprint2021arXiv

Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits

To explore the vulnerability of deep neural networks (DNNs), many attack paradigms have been well studied, such as the poisoning-based backdoor attack in the training stage and the adversarial attack in the inference stage. In this paper, we study a novel attack paradigm, which modifies model parameters in the deployment stage for malicious purposes. Specifically, our goal is to misclassify a specific sample into a target class without any sample modification, while not significantly reduce the prediction accuracy of other samples to ensure the stealthiness. To this end, we formulate this problem as a binary integer programming (BIP), since the parameters are stored as binary bits ($i.e.$, 0 and 1) in the memory. By utilizing the latest technique in integer programming, we equivalently reformulate this BIP problem as a continuous optimization problem, which can be effectively and efficiently solved using the alternating direction method of multipliers (ADMM) method. Consequently, the flipped critical bits can be easily determined through optimization, rather than using a heuristic strategy. Extensive experiments demonstrate the superiority of our method in attacking DNNs.

preprint2021arXiv

The Stellar &#34;Snake&#34; I: Whole Structure and Properties

To complement our previous discovery of the young snake-like structure in the solar neighborhood and reveal the structure&#39;s full extent, we build two samples of stars within the Snake and its surrounding territory from {\tt Gaia EDR3}. With the friends-of-friends algorithm, we identify 2694 and 9615 Snake member candidates from the two samples. Thirteen open clusters are embedded in these member candidates. By combining the spectroscopic data from multiple surveys, we investigate the comprehensive properties of the candidates and find that they \thj{are very likely to} belong to one sizable structure, since most of the components are well bridged in their spatial distributions, and follow a single stellar population with an age of $30-40$\,Myr and solar metallicity. This sizable structure is best explained as hierarchically primordial, and probably formed from a filamentary giant molecular cloud with unique formation history in localized regions. To analyze the dynamics of the Snake, we divide the structure into five groups according to their tangential velocities; we find that the groups are expanding at a coherent rate ($κ_X\sim3.0\,\times10^{-2}\,\rm km\,s^{-1}\,pc^{-1}$) along the length of the structure ($X$-direction). \thj{The corresponding expansion age ($τ\sim33$\,Myr) is highly consistent with the age of the Snake}. With over ten thousand member stars, the Snake is an ideal laboratory to study nearby coeval stellar formation, stellar physics, and environmental evolution over a large spatial extent.

preprint2020arXiv

A unified structure preserving scheme for a multi-species model with a gradient flow structure and nonlocal interactions via singular kernels

In this paper, we consider a nonlinear and nonlocal parabolic model for multi-species ionic fluids and introduce a semi-implicit finite volume scheme, which is second order accurate in space, first order in time and satisfies the following properties: positivity preserving, mass conservation and energy dissipation. Besides, our scheme involves a fast algorithm on the convolution terms with singular but integrable kernels, which otherwise impedes the accuracy and efficiency of the whole scheme. Error estimates on the fast convolution algorithm are shown next. Numerous numerical tests are provided to demonstrate the properties, such as unconditional stability, order of convergence, energy dissipation and the complexity of the fast convolution algorithm. Furthermore, extensive numerical experiments are carried out to explore the modeling effects in specific examples, such as, the steric repulsion, the concentration of ions at the boundary and the blowup phenomenon of the Keller-Segel equations.

preprint2020arXiv

Are fulleranes responsible for the 21 micron feature?

Recent detections of C$_{60}$, C$_{70}$, and C$_{60}^+$ in space induced extensive studies of fullerene derivatives in circumstellar environments. As the promising fullerene sources, protoplanetary nebulae (PPNe) show a number of unidentified bands in their infrared spectra, among which a small sample exhibits an enigmatic feature at $\sim21$\,$μ$m. Hydrogenation of fullerenes can produce fulleranes emitting new infrared bands. In this paper, we investigate the possibility of fulleranes (C$_{60}$H$_m$) as the carrier of the 21\,$μ$m feature in terms of theoretical vibrational spectra of fulleranes. The evidences favoring and disfavoring the fullerane hypothesis are presented. We made an initial guess for the hydrogen coverage of C$_{60}$H$_m$ that may contribute to the 21\,$μ$m feature.

preprint2020arXiv

ATOMS: ALMA Three-millimeter Observations of Massive Star-forming regions -- I. Survey description and a first look at G9.62+0.19

The &#34;ATOMS,&#34; standing for {\it ALMA Three-millimeter Observations of Massive Star-forming regions}, survey has observed 146 active star forming regions with ALMA Band 3, aiming to systematically investigate the spatial distribution of various dense gas tracers in a large sample of Galactic massive clumps, to study the roles of stellar feedback in star formation, and to characterize filamentary structures inside massive clumps. In this work, the observations, data analysis, and example science of the &#34;ATOMS&#34; survey are presented, using a case study for the G9.62+0.19 complex. Toward this source, some transitions, commonly assumed to trace dense gas, including CS $J = 2-1$, HCO$^+$ $J = 1-0$ and HCN $J = 1-0$, are found to show extended gas emission in low density regions within the clump; less than 25\% of their emission is from dense cores. SO, CH$_3$OH, H$^{13}$CN and HC$_3$N show similar morphologies in their spatial distributions and reveal well the dense cores. Widespread narrow SiO emission is present (over $\sim$1 pc), which may be caused by slow shocks from large--scale colliding flows or H{\sc ii} regions. Stellar feedback from an expanding H{\sc ii} region has greatly reshaped the natal clump, significantly changed the spatial distribution of gas, and may also account for the sequential high-mass star formation in the G9.62+0.19 complex. The ATOMS survey data can be jointly analyzed with other survey data, e.g., &#34;MALT90&#34;, &#34;Orion B&#34;, &#34;EMPIRE&#34;, &#34;ALMA\_IMF&#34;, and &#34;ALMAGAL&#34;, to deepen our understandings of &#34;dense gas&#34; star formation scaling relations and massive proto-cluster formation.

preprint2020arXiv

ATOMS: ALMA Three-millimeter Observations of Massive Star-forming regions -- II. Compact objects in ACA observations and star formation scaling relations

We report studies of the relationships between the total bolometric luminosity ($L_{\rm bol}$ or $L_{\rm TIR}$) and the molecular line luminosities of $J=1-0$ transitions of H$^{13}$CN, H$^{13}$CO$^+$, HCN, and HCO$^+$ with data obtained from ACA observations in the &#34;ATOMS&#34; survey of 146 active Galactic star forming regions. The correlations between $L_{\rm bol}$ and molecular line luminosities $L&#39;_{\rm mol}$ of the four transitions all appear to be approximately linear. Line emission of isotopologues shows as large scatters in $L_{\rm bol}$-$L&#39;_{\rm mol}$ relations as their main line emission. The log($L_{\rm bol}$/$L&#39;_{\rm mol}$) for different molecular line tracers have similar distributions. The $L_{\rm bol}$-to-$L&#39;_{\rm mol}$ ratios do not change with galactocentric distances ($R_{\rm GC}$) and clump masses ($M_{\rm clump}$). The molecular line luminosity ratios (HCN-to-HCO$^+$, H$^{13}$CN-to-H$^{13}$CO$^+$, HCN-to-H$^{13}$CN and HCO$^+$-to-H$^{13}$CO$^+$) all appear constant against $L_{\rm bol}$, dust temperature ($T_{\rm d}$), $M_{\rm clump}$ and $R_{\rm GC}$. Our studies suggest that both the main lines and isotopologue lines are good tracers of the total masses of dense gas in Galactic molecular clumps. The large optical depths of main lines do not affect the interpretation of the slopes in star formation relations. We find that the mean star formation efficiency (SFE) of massive Galactic clumps in the &#34;ATOMS&#34; survey is reasonably consistent with other measures of the SFE for dense gas, even those using very different tracers or examining very different spatial scales.

preprint2020arXiv

Can the Kappa-distributed electron energies account for the intensity ratios of O II lines in photoionized gaseous nebulae?

A vexing puzzle in the study of planetary nebulae and \ion{H}{2} regions is that the plasma diagnostic results based on collisionally excited lines systematically differ from those based on recombination lines. A fairly speculative interpretation is the presence of nonthermal electrons with the so-called $κ$ energy distributions, yet there is little observational evidence to verify or disprove this hypothesis. In this paper, we examine the influence of $κ$-distributed electrons on the emissivities of \ion{O}{2} recombination lines using an approximate method, where the rate coefficients for a $κ$ distribution are computed by summing Maxwellian-Boltzmann rate coefficients with appropriate weights. The results show that if invoking $κ$-distributed electrons, the temperatures derived from the [\ion{O}{3}] $(\lambda4959+\lambda5007)/\lambda4363$ ratios could coincide with those estimated from the \ion{O}{2} $\lambda4649/\lambda4089$ ratios. However, the estimated temperatures and $κ$ values are not in agreement with those obtained through comparing the [\ion{O}{3}] $(\lambda4959+\lambda5007)/\lambda4363$ ratios and the hydrogen recombination spectra, suggesting that the electron energy is unlikely to follow the $κ$-distributions over a global scale of the nebular regions. Nevertheless, based on this observation alone, we cannot definitely rule out the presence of $κ$-distributed electrons in some microstructures within nebulae.

preprint2020arXiv

Compositions of pseudo-symmetric integrators with complex coefficients for the numerical integration of differential equations

In this paper, we are concerned with the construction and analysis of a new class of methods obtained as double jump compositions with complex coefficients and projection on the real axis. It is shown in particular that the new integrators are symmetric and symplectic up to high orders if one uses a symmetric and symplectic basic method. In terms of efficiency, the aforementioned technique requires fewer stages than standard compositions of the same orders and is thus expected to lead to faster methods.

preprint2020arXiv

Controllable Descendant Face Synthesis

Kinship face synthesis is an interesting topic raised to answer questions like &#34;what will your future children look like?&#34;. Published approaches to this topic are limited. Most of the existing methods train models for one-versus-one kin relation, which only consider one parent face and one child face by directly using an auto-encoder without any explicit control over the resemblance of the synthesized face to the parent face. In this paper, we propose a novel method for controllable descendant face synthesis, which models two-versus-one kin relation between two parent faces and one child face. Our model consists of an inheritance module and an attribute enhancement module, where the former is designed for accurate control over the resemblance between the synthesized face and parent faces, and the latter is designed for control over age and gender. As there is no large scale database with father-mother-child kinship annotation, we propose an effective strategy to train the model without using the ground truth descendant faces. No carefully designed image pairs are required for learning except only age and gender labels of training faces. We conduct comprehensive experimental evaluations on three public benchmark databases, which demonstrates encouraging results.

preprint2020arXiv

COVID-19 infection and recovery in various countries: Modeling the dynamics and evaluating the non-pharmaceutical mitigation scenarios

The coronavirus disease 2019 (COVID-19) pandemic radically impacts our lives, while the transmission/infection and recovery dynamics of COVID-19 remain obscure. A time-dependent Susceptible, Exposed, Infectious, and Recovered (SEIR) model was proposed and applied to fit and then predict the time series of COVID-19 evolution observed in the last three months (till 3/22/2020) in various provinces and metropolises in China. The model results revealed the space dependent transmission/infection rate and the significant spatiotemporal variation in the recovery rate, likely due to the continuous improvement of screening techniques and public hospital systems, as well as full city lockdowns in China. The validated SEIR model was then applied to predict COVID-19 evolution in United States, Italy, Japan, and South Korea which have responded differently to monitoring and mitigating COVID-19 so far, although these predictions contain high uncertainty due to the intrinsic change of the maximum infected population and the infection/recovery rates within the different countries. In addition, a stochastic model based on the random walk particle tracking scheme, analogous to a mixing-limited bimolecular reaction model, was developed to evaluate non-pharmaceutical strategies to mitigate COVID-19 spread. Preliminary tests using the stochastic model showed that self-quarantine may not be as efficient as strict social distancing in slowing COVID-19 spread, if not all of the infected people can be promptly diagnosed and quarantined.

preprint2020arXiv

Discovery of Extended Structures around Two Evolved Planetary Nebulae M 2-55 and Abell 2

We report a multi-wavelength study of two evolved planetary nebulae (PNs) M 2-55 and Abell 2. Deep optical narrow-band images ([O III], H?, and [N II]) of M 2-55 reveal two pairs of bipolar lobes and a new faint arc-like structure. This arc-shaped filament around M 2-55 appears a well-defined boundary from southwest to southeast, strongly suggesting that this nebula is in interaction with its surrounding interstellar medium. From the imaging data of Wide-field Infrared Survey Explorer (WISE) all-sky survey, we discovered extensive mid-infrared halos around these PNs, which are approximately twice larger than their main nebulae seen in the visible. We also present a mid-resolution optical spectrum of M 2-55, which shows that it is a high-excitation evolved PN with a low electron density of 250 cm^-3. Furthermore, we investigate the properties of these nebulae from their spectral energy distributions (SEDs) by means of archival data.

preprint2020arXiv

Generations of high efficiency, high purity, and broadband Laguerre-Gaussian modes from a Janus optical parametric oscillator

Laguerre-Gaussian (LG) modes, carrying orbital angular momentum of light, are critical for important applications such as high-capacity optical communications, super-resolution imaging, and multi-dimensional quantum entanglement. Advanced developments in these applications strongly demand reliable and tunable LG mode laser sources, which, however, do not yet exist. Here, we experimentally demonstrate highly-efficient, highly-pure, broadly-tunable, and topological-charge-controllable LG modes from a Janus optical parametric oscillator (OPO). Janus OPO featuring two-face cavity mode is designed to guarantee an efficient evolution from a Gaussian-shaped fundamental pumping mode to a desired LG parametric mode. The output LG mode has a tunable wavelength between 1.5 um and 1.6 um with a conversion efficiency above 15%, a topological charge switchable from -4 to 4, and a mode purity as high as 97%, which provides a high-performance solid-state light source for high-end demands in multi-dimensional multiplexing/demultiplexing, control of spin-orbital coupling between light and atoms, and so on.

preprint2020arXiv

Hydrogenated Fullerenes (Fulleranes) in Space

Since the first laboratory synthesis of C$_{60}$ in 1985, fullerene-related species have been proposed to interpret various astronomical features. After more than 25 years&#39; efforts, several circumstellar and interstellar features have been convincingly assigned to C$_{60}$, C$_{70}$, and C$_{60}^+$. These successes resulted from the recent advancements in observational, experimental, as well as computational techniques, and re-stimulated interest in searching for fullerene derivatives in space. As one of the most important fullerene derivatives, hydrogenated fullerene (fullerane) is likely to exist in circumstellar and interstellar conditions. This review gives an overview of the chemical properties and spectral signals of fulleranes focusing on those relevant to astronomy. We summarize previous proposals of fulleranes as the carrier of astronomical features at UV, optical, infrared, and radio wavelengths, and discuss the arguments favoring or disfavoring the presence of fulleranes in astronomical environments. Although no unambiguous detection of fulleranes in space has yet been reported, there are plausible evidences for supporting the formation of certain fullerane isomers.

preprint2020arXiv

Is an upturn commonly seen in Fermi-LAT GRB afterglow spectra?

We analyzed 199 GRBs detected by Fermi-LAT during the years 2008-2019. We found 67 photons at energies >=10 GeV, which come from 34 GRBs. Out of these 34 GRBs, Fermi-LAT detects significant (TS>=4) afterglow 0.1-200 GeV photons from 25 GRBs. We present time-integrated 0.1-200 GeV spectra of these 25 GRBs. The spectra of a significant fraction (9/25) of these GRBs revealed a harder component above an energy break of 0.3-2 GeV. While shock synchrotron may account for the photons at the lower energy end, high energy photons above the break is naturally explained by synchrotron self-Compton (SSC) emission. We perform broadband model fit to the X-ray-LAT emission of GRB 131231A. Comparing the afterglow spectra of these 25 GRBs with other Fermi-LAT detected GRBs, we found that the power-law index distribution is similar for the two populations. This may indicate that the additional high-energy component may also exist in Fermi-LAT GRBs in general.

preprint2020arXiv

Large Magnetoresistance in Topological Insulator Candidate TaSe3

Large unsaturated magnetoresistance (XMR) with magnitude about 1000% is observed in topological insulator candidate TaSe3 from our high field (up to 38 T) measurements. Two oscillation modes, associated with one hole pocket and two electron pockets in the bulk, respectively, are detected from our Shubnikov-de Hass (SdH) measurements, consistent with our first-principles calculations. With the detailed Hall measurements performed, our two-band model analysis exhibits an imperfect density ratio n_h/n_e closing 0.9 at T< 20 K , which suggests that the carrier compensations account for the XMR in TaSe3.

preprint2020arXiv

Molecular gas in 21 and 30 micron sources: the 2\,mm and 1.3\,mm spectra of IRAS\,21318+5631 and 22272+5435

The carriers of the 21 and 30\,$μ$m emission features in infrared spectra of circumstellar envelopes are a long-standing enigma. In this paper, we present the results of molecular line observations toward two circumstellar envelopes exhibiting the 21 and/or 30\,$μ$m features, IRAS\,21318+5631 and 22272+5435, aiming at investigating whether they have unusual gas-phase chemistry and searching for possible gas-phase precursor of the carriers of the two dust features. The spectra cover several discrete frequency ranges of 130--164\,GHz and 216.5--273\,GHz, resulting in a detection of 13 molecular species and isotopologues in each object. Rotation-diagram analysis is carried out to determine the molecular abundances, column densities, and excitation temperatures. We did not discover any molecular species that is unexpected in a normal C-rich star. Nevertheless, there exists subtle difference between their molecular abundances. IRAS\,22272+5435 shows stronger SiC$_2$ and HC$_3$N lines and weaker SiS lines than IRAS\,21318+5631, presumably suggesting that this 21\,$μ$m source is more carbon rich and has experienced a more efficient dust formation. We discuss the potential implications of the results for the carriers of the 21\,$μ$m and 30\,$μ$m features.

preprint2020arXiv

On Positive Geometry and Scattering Forms for Matter Particles

We initiate the study of positive geometry and scattering forms for tree-level amplitudes with matter particles in the (anti-)fundamental representation of the color/flavor group. As a toy example, we study the bi-color scalar theory, which supplements the bi-adjoint theory with scalars in the (anti-)fundamental representations of both groups. Using a recursive construction we obtain a class of unbounded polytopes called open associahedra (or associahedra with certain facets at infinity) whose canonical form computes amplitudes in bi-color theory, for arbitrary number of legs and flavor assignments. In addition, we discuss the duality between color factors and wedge products, or &#34;color is kinematics&#34;, for amplitudes with matter particles as well.

preprint2020arXiv

Optical image decomposition and noise filtering based on Laguerre-Gaussian modes

We propose and experimentally demonstrate an efficient image decomposition in the Laguerre-Gaussian (LG) domain. By developing an advanced computing method, the sampling points are much fewer than those in the existing methods, which can significantly improve the calculation efficiency. The beam waist, azimuthal and radial truncation orders of the LG modes are optimized depending on the image information to be restored. In the experiment, we decompose an image by using about 30000 LG modes and realize a high-fidelity reconstruction. Furthermore, we show image noise reduction through LG domain filtering. Our results open a door for LG-mode based image processing.

preprint2020arXiv

Radiative thermal switch via asymmetric black phosphorus gratings

Active control of heat transfer at the nanoscale has great potentials in thermal logic and energy conversion devices. In the present work, we theoretically propose a radiative thermal switch (RTS) composed of a pair of asymmetric black phosphorus (BP) gratings, with BP nanoribbons periodically patterned in different directions. The simply mechanical rotation between the gratings enables substantial modulation of near-field radiative heat transfer, especially when combined with the use of non-identical parameters, i.e., filling factors and electron densities of BP. Among all the cases including asymmetric BP gratings, symmetric BP gratings, and BP films, we find that the asymmetric BP gratings possess the most excellent switching performance. The optimized switching factors can be as high as 90% with the vacuum separation d=50 nm and higher than 70% even in the far-field regime. The high-performance switching is basically attributed to the rotatable-tunable matching degree between the surface characteristics of the two asymmetric gratings. Moreover, due to the twisting principle, the RTS can work at any temperature, which has great advantage over the phase change materials-based RTS. The proposed switching scheme has great significance for the applications in thermal management and thermal circuits.

preprint2020arXiv

Singular Solutions in Soft Limits

A generalization of the scattering equations on $X(2,n)$, the configuration space of $n$ points on $\mathbb{CP}^1$, to higher dimensional projective spaces was recently introduced by Early, Guevara, Mizera, and one of the authors. One of the new features in $X(k,n)$ with $k>2$ is the presence of both regular and singular solutions in a soft limit. In this work we study soft limits in $X(3,7)$, $X(4,7)$, $X(3,8)$ and $X(5,8)$, find all singular solutions, and show their geometrical configurations. More explicitly, for $X(3,7)$ and $X(4,7)$ we find $180$ and $120$ singular solutions which when added to the known number of regular solutions both give rise to $1\, 272$ solutions as it is expected since $X(3,7)\sim X(4,7)$. Likewise, for $X(3,8)$ and $X(5,8)$ we find $59\, 640$ and $58\, 800$ singular solutions which when added to the regular solutions both give rise to $188\, 112$ solutions. We also propose a classification of all configurations that can support singular solutions for general $X(k,n)$ and comment on their contribution to soft expansions of generalized biadjoint amplitudes.

preprint2020arXiv

The spectra of evolved stars at 20--25\,GHz: tracing circumstellar chemistry during the asymptotic giant branch to planetary nebula transition

We report an unbiased radio line survey towards the circumstellar envelopes of evolved stars at the frequency range from 20 to 25 GHz, aiming to obtain a more complete unbiased picture of the chemical evolution in the final stages of stellar evolution. The observation sample includes the asymptotic giant branch (AGB) star IRC+10216, the proto-planetary nebulae (PPNs) CRL\,2688 and CRL\,618, and the young planetary nebula (PN) NGC\,7027, representing an evolutionary sequence spanning about 10,000 years. Rotational transitions from cyanopolyyne chains and inversion lines from ammonia are detected in the AGB star and PPNs, while the PN displays several recombination lines. The different spectral behaviors of these evolved stars clearly reflect the evolution of circumstellar chemistry during the AGB-PPN-PN transitions.

preprint2020arXiv

Three-dimensional topological semimetal phase in layered TaNiTe5 probed by de Haas-van Alphen effect

Layered three-dimensional (3D) topological semimetals have attracted intensively attention due to the exotic phenomena and abundantly tunable properties. Here we report the experimental evidence for the 3D topological semimetal phase in layered material TaNiTe5 single crystals through quantum oscillations. Strong quantum oscillations have been observed with diamagnetism background in TaNiTe5. By analyzing the de Haas-van Alphen oscillations, multi-periodic oscillations were extracted, in content with magnetotransport measurements. Moreover, nontrivial &#34;π&#34; Berry phase with 3D Fermi surface is identified, indicating the topologically nontrivial feature in TaNiTe5. Additionally, we demonstrated the thin-layer of TaNiTe5 crystals is highly feasible by the mechanical exfoliation, which offers a platform to explore exotic properties in low dimensional topological semimetal and paves the way for potential applications in nanodevices.

preprint2020arXiv

Too Much Information Kills Information: A Clustering Perspective

Clustering is one of the most fundamental tools in the artificial intelligence area, particularly in the pattern recognition and learning theory. In this paper, we propose a simple, but novel approach for variance-based k-clustering tasks, included in which is the widely known k-means clustering. The proposed approach picks a sampling subset from the given dataset and makes decisions based on the data information in the subset only. With certain assumptions, the resulting clustering is provably good to estimate the optimum of the variance-based objective with high probability. Extensive experiments on synthetic datasets and real-world datasets show that to obtain competitive results compared with k-means method (Llyod 1982) and k-means++ method (Arthur and Vassilvitskii 2007), we only need 7% information of the dataset. If we have up to 15% information of the dataset, then our algorithm outperforms both the k-means method and k-means++ method in at least 80% of the clustering tasks, in terms of the quality of clustering. Also, an extended algorithm based on the same idea guarantees a balanced k-clustering result.

preprint2020arXiv

Universal programmable on-chip metasurface building blocks for arbitrary high-order mode manipulation

On-chip mode-division multiplexing (MDM) has been emerging as a promising technology to further enhance the link capacity and bandwidth of data communications with multiple mode channels. Both mode converters and mode exchangers are indispensable fundamental components for flexible mode operations. While several configurations have been developed previously, it is still very challenging to efficiently manipulate arbitrary high-order modes in a versatile way to reduce the R&D and prototyping costs. Here we initiate a breakthrough with a simple yet universal generic mode operator building block concept utilizing metasurface structures. The programmable arbitrary high-order mode operators can realize mode conversion and exchange simultaneously with competitive and uniform performance, high stability and compact footprints, offering a quintessential step change for on-chip multimode optical interconnections.