Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
14works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

14 published item(s)

preprint2026arXiv

Decoupled interband pairing in a bilayer iron-based superconductor evidenced by ultrahigh-resolution ARPES

We present direct experimental evidence of a weakly coupled multiband superconducting state in the bilayer iron-based superconductor ACa$_2$Fe$_4$As$_4$F$_2$ (A = K, Cs) via ultrahigh-resolution angle-resolved photoemission spectroscopy (ARPES). Remarkably, the K-containing compound exhibits two distinct transition temperatures, corresponding to two separate sets of bilayer-split bands, as evidenced by temperature-dependent superconducting gap and spectral weight near the Fermi energy, while its Cs counterpart displays conventional single transition behavior. These experimental observations are well described by the weakly coupled two-band model of Eilenberger theory, which identifies suppressed interband pairing interactions between the bilayer-split bands as the key mechanism. By exploring quantum phenomena in the weak-coupling limit within a multiband system, our findings pave the way for engineering exotic superconductivity via band-selective pairing control.

preprint2022arXiv

Accelerating Frank-Wolfe Algorithm using Low-Dimensional and Adaptive Data Structures

In this paper, we study the problem of speeding up a type of optimization algorithms called Frank-Wolfe, a conditional gradient method. We develop and employ two novel inner product search data structures, improving the prior fastest algorithm in [Shrivastava, Song and Xu, NeurIPS 2021]. * The first data structure uses low-dimensional random projection to reduce the problem to a lower dimension, then uses efficient inner product data structure. It has preprocessing time $\tilde O(nd^{ω-1}+dn^{1+o(1)})$ and per iteration cost $\tilde O(d+n^ρ)$ for small constant $ρ$. * The second data structure leverages the recent development in adaptive inner product search data structure that can output estimations to all inner products. It has preprocessing time $\tilde O(nd)$ and per iteration cost $\tilde O(d+n)$. The first algorithm improves the state-of-the-art (with preprocessing time $\tilde O(d^2n^{1+o(1)})$ and per iteration cost $\tilde O(dn^ρ)$) in all cases, while the second one provides an even faster preprocessing time and is suitable when the number of iterations is small.

preprint2022arXiv

Anomalous contribution to the nematic electronic states from the structural transition in FeSe revealed by time- and angle-resolved photoemission spectroscopy

High-resolution time- and angle-resolved photoemission measurements were made on FeSe superconductors. With ultrafast photoexcitation, two critical excitation fluences that correspond to two ultrafast electronic phase transitions were found only in the $d_{yz}$-orbit-derived band near the Brillouin-zone center within our time and energy resolution. Upon comparison to the detailed temperature dependent measurements, we conclude that there are two equilibrium electronic phase transitions (at approximately 90 and 120 K) above the superconducting transition temperature, and an anomalous contribution on the scale of 10 meV to the nematic states from the structural transition is experimentally determined. Our observations strongly suggest that the electronic phase transition at 120 K must be taken into account in the energy band development of FeSe, and, furthermore, the contribution of the structural transition plays an important role in the nematic phase of iron-based high-temperature superconductors.

preprint2022arXiv

Cluster on Wheels

This paper presents a very compact 16-node cluster that is the core of a future robot for collecting and storing massive amounts of sensor data for research on Simultaneous Localization and Mapping (SLAM). To the best of our knowledge, this is the first time that such a cluster is used in robotics. We first present the requirements and different options for computing of such a robot and then show the hardware and software of our solution in detail. The cluster consists of 16 nodes of AMD Ryzen 7 5700U CPUs with a total of 128 cores. As a system that is to be used on a Clearpath Husky robot, it is very small in size, can be operated from battery power and has all required power and networking components integrated. Stress tests on the completed cluster show that it performs well.

preprint2022arXiv

Deep Neural Networks with ReLU-Sine-Exponential Activations Break Curse of Dimensionality in Approximation on Hölder Class

In this paper, we construct neural networks with ReLU, sine and $2^x$ as activation functions. For general continuous $f$ defined on $[0,1]^d$ with continuity modulus $ω_f(\cdot)$, we construct ReLU-sine-$2^x$ networks that enjoy an approximation rate $\mathcal{O}(ω_f(\sqrt{d})\cdot2^{-M}+ω_{f}\left(\frac{\sqrt{d}}{N}\right))$, where $M,N\in \mathbb{N}^{+}$ denote the hyperparameters related to widths of the networks. As a consequence, we can construct ReLU-sine-$2^x$ network with the depth $5$ and width $\max\left\{\left\lceil2d^{3/2}\left(\frac{3μ}ε\right)^{1/α}\right\rceil,2\left\lceil\log_2\frac{3μd^{α/2}}{2ε}\right\rceil+2\right\}$ that approximates $f\in \mathcal{H}_μ^α([0,1]^d)$ within a given tolerance $ε>0$ measured in $L^p$ norm $p\in[1,\infty)$, where $\mathcal{H}_μ^α([0,1]^d)$ denotes the Hölder continuous function class defined on $[0,1]^d$ with order $α\in (0,1]$ and constant $μ> 0$. Therefore, the ReLU-sine-$2^x$ networks overcome the curse of dimensionality on $\mathcal{H}_μ^α([0,1]^d)$. In addition to its supper expressive power, functions implemented by ReLU-sine-$2^x$ networks are (generalized) differentiable, enabling us to apply SGD to train.

preprint2022arXiv

Multi-Entanglement Routing Design over Quantum Networks

Quantum networks are considered as a promising future platform for quantum information exchange and quantum applications, which have capabilities far beyond the traditional communication networks. Remote quantum entanglement is an essential component of a quantum network. How to efficiently design a multi-routing entanglement protocol is a fundamental yet challenging problem. In this paper, we study a quantum entanglement routing problem to simultaneously maximize the number of quantum-user pairs and their expected throughput. Our approach is to formulate the problem as two sequential integer programming steps. We propose efficient entanglement routing algorithms for the two integer programming steps and analyze their time complexity and performance bounds. Results of evaluation highlight that our approach outperforms existing solutions in both served quantum-user pairs numbers and the network expected throughput.

preprint2022arXiv

Unusual band splitting and superconducting gap evolution with sulfur substitution in FeSe

High-resolution angle-resolved photoemission measurements were taken on FeSe$_{1-x}$S$_x$ (x=0, 0.04, and 0.08) superconductors. With an ultrahigh energy resolution of 0.4 meV, unusual two hole bands near the Brillouin-zone center, which was possibly a result of additional symmetry breaking, were identified in all the sulfur-substituted samples. In addition, in both of the hole bands highly anisotropic superconducting gaps with resolution limited nodes were evidenced. We find that the larger superconducting gap on the outer hole band is reduced linearly to the nematic transition temperature while the gap on the inner hole is nearly S-substitution independent. Our observations strongly suggest that the superconducting gap increases with enhanced nematicity although the superconducting transition temperature is not only governed by the pairing strength, demonstrating strong constraints on theories in the FeSe family.

preprint2021arXiv

CFLMEC: Cooperative Federated Learning for Mobile Edge Computing

We investigate a cooperative federated learning framework among devices for mobile edge computing, named CFLMEC, where devices co-exist in a shared spectrum with interference. Keeping in view the time-average network throughput of cooperative federated learning framework and spectrum scarcity, we focus on maximize the admission data to the edge server or the near devices, which fills the gap of communication resource allocation for devices with federated learning. In CFLMEC, devices can transmit local models to the corresponding devices or the edge server in a relay race manner, and we use a decomposition approach to solve the resource optimization problem by considering maximum data rate on sub-channel, channel reuse and wireless resource allocation in which establishes a primal-dual learning framework and batch gradient decent to learn the dynamic network with outdated information and predict the sub-channel condition. With aim at maximizing throughput of devices, we propose communication resource allocation algorithms with and without sufficient sub-channels for strong reliance on edge servers (SRs) in cellular link, and interference aware communication resource allocation algorithm for less reliance on edge servers (LRs) in D2D link. Extensive simulation results demonstrate the CFLMEC can achieve the highest throughput of local devices comparing with existing works, meanwhile limiting the number of the sub-channels.

preprint2020arXiv

A Parallel Optimal Task Allocation Mechanism for Large-Scale Mobile Edge Computing

We consider the problem of intelligent and efficient task allocation mechanism in large-scale mobile edge computing (MEC), which can reduce delay and energy consumption in a parallel and distributed optimization. In this paper, we study the joint optimization model to consider cooperative task management mechanism among mobile terminals (MT), macro cell base station (MBS), and multiple small cell base station (SBS) for large-scale MEC applications. We propose a parallel multi-block Alternating Direction Method of Multipliers (ADMM) based method to model both requirements of low delay and low energy consumption in the MEC system which formulates the task allocation under those requirements as a nonlinear 0-1 integer programming problem. To solve the optimization problem, we develop an efficient combination of conjugate gradient, Newton and linear search techniques based algorithm with Logarithmic Smoothing (for global variables updating) and the Cyclic Block coordinate Gradient Projection (CBGP, for local variables updating) methods, which can guarantee convergence and reduce computational complexity with a good scalability. Numerical results demonstrate the effectiveness of the proposed mechanism and it can effectively reduce delay and energy consumption for a large-scale MEC system.

preprint2020arXiv

A time- and angle-resolved photoemission spectroscopy with probe photon energy up to 6.7 eV

We present the development of a time- and angle-resolved photoemission spectroscopy based on a Yb-based femtosecond laser and a hemispherical electron analyzer. The energy of the pump photon is tunable between 1.4 and 1.9 eV, and the pulse duration is around 30 fs. We use a KBe$_2$BO$_3$F$_2$ non-linear optical crystal to generate probe pulses, of which the photon energy is up to 6.7 eV, and obtain an overall time resolution of 1 ps and energy resolution of 18 meV. In addition, $β$-BaB$_2$O$_4$ crystals are used to generate alternative probe pulses at 6.05 eV, giving an overall time resolution of 130 fs and energy resolution of 19 meV. We illustrate the performance of the system with representative data on several samples (Bi$_2$Se$_3$, YbCd$_2$Sb$_2$, FeSe).

preprint2020arXiv

CL-ADMM: A Cooperative Learning Based Optimization Framework for Resource Management in MEC

We consider the problem of intelligent and efficient resource management framework in mobile edge computing (MEC), which can reduce delay and energy consumption, featuring distributed optimization and efficient congestion avoidance mechanism. In this paper, we present a Cooperative Learning framework for resource management in MEC from an Alternating Direction Method of Multipliers (ADMM) perspective, called CL-ADMM framework. First, in order to caching task efficiently in a group, a novel task popularity estimating scheme is proposed, which is based on semi-Markov process model, then a greedy task cooperative caching mechanism has been established, which can effectively reduce delay and energy consumption. Secondly, for addressing group congestion, a dynamic task migration scheme based on cooperative improved Q-learning is proposed, which can effectively reduce delay and alleviate congestion. Thirdly, for minimizing delay and energy consumption for resources allocation in a group, we formulate it as an optimization problem with a large number of variables, and then exploit a novel ADMM based scheme to address this problem, which can reduce the complexity of problem with a new set of auxiliary variables, these sub-problems are all convex problems, and can be solved by using a primal-dual approach, guaranteeing its convergences. Then we prove that the convergence by using Lyapunov theory. Numerical results demonstrate the effectiveness of the CL-ADMM and it can effectively reduce delay and energy consumption for MEC.

preprint2020arXiv

Laboratory study of the formation of fullerene (from smaller to larger, C$_{44}$ to C$_{70}$)/anthracene cluster cations in the gas phase

The formation and evolution mechanism of fullerenes in the planetary nebula or in the interstellar medium are still not understood. Here we present the study on the cluster formation and the relative reactivity of fullerene cations (from smaller to larger, C$_{44}$ to C$_{70}$) with anthracene molecule (C$_{14}$H$_{10}$). The experiment is performed in the apparatus that combines a quadrupole ion trap with a time-of-flight mass spectrometer. By using a 355 nm laser beam to irradiate the trapped fullerenes cations (C$_{60}$$^+$ or C$_{70}$$^+$), smaller fullerene cations C$_{(60-2n)}$$^+$, n=1-8 or C$_{(70-2m)}$$^+$, m=1-11 are generated, respectively. Then reacting with anthracene molecules, series of fullerene/anthracene cluster cations are newly formed (e.g., (C$_{14}$H$_{10}$)C$_{(60-2n)}$$^+$, n=1-8 and (C$_{14}$H$_{10}$)C$_{(70-2m)}$$^+$, m=1-11), and slight difference of the reactivity within the smaller fullerene cations are observed. Nevertheless, smaller fullerenes show obviously higher reactivity when comparing to fullerene C$_{60}$$^+$ and C$_{70}$$^+$. A successive loss of C$_2$ fragments mechanism is suggested to account for the formation of smaller fullerene cations, which then undergo addition reaction with anthracene molecules to form the fullerene-anthracene cluster cations. It is found that the higher laser energy and longer irradiation time are key factors that affect the formation of smaller fullerene cations. This may indicate that in the strong radiation field environment (such as photon-dominated regions) in space, fullerenes are expected to follow the top-down evolution route, and then form small grain dust (e.g., clusters) through collision reaction with co-existing molecules, here, smaller PAHs.

preprint2020arXiv

Non-Coulomb strong electron-hole binding in $Ta_2NiSe_5$ revealed by time- and angle-resolved photoemission spectroscopy

We reveal an ultrafast purely electronic phase transition in \TNS, which is a plausible excitonic insulator, after excited by an ultrafast infrared laser pulse. Specifically, the order parameter of the strong electron-hole binding shrinks with enhancing the pump pulse, and above a critical pump fluence, a photo-excited semimetallic state is experimentally identified with the absence of ultrafast structural transition. In addition, the bare valence and conduction bands and also the effective exciton binding energy in \TNS~are determined. These findings and detailed analysis suggest a bare nonequilibrium semimetallic phase in $Ta_2NiSe_5$ and the strong electron-hole binding cannot be exclusively driven by Coulomb interaction.

preprint2020arXiv

Towards Efficient Scheduling of Federated Mobile Devices under Computational and Statistical Heterogeneity

Originated from distributed learning, federated learning enables privacy-preserved collaboration on a new abstracted level by sharing the model parameters only. While the current research mainly focuses on optimizing learning algorithms and minimizing communication overhead left by distributed learning, there is still a considerable gap when it comes to the real implementation on mobile devices. In this paper, we start with an empirical experiment to demonstrate computation heterogeneity is a more pronounced bottleneck than communication on the current generation of battery-powered mobile devices, and the existing methods are haunted by mobile stragglers. Further, non-identically distributed data across the mobile users makes the selection of participants critical to the accuracy and convergence. To tackle the computational and statistical heterogeneity, we utilize data as a tuning knob and propose two efficient polynomial-time algorithms to schedule different workloads on various mobile devices, when data is identically or non-identically distributed. For identically distributed data, we combine partitioning and linear bottleneck assignment to achieve near-optimal training time without accuracy loss. For non-identically distributed data, we convert it into an average cost minimization problem and propose a greedy algorithm to find a reasonable balance between computation time and accuracy. We also establish an offline profiler to quantify the runtime behavior of different devices, which serves as the input to the scheduling algorithms. We conduct extensive experiments on a mobile testbed with two datasets and up to 20 devices. Compared with the common benchmarks, the proposed algorithms achieve 2-100x speedup epoch-wise, 2-7% accuracy gain and boost the convergence rate by more than 100% on CIFAR10.