Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
31works
0followers
21topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

31 published item(s)

preprint2026arXiv

DecisionLLM: Large Language Models for Long Sequence Decision Exploration

Long-sequence decision-making, which is usually addressed through reinforcement learning (RL), is a critical component for optimizing strategic operations in dynamic environments, such as real-time bidding in computational advertising. The Decision Transformer (DT) introduced a powerful paradigm by framing RL as an autoregressive sequence modeling problem. Concurrently, Large Language Models (LLMs) have demonstrated remarkable success in complex reasoning and planning tasks. This inspires us whether LLMs, which share the same Transformer foundation, but operate at a much larger scale, can unlock new levels of performance in long-horizon sequential decision-making problem. This work investigates the application of LLMs to offline decision making tasks. A fundamental challenge in this domain is the LLMs' inherent inability to interpret continuous values, as they lack a native understanding of numerical magnitude and order when values are represented as text strings. To address this, we propose treating trajectories as a distinct modality. By learning to align trajectory data with natural language task descriptions, our model can autoregressively predict future decisions within a cohesive framework we term DecisionLLM. We establish a set of scaling laws governing this paradigm, demonstrating that performance hinges on three factors: model scale, data volume, and data quality. In offline experimental benchmarks and bidding scenarios, DecisionLLM achieves strong performance. Specifically, DecisionLLM-3B outperforms the traditional Decision Transformer (DT) by 69.4 on Maze2D umaze-v1 and by 0.085 on AuctionNet. It extends the AIGB paradigm and points to promising directions for future exploration in online bidding.

preprint2026arXiv

Stephanie2: Thinking, Waiting, and Making Decisions Like Humans in Step-by-Step AI Social Chat

Instant-messaging human social chat typically progresses through a sequence of short messages. Existing step-by-step AI chatting systems typically split a one-shot generation into multiple messages and send them sequentially, but they lack an active waiting mechanism and exhibit unnatural message pacing. In order to address these issues, we propose Stephanie2, a novel next-generation step-wise decision-making dialogue agent. With active waiting and message-pace adaptation, Stephanie2 explicitly decides at each step whether to send or wait, and models latency as the sum of thinking time and typing time to achieve more natural pacing. We further introduce a time-window-based dual-agent dialogue system to generate pseudo dialogue histories for human and automatic evaluations. Experiments show that Stephanie2 clearly outperforms Stephanie1 on metrics such as naturalness and engagement, and achieves a higher pass rate on human evaluation with the role identification Turing test.

preprint2026arXiv

TAP-ViTs: Task-Adaptive Pruning for On-Device Deployment of Vision Transformers

Vision Transformers (ViTs) have demonstrated strong performance across a wide range of vision tasks, yet their substantial computational and memory demands hinder efficient deployment on resource-constrained mobile and edge devices. Pruning has emerged as a promising direction for reducing ViT complexity. However, existing approaches either (i) produce a single pruned model shared across all devices, ignoring device heterogeneity, or (ii) rely on fine-tuning with device-local data, which is often infeasible due to limited on-device resources and strict privacy constraints. As a result, current methods fall short of enabling task-customized ViT pruning in privacy-preserving mobile computing settings. This paper introduces TAP-ViTs, a novel task-adaptive pruning framework that generates device-specific pruned ViT models without requiring access to any raw local data. Specifically, to infer device-level task characteristics under privacy constraints, we propose a Gaussian Mixture Model (GMM)-based metric dataset construction mechanism. Each device fits a lightweight GMM to approximate its private data distribution and uploads only the GMM parameters. Using these parameters, the cloud selects distribution-consistent samples from public data to construct a task-representative metric dataset for each device. Based on this proxy dataset, we further develop a dual-granularity importance evaluation-based pruning strategy that jointly measures composite neuron importance and adaptive layer importance, enabling fine-grained, task-aware pruning tailored to each device's computational budget. Extensive experiments across multiple ViT backbones and datasets demonstrate that TAP-ViTs consistently outperforms state-of-the-art pruning methods under comparable compression ratios.

preprint2026arXiv

Total Gluon Helicity Contribution to the Proton Spin from Lattice QCD

We report a state-of-the-art lattice QCD calculation of the total gluon helicity contribution to the proton spin, $ΔG$. The calculation is done on ensembles with three different lattice spacings $a=\{0.08, 0.09, 0.11\}$ fm. By employing distillation and momentum smearing for proton external states, we extract the bare matrix elements of the topological current $K^μ$ using 5-HYP smeared Coulomb gauge fixing configurations. Furthermore, we apply a non-perturbative $\mathrm{RI/MOM}$ renormalization scheme augmented by the Cluster Decomposition Error Reduction (CDER) technique to determine the renormalization constants of $K^μ$. The results obtained from different components $K^{t,i}$ (with $i$ being the direction of proton momentum or polarization) are consistent with Lorentz covariance within uncertainties. After extrapolating to the continuum limit, $ΔG$ is found to be $ΔG = 0.231(17)^{\mathrm{sta.}}(44)^{\mathrm{sym.}}$ at the $\overline{\mathrm{MS}}$ scale $μ^2=10\ \mathrm{GeV}^2$, which constitutes approximately $46(9)\%$ of the proton spin.

preprint2024arXiv

BiSinger: Bilingual Singing Voice Synthesis

Although Singing Voice Synthesis (SVS) has made great strides with Text-to-Speech (TTS) techniques, multilingual singing voice modeling remains relatively unexplored. This paper presents BiSinger, a bilingual pop SVS system for English and Chinese Mandarin. Current systems require separate models per language and cannot accurately represent both Chinese and English, hindering code-switch SVS. To address this gap, we design a shared representation between Chinese and English singing voices, achieved by using the CMU dictionary with mapping rules. We fuse monolingual singing datasets with open-source singing voice conversion techniques to generate bilingual singing voices while also exploring the potential use of bilingual speech data. Experiments affirm that our language-independent representation and incorporation of related datasets enable a single model with enhanced performance in English and code-switch SVS while maintaining Chinese song performance. Audio samples are available at https://bisinger-svs.github.io.

preprint2024arXiv

Quark masses and low energy constants in the continuum from the tadpole improved clover ensembles

We present the light-flavor quark masses and low energy constants using the 2+1 flavor full-QCD ensembles with stout smeared clover fermion action and Symanzik gauge actions. Both the fermion and gauge actions are tadpole improved self-consistently. The simulations are performed on 11 ensembles at 3 lattice spacings $a\in[0.05,0.11]$ fm, 4 spatial sizes $L\in[2.5, 5.1]$ fm, 7 pion masses $m_π\in[135,350]$ MeV, and several values of the strange quark mass. The quark mass is defined through the partially conserved axial current (PCAC) relation and renormalized to $\overline{\mathrm{MS}}$ 2 GeV through the intermediate regularization independent momentum subtraction (RI/MOM) scheme. The systematic uncertainty of using the symmetric momentum subtraction (SMOM) scheme is also included. Eventually, we predict $m_u=2.45(22)(20)$ MeV, $m_d=4.74(11)(09)$ MeV, and $m_s=98.8(2.9)(4.7)$ MeV with the systematic uncertainties from lattice spacing determination, continuum extrapolation and renormalization constant included. We also obtain the chiral condensate $Σ^{1/3}=268.6(3.6)(0.7)$ MeV and the pion decay constant $F=86.6(7)(1.4) $ MeV in the $N_f=2$ chiral limit, and the next-to-leading order low energy constants $\ell_3=2.43(54)(05)$ and $\ell_4=4.322(75)(96)$.

preprint2023arXiv

Event-Based Fusion for Motion Deblurring with Cross-modal Attention

Traditional frame-based cameras inevitably suffer from motion blur due to long exposure times. As a kind of bio-inspired camera, the event camera records the intensity changes in an asynchronous way with high temporal resolution, providing valid image degradation information within the exposure time. In this paper, we rethink the eventbased image deblurring problem and unfold it into an end-to-end two-stage image restoration network. To effectively fuse event and image features, we design an event-image cross-modal attention module applied at multiple levels of our network, which allows to focus on relevant features from the event branch and filter out noise. We also introduce a novel symmetric cumulative event representation specifically for image deblurring as well as an event mask gated connection between the two stages of our network which helps avoid information loss. At the dataset level, to foster event-based motion deblurring and to facilitate evaluation on challenging real-world images, we introduce the Real Event Blur (REBlur) dataset, captured with an event camera in an illumination controlled optical laboratory. Our Event Fusion Network (EFNet) sets the new state of the art in motion deblurring, surpassing both the prior best-performing image-based method and all event-based methods with public implementations on the GoPro dataset (by up to 2.47dB) and on our REBlur dataset, even in extreme blurry conditions. The code and our REBlur dataset will be made publicly available.

preprint2022arXiv

A Simulation Platform for Multi-tenant Machine Learning Services on Thousands of GPUs

Multi-tenant machine learning services have become emerging data-intensive workloads in data centers with heavy usage of GPU resources. Due to the large scale, many tuning parameters and heavy resource usage, it is usually impractical to evaluate and benchmark those machine learning services on real clusters. In this demonstration, we present AnalySIM, a cluster simulator that allows efficient design explorations for multi-tenant machine learning services. Specifically, by trace-driven cluster workload simulation, AnalySIM can easily test and analyze various scheduling policies in a number of performance metrics such as GPU resource utilization. AnalySIM simulates the cluster computational resource based on both physical topology and logical partition. The tool has been used in SenseTime to understand the impact of different scheduling policies with the trace from a real production cluster of over 1000 GPUs. We find that preemption and migration are able to significantly reduce average job completion time and mitigate the resource fragmentation problem.

preprint2022arXiv

Deep Learning Workload Scheduling in GPU Datacenters: Taxonomy, Challenges and Vision

Deep learning (DL) shows its prosperity in a wide variety of fields. The development of a DL model is a time-consuming and resource-intensive procedure. Hence, dedicated GPU accelerators have been collectively constructed into a GPU datacenter. An efficient scheduler design for such GPU datacenter is crucially important to reduce the operational cost and improve resource utilization. However, traditional approaches designed for big data or high performance computing workloads can not support DL workloads to fully utilize the GPU resources. Recently, substantial schedulers are proposed to tailor for DL workloads in GPU datacenters. This paper surveys existing research efforts for both training and inference workloads. We primarily present how existing schedulers facilitate the respective workloads from the scheduling objectives and resource consumption features. Finally, we prospect several promising future research directions. More detailed summary with the surveyed paper and code links can be found at our project website: https://github.com/S-Lab-System-Group/Awesome-DL-Scheduling-Papers

preprint2022arXiv

Distribution Amplitudes of $K^*$ and $ϕ$ at Physical Pion Mass from Lattice QCD

We present the first lattice QCD calculation of the distribution amplitudes of longitudinally and transversely polarized vector mesons $K^*$ and $ϕ$ using large momentum effective theory. We use the clover fermion action on three ensembles with 2+1+1 flavors of highly improved staggered quarks (HISQ) action, generated by MILC collaboration, at physical pion mass and \{0.06, 0.09, 0.12\} fm lattice spacings, and choose three different hadron momenta $P_z=\{1.29, 1.72, 2.15\}$ GeV. The resulting lattice matrix elements are nonperturbatively renormalized in a hybrid scheme proposed recently. An extrapolation to the continuum and infinite momentum limit is carried out. We find that while the longitudinal distribution amplitudes tend to be close to the asymptotic form, the transverse ones deviate rather significantly from the asymptotic form. Our final results provide crucial {\it ab initio} theory inputs for analyzing pertinent exclusive processes.

preprint2022arXiv

First Lattice QCD determination of semileptonic decays of charmed-strange baryons $Ξ_c$

While the standard model is the most successfully theory to describe all interactions and constituents in elementary particle physics, it has been constantly examined for over four decades. Weak decays of charm quarks can measure the coupling strength of quarks in different families and serve as an ideal probe for CP violation. As the lowest charm-strange baryons with three different flavors, $Ξ_c$ baryons (made of $csu$ or $csd$) have been extensively studied in experiments at the large hadron collider and in electron-positron collision. However the lack of reliable knowledge in theory becomes the unavoidable obstacle in the way. In this work, we use the state-of-the-art Lattice QCD techniques, and generate 2+1 clover fermion ensembles with two lattice spacings, $a=(0.108{\rm fm},0.080{\rm fm})$. We then present the first {\it ab-initio} lattice QCD determination of form factors governing $Ξ_{c}\to Ξ\ell^+ν_{\ell}$, analogous with the notable $β$-decay of nuclei. Our theoretical results for decay widths are consistent with and about two times more precise than the latest measurements by ALICE and Belle collaborations. Together with experimental measurements, we independently determine the quark-mixing matrix element $|V_{cs}|$, which is found in good agreement with other determinations.

preprint2022arXiv

Generative Adversarial Exploration for Reinforcement Learning

Exploration is crucial for training the optimal reinforcement learning (RL) policy, where the key is to discriminate whether a state visiting is novel. Most previous work focuses on designing heuristic rules or distance metrics to check whether a state is novel without considering such a discrimination process that can be learned. In this paper, we propose a novel method called generative adversarial exploration (GAEX) to encourage exploration in RL via introducing an intrinsic reward output from a generative adversarial network, where the generator provides fake samples of states that help discriminator identify those less frequently visited states. Thus the agent is encouraged to visit those states which the discriminator is less confident to judge as visited. GAEX is easy to implement and of high training efficiency. In our experiments, we apply GAEX into DQN and the DQN-GAEX algorithm achieves convincing performance on challenging exploration problems, including the game Venture, Montezuma's Revenge and Super Mario Bros, without further fine-tuning on complicate learning algorithms. To our knowledge, this is the first work to employ GAN in RL exploration problems.

preprint2022arXiv

Nonperturbative Determination of Collins-Soper Kernel from Quasi Transverse-Momentum Dependent Wave Functions

In the framework of large-momentum effective theory at one-loop matching accuracy, we perform a lattice calculation of the Collins-Soper kernel which governs the rapidity evolution of transverse-momentum-dependent (TMD) distributions. We first obtain the quasi TMD wave functions at three different meson momenta on a lattice with valence clover quarks on a dynamical HISQ sea and lattice spacing $a=0.12$~fm from MILC, and renormalize the pertinent linear divergences using Wilson loops. Through one-loop matching to the light-cone wave functions, we determine the Collins-Soper kernel with transverse separation up to 0.6~fm. We study the systematic uncertainties from operator mixing and scale dependence, as well as the impact from higher power corrections. Our results potentially allow for a determination of the soft function and other transverse-momentum dependent quantities at one-loop accuracy.

preprint2022arXiv

Probing the $Zb\bar{b}$ anomalous couplings via exclusive $Z$ boson decay

We propose to utilize the exclusive $Z$-boson rare decays $Z\to Υ(ns)+γ$ to constrain the $Zb\bar{b}$ couplings at the HL-LHC and 100 TeV proton-proton collider. We demonstrate that the event yield of the proposed processes is sensitive to the axial-vector component of the $Zb\bar{b}$ coupling and can provide complementary information to the jet-charge weighted single-spin asymmetry measurement at the EIC and the $gg\to Zh$ production rate measurement at the LHC. By applying the NRQCD factorization formalism, we calculate the partial decay width of $Z\to Υ(ns)+γ$ to the NLO accuracy in strong interaction, which is found to agree with those obtained from the light-cone distribution amplitude approach. We show that the HL-LHC can break the degeneracy of the $Zb\bar{b}$ couplings, as implied by the precision electroweak data at LEP and SLC, if the signal efficiency can be improved by a factor of 1.7 from the present ATLAS analysis at the 13 TeV LHC with an integrated luminosity of $36.1~{\rm fb}^{-1}$.

preprint2022arXiv

TrajGen: Generating Realistic and Diverse Trajectories with Reactive and Feasible Agent Behaviors for Autonomous Driving

Realistic and diverse simulation scenarios with reactive and feasible agent behaviors can be used for validation and verification of self-driving system performance without relying on expensive and time-consuming real-world testing. Existing simulators rely on heuristic-based behavior models for background vehicles, which cannot capture the complex interactive behaviors in real-world scenarios. To bridge the gap between simulation and the real world, we propose TrajGen, a two-stage trajectory generation framework, which can capture more realistic behaviors directly from human demonstration. In particular, TrajGen consists of the multi-modal trajectory prediction stage and the reinforcement learning based trajectory modification stage. In the first stage, we propose a novel auxiliary RouteLoss for the trajectory prediction model to generate multi-modal diverse trajectories in the drivable area. In the second stage, reinforcement learning is used to track the predicted trajectories while avoiding collisions, which can improve the feasibility of generated trajectories. In addition, we develop a data-driven simulator I-Sim that can be used to train reinforcement learning models in parallel based on naturalistic driving data. The vehicle model in I-Sim can guarantee that the generated trajectories by TrajGen satisfy vehicle kinematic constraints. Finally, we give comprehensive metrics to evaluate generated trajectories for simulation scenarios, which shows that TrajGen outperforms either trajectory prediction or inverse reinforcement learning in terms of fidelity, reactivity, feasibility, and diversity.

preprint2022arXiv

Wireless Image Transmission Using Deep Source Channel Coding With Attention Modules

Recent research on joint source channel coding (JSCC) for wireless communications has achieved great success owing to the employment of deep learning (DL). However, the existing work on DL based JSCC usually trains the designed network to operate under a specific signal-to-noise ratio (SNR) regime, without taking into account that the SNR level during the deployment stage may differ from that during the training stage. A number of networks are required to cover the scenario with a broad range of SNRs, which is computational inefficiency (in the training stage) and requires large storage. To overcome these drawbacks our paper proposes a novel method called Attention DL based JSCC (ADJSCC) that can successfully operate with different SNR levels during transmission. This design is inspired by the resource assignment strategy in traditional JSCC, which dynamically adjusts the compression ratio in source coding and the channel coding rate according to the channel SNR. This is achieved by resorting to attention mechanisms because these are able to allocate computing resources to more critical tasks. Instead of applying the resource allocation strategy in traditional JSCC, the ADJSCC uses the channel-wise soft attention to scaling features according to SNR conditions. We compare the ADJSCC method with the state-of-the-art DL based JSCC method through extensive experiments to demonstrate its adaptability, robustness and versatility. Compared with the existing methods, the proposed method takes less storage and is more robust in the presence of channel mismatch.

preprint2021arXiv

Efficient Channel Estimation for RIS-Aided MIMO Communications with Unitary Approximate Message Passing

Reconfigurable intelligent surface (RIS) is very promising for wireless networks to achieve high energy efficiency, extended coverage, improved capacity, massive connectivity, etc. To unleash the full potentials of RIS-aided communications, acquiring accurate channel state information is crucial, which however is very challenging. For RIS-aided multiple-input and multiple-output (MIMO) communications, the existing channel estimation methods have computational complexity growing rapidly with the number of RIS units $N$ (e.g., in the order of $N^2$ or $N^3$) and/or have special requirements on the matrices involved (e.g., the matrices need to be sparse for algorithm convergence to achieve satisfactory performance), which hinder their applications. In this work, instead of using the conventional signal model in the literature, we derive a new signal model obtained through proper vectorization and reduction operations. Then, leveraging the unitary approximate message passing (UAMP), we develop a more efficient channel estimator that has complexity linear with $N$ and does not have special requirements on the relevant matrices, thanks to the robustness of UAMP. These facilitate the applications of the proposed algorithm to a general RIS-aided MIMO system with a larger $N$. Moreover, extensive numerical results show that the proposed estimator delivers much better performance and/or requires significantly less number of training symbols, thereby leading to notable reductions in both training overhead and latency.

preprint2021arXiv

Near Threshold Heavy Quarkonium Photoproduction at Large Momentum Transfer

Perturbative QCD is applied to investigate the near threshold heavy quarkonium photoproduction at large momentum transfer. We take into account the contributions from the leading three-quark Fock states of the nucleon. The dominant contribution comes from the three-quark Fock state with one unit quark orbital angular momentum (OAM) whereas that from zero quark OAM is suppressed at the threshold. From our analysis, we also show that there is no direct connection between the near threshold heavy quarkonium photoproduction and the gluonic gravitational form factors of the nucleon. Based on the comparison between our result and recent GlueX data of $J/ψ$ photoproduction, we make predictions for $ψ'$ and $Υ$ (1S,2S) states which can be tested in future experiments.

preprint2020arXiv

Graph-guided Architecture Search for Real-time Semantic Segmentation

Designing a lightweight semantic segmentation network often requires researchers to find a trade-off between performance and speed, which is always empirical due to the limited interpretability of neural networks. In order to release researchers from these tedious mechanical trials, we propose a Graph-guided Architecture Search (GAS) pipeline to automatically search real-time semantic segmentation networks. Unlike previous works that use a simplified search space and stack a repeatable cell to form a network, we introduce a novel search mechanism with new search space where a lightweight model can be effectively explored through the cell-level diversity and latencyoriented constraint. Specifically, to produce the cell-level diversity, the cell-sharing constraint is eliminated through the cell-independent manner. Then a graph convolution network (GCN) is seamlessly integrated as a communication mechanism between cells. Finally, a latency-oriented constraint is endowed into the search process to balance the speed and performance. Extensive experiments on Cityscapes and CamVid datasets demonstrate that GAS achieves the new state-of-the-art trade-off between accuracy and speed. In particular, on Cityscapes dataset, GAS achieves the new best performance of 73.5% mIoU with speed of 108.4 FPS on Titan Xp.

preprint2020arXiv

Identification of splicing edges in tampered image based on Dichromatic Reflection Model

Imaging is a sophisticated process combining a plenty of photovoltaic conversions, which lead to some spectral signatures beyond visual perception in the final images. Any manipulation against an original image will destroy these signatures and inevitably leave some traces in the final forgery. Therefore we present a novel optic-physical method to discriminate splicing edges from natural edges in a tampered image. First, we transform the forensic image from RGB into color space of S and o1o2. Then on the assumption of Dichromatic Reflection Model, edges in the image are discovered by composite gradient and classified into different types based on their different photometric properties. Finally, splicing edges are reserved against natural ones by a simple logical algorithm. Experiment results show the efficacy of the proposed method.

preprint2020arXiv

Lattice QCD package GWU-code and QUDA with HIP

The open source HIP platform for GPU computing provides an uniform framework to support both the NVIDIA and AMD GPUs, and also the possibility to porting the CUDA code to the HIP- compatible one. We present the porting progress on the Overlap fermion inverter (GWU-code) and also the general Lattice QCD inverter package - QUDA. The manual of using QUDA on HIP and also the tips of porting general CUDA code into the HIP framework are also provided.

preprint2020arXiv

Measuring the Utilization of Public Open Spaces by Deep Learning: a Benchmark Study at the Detroit Riverfront

Physical activities and social interactions are essential activities that ensure a healthy lifestyle. Public open spaces (POS), such as parks, plazas and greenways, are key environments that encourage those activities. To evaluate a POS, there is a need to study how humans use the facilities within it. However, traditional approaches to studying use of POS are manual and therefore time and labor intensive. They also may only provide qualitative insights. It is appealing to make use of surveillance cameras and to extract user-related information through computer vision. This paper proposes a proof-of-concept deep learning computer vision framework for measuring human activities quantitatively in POS and demonstrates a case study of the proposed framework using the Detroit Riverfront Conservancy (DRFC) surveillance camera network. A custom image dataset is presented to train the framework; the dataset includes 7826 fully annotated images collected from 18 cameras across the DRFC park space under various illumination conditions. Dataset analysis is also provided as well as a baseline model for one-step user localization and activity recognition. The mAP results are 77.5\% for {\it pedestrian} detection and 81.6\% for {\it cyclist} detection. Behavioral maps are autonomously generated by the framework to locate different POS users and the average error for behavioral localization is within 10 cm.

preprint2020arXiv

Micromobility in Smart Cities: A Closer Look at Shared Dockless E-Scooters via Big Social Data

The micromobility is shaping first- and last-mile travels in urban areas. Recently, shared dockless electric scooters (e-scooters) have emerged as a daily alternative to driving for short-distance commuters in large cities due to the affordability, easy accessibility via an app, and zero emissions. Meanwhile, e-scooters come with challenges in city management, such as traffic rules, public safety, parking regulations, and liability issues. In this paper, we collected and investigated 5.8 million scooter-tagged tweets and 144,197 images, generated by 2.7 million users from October 2018 to March 2020, to take a closer look at shared e-scooters via crowdsourcing data analytics. We profiled e-scooter usages from spatial-temporal perspectives, explored different business roles (i.e., riders, gig workers, and ridesharing companies), examined operation patterns (e.g., injury types, and parking behaviors), and conducted sentiment analysis. To our best knowledge, this paper is the first large-scale systematic study on shared e-scooters using big social data.

preprint2020arXiv

Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation

To satisfy the stringent requirements on computational resources in the field of real-time semantic segmentation, most approaches focus on the hand-crafted design of light-weight segmentation networks. Recently, Neural Architecture Search (NAS) has been used to search for the optimal building blocks of networks automatically, but the network depth, downsampling strategy, and feature aggregation way are still set in advance by trial and error. In this paper, we propose a joint search framework, called AutoRTNet, to automate the design of these strategies. Specifically, we propose hyper-cells to jointly decide the network depth and downsampling strategy, and an aggregation cell to achieve automatic multi-scale feature aggregation. Experimental results show that AutoRTNet achieves 73.9% mIoU on the Cityscapes test set and 110.0 FPS on an NVIDIA TitanXP GPU card with 768x1536 input images.

preprint2020arXiv

Trace anomaly and dynamical quark mass

We investigated the origin of the RI&#39;/MOM quark mass under the Landau gauge at the non-perturbative scale, using the chiral fermion with different quark masses and lattice spacings. Our result confirms that such a mass is non-vanishing based on the linear extrapolation to the chiral and continuum limit, and shows that such a mass comes from the spontaneous chiral symmetry breaking induced by the near zero modes with the eigenvalue $λ<{\cal O}(5m_q)$, and is proportional to the quark matrix element of the trace anomaly at least down to $\sim $1.3 GeV.

preprint2019arXiv

Unpolarized isovector quark distribution function from Lattice QCD: A systematic analysis of renormalization and matching

We present a detailed Lattice QCD study of the unpolarized isovector quark Parton Distribution Function (PDF) using large-momentum effective theory framework. We choose a quasi-PDF defined by a spatial correlator which is free from mixing with other operators of the same dimension. In the lattice simulation, we use a Gaussian-momentum-smeared source at $M_π=356$ MeV and $P_z \in \{1.8,2.3\}$ GeV. To control the systematics associated with the excited states, we explore {five different source-sink separations}. The nonperturbative renormalization is conducted in a regularization-independent momentum subtraction scheme, and the matching between the renormalized quasi-PDF and $\bar{\rm MS}$ PDF is calculated based on perturbative QCD up to one-loop order. Systematic errors due to renormalization and perturbative matching are also analyzed in detail. Our results for lightcone PDF are in reasonable agreement with the latest phenomenological analysis.