Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
59works
0followers
49topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

59 published item(s)

preprint2026arXiv

Enhancing Blind Video Quality Assessment with Rich Quality-aware Features

Blind video quality assessment (BVQA) is a highly challenging task due to the intrinsic complexity of video content and visual distortions, especially given the high popularity of social media videos, which originate from a wide range of sources, and are often processed by various compression and enhancement algorithms. While recent BVQA and blind image quality assessment (BIQA) studies have made remarkable progress, their models typically perform well on the datasets they were trained on but generalize poorly to unseen videos, making them less effective for accurately evaluating the perceptual quality of diverse social media videos. In this paper, we propose Rich Quality-aware features enabled Video Quality Assessment (RQ-VQA), a simple yet effective method to enhance BVQA by leveraging rich quality-aware features extracted from off-the-shelf BIQA and BVQA models. Our approach exploits the expertise of existing quality assessment models within their trained domains to improve generalization. Specifically, we design a multi-source feature framework that integrates:(1) Learnable spatial features} from a base model fine-tuned on the target VQA dataset to capture domain-specific quality cues; (2) Temporal motion features from the fast pathway of SlowFast pre-trained on action recognition datasets to model motion-related distortions; (3) Spatial quality-aware features from BIQA models trained on diverse IQA datasets to enhance frame-level distortion representation; and (4) Spatiotemporal quality-aware features from a BVQA model trained on large-scale VQA datasets to jointly encode spatial structure and temporal dynamics. These features are concatenated and fed into a multi-layer perceptron (MLP) to regress them into quality scores. Experimental results demonstrate that our model achieves state-of-the-art performance on three public social media VQA datasets.

preprint2025arXiv

ACE-RL: Adaptive Constraint-Enhanced Reward for Long-form Generation Reinforcement Learning

Long-form generation has become a critical and challenging application for Large Language Models (LLMs). Existing studies are limited by their reliance on scarce, high-quality long-form response data and their focus on coarse-grained, general-purpose metrics (e.g., coherence and helpfulness), overlooking the nuanced, scenario-specific requirements of real-world tasks. To address these limitations, we propose a framework utilizing Adaptive Constraint-Enhanced reward for long-form generation Reinforcement Learning (ACE-RL). ACE-RL first decomposes each instruction into a set of fine-grained, adaptive constraint criteria spanning key dimensions of long-form generation tasks. Subsequently, we design a reward mechanism to quantify the response quality based on their satisfaction over corresponding constraints, converting subjective quality evaluation into constraint verification. Finally, we leverage reinforcement learning to optimize LLMs using these fine-grained signals. Experimental results show that ACE-RL significantly outperforms existing SFT and RL baselines by 18.63% and 7.61% on WritingBench, and our top-performing model even surpasses proprietary systems like GPT-4o by 8.76%, providing a more effective training paradigm in long-form generation scenarios.

preprint2025arXiv

Causal LLM Routing: End-to-End Regret Minimization from Observational Data

LLM routing aims to select the most appropriate model for each query, balancing competing performance metrics such as accuracy and cost across a pool of language models. Prior approaches typically adopt a decoupled strategy, where the metrics are first predicted and the model is then selected based on these estimates. This setup is prone to compounding errors and often relies on full-feedback data, where each query is evaluated by all candidate models, which is costly to obtain and maintain in practice. In contrast, we learn from observational data, which records only the outcome of the model actually deployed. We propose a causal end-to-end framework that learns routing policies by minimizing decision-making regret from observational data. To enable efficient optimization, we introduce two theoretically grounded surrogate objectives: a classification-based upper bound, and a softmax-weighted regret approximation shown to recover the optimal policy at convergence. We further extend our framework to handle heterogeneous cost preferences via an interval-conditioned architecture. Experiments on public benchmarks show that our method outperforms existing baselines, achieving state-of-the-art performance across different embedding models.

preprint2024arXiv

Quark masses and low energy constants in the continuum from the tadpole improved clover ensembles

We present the light-flavor quark masses and low energy constants using the 2+1 flavor full-QCD ensembles with stout smeared clover fermion action and Symanzik gauge actions. Both the fermion and gauge actions are tadpole improved self-consistently. The simulations are performed on 11 ensembles at 3 lattice spacings $a\in[0.05,0.11]$ fm, 4 spatial sizes $L\in[2.5, 5.1]$ fm, 7 pion masses $m_π\in[135,350]$ MeV, and several values of the strange quark mass. The quark mass is defined through the partially conserved axial current (PCAC) relation and renormalized to $\overline{\mathrm{MS}}$ 2 GeV through the intermediate regularization independent momentum subtraction (RI/MOM) scheme. The systematic uncertainty of using the symmetric momentum subtraction (SMOM) scheme is also included. Eventually, we predict $m_u=2.45(22)(20)$ MeV, $m_d=4.74(11)(09)$ MeV, and $m_s=98.8(2.9)(4.7)$ MeV with the systematic uncertainties from lattice spacing determination, continuum extrapolation and renormalization constant included. We also obtain the chiral condensate $Σ^{1/3}=268.6(3.6)(0.7)$ MeV and the pion decay constant $F=86.6(7)(1.4) $ MeV in the $N_f=2$ chiral limit, and the next-to-leading order low energy constants $\ell_3=2.43(54)(05)$ and $\ell_4=4.322(75)(96)$.

preprint2022arXiv

$T_{cc}^{+}(3875)$ relevant $DD^*$ scattering from $N_f=2$ lattice QCD

The $S$-wave $DD^*$ scattering in the isospin $I=0,1$ channels is studied in $N_f=2$ lattice QCD at $m_π\approx 350$ MeV. It is observed that the $DD^*$ interaction is repulsive in the $I=1$ channel when the $DD^*$ energy is near the $DD^*$ threshold. In contrast, the $DD^*$ interaction in the $I=0$ channel is definitely attractive in a wide range of the $DD^*$ energy. This is consistent with the isospin assignment $I=0$ for $T_{cc}^+(3875)$. By analyzing the components of the $DD^*$ correlation functions, it turns out that the quark diagram responsible for the different properties of $I=0,1$ $DD^*$ interactions can be understood as the charged $ρ$ meson exchange effect. This observation provides direct information on the internal dynamics of $T_{cc}^+(3875)$.

preprint2022arXiv

A No-Reference Deep Learning Quality Assessment Method for Super-resolution Images Based on Frequency Maps

To support the application scenarios where high-resolution (HR) images are urgently needed, various single image super-resolution (SISR) algorithms are developed. However, SISR is an ill-posed inverse problem, which may bring artifacts like texture shift, blur, etc. to the reconstructed images, thus it is necessary to evaluate the quality of super-resolution images (SRIs). Note that most existing image quality assessment (IQA) methods were developed for synthetically distorted images, which may not work for SRIs since their distortions are more diverse and complicated. Therefore, in this paper, we propose a no-reference deep-learning image quality assessment method based on frequency maps because the artifacts caused by SISR algorithms are quite sensitive to frequency information. Specifically, we first obtain the high-frequency map (HM) and low-frequency map (LM) of SRI by using Sobel operator and piecewise smooth image approximation. Then, a two-stream network is employed to extract the quality-aware features of both frequency maps. Finally, the features are regressed into a single quality value using fully connected layers. The experimental results show that our method outperforms all compared IQA models on the selected three super-resolution quality assessment (SRQA) databases.

preprint2022arXiv

An Opposite Gaussian Product Inequality

The long-standing Gaussian product inequality (GPI) conjecture states that $E [\prod_{j=1}^{n}|X_j|^{α_j}]\geq\prod_{j=1}^{n}E[|X_j|^{α_j}]$ for any centered Gaussian random vector $(X_1,\dots,X_n)$ and any non-negative real numbers $α_j$, $j=1,\ldots,{n}$. In this note, we prove a novel &#34;opposite GPI&#34; for centered bivariate Gaussian random variables when $-1<α_1<0$ and $α_2>0$: $E[|X_1|^{α_1}|X_2|^{α_2}]\le E[|X_1|^{α_1}]E[|X_2|^{α_2}]$. This completes the picture of bivariate Gaussian product relations.

preprint2022arXiv

Annihilation diagram contribution to charmonium masses

In this work, we generate gauge configurations with $N_f=2$ dynamical charm quarks on anisotropic lattices. The mass shift of $1S$ and $1P$ charmonia owing to the charm quark annihilation effect can be investigated directly in a manner of unitary theory. The distillation method is adopted to treat the charm quark annihilation diagrams at a very precise level. For $1S$ charmonia, the charm quark annihilation effect almost does not change the $J/ψ$ mass, but lifts the $η_c$ mass by approximately 3-4 MeV. For $1P$ charmonia, this effect results in positive mass shifts of approximately 1 MeV for $χ_{c1}$ and $h_c$, but decreases the $χ_{c2}$ mass by approximately 3 MeV. We have not obtain a reliable result for the mass shift of $χ_{c0}$. In addition, it is observed that the spin averaged mass of the spin-triplet $1P$ charmonia is in a good agreement with the $h_c$, as expected by the non-relativistic quark model and measured by experiments.

preprint2022arXiv

Autonomous Smart Grid Fault Detection

Smart grid plays a crucial role for the smart society and the upcoming carbon neutral society. Achieving autonomous smart grid fault detection is critical for smart grid system state awareness, maintenance and operation. This paper focuses on fault monitoring in smart grid and discusses the inherent technical challenges and solutions. In particular, we first present the basic principles of smart grid fault detection. Then, we explain the new requirements for autonomous smart grid fault detection, the technical challenges and their possible solutions. A case study is introduced, as a preliminary study for autonomous smart grid fault detection. In addition, we highlight relevant directions for future research.

preprint2022arXiv

Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency

The intelligent video surveillance system (IVSS) can automatically analyze the content of the surveillance image (SI) and reduce the burden of the manual labour. However, the SIs may suffer quality degradations in the procedure of acquisition, compression, and transmission, which makes IVSS hard to understand the content of SIs. In this paper, we first conduct an example experiment (i.e. the face detection task) to demonstrate that the quality of the SIs has a crucial impact on the performance of the IVSS, and then propose a saliency-based deep neural network for the blind quality assessment of the SIs, which helps IVSS to filter the low-quality SIs and improve the detection and recognition performance. Specifically, we first compute the saliency map of the SI to select the most salient local region since the salient regions usually contain rich semantic information for machine vision and thus have a great impact on the overall quality of the SIs. Next, the convolutional neural network (CNN) is adopted to extract quality-aware features for the whole image and local region, which are then mapped into the global and local quality scores through the fully connected (FC) network respectively. Finally, the overall quality score is computed as the weighted sum of the global and local quality scores. Experimental results on the SI quality database (SIQD) show that the proposed method outperforms all compared state-of-the-art BIQA methods.

preprint2022arXiv

CHANG-ES XXIX: The Sub-kpc Nuclear Bubble of NGC 4438

AGN bubbles could play an important role in accelerating high-energy CRs and galactic feedback. Only in nearby galaxies could we have high enough angular resolution in multi-wavelengths to study the sub-kpc environment of the AGN, where the bubbles are produced and strongly interact with the surrounding ISM. In this paper, we present the latest Chandra observations of the Virgo cluster galaxy NGC 4438, which hosts multi-scale bubbles detected in various bands. The galaxy also has low current star formation activity, so these bubbles are evidently produced by the AGN rather than a starburst. We present spatially resolved spectral analysis of the Chandra data of the $\sim3^{\prime\prime}\times5^{\prime\prime}$ ($\sim200{\rm~pc}\times350\rm~pc$) nuclear bubble of NGC 4438. The power law tail in the X-ray spectra can be most naturally explained as synchrotron emission from high-energy CR leptons. The hot gas temperature increases, while the overall contribution of the non-thermal X-ray emission decreases with the vertical distance from the galactic plane. We calculate the synchrotron cooling timescale of the CR leptons responsible for the non-thermal hard X-ray emission to be only a few tens to a few hundreds of years. The thermal pressure of the hot gas is about three times the magnetic pressure, but the current data cannot rule out the possibility that they are still in pressure balance. The spatially resolved spectroscopy presented in this paper may have important constraints on how the AGN accelerates CRs and drives outflows. We also discover a transient X-ray source only $\sim5^{\prime\prime}$ from the nucleus of NGC 4438. The source was not detected in 2002 and 2008, but became quite X-ray bright in March 2020, with an average 0.5-7 keV luminosity of $\sim10^{39}\rm~ergs~s^{-1}$.

preprint2022arXiv

CHANG-ES. XXIV. First Detection of A Radio Nuclear Ring and Potential LLAGN in NGC 5792

We report the discoveries of a nuclear ring of diameter 10$\arcsec$ ($\sim$1.5 kpc) and a potential low luminosity active galactic nucleus (LLAGN) in the radio continuum emission map of the edge-on barred spiral galaxy NGC~5792. These discoveries are based on the Continuum Halos in Nearby Galaxies - an Expanded Very Large Array (VLA) Survey, as well as subsequent VLA observations of sub-arcsecond resolution. Using a mixture of H$α$ and 24 $μ$m calibration, we disentangle the thermal and non-thermal radio emission of the nuclear region, and derive a star formation rate (SFR) of $\sim 0.4~M_{\sun}$ yr$^{-1}$. We find that the nuclear ring is dominated by non-thermal synchrotron emission. The synchrotron-based SFR is about three times of the mixture-based SFR. This result indicates that the nuclear ring underwent more intense star-forming activity in the past, and now its star formation is in the low state. The sub-arcsecond VLA images resolve six individual knots on the nuclear ring. The equipartition magnetic field strength $B_{\rm eq}$ of the knots varies from 77 to 88 $μ$G. The radio ring surrounds a point-like faint radio core of $S_{\rm 6GHz}=(16\pm4)$ $μ$Jy with polarized lobes at the center of NGC~5792, which suggests an LLAGN with an Eddington ratio $\sim10^{-5}$. This radio nuclear ring is reminiscent of the Central Molecular Zone (CMZ) of the Galaxy. Both of them consist of a nuclear ring and LLAGN.

preprint2022arXiv

Constrained Prescriptive Trees via Column Generation

With the abundance of available data, many enterprises seek to implement data-driven prescriptive analytics to help them make informed decisions. These prescriptive policies need to satisfy operational constraints, and proactively eliminate rule conflicts, both of which are ubiquitous in practice. It is also desirable for them to be simple and interpretable, so they can be easily verified and implemented. Existing approaches from the literature center around constructing variants of prescriptive decision trees to generate interpretable policies. However, none of the existing methods are able to handle constraints. In this paper, we propose a scalable method that solves the constrained prescriptive policy generation problem. We introduce a novel path-based mixed-integer program (MIP) formulation which identifies a (near) optimal policy efficiently via column generation. The policy generated can be represented as a multiway-split tree which is more interpretable and informative than a binary-split tree due to its shorter rules. We demonstrate the efficacy of our method with extensive experiments on both synthetic and real datasets.

preprint2022arXiv

Cyber-Physical Vulnerability Assessment of P2P Energy Exchanges in Active Distribution Networks

Owing to the decreasing costs of distributed energy resources (DERs) as well as decarbonization policies, power systems are undergoing a modernization process. The large deployment of DERs together with internet of things (IoT) devices provide a platform for peer-to-peer (P2P) energy trading in active distribution networks. However, P2P energy trading with IoT devices have driven the grid more vulnerable to cyber-physical threats. To this end, in this paper, a resilience-oriented P2P energy exchange model is developed considering three phase unbalanced distribution systems. In addition, various scenarios for vulnerability assessment of P2P energy exchanges considering adverse prosumers and consumers, who provide false information regarding the price and quantity with the goal of maximum financial benefit and system operation disruption, are considered. Techno-economic survivability analysis against these attacks are investigated on a IEEE 13-node unbalanced distribution test system. Simulation results demonstrate that adverse peers can affect the physical operation of grid, maximize their benefits, and cause financial loss of other agents.

preprint2022arXiv

Data-driven discovery of quasi-disordered mechanical metamaterials failed progressively

Natural cellular materials, such as honeycombs, woods, foams, trabecular bones, plant parenchyma, and sponges, may benefit from the disorderliness within their internal microstructures to achieve damage tolerant behaviours. Inspired by this, we have created quasi-disordered truss metamaterials (QTMs) via introducing spatial coordinate perturbations or strut thickness variations to the perfect, periodic truss lattices. Numerical studies have suggested that the QTMs can exhibit either ductile, damage tolerant behaviours or sudden, catastrophic failure mode, depending on the distribution of the introduced disorderliness. A data-driven approach has been developed, combining deep-learning and global optimization algorithms, to tune the distribution of the disorderliness to achieve the damage tolerant QTM designs. A case study on the QTMs created from a periodic Face Centred Cubic (FCC) lattice has demonstrated that the optimised QTMs can achieve up to 100% increase in ductility at the expense of less than 5% stiffness and less than 10% tensile strength. Our results suggest a novel design pathway for architected materials to improve damage tolerance.

preprint2022arXiv

Deep Neural Network for Blind Visual Quality Assessment of 4K Content

The 4K content can deliver a more immersive visual experience to consumers due to the huge improvement of spatial resolution. However, existing blind image quality assessment (BIQA) methods are not suitable for the original and upscaled 4K contents due to the expanded resolution and specific distortions. In this paper, we propose a deep learning-based BIQA model for 4K content, which on one hand can recognize true and pseudo 4K content and on the other hand can evaluate their perceptual visual quality. Considering the characteristic that high spatial resolution can represent more abundant high-frequency information, we first propose a Grey-level Co-occurrence Matrix (GLCM) based texture complexity measure to select three representative image patches from a 4K image, which can reduce the computational complexity and is proven to be very effective for the overall quality prediction through experiments. Then we extract different kinds of visual features from the intermediate layers of the convolutional neural network (CNN) and integrate them into the quality-aware feature representation. Finally, two multilayer perception (MLP) networks are utilized to map the quality-aware features into the class probability and the quality score for each patch respectively. The overall quality index is obtained through the average pooling of patch results. The proposed model is trained through the multi-task learning manner and we introduce an uncertainty principle to balance the losses of the classification and regression tasks. The experimental results show that the proposed model outperforms all compared BIQA metrics on four 4K content quality assessment databases.

preprint2022arXiv

Enabling Relative Localization for Nanodrone Swarm Platooning

Nanodrone swarm is formulated by multiple light-weight and low-cost nanodrones to perform the tasks in very challenging environments. Therefore, it is essential to estimate the relative position of nanodrones in the swarm for accurate and safe platooning in inclement indoor environment. However, the vision and infrared sensors are constrained by the line-of-sight perception, and instrumenting extra motion sensors on drone&#39;s body is constrained by the nanodrone&#39;s form factor and energy-efficiency. This paper presents the design, implementation and evaluation of RFDrone, a system that can sense the relative position of nanodrone in the swarm using wireless signals, which can naturally identify each individual nanodrone. To do so, each light-weight nanodrone is attached with a RF sticker (i.e., called RFID tag), which will be localized by the external RFID reader in the inclement indoor environment. Instead of accurately localizing each RFID-tagged nanodrone, we propose to estimate the relative position of all the RFID-tagged nanodrones in the swarm based on the spatial-temporal phase profiling. We implement an end-to-end physical prototype of RFDrone. Our experimental results show that RFDrone can accurately estimate the relative position of nanodrones in the swarm with average relative localization accuracy of around 0.95 across x, y and z axis, and average accuracy of around 0.93 for nanodrone swarm&#39;s geometry estimation.

preprint2022arXiv

End-to-End Jet Classification of Boosted Top Quarks with the CMS Open Data

We describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique combines deep learning algorithms and low-level detector representation of the high-energy collision event. In this study, we use low-level detector information from the simulated CMS Open Data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves an AUC score of 0.975$\pm$0.002 for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier AUC score increases to 0.9824$\pm$0.0013, serving as the first performance benchmark for these CMS Open Data samples. We additionally provide a timing performance comparison of different processor unit architectures for training the network.

preprint2022arXiv

First Lattice QCD determination of semileptonic decays of charmed-strange baryons $Ξ_c$

While the standard model is the most successfully theory to describe all interactions and constituents in elementary particle physics, it has been constantly examined for over four decades. Weak decays of charm quarks can measure the coupling strength of quarks in different families and serve as an ideal probe for CP violation. As the lowest charm-strange baryons with three different flavors, $Ξ_c$ baryons (made of $csu$ or $csd$) have been extensively studied in experiments at the large hadron collider and in electron-positron collision. However the lack of reliable knowledge in theory becomes the unavoidable obstacle in the way. In this work, we use the state-of-the-art Lattice QCD techniques, and generate 2+1 clover fermion ensembles with two lattice spacings, $a=(0.108{\rm fm},0.080{\rm fm})$. We then present the first {\it ab-initio} lattice QCD determination of form factors governing $Ξ_{c}\to Ξ\ell^+ν_{\ell}$, analogous with the notable $β$-decay of nuclei. Our theoretical results for decay widths are consistent with and about two times more precise than the latest measurements by ALICE and Belle collaborations. Together with experimental measurements, we independently determine the quark-mixing matrix element $|V_{cs}|$, which is found in good agreement with other determinations.

preprint2022arXiv

Hydrogen and Battery Storage Technologies for Low Cost Energy Decarbonization in Distribution Networks

Deep energy decarbonization cannot be achieved without high penetration of renewables. At higher renewable energy penetrations, the variability and intermittent nature of solar photovoltaic (PV) electricity can cause ramping issues with existing fossil fuel generation, requiring longer term energy storage to increase the reliability of grid operation. A proton exchange membrane electrolyzer can produce H2and serves as a utility controllable load. The produced H2 can then be stored and converted back into electricity, or mixed with natural gas, or used as transportation fuel, or chemical feedstock. This paper considers the perspective of the distribution system operator that operates the distributed energy resources on a standard IEEE 33-node distribution network considering the technical and physical constraints with the goal of minimizing total investment and operation cost. Different case studies, at very high PV penetrations are considered to show the challenges and path to net-zero emission energy production using H2 energy. Sensitivity of utility PV costs and electrolyzer capital costs on producing H2 at $1/kg are presented showing that the distribution network could produce 100% renewable electricity and H2 could be produced with the same price by 2050 with conservative cost estimates and by 2030 with accelerated cost declines.

preprint2022arXiv

Moment ratio inequality of bivariate Gaussian distribution and three-dimensional Gaussian product inequality

We prove the three-dimensional Gaussian product inequality (GPI) $E[X_1^{2}X_2^{2m_2}X_3^{2m_3}]\ge E[X_1^{2}]E[X_2^{2m_2}]E[X_3^{2m_3}]$ for any centered Gaussian random vector $(X_1,X_2,X_3)$ and $m_2,m_3\in\mathbb{N}$. We discover a novel inequality for the moment ratio $\frac{|E[ X_2^{2m_2+1}X_3^{2m_3+1}]|}{E[ X_2^{2m_2}X_3^{2m_3}]}$, which implies the 3D-GPI. The interplay between computing and hard analysis plays a crucial role in the proofs.

preprint2022arXiv

Multitask Balanced and Recalibrated Network for Medical Code Prediction

Human coders assign standardized medical codes to clinical documents generated during patients&#39; hospitalization, which is error-prone and labor-intensive. Automated medical coding approaches have been developed using machine learning methods such as deep neural networks. Nevertheless, automated medical coding is still challenging because of the imbalanced class problem, complex code association, and noise in lengthy documents. To solve these issues, we propose a novel neural network called Multitask Balanced and Recalibrated Neural Network. Significantly, the multitask learning scheme shares the relationship knowledge between different code branches to capture the code association. A recalibrated aggregation module is developed by cascading convolutional blocks to extract high-level semantic features that mitigate the impact of noise in documents. Also, the cascaded structure of the recalibrated module can benefit the learning from lengthy notes. To solve the class imbalanced problem, we deploy the focal loss to redistribute the attention of low and high-frequency medical codes. Experimental results show that our proposed model outperforms competitive baselines on a real-world clinical dataset MIMIC-III.

preprint2022arXiv

No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models

To improve the viewer&#39;s Quality of Experience (QoE) and optimize computer graphics applications, 3D model quality assessment (3D-QA) has become an important task in the multimedia area. Point cloud and mesh are the two most widely used digital representation formats of 3D models, the visual quality of which is quite sensitive to lossy operations like simplification and compression. Therefore, many related studies such as point cloud quality assessment (PCQA) and mesh quality assessment (MQA) have been carried out to measure the visual quality degradations of 3D models. However, a large part of previous studies utilize full-reference (FR) metrics, which indicates they can not predict the quality level with the absence of the reference 3D model. Furthermore, few 3D-QA metrics consider color information, which significantly restricts their effectiveness and scope of application. In this paper, we propose a no-reference (NR) quality assessment metric for colored 3D models represented by both point cloud and mesh. First, we project the 3D models from 3D space into quality-related geometry and color feature domains. Then, the 3D natural scene statistics (3D-NSS) and entropy are utilized to extract quality-aware features. Finally, machine learning is employed to regress the quality-aware features into visual quality scores. Our method is validated on the colored point cloud quality assessment database (SJTU-PCQA), the Waterloo point cloud assessment database (WPC), and the colored mesh quality assessment database (CMDM). The experimental results show that the proposed method outperforms most compared NR 3D-QA metrics with competitive computational resources and greatly reduces the performance gap with the state-of-the-art FR 3D-QA metrics. The code of the proposed model is publicly available now to facilitate further research.

preprint2022arXiv

Nuclear mass table in deformed relativistic Hartree-Bogoliubov theory in continuum: I. even-even nuclei

Ground-state properties of even-even nuclei with $8\le Z\le120$ from the proton drip line to the neutron drip line have been investigated using the deformed relativistic Hartree-Bogoliubov theory in continuum (DRHBc) with the density functional PC-PK1. With the effects of deformation and continuum included simultaneously, 2583 even-even nuclei are predicted to be bound. The calculated binding energies, two-nucleon separation energies, root-mean-square (rms) radii of neutron, proton, matter, and charge distributions, quadrupole deformations, and neutron and proton Fermi surfaces are tabulated and compared with available experimental data. The rms deviation from the 637 mass data is 1.518 MeV, providing one of the best microscopic descriptions for nuclear masses. The drip lines obtained from DRHBc calculations are compared with other calculations, including the spherical relativistic continuum Hartree-Bogoliubov (RCHB) and triaxial relativistic Hartree-Bogoliubov (TRHB) calculations with PC-PK1. The deformation and continuum effects on the limits of the nuclear landscape are discussed. Possible peninsulas consisting of bound nuclei beyond the two-neutron drip line are predicted. The systematics of the two-nucleon separation energies, two-nucleon gaps, rms radii, quadrupole deformations, potential energy curves, neutron densities, neutron mean-field potentials, and pairing energies in the DRHBc calculations are also discussed. In addition, the $α$ decay energies extracted are in good agreement with available data.

preprint2022arXiv

Perceptual Quality Assessment for Fine-Grained Compressed Images

Recent years have witnessed the rapid development of image storage and transmission systems, in which image compression plays an important role. Generally speaking, image compression algorithms are developed to ensure good visual quality at limited bit rates. However, due to the different compression optimization methods, the compressed images may have different levels of quality, which needs to be evaluated quantificationally. Nowadays, the mainstream full-reference (FR) metrics are effective to predict the quality of compressed images at coarse-grained levels (the bit rates differences of compressed images are obvious), however, they may perform poorly for fine-grained compressed images whose bit rates differences are quite subtle. Therefore, to better improve the Quality of Experience (QoE) and provide useful guidance for compression algorithms, we propose a full-reference image quality assessment (FR-IQA) method for compressed images of fine-grained levels. Specifically, the reference images and compressed images are first converted to $YCbCr$ color space. The gradient features are extracted from regions that are sensitive to compression artifacts. Then we employ the Log-Gabor transformation to further analyze the texture difference. Finally, the obtained features are fused into a quality score. The proposed method is validated on the fine-grained compression image quality assessment (FGIQA) database, which is especially constructed for assessing the quality of compressed images with close bit rates. The experimental results show that our metric outperforms mainstream FR-IQA metrics on the FGIQA database. We also test our method on other commonly used compression IQA databases and the results show that our method obtains competitive performance on the coarse-grained compression IQA databases as well.

preprint2022arXiv

Periodic measures for a class of SPDEs with regime-switching

We use the variational approach to investigate periodic measures for a class of SPDEs with regime-switching. The hybrid system is driven by degenerate Lévy noise. We use the Lyapunov function method to study the existence of periodic measures and show the uniqueness of periodic measures by establishing the strong Feller property and irreducibility of the associated time-inhomogeneous semigroup. The main results are applied to stochastic porous media equations with regime-switching.

preprint2022arXiv

Smooth Solutions to Asymptotic Plateau Type Problem in Hyperbolic Space

We investigate on the existence of smooth complete hypersurface with prescribed Weingarten curvature and asymptotic boundary at infinity in hyperbolic space under the assumption that there exists an asymptotic subsolution. We give an affirmative answer for the case $k = n$ when the asymptotic boundary $Γ$ bounds a uniformly convex domain, and for $k < n$ when $Γ$ bounds a disk, utilizing Pogorelov type interior second order estimate. Our result complements our previous work \cite{Sui2019, Sui-Sun}, and generalizes the asymptotic Plateau type problem to non-constant prescribed curvature case.

preprint2022arXiv

Some New Gaussian Product Inequalities

The Gaussian product inequality is a long-standing conjecture. In this paper, we investigate the three-dimensional inequality $E[X_1^{2}X_2^{2m_2}X_3^{2m_3}]\ge E[X_1^{2}]E[X_2^{2m_2}]E[X_3^{2m_3}]$ for any centered Gaussian random vector $(X_1,X_2,X_3)$ and $m_2,m_3\in\mathbb{N}$. First, we show that this inequality is implied by a combinatorial inequality. The combinatorial inequality can be verified directly for small values of $m_2$ and arbitrary $m_3$. Hence the corresponding cases of the three-dimensional inequality are proved. Second, we show that the three-dimensional inequality is equivalent to an improved Cauchy-Schwarz inequality. This observation leads us to derive some novel moment inequalities for bivariate Gaussian random variables.

preprint2022arXiv

Spatial Aware Multi-Task Learning Based Speech Separation

During the Covid, online meetings have become an indispensable part of our lives. This trend is likely to continue due to their convenience and broad reach. However, background noise from other family members, roommates, office-mates not only degrades the voice quality but also raises serious privacy issues. In this paper, we develop a novel system, called Spatial Aware Multi-task learning-based Separation (SAMS), to extract audio signals from the target user during teleconferencing. Our solution consists of three novel components: (i) generating fine-grained location embeddings from the user&#39;s voice and inaudible tracking sound, which contains the user&#39;s position and rich multipath information, (ii) developing a source separation neural network using multi-task learning to jointly optimize source separation and location, and (iii) significantly speeding up inference to provide a real-time guarantee. Our testbed experiments demonstrate the effectiveness of our approach

preprint2022arXiv

Subjective Quality Assessment for Images Generated by Computer Graphics

With the development of rendering techniques, computer graphics generated images (CGIs) have been widely used in practical application scenarios such as architecture design, video games, simulators, movies, etc. Different from natural scene images (NSIs), the distortions of CGIs are usually caused by poor rending settings and limited computation resources. What&#39;s more, some CGIs may also suffer from compression distortions in transmission systems like cloud gaming and stream media. However, limited work has been put forward to tackle the problem of computer graphics generated images&#39; quality assessment (CG-IQA). Therefore, in this paper, we establish a large-scale subjective CG-IQA database to deal with the challenge of CG-IQA tasks. We collect 25,454 in-the-wild CGIs through previous databases and personal collection. After data cleaning, we carefully select 1,200 CGIs to conduct the subjective experiment. Several popular no-reference image quality assessment (NR-IQA) methods are tested on our database. The experimental results show that the handcrafted-based methods achieve low correlation with subjective judgment and deep learning based methods obtain relatively better performance, which demonstrates that the current NR-IQA models are not suitable for CG-IQA tasks and more effective models are urgently needed.

preprint2022arXiv

The Glueball content of $η_c$

We carry out the first lattice QCD derivation of the mixing energy and the mixing angle of the pseudoscalar charmonium and glueball on two gauge ensembles with $N_f=2$ degenerate dynamical charm quarks. The mixing energy is determined to be $49(6)$ MeV on the near physical charm ensemble, which seems insensitive to charm quark mass. By the assumption that $X(2370)$ is predominantly a pseudoscalar glueball, the mixing angle is determined to be approximately $4.6(6)^\circ$, which results in a $+3.9(9)$ MeV mass shift of the ground state pseudoscalar charmonium. In the mean time, the mixing can raise the total width of the pseudoscalar charmonium by 7.2(8) MeV, which explains to some extent the relative large total width of the $η_c$ meson. As a result, the branching fraction of $η_c\to γγ$ can be understood in this $c\bar{c}$-glueball mixing framework. On the other hand, the possible discrepancy of the theoretical predictions and the experimental results of the partial width of $J/ψ\toγη_c$ cannot be alleviated by the $c\bar{c}$-glueball mixing picture yet, which demands future precise experimental measurements of this partial width.

preprint2022arXiv

Towards Learning in Grey Spatiotemporal Systems: A Prophet to Non-consecutive Spatiotemporal Dynamics

Spatiotemporal forecasting is an imperative topic in data science due to its diverse and critical applications in smart cities. Existing works mostly perform consecutive predictions of following steps with observations completely and continuously obtained, where nearest observations can be exploited as key knowledge for instantaneous status estimation. However, the practical issues of early activity planning and sensor failures elicit a brand-new task, i.e., non-consecutive forecasting. In this paper, we define spatiotemporal learning systems with missing observation as Grey Spatiotemporal Systems (G2S) and propose a Factor-Decoupled learning framework for G2S (FDG2S), where the core idea is to hierarchically decouple multi-level factors and enable both flexible aggregations and disentangled uncertainty estimations. Firstly, to compensate for missing observations, a generic semantic-neighboring sequence sampling is devised, which selects representative sequences to capture both periodical regularity and instantaneous variations. Secondly, we turn the predictions of non-consecutive statuses into inferring statuses under expected combined exogenous factors. In particular, a factor-decoupled aggregation scheme is proposed to decouple factor-induced predictive intensity and region-wise proximity by two energy functions of conditional random field. To infer region-wise proximity under flexible factor-wise combinations and enable dynamic neighborhood aggregations, we further disentangle compounded influences of exogenous factors on region-wise proximity and learn to aggregate them. Given the inherent incompleteness and critical applications of G2S, a DisEntangled Uncertainty Quantification is put forward, to identify two types of uncertainty for reliability guarantees and model interpretations.

preprint2022arXiv

Video-based Human-Object Interaction Detection from Tubelet Tokens

We present a novel vision Transformer, named TUTOR, which is able to learn tubelet tokens, served as highly-abstracted spatiotemporal representations, for video-based human-object interaction (V-HOI) detection. The tubelet tokens structurize videos by agglomerating and linking semantically-related patch tokens along spatial and temporal domains, which enjoy two benefits: 1) Compactness: each tubelet token is learned by a selective attention mechanism to reduce redundant spatial dependencies from others; 2) Expressiveness: each tubelet token is enabled to align with a semantic instance, i.e., an object or a human, across frames, thanks to agglomeration and linking. The effectiveness and efficiency of TUTOR are verified by extensive experiments. Results shows our method outperforms existing works by large margins, with a relative mAP gain of $16.14\%$ on VidHOI and a 2 points gain on CAD-120 as well as a $4 \times$ speedup.

preprint2022arXiv

Visual-Assisted Sound Source Depth Estimation in the Wild

Depth estimation enables a wide variety of 3D applications, such as robotics, autonomous driving, and virtual reality. Despite significant work in this area, it remains open how to enable accurate, low-cost, high-resolution, and large-range depth estimation. Inspired by the flash-to-bang phenomenon (i.e. hearing the thunder after seeing the lightning), this paper develops FBDepth, the first audio-visual depth estimation framework. It takes the difference between the time-of-flight (ToF) of the light and the sound to infer the sound source depth. FBDepth is the first to incorporate video and audio with both semantic features and spatial hints for range estimation. It first aligns correspondence between the video track and audio track to locate the target object and target sound in a coarse granularity. Based on the observation of moving objects&#39; trajectories, FBDepth proposes to estimate the intersection of optical flow before and after the sound production to locate video events in time. FBDepth feeds the estimated timestamp of the video event and the audio clip for the final depth estimation. We use a mobile phone to collect 3000+ video clips with 20 different objects at up to $50m$. FBDepth decreases the Absolute Relative error (AbsRel) by 55\% compared to RGB-based methods.

preprint2022arXiv

Weak Random Periodic Solutions of Random Dynamical Systems

We first introduce the concept of weak random periodic solutions of random dynamical systems. Then, we discuss the existence of such periodic solutions. Further, we introduce the definition of weak random periodic measures and study their relationship with weak random periodic solutions. In particular, we establish the existence of invariant measures of random dynamical systems by virtue of their weak random periodic solutions. Finally, we use concrete examples to illustrate the weak random periodic phenomena of dynamical systems induced by random and stochastic differential equations.

preprint2021arXiv

Construction of Explicit Symplectic Integrators in General Relativity. I. Schwarzschild Black Holes

Symplectic integrators that preserve the geometric structure of Hamiltonian flows and do not exhibit secular growth in energy errors are suitable for the long-term integration of N-body Hamiltonian systems in the solar system. However, the construction of explicit symplectic integrators is frequently difficult in general relativity because all variables are inseparable. Moreover, even if two analytically integrable splitting parts exist in a relativistic Hamiltonian, all analytical solutions are not explicit functions of proper time. Naturally, implicit symplectic integrators, such as the midpoint rule, are applicable to this case. In general, these integrators are numerically more expensive to solve than same-order explicit symplectic algorithms. To address this issue, we split the Hamiltonian of Schwarzschild spacetime geometry into four integrable parts with analytical solutions as explicit functions of proper time. In this manner, second- and fourth-order explicit symplectic integrators can be easily made available. The new algorithms are also useful for modeling the chaotic motion of charged particles around a black hole with an external magnetic field. They demonstrate excellent long-term performance in maintaining bounded Hamiltonian errors and saving computational cost when appropriate proper time steps are adopted.

preprint2021arXiv

Construction of explicit symplectic integrators in general relativity. II. Reissner-Nordstrom black holes

In a previous paper, second- and fourth-order explicit symplectic integrators were designed for a Hamiltonian of the Schwarzschild black hole. Following this work, we continue to trace the possibility of the construction of explicit symplectic integrators for a Hamiltonian of charged particles moving around a Reissner-Nordstrom black hole with an external magnetic field. Such explicit symplectic methods are still available when the Hamiltonian is separated into five independently integrable parts with analytical solutions as explicit functions of proper time. Numerical tests show that the proposed algorithms share the desirable properties in their long-term stability, precision and efficiency for appropriate choices of step sizes. For the applicability of one of the new algorithms, the effects of the black hole&#39;s charge, the Coulomb part of the electromagnetic potential and the magnetic parameter on the dynamical behavior are surveyed. Under some circumstances, the extent of chaos gets strong with an increase of the magnetic parameter from a global phase-space structure. No the variation of the black hole&#39;s charge but the variation of the Coulomb part is considerably sensitive to affect the regular and chaotic dynamics of particles&#39; orbits. A positive Coulomb part is easier to induce chaos than a negative one.

preprint2021arXiv

Laser Cooling of Germanium Semiconductor Nanocrystals

Laser cooling of matter through anti-Stokes photoluminescence, where the emitted frequency of light exceeds that of the impinging laser by virtue of absorption of thermal vibrational energy, has been successfully realized in condensed media, and in particular with rare earth doped systems achieving sub-100K solid state optical refrigeration. Studies suggest that laser cooling in semiconductors has the potential of achieving temperatures down to ~10K and that its direct integration can usher unique high-performance nanostructured semiconductor devices. While laser cooling of nanostructured II-VI semiconductors has been reported recently, laser cooling of indirect bandgap semiconductors such as group IV silicon and germanium remains a major challenge. Here we report on the anomalous observation of dominant anti-Stokes photoluminescence in germanium nanocrystals. We attribute this result to the confluence of ultra-high purity nanocrystal germanium, generation of high density of electron-hole plasma, the inherent degeneracy of longitudinal and transverse optical phonons in non-polar indirect bandgap semiconductors, and commensurate spatial confinement effects. At high laser intensities, laser cooling with lattice temperature as low as ~50K is inferred.

preprint2021arXiv

Sum-Rate Maximization in Distributed Intelligent Reflecting Surfaces-Aided mmWave Communications

In this paper, we focus on the sum-rate optimization in a multi-user millimeter-wave (mmWave) system with distributed intelligent reflecting surfaces (D-IRSs), where a base station (BS) communicates with users via multiple IRSs. The BS transmit beamforming, IRS switch vector, and phase shifts of the IRS are jointly optimized to maximize the sum-rate under minimum user rate, unit-modulus, and transmit power constraints. To solve the resulting non-convex optimization problem, we develop an efficient alternating optimization (AO) algorithm. Specifically, the non-convex problem is converted into three subproblems, which are solved alternatively. The solution to transmit beamforming at the BS and the phase shifts at the IRS are derived by using the successive convex approximation (SCA)-based algorithm, and a greedy algorithm is proposed to design the IRS switch vector. The complexity of the proposed AO algorithm is analyzed theoretically. Numerical results show that the D-IRSs-aided scheme can significantly improve the sum-rate and energy efficiency performance.

preprint2021arXiv

TeethTap: Recognizing Discrete Teeth Gestures Using Motion and Acoustic Sensing on an Earpiece

Teeth gestures become an alternative input modality for different situations and accessibility purposes. In this paper, we present TeethTap, a novel eyes-free and hands-free input technique, which can recognize up to 13 discrete teeth tapping gestures. TeethTap adopts a wearable 3D printed earpiece with an IMU sensor and a contact microphone behind both ears, which works in tandem to detect jaw movement and sound data, respectively. TeethTap uses a support vector machine to classify gestures from noise by fusing acoustic and motion data, and implements K-Nearest-Neighbor (KNN) with a Dynamic Time Warping (DTW) distance measurement using motion data for gesture classification. A user study with 11 participants demonstrated that TeethTap could recognize 13 gestures with a real-time classification accuracy of 90.9% in a laboratory environment. We further uncovered the accuracy differences on different teeth gestures when having sensors on single vs. both sides. Moreover, we explored the activation gesture under real-world environments, including eating, speaking, walking and jumping. Based on our findings, we further discussed potential applications and practical challenges of integrating TeethTap into future devices.

preprint2020arXiv

A covariance-enhanced approach to multi-tissue joint eQTL mapping with application to transcriptome-wide association studies

Transcriptome-wide association studies based on genetically predicted gene expression have the potential to identify novel regions associated with various complex traits. It has been shown that incorporating expression quantitative trait loci (eQTLs) corresponding to multiple tissue types can improve power for association studies involving complex etiology. In this article, we propose a new multivariate response linear regression model and method for predicting gene expression in multiple tissues simultaneously. Unlike existing methods for multi-tissue joint eQTL mapping, our approach incorporates tissue-tissue expression correlation, which allows us to more efficiently handle missing expression measurements and more accurately predict gene expression using a weighted summation of eQTL genotypes. We show through simulation studies that our approach performs better than the existing methods in many scenarios. We use our method to estimate eQTL weights for 29 tissues collected by GTEx, and show that our approach significantly improves expression prediction accuracy compared to competitors. Using our eQTL weights, we perform a multi-tissue-based S-MultiXcan transcriptome-wide association study and show that our method leads to more discoveries in novel regions and more discoveries overall than the existing methods. Estimated eQTL weights are available for download online at github.com/ajmolstad/MTeQTLResults.

preprint2020arXiv

A Data-Driven Network Model for the Emerging COVID-19 Epidemics in Wuhan, Toronto and Italy

The ongoing Coronavirus Disease 2019 (COVID-19) pandemic threatens the health of humans and causes great economic losses. Predictive modelling and forecasting the epidemic trends are essential for developing countermeasures to mitigate this pandemic. We develop a network model, where each node represents an individual and the edges represent contacts between individuals where the infection can spread. The individuals are classified based on the number of contacts they have each day (their node degrees) and their infection status. The transmission network model was respectively fitted to the reported data for the COVID-19 epidemic in Wuhan (China), Toronto (Canada), and the Italian Republic using a Markov Chain Monte Carlo (MCMC) optimization algorithm. Our model fits all three regions well with narrow confidence intervals and could be adapted to simulate other megacities or regions. The model projections on the role of containment strategies can help inform public health authorities to plan control measures.

preprint2020arXiv

C*-algebras of a Cantor system with finitely many minimal subsets: structures, K-theories, and the index map

We study homeomorphisms of a Cantor set with $k$ ($k < +\infty$) minimal invariant closed (but not open) subsets; we also study crossed product C*-algebras associated to these Cantor systems and their certain orbit-cut sub-C*-algebras. In the case that $k\geq 2$, the crossed product C*-algebra is stably finite, has stable rank 2, and has real rank zero if in addition $(X, σ)$ is aperiodic. The image of the index map is connected to certain directed graphs arising from the Bratteli-Vershik-Kakutani model of the Cantor system. Using this, it is shown that the ideal of the Bratteli diagram (of the Bratteli-Vershik-Kakutani model) must have at least $k$ vertices at each level, and the image of the index map must consist infinitesimals.

preprint2020arXiv

Deformed relativistic Hartree-Bogoliubov theory in continuum with point coupling functional: examples of even-even Nd isotopes

The aim of this work is to develop the deformed relativistic Hartree-Bogoliubov theory in continuum (DRHBc) theory based on the point-coupling density functionals and extend it to provide a unified description for all even-even nuclei in the nuclear chart by overcoming all possible challenges. The nuclear superfluidity is considered via Bogoliubov transformation. Densities and potentials are expanded in terms of Legendre polynomials to include the axial deformation degrees of freedom. Sophisticated relativistic Hartree-Bogoliubov equations in coordinate space are solved in the DiracWoods-Saxon basis to consider the continuum effects. Numerical checks are performed from light nuclei to heavy nuclei. The techniques to construct the DRHBc mass table for even-even nuclei are explored. The DRHBc theory is extended to study heavier nuclei beyond magnesium isotopes. Taking Nd isotopes as examples, the experimental binding energies, two-neutron separation energies, quadrupole deformations, and charge radii are reproduced rather well. The deformation and continuum play essential roles in the description of nuclear masses and prediction of drip-line nuclei. By examining the single-particle levels in the canonical basis and their contributions to the total density, the thickness of the neutron skin, the particles number in continuum, and the Coulomb barrier, the exotic structures including the neutron skin and the proton radioactivity are predicted.

preprint2020arXiv

Fatigue-aware Bandits for Dependent Click Models

As recommender systems send a massive amount of content to keep users engaged, users may experience fatigue which is contributed by 1) an overexposure to irrelevant content, 2) boredom from seeing too many similar recommendations. To address this problem, we consider an online learning setting where a platform learns a policy to recommend content that takes user fatigue into account. We propose an extension of the Dependent Click Model (DCM) to describe users&#39; behavior. We stipulate that for each piece of content, its attractiveness to a user depends on its intrinsic relevance and a discount factor which measures how many similar contents have been shown. Users view the recommended content sequentially and click on the ones that they find attractive. Users may leave the platform at any time, and the probability of exiting is higher when they do not like the content. Based on user&#39;s feedback, the platform learns the relevance of the underlying content as well as the discounting effect due to content fatigue. We refer to this learning task as &#34;fatigue-aware DCM Bandit&#34; problem. We consider two learning scenarios depending on whether the discounting effect is known. For each scenario, we propose a learning algorithm which simultaneously explores and exploits, and characterize its regret bound.

preprint2020arXiv

Improved Fault-Tolerant Quantum Simulation of Condensed-Phase Correlated Electrons via Trotterization

Recent work has deployed linear combinations of unitaries techniques to reduce the cost of fault-tolerant quantum simulations of correlated electron models. Here, we show that one can sometimes improve upon those results with optimized implementations of Trotter-Suzuki-based product formulas. We show that low-order Trotter methods perform surprisingly well when used with phase estimation to compute relative precision quantities (e.g. energies per unit cell), as is often the goal for condensed-phase systems. In this context, simulations of the Hubbard and plane-wave electronic structure models with $N < 10^5$ fermionic modes can be performed with roughly $O(1)$ and $O(N^2)$ T complexities. We perform numerics revealing tradeoffs between the error and gate complexity of a Trotter step; e.g., we show that split-operator techniques have less Trotter error than popular alternatives. By compiling to surface code fault-tolerant gates and assuming error rates of one part per thousand, we show that one can error-correct quantum simulations of interesting, classically intractable instances with a few hundred thousand physical qubits.

preprint2020arXiv

Joint Communication and Computational Resource Allocation for QoE-driven Point Cloud Video Streaming

Point cloud video is the most popular representation of hologram, which is the medium to precedent natural content in VR/AR/MR and is expected to be the next generation video. Point cloud video system provides users immersive viewing experience with six degrees of freedom and has wide applications in many fields such as online education, entertainment. To further enhance these applications, point cloud video streaming is in critical demand. The inherent challenges lie in the large size by the necessity of recording the three-dimensional coordinates besides color information, and the associated high computation complexity of encoding. To this end, this paper proposes a communication and computation resource allocation scheme for QoE-driven point cloud video streaming. In particular, we maximize system resource utilization by selecting different quantities, transmission forms and quality level tiles to maximize the quality of experience. Extensive simulations are conducted and the simulation results show the superior performance over the existing schemes

preprint2020arXiv

Learning to Zoom-in via Learning to Zoom-out: Real-world Super-resolution by Generating and Adapting Degradation

Most learning-based super-resolution (SR) methods aim to recover high-resolution (HR) image from a given low-resolution (LR) image via learning on LR-HR image pairs. The SR methods learned on synthetic data do not perform well in real-world, due to the domain gap between the artificially synthesized and real LR images. Some efforts are thus taken to capture real-world image pairs. The captured LR-HR image pairs usually suffer from unavoidable misalignment, which hampers the performance of end-to-end learning, however. Here, focusing on the real-world SR, we ask a different question: since misalignment is unavoidable, can we propose a method that does not need LR-HR image pairing and alignment at all and utilize real images as they are? Hence we propose a framework to learn SR from an arbitrary set of unpaired LR and HR images and see how far a step can go in such a realistic and &#34;unsupervised&#34; setting. To do so, we firstly train a degradation generation network to generate realistic LR images and, more importantly, to capture their distribution (i.e., learning to zoom out). Instead of assuming the domain gap has been eliminated, we minimize the discrepancy between the generated data and real data while learning a degradation adaptive SR network (i.e., learning to zoom in). The proposed unpaired method achieves state-of-the-art SR results on real-world images, even in the datasets that favor the paired-learning methods more.

preprint2020arXiv

PVN3D: A Deep Point-wise 3D Keypoints Voting Network for 6DoF Pose Estimation

In this work, we present a novel data-driven method for robust 6DoF object pose estimation from a single RGBD image. Unlike previous methods that directly regressing pose parameters, we tackle this challenging task with a keypoint-based approach. Specifically, we propose a deep Hough voting network to detect 3D keypoints of objects and then estimate the 6D pose parameters within a least-squares fitting manner. Our method is a natural extension of 2D-keypoint approaches that successfully work on RGB based 6DoF estimation. It allows us to fully utilize the geometric constraint of rigid objects with the extra depth information and is easy for a network to learn and optimize. Extensive experiments were conducted to demonstrate the effectiveness of 3D-keypoint detection in the 6D pose estimation task. Experimental results also show our method outperforms the state-of-the-art methods by large margins on several benchmarks. Code and video are available at https://github.com/ethnhe/PVN3D.git.

preprint2020arXiv

QoE-driven Coupled Uplink and Downlink Rate Adaptation for 360-degree Video Live Streaming

360-degree video provides an immersive 360-degree viewing experience and has been widely used in many areas. The 360-degree video live streaming systems involve capturing, compression, uplink (camera to video server) and downlink (video server to user) transmissions. However, few studies have jointly investigated such complex systems, especially the rate adaptation for the coupled uplink and downlink in the 360-degree video streaming under limited bandwidth constraints. In this letter, we propose a quality of experience (QoE)-driven 360-degree video live streaming system, in which a video server performs rate adaptation based on the uplink and downlink bandwidths and information concerning each user&#39;s real-time field-of-view (FOV). We formulate it as a nonlinear integer programming problem and propose an algorithm, which combines the Karush-Kuhn-Tucker (KKT) condition and branch and bound method, to solve it. The numerical results show that the proposed optimization model can improve users&#39; QoE significantly in comparison with other baseline schemes.

preprint2020arXiv

Scalable Distributed Non-Convex ADMM-based Active Distribution System Service Restoration

Distributed restoration can harness distributed energy resources (DER) to enhance the resilience of active distribution networks. However, the large number of decision variables, especially the binary decision variables of reconfiguration, bring challenges on developing effective distributed distribution service restoration (DDSR) strategies. This paper proposes a scalable distributed optimization method based on the alternating direction method of multipliers (ADMM) for non-convex mixed-integer optimization problems and applies to develop the DDSR framework. The non-convex ADMM method consists of relax-drive-polish phases, 1) relaxing binary variables and applying the convex ADMM as a warm start; 2) driving the solutions toward Boolean values through a proximal operator; 3) fixing the obtained binary variables to polish continuous variables for a high-quality solution. Then, an autonomous clustering strategy together with consensus ADMM is developed to realize the distributed cluster-based framework of restoration. The nonconvex ADMM-based DDSR can determine DER scheduling and switch status for reconfiguration and load pickup in a distributed manner, energizing the out-of-service area from local faults or total blackouts in large-scale distribution networks. The effectiveness and scalability of the proposed DDSR framework are demonstrated through testing on the IEEE 123-node and IEEE 8500-node test feeders.

preprint2020arXiv

Secrecy Rate Maximization for Intelligent Reflecting Surface Aided SWIPT Systems

Simultaneous wireless information and power transfer (SWIPT) and intelligent reflecting surface (IRS) are two promising techniques for providing enhanced wireless communication capability and sustainable energy supply to energy-constrained wireless devices. Moreover, the combination of the IRS and the SWIPT can create the &#34;one plus one greater than two&#34; effect. However, due to the broadcast nature of wireless media, the IRS-aided SWIPT systems are vulnerable to eavesdropping. In this paper, we study the security issue of the IRS-aided SWIPT systems. The objective is to maximize the secrecy rate by jointly designing the transmit beamforming and artificial noise (AN) covariance matrix at a base station (BS) and reflective beamforming at an IRS, under transmit power constraint at the BS and energy harvesting (EH) constraints at multiple energy receivers. To tackle the formulated non-convex problem, we first employ an alternating optimization (AO) algorithm to decouple the coupling variables. Then, reflective beamforming, transmit beamforming and AN covariance matrix can be optimized by using a penalty-based algorithm and semidefinite relaxation (SDR) method, respectively. Simulation results demonstrate the effectiveness of the proposed scheme over baseline schemes.

preprint2020arXiv

The 4-D Gaussian Random Vector Maximum Conjecture and the 3-D Simplex Mean Width Conjecture

We prove the four-dimensional Gaussian random vector maximum conjecture. This conjecture asserts that among all centered Gaussian random vectors $X=(X_1,X_2,X_3,X_4)$ with $E[X_i^2]=1$, $1\le i\le 4$, the expectation $E[\max(X_1,X_2,X_3,X_4)]$ is maximal if and only if all off-diagonal elements of the covariance matrix equal $-\frac{1}{3}$. As a direct consequence, we resolve the three-dimensional simplex mean width conjecture. This latter conjecture is a long-standing open problem in convex geometry, which asserts that among all simplices inscribed into the three-dimensional unit Euclidean ball the regular simplex has the maximal mean width.

preprint2020arXiv

Towards High Throughput Wireless Network with Directional Antenna

In indoor areas such as homes and offices, high throughput communication for multiple devices is quickly becoming a necessity. Even though an access point (AP) mounted with an omni-directional antenna can cover a whole room, it cannot provide connections with high throughput throughout the room. Therefore, we propose, \emph DiRF, a directional antenna based wireless home network designed to achieve high throughput in indoor areas with densely deployed directional APs. \emph DiRF consists of a position based AP selection algorithm which can decrease latency accumulation caused by frequent AP switching, and a downlink packet scheduler which can reduce the downlink packet retransmissions during AP switching. We implement and evaluate \emph DiRF with six commercial APs, each connects with a single directional antenna. Our experiments show that \emph DiRF achieves a $3.16\times$ TCP throughput improvement, compared to the conventional scheme that only uses one AP mounted with one omni-directional antenna.

preprint2019arXiv

Complexity growth rate, grand potential and partition function

We examine the complexity/volume conjecture and further investigate the possible connections between complexity and partition function. The complexity/volume 2.0 states that the complexity growth rate $\mathcal{\dot{C}}\sim PV$. In the standard statistics, there is a fundamental relation among $PV$, the grand potential $Ω$ and the partition function $\mathcal{Z}$. By using this relation, we are able to construct an ansatz between complexity and partition function. The complexity/partition function relation is then utilized to study the complexity of the thermofield double state of extended SYK models for various conditions. The relation between complexity growth rate and black hole phase transition is also discussed.

preprint2016arXiv

Long distance co-propagation of quantum key distribution and terabit classical optical data channels

Quantum key distribution (QKD) generates symmetric keys between two remote parties, and guarantees the keys not accessible to any third party. Wavelength division multiplexing (WDM) between QKD and classical optical communications by sharing the existing fibre optics infrastructure is highly desired in order to reduce the cost of QKD applications. However, quantum signals are extremely weak and thus easily affected by the spontaneous Raman scattering effect from intensive classical light. Here, by means of wavelength selecting and spectral and temporal filtering, we realize the multiplexing and long distance co-propagation of QKD and Terabit classical coherent optical communication system up to 80km. The data capacity is two orders of magnitude larger than the previous results. Our demonstration verifies the feasibility of QKD and classical communication to share the resources of backbone fibre links, and thus taking the utility of QKD a great step forward.

preprint2015arXiv

Further study on Hunt&#39;s hypothesis (H) for Levy processes

Getoor&#39;s conjecture that essentially all Levy processes satisfy (H) is a long-standing open problem in potential theory. In the beginning of the paper, we summarize the main results obtained so far for the problem. Then, we present two new necessary and sufficient conditions for the validity of (H). Furthermore, we give applications of these new criteria. First, we give explicit constructions of Levy processes satisfying (H) in a context where previously known results could not be applied. Second, we show that a large class of pure jump subordinators can be decomposed into the summation of two independent subordinators such that both of them satisfy (H).