Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
23works
0followers
19topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

23 published item(s)

preprint2026arXiv

Towards Autonomous Business Intelligence via Data-to-Insight Discovery Agent

Transforming fragmented enterprise data into actionable insights remains a significant challenge for LLMs, constrained by complex database schemas, limitations in dynamic SQL generation, and the need for deep multi-dimensional analysis.In this paper, we propose AIDA(Autonomous Insight Discovery Agent), the first end-to-end framework designed for autonomous exploration in complex business environments. We establish a highly flexible instant retail environment encompassing 200+ metrics and 100+ dimensions, and integrates a proprietary Domain-Specific Language (DSL) that bridges semantic reasoning with precise SQL execution. Our reinforcement learning system subsequently formulates business analysis as a Pareto Principle-guided cumulative reasoning process. Experimental results demonstrate that AIDA significantly outperforms workflow-based agents, and extensive evaluations further reveal that AIDA achieves superior environmental perception and more in-depth analysis from diverse perspectives. Our work ultimately establishes the transformative potential of autonomous intelligence for industrial-scale business intelligence systems.

preprint2024arXiv

Derived Hall algebras of root categories

For a finitary hereditary abelian category $\mathcal{A}$, we define a derived Hall algebra of its root category by counting the triangles and using the octahedral axiom, which is proved to be isomorphic to the Drinfeld double of Hall algebra of $\mathcal{A}$. When applied to finite-dimensional nilpotent representations of the Jordan quiver or coherent sheaves over elliptic curves, these algebras provide categorical realizations of the ring of Laurent symmetric functions and also double affine Hecke algebras.

preprint2024arXiv

RustNeRF: Robust Neural Radiance Field with Low-Quality Images

Recent work on Neural Radiance Fields (NeRF) exploits multi-view 3D consistency, achieving impressive results in 3D scene modeling and high-fidelity novel-view synthesis. However, there are limitations. First, existing methods assume enough high-quality images are available for training the NeRF model, ignoring real-world image degradation. Second, previous methods struggle with ambiguity in the training set due to unmodeled inconsistencies among different views. In this work, we present RustNeRF for real-world high-quality NeRF. To improve NeRF's robustness under real-world inputs, we train a 3D-aware preprocessing network that incorporates real-world degradation modeling. We propose a novel implicit multi-view guidance to address information loss during image degradation and restoration. Extensive experiments demonstrate RustNeRF's advantages over existing approaches under real-world degradation. The code will be released.

preprint2022arXiv

Adaptive Patch Exiting for Scalable Single Image Super-Resolution

Since the future of computing is heterogeneous, scalability is a crucial problem for single image super-resolution. Recent works try to train one network, which can be deployed on platforms with different capacities. However, they rely on the pixel-wise sparse convolution, which is not hardware-friendly and achieves limited practical speedup. As image can be divided into patches, which have various restoration difficulties, we present a scalable method based on Adaptive Patch Exiting (APE) to achieve more practical speedup. Specifically, we propose to train a regressor to predict the incremental capacity of each layer for the patch. Once the incremental capacity is below the threshold, the patch can exit at the specific layer. Our method can easily adjust the trade-off between performance and efficiency by changing the threshold of incremental capacity. Furthermore, we propose a novel strategy to enable the network training of our method. We conduct extensive experiments across various backbones, datasets and scaling factors to demonstrate the advantages of our method. Code is available at https://github.com/littlepure2333/APE

preprint2022arXiv

Efficient Meta-Tuning for Content-aware Neural Video Delivery

Recently, Deep Neural Networks (DNNs) are utilized to reduce the bandwidth and improve the quality of Internet video delivery. Existing methods train corresponding content-aware super-resolution (SR) model for each video chunk on the server, and stream low-resolution (LR) video chunks along with SR models to the client. Although they achieve promising results, the huge computational cost of network training limits their practical applications. In this paper, we present a method named Efficient Meta-Tuning (EMT) to reduce the computational cost. Instead of training from scratch, EMT adapts a meta-learned model to the first chunk of the input video. As for the following chunks, it fine-tunes the partial parameters selected by gradient masking of previous adapted model. In order to achieve further speedup for EMT, we propose a novel sampling strategy to extract the most challenging patches from video frames. The proposed strategy is highly efficient and brings negligible additional cost. Our method significantly reduces the computational cost and achieves even better performance, paving the way for applying neural video delivery techniques to practical applications. We conduct extensive experiments based on various efficient SR architectures, including ESPCN, SRCNN, FSRCNN and EDSR-1, demonstrating the generalization ability of our work. The code is released at \url{https://github.com/Neural-video-delivery/EMT-Pytorch-ECCV2022}.

preprint2022arXiv

NFANet: A Novel Method for Weakly Supervised Water Extraction from High-Resolution Remote Sensing Imagery

The use of deep learning for water extraction requires precise pixel-level labels. However, it is very difficult to label high-resolution remote sensing images at the pixel level. Therefore, we study how to utilize point labels to extract water bodies and propose a novel method called the neighbor feature aggregation network (NFANet). Compared with pixellevel labels, point labels are much easier to obtain, but they will lose much information. In this paper, we take advantage of the similarity between the adjacent pixels of a local water-body, and propose a neighbor sampler to resample remote sensing images. Then, the sampled images are sent to the network for feature aggregation. In addition, we use an improved recursive training algorithm to further improve the extraction accuracy, making the water boundary more natural. Furthermore, our method utilizes neighboring features instead of global or local features to learn more representative features. The experimental results show that the proposed NFANet method not only outperforms other studied weakly supervised approaches, but also obtains similar results as the state-of-the-art ones.

preprint2022arXiv

Non-Hermiticity stabilized Majorana zero modes in semiconductor-superconductor nanowires

Coupled Majorana zero modes with nonzero energies are generally detrimental to the non-Abelian statistics due to the additional dynamic phase. Nevertheless, we show that a well-connected lead can introduce a local non-Hermitian dissipation term to shift the energies of the both coupled Majorana modes to zero, and surprisingly turn the coupled Majorana mode far from the lead into a dark Majorana mode with exponentially small dissipation. This dark Majorana mode can conquer the drawback of the partially overlapped Majorana zero modes and possess the properties of true Majorana zero mode such as the perfect fractional Josephson effect and the non-Abelian statistics.

preprint2022arXiv

NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results. In this challenge, the new Large-scale Diverse Video (LDV) dataset is employed. The challenge has three tracks. Tracks 1 and 2 aim at enhancing the videos compressed by HEVC at a fixed QP, while Track 3 is designed for enhancing the videos compressed by x265 at a fixed bit-rate. Besides, the quality enhancement of Tracks 1 and 3 targets at improving the fidelity (PSNR), and Track 2 targets at enhancing the perceptual quality. The three tracks totally attract 482 registrations. In the test phase, 12 teams, 8 teams and 11 teams submitted the final results of Tracks 1, 2 and 3, respectively. The proposed methods and solutions gauge the state-of-the-art of video quality enhancement. The homepage of the challenge: https://github.com/RenYang-home/NTIRE21_VEnh

preprint2022arXiv

Structure-aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation

Morphable models are essential for the statistical modeling of 3D faces. Previous works on morphable models mostly focus on large-scale facial geometry but ignore facial details. This paper augments morphable models in representing facial details by learning a Structure-aware Editable Morphable Model (SEMM). SEMM introduces a detail structure representation based on the distance field of wrinkle lines, jointly modeled with detail displacements to establish better correspondences and enable intuitive manipulation of wrinkle structure. Besides, SEMM introduces two transformation modules to translate expression blendshape weights and age values into changes in latent space, allowing effective semantic detail editing while maintaining identity. Extensive experiments demonstrate that the proposed model compactly represents facial details, outperforms previous methods in expression animation qualitatively and quantitatively, and achieves effective age editing and wrinkle line editing of facial details. Code and model are available at https://github.com/gerwang/facial-detail-manipulation.

preprint2022arXiv

Uncertainty Guided Depth Fusion for Spike Camera

Depth estimation is essential for various important real-world applications such as autonomous driving. However, it suffers from severe performance degradation in high-velocity scenario since traditional cameras can only capture blurred images. To deal with this problem, the spike camera is designed to capture the pixel-wise luminance intensity at high frame rate. However, depth estimation with spike camera remains very challenging using traditional monocular or stereo depth estimation algorithms, which are based on the photometric consistency. In this paper, we propose a novel Uncertainty-Guided Depth Fusion (UGDF) framework to fuse the predictions of monocular and stereo depth estimation networks for spike camera. Our framework is motivated by the fact that stereo spike depth estimation achieves better results at close range while monocular spike depth estimation obtains better results at long range. Therefore, we introduce a dual-task depth estimation architecture with a joint training strategy and estimate the distributed uncertainty to fuse the monocular and stereo results. In order to demonstrate the advantage of spike depth estimation over traditional camera depth estimation, we contribute a spike-depth dataset named CitySpike20K, which contains 20K paired samples, for spike depth estimation. UGDF achieves state-of-the-art results on CitySpike20K, surpassing all monocular or stereo spike depth estimation baselines. We conduct extensive experiments to evaluate the effectiveness and generalization of our method on CitySpike20K. To the best of our knowledge, our framework is the first dual-task fusion framework for spike camera depth estimation. Code and dataset will be released.

preprint2021arXiv

Deep Likelihood Network for Image Restoration with Multiple Degradation Levels

Convolutional neural networks have been proven effective in a variety of image restoration tasks. Most state-of-the-art solutions, however, are trained using images with a single particular degradation level, and their performance deteriorates drastically when applied to other degradation settings. In this paper, we propose deep likelihood network (DL-Net), aiming at generalizing off-the-shelf image restoration networks to succeed over a spectrum of degradation levels. We slightly modify an off-the-shelf network by appending a simple recursive module, which is derived from a fidelity term, for disentangling the computation for multiple degradation levels. Extensive experimental results on image inpainting, interpolation, and super-resolution show the effectiveness of our DL-Net.

preprint2021arXiv

Hall algebras and quantum symmetric pairs I: foundations

A quantum symmetric pair consists of a quantum group $\mathbf U$ and its coideal subalgebra ${\mathbf U}^{\imath}_{\boldsymbolς}$ with parameters $\boldsymbolς$ (called an $\imath$quantum group). We initiate a Hall algebra approach for the categorification of $\imath$quantum groups. A universal $\imath$quantum group $\widetilde{\mathbf U}^{\imath}$ is introduced and ${\mathbf U}^{\imath}_{\boldsymbolς}$ is recovered by a central reduction of $\widetilde{\mathbf U}^{\imath}$. The semi-derived Ringel-Hall algebras of the first author and Peng, which are closely related to semi-derived Hall algebras of Gorsky and motivated by Bridgeland's work, are extended to the setting of 1-Gorenstein algebras, as shown in Appendix A by the first author. A new class of 1-Gorenstein algebras (called $\imath$quiver algebras) arising from acyclic quivers with involutions is introduced. The semi-derived Ringel-Hall algebras for the Dynkin $\imath$quiver algebras are shown to be isomorphic to the universal quasi-split $\imath$quantum groups of finite type. Monomial bases and PBW bases for these Hall algebras and $\imath$quantum groups are constructed.

preprint2021arXiv

Hall algebras and quantum symmetric pairs III: Quiver varieties

The $\imath$quiver algebras were introduced recently by the authors to provide a Hall algebra realization of universal $\imath$quantum groups, which is a generalization of Bridgeland's Hall algebra construction for (Drinfeld doubles of) quantum groups; here an $\imath$quantum group and a corresponding Drinfeld-Jimbo quantum group form a quantum symmetric pair. In this paper, the Dynkin $\imath$quiver algebras are shown to arise as new examples of singular Nakajima-Keller-Scherotzke categories. Then we provide a geometric construction of the universal $\imath$quantum groups and their ``dual canonical bases" with positivity, via the quantum Grothendieck rings of Nakajima-Keller-Scherotzke quiver varieties, generalizing Qin's geometric realization of quantum groups of type ADE.

preprint2021arXiv

Realistic flat-band model based on degenerate $p$-orbitals in two-dimensional ionic materials

Though several theoretical models have been proposed to design electronic flat-bands, the definite experimental realization in two-dimensional atomic crystal is still lacking. Here we propose a novel and realistic flat-band model based on threefold degenerate $p$-orbitals in two-dimensional ionic materials. Our theoretical analysis and first-principles calculations show that the proposed flat-band can be realized in 1T layered materials of alkali-metal chalogenides and metal-carbon group compounds. Some of the former are theoretically predicted to be stable as layered materials (e.g., K$_2$S), and some of the latter have been experimentally fabricated in previous works (e.g., Gd$_2$CCl$_2$). More interestingly, the flat-band is partially filled in the heterostructure of a K$_2$S monolayer and graphene layers. The spin polarized nearly flat-band can be realized in the ferromagnetic state of a Gd$_2$CCl$_2$ monolayer, which has been fabricated in experiments. Our theoretical model together with the material predictions provide a realistic platform for the study of flat-bands and related exotic quantum phases.

preprint2021arXiv

Semi-derived Ringel-Hall algebras and Drinfeld double

Let $\mathcal{A}$ be an arbitrary hereditary abelian category that may not have enough projective objects. For example, $\mathcal{A}$ can be the category of finite-dimensional representations of a quiver or the category of coherent sheaves on a smooth projective curve or on a weighted projective line. Inspired by the works of Bridgeland and Gorsky, we define the semi-derived Ringel-Hall algebra of $\mathcal{A}$, denoted by $\mathcal{S}\mathcal{D}\mathcal{H}_{\mathbb{Z}/2}(\mathcal{A})$, to be the localization of a quotient algebra of the Ringel-Hall algebra of the category of $\mathbb{Z}/2$-graded complexes over $\mathcal{A}$. We obtain the following three main results. The semi-derived Ringel-Hall algebra has a natural basis. A twisted version of the semi-derived Ringel-Hall algebra of $\mathcal{A}$ is isomorphic to the Drinfeld double of the twisted extended Ringel-Hall algebra $\mathcal{H}_{tw}^e(\mathcal{A})$ of $\mathcal{A}$. If $\mathcal{A}$ has a tilting object $T$, then its semi-derived Ringel-Hall algebra is isomorphic to the $\mathbb{Z}/2$-graded semi-derived Hall algebra $\mathcal{S}\mathcal{D}\mathcal{H}_{\mathbb{Z}/2}(\mathrm{add} T)$ of the exact category $\mathrm{add} T$ defined by Gorsky, and so is isomorphic to Bridgeland's Hall algebra of $\mod (\mathrm{End}(T)^{op})$.

preprint2020arXiv

A chip-scale oscillation-mode optomechanical inertial sensor near the thermodynamical limits

High-precision inertial sensing and gravity sensing are key in navigation, oil exploration, and earthquake prediction. In contrast to prior accelerometers using piezoelectric or electronic capacitance readout techniques, optical readout provides narrow-linewidth high-sensitivity laser detection along with low-noise resonant optomechanical transduction near the thermodynamical limits. Here an optomechanical inertial sensor with 8.2micro-g/Hz^1/2 velocity random walk (VRW) at acquisition rate of 100 Hz and 50.9 micro-g bias instability is demonstrated, suitable for consumer and industrial grade applications, e.g., inertial navigation, inclination sensing, platform stabilization, and/or wearable device motion detection. Driven into optomechanical sustained-oscillation, the slot photonic crystal cavity provides radio-frequency readout of the optically-driven transduction with enhanced 625 microg/Hz sensitivity. Measuring the optomechanically-stiffened oscillation shift, instead of the optical transmission shift, provides a 220x VRW enhancement over pre-oscillation mode detection due to the strong optomechanical transduction. Supported by theory, this inertial sensor operates 2.56x above the thermodynamical limit at small integration times, with 43-dB dynamic range, in a solid-state room-temperature readout architecture.

preprint2020arXiv

Neural Video Coding using Multiscale Motion Compensation and Spatiotemporal Context Model

Over the past two decades, traditional block-based video coding has made remarkable progress and spawned a series of well-known standards such as MPEG-4, H.264/AVC and H.265/HEVC. On the other hand, deep neural networks (DNNs) have shown their powerful capacity for visual content understanding, feature extraction and compact representation. Some previous works have explored the learnt video coding algorithms in an end-to-end manner, which show the great potential compared with traditional methods. In this paper, we propose an end-to-end deep neural video coding framework (NVC), which uses variational autoencoders (VAEs) with joint spatial and temporal prior aggregation (PA) to exploit the correlations in intra-frame pixels, inter-frame motions and inter-frame compensation residuals, respectively. Novel features of NVC include: 1) To estimate and compensate motion over a large range of magnitudes, we propose an unsupervised multiscale motion compensation network (MS-MCN) together with a pyramid decoder in the VAE for coding motion features that generates multiscale flow fields, 2) we design a novel adaptive spatiotemporal context model for efficient entropy coding for motion information, 3) we adopt nonlocal attention modules (NLAM) at the bottlenecks of the VAEs for implicit adaptive feature extraction and activation, leveraging its high transformation capacity and unequal weighting with joint global and local information, and 4) we introduce multi-module optimization and a multi-frame training strategy to minimize the temporal error propagation among P-frames. NVC is evaluated for the low-delay causal settings and compared with H.265/HEVC, H.264/AVC and the other learnt video compression methods following the common test conditions, demonstrating consistent gains across all popular test sequences for both PSNR and MS-SSIM distortion metrics.

preprint2019arXiv

Dielectric Metasurfaces for Complete and Independent Control of Optical Amplitude and Phase

Metasurfaces are optically thin metamaterials that promise complete control of the wavefront of light but are primarily used to control only the phase of light. Here, we present an approach, simple in concept and in practice, that uses meta-atoms with a varying degree of form birefringence and rotation angles to create high-efficiency dielectric metasurfaces that control both the optical amplitude and phase at one or two frequencies. This opens up applications in computer-generated holography, allowing faithful reproduction of both the phase and amplitude of a target holographic scene without the iterative algorithms required in phase-only holography. We demonstrate all-dielectric metasurface holograms with independent and complete control of the amplitude and phase at up to two optical frequencies simultaneously to generate two- and three-dimensional holographic objects. We show that phase-amplitude metasurfaces enable a few features not attainable in phase-only holography; these include creating artifact-free two-dimensional holographic images, encoding phase and amplitude profiles separately at the object plane, encoding intensity profiles at the metasurface and object planes separately, and controlling the surface textures of three-dimensional holographic objects.

preprint2016arXiv

Higgs amplitude mode in massless Dirac fermion systems

The Higgs amplitude mode in superconductors is the condensed matter analogy of Higgs bosons in particle physics. We investigate the time evolution of Higgs amplitude mode in massless Dirac systems, induced by a weak quench of an attractive interaction. We find that the Higgs amplitude mode in the half-filling honeycomb lattice has a logarithmic decaying behaviour, qualitatively different from the $1/\sqrt{t}$ decay in the normal superconductors. Our study is also extended to the doped cases in honeycomb lattice. As for the 3D Dirac semimetal at half filling, we obtain an undamped oscillation of the amplitude mode. Our finding is not only an important supplement to the previous theoretical studies on normal fermion systems, but also provide an experimental signature to characterize the superconductivity in 2D or 3D Dirac systems.