Source author record

Jun-Jie Huang

Jun-Jie Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision eess.IV Machine Learning physics.flu-dyn math.CO eess.SP math.OC physics.comp-ph

Catalog footprint

What is connected

15works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Combined Dictionary Unfolding Network with Gradient-Adaptive Fidelity for Transferable Multi-Source Fusion

Deep Unfolding Network-based methods have emerged as effective solutions for multi-source image fusion by combining model-driven iterative optimization with data-driven deep learning. However, most existing deep unfolding image fusion methods are derived from alternating minimization, which updates the features of different modalities separately. This design introduces considerable computational and memory overhead, limiting deployment on resource-constrained edge devices. To address this issue, we propose CDNet, a lightweight Combined Dictionary Unfolding Network for multi-source image fusion. Rather than introducing a new sparse coding prior or empirically compressing an existing fusion network, CDNet translates the unique-common decomposition prior of coupled dictionary learning into a structurally constrained joint unfolding architecture. The resulting CDBlock follows a block-sparse interaction topology and performs a model-derived joint update of common and modality-specific representations, thereby streamlining feature learning and improving efficiency.In addition, we design a compact High- and Low-frequency Image Fidelity loss for unsupervised training without ground-truth images. We evaluate CDNet on four tasks, including multi-exposure image fusion, infrared and visible image fusion, medical image fusion, and infrared and visible image fusion for semantic segmentation. Experimental results show that CDNet achieves competitive or superior fusion performance with high efficiency. For infrared and visible image fusion, CDNet outperforms competing methods on four of six metrics on the TNO dataset and five of six metrics on the RoadScene dataset. In particular, it surpasses the second-best method by 1.23 dB and 1.59 dB in PSNR on TNO and RoadScene, respectively.

preprint2022arXiv

DURRNet: Deep Unfolded Single Image Reflection Removal Network

Single image reflection removal problem aims to divide a reflection-contaminated image into a transmission image and a reflection image. It is a canonical blind source separation problem and is highly ill-posed. In this paper, we present a novel deep architecture called deep unfolded single image reflection removal network (DURRNet) which makes an attempt to combine the best features from model-based and learning-based paradigms and therefore leads to a more interpretable deep architecture. Specifically, we first propose a model-based optimization with transform-based exclusion prior and then design an iterative algorithm with simple closed-form solutions for solving each sub-problems. With the deep unrolling technique, we build the DURRNet with ProxNets to model natural image priors and ProxInvNets which are constructed with invertible networks to impose the exclusion prior. Comprehensive experimental results on commonly used datasets demonstrate that the proposed DURRNet achieves state-of-the-art results both visually and quantitatively.

preprint2022arXiv

Meta-learning based Alternating Minimization Algorithm for Non-convex Optimization

In this paper, we propose a novel solution for non-convex problems of multiple variables, especially for those typically solved by an alternating minimization (AM) strategy that splits the original optimization problem into a set of sub-problems corresponding to each variable, and then iteratively optimize each sub-problem using a fixed updating rule. However, due to the intrinsic non-convexity of the original optimization problem, the optimization can usually be trapped into spurious local minimum even when each sub-problem can be optimally solved at each iteration. Meanwhile, learning-based approaches, such as deep unfolding algorithms, are highly limited by the lack of labelled data and restricted explainability. To tackle these issues, we propose a meta-learning based alternating minimization (MLAM) method, which aims to minimize a partial of the global losses over iterations instead of carrying minimization on each sub-problem, and it tends to learn an adaptive strategy to replace the handcrafted counterpart resulting in advance on superior performance. Meanwhile, the proposed MLAM still maintains the original algorithmic principle, which contributes to a better interpretability. We evaluate the proposed method on two representative problems, namely, bi-linear inverse problem: matrix completion, and non-linear problem: Gaussian mixture models. The experimental results validate that our proposed approach outperforms AM-based methods in standard settings, and is able to achieve effective optimization in challenging cases while other comparing methods would typically fail.

preprint2022arXiv

Mixed X-Ray Image Separation for Artworks with Concealed Designs

In this paper, we focus on X-ray images of paintings with concealed sub-surface designs (e.g., deriving from reuse of the painting support or revision of a composition by the artist), which include contributions from both the surface painting and the concealed features. In particular, we propose a self-supervised deep learning-based image separation approach that can be applied to the X-ray images from such paintings to separate them into two hypothetical X-ray images. One of these reconstructed images is related to the X-ray image of the concealed painting, while the second one contains only information related to the X-ray of the visible painting. The proposed separation network consists of two components: the analysis and the synthesis sub-networks. The analysis sub-network is based on learned coupled iterative shrinkage thresholding algorithms (LCISTA) designed using algorithm unrolling techniques, and the synthesis sub-network consists of several linear mappings. The learning algorithm operates in a totally self-supervised fashion without requiring a sample set that contains both the mixed X-ray images and the separated ones. The proposed method is demonstrated on a real painting with concealed content, Doña Isabel de Porcel by Francisco de Goya, to show its effectiveness.

preprint2022arXiv

Two-geodesic transitive graphs of order $p^n$ with $n\leq3$

A vertex triple $(u,v,w)$ of a graph is called a $2$-geodesic if $v$ is adjacent to both $u$ and $w$ and $u$ is not adjacent to $w$. A graph is said to be $2$-geodesic transitive if its automorphism group is transitive on the set of $2$-geodesics. In this paper, a complete classification of $2$-geodesic transitive graphs of order $p^n$ is given for each prime $p$ and $n\leq 3$. It turns out that all such graphs consist of three small graphs: the complete bipartite graph $K_{4,4}$ of order $8$, the Schläfli graph of order $27$ and its complement, and fourteen infinite families: the cycles $C_p, C_{p^2}$ and $C_{p^3}$, the complete graphs $K_p, K_{p^2}$ and $K_{p^3}$, the complete multipartite graphs $K_{p[p]}$, $K_{p[p^2]}$ and $K_{p^2[p]}$, the Hamming graph $H(2,p)$ and its complement, the Hamming graph $H(3,p)$, and two infinite families of normal Cayley graphs on extraspecial group of order $p^3$ and exponent $p$.

preprint2021arXiv

Two-distance transitive normal Cayley graphs

In this paper, we construct an infinite family of normal Cayley graphs, which are $2$-distance-transitive but neither distance-transitive nor $2$-arc-transitive. This answers a question raised by Chen, Jin and Li in 2019 and corrects a claim in a literature given by Pan, Huang and Liu in 2015.

preprint2021arXiv

Video Summarization through Reinforcement Learning with a 3D Spatio-Temporal U-Net

Intelligent video summarization algorithms allow to quickly convey the most relevant information in videos through the identification of the most essential and explanatory content while removing redundant video frames. In this paper, we introduce the 3DST-UNet-RL framework for video summarization. A 3D spatio-temporal U-Net is used to efficiently encode spatio-temporal information of the input videos for downstream reinforcement learning (RL). An RL agent learns from spatio-temporal latent scores and predicts actions for keeping or rejecting a video frame in a video summary. We investigate if real/inflated 3D spatio-temporal CNN features are better suited to learn representations from videos than commonly used 2D image features. Our framework can operate in both, a fully unsupervised mode and a supervised training mode. We analyse the impact of prescribed summary lengths and show experimental evidence for the effectiveness of 3DST-UNet-RL on two commonly used general video summarization benchmarks. We also applied our method on a medical video summarization task. The proposed video summarization method has the potential to save storage costs of ultrasound screening videos as well as to increase efficiency when browsing patient video data during retrospective analysis or audit without loosing essential information

preprint2021arXiv

WINNet: Wavelet-inspired Invertible Network for Image Denoising

Image denoising aims to restore a clean image from an observed noisy image. The model-based image denoising approaches can achieve good generalization ability over different noise levels and are with high interpretability. Learning-based approaches are able to achieve better results, but usually with weaker generalization ability and interpretability. In this paper, we propose a wavelet-inspired invertible network (WINNet) to combine the merits of the wavelet-based approaches and learningbased approaches. The proposed WINNet consists of K-scale of lifting inspired invertible neural networks (LINNs) and sparsity-driven denoising networks together with a noise estimation network. The network architecture of LINNs is inspired by the lifting scheme in wavelets. LINNs are used to learn a non-linear redundant transform with perfect reconstruction property to facilitate noise removal. The denoising network implements a sparse coding process for denoising. The noise estimation network estimates the noise level from the input image which will be used to adaptively adjust the soft-thresholds in LINNs. The forward transform of LINNs produce a redundant multi-scale representation for denoising. The denoised image is reconstructed using the inverse transform of LINNs with the denoised detail channels and the original coarse channel. The simulation results show that the proposed WINNet method is highly interpretable and has strong generalization ability to unseen noise levels. It also achieves competitive results in the non-blind/blind image denoising and in image deblurring.

preprint2020arXiv

Learning Deep Analysis Dictionaries -- Part II: Convolutional Dictionaries

In this paper, we introduce a Deep Convolutional Analysis Dictionary Model (DeepCAM) by learning convolutional dictionaries instead of unstructured dictionaries as in the case of deep analysis dictionary model introduced in the companion paper. Convolutional dictionaries are more suitable for processing high-dimensional signals like for example images and have only a small number of free parameters. By exploiting the properties of a convolutional dictionary, we present an efficient convolutional analysis dictionary learning approach. A L-layer DeepCAM consists of L layers of convolutional analysis dictionary and element-wise soft-thresholding pairs and a single layer of convolutional synthesis dictionary. Similar to DeepAM, each convolutional analysis dictionary is composed of a convolutional Information Preserving Analysis Dictionary (IPAD) and a convolutional Clustering Analysis Dictionary (CAD). The IPAD and the CAD are learned using variations of the proposed learning algorithm. We demonstrate that DeepCAM is an effective multilayer convolutional model and, on single image super-resolution, achieves performance comparable with other methods while also showing good generalization capabilities.

preprint2020arXiv

Learning Deep Analysis Dictionaries for Image Super-Resolution

Inspired by the recent success of deep neural networks and the recent efforts to develop multi-layer dictionary models, we propose a Deep Analysis dictionary Model (DeepAM) which is optimized to address a specific regression task known as single image super-resolution. Contrary to other multi-layer dictionary models, our architecture contains L layers of analysis dictionary and soft-thresholding operators to gradually extract high-level features and a layer of synthesis dictionary which is designed to optimize the regression task at hand. In our approach, each analysis dictionary is partitioned into two sub-dictionaries: an Information Preserving Analysis Dictionary (IPAD) and a Clustering Analysis Dictionary (CAD). The IPAD together with the corresponding soft-thresholds is designed to pass the key information from the previous layer to the next layer, while the CAD together with the corresponding soft-thresholding operator is designed to produce a sparse feature representation of its input data that facilitates discrimination of key features. DeepAM uses both supervised and unsupervised setup. Simulation results show that the proposed deep analysis dictionary model achieves better performance compared to a deep neural network that has the same structure and is optimized using back-propagation when training datasets are small.

preprint2020arXiv

Reconstruction of FRI Signals using Deep Neural Network Approaches

Finite Rate of Innovation (FRI) theory considers sampling and reconstruction of classes of non-bandlimited continuous signals that have a small number of free parameters, such as a stream of Diracs. The task of reconstructing FRI signals from discrete samples is often transformed into a spectral estimation problem and solved using Prony's method and matrix pencil method which involve estimating signal subspaces. They achieve an optimal performance given by the Cramér-Rao bound yet break down at a certain peak signal-to-noise ratio (PSNR). This is probably due to the so-called subspace swap event. In this paper, we aim to alleviate the subspace swap problem and investigate alternative approaches including directly estimating FRI parameters using deep neural networks and utilising deep neural networks as denoisers to reduce the noise in the samples. Simulations show significant improvements on the breakdown PSNR over existing FRI methods, which still outperform learning-based approaches in medium to high PSNR regimes.

preprint2019arXiv

Gated Multi-layer Convolutional Feature Extraction Network for Robust Pedestrian Detection

Pedestrian detection methods have been significantly improved with the development of deep convolutional neural networks. Nevertheless, robustly detecting pedestrians with a large variant on sizes and with occlusions remains a challenging problem. In this paper, we propose a gated multi-layer convolutional feature extraction method which can adaptively generate discriminative features for candidate pedestrian regions. The proposed gated feature extraction framework consists of squeeze units, gate units and a concatenation layer which perform feature dimension squeezing, feature elements manipulation and convolutional features combination from multiple CNN layers, respectively. We proposed two different gate models which can manipulate the regional feature maps in a channel-wise selection manner and a spatial-wise selection manner, respectively. Experiments on the challenging CityPersons dataset demonstrate the effectiveness of the proposed method, especially on detecting those small-size and occluded pedestrians.

preprint2016arXiv

An Alternative Method to Implement Contact Angle Boundary Condition on Immersed Surfaces for Phase-Field Simulations

In this paper, we propose an alternative approach to implement the contact angle boundary condition on immersed surfaces for phase-field simulations of two-phase flows using the Cahn-Hilliard equation on a Cartesian mesh. This simple and effective method was inspired by previous works on the geometric formulation of the wetting boundary condition. In two dimensions, by making full use of the hyperbolic tangent profile of the order parameter, we were able to obtain its unknown value at a ghost point from the information at only one point in the fluid. This is in contrast with previous approaches using interpolations involving several points. The special feature allows this method to be easily implemented on immersed surfaces (including curved ones) that cut through the grid lines. It is verified through the study of two examples: (1) the shape of a drop on a circular cylinder with different contact angles; (2) the spreading of a drop on an embedded inclined wall with a given contact angle.

preprint2013arXiv

Numerical Study of Drop Motion on a Surface with Wettability Gradient and Contact Angle Hysteresis

In this work, the motion of a 2-D drop on a surface with given wettability gradient is studied numerically by a hybrid lattice-Boltzmann finite-difference method using the multiple-relaxation-time collision model. We incorporate the geometric wetting boundary condition that allows accurate implementation of a contact angle hysteresis model. The method is first validated through three benchmark tests, including the layered Poiseuille flow with a viscosity contrast, the motion of a liquid column in a channel with specified wettability gradient and the force balance for a static drop attached to a surface with hysteresis subject to a body force. Then, simulations of a drop on a wall with given wettability gradient are performed under different conditions. The effects of the Reynolds number, the viscosity ratio, the wettability gradient, as well as the contact angle hysteresis on the drop motion are investigated in detail. It is found that the capillary number of the drop in steady state is significantly affected by the viscosity ratio, the magnitudes of the wettability gradient and the contact angle hysteresis, whereas it only shows very weak dependence on the Reynolds number.

preprint2013arXiv

Wetting Boundary Conditions in Phase-Field-Based Simulation of Binary Fluids: Some Comparative Studies and New Development

We studied several wetting boundary conditions (WBCs) in the simulation of binary fluids based on phase-field theory. Five WBCs, three belonging to the surface energy (SE) formulation using the linear, cubic and sine functions (denoted as LinSE, CubSE and SinSE), the fourth using a geometric formulation (Geom), and the fifth using a characteristic interpolation (CI), were compared with each other through the study of several problems: (1) the static contact angle of a drop; (2) a Poiseuille flow-driven liquid column; (3) a wettability gradient (WG)-driven liquid column; (4) drop dewetting. It was found that while all WBCs can predict the static contact angle fairly accurately, they may affect the simulation outcomes of dynamic problems differently, depending on the driving mechanism. For the flow-driven problem, to use different WBCs had almost no effect on the flow characteristics over a large scale. But for other capillarity-driven problems, the WBC had some noticeable effects. For the WG-driven liquid column, Geom gave the most consistent prediction between the drop velocity and dynamic contact angles, and LinSE delivered the poorest prediction in this aspect. Except for Geom, the dynamic contact angle differed from the prescribed (static) one when other WBCs were used. For drop dewetting, Geom led to the most violent drop motion whereas CubSE caused the weakest motion; the initial contact line velocity was also found to be dependent on the WBC. For several problems, CubSE and SinSE gave almost the same results, and those by Geom and CI were close as well, possibly due to similar consideration in their design. Besides various comparisons, a new implementation that may be used for all WBCs was proposed to mimic the wall energy relaxation and control the degree of slip. This new procedure made it possible to allow the simulations to match experimental measurements well.

Jun-Jie Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

Combined Dictionary Unfolding Network with Gradient-Adaptive Fidelity for Transferable Multi-Source Fusion

DURRNet: Deep Unfolded Single Image Reflection Removal Network

Meta-learning based Alternating Minimization Algorithm for Non-convex Optimization

Mixed X-Ray Image Separation for Artworks with Concealed Designs

Two-geodesic transitive graphs of order $p^n$ with $n\leq3$

Two-distance transitive normal Cayley graphs

Video Summarization through Reinforcement Learning with a 3D Spatio-Temporal U-Net

WINNet: Wavelet-inspired Invertible Network for Image Denoising

Learning Deep Analysis Dictionaries -- Part II: Convolutional Dictionaries

Learning Deep Analysis Dictionaries for Image Super-Resolution

Reconstruction of FRI Signals using Deep Neural Network Approaches

Gated Multi-layer Convolutional Feature Extraction Network for Robust Pedestrian Detection

An Alternative Method to Implement Contact Angle Boundary Condition on Immersed Surfaces for Phase-Field Simulations

Numerical Study of Drop Motion on a Surface with Wettability Gradient and Contact Angle Hysteresis

Wetting Boundary Conditions in Phase-Field-Based Simulation of Binary Fluids: Some Comparative Studies and New Development