Source author record

Qiang Chen

Qiang Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

31works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

LEGATO: Good Identity Unlearning Is Continuous

Machine unlearning has become a crucial role in enabling generative models trained on large datasets to remove sensitive, private, or copyright-protected data. However, existing machine unlearning methods face three challenges in learning to forget identity of generative models: 1) inefficient, where identity erasure requires fine-tuning all the model's parameters; 2) limited controllability, where forgetting intensity cannot be controlled and explainability is lacking; 3) catastrophic collapse, where the model's retention capability undergoes drastic degradation as forgetting progresses. Forgetting has typically been handled through discrete and unstable updates, often requiring full-model fine-tuning and leading to catastrophic collapse. In this work, we argue that identity forgetting should be modeled as a continuous trajectory, and introduce LEGATO - Learn to ForgEt Identity in GenerAtive Models via Trajectory-consistent Neural Ordinary Differential Equations. LEGATO augments pre-trained generators with fine-tunable lightweight Neural ODE adapters, enabling smooth, controllable forgetting while keeping the original model weights frozen. This formulation allows forgetting intensity to be precisely modulated via ODE step size, offering interpretability and robustness. To further ensure stability, we introduce trajectory consistency constraints that explicitly prevent catastrophic collapse during unlearning. Extensive experiments across in-domain and out-of-domain identity unlearning benchmarks show that LEGATO achieves state-of-the-art forgetting performance, avoids catastrophic collapse and reduces fine-tuned parameters.

preprint2026arXiv

Programmable ultra-broadband photonic chaos platform enabled by microwave-chaos-driven electro-optic frequency combs

Optical chaos holds great promise for secure communication, LiDAR, and reinforcement learning. However, its scalability has long been constrained by an intrinsic trade-off between bandwidth and the number of parallel chaotic channels. Here, we introduce a programmable "chaos-on-comb" architecture that overcomes this limitation using standard electro-optic components. By heterodyning a delayed-feedback chaotic laser with a continuous-wave reference, a broadband chaotic microwave signal is generated to simultaneously drive a cascaded electro-optic comb, imprinting chaotic dynamics across all comb lines and merging them into an ultra-broadband chaotic continuum. Then, incorporating spectrum slicing enables flexible extraction of parallel chaotic channels with preserved statistical independence and per-channel programmability. As a result, we demonstrate a single-channel ultra-broadband optical chaos with an effective bandwidth of 543.8 GHz, and a broadband terahertz noise source with an excess noise ratio of 52.99 \pm 2.85 dB to validate its flatness. Furthermore, we employ the uncorrelated parallel chaos for ultrafast photonic decision-making in a 256-armed bandit problem, achieving a favourable power-law scaling exponent of 0.86. Our work paves the way toward programmable, reconfigurable, and application-ready photonic chaos systems.

preprint2026arXiv

Test-time generative augmentation for medical image segmentation

Medical image segmentation is critical for clinical diagnosis, treatment planning, and monitoring, yet segmentation models often struggle with uncertainties stemming from occlusions, ambiguous boundaries, and variations in imaging devices. Traditional test-time augmentation (TTA) techniques typically rely on predefined geometric and photometric transformations, limiting their adaptability and effectiveness in complex medical scenarios. In this study, we introduced Test-Time Generative Augmentation (TTGA), a novel augmentation strategy specifically tailored for medical image segmentation at inference time. Different from conventional augmentation strategies that suffer from excessive randomness or limited flexibility, TTGA leverages a domain-fine-tuned generative model to produce contextually relevant and diverse augmentations tailored to the characteristics of each test image. Built upon diffusion model inversion, a masked null-text inversion method is proposed to enable region-specific augmentations during sampling. Furthermore, a dual denoising pathway is designed to balance precise identity preservation with controlled variability. We demonstrate the efficacy of our TTGA through extensive experiments across three distinct segmentation tasks spanning nine datasets. Our results consistently demonstrate that TTGA not only improves segmentation accuracy (with DSC gains ranging from 0.1% to 2.3% over the baseline) but also offers pixel-wise error estimation (with DSC gains ranging from 1.1% to 29.0% over the baseline). The source code and demonstration are available at: https://github.com/maxiao0234/TTGA.

preprint2026arXiv

Tools as Continuous Flow for Evolving Agentic Reasoning

Large Language Models (LLMs) have demonstrated remarkable capabilities in orchestrating tools for reasoning tasks. However, existing methods rely on a step-wise paradigm that lacks a global perspective, which causes error accumulation over long horizons and restricts generalization to unseen tools. To overcome these limitations, we propose Tools as Continuous Flow for Evolving Agentic Reasoning (FlowAgent), which reconceptualizes tool chaining as continuous trajectory generation within a semantic space. To systematically evaluate this paradigm, we introduce the first plan-level closed-loop benchmark dedicated to plan-level agentic reasoning in dynamic real-world environments. Specifically, the proposed FlowAgent leverages conditional flow matching to generate continuous latent trajectories, providing a global planning perspective to ensure coherent and robust tool execution. Theoretically, we establish formal bounds on utility convergence and prove that our continuous formulation fundamentally guarantees robust generalization and error attenuation. Empirical evaluations show that FlowAgent achieves superior robustness and adaptability in long-horizon reasoning tasks.

preprint2024arXiv

MS-DETR: Efficient DETR Training with Mixed Supervision

DETR accomplishes end-to-end object detection through iteratively generating multiple object candidates based on image features and promoting one candidate for each ground-truth object. The traditional training procedure using one-to-one supervision in the original DETR lacks direct supervision for the object detection candidates. We aim at improving the DETR training efficiency by explicitly supervising the candidate generation procedure through mixing one-to-one supervision and one-to-many supervision. Our approach, namely MS-DETR, is simple, and places one-to-many supervision to the object queries of the primary decoder that is used for inference. In comparison to existing DETR variants with one-to-many supervision, such as Group DETR and Hybrid DETR, our approach does not need additional decoder branches or object queries. The object queries of the primary decoder in our approach directly benefit from one-to-many supervision and thus are superior in object candidate prediction. Experimental results show that our approach outperforms related DETR variants, such as DN-DETR, Hybrid DETR, and Group DETR, and the combination with related DETR variants further improves the performance.

preprint2023arXiv

Spinon continuum in the Heisenberg quantum chain compound Sr$_2$V$_3$O$_9$

Magnetic excitations in the spin chain candidate Sr$_2$V$_3$O$_9$ have been investigated by inelastic neutron scattering on a single crystal sample. A spinon continuum with a bandwidth of $\sim22$ meV is observed along the chain formed by alternating magnetic V$^{4+}$ and nonmagnetic V$^{5+}$ ions. Incipient magnetic Bragg peaks due to weak ferromagnetic interchain couplings emerge when approaching the magnetic transition at $T_N\sim 5.3$ K while the excitations remain gapless within the instrumental resolution. Comparisons to the Bethe ansatz, density matrix renormalization group (DMRG) calculations, and effective field theories confirm Sr$_2$V$_3$O$_9$ as a host of weakly coupled $S = 1/2$ chains dominated by antiferromagnetic intrachain interactions of $\sim7.1$(1) meV.

preprint2022arXiv

HIH: Towards More Accurate Face Alignment via Heatmap in Heatmap

Heatmap-based regression overcomes the lack of spatial and contextual information of direct coordinate regression, and has revolutionized the task of face alignment. Yet it suffers from quantization errors caused by neglecting subpixel coordinates in image resizing and network downsampling. In this paper, we first quantitatively analyze the quantization error on benchmarks, which accounts for more than 1/3 of the whole prediction errors for state-of-the-art methods. To tackle this problem, we propose a novel Heatmap In Heatmap(HIH) representation and a coordinate soft-classification (CSC) method, which are seamlessly integrated into the classic hourglass network. The HIH representation utilizes nested heatmaps to jointly represent the coordinate label: one heatmap called integer heatmap stands for the integer coordinate, and the other heatmap named decimal heatmap represents the subpixel coordinate. The range of a decimal heatmap makes up one pixel in the corresponding integer heatmap. Besides, we transfer the offset regression problem to an interval classification task, and CSC regards the confidence of the pixel as the probability of the interval. Meanwhile, CSC applying the distribution loss leverage the soft labels generated from the Gaussian distribution function to guide the offset heatmap training, which makes it easier to learn the distribution of coordinate offsets. Extensive experiments on challenging benchmark datasets demonstrate that our HIH can achieve state-of-the-art results. In particular, our HIH reaches 4.08 NME (Normalized Mean Error) on WFLW, and 3.21 on COFW, which exceeds previous methods by a significant margin.

preprint2022arXiv

Image Magnification Network for Vessel Segmentation in OCTA Images

Optical coherence tomography angiography (OCTA) is a novel non-invasive imaging modality that allows micron-level resolution to visualize the retinal microvasculature. The retinal vessel segmentation in OCTA images is still an open problem, and especially the thin and dense structure of the capillary plexus is an important challenge of this problem. In this work, we propose a novel image magnification network (IMN) for vessel segmentation in OCTA images. Contrary to the U-Net structure with a down-sampling encoder and up-sampling decoder, the proposed IMN adopts the design of up-sampling encoding and then down-sampling decoding. This design is to capture more low-level image details to reduce the omission of small structures. The experimental results on three open OCTA datasets show that the proposed IMN with an average dice score of 90.2% achieves the best performance in vessel segmentation of OCTA images. Besides, we also demonstrate the superior performance of IMN in cross-field image vessel segmentation and vessel skeleton extraction.

preprint2022arXiv

Improving Transferability for Domain Adaptive Detection Transformers

DETR-style detectors stand out amongst in-domain scenarios, but their properties in domain shift settings are under-explored. This paper aims to build a simple but effective baseline with a DETR-style detector on domain shift settings based on two findings. For one, mitigating the domain shift on the backbone and the decoder output features excels in getting favorable results. For another, advanced domain alignment methods in both parts further enhance the performance. Thus, we propose the Object-Aware Alignment (OAA) module and the Optimal Transport based Alignment (OTA) module to achieve comprehensive domain alignment on the outputs of the backbone and the detector. The OAA module aligns the foreground regions identified by pseudo-labels in the backbone outputs, leading to domain-invariant based features. The OTA module utilizes sliced Wasserstein distance to maximize the retention of location information while minimizing the domain gap in the decoder outputs. We implement the findings and the alignment modules into our adaptation method, and it benchmarks the DETR-style detector on the domain shift settings. Experiments on various domain adaptive scenarios validate the effectiveness of our method.

preprint2022arXiv

Label Adversarial Learning for Skeleton-level to Pixel-level Adjustable Vessel Segmentation

You can have your cake and eat it too. Microvessel segmentation in optical coherence tomography angiography (OCTA) images remains challenging. Skeleton-level segmentation shows clear topology but without diameter information, while pixel-level segmentation shows a clear caliber but low topology. To close this gap, we propose a novel label adversarial learning (LAL) for skeleton-level to pixel-level adjustable vessel segmentation. LAL mainly consists of two designs: a label adversarial loss and an embeddable adjustment layer. The label adversarial loss establishes an adversarial relationship between the two label supervisions, while the adjustment layer adjusts the network parameters to match the different adversarial weights. Such a design can efficiently capture the variation between the two supervisions, making the segmentation continuous and tunable. This continuous process allows us to recommend high-quality vessel segmentation with clear caliber and topology. Experimental results show that our results outperform manual annotations of current public datasets and conventional filtering effects. Furthermore, such a continuous process can also be used to generate an uncertainty map representing weak vessel boundaries and noise.

preprint2022arXiv

MixFormer: Mixing Features across Windows and Dimensions

While local-window self-attention performs notably in vision tasks, it suffers from limited receptive field and weak modeling capability issues. This is mainly because it performs self-attention within non-overlapped windows and shares weights on the channel dimension. We propose MixFormer to find a solution. First, we combine local-window self-attention with depth-wise convolution in a parallel design, modeling cross-window connections to enlarge the receptive fields. Second, we propose bi-directional interactions across branches to provide complementary clues in the channel and spatial dimensions. These two designs are integrated to achieve efficient feature mixing among windows and dimensions. Our MixFormer provides competitive results on image classification with EfficientNet and shows better results than RegNet and Swin Transformer. Performance in downstream tasks outperforms its alternatives by significant margins with less computational costs in 5 dense prediction tasks on MS COCO, ADE20k, and LVIS. Code is available at \url{https://github.com/PaddlePaddle/PaddleClas}.

preprint2022arXiv

The consistent behavior of negative Poissons ratio with interlayer interactions

Negative Poissons ratio (NPR) is of great interest due to the novel applications in lots of fields. Films are the most commonly used form in practical applications, which involves multiple layers. However, the effect of interlayer interactions on the NPR is still unclear. In this study, based on first principles calculations, we systematically investigate the effect of interlayer interactions on the NPR by comparably studying single-layer graphene, few-layer graphene, h-BN, and graphene-BN heterostructure. It is found that they almost have the same geometry-strain response. Consequently, the NPR in bilayer graphene, triple-layer graphene, and graphene-BN heterostructure are consistent with that in single-layer graphene and h-BN. The fundamental mechanism lies in that the response to strain of the orbital coupling are consistent under the effect of interlayer interactions. The deep understanding of the NPR with the effect of interlayer interactions as achieved in this study is beneficial for the future design and development of micro-/nanoscale electromechanical devices with novel functions based on nanostructures.

preprint2021arXiv

G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for Biomarker Identification and Disease Classification

We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers. Our model consists of an encoder, a decoder and a classifier. The encoder learns a non-linear subspace shared between the input data modalities. The classifier and the decoder act as regularizers to ensure that the low-dimensional encoding captures predictive differences between patients and controls. We use a learnable dropout layer to extract interpretable biomarkers from the data, and our unique training strategy can easily accommodate missing data modalities across subjects. We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data. Using 10-fold cross validation, we demonstrate that our model achieves better classification accuracy than baseline methods, and that this performance generalizes to a second dataset collected at a different site. In an exploratory analysis we further show that the biomarkers identified by our model are closely associated with the well-documented deficits in schizophrenia.

preprint2021arXiv

Gauge invariant canonical symplectic algorithms for real-time lattice strong-field quantum electrodynamics

A class of high-order canonical symplectic structure-preserving geometric algorithms are developed for high-quality simulations of the quantized Dirac-Maxwell theory based strong-field quantum electrodynamics (SFQED) and relativistic quantum plasmas (RQP) phenomena. The Lagrangian density of an interacting bispinor-gauge fields theory is constructed in a conjugate real fields form. The canonical symplectic form and canonical equations of this field theory are obtained by the general Hamilton's principle on cotangent bundle. Based on discrete exterior calculus, the gauge field components are discreted to form a cochain complex, and the bispinor components are naturally discreted on a staggered dual lattice as combinations of differential forms. With pull-back and push-forward gauge covariant derivatives, the discrete action is gauge invariant. A well-defined discrete canonical Poisson bracket generates a semi-discrete lattice canonical field theory (LCFT), which admits the canonical symplectic form, unitary property, gauge symmetry and discrete Poincaré subgroup. The Hamiltonian splitting method, Cayley transformation and symmetric composition technique are introduced to construct a class of high-order numerical schemes. These schemes involve two degenerate fermion flavors and are locally unconditional stable, which also preserve the geometric structures. Equipped with statistically quantization-equivalent ensemble models of the Dirac vacuum and non-trivial plasma backgrounds, the schemes are expected to have excellent performance in secular simulations of relativistic quantum effects. The algorithms are verified in detail by numerical energy spectra. Real-time LCFT simulations are successfully implemented for the nonlinear Schwinger mechanism induced $e$-$e^+$ pairs creation and vacuum Kerr effect, which open a new door toward high-quality simulations in SFQED and RQP.

preprint2020arXiv

Compact Global Descriptor for Neural Networks

Long-range dependencies modeling, widely used in capturing spatiotemporal correlation, has shown to be effective in CNN dominated computer vision tasks. Yet neither stacks of convolutional operations to enlarge receptive fields nor recent nonlocal modules is computationally efficient. In this paper, we present a generic family of lightweight global descriptors for modeling the interactions between positions across different dimensions (e.g., channels, frames). This descriptor enables subsequent convolutions to access the informative global features with negligible computational complexity and parameters. Benchmark experiments show that the proposed method can complete state-of-the-art long-range mechanisms with a significant reduction in extra computing cost. Code available at https://github.com/HolmesShuan/Compact-Global-Descriptor.

preprint2020arXiv

Fixed-Time Cooperative Tracking Control for Double-Integrator Multi-Agent Systems: A Time-Based Generator Approach

In this paper, both the fixed-time distributed consensus tracking and the fixed-time distributed average tracking problems for double-integrator-type multi-agent systems with bounded input disturbances are studied, respectively. Firstly, a new practical robust fixed-time sliding mode control method based on the time-based generator is proposed. Secondly, a fixed-time distributed consensus tracking observer for double-integrator-type multi-agent systems is designed to estimate the state disagreements between the leader and the followers under undirected and directed communication, respectively. Thirdly, a fixed-time distributed average tracking observer for double-integrator-type multi-agent systems is designed to measure the average value of reference signals under undirected communication. Note that both the observers for the distributed consensus tracking and the distributed average tracking are devised based on time-based generators and can be extended to that of high-order multi-agent systems trivially. Furthermore, by combing the fixed-time sliding mode control with the fixed-time observers, the fixed-time controllers are designed to solve the distributed consensus tracking and the distributed average tracking problems. Finally, a few numerical simulations are shown to verify the results.

preprint2020arXiv

Simulation of Skin Stretching around the Forehead Wrinkles in Rhytidectomy

Objective: Skin stretching around the forehead wrinkles is an important method in rhytidectomy. Proper parameters are required to evaluate the surgical effect. In this paper, a simulation method was proposed to obtain the parameters. Methods: Three-dimensional point cloud data with a resolution of 50 μm were employed. First, a smooth supporting contour under the wrinkled forehead was generated via b-spline interpolation and extrapolation to constrain the deformation of the wrinkled zone. Then, based on the vector formed intrinsic finite element (VFIFE) algorithm, the simulation was implemented in Matlab for the deformation of wrinkled forehead skin in the stretching process. Finally, the stress distribution and the residual wrinkles of forehead skin were employed to evaluate the surgical effect. Results: Although the residual wrinkles are similar when forehead wrinkles are finitely stretched, their stress distribution changes greatly. This indicates that the stress distribution in the skin is effective to evaluate the surgical effect, and the forehead wrinkles are easily to be overstretched, which may lead to potential skin injuries. Conclusion: The simulation method can predict stress distribution and residual wrinkles after forehead wrinkle stretching surgery, which can be potentially used to control the surgical process and further reduce risks of skin injury.

preprint2020arXiv

SpatialFlow: Bridging All Tasks for Panoptic Segmentation

Object location is fundamental to panoptic segmentation as it is related to all things and stuff in the image scene. Knowing the locations of objects in the image provides clues for segmenting and helps the network better understand the scene. How to integrate object location in both thing and stuff segmentation is a crucial problem. In this paper, we propose spatial information flows to achieve this objective. The flows can bridge all sub-tasks in panoptic segmentation by delivering the object's spatial context from the box regression task to others. More importantly, we design four parallel sub-networks to get a preferable adaptation of object spatial information in sub-tasks. Upon the sub-networks and the flows, we present a location-aware and unified framework for panoptic segmentation, denoted as SpatialFlow. We perform a detailed ablation study on each component and conduct extensive experiments to prove the effectiveness of SpatialFlow. Furthermore, we achieve state-of-the-art results, which are $47.9$ PQ and $62.5$ PQ respectively on MS-COCO and Cityscapes panoptic benchmarks. Code will be available at https://github.com/chensnathan/SpatialFlow.

preprint2016arXiv

A Fast Factorization-based Approach to Robust PCA

Robust principal component analysis (RPCA) has been widely used for recovering low-rank matrices in many data mining and machine learning problems. It separates a data matrix into a low-rank part and a sparse part. The convex approach has been well studied in the literature. However, state-of-the-art algorithms for the convex approach usually have relatively high complexity due to the need of solving (partial) singular value decompositions of large matrices. A non-convex approach, AltProj, has also been proposed with lighter complexity and better scalability. Given the true rank $r$ of the underlying low rank matrix, AltProj has a complexity of $O(r^2dn)$, where $d\times n$ is the size of data matrix. In this paper, we propose a novel factorization-based model of RPCA, which has a complexity of $O(kdn)$, where $k$ is an upper bound of the true rank. Our method does not need the precise value of the true rank. From extensive experiments, we observe that AltProj can work only when $r$ is precisely known in advance; however, when the needed rank parameter $r$ is specified to a value different from the true rank, AltProj cannot fully separate the two parts while our method succeeds. Even when both work, our method is about 4 times faster than AltProj. Our method can be used as a light-weight, scalable tool for RPCA in the absence of the precise value of the true rank.

preprint2016arXiv

Criticality-Enhanced Magnetocaloric Effect in Quantum Spin Chain Material Copper Nitrate

Low-dimensional quantum magnets, due to the existence of abundant exotic quantum phases therein and experimental feasibilities in laboratories, continues intriguing people in condensed matter physics. In this work, a comprehensive study of Cu(NO$_3$)$_2$ $\cdot$ 2.5H$_2$O (copper nitrate hemipentahydrate, CN), a spin chain material, is performed with multi-technique approach including thermal tensor network (TTN) simulations, first-principles calculations, as well as magnetization measurements in experiments. Employing a cutting-edge TTN method developed in the present work, we determine the couplings $J=5.13$ K, $α=0.23(1)$ and Landé factors $g_{\parallel}=2.31$, $g_{\perp}=2.14$ in an alternating Heisenberg antiferromagnetic chain model, with which one can fit strikingly well the magnetothermodynamic properties. Part of the fitted experimental data are measured on the single-crystal CN specimens synthesized by us. Based on first-principles calculations, we reveal explicitly the spin chain scenario in CN by displaying the calculated electron density distributions, from which the distinct superexchange paths are visualized. On top of that, we investigated the magnetocaloric effect (MCE) in CN by calculating its isentropes and magnetic Grüeisen parameter (GP). Prominent quantum-criticality-enhanced MCE was uncovered, the TTN simulations are in good agreements with measured isentropic lines in the sub-Kelvin region. We propose that CN is potentially a very promising quantum critical coolant, due to the remarkably enhanced MCE near both critical fields of moderate strengths as 2.87 and 4.08 T, respectively.

preprint2015arXiv

A review of plasma liquid interactions for nanomaterial synthesis

In this review, we have summarized the recent advances and present conditions of the nanomaterials synthesis from the plasma-liquid interactions. A theoretical analysis for the nanomaterials synthesis process is presented by analyzing the experimental data. Besides the theoretical analysis, the practical applications in several nanomaterials syntheses of the the plasma-liquid interactions are also presented.

preprint2015arXiv

Cross-domain Image Retrieval with a Dual Attribute-aware Ranking Network

We address the problem of cross-domain image retrieval, considering the following practical application: given a user photo depicting a clothing image, our goal is to retrieve the same or attribute-similar clothing items from online shopping stores. This is a challenging problem due to the large discrepancy between online shopping images, usually taken in ideal lighting/pose/background conditions, and user photos captured in uncontrolled conditions. To address this problem, we propose a Dual Attribute-aware Ranking Network (DARN) for retrieval feature learning. More specifically, DARN consists of two sub-networks, one for each domain, whose retrieval feature representations are driven by semantic attribute learning. We show that this attribute-guided learning is a key factor for retrieval accuracy improvement. In addition, to further align with the nature of the retrieval problem, we impose a triplet visual similarity constraint for learning to rank across the two sub-networks. Another contribution of our work is a large-scale dataset which makes the network learning feasible. We exploit customer review websites to crawl a large set of online shopping images and corresponding offline user photos with fine-grained clothing attributes, i.e., around 450,000 online shopping images and about 90,000 exact offline counterpart images of those online ones. All these images are collected from real-world consumer websites reflecting the diversity of the data modality, which makes this dataset unique and rare in the academic community. We extensively evaluate the retrieval performance of networks in different configurations. The top-20 retrieval accuracy is doubled when using the proposed DARN other than the current popular solution using pre-trained CNN features only (0.570 vs. 0.268).

preprint2015arXiv

LogDet Rank Minimization with Application to Subspace Clustering

Low-rank matrix is desired in many machine learning and computer vision problems. Most of the recent studies use the nuclear norm as a convex surrogate of the rank operator. However, all singular values are simply added together by the nuclear norm, and thus the rank may not be well approximated in practical problems. In this paper, we propose to use a log-determinant (LogDet) function as a smooth and closer, though non-convex, approximation to rank for obtaining a low-rank representation in subspace clustering. Augmented Lagrange multipliers strategy is applied to iteratively optimize the LogDet-based non-convex objective function on potentially large-scale data. By making use of the angular information of principal directions of the resultant low-rank representation, an affinity graph matrix is constructed for spectral clustering. Experimental results on motion segmentation and face clustering data demonstrate that the proposed method often outperforms state-of-the-art subspace clustering algorithms.

preprint2014arXiv

Green tea induced gold nanostar synthesis mediated by Ag(I) ions

We report a synthesis of tea components conjugated gold nanostars (AuNSs) with strong near infrared absorption by reducing an aqueous solution of chloroauric acid trihydrate via green tea in association with Ag(I) ions. Green tea acts as a reducing agent by providing electrons for the gold (III) reduction as well as a stabilizing agent by conjugating some of its components on the surfaces of AuNSs. Moreover, the Ag(I) ions play an important role in mediating the branched growth of the resultant AuNSs by inducing anisotropic growth on the surfaces of initially formed spherical gold nanoparticles.

preprint2014arXiv

Network In Network

We propose a novel deep network structure called "Network In Network" (NIN) to enhance model discriminability for local patches within the receptive field. The conventional convolutional layer uses linear filters followed by a nonlinear activation function to scan the input. Instead, we build micro neural networks with more complex structures to abstract the data within the receptive field. We instantiate the micro neural network with a multilayer perceptron, which is a potent function approximator. The feature maps are obtained by sliding the micro networks over the input in a similar manner as CNN; they are then fed into the next layer. Deep NIN can be implemented by stacking mutiple of the above described structure. With enhanced local modeling via the micro network, we are able to utilize global average pooling over feature maps in the classification layer, which is easier to interpret and less prone to overfitting than traditional fully connected layers. We demonstrated the state-of-the-art classification performances with NIN on CIFAR-10 and CIFAR-100, and reasonable performances on SVHN and MNIST datasets.

preprint2013arXiv

Preliminary Study on the RF tuning of CSNS DTL

In the R&D of the CSNS Drift Tube Linac (DTL), the first unit tank with 28 drift tubes has been developed. The axial accelerating field is ramped from 2.2MV/m to 3.1MV/m in this tank. The required field flatness is less than 2 % with the standard deviation of 1 % for the beam dynamics. And the field stability should be less than 1% for machine stable operation. After the successful alignment, the RF tuning was carried out focusing on the field profile measurement. Four slug tuners and 11 post couplers were applied in this procedure. The ramped filed and required stability had been achieved by fine adjustment of the slug tuners and post couplers. In this paper, the preliminary tuning results are presented and discussed.

preprint2013arXiv

Surface Engineering of Synthetic Nanopores by Atomic Layer Deposition and Their Applications

In the past decade, nanopores have been developed extensively for various potential applications, and their performance greatly depends on the surface properties of the nanopores. Atomic layer deposition (ALD) is a new technology for depositing thin films, which has been rapidly developed from a niche technology to an established method. ALD films can cover the surface in confined regions even in nano scale conformally, thus it is proved to be a powerful tool to modify the surface of the synthetic nanopores and also to fabricate complex nanopores. This review gives a brief introduction on nanopore synthesis and ALD fundamental knowledge, then focuses on the various aspects of synthetic nanopores processing by ALD and their applications, including single-molecule sensing, nanofluidic devices, nanostructure fabrication and other applications.

preprint2012arXiv

On rigidity of gradient Kähler-Ricci solitons with harmonic Bochner tensor

In this paper, we prove that complete gradient steady Kähler-Ricci solitons with harmonic Bochner tensor are necessarily Kähler-Ricci flat, i.e., Calabi-Yau, and that complete gradient shrinking (or expanding) Kähler-Ricci solitons with harmonic Bochner tensor must be isometric to a quotient of $N^k\times \mathbb{C}^{n-k}$, where $N$ is a Kähler-Einstein manifold with positive (or negative) scalar curvature.

preprint2011arXiv

On Bach flat warped product Einstein manifolds

In this paper we show that a compact warped product Einstein manifold with vanishing Bach tensor of dimension $n \geq 4$ is a finite quotient of a warped product with $(n-1)$-dimensional Einstein fiber. The fiber has constant curvature if $n=4$.

preprint2010arXiv

On Locally Conformally Flat Gradient Steady Ricci Solitons

In this paper, we classify n-dimensional (n>2) complete noncompact locally conformally flat gradient steady solitons. In particular, we prove that a complete noncompact non-flat conformally flat gradient steady Ricci soliton is, up to scaling, the Bryant soliton.

preprint2010arXiv

Selective Image Super-Resolution

In this paper we propose a vision system that performs image Super Resolution (SR) with selectivity. Conventional SR techniques, either by multi-image fusion or example-based construction, have failed to capitalize on the intrinsic structural and semantic context in the image, and performed "blind" resolution recovery to the entire image area. By comparison, we advocate example-based selective SR whereby selectivity is exemplified in three aspects: region selectivity (SR only at object regions), source selectivity (object SR with trained object dictionaries), and refinement selectivity (object boundaries refinement using matting). The proposed system takes over-segmented low-resolution images as inputs, assimilates recent learning techniques of sparse coding (SC) and grouped multi-task lasso (GMTL), and leads eventually to a framework for joint figure-ground separation and interest object SR. The efficiency of our framework is manifested in our experiments with subsets of the VOC2009 and MSRC datasets. We also demonstrate several interesting vision applications that can build on our system.

Qiang Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

31 published item(s)

LEGATO: Good Identity Unlearning Is Continuous

Programmable ultra-broadband photonic chaos platform enabled by microwave-chaos-driven electro-optic frequency combs

Test-time generative augmentation for medical image segmentation

Tools as Continuous Flow for Evolving Agentic Reasoning

MS-DETR: Efficient DETR Training with Mixed Supervision

Spinon continuum in the Heisenberg quantum chain compound Sr$_2$V$_3$O$_9$

HIH: Towards More Accurate Face Alignment via Heatmap in Heatmap

Image Magnification Network for Vessel Segmentation in OCTA Images

Improving Transferability for Domain Adaptive Detection Transformers

Label Adversarial Learning for Skeleton-level to Pixel-level Adjustable Vessel Segmentation

MixFormer: Mixing Features across Windows and Dimensions

The consistent behavior of negative Poissons ratio with interlayer interactions

G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for Biomarker Identification and Disease Classification

Gauge invariant canonical symplectic algorithms for real-time lattice strong-field quantum electrodynamics

Compact Global Descriptor for Neural Networks

Fixed-Time Cooperative Tracking Control for Double-Integrator Multi-Agent Systems: A Time-Based Generator Approach

Simulation of Skin Stretching around the Forehead Wrinkles in Rhytidectomy

SpatialFlow: Bridging All Tasks for Panoptic Segmentation

A Fast Factorization-based Approach to Robust PCA

Criticality-Enhanced Magnetocaloric Effect in Quantum Spin Chain Material Copper Nitrate

A review of plasma liquid interactions for nanomaterial synthesis

Cross-domain Image Retrieval with a Dual Attribute-aware Ranking Network

LogDet Rank Minimization with Application to Subspace Clustering

Green tea induced gold nanostar synthesis mediated by Ag(I) ions

Network In Network

Preliminary Study on the RF tuning of CSNS DTL

Surface Engineering of Synthetic Nanopores by Atomic Layer Deposition and Their Applications

On rigidity of gradient Kähler-Ricci solitons with harmonic Bochner tensor

On Bach flat warped product Einstein manifolds

On Locally Conformally Flat Gradient Steady Ricci Solitons

Selective Image Super-Resolution