Source author record

Xiaojun Chen

Xiaojun Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

37works

31topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Exposing Functional Fusion: A New Class of Strategic Backdoor in Dynamic Prompt Architectures

Existing ViT backdoor attacks based on backbone-overwriting full-tuning are computationally expensive and inflict performance degradation. This has forced adversaries towards the Visual Parameter-Efficient Fine-Tuning (PEFT) paradigm, dominated by adapter-based (e.g., LoRA) and prompt-based (e.g., VPT) approaches. While adapter security has seen initial study, the risks of the burgeoning prompt-based ecosystem remain critically unexplored. We fill this critical gap, exposing how the evolution of VPT towards dynamic and context-aware architectures can facilitate a far more dangerous and emergent threat. This vulnerability arises even though these dynamic modules unlock superior benign performance. We propose VIPER, an attack framework built on a lightweight, dynamic Visual Prompt Generator (VPG) that demonstrates this vulnerability. Critically, this dynamic architecture enables Functional Fusion: an emergent phenomenon where malicious logic and benign task utility are tightly fused into the same sparse, high-magnitude parameter core. This fusion creates a formidable ``hostage" dilemma, as pruning the attack necessarily destroys the benign performance. Comprehensive evaluations show VIPER effectively addresses the attacker's trilemma: VIPER not only achieves state-of-the-art performance on clean data, but also maintains near-100% ASR even under 90% VPG-module pruning (where LoRA attacks collapse), while adding only an imperceptible 0.06ms (1.16%) of inference latency. VIPER's results, driven by Functional Fusion, expose a new, paradigm-level risk in dynamic prompt architectures.

preprint2026arXiv

PointSLAM++: Robust Dense Neural Gaussian Point Cloud-based SLAM

Real-time 3D reconstruction is crucial for robotics and augmented reality, yet current simultaneous localization and mapping(SLAM) approaches often struggle to maintain structural consistency and robust pose estimation in the presence of depth noise. This work introduces PointSLAM++, a novel RGB-D SLAM system that leverages a hierarchically constrained neural Gaussian representation to preserve structural relationships while generating Gaussian primitives for scene mapping. It also employs progressive pose optimization to mitigate depth sensor noise, significantly enhancing localization accuracy. Furthermore, it utilizes a dynamic neural representation graph that adjusts the distribution of Gaussian nodes based on local geometric complexity, enabling the map to adapt to intricate scene details in real time. This combination yields high-precision 3D mapping and photorealistic scene rendering. Experimental results show PointSLAM++ outperforms existing 3DGS-based SLAM methods in reconstruction accuracy and rendering quality, demonstrating its advantages for large-scale AR and robotics.

preprint2022arXiv

An Inexact Augmented Lagrangian Algorithm for Training Leaky ReLU Neural Network with Group Sparsity

The leaky ReLU network with a group sparse regularization term has been widely used in the recent years. However, training such a network yields a nonsmooth nonconvex optimization problem and there exists a lack of approaches to compute a stationary point deterministically. In this paper, we first resolve the multi-layer composite term in the original optimization problem by introducing auxiliary variables and additional constraints. We show the new model has a nonempty and bounded solution set and its feasible set satisfies the Mangasarian-Fromovitz constraint qualification. Moreover, we show the relationship between the new model and the original problem. Remarkably, we propose an inexact augmented Lagrangian algorithm for solving the new model and show the convergence of the algorithm to a KKT point. Numerical experiments demonstrate that our algorithm is more efficient for training sparse leaky ReLU neural networks than some well-known algorithms.

preprint2022arXiv

An Optimal Control Problem with Terminal Stochastic Linear Complementarity Constraints

In this paper, we investigate an optimal control problem with terminal stochastic linear complementarity constraints (SLCC), and its discrete approximation using the relaxation, the sample average approximation (SAA) and the implicit Euler time-stepping scheme. We show the existence of feasible solutions and optimal solutions to the optimal control problem and its discrete approximation under the conditions that the expectation of the stochastic matrix in the SLCC is a Z-matrix or an adequate matrix. Moreover, we prove that the solution sequence generated by the discrete approximation converges to a solution of the original optimal control problem with probability 1 as $ε\downarrow 0$, $ν\to \infty $ and $h\downarrow 0$, where $ε$ is the relaxation parameter, $ν$ is the sample size and $h$ is the mesh size. We also provide asymptotics of the SAA optimal value and error bounds of the time-stepping method. A numerical example is used to illustrate the existence of optimal solutions, the discretization scheme and error estimation.

preprint2022arXiv

CLTS+: A New Chinese Long Text Summarization Dataset with Abstractive Summaries

The abstractive methods lack of creative ability is particularly a problem in automatic text summarization. The summaries generated by models are mostly extracted from the source articles. One of the main causes for this problem is the lack of dataset with abstractiveness, especially for Chinese. In order to solve this problem, we paraphrase the reference summaries in CLTS, the Chinese Long Text Summarization dataset, correct errors of factual inconsistencies, and propose the first Chinese Long Text Summarization dataset with a high level of abstractiveness, CLTS+, which contains more than 180K article-summary pairs and is available online. Additionally, we introduce an intrinsic metric based on co-occurrence words to evaluate the dataset we constructed. We analyze the extraction strategies used in CLTS+ summaries against other datasets to quantify the abstractiveness and difficulty of our new data and train several baselines on CLTS+ to verify the utility of it for improving the creative ability of models.

preprint2022arXiv

Deep Unsupervised Hashing with Latent Semantic Components

Deep unsupervised hashing has been appreciated in the regime of image retrieval. However, most prior arts failed to detect the semantic components and their relationships behind the images, which makes them lack discriminative power. To make up the defect, we propose a novel Deep Semantic Components Hashing (DSCH), which involves a common sense that an image normally contains a bunch of semantic components with homology and co-occurrence relationships. Based on this prior, DSCH regards the semantic components as latent variables under the Expectation-Maximization framework and designs a two-step iterative algorithm with the objective of maximum likelihood of training data. Firstly, DSCH constructs a semantic component structure by uncovering the fine-grained semantics components of images with a Gaussian Mixture Modal~(GMM), where an image is represented as a mixture of multiple components, and the semantics co-occurrence are exploited. Besides, coarse-grained semantics components, are discovered by considering the homology relationships between fine-grained components, and the hierarchy organization is then constructed. Secondly, DSCH makes the images close to their semantic component centers at both fine-grained and coarse-grained levels, and also makes the images share similar semantic components close to each other. Extensive experiments on three benchmark datasets demonstrate that the proposed hierarchical semantic components indeed facilitate the hashing model to achieve superior performance.

preprint2022arXiv

Group sparse optimization for inpainting of random fields on the sphere

We propose a group sparse optimization model for inpainting of a square-integrable isotropic random field on the unit sphere, where the field is represented by spherical harmonics with random complex coefficients. In the proposed optimization model, the variable is an infinite-dimensional complex vector and the objective function is a real-valued function defined by a hybrid of the $\ell_2$ norm and non-Liptchitz $\ell_p (0<p<1)$ norm that preserves rotational invariance property and group structure of the random complex coefficients. We show that the infinite-dimensional optimization problem is equivalent to a convexly-constrained finite-dimensional optimization problem. Moreover, we propose a smoothing penalty algorithm to solve the finite-dimensional problem via unconstrained optimization problems. We provide an approximation error bound of the inpainted random field defined by a scaled KKT point of the constrained optimization problem in the square-integrable space on the sphere with probability measure. Finally, we conduct numerical experiments on band-limited random fields on the sphere and images from Earth topography data to show the promising performance of the smoothing penalty algorithm for inpainting of random fields on the sphere.

preprint2022arXiv

Identifying Electrocardiogram Abnormalities Using a Handcrafted-Rule-Enhanced Neural Network

A large number of people suffer from life-threatening cardiac abnormalities, and electrocardiogram (ECG) analysis is beneficial to determining whether an individual is at risk of such abnormalities. Automatic ECG classification methods, especially the deep learning based ones, have been proposed to detect cardiac abnormalities using ECG records, showing good potential to improve clinical diagnosis and help early prevention of cardiovascular diseases. However, the predictions of the known neural networks still do not satisfactorily meet the needs of clinicians, and this phenomenon suggests that some information used in clinical diagnosis may not be well captured and utilized by these methods. In this paper, we introduce some rules into convolutional neural networks, which help present clinical knowledge to deep learning based ECG analysis, in order to improve automated ECG diagnosis performance. Specifically, we propose a Handcrafted-Rule-enhanced Neural Network (called HRNN) for ECG classification with standard 12-lead ECG input, which consists of a rule inference module and a deep learning module. Experiments on two large-scale public ECG datasets show that our new approach considerably outperforms existing state-of-the-art methods. Further, our proposed approach not only can improve the diagnosis performance, but also can assist in detecting mislabelled ECG samples. Our codes are available at https://github.com/alwaysbyx/ecg_processing.

preprint2022arXiv

Linearly-constrained nonsmooth optimization for training autoencoders

A regularized minimization model with $l_1$-norm penalty (RP) is introduced for training the autoencoders that belong to a class of two-layer neural networks. We show that the RP can act as an exact penalty model which shares the same global minimizers, local minimizers, and d(irectional)-stationary points with the original regularized model under mild conditions. We construct a bounded box region that contains at least one global minimizer of the RP, and propose a linearly constrained regularized minimization model with $l_1$-norm penalty (LRP) for training autoencoders. A smoothing proximal gradient algorithm is designed to solve the LRP. Convergence of the algorithm to a generalized d-stationary point of the RP and LRP is delivered. Comprehensive numerical experiments convincingly illustrate the efficiency as well as the robustness of the proposed algorithm.

preprint2022arXiv

Optimality conditions for nonsmooth nonconvex-nonconcave min-max problems and generative adversarial networks

This paper considers a class of nonsmooth nonconvex-nonconcave min-max problems in machine learning and games. We first provide sufficient conditions for the existence of global minimax points and local minimax points. Next, we establish the first-order and second-order optimality conditions for local minimax points by using directional derivatives. These conditions reduce to smooth min-max problems with Fr{é}chet derivatives. We apply our theoretical results to generative adversarial networks (GANs) in which two neural networks contest with each other in a game. Examples are used to illustrate applications of the new theory for training GANs.

preprint2022arXiv

PrUE: Distilling Knowledge from Sparse Teacher Networks

Although deep neural networks have enjoyed remarkable success across a wide variety of tasks, their ever-increasing size also imposes significant overhead on deployment. To compress these models, knowledge distillation was proposed to transfer knowledge from a cumbersome (teacher) network into a lightweight (student) network. However, guidance from a teacher does not always improve the generalization of students, especially when the size gap between student and teacher is large. Previous works argued that it was due to the high certainty of the teacher, resulting in harder labels that were difficult to fit. To soften these labels, we present a pruning method termed Prediction Uncertainty Enlargement (PrUE) to simplify the teacher. Specifically, our method aims to decrease the teacher's certainty about data, thereby generating soft predictions for students. We empirically investigate the effectiveness of the proposed method with experiments on CIFAR-10/100, Tiny-ImageNet, and ImageNet. Results indicate that student networks trained with sparse teachers achieve better performance. Besides, our method allows researchers to distill knowledge from deeper networks to improve students further. Our code is made public at: \url{https://github.com/wangshaopu/prue}.

preprint2022arXiv

Twisted bi-symplectic structure on Koszul twisted Calabi-Yau algebras

For a Koszul Artin-Schelter regular algebra (also called twisted Calabi-Yau algebra), we show that it has a "twisted" bi-symplectic structure, which may be viewed as a noncommutative and twisted analogue of the shifted symplectic structure introduced by Pantev, Toën, Vaquié and Vezzosi. This structure gives a quasi-isomorphism between the tangent complex and the twisted cotangent complex of the algebra, and may be viewed as a DG enhancement of Van den Bergh's noncommutative Poincaré duality; it also induces a twisted symplectic structure on its derived representation schemes.

preprint2021arXiv

On the Łojasiewicz Exponent of the Quadratic Sphere Constrained Optimization Problem

In this paper, we prove that the global version of the $Ł$ojasiewicz gradient inequality holds for quadratic sphere constrained optimization problem with exponent $θ=\frac{3}{4}$. An example from Ting Kei Pong shows that $θ=\frac{3}{4}$ is tight. This is the first $Ł$ojasiewicz gradient inequality established for the sphere constrained optimization problem with a linear term.

preprint2021arXiv

Pure Characteristics Demand Models and Distributionally Robust Mathematical Programs with Stochastic Complementarity Constraints

We formulate pure characteristics demand models under uncertainties of probability distributions as distributionally robust mathematical programs with stochastic complementarity constraints (DRMP-SCC). For any fixed first-stage variable and a random realization, the second-stage problem of DRMP-SCC is a monotone linear complementarity problem (LCP). To deal with uncertainties of probability distributions of the involved random variables in the stochastic LCP, we use the distributionally robust approach. Moreover, we propose an approximation problem with regularization and discretization to solve DRMP-SCC, which is a two-stage nonconvex-nonconcave minimax optimization problem. We prove the convergence of the approximation problem to DRMP-SCC regarding the optimal solution sets, optimal values and stationary points as the regularization parameter goes to zero and the sample size goes to infinity. Finally, preliminary numerical results for investigating distributional robustness of pure characteristics demand models are reported to illustrate the effectiveness and efficiency of our approaches.

preprint2020arXiv

A weakly supervised registration-based framework for prostate segmentation via the combination of statistical shape model and CNN

Precise determination of target is an essential procedure in prostate interventions, such as the prostate biopsy, lesion detection and targeted therapy. However, the prostate delineation may be tough in some cases due to tissue ambiguity or lack of partial anatomical boundary. To address this problem, we proposed a weakly supervised registration-based framework for the precise prostate segmentation, by combining convolutional neural network (CNN) with statistical shape model (SSM). To obtain the prostate region, an inception-based neural network (SSM-Net) was firstly exploited to predict the model transform, shape control parameters and a fine-tuning vector, for the generation of prostate boundary. According to the inferred boundary, a normalized distance map was calculated. Then, a residual U-net (ResU-Net) was employed to predict a probability label map from the input images. Finally, the average of the distance map and the probability map was regarded as the prostate segmentation. After that, two public dataset PROMISE12 and NCI- ISBI 2013 were utilized for the model computation and for the network training and testing. The validation results demonstrate that the segmentation framework using a SSM with 9500 nodes achieved the best performance, with a dice of 0.904 and an average surface distance of 1.88 mm. In addition, we verified the impact of model elasticity augmentation and fine-tuning item on the network segmentation capability. As a result, both factors have improved the delineation accuracy, with dice increased by 10% and 7% respectively. In conclusion, via the combination of two weakly supervised neural networks, our segmentation method might be an effective and robust approach for prostate segmentation.

preprint2020arXiv

An exact penalty approach for optimization with nonnegative orthogonality constraints

Optimization with nonnegative orthogonality constraints has wide applications in machine learning and data sciences. It is NP-hard due to some combinatorial properties of the constraints. We first propose an equivalent optimization formulation with nonnegative and multiple spherical constraints and an additional single nonlinear constraint. Various constraint qualifications, the first- and second-order optimality conditions of the equivalent formulation are discussed. By establishing a local error bound of the feasible set, we design a class of (smooth) exact penalty models via keeping the nonnegative and multiple spherical constraints. The penalty models are exact if the penalty parameter is sufficiently large other than going to infinity. A practical penalty algorithm with postprocessing is then developed. It uses a second-order method to approximately solve a series of subproblems with nonnegative and multiple spherical constraints. We study the asymptotic convergence of the penalty algorithm and establish that any limit point is a weakly stationary point of the original problem and becomes a stationary point under some additional mild conditions. Extensive numerical results on the projection problem, orthogonal nonnegative matrix factorization problems and the K-indicators model show the effectiveness of our proposed approach.

preprint2020arXiv

Calabi-Yau algebras and the shifted noncommutative symplectic structure

In this paper we show that for a Koszul Calabi-Yau algebra, there is a shifted bi-symplectic structure in the sense of Crawley-Boevey-Etingof-Ginzburg, on the cobar construction of its co-unitalized Koszul dual coalgebra, and hence its DG representation schemes, in the sense of Berest-Khachatryan-Ramadoss, have a shifted symplectic structure in the sense of Pantev-Toën-Vaquié-Vezzosi.

preprint2020arXiv

Equilibrium Oil Market Share under the COVID-19 Pandemic

Equilibrium models for energy markets under uncertain demand and supply have attracted considerable attentions. This paper focuses on modelling crude oil market share under the COVID-19 pandemic using two-stage stochastic equilibrium. We describe the uncertainties in the demand and supply by random variables and provide two types of production decisions (here-and-now and wait-and-see). The here-and-now decision in the first stage does not depend on the outcome of random events to be revealed in the future and the wait-and-see decision in the second stage is allowed to depend on the random events in the future and adjust the feasibility of the here-and-now decision in rare unexpected scenarios such as those observed during the COVID-19 pandemic. We develop a fast algorithm to find a solution of the two-stage stochastic equilibrium. We show the robustness of the two-stage stochastic equilibrium model for forecasting the oil market share using the real market data from January 2019 to May 2020.

preprint2020arXiv

Incorporating Uncertain Segmentation Information into Chinese NER for Social Media Text

Chinese word segmentation is necessary to provide word-level information for Chinese named entity recognition (NER) systems. However, segmentation error propagation is a challenge for Chinese NER while processing colloquial data like social media text. In this paper, we propose a model (UIcwsNN) that specializes in identifying entities from Chinese social media text, especially by leveraging ambiguous information of word segmentation. Such uncertain information contains all the potential segmentation states of a sentence that provides a channel for the model to infer deep word-level characteristics. We propose a trilogy (i.e., candidate position embedding -> position selective attention -> adaptive word convolution) to encode uncertain word segmentation information and acquire appropriate word-level representation. Experiments results on the social media corpus show that our model alleviates the segmentation error cascading trouble effectively, and achieves a significant performance improvement of more than 2% over previous state-of-the-art methods.

preprint2019arXiv

Gravity algebra structure on the negative cyclic homology of Calabi-Yau algebras

In this paper, we study the gravity algebra structure on the negative cyclic homology or the cyclic cohomology of several classes of algebras. These algebras include: Calabi-Yau algebras, symmetric Frobenius algebras, unimodular Poisson algebras, and unimodular Frobenius Poisson algebras. The relationships among these gravity algebras are also discussed under some additional conditions.

preprint2016arXiv

A semi-automatic computer-aided method for surgical template design

This paper presents a generalized integrated framework of semi-automatic surgical template design. Several algorithms were implemented including the mesh segmentation, offset surface generation, collision detection, ruled surface generation, etc., and a special software named TemDesigner was developed. With a simple user interface, a customized template can be semi- automatically designed according to the preoperative plan. Firstly, mesh segmentation with signed scalar of vertex is utilized to partition the inner surface from the input surface mesh based on the indicated point loop. Then, the offset surface of the inner surface is obtained through contouring the distance field of the inner surface, and segmented to generate the outer surface. Ruled surface is employed to connect inner and outer surfaces. Finally, drilling tubes are generated according to the preoperative plan through collision detection and merging. It has been applied to the template design for various kinds of surgeries, including oral implantology, cervical pedicle screw insertion, iliosacral screw insertion and osteotomy, demonstrating the efficiency, functionality and generality of our method.

preprint2016arXiv

Alternating Direction Method of Multipliers for A Class of Nonconvex and Nonsmooth Problems with Applications to Background/Foreground Extraction

In this paper, we study a general optimization model, which covers a large class of existing models for many applications in imaging sciences. To solve the resulting possibly nonconvex, nonsmooth and non-Lipschitz optimization problem, we adapt the alternating direction method of multipliers (ADMM) with a general dual step-size to solve a reformulation that contains three blocks of variables, and analyze its convergence. We show that for any dual step-size less than the golden ratio, there exists a computable threshold such that if the penalty parameter is chosen above such a threshold and the sequence thus generated by our ADMM is bounded, then the cluster point of the sequence gives a stationary point of the nonconvex optimization problem. We achieve this via a potential function specifically constructed for our ADMM. Moreover, we establish the global convergence of the whole sequence if, in addition, this special potential function is a Kurdyka-Łojasiewicz function. Furthermore, we present a simple strategy for initializing the algorithm to guarantee boundedness of the sequence. Finally, we perform numerical experiments comparing our ADMM with the proximal alternating linearized minimization (PALM) proposed in [5] on the background/foreground extraction problem with real data. The numerical results show that our ADMM with a nontrivial dual step-size is efficient.

preprint2016arXiv

Automorphisms and Ideals of Noncommutative Deformations of $\mathbb{C}^2/\mathbb{Z}_2$

Let $O_τ(Γ)$ be a family of algebras \textit{quantizing} the coordinate ring of $\mathbb{C}^2 / Γ$, where $Γ$ is a finite subgroup of $\mathrm{SL}_2(\mathbb{C})$, and let $G_Γ$ be the automorphism group of $O_τ$. We study the natural action of $G_Γ$ on the space of right ideals of $O_τ$ (equivalently, finitely generated rank $1$ projective $O_τ$-modules). It is known that the later can be identified with disjoint union of algebraic (quiver) varieties, and this identification is $G_Γ$-equivariant. In the present paper, when $Γ\cong \mathbb{Z}_2$, we show that the $G_Γ$-action on each quiver variety is transitive. We also show that the natural embedding of $G_Γ$ into $\mathrm{Pic}(O_τ)$, the Picard group of $O_τ$, is an isomorphism. These results are used to prove that there are countably many non-isomorphic algebras Morita equivalent to $O_τ$, and explicit presentation of these algebras are given. Since algebras $O_τ(\mathbb{Z}_2)$ are isomorphic to primitive factors of $U(sl_2)$, we obtain a complete description of algebras Morita equivalent to primitive factors. A structure of the group $G_Γ$, where $Γ$ is an arbitrary cyclic group, is also investigated. Our results generalize earlier results obtained for the (first) Weyl algebra $A_1$.

preprint2016arXiv

Generalized Conjugate Gradient Methods for $\ell_1$ Regularized Convex Quadratic Programming with Finite Convergence

The conjugate gradient (CG) method is an efficient iterative method for solving large-scale strongly convex quadratic programming (QP). In this paper we propose some generalized CG (GCG) methods for solving the $\ell_1$-regularized (possibly not strongly) convex QP that terminate at an optimal solution in a finite number of iterations. At each iteration, our methods first identify a face of an orthant and then either perform an exact line search along the direction of the negative projected minimum-norm subgradient of the objective function or execute a CG subroutine that conducts a sequence of CG iterations until a CG iterate crosses the boundary of this face or an approximate minimizer of over this face or a subface is found. We determine which type of step should be taken by comparing the magnitude of some components of the minimum-norm subgradient of the objective function to that of its rest components. Our analysis on finite convergence of these methods makes use of an error bound result and some key properties of the aforementioned exact line search and the CG subroutine. We also show that the proposed methods are capable of finding an approximate solution of the problem by allowing some inexactness on the execution of the CG subroutine. The overall arithmetic operation cost of our GCG methods for finding an $ε$-optimal solution depends on $ε$ in $O(\log(1/ε))$, which is superior to the accelerated proximal gradient method [2,23] that depends on $ε$ in $O(1/\sqrtε)$. In addition, our GCG methods can be extended straightforwardly to solve box-constrained convex QP with finite convergence. Numerical results demonstrate that our methods are very favorable for solving ill-conditioned problems.

preprint2016arXiv

Linear Convergence of Proximal Gradient Algorithm with Extrapolation for a Class of Nonconvex Nonsmooth Minimization Problems

In this paper, we study the proximal gradient algorithm with extrapolation for minimizing the sum of a Lipschitz differentiable function and a proper closed convex function. Under the error bound condition used in [19] for analyzing the convergence of the proximal gradient algorithm, we show that there exists a threshold such that if the extrapolation coefficients are chosen below this threshold, then the sequence generated converges $R$-linearly to a stationary point of the problem. Moreover, the corresponding sequence of objective values is also $R$-linearly convergent. In addition, the threshold reduces to $1$ for convex problems and, as a consequence, we obtain the $R$-linear convergence of the sequence generated by FISTA with fixed restart. Finally, we present some numerical experiments to illustrate our results.

preprint2016arXiv

Penalty methods for a class of non-Lipschitz optimization problems

We consider a class of constrained optimization problems with a possibly nonconvex non-Lipschitz objective and a convex feasible set being the intersection of a polyhedron and a possibly degenerate ellipsoid. Such problems have a wide range of applications in data science, where the objective is used for inducing sparsity in the solutions while the constraint set models the noise tolerance and incorporates other prior information for data fitting. To solve this class of constrained optimization problems, a common approach is the penalty method. However, there is little theory on exact penalization for problems with nonconvex and non-Lipschitz objective functions. In this paper, we study the existence of exact penalty parameters regarding local minimizers, stationary points and $ε$-minimizers under suitable assumptions. Moreover, we discuss a penalty method whose subproblems are solved via a nonmonotone proximal gradient method with a suitable update scheme for the penalty parameters, and prove the convergence of the algorithm to a KKT point of the constrained problem. Preliminary numerical results demonstrate the efficiency of the penalty method for finding sparse solutions of underdetermined linear systems.

preprint2016arXiv

US-Cut: Interactive Algorithm for rapid Detection and Segmentation of Liver Tumors in Ultrasound Acquisitions

Ultrasound (US) is the most commonly used liver imaging modality worldwide. It plays an important role in follow-up of cancer patients with liver metastases. We present an interactive segmentation approach for liver tumors in US acquisitions. Due to the low image quality and the low contrast between the tumors and the surrounding tissue in US images, the segmentation is very challenging. Thus, the clinical practice still relies on manual measurement and outlining of the tumors in the US images. We target this problem by applying an interactive segmentation algorithm to the US data, allowing the user to get real-time feedback of the segmentation results. The algorithm has been developed and tested hand-in-hand by physicians and computer scientists to make sure a future practical usage in a clinical setting is feasible. To cover typical acquisitions from the clinical routine, the approach has been evaluated with dozens of datasets where the tumors are hyperechoic (brighter), hypoechoic (darker) or isoechoic (similar) in comparison to the surrounding liver tissue. Due to the interactive real-time behavior of the approach, it was possible even in difficult cases to find satisfying segmentations of the tumors within seconds and without parameter settings, and the average tumor deviation was only 1.4mm compared with manual measurements. However, the long term goal is to ease the volumetric acquisition of liver tumors in order to evaluate for treatment response. Additional aim is the registration of intraoperative US images via the interactive segmentations to the patient's pre-interventional CT acquisitions.

preprint2015arXiv

A Double Poisson Algebra Structure on Fukaya Categories

Let $M$ be an exact symplectic manifold with $c_1(M)=0$. Denote by $\mathrm{Fuk}(M)$ the Fukaya category of $M$. We show that the dual space of the bar construction of $\mathrm{Fuk}(M)$ has a differential graded noncommutative Poisson structure. As a corollary we get a Lie algebra structure on the cyclic cohomology $\mathrm{HC}^\bullet(\mathrm{Fuk}(M))$, which is analogous to the ones discovered by Kontsevich in noncommutative symplectic geometry and by Chas and Sullivan in string topology.

preprint2015arXiv

Batalin-Vilkovisky algebras and the noncommutative Poincare duality of Koszul Calabi-Yau algebras

Let $A$ be a Koszul Calabi-Yau algebra. We show that there exists an isomorphism of Batalin-Vilkovisky algebras between the Hochschild cohomology ring of $A$ and that of its Koszul dual algebra $A^!$. This confirms (a generalization of) a conjecture of R.~Rouquier.

preprint2015arXiv

Interactive Volumetry Of Liver Ablation Zones

Percutaneous radiofrequency ablation (RFA) is a minimally invasive technique that destroys cancer cells by heat. The heat results from focusing energy in the radiofrequency spectrum through a needle. Amongst others, this can enable the treatment of patients who are not eligible for an open surgery. However, the possibility of recurrent liver cancer due to incomplete ablation of the tumor makes post-interventional monitoring via regular follow-up scans mandatory. These scans have to be carefully inspected for any conspicuousness. Within this study, the RF ablation zones from twelve post-interventional CT acquisitions have been segmented semi-automatically to support the visual inspection. An interactive, graph-based contouring approach, which prefers spherically shaped regions, has been applied. For the quantitative and qualitative analysis of the algorithm's results, manual slice-by-slice segmentations produced by clinical experts have been used as the gold standard (which have also been compared among each other). As evaluation metric for the statistical validation, the Dice Similarity Coefficient (DSC) has been calculated. The results show that the proposed tool provides lesion segmentation with sufficient accuracy much faster than manual segmentation. The visual feedback and interactivity make the proposed tool well suitable for the clinical workflow.

preprint2015arXiv

Spherical $t_ε$-Designs for Approximations on the Sphere

A spherical $t$-design is a set of points on the sphere that are nodes of a positive equal weight quadrature rule having algebraic accuracy $t$ for all spherical polynomials with degrees $\le t$. Spherical $t$-designs have many distinguished properties in approximations on the sphere and receive remarkable attention. Although the existence of a spherical $t$-design is known for any $t\ge 0$, a spherical design is only known in a set of interval enclosures on the sphere \cite{chen2011computational} for $t\le 100$. It is unknown how to choose a set of points from the set of interval enclosures to obtain a spherical $t$-design. In this paper we investigate a new concept of point sets on the sphere named spherical $t_ε$-design ($0<ε<1$), which are nodes of a positive weight quadrature rule with algebraic accuracy $t$. The sum of the weights is equal to the area of the sphere and the mean value of the weights is equal to the weight of the quadrature rule defined by the spherical $t$-design. A spherical $t_ε$-design is a spherical $t$-design when $ε=0,$ and a spherical $t$-design is a spherical $t_ε$-design for any $0<ε<1$. We show that any point set chosen from the set of interval enclosures \cite{chen2011computational} is a spherical $t_ε$-design. We then study the worst-case errors of quadrature rules using spherical $t_ε$-designs in a Sobolev space, and investigate a model of polynomial approximation with the $l_1$-regularization using spherical $t_ε$-designs. Numerical results illustrate good performance of spherical $t_ε$-designs for numerical integration and function approximation on the sphere.

preprint2014arXiv

Development of an open source software module for enhanced visualization during MR-guided interstitial gynecologic brachytherapy

In 2010, gynecologic malignancies were the 4th leading cause of death in U.S. women and for patients with extensive primary or recurrent disease, treatment with interstitial brachytherapy may be an option. However, brachytherapy requires precise insertion of hollow catheters with introducers into the tumor in order to eradicate the cancer. In this study, a software solution to assist interstitial gynecologic brachytherapy has been investigated and the software has been realized as an own module under (3D) Slicer, which is a free open source software platform for (translational) biomedical research. The developed research module allows on-time processing of intra-operative magnetic resonance imaging (iMRI) data over a direct DICOM connection to a MR scanner. Afterwards follows a multi-stage registration of CAD models of the medical brachytherapy devices (template, obturator) to the patient's MR images, enabling the virtual placement of interstitial needles to assist the physician during the intervention.

preprint2012arXiv

Cyclic Homology of Fukaya Categories and the Linearized Contact Homology

Let $M$ be an exact symplectic manifold with contact type boundary such that $c_1(M)=0$. In this paper we show that the cyclic cohomology of the Fukaya category of $M$ has the structure of an involutive Lie bialgebra. Inspired by a work of Cieliebak-Latschev we show that there is a Lie bialgebra homomorphism from the linearized contact homology of $M$ to the cyclic cohomology of the Fukaya category. Our study is also motivated by string topology and 2-dimensional topological conformal field theory.

preprint2012arXiv

Noncommutative Poisson structures, derived representation schemes and Calabi-Yau algebras

Recantly, William Crawley-Boevey proposed the definition of a Poisson structure on a noncommutative algebra $A$ based on the Kontsevich principle. His idea was to find the {\it weakest} possible structure on $A$ that induces standard (commutative) Poisson structures on all representation spaces $ \Rep_V(A) $. It turns out that such a weak Poisson structure on $A$ is a Lie algebra bracket on the 0-th cyclic homology $ \HC_0(A) $ satisfying some extra conditions; it was thus called in an {\it $ H_0$-Poisson structure}. This paper studies a higher homological extension of this construction. In our more general setting, we show that noncommutative Poisson structures in the above sense behave nicely with respect to homotopy (in the sense that homotopy equivalent NC Poisson structures on $A$ induce (via the derived representation functor) homotopy equivalent Poisson algebra structures on the derved representation schemes $\DRep_V(A) $). For an ordinary algebra $A$, a noncommutative Poisson structure on a semifree (more generally, cofibrant) resolution of $A$ yields a graded (super) Lie algebra structure on the full cyclic homology $ \HC_\bullet(A) $ extending Crawley-Boevey's $\H_0$-Poisson structure on $ \HC_0(A) $. We call such structures {\it derived Poisson structures} on $A$. We also show that derived Poisson structures do arise in nature: the cobar construction $Ω(C)$ of an $(-n)$-cyclic coassociative DG coalgebra (in particular, of the linear dual of a finite dimensional $n$-cyclic DG algebra) $C$ carries a $(2-n)$-double Poisson bracket in the sense of Van den Bergh. This in turn induces a corresponding noncommutative $(2-n)$-Poisson structure on $Ω(C)$. When (the semifree) DG algebra $Ω(C)$ resolves an honest algebra $A$, $A$ acquires a derived $(2-n)$-Poisson structure.

preprint2011arXiv

Complexity of Unconstrained L_2-L_p Minimization

We consider the unconstrained $L_2$-$L_p$ minimization: find a minimizer of $\|Ax-b\|^2_2+λ\|x\|^p_p$ for given $A \in R^{m\times n}$, $b\in R^m$ and parameters $λ>0$, $p\in [0,1)$. This problem has been studied extensively in variable selection and sparse least squares fitting for high dimensional data. Theoretical results show that the minimizers of the $L_2$-$L_p$ problem have various attractive features due to the concavity and non-Lipschitzian property of the regularization function $\|\cdot\|^p_p$. In this paper, we show that the $L_q$-$L_p$ minimization problem is strongly NP-hard for any $p\in [0,1)$ and $q\ge 1$, including its smoothed version. On the other hand, we show that, by choosing parameters $(p,λ)$ carefully, a minimizer, global or local, will have certain desired sparsity. We believe that these results provide new theoretical insights to the studies and applications of the concave regularized optimization problems.

preprint2010arXiv

Lie bialgebras and the cyclic homology of $A_\infty$ structures in topology

$A_\infty$ categories are a mathematical structure that appears in topological field theory, string topology, and symplectic topology. This paper studies the cyclic homology of a Calabi-Yau $A_\infty$ category, and shows that it is naturally an equivariant topological conformal field theory, and in particular, contains an involutive Lie bialgebra. Applications of the theory to string topology and the Fukaya category are given; in particular, it is shown that there is a Lie bialgebra homomorphism from the cyclic cohomology of the Fukaya category of a symplectic manifold with contact type boundary to the linearized contact homology of the boundary.

preprint2009arXiv

Quantization of the Lie bialgebra of string topology

Let M be a smooth, simply-connected, closed oriented manifold, and LM the free loop space of M. Using a Poincare duality model for M, we show that the reduced equivariant homology of LM has the structure of a Lie bialgebra, and we construct a Hopf algebra which quantizes the Lie bialgebra.

Xiaojun Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

37 published item(s)

Exposing Functional Fusion: A New Class of Strategic Backdoor in Dynamic Prompt Architectures

PointSLAM++: Robust Dense Neural Gaussian Point Cloud-based SLAM

An Inexact Augmented Lagrangian Algorithm for Training Leaky ReLU Neural Network with Group Sparsity

An Optimal Control Problem with Terminal Stochastic Linear Complementarity Constraints

CLTS+: A New Chinese Long Text Summarization Dataset with Abstractive Summaries

Deep Unsupervised Hashing with Latent Semantic Components

Group sparse optimization for inpainting of random fields on the sphere

Identifying Electrocardiogram Abnormalities Using a Handcrafted-Rule-Enhanced Neural Network

Linearly-constrained nonsmooth optimization for training autoencoders

Optimality conditions for nonsmooth nonconvex-nonconcave min-max problems and generative adversarial networks

PrUE: Distilling Knowledge from Sparse Teacher Networks

Twisted bi-symplectic structure on Koszul twisted Calabi-Yau algebras

On the Łojasiewicz Exponent of the Quadratic Sphere Constrained Optimization Problem

Pure Characteristics Demand Models and Distributionally Robust Mathematical Programs with Stochastic Complementarity Constraints

A weakly supervised registration-based framework for prostate segmentation via the combination of statistical shape model and CNN

An exact penalty approach for optimization with nonnegative orthogonality constraints

Calabi-Yau algebras and the shifted noncommutative symplectic structure

Equilibrium Oil Market Share under the COVID-19 Pandemic

Incorporating Uncertain Segmentation Information into Chinese NER for Social Media Text

Gravity algebra structure on the negative cyclic homology of Calabi-Yau algebras

A semi-automatic computer-aided method for surgical template design

Alternating Direction Method of Multipliers for A Class of Nonconvex and Nonsmooth Problems with Applications to Background/Foreground Extraction

Automorphisms and Ideals of Noncommutative Deformations of $\mathbb{C}^2/\mathbb{Z}_2$

Generalized Conjugate Gradient Methods for $\ell_1$ Regularized Convex Quadratic Programming with Finite Convergence

Linear Convergence of Proximal Gradient Algorithm with Extrapolation for a Class of Nonconvex Nonsmooth Minimization Problems

Penalty methods for a class of non-Lipschitz optimization problems

US-Cut: Interactive Algorithm for rapid Detection and Segmentation of Liver Tumors in Ultrasound Acquisitions

A Double Poisson Algebra Structure on Fukaya Categories

Batalin-Vilkovisky algebras and the noncommutative Poincare duality of Koszul Calabi-Yau algebras

Interactive Volumetry Of Liver Ablation Zones

Spherical $t_ε$-Designs for Approximations on the Sphere

Development of an open source software module for enhanced visualization during MR-guided interstitial gynecologic brachytherapy

Cyclic Homology of Fukaya Categories and the Linearized Contact Homology

Noncommutative Poisson structures, derived representation schemes and Calabi-Yau algebras

Complexity of Unconstrained L_2-L_p Minimization

Lie bialgebras and the cyclic homology of $A_\infty$ structures in topology

Quantization of the Lie bialgebra of string topology