Researcher profile

Zhimin Zhang

Zhimin Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
21works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

21 published item(s)

preprint2024arXiv

InvariantOODG: Learning Invariant Features of Point Clouds for Out-of-Distribution Generalization

The convenience of 3D sensors has led to an increase in the use of 3D point clouds in various applications. However, the differences in acquisition devices or scenarios lead to divergence in the data distribution of point clouds, which requires good generalization of point cloud representation learning methods. While most previous methods rely on domain adaptation, which involves fine-tuning pre-trained models on target domain data, this may not always be feasible in real-world scenarios where target domain data may be unavailable. To address this issue, we propose InvariantOODG, which learns invariability between point clouds with different distributions using a two-branch network to extract local-to-global features from original and augmented point clouds. Specifically, to enhance local feature learning of point clouds, we define a set of learnable anchor points that locate the most useful local regions and two types of transformations to augment the input point clouds. The experimental results demonstrate the effectiveness of the proposed model on 3D domain generalization benchmarks.

preprint2022arXiv

A new family of nonconforming elements with $\pmb{H}(\mathrm{curl})$-continuity for the three-dimensional quad-curl problem

We propose and analyze a new family of nonconforming finite elements for the three-dimensional quad-curl problem. The proposed finite element spaces are subspaces of $\pmb{H}(\mathrm{curl})$, but not of $\pmb{H}(\mathrm{grad}~\mathrm{curl})$, which are different from the existing nonconforming ones. The well-posedness of the discrete problem is proved and optimal error estimates in discrete $\pmb{H}(\mathrm{grad}~\mathrm{curl})$ norm, $\pmb{H}(\mathrm{curl})$ norm and $\pmb{L}^2$ norm are derived. Numerical experiments are provided to illustrate the good performance of the method and confirm our theoretical predictions.

preprint2022arXiv

An hp-version interior penalty discontinuous Galerkin method for the quad-curl eigenvalue problem

An hp-version interior penalty discontinuous Galerkin (IPDG) method under nonconforming meshes is proposed to solve the quad-curl eigenvalue problem. We prove well-posedness of the numerical scheme for the quad-curl equation and then derive an error estimate in a mesh-dependent norm, which is optimal with respect to h but has different p-version error bounds under conforming and nonconforming tetrahedron meshes. The hp-version discrete compactness of the DG space is established for the convergence proof. The performance of the method is demonstrated by numerical experiments using conforming/nonconforming meshes and h-version/p-version refinement. The optimal h-version convergence rate and the exponential p-version convergence rate are observed.

preprint2022arXiv

Improving Stack Overflow question title generation with copying enhanced CodeBERT model and bi-modal information

Context: Stack Overflow is very helpful for software developers who are seeking answers to programming problems. Previous studies have shown that a growing number of questions are of low quality and thus obtain less attention from potential answerers. Gao et al. proposed an LSTM-based model (i.e., BiLSTM-CC) to automatically generate question titles from the code snippets to improve the question quality. However, only using the code snippets in the question body cannot provide sufficient information for title generation, and LSTMs cannot capture the long-range dependencies between tokens. Objective: This paper proposes CCBERT, a deep learning based novel model to enhance the performance of question title generation by making full use of the bi-modal information of the entire question body. Method: CCBERT follows the encoder-decoder paradigm and uses CodeBERT to encode the question body into hidden representations, a stacked Transformer decoder to generate predicted tokens, and an additional copy attention layer to refine the output distribution. Both the encoder and decoder perform the multi-head self-attention operation to better capture the long-range dependencies. This paper builds a dataset containing around 200,000 high-quality questions filtered from the data officially published by Stack Overflow to verify the effectiveness of the CCBERT model. Results: CCBERT outperforms all the baseline models on the dataset. Experiments on both code-only and low-resource datasets show the superiority of CCBERT with less performance degradation. The human evaluation also shows the excellent performance of CCBERT concerning both readability and correlation criteria.

preprint2022arXiv

Nonconforming finite element approximations and the analysis of Nitsche's method for a singularly perturbed quad-curl problem in three dimensions

We introduce and analyze a robust nonconforming finite element method for a three dimensional singularly perturbed quad-curl model problem. For the solution of the model problem, we derive proper a priori bounds, based on which, we prove that the proposed finite element method is robust with respect to the singular perturbation parameter $\varepsilon$ and the numerical solution is uniformly convergent with order $h^{1/2}$. In addition, we investigate the effect of treating the second boundary condition weakly by Nitsche&#39;s method. We show that such a treatment leads to sharper error estimates than imposing the boundary condition strongly when the parameter $\varepsilon< h$. Finally, numerical experiments are provided to illustrate the good performance of the method and confirm our theoretical predictions.

preprint2022arXiv

Unsupervised Manga Character Re-identification via Face-body and Spatial-temporal Associated Clustering

In the past few years, there has been a dramatic growth in e-manga (electronic Japanese-style comics). Faced with the booming demand for manga research and the large amount of unlabeled manga data, we raised a new task, called unsupervised manga character re-identification. However, the artistic expression and stylistic limitations of manga pose many challenges to the re-identification problem. Inspired by the idea that some content-related features may help clustering, we propose a Face-body and Spatial-temporal Associated Clustering method (FSAC). In the face-body combination module, a face-body graph is constructed to solve problems such as exaggeration and deformation in artistic creation by using the integrity of the image. In the spatial-temporal relationship correction module, we analyze the appearance features of characters and design a temporal-spatial-related triplet loss to fine-tune the clustering. Extensive experiments on a manga book dataset with 109 volumes validate the superiority of our method in unsupervised manga character re-identification.

preprint2022arXiv

When Six Degrees of Separation Meets Online Social Networks: How Low Can the Degree Be?

The proposal of Six Degrees of Separation makes people realize that the world is not as big as we imagined. Even if the world&#39;s population now exceeds 7 billion, two strangers can still get in touch through a limited intermediary. When online social networks have taken the world by storm, can people connect through shorter distances? This issue is worth thinking about. This paper describes Six Degrees of Separation and the limitations of this theory in practical applications. Combined with online social networks, the paper analyzes the development and change of the degrees of connection between people. Finally, the paper considers the actual coverage of online social networks and other issues, and rationalizes the possible degrees of the future Six Degrees of Separation.

preprint2021arXiv

Unconditionally optimal convergence of an energy-conserving and linearly implicit scheme for nonlinear wave equations

In this paper, we present and analyze an energy-conserving and linearly implicit scheme for solving the nonlinear wave equations. Optimal error estimates in time and superconvergent error estimates in space are established without time-step dependent on the spatial mesh size. The key is to estimate directly the solution bounds in the $H^2$-norm for both the nonlinear wave equation and the corresponding fully discrete scheme, while the previous investigations rely on the temporal-spatial error splitting approach. Numerical examples are presented to confirm energy-conserving properties, unconditional convergence, and optimal error estimates, respectively, of the proposed fully discrete schemes.

preprint2020arXiv

A $C^1$ Petrov-Galerkin method and Gauss collocation method for 1D general elliptic problems and superconvergence

In this paper, we present and study $C^1$ Petrov-Galerkin and Gauss collocation methods with arbitrary polynomial degree $k$ ($\ge 3$) for one-dimensional elliptic equations. We prove that, the solution and its derivative approximations converge with rate $2k-2$ at all grid points; and the solution approximation is superconvergent at all interior roots of a special Jacobi polynomial of degree $k+1$ in each element, the first-order derivative approximation is superconvergent at all interior $k-2$ Lobatto points, and the second-order derivative approximation is superconvergent at $k-1$ Gauss points, with an order of $k+2$, $k+1$, and $k$, respectively. As a by-product, we prove that both the Petrov-Galerkin solution and the Gauss collocation solution are superconvergent towards a particular Jacobi projection of the exact solution in $H^2$, $H^1$, and $L^2$ norms. All theoretical findings are confirmed by numerical experiments.

preprint2020arXiv

A new finite element approach for the Dirichlet eigenvalue problem

In this paper, we propose a new finite element approach, which is different than the classic Babuska-Osborn theory, to approximate Dirichlet eigenvalues. The Dirichlet eigenvalue problem is formulated as the eigenvalue problem of a holomorphic Fredholm operator function of index zero. Using conforming finite elements, the convergence is proved using the abstract approximation theory for holomorphic operator functions. The spectral indicator method is employed to compute the eigenvalues. A numerical example is presented to validate the theory.

preprint2020arXiv

A priori and a posteriori error estimates for the quad-curl eigenvalue problem

In this paper, we propose a new family of H(curl^2)-conforming elements for the quad-curl eigenvalue problem in 2D. The accuracy of this family is one order higher than that in [32]. We prove a priori and a posteriori error estimates. The a priori estimate of the eigenvalue with a convergence order 2(s-1) is obtained if the eigenvector u\in H^{s+1}(Ω). For the a posteriori estimate, by analyzing the associated source problem, we obtain lower and upper bounds for the eigenvector in an energy norm and an upper bound for the eigenvalues. Numerical examples are presented for validation.

preprint2020arXiv

Characterizing and Understanding GCNs on GPU

Graph convolutional neural networks (GCNs) have achieved state-of-the-art performance on graph-structured data analysis. Like traditional neural networks, training and inference of GCNs are accelerated with GPUs. Therefore, characterizing and understanding the execution pattern of GCNs on GPU is important for both software and hardware optimization. Unfortunately, to the best of our knowledge, there is no detailed characterization effort of GCN workloads on GPU. In this paper, we characterize GCN workloads at inference stage and explore GCN models on NVIDIA V100 GPU. Given the characterization and exploration, we propose several useful guidelines for both software optimization and hardware optimization for the efficient execution of GCNs on GPU.

preprint2020arXiv

Finite Element Calculation of Photonic Band Structures for Frequency Dependent Materials

We consider the calculation of the band structure of frequency dependent photonic crystals. The associated eigenvalue problem is nonlinear and it is challenging to develop effective convergent numerical methods. In this paper, the band structure problem is formulated as the eigenvalue problem of a holomorphic Fredholm operator function of index zero. Lagrange finite elements are used to discretize the operator function. Then the convergence of the eigenvalues is proved using the abstract approximation theory for holomorphic operator functions. A spectral indicator method is developed to practically compute the eigenvalues. Numerical examples are presented to validate the theory and show the effectiveness of the proposed method.

preprint2020arXiv

Finite element methods based on two families of second-order numerical formulas for the fractional Cable model with smooth solutions

We apply two families of novel fractional $θ$-methods, the FBT-$θ$ and FBN-$θ$ methods developed by the authors in previous work, to the fractional Cable model, in which the time direction is approximated by the fractional $θ$-methods, and the space direction is approximated by the finite element method. Some positivity properties of the coefficients for both of these methods are derived, which are crucial for the proof of the stability estimates. We analyse the stability of the scheme and derive an optimal convergence result with $O(τ^2+h^{r+1})$ for smooth solutions, where $τ$ is the time mesh size and $h$ is the spatial mesh size. Some numerical experiments with smooth and nonsmooth solutions are conducted to confirm our theoretical analysis. To overcome the singularity at initial value, the starting part is added to restore the second-order convergence rate in time.

preprint2020arXiv

HyGCN: A GCN Accelerator with Hybrid Architecture

In this work, we first characterize the hybrid execution patterns of GCNs on Intel Xeon CPU. Guided by the characterization, we design a GCN accelerator, HyGCN, using a hybrid architecture to efficiently perform GCNs. Specifically, first, we build a new programming model to exploit the fine-grained parallelism for our hardware design. Second, we propose a hardware design with two efficient processing engines to alleviate the irregularity of Aggregation phase and leverage the regularity of Combination phase. Besides, these engines can exploit various parallelism and reuse highly reusable data efficiently. Third, we optimize the overall system via inter-engine pipeline for inter-phase fusion and priority-based off-chip memory access coordination to improve off-chip bandwidth utilization. Compared to the state-of-the-art software framework running on Intel Xeon CPU and NVIDIA V100 GPU, our work achieves on average 1509$\times$ speedup with 2500$\times$ energy reduction and average 6.5$\times$ speedup with 10$\times$ energy reduction, respectively.

preprint2020arXiv

Three families of grad-div-conforming finite elements

Several smooth finite element de Rham complexes are constructed in three-dimensional space, which yield three families of grad-div conforming finite elements. The simplest element has only 8 degrees of freedom (DOFs) for a tetrahedron and 14 DOFs for a cuboid. These elements naturally lead to conforming approximations to quad-div problems. Numerical experiments for each family validate the correctness and efficiency of the elements for solving the quad-div problem.

preprint2019arXiv

Analysis of adaptive BDF2 scheme for diffusion equations

The variable two-step backward differentiation formula (BDF2) is revisited via a new theoretical framework using the positive semi-definiteness of BDF2 convolution kernels and a class of orthogonal convolution kernels. We prove that, if the adjacent time-step ratios $r_k:=τ_k/τ_{k-1}\le(3+\sqrt{17})/2\approx3.561$, the adaptive BDF2 time-stepping scheme for linear reaction-diffusion equations is unconditionally stable and (maybe, first-order) convergent in the $L^2$ norm. The second-order temporal convergence can be recovered if almost all of time-step ratios $r_k\le 1+\sqrt{2}$ or some high-order starting scheme is used. Specially, for linear dissipative diffusion problems, the stable BDF2 method preserves both the energy dissipation law (in the $H^1$ seminorm) and the $L^2$ norm monotonicity at the discrete levels. An example is included to support our analysis.

preprint2019arXiv

Stimulated emission depletion microscopy with array detection and photon reassignment

We propose a novel stimulated emission depletion (STED) microscopy based on array detection and photon reassignment. By replacing the single-point detector in traditional STED with a detector array and utilizing the photon reassignment method to recombine the images acquired by each detector, the final photon reassignment STED (prSTED) image could be obtained. We analyze the principle and imaging characteristics of prSTED, and the results indicate that, compared with traditional STED, prSTED can improve the signal-to-noise ratio (SNR) of the image by increasing the obtained photon flux while maintaining the original spatial resolution of STED. In addition, the SNR and resolution of prSTED are strongly correlated with the intensity of depletion beam. Corresponding theoretical and experimental analysis about this feature are also conducted. In general, considering the enhanced signal strength, imaging speed and compatibility with some other imaging techniques, we believe prSTED would be a helpful promotion in biomedical imaging.

preprint2018arXiv

Sharp $H^1$-norm error estimates of two time-stepping schemes for reaction-subdiffusion problems

Due to the intrinsically initial singularity of solution and the discrete convolution form in numerical Caputo derivatives, the traditional $H^1$-norm analysis (corresponding to the case for a classical diffusion equation) to the time approximations of a fractional subdiffusion problem always leads to suboptimal error estimates (a loss of time accuracy). To recover the theoretical accuracy in time, we propose an improved discrete Grönwall inequality and apply it to the well-known L1 formula and a fractional Crank-Nicolson scheme. With the help of a time-space error-splitting technique and the global consistency analysis, sharp $H^1$-norm error estimates of the two nonuniform approaches are established for a reaction-subdiffusion problems. Numerical experiments are included to confirm the sharpness of our analysis.

preprint2012arXiv

Flux-conserving finite element methods

We analyze the flux conservation property of the finite element method. It is shown that the finite element solution does approximate the flux locally in the optimal order, i.e., the same order as that of the nodal interpolation operator. We propose two methods, post-processing the finite element solutions locally. The new solutions, remaining as optimal-order solutions, are flux-conserving elementwise. In one of our methods, the processed solution also satisfies the original finite element equations. While the high-order finite volume schemes are still under construction, our methods produce finite-volume-like finite element solution of any order. In particular, our methods avoid solving non-symmetric finite volume equations. Numerical tests in 2D and 3D verify our findings.

preprint2012arXiv

Superconvergence Points of Spectral Interpolation

In this work, we study superconvergence properties for some high-order orthogonal polynomial interpolations.The results are two-folds: When interpolating function values, we identify those points where the first and second derivatives of the interpolant converge faster;When interpolating the first derivative,we locate those points where the function value of the interpolant superconverges. For the earlier case, we use various Chebyshev polynomials; and for the later case,we also include the counterpart Legendre polynomials.