Source author record

Ze Chen

Ze Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Biological Physics Computation and Language Information Retrieval math.NA nucl-ex Numerical Analysis physics.ins-det physics.med-ph

Catalog footprint

What is connected

8works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

OPD@NL4Opt: An ensemble approach for the NER task of the optimization problem

In this paper, we present an ensemble approach for the NL4Opt competition subtask 1(NER task). For this task, we first fine tune the pretrained language models based on the competition dataset. Then we adopt differential learning rates and adversarial training strategies to enhance the model generalization and robustness. Additionally, we use a model ensemble method for the final prediction, which achieves a micro-averaged F1 score of 93.3% and attains the second prize in the NER task.

preprint2022arXiv

A Semantic Alignment System for Multilingual Query-Product Retrieval

This paper mainly describes our winning solution (team name: www) to Amazon ESCI Challenge of KDD CUP 2022, which achieves a NDCG score of 0.9043 and wins the first place on task 1: the query-product ranking track. In this competition, participants are provided with a real-world large-scale multilingual shopping queries data set and it contains query-product pairs in English, Japanese and Spanish. Three different tasks are proposed in this competition, including ranking the results list as task 1, classifying the query/product pairs into Exact, Substitute, Complement, or Irrelevant (ESCI) categories as task 2 and identifying substitute products for a given query as task 3. We mainly focus on task 1 and propose a semantic alignment system for multilingual query-product retrieval. Pre-trained multilingual language models (LM) are adopted to get the semantic representation of queries and products. Our models are all trained with cross-entropy loss to classify the query-product pairs into ESCI 4 categories at first, and then we use weighted sum with the 4-class probabilities to get the score for ranking. To further boost the model, we also do elaborative data preprocessing, data augmentation by translation, specially handling English texts with English LMs, adversarial training with AWP and FGM, self distillation, pseudo labeling, label smoothing and ensemble. Finally, Our solution outperforms others both on public and private leaderboard.

preprint2022arXiv

Dynamic Supervisor for Cross-dataset Object Detection

The application of cross-dataset training in object detection tasks is complicated because the inconsistency in the category range across datasets transforms fully supervised learning into semi-supervised learning. To address this problem, recent studies focus on the generation of high-quality missing annotations. In this study, we first point out that it is not enough to generate high-quality annotations using a single model, which only looks once for annotations. Through detailed experimental analyses, we further conclude that hard-label training is conducive to generating high-recall annotations, while soft-label training tends to obtain high-precision annotations. Inspired by the aspects mentioned above, we propose a dynamic supervisor framework that updates the annotations multiple times through multiple-updated submodels trained using hard and soft labels. In the final generated annotations, both recall and precision improve significantly through the integration of hard-label training with soft-label training. Extensive experiments conducted on various dataset combination settings support our analyses and demonstrate the superior performance of the proposed dynamic supervisor.

preprint2022arXiv

Spatial Likelihood Voting with Self-Knowledge Distillation for Weakly Supervised Object Detection

Weakly supervised object detection (WSOD), which is an effective way to train an object detection model using only image-level annotations, has attracted considerable attention from researchers. However, most of the existing methods, which are based on multiple instance learning (MIL), tend to localize instances to the discriminative parts of salient objects instead of the entire content of all objects. In this paper, we propose a WSOD framework called the Spatial Likelihood Voting with Self-knowledge Distillation Network (SLV-SD Net). In this framework, we introduce a spatial likelihood voting (SLV) module to converge region proposal localization without bounding box annotations. Specifically, in every iteration during training, all the region proposals in a given image act as voters voting for the likelihood of each category in the spatial dimensions. After dilating the alignment on the area with large likelihood values, the voting results are regularized as bounding boxes, which are then used for the final classification and localization. Based on SLV, we further propose a self-knowledge distillation (SD) module to refine the feature representations of the given image. The likelihood maps generated by the SLV module are used to supervise the feature learning of the backbone network, encouraging the network to attend to wider and more diverse areas of the image. Extensive experiments on the PASCAL VOC 2007/2012 and MS-COCO datasets demonstrate the excellent performance of SLV-SD Net. In addition, SLV-SD Net produces new state-of-the-art results on these benchmarks.

preprint2020arXiv

Multidimensional Phase Recovery and Interpolative Decomposition Butterfly Factorization

This paper focuses on the fast evaluation of the matvec $g=Kf$ for $K\in \mathbb{C}^{N\times N}$, which is the discretization of a multidimensional oscillatory integral transform $g(x) = \int K(x,ξ) f(ξ)dξ$ with a kernel function $K(x,ξ)=e^{2\piıΦ(x,ξ)}$, where $Φ(x,ξ)$ is a piecewise smooth phase function with $x$ and $ξ$ in $\mathbb{R}^d$ for $d=2$ or $3$. A new framework is introduced to compute $Kf$ with $O(N\log N)$ time and memory complexity in the case that only indirect access to the phase function $Φ$ is available. This framework consists of two main steps: 1) an $O(N\log N)$ algorithm for recovering the multidimensional phase function $Φ$ from indirect access is proposed; 2) a multidimensional interpolative decomposition butterfly factorization (MIDBF) is designed to evaluate the matvec $Kf$ with an $O(N\log N)$ complexity once $Φ$ is available. Numerical results are provided to demonstrate the effectiveness of the proposed framework.

preprint2020arXiv

SLV: Spatial Likelihood Voting for Weakly Supervised Object Detection

Based on the framework of multiple instance learning (MIL), tremendous works have promoted the advances of weakly supervised object detection (WSOD). However, most MIL-based methods tend to localize instances to their discriminative parts instead of the whole content. In this paper, we propose a spatial likelihood voting (SLV) module to converge the proposal localizing process without any bounding box annotations. Specifically, all region proposals in a given image play the role of voters every iteration during training, voting for the likelihood of each category in spatial dimensions. After dilating alignment on the area with large likelihood values, the voting results are regularized as bounding boxes, being used for the final classification and localization. Based on SLV, we further propose an end-to-end training framework for multi-task learning. The classification and localization tasks promote each other, which further improves the detection performance. Extensive experiments on the PASCAL VOC 2007 and 2012 datasets demonstrate the superior performance of SLV.

preprint2013arXiv

A simulation study of a dual-plate in-room PET system for dose verification in carbon ion therapy

Carbon ion therapy have the ability to overcome the limitation of convertional radiotherapy due to its most energy deposition in selective depth, usually called Bragg peak, which results in increased biological effectiness. During carbon ion therapy, lots positron emitters such as $^{11}$C, $^{15}$O, $^{10}$C are generated in irradiated tissues by nuclear reactions. Immediately after patient irradiation, PET scanners can be used to measure the spatial distribution of positron emitters, which can track the carbon beam to the tissue. In this study, we designed and evaluated an dual-plate in-room PET scanner to monitor patient dose in carbon ion therapy, which is based on GATE simulation platform. A dual-plate PET is designed to avoid interference with the carbon beam line and with patient positioning. Its performance was compared with that of four-head and full-ring PET scanners. The dual-plate, four-head and full-ring PET scanners consisted of 30, 60, 60 detector modules, respectively, with a 36 cm distance between directly opposite detector modules for dose deposition measurements. Each detector module was consisted of a 24$\times$24 array of 2$\times2\times$18 mm$^{3}$ LYSO pixels coupled to a Hamamatsu H8500 PMT. To esitmate the production yield of positron emitters, a 10$\times15\times$15 cm$^{3}$ cuboid PMMA phantom was irradiated with 172, 200, 250 AMeV $^{12}$C beams. 3D images of the activity distribution of the three type scanners are produced by an iterative reconstruction algorithm. By comparing the longitudinal profile of positron emitters, measured along the carbon beam path, we concluded that the development of a dual-plate PET scanner is feasible to monitor the dose distribution for carbon ion therapy.

preprint2013arXiv

Optimum performance investigation of LYSO crystal pixels: A comparison between GATE simulation and experimental data

Monte Carlo simulation plays an important role in the study of time of flight (TOF) positron emission tomography (PET) prototype. As it can incorporate accurate physical modeling of scintillation detection process, from scintillation light generation, the transport of scintillation photos through the crystal(s), to the conversion of these photons into electronic signals. The Geant4 based simulation software GATE can provide a user-friendly simulation platform containing the properties needed. In this work, we developed a dedicated module in GATE simulation tool. Using this module, we simulated the light yield, energy resolution, time resolution of LYSO pixels with the same cross-section ($4\times4 mm^{2}$) of different lengths: 5 mm, 10 mm, 15 mm, 20 mm, 25 mm, coupled to a PMT. The experiments were performed to validate the GATE simulation results. The results indicate that the best time resolution (484.0$\pm$67.5 ps) and energy resolution (13.3$\pm$0.4 %) could be produced by using pixel with length of 5 mm. The module can also be applied to other cases for precisely simulating optical photons propagating in scintillators.

Ze Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

OPD@NL4Opt: An ensemble approach for the NER task of the optimization problem

A Semantic Alignment System for Multilingual Query-Product Retrieval

Dynamic Supervisor for Cross-dataset Object Detection

Spatial Likelihood Voting with Self-Knowledge Distillation for Weakly Supervised Object Detection

Multidimensional Phase Recovery and Interpolative Decomposition Butterfly Factorization

SLV: Spatial Likelihood Voting for Weakly Supervised Object Detection

A simulation study of a dual-plate in-room PET system for dose verification in carbon ion therapy

Optimum performance investigation of LYSO crystal pixels: A comparison between GATE simulation and experimental data