Source author record

Hongkai Zhang

Hongkai Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision physics.chem-ph physics.ins-det

Catalog footprint

What is connected

5works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

1st Place Solutions of Waymo Open Dataset Challenge 2020 -- 2D Object Detection Track

In this technical report, we present our solutions of Waymo Open Dataset (WOD) Challenge 2020 - 2D Object Track. We adopt FPN as our basic framework. Cascade RCNN, stacked PAFPN Neck and Double-Head are used for performance improvements. In order to handle the small object detection problem in WOD, we use very large image scales for both training and testing. Using our methods, our team RW-TSDet achieved the 1st place in the 2D Object Detection Track.

preprint2020arXiv

Appearance-Preserving 3D Convolution for Video-based Person Re-identification

Due to the imperfect person detection results and posture changes, temporal appearance misalignment is unavoidable in video-based person re-identification (ReID). In this case, 3D convolution may destroy the appearance representation of person video clips, thus it is harmful to ReID. To address this problem, we propose AppearancePreserving 3D Convolution (AP3D), which is composed of two components: an Appearance-Preserving Module (APM) and a 3D convolution kernel. With APM aligning the adjacent feature maps in pixel level, the following 3D convolution can model temporal information on the premise of maintaining the appearance representation quality. It is easy to combine AP3D with existing 3D ConvNets by simply replacing the original 3D convolution kernels with AP3Ds. Extensive experiments demonstrate the effectiveness of AP3D for video-based ReID and the results on three widely used datasets surpass the state-of-the-arts. Code is available at: https://github.com/guxinqian/AP3D.

preprint2020arXiv

Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training

Although two-stage object detectors have continuously advanced the state-of-the-art performance in recent years, the training process itself is far from crystal. In this work, we first point out the inconsistency problem between the fixed network settings and the dynamic training procedure, which greatly affects the performance. For example, the fixed label assignment strategy and regression loss function cannot fit the distribution change of proposals and thus are harmful to training high quality detectors. Consequently, we propose Dynamic R-CNN to adjust the label assignment criteria (IoU threshold) and the shape of regression loss function (parameters of SmoothL1 Loss) automatically based on the statistics of proposals during training. This dynamic design makes better use of the training samples and pushes the detector to fit more high quality samples. Specifically, our method improves upon ResNet-50-FPN baseline with 1.9% AP and 5.5% AP$_{90}$ on the MS COCO dataset with no extra overhead. Codes and models are available at https://github.com/hkzhang95/DynamicRCNN.

preprint2015arXiv

Performance assessment of CsI(Tl) screens on various substrates for X-ray imaging

Thallium-doped cesium iodide (CsI(Tl)) screens are widely used in X-ray imaging devices because of the columnar structure of CsI(Tl) layer, but few reports focus on the optical role of the substrate in the screen system. In this paper, four substrates including fused silica (SiO2), silver-film coated SiO2, graphite (C) and fiber optic plate (FOP) are used to fabricate CsI(Tl) screens by thermal evaporation. Their imaging performance is evaluated by relative light output (RLO), modulation transfer function (MTF), normalized noise power spectrum (NNPS) and noise equivalent quanta (NEQ). The results reveal that although CsI(Tl) film on graphite plate yields images with the lowest light output, it presents relatively higher spatial resolution and better signal-to-noise characteristics. However, films on SiO2 plate obtain low MTF but high NNPS curves, whether or not coated with silver film. Furthermore, scintillation screens on FOP have bright images with low NNPS and high NEQ, but have the lowest MTF. By controlling the substrate optical features, CsI(Tl) films can be tailed to suit a given application.

preprint1999arXiv

Optimization of Quantum Monte Carlo Wave Functions Using Analytical Energy Derivatives

An algorithm is proposed to optimize quantum Monte Carlo (QMC) wave functions based on New ton's method and analytical computation of the first and second derivatives of the variati onal energy. This direct application of the variational principle yields significantly low er energy than variance minimization methods when applied to the same trial wave function. Quadratic convergence to the local minimum of the variational parameters is achieved. A g eneral theorem is presented, which substantially simplifies the analytic expressions of de rivatives in the case of wave function optimization. To demonstrate the method, the ground state energies of the first-row elements are calculated.

Hongkai Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

1st Place Solutions of Waymo Open Dataset Challenge 2020 -- 2D Object Detection Track

Appearance-Preserving 3D Convolution for Video-based Person Re-identification

Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training

Performance assessment of CsI(Tl) screens on various substrates for X-ray imaging

Optimization of Quantum Monte Carlo Wave Functions Using Analytical Energy Derivatives