Source author record

Haochen Wang

Haochen Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.CO astro-ph.IM Computer Vision Artificial Intelligence astro-ph.HE Computation and Language gr-qc

Catalog footprint

What is connected

10works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

MMFormalizer: Multimodal Autoformalization in the Wild

Autoformalization, which translates natural language mathematics into formal statements to enable machine reasoning, faces fundamental challenges in the wild due to the multimodal nature of the physical world, where physics requires inferring hidden constraints (e.g., mass or energy) from visual elements. To address this, we propose MMFormalizer, which extends autoformalization beyond text by integrating adaptive grounding with entities from real-world mathematical and physical domains. MMFormalizer recursively constructs formal propositions from perceptually grounded primitives through recursive grounding and axiom composition, with adaptive recursive termination ensuring that every abstraction is supported by visual evidence and anchored in dimensional or axiomatic grounding. We evaluate MMFormalizer on a new benchmark, PhyX-AF, comprising 115 curated samples from MathVerse, PhyX, Synthetic Geometry, and Analytic Geometry, covering diverse multimodal autoformalization tasks. Results show that frontier models such as GPT-5 and Gemini-3-Pro achieve the highest compile and semantic accuracy, with GPT-5 excelling in physical reasoning, while geometry remains the most challenging domain. Overall, MMFormalizer provides a scalable framework for unified multimodal autoformalization, bridging perception and formal reasoning. To the best of our knowledge, this is the first multimodal autoformalization method capable of handling classical mechanics (derived from the Hamiltonian), as well as relativity, quantum mechanics, and thermodynamics. More details are available on our project page: MMFormalizer.github.io

preprint2026arXiv

The Second CHIME/FRB Catalog of Fast Radio Bursts

We present a catalog of 4539 fast radio bursts (FRBs) observed with the Canadian Hydrogen Intensity Mapping Experiment (CHIME) telescope between 25 July 2018 and 15 September 2023. These bursts originate from 3641 unique sources, including 981 bursts from 83 known repeating sources. For each FRB, the catalog provides a $O(10')$ estimate of sky location along with corresponding measurements of cumulative exposure time and survey sensitivity over the observing period. It includes a total-intensity dynamic spectrum between 400 and 800 MHz at 0.983 ms resolution. From this spectrum, we constrain a model of the burst morphology and measure key parameters such as arrival time, intrinsic temporal width, dispersion measure, scattering time, and flux density. This second catalog includes all FRBs from the first catalog, with every event reprocessed using a uniform and improved analysis framework. We show that previously published inferences remain valid under the updated measurements. We assess consistency of the detection rate across observational parameters, present initial distributions of burst properties, and outline ongoing and future studies that will use this catalog to investigate the nature of FRBs and their utility as astrophysical and cosmological probes.

preprint2026arXiv

The Squeezed Bispectrum from CHIME HI Emission and Planck CMB Lensing: Current Sensitivity and Forecasts

Line intensity mapping using atomic hydrogen (HI) has the potential to efficiently map large volumes of the universe if the signal can be successfully separated from overwhelmingly bright radio foreground emission. This motivates cross-correlations, to ascertain the cosmological nature of measured HI fluctuations, and to study their connections with galaxies and the underlying matter density field. However, these same foregrounds render the cross-correlation with projected fields such as the lensing of the cosmic microwave background (CMB) difficult. Indeed, the correlated Fourier modes vary slowly along the line of sight, and are thus most contaminated by the smooth-spectrum radio continuum foregrounds. In this paper, we implement a method that avoids this issue by attempting to measure the non-linear gravitational coupling of the small-scale 21cm power from the Canadian Hydrogen Intensity Mapping Experiment (CHIME) with large-scale Planck CMB lensing. This measurement is a position-dependent power spectrum, i.e. a squeezed integrated bispectrum. Using 94 nights of CHIME data between $1.0 < z < 1.3$ and aggressive foreground filtering, we find that the expected signal is five times smaller than the current noise. We forecast that incorporating the additional nights of CHIME data already collected would enable a signal-to-noise ratio of 3, without any further improvements in filtering for foreground cleaning.

preprint2022arXiv

An Overview of CHIME, the Canadian Hydrogen Intensity Mapping Experiment

The Canadian Hydrogen Intensity Mapping Experiment (CHIME) is a drift scan radio telescope operating across the 400-800 MHz band. CHIME is located at the Dominion Radio Astrophysical Observatory near Penticton, BC Canada. The instrument is designed to map neutral hydrogen over the redshift range 0.8 to 2.5 to constrain the expansion history of the Universe. This goal drives the design features of the instrument. CHIME consists of four parallel cylindrical reflectors, oriented north-south, each 100 m $\times$ 20 m and outfitted with a 256 element dual-polarization linear feed array. CHIME observes a two degree wide stripe covering the entire meridian at any given moment, observing 3/4 of the sky every day due to Earth rotation. An FX correlator utilizes FPGAs and GPUs to digitize and correlate the signals, with different correlation products generated for cosmological, fast radio burst, pulsar, VLBI, and 21 cm absorber backends. For the cosmology backend, the $N_\mathrm{feed}^2$ correlation matrix is formed for 1024 frequency channels across the band every 31 ms. A data receiver system applies calibration and flagging and, for our primary cosmological data product, stacks redundant baselines and integrates for 10 s. We present an overview of the instrument, its performance metrics based on the first three years of science data, and we describe the current progress in characterizing CHIME's primary beam response. We also present maps of the sky derived from CHIME data; we are using versions of these maps for a cosmological stacking analysis as well as for investigation of Galactic foregrounds.

preprint2022arXiv

Decoupled IoU Regression for Object Detection

Non-maximum suppression (NMS) is widely used in object detection pipelines for removing duplicated bounding boxes. The inconsistency between the confidence for NMS and the real localization confidence seriously affects detection performance. Prior works propose to predict Intersection-over-Union (IoU) between bounding boxes and corresponding ground-truths to improve NMS, while accurately predicting IoU is still a challenging problem. We argue that the complex definition of IoU and feature misalignment make it difficult to predict IoU accurately. In this paper, we propose a novel Decoupled IoU Regression (DIR) model to handle these problems. The proposed DIR decouples the traditional localization confidence metric IoU into two new metrics, Purity and Integrity. Purity reflects the proportion of the object area in the detected bounding box, and Integrity refers to the completeness of the detected object area. Separately predicting Purity and Integrity can divide the complex mapping between the bounding box and its IoU into two clearer mappings and model them independently. In addition, a simple but effective feature realignment approach is also introduced to make the IoU regressor work in a hindsight manner, which can make the target mapping more stable. The proposed DIR can be conveniently integrated with existing two-stage detectors and significantly improve their performance. Through a simple implementation of DIR with HTC, we obtain 51.3% AP on MS COCO benchmark, which outperforms previous methods and achieves state-of-the-art.

preprint2022arXiv

NFormer: Robust Person Re-identification with Neighbor Transformer

Person re-identification aims to retrieve persons in highly varying settings across different cameras and scenarios, in which robust and discriminative representation learning is crucial. Most research considers learning representations from single images, ignoring any potential interactions between them. However, due to the high intra-identity variations, ignoring such interactions typically leads to outlier features. To tackle this issue, we propose a Neighbor Transformer Network, or NFormer, which explicitly models interactions across all input images, thus suppressing outlier features and leading to more robust representations overall. As modelling interactions between enormous amount of images is a massive task with lots of distractors, NFormer introduces two novel modules, the Landmark Agent Attention, and the Reciprocal Neighbor Softmax. Specifically, the Landmark Agent Attention efficiently models the relation map between images by a low-rank factorization with a few landmarks in feature space. Moreover, the Reciprocal Neighbor Softmax achieves sparse attention to relevant -- rather than all -- neighbors only, which alleviates interference of irrelevant representations and further relieves the computational burden. In experiments on four large-scale datasets, NFormer achieves a new state-of-the-art. The code is released at \url{https://github.com/haochenheheda/NFormer}.

preprint2022arXiv

Removing systematics-induced 21-cm foreground residuals by cross-correlating filtered data

Observations of the redshifted 21-cm signal emitted by neutral hydrogen represent a promising probe of large-scale structure in the universe. However, cosmological 21-cm signal is challenging to observe due to astrophysical foregrounds which are several orders of magnitude brighter. Traditional linear foreground removal methods can optimally remove foregrounds for a known telescope response but are sensitive to telescope systematic errors such as antenna gain and delay errors, leaving foreground contamination in the recovered signal. Non-linear methods such as principal component analysis, on the other hand, have been used successfully for foreground removal, but they lead to signal loss that is difficult to characterize and requires careful analysis. In this paper, we present a systematics-robust foreground removal technique which combines both linear and non-linear methods. We first obtain signal and foreground estimates using a linear filter. Under the assumption that the signal estimate is contaminated by foreground residuals induced by parameterizable systematic effects, we infer the systematics-induced contamination by cross-correlating the initial signal and foreground estimates. Correcting for the inferred error, we are able to subtract foreground contamination from the linearly filtered signal up to the first order in the amplitude of the telescope systematics. In simulations of an interferometric 21-cm survey, our algorithm removes foreground leakage induced by complex gain errors by one to two orders of magnitude in the power spectrum. Our technique thus eases the requirements on telescope characterization for modern and next-generation 21-cm cosmology experiments.

preprint2022arXiv

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

The crux of semi-supervised semantic segmentation is to assign adequate pseudo-labels to the pixels of unlabeled images. A common practice is to select the highly confident predictions as the pseudo ground-truth, but it leads to a problem that most pixels may be left unused due to their unreliability. We argue that every pixel matters to the model training, even its prediction is ambiguous. Intuitively, an unreliable prediction may get confused among the top classes (i.e., those with the highest probabilities), however, it should be confident about the pixel not belonging to the remaining classes. Hence, such a pixel can be convincingly treated as a negative sample to those most unlikely categories. Based on this insight, we develop an effective pipeline to make sufficient use of unlabeled data. Concretely, we separate reliable and unreliable pixels via the entropy of predictions, push each unreliable pixel to a category-wise queue that consists of negative samples, and manage to train the model with all candidate pixels. Considering the training evolution, where the prediction becomes more and more accurate, we adaptively adjust the threshold for the reliable-unreliable partition. Experimental results on various benchmarks and training settings demonstrate the superiority of our approach over the state-of-the-art alternatives.

preprint2022arXiv

Using the Sun to Measure the Primary Beam Response of the Canadian Hydrogen Intensity Mapping Experiment

We present a beam pattern measurement of the Canadian Hydrogen Intensity Mapping Experiment (CHIME) made using the Sun as a calibration source. As CHIME is a pure drift scan instrument, we rely on the seasonal North-South motion of the Sun to probe the beam at different elevations. This semiannual range in elevation, combined with the radio brightness of the Sun, enables a beam measurement which spans ~7,200 square degrees on the sky without the need to move the telescope. We take advantage of observations made near solar minimum to minimize the impact of solar variability, which is observed to be <10% in intensity over the observation period. The resulting data set is highly complementary to other CHIME beam measurements -- both in terms of angular coverage and systematics -- and plays an important role in the ongoing program to characterize the CHIME primary beam.

preprint2020arXiv

Pixel Consensus Voting for Panoptic Segmentation

The core of our approach, Pixel Consensus Voting, is a framework for instance segmentation based on the Generalized Hough transform. Pixels cast discretized, probabilistic votes for the likely regions that contain instance centroids. At the detected peaks that emerge in the voting heatmap, backprojection is applied to collect pixels and produce instance masks. Unlike a sliding window detector that densely enumerates object proposals, our method detects instances as a result of the consensus among pixel-wise votes. We implement vote aggregation and backprojection using native operators of a convolutional neural network. The discretization of centroid voting reduces the training of instance segmentation to pixel labeling, analogous and complementary to FCN-style semantic segmentation, leading to an efficient and unified architecture that jointly models things and stuff. We demonstrate the effectiveness of our pipeline on COCO and Cityscapes Panoptic Segmentation and obtain competitive results. Code will be open-sourced.

Haochen Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

MMFormalizer: Multimodal Autoformalization in the Wild

The Second CHIME/FRB Catalog of Fast Radio Bursts

The Squeezed Bispectrum from CHIME HI Emission and Planck CMB Lensing: Current Sensitivity and Forecasts

An Overview of CHIME, the Canadian Hydrogen Intensity Mapping Experiment

Decoupled IoU Regression for Object Detection

NFormer: Robust Person Re-identification with Neighbor Transformer

Removing systematics-induced 21-cm foreground residuals by cross-correlating filtered data

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

Using the Sun to Measure the Primary Beam Response of the Canadian Hydrogen Intensity Mapping Experiment

Pixel Consensus Voting for Panoptic Segmentation