Researcher profile

Jiahui Huang

Jiahui Huang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

TimeGMM: Single-Pass Probabilistic Forecasting via Adaptive Gaussian Mixture Models with Reversible Normalization

Probabilistic time series forecasting is crucial for quantifying future uncertainty, with significant applications in fields such as energy and finance. However, existing methods often rely on computationally expensive sampling or restrictive parametric assumptions to characterize future distributions, which limits predictive performance and introduces distributional mismatch. To address these challenges, this paper presents TimeGMM, a novel probabilistic forecasting framework based on Gaussian Mixture Models (GMM) that captures complex future distributions in a single forward pass. A key component is GMM-adapted Reversible Instance Normalization (GRIN), a novel module designed to dynamically adapt to temporal-probabilistic distribution shifts. The framework integrates a dedicated Temporal Encoder (TE-Module) with a Conditional Temporal-Probabilistic Decoder (CTPD-Module) to jointly capture temporal dependencies and mixture distribution parameters. Extensive experiments demonstrate that TimeGMM consistently outperforms state-of-the-art methods, achieving maximum improvements of 22.48\% in CRPS and 21.23\% in NMAE.

preprint2022arXiv

Dynamic 3D Scene Analysis by Point Cloud Accumulation

Multi-beam LiDAR sensors, as used on autonomous vehicles and mobile robots, acquire sequences of 3D range scans ("frames"). Each frame covers the scene sparsely, due to limited angular scanning resolution and occlusion. The sparsity restricts the performance of downstream processes like semantic segmentation or surface reconstruction. Luckily, when the sensor moves, frames are captured from a sequence of different viewpoints. This provides complementary information and, when accumulated in a common scene coordinate frame, yields a denser sampling and a more complete coverage of the underlying 3D scene. However, often the scanned scenes contain moving objects. Points on those objects are not correctly aligned by just undoing the scanner's ego-motion. In the present paper, we explore multi-frame point cloud accumulation as a mid-level representation of 3D scan sequences, and develop a method that exploits inductive biases of outdoor street scenes, including their geometric layout and object-level rigidity. Compared to state-of-the-art scene flow estimators, our proposed approach aims to align all 3D points in a common reference frame correctly accumulating the points on the individual objects. Our approach greatly reduces the alignment errors on several benchmark datasets. Moreover, the accumulated point clouds benefit high-level tasks like surface reconstruction.

preprint2022arXiv

Multiway Non-rigid Point Cloud Registration via Learned Functional Map Synchronization

We present SyNoRiM, a novel way to jointly register multiple non-rigid shapes by synchronizing the maps relating learned functions defined on the point clouds. Even though the ability to process non-rigid shapes is critical in various applications ranging from computer animation to 3D digitization, the literature still lacks a robust and flexible framework to match and align a collection of real, noisy scans observed under occlusions. Given a set of such point clouds, our method first computes the pairwise correspondences parameterized via functional maps. We simultaneously learn potentially non-orthogonal basis functions to effectively regularize the deformations, while handling the occlusions in an elegant way. To maximally benefit from the multi-way information provided by the inferred pairwise deformation fields, we synchronize the pairwise functional maps into a cycle-consistent whole thanks to our novel and principled optimization formulation. We demonstrate via extensive experiments that our method achieves a state-of-the-art performance in registration accuracy, while being flexible and efficient as we handle both non-rigid and multi-body cases in a unified framework and avoid the costly optimization over point-wise permutations by the use of basis function maps.

preprint2022arXiv

VLT MUSE observations of the bubble nebula around NGC 1313 X-2 and evidence for additional photoionization

The bubble nebula surrounding NGC 1313 X-2 is believed to be powered by high velocity winds from the central ultraluminous X-ray source (ULX) as a result of supercritical accretion. With the Multi-Unit Spectroscopic Explorer (MUSE) observation of the nebula, we find enhanced OIII emission at locations spatially coincident with clusters of stars and the central X-ray source, suggesting that photoionization in addition to shock-ionization plays an important role in powering the nebula. The X-ray luminosity of the ULX and the number of massive stars in the nebula region can account for the required ionizing luminosity derived with MAPPINGS V, which also confirms that pure shocks cannot explain the observed emission line ratios.

preprint2021arXiv

A significant detection of X-ray Polarization in Sco X-1 with PolarLight and constraints on the corona geometry

We report the detection of X-ray polarization in the neutron star low mass X-ray binary Scorpius (Sco) X-1 with PolarLight. The result is energy dependent, with a non-detection in 3-4 keV but a 4$σ$ detection in 4-8 keV; it is also flux dependent in the 4-8 keV band, with a non-detection when the source displays low fluxes but a 5$σ$ detection during high fluxes, in which case we obtain a polarization fraction of $0.043 \pm 0.008$ and a polarization angle of $52.6^\circ \pm 5.4^\circ$. This confirms a previous marginal detection with OSO-8 in the 1970s, and marks Sco X-1 the second astrophysical source with a significant polarization measurement in the keV band. The measured polarization angle is in line with the jet orientation of the source on the sky plane ($54^\circ$), which is supposedly the symmetric axis of the system. Combining previous spectral analysis, our measurements suggest that an optically thin corona is located in the transition layer under the highest accretion rates, and disfavor the extended accretion disk corona model.

preprint2021arXiv

Subdivision-Based Mesh Convolution Networks

Convolutional neural networks (CNNs) have made great breakthroughs in 2D computer vision. However, their irregular structure makes it hard to harness the potential of CNNs directly on meshes. A subdivision surface provides a hierarchical multi-resolution structure, in which each face in a closed 2-manifold triangle mesh is exactly adjacent to three faces. Motivated by these two observations, this paper presents SubdivNet, an innovative and versatile CNN framework for 3D triangle meshes with Loop subdivision sequence connectivity. Making an analogy between mesh faces and pixels in a 2D image allows us to present a mesh convolution operator to aggregate local features from nearby faces. By exploiting face neighborhoods, this convolution can support standard 2D convolutional network concepts, e.g. variable kernel size, stride, and dilation. Based on the multi-resolution hierarchy, we make use of pooling layers which uniformly merge four faces into one and an upsampling method which splits one face into four. Thereby, many popular 2D CNN architectures can be easily adapted to process 3D meshes. Meshes with arbitrary connectivity can be remeshed to have Loop subdivision sequence connectivity via self-parameterization, making SubdivNet a general approach. Extensive evaluation and various applications demonstrate SubdivNet's effectiveness and efficiency.

preprint2020arXiv

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings

We present ClusterVO, a stereo Visual Odometry which simultaneously clusters and estimates the motion of both ego and surrounding rigid clusters/objects. Unlike previous solutions relying on batch input or imposing priors on scene structure or dynamic object models, ClusterVO is online, general and thus can be used in various scenarios including indoor scene understanding and autonomous driving. At the core of our system lies a multi-level probabilistic association mechanism and a heterogeneous Conditional Random Field (CRF) clustering approach combining semantic, spatial and motion information to jointly infer cluster segmentations online for every frame. The poses of camera and dynamic objects are instantly solved through a sliding-window optimization. Our system is evaluated on Oxford Multimotion and KITTI dataset both quantitatively and qualitatively, reaching comparable results to state-of-the-art solutions on both odometry and dynamic trajectory recovery.

preprint2020arXiv

Duality Diagram Similarity: a generic framework for initialization selection in task transfer learning

In this paper, we tackle an open research question in transfer learning, which is selecting a model initialization to achieve high performance on a new task, given several pre-trained models. We propose a new highly efficient and accurate approach based on duality diagram similarity (DDS) between deep neural networks (DNNs). DDS is a generic framework to represent and compare data of different feature dimensions. We validate our approach on the Taskonomy dataset by measuring the correspondence between actual transfer learning performance rankings on 17 taskonomy tasks and predicted rankings. Computing DDS based ranking for $17\times17$ transfers requires less than 2 minutes and shows a high correlation ($0.86$) with actual transfer learning rankings, outperforming state-of-the-art methods by a large margin ($10\%$) on the Taskonomy benchmark. We also demonstrate the robustness of our model selection approach to a new task, namely Pascal VOC semantic segmentation. Additionally, we show that our method can be applied to select the best layer locations within a DNN for transfer learning on 2D, 3D and semantic tasks on NYUv2 and Pascal VOC datasets.

preprint2020arXiv

In-orbit Operation and Performance of the CubeSat Soft X-ray Polarimeter PolarLight

PolarLight is a compact soft X-ray polarimeter onboard a CubeSat, which was launched into a low-Earth orbit on October 29, 2018. In March 2019, PolarLight started full operation, and since then, regular observations with the Crab nebula, Sco X-1, and background regions have been conducted. Here we report the operation, calibration, and performance of PolarLight in the orbit. Based on these, we discuss how one can run a low-cost, shared CubeSat for space astronomy, and how CubeSats can play a role in modern space astronomy for technical demonstration, science observations, and student training.

preprint2020arXiv

Shallow2Deep: Indoor Scene Modeling by Single Image Understanding

Dense indoor scene modeling from 2D images has been bottlenecked due to the absence of depth information and cluttered occlusions. We present an automatic indoor scene modeling approach using deep features from neural networks. Given a single RGB image, our method simultaneously recovers semantic contents, 3D geometry and object relationship by reasoning indoor environment context. Particularly, we design a shallow-to-deep architecture on the basis of convolutional networks for semantic scene understanding and modeling. It involves multi-level convolutional networks to parse indoor semantics/geometry into non-relational and relational knowledge. Non-relational knowledge extracted from shallow-end networks (e.g. room layout, object geometry) is fed forward into deeper levels to parse relational semantics (e.g. support relationship). A Relation Network is proposed to infer the support relationship between objects. All the structured semantics and geometry above are assembled to guide a global optimization for 3D scene modeling. Qualitative and quantitative analysis demonstrates the feasibility of our method in understanding and modeling semantics-enriched indoor scenes by evaluating the performance of reconstruction accuracy, computation performance and scene complexity.

preprint2020arXiv

What fraction of an $S_n$-orbit can lie on a hyperplane?

Consider the $S_n$-action on $\mathbb{R}^n$ given by permuting coordinates. This paper addresses the following problem: compute $\max_{v,H} |H\cap S_nv|$ as $H\subset\mathbb{R}^n$ ranges over all hyperplanes through the origin and $v\in\mathbb{R}^n$ ranges over all vectors with distinct coordinates that are not contained in the hyperplane $\sum x_i=0$. We conjecture that for $n\geq3$, the answer is $(n-1)!$ for odd $n$, and $n(n-2)!$ for even $n$. We prove that if $p$ is the largest prime with $p\leq n$, then $\max_{v,H} |H\cap S_nv|\leq \frac{n!}{p}$. In particular, this proves the conjecture when $n$ or $n-1$ is prime.