Source author record

Shu Liu

Shu Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision astro-ph.GA math.OC Information Theory Machine Learning math.IT math.NA Numerical Analysis astro-ph.CO astro-ph.SR Computation and Language math.DS math.PR Artificial Intelligence eess.AS Sound

Catalog footprint

What is connected

34works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Natural Primal-Dual Hybrid Gradient Method for Adversarial Neural Network Training on Solving Partial Differential Equations

We propose a scalable preconditioned primal-dual hybrid gradient algorithm for solving partial differential equations (PDEs). We multiply the PDE with a dual test function to obtain an inf-sup problem whose loss functional involves lower-order differential operators. The Primal-Dual Hybrid Gradient (PDHG) algorithm is then leveraged for this saddle point problem. By introducing suitable precondition operators to the proximal steps in the PDHG algorithm, we obtain an alternative natural gradient ascent-descent optimization scheme for updating the neural network parameters. We apply the Krylov subspace method (MINRES) to evaluate the natural gradients efficiently. Such treatment readily handles the inversion of precondition matrices via matrix-vector multiplication. An \textit{a posteriori} convergence analysis is established for the time-continuous version of the proposed algorithm for general linear PDEs. By incorporating appropriate boundary loss terms, we further obtain a refined \textit{a priori} convergence result for elliptic equations in divergence form. The algorithm is tested on various types of PDEs with dimensions ranging from $1$ to $50$, including linear and nonlinear elliptic equations, reaction-diffusion equations, and Monge-Ampère equations stemming from the $L^2$ optimal transport problems. We compare the performance of the proposed method with several commonly used deep learning algorithms such as physics-informed neural networks (PINNs), the DeepRitz method and weak adversarial networks (WANs) using either the Adam or the L-BFGS optimizer. The numerical results suggest that the proposed method performs efficiently and robustly and converges more stably with higher accuracy.

preprint2026arXiv

Judge, Then Drive: A Critic-Centric Vision Language Action Framework for Autonomous Driving

Recent advances in vision language action (VLA) models have shown remarkable potential for autonomous driving by directly mapping multimodal inputs to control signals. However, previous VLA-based methods have not explicitly exploited the critic capability of VLAs to refine driving decisions, even though such capability has been well demonstrated in other LLM-based domains, thereby limiting their performance in complex closed-loop scenarios. In this work, we present a theoretically inspired two-stage framework, CriticVLA, which extends the role of VLAs from acting to judging. CriticVLA first generates a rough trajectory and then refines it through multimodal evaluation and single-step optimization guided by a VLA-based critic, yielding higher-quality driving behaviors. To support this process, we construct a large-scale synthetic dataset of 12.9 million annotated trajectories covering diverse driving scenarios, which enhances the critic's reasoning and refinement abilities. Extensive closed-loop experiments on the Bench2Drive benchmark show that CriticVLA significantly surpasses state-of-the-art baselines, achieving a 73.33% total success rate and delivering about 30% improvement in challenging scenarios.

preprint2024arXiv

MOODv2: Masked Image Modeling for Out-of-Distribution Detection

The crux of effective out-of-distribution (OOD) detection lies in acquiring a robust in-distribution (ID) representation, distinct from OOD samples. While previous methods predominantly leaned on recognition-based techniques for this purpose, they often resulted in shortcut learning, lacking comprehensive representations. In our study, we conducted a comprehensive analysis, exploring distinct pretraining tasks and employing various OOD score functions. The results highlight that the feature representations pre-trained through reconstruction yield a notable enhancement and narrow the performance gap among various score functions. This suggests that even simple score functions can rival complex ones when leveraging reconstruction-based pretext tasks. Reconstruction-based pretext tasks adapt well to various score functions. As such, it holds promising potential for further expansion. Our OOD detection framework, MOODv2, employs the masked image modeling pretext task. Without bells and whistles, MOODv2 impressively enhances 14.30% AUROC to 95.68% on ImageNet and achieves 99.98% on CIFAR-10.

preprint2022arXiv

A new construction of nonlinear codes via algebraic function fields

In coding theory, constructing codes with good parameters is one of the most important and fundamental problems. Though a great many of good codes have been produced, most of them are defined over alphabets of sizes equal to prime powers. In this paper, we provide a new explicit construction of $(q+1)$-ary nonlinear codes via algebraic function fields, where $q$ is a prime power. Our codes are constructed by evaluations of rational functions at all rational places of the algebraic function field. Compared with algebraic geometry codes, the main difference is that we allow rational functions to be evaluated at pole places. After evaluating rational functions from a union of Riemann-Roch spaces, we obtain a family of nonlinear codes over the alphabet $\mathbb{F}_{q}\cup \{\infty\}$. It turns out that our codes have better parameters than those obtained from MDS codes or good algebraic geometry codes via code alphabet extension and restriction.

preprint2022arXiv

DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation

Unsupervised domain adaptation in semantic segmentation has been raised to alleviate the reliance on expensive pixel-wise annotations. It leverages a labeled source domain dataset as well as unlabeled target domain images to learn a segmentation network. In this paper, we observe two main issues of the existing domain-invariant learning framework. (1) Being distracted by the feature distribution alignment, the network cannot focus on the segmentation task. (2) Fitting source domain data well would compromise the target domain performance. To address these issues, we propose DecoupleNet that alleviates source domain overfitting and enables the final model to focus more on the segmentation task. Furthermore, we put forward Self-Discrimination (SD) and introduce an auxiliary classifier to learn more discriminative target domain features with pseudo labels. Finally, we propose Online Enhanced Self-Training (OEST) to contextually enhance the quality of pseudo labels in an online manner. Experiments show our method outperforms existing state-of-the-art methods, and extensive ablation studies verify the effectiveness of each component. Code is available at https://github.com/dvlab-research/DecoupleNet.

preprint2022arXiv

Deuterated ammonia in Galactic massive star-forming regions

We present sensitive observations of NH2D at 110.153599 GHz toward 50 Galactic massive star-forming regions with IRAM 30-m telescope. The NH2D transition is detected toward 36 objects, yielding a detection rate of 72%. Column densities of NH2D, HC3N and C18O for each source are derived by assuming local thermal equilibrium conditions with a fixed excitation temperature. The deuterium ratio of NH$_3$, defined as the abundance ratio of NH2D to NH3, for 19 sources is also obtained with the information of NH3 from the literature. The range of deuterium fractionation bends to be large in the late-stage star-forming regions in this work, with the value from 0.043 to 0.0006. The highest deuterium ratio of NH3 is 0.043 in G081.75+00.78 (DR21). We also find that the deuterium ratio of NH3 increases with the Galactocentric distances and decreases with the line width.

preprint2022arXiv

DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D Detectors

Camera-based 3D object detectors are welcome due to their wider deployment and lower price than LiDAR sensors. We first revisit the prior stereo detector DSGN for its stereo volume construction ways for representing both 3D geometry and semantics. We polish the stereo modeling and propose the advanced version, DSGN++, aiming to enhance effective information flow throughout the 2D-to-3D pipeline in three main aspects. First, to effectively lift the 2D information to stereo volume, we propose depth-wise plane sweeping (DPS) that allows denser connections and extracts depth-guided features. Second, for grasping differently spaced features, we present a novel stereo volume -- Dual-view Stereo Volume (DSV) that integrates front-view and top-view features and reconstructs sub-voxel depth in the camera frustum. Third, as the foreground region becomes less dominant in 3D space, we propose a multi-modal data editing strategy -- Stereo-LiDAR Copy-Paste, which ensures cross-modal alignment and improves data efficiency. Without bells and whistles, extensive experiments in various modality setups on the popular KITTI benchmark show that our method consistently outperforms other camera-based 3D detectors for all categories. Code is available at https://github.com/chenyilun95/DSGN2.

preprint2022arXiv

EfficientNeRF: Efficient Neural Radiance Fields

Neural Radiance Fields (NeRF) has been wildly applied to various tasks for its high-quality representation of 3D scenes. It takes long per-scene training time and per-image testing time. In this paper, we present EfficientNeRF as an efficient NeRF-based method to represent 3D scene and synthesize novel-view images. Although several ways exist to accelerate the training or testing process, it is still difficult to much reduce time for both phases simultaneously. We analyze the density and weight distribution of the sampled points then propose valid and pivotal sampling at the coarse and fine stage, respectively, to significantly improve sampling efficiency. In addition, we design a novel data structure to cache the whole scene during testing to accelerate the rendering speed. Overall, our method can reduce over 88\% of training time, reach rendering speed of over 200 FPS, while still achieving competitive accuracy. Experiments prove that our method promotes the practicality of NeRF in the real world and enables many applications.

preprint2022arXiv

Generalized Few-shot Semantic Segmentation

Training semantic segmentation models requires a large amount of finely annotated data, making it hard to quickly adapt to novel classes not satisfying this condition. Few-Shot Segmentation (FS-Seg) tackles this problem with many constraints. In this paper, we introduce a new benchmark, called Generalized Few-Shot Semantic Segmentation (GFS-Seg), to analyze the generalization ability of simultaneously segmenting the novel categories with very few examples and the base categories with sufficient examples. It is the first study showing that previous representative state-of-the-art FS-Seg methods fall short in GFS-Seg and the performance discrepancy mainly comes from the constrained setting of FS-Seg. To make GFS-Seg tractable, we set up a GFS-Seg baseline that achieves decent performance without structural change on the original model. Then, since context is essential for semantic segmentation, we propose the Context-Aware Prototype Learning (CAPL) that significantly improves performance by 1) leveraging the co-occurrence prior knowledge from support samples, and 2) dynamically enriching contextual information to the classifier, conditioned on the content of each query image. Both two contributions are experimentally shown to have substantial practical merit. Extensive experiments on Pascal-VOC and COCO manifest the effectiveness of CAPL, and CAPL generalizes well to FS-Seg by achieving competitive performance. Code is available at https://github.com/dvlab-research/GFS-Seg.

preprint2022arXiv

Good locally repairable codes via propagation rules

In classical coding theory, it is common to construct new codes via propagation rules. There are various propagation rules to construct classical block codes. However, propagation rules have not been extensively explored for constructions of locally repairable codes. In this paper, we introduce a few propagation rules to construct good locally repairable codes. To our surprise, these simple propagation rules produce a few interesting results. Firstly, by concatenating a locally repairable code as an inner code with a classical block code as an outer code, we obtain quite a few dimension-optimal binary locally repairable codes. Secondly, from this concatenation, we explicitly build a family of locally repairable codes that exceeds the Zyablov-type bound. Thirdly, by a lengthening propagation rule that adds some rows and columns from a parity-check matrix of a given linear code, we are able to produce a family of dimension-optimal binary locally repairable codes from the extended Hamming codes, and to convert a classical maximum distance separable (MDS) code into a Singleton-optimal locally repairable code. Furthermore, via the lengthening propagation rule, we greatly simplify the construction of a family of locally repairable codes in \cite[Theorem 5]{MX20} that breaks the asymptotic Gilbert-Varshamov bound. In addition, we make use of three other propagation rules to produce more dimension-optimal binary locally repairable codes. Finally, one of phenomena that we observe in this paper is that some trivial propagation rules in classical block codes do not hold anymore for locally repairable codes.

preprint2022arXiv

Improving Data Driven Inverse Text Normalization using Data Augmentation

Inverse text normalization (ITN) is used to convert the spoken form output of an automatic speech recognition (ASR) system to a written form. Traditional handcrafted ITN rules can be complex to transcribe and maintain. Meanwhile neural modeling approaches require quality large-scale spoken-written pair examples in the same or similar domain as the ASR system (in-domain data), to train. Both these approaches require costly and complex annotations. In this paper, we present a data augmentation technique that effectively generates rich spoken-written numeric pairs from out-of-domain textual data with minimal human annotation. We empirically demonstrate that ITN model trained using our data augmentation technique consistently outperform ITN model trained using only in-domain data across all numeric surfaces like cardinal, currency, and fraction, by an overall accuracy of 14.44%.

preprint2022arXiv

Neural Parametric Fokker-Planck Equations

In this paper, we develop and analyze numerical methods for high dimensional Fokker-Planck equations by leveraging generative models from deep learning. Our starting point is a formulation of the Fokker-Planck equation as a system of ordinary differential equations (ODEs) on finite-dimensional parameter space with the parameters inherited from generative models such as normalizing flows. We call such ODEs neural parametric Fokker-Planck equations. The fact that the Fokker-Planck equation can be viewed as the $L^2$-Wasserstein gradient flow of Kullback-Leibler (KL) divergence allows us to derive the ODEs as the constrained $L^2$-Wasserstein gradient flow of KL divergence on the set of probability densities generated by neural networks. For numerical computation, we design a variational semi-implicit scheme for the time discretization of the proposed ODE. Such an algorithm is sampling-based, which can readily handle the Fokker-Planck equations in higher dimensional spaces. Moreover, we also establish bounds for the asymptotic convergence analysis of the neural parametric Fokker-Planck equation as well as the error analysis for both the continuous and discrete versions. Several numerical examples are provided to illustrate the performance of the proposed algorithms and analysis.

preprint2022arXiv

Optimal control for stochastic nonlinear Schrodinger equation on graph

We study the optimal control formulation for stochastic nonlinear Schrodinger equation (SNLSE) on a finite graph. By viewing the SNLSE as a stochastic Wasserstein Hamiltonian flow on density manifold, we show the global existence of a unique strong solution for SNLSE with a linear drift control or a linear diffusion control on graph. Furthermore, we provide the gradient formula, the existence of the optimal control and a description on the optimal condition via the forward and backward stochastic differential equations.

preprint2022arXiv

Properties of dense molecular gas along the major axis of M 82

Dense gas is important for galaxy evolution and star formation. Optically-thin dense-gas tracers, such as isotopologues of HCN, HCO+, etc., are very helpful to diagnose excitation conditions of dense molecular gas. However, previous studies of optically-thin dense-gas tracers were mostly focusing on average properties of galaxies as a whole, due to limited sensitivity and angular resolution. M82, a nearby prototype starburst galaxy, offers a unique case for spatially-resolved studies with single-dish telescopes. With the IRAM 30-m telescope, we observed the J = 1 - 0 transition of H13CN, HC15N, H13CO+, HN13C, H15NC, and SiO J = 2 - 1, HC3N J= 10 - 9, H2CO J = 2 - 1 toward five positions along the major axis of M82. The intensity ratios of I(HCN)/I(H13CN) and I(HCO+)/I(H13CO+) show a significant spatial variation along the major axis, with lower values in the central region than those on the disk, indicating higher optical depths in the central region. The optical depths of HCO+ lines are found to be systematically higher than those of HCN lines at all positions. Futhermore, we find that the 14N/15N ratios have an increasing gradient from the center to the outer disk.

preprint2022arXiv

ResLT: Residual Learning for Long-tailed Recognition

Deep learning algorithms face great challenges with long-tailed data distribution which, however, is quite a common case in real-world scenarios. Previous methods tackle the problem from either the aspect of input space (re-sampling classes with different frequencies) or loss space (re-weighting classes with different weights), suffering from heavy over-fitting to tail classes or hard optimization during training. To alleviate these issues, we propose a more fundamental perspective for long-tailed recognition, i.e., from the aspect of parameter space, and aims to preserve specific capacity for classes with low frequencies. From this perspective, the trivial solution utilizes different branches for the head, medium, and tail classes respectively, and then sums their outputs as the final results is not feasible. Instead, we design the effective residual fusion mechanism -- with one main branch optimized to recognize images from all classes, another two residual branches are gradually fused and optimized to enhance images from medium+tail classes and tail classes respectively. Then the branches are aggregated into final results by additive shortcuts. We test our method on several benchmarks, i.e., long-tailed version of CIFAR-10, CIFAR-100, Places, ImageNet, and iNaturalist 2018. Experimental results manifest the effectiveness of our method. Our code is available at https://github.com/jiequancui/ResLT.

preprint2022arXiv

SEA: Bridging the Gap Between One- and Two-stage Detector Distillation via SEmantic-aware Alignment

We revisit the one- and two-stage detector distillation tasks and present a simple and efficient semantic-aware framework to fill the gap between them. We address the pixel-level imbalance problem by designing the category anchor to produce a representative pattern for each category and regularize the topological distance between pixels and category anchors to further tighten their semantic bonds. We name our method SEA (SEmantic-aware Alignment) distillation given the nature of abstracting dense fine-grained information by semantic reliance to well facilitate distillation efficacy. SEA is well adapted to either detection pipeline and achieves new state-of-the-art results on the challenging COCO object detection task on both one- and two-stage detectors. Its superior performance on instance segmentation further manifests the generalization ability. Both 2x-distilled RetinaNet and FCOS with ResNet50-FPN outperform their corresponding 3x ResNet101-FPN teacher, arriving 40.64 and 43.06 AP, respectively. Code will be made publicly available.

preprint2022arXiv

Spatial distribution of HOCN around Sagittarius B2

HOCN and HNCO abundance ratio in molecular gas can tell us the information of their formation mechanism. We performed high-sensitivity mapping observations of HOCN, HNCO, and HNC$^{18}$O lines around Sagittarius B2 (Sgr B2) with IRAM 30m telescope at 3-mm wavelength. HNCO 4$_{04}$-3$_{03}$ and HOCN 4$_{04}$-3$_{03}$ are used to obtain the abundance ratio of HNCO to HOCN. The ratio of HNCO 4$_{04}$-3$_{03}$ to HNC$^{18}$O 4$_{04}$-3$_{03}$ is used to calculate the optical depth of HNCO 4$_{04}$-3$_{03}$. The abundance ratio of HOCN and HNCO is observed to range from 0.4% to 0.7% toward most positions, which agrees well with the gas-grain model. However, the relative abundance of HOCN is observed to be enhanced toward the direction of Sgr B2 (S), with HOCN to HNCO abundance ratio of $\sim$ 0.9%. The reason for that still needs further investigation.Based on the intensity ratio of HNCO and HNC$^{18}$O lines, we updated the isotopic ratio of $^{16}$O/$^{18}$O to be 296 $\pm$ 54 in Sgr B2.

preprint2022arXiv

Stratified Transformer for 3D Point Cloud Segmentation

3D point cloud segmentation has made tremendous progress in recent years. Most current methods focus on aggregating local features, but fail to directly model long-range dependencies. In this paper, we propose Stratified Transformer that is able to capture long-range contexts and demonstrates strong generalization ability and high performance. Specifically, we first put forward a novel key sampling strategy. For each query point, we sample nearby points densely and distant points sparsely as its keys in a stratified way, which enables the model to enlarge the effective receptive field and enjoy long-range contexts at a low computational cost. Also, to combat the challenges posed by irregular point arrangements, we propose first-layer point embedding to aggregate local information, which facilitates convergence and boosts performance. Besides, we adopt contextual relative position encoding to adaptively capture position information. Finally, a memory-efficient implementation is introduced to overcome the issue of varying point numbers in each window. Extensive experiments demonstrate the effectiveness and superiority of our method on S3DIS, ScanNetv2 and ShapeNetPart datasets. Code is available at https://github.com/dvlab-research/Stratified-Transformer.

preprint2022arXiv

Upper bounds on maximum lengths of Singleton-optimal locally repairable codes

A locally repairable code is called Singleton-optimal if it achieves the Singleton-type bound. Such codes are of great theoretic interest in the study of locally repairable codes. In the recent years there has been a great amount of work on this topic. One of the main problems in this topic is to determine the largest length of a q-ary Singleton-optimal locally repairable code for given locality and minimum distance. Unlike classical MDS codes, the maximum length of Singleton? Optimal locally repairable codes are very sensitive to minimum distance and locality. Thus, it is more challenging and complicated to investigate the maximum length of Singleton-optimal locally repairable codes. In literature, there has been already some research on this problem. However, most of work is concerned with some specific parameter regime such as small minimum distance and locality, and rely on the constraint that (r + 1)|n and recovery sets are disjoint, where r is locality and n is the code length. In this paper we study the problem for large range of parameters including the case where minimum distance is proportional to length. In addition, we also derive some upper bounds on the maximum length of Singleton-optimal locally repairable codes with small minimum distance by removing this constraint. It turns out that even without the constraint we still get better upper bounds for codes with small locality and distance compared with known results. Furthermore, based on our upper bounds for codes with small distance and locality and some propagation rule that we propose in this paper, we are able to derive some upper bounds for codes with relatively large distance and locality assuming that (r + 1)|n and recovery sets are disjoint.

preprint2022arXiv

Wasserstein Hamiltonian flow with common noise on graph

We study the Wasserstein Hamiltonian flow with a common noise on the density manifold of a finite graph. Under the framework of stochastic variational principle, we first develop the formulation of stochastic Wasserstein Hamiltonian flow and show the local existence of a unique solution. We also establish a sufficient condition for the global existence of the solution. Consequently, we obtain the global well-posedness for the nonlinear Schrödinger equations with common noise on graph. In addition, using Wong-Zakai approximation of common noise, we prove the existence of the minimizer for an optimal control problem with common noise. We show that its minimizer satisfies the stochastic Wasserstein Hamiltonian flow on graph as well.

preprint2022arXiv

Widespread subsonic turbulence in Ophiuchus North 1

Supersonic motions are common in molecular clouds. (Sub)sonic turbulence is usually detected toward dense cores and filaments. However, it remains unknown whether (sub)sonic motions at larger scales ($\gtrsim$1~pc) can be present in different environments or not. Located at a distance of about 110 pc, Ophiuchus North 1 (Oph N1) is one of the nearest molecular clouds that allows in-depth investigation of its turbulence properties by large-scale mapping observations of single-dish telescopes. We carried out the $^{12}$CO ($J=1-0$) and C$^{18}$O ($J=1-0$) imaging observations toward Oph N1 with the Purple Mountain Observatory 13.7 m telescope. The observations have an angular resolution of $\sim$55\arcsec (i.e., 0.03~pc). Most of the whole C$^{18}$O emitting regions have Mach numbers of $\lesssim$1, demonstrating the large-scale (sub)sonic turbulence across Oph N1. Based on the polarization measurements, we estimate the magnetic field strength of the plane-of-sky component to be $\gtrsim$9~$μ$G. We infer that Oph N1 is globally sub-Alfv{é}nic, and is supported against gravity mainly by the magnetic field. The steep velocity structure function can be caused by the expansion of the Sh~2-27 H{\scriptsize II} region or the dissipative range of incompressible turbulence. Our observations reveal a surprising case of clouds characterised by widespread subsonic turbulence and steep size-linewidth relationship. This cloud is magnetized where ion-neutral friction should play an important role.

preprint2021arXiv

Are Type Ia Supernovae in Restframe $H$ Brighter in More Massive Galaxies?

We analyze 143 Type Ia supernovae (SNeIa) observed in $H$ band (1.6-1.8 $μ$m) and find SNeIa are intrinsically brighter in $H$-band with increasing host galaxy stellar mass. We find SNeIa in galaxies more massive than $10^{10.43} M_{\odot}$ are $0.13 \pm 0.04$ mag brighter in $H$ than SNeIa in less massive galaxies. The same set of SNeIa observed at optical wavelengths, after width-color-luminosity corrections, exhibit a $0.10 \pm 0.03$ mag offset in the Hubble residuals. We observe an outlier population ($|ΔH_{\rm max}| > 0.5$ mag) in the $H$ band and show that removing the outlier population moves the mass threshold to $10^{10.65} M_{\odot}$ and reduces the step in $H$ band to $0.08 \pm 0.04$ mag, but the equivalent optical mass step is increased to $0.13 \pm 0.04$ mag. We conclude the outliers do not drive the brightness--host-mass correlation. Less massive galaxies preferentially host more higher-stretch SNeIa, which are intrinsically brighter and bluer. It is only after correction for width-luminosity and color-luminosity relationships that SNeIa have brighter optical Hubble residuals in more massive galaxies. Thus finding SNeIa are intrinsically brighter in $H$ in more massive galaxies is an opposite correlation to the intrinsic (pre-width-luminosity correction) optical brightness. If dust and the treatment of intrinsic color variation were the main driver of the host galaxy mass correlation, we would not expect a correlation of brighter $H$-band SNeIa in more massive galaxies.

preprint2021arXiv

The Dependence of the Type Ia Supernova Host Bias on Observation or Fitting Technique

More luminous Type Ia supernovae (SNe Ia) prefer less massive hosts and regions of higher star formation. This correlation is inverted during width-color-luminosity light curve standardization resulting in step-like biases of distance measurements with respect to host properties. Using the PISCO supernova host sample and SDSS, GALEX, and 2MASS photometry, we compare host stellar mass and specific star formation rate (sSFR) from different observation methods, including local vs. global, and fitting techniques to measure their impact on the host step biases. Mass step measurements for all our mass samples are consistent within a 1$σ$ significance from -0.03$\pm$0.02 mag to -0.04$\pm$0.02 mag. Including or excluding UV information had no effect on measured mass step size or location. Specific SFR (sSFR) step sizes are more significant than mass step measurements and varied from $0.05\pm0.03$ mag (H$α$) and $0.06\pm0.02$ mag (UV) for a 51 host sample. The sSFR step location is influenced by mass sample used to normalize star formation and by sSFR tracer choice. The step size is reduced to 0.04$\pm$0.03 mag when using all available 73 hosts with H$α$ measurements. This 73 PISCO host subsample overall lacked a clear step signal, but here we are searching for whether different choices of mass or sSFR estimation can create a step signal. We find no evidence that different observation or fitting techniques choice can create a distance measurement step in either mass or sSFR.

preprint2021arXiv

What is a stochastic Hamiltonian process on finite graph? An optimal transport answer

We present a definition of stochastic Hamiltonian process on finite graph via its corresponding density dynamics in Wasserstein manifold. We demonstrate the existence of stochastic Hamiltonian process in many classical discrete problems, such as the optimal transport problem, Schrödinger equation and Schrödinger bridge problem (SBP). The stationary and periodic properties of Hamiltonian processes are also investigated in the framework of SBP.

preprint2020arXiv

3DSSD: Point-based 3D Single Stage Object Detector

Currently, there have been many kinds of voxel-based 3D single stage detectors, while point-based single stage methods are still underexplored. In this paper, we first present a lightweight and effective point-based 3D single stage object detector, named 3DSSD, achieving a good balance between accuracy and efficiency. In this paradigm, all upsampling layers and refinement stage, which are indispensable in all existing point-based methods, are abandoned to reduce the large computation cost. We novelly propose a fusion sampling strategy in downsampling process to make detection on less representative points feasible. A delicate box prediction network including a candidate generation layer, an anchor-free regression head with a 3D center-ness assignment strategy is designed to meet with our demand of accuracy and speed. Our paradigm is an elegant single stage anchor-free framework, showing great superiority to other existing methods. We evaluate 3DSSD on widely used KITTI dataset and more challenging nuScenes dataset. Our method outperforms all state-of-the-art voxel-based single stage methods by a large margin, and has comparable performance to two stage point-based methods as well, with inference speed more than 25 FPS, 2x faster than former state-of-the-art point-based methods.

preprint2020arXiv

CO observations toward HI-rich Ultra Diffuse Galaxies

We present CO observations toward a sample of six HI-rich Ultra-diffuse galaxies (UDGs) as well as one UDG (VLSB-A) in the Virgo Cluster with the IRAM 30-m telescope. CO 1-0 is marginally detected at 4sigma level in AGC122966, as the first detection of CO emission in UDGs. We estimate upper limits of molecular mass in other galaxies from the non-detection of CO lines. These upper limits and the marginal CO detection in AGC122966 indicate low mass ratios between molecular and atomic gas masses. With the star formation efficiency derived from the molecular gas, we suggest that the inefficiency of star formation in such HI-rich UDGs is likely caused by the low efficiency in converting molecules from atomic gas, instead of low efficiency in forming stars from molecular gas.

preprint2020arXiv

Dive Deeper Into Box for Object Detection

Anchor free methods have defined the new frontier in state-of-the-art object detection researches where accurate bounding box estimation is the key to the success of these methods. However, even the bounding box has the highest confidence score, it is still far from perfect at localization. To this end, we propose a box reorganization method(DDBNet), which can dive deeper into the box for more accurate localization. At the first step, drifted boxes are filtered out because the contents in these boxes are inconsistent with target semantics. Next, the selected boxes are broken into boundaries, and the well-aligned boundaries are searched and grouped into a sort of optimal boxes toward tightening instances more precisely. Experimental results show that our method is effective which leads to state-of-the-art performance for object detection.

preprint2020arXiv

DSGN: Deep Stereo Geometry Network for 3D Object Detection

Most state-of-the-art 3D object detectors heavily rely on LiDAR sensors because there is a large performance gap between image-based and LiDAR-based methods. It is caused by the way to form representation for the prediction in 3D scenarios. Our method, called Deep Stereo Geometry Network (DSGN), significantly reduces this gap by detecting 3D objects on a differentiable volumetric representation -- 3D geometric volume, which effectively encodes 3D geometric structure for 3D regular space. With this representation, we learn depth information and semantic cues simultaneously. For the first time, we provide a simple and effective one-stage stereo-based 3D detection pipeline that jointly estimates the depth and detects 3D objects in an end-to-end learning manner. Our approach outperforms previous stereo-based 3D detectors (about 10 higher in terms of AP) and even achieves comparable performance with several LiDAR-based methods on the KITTI 3D object detection leaderboard. Our code is publicly available at https://github.com/chenyilun95/DSGN.

preprint2020arXiv

Filament Intersections and Cold Dense Cores in Orion A North

We studied the filament structures and dense cores in OMC-2,3 region in Orion A North molecular cloud using the high-resolution N2H+ (1-0) spectral cube observed with the Atacama Large Millimeter/Submillimeter Array (ALMA). The filament network over a total length of 2 pc is found to contain 170 intersections and 128 candidate dense cores. The dense cores are all displaced from the infrared point sources (possible young stars), and the major fraction of cores (103) are located around the intersections. Towards the intersections, there is also an increasing trend for the total column density Ntot as well as the the power-law index of the column-density Probability Distribution Function (N-PDF), suggesting that the intersections would in general have more significant gas assembly than the other part of the filament paths. The virial analysis shows that the dense cores mostly have virial mass ratio of alpha_vir=M_vir/M_gas<1.0, suggesting them to be bounded by the self gravity. In the mean time, only about 23 percent of the cores have critical mass ratio of alpha_crit=M_crit/M_gas<1.0, suggesting them to be unstable against core collapse. Combining these results, it shows that the major fraction of the cold starless and possible prestellar cores in OMC-2,3 are being assembled around the intersections, and currently in a gravitationally bound state. But more extensive core collapse and star formation may still require continuous core-mass growth or other perturbatio

preprint2020arXiv

Jointly Modeling Aspect and Sentiment with Dynamic Heterogeneous Graph Neural Networks

Target-Based Sentiment Analysis aims to detect the opinion aspects (aspect extraction) and the sentiment polarities (sentiment detection) towards them. Both the previous pipeline and integrated methods fail to precisely model the innate connection between these two objectives. In this paper, we propose a novel dynamic heterogeneous graph to jointly model the two objectives in an explicit way. Both the ordinary words and sentiment labels are treated as nodes in the heterogeneous graph, so that the aspect words can interact with the sentiment information. The graph is initialized with multiple types of dependencies, and dynamically modified during real-time prediction. Experiments on the benchmark datasets show that our model outperforms the state-of-the-art models. Further analysis demonstrates that our model obtains significant performance gain on the challenging instances under multiple-opinion aspects and no-opinion aspect situations.

preprint2020arXiv

Parametric Fokker-Planck equation

We derive the Fokker-Planck equation on the parametric space. It is the Wasserstein gradient flow of relative entropy on the statistical manifold. We pull back the PDE to a finite dimensional ODE on parameter space. Some analytical example and numerical examples are presented.

preprint2020arXiv

PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation

Instance segmentation is an important task for scene understanding. Compared to the fully-developed 2D, 3D instance segmentation for point clouds have much room to improve. In this paper, we present PointGroup, a new end-to-end bottom-up architecture, specifically focused on better grouping the points by exploring the void space between objects. We design a two-branch network to extract point features and predict semantic labels and offsets, for shifting each point towards its respective instance centroid. A clustering component is followed to utilize both the original and offset-shifted point coordinate sets, taking advantage of their complementary strength. Further, we formulate the ScoreNet to evaluate the candidate instances, followed by the Non-Maximum Suppression (NMS) to remove duplicates. We conduct extensive experiments on two challenging datasets, ScanNet v2 and S3DIS, on which our method achieves the highest performance, 63.6% and 64.0%, compared to 54.9% and 54.4% achieved by former best solutions in terms of mAP with IoU threshold 0.5.

preprint2019arXiv

Pilot HI Survey of Planck Galactic Cold Clumps with FAST

We present a pilot HI survey of 17 Planck Galactic Cold Clumps (PGCCs) with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). HI Narrow Self-Absorption (HINSA) is an effective method to detect cold HI being mixed with molecular hydrogen H$_2$ and improves our understanding of the atomic to molecular transition in the interstellar medium. HINSA was found in 58\% PGCCs that we observed. The column density of HINSA was found to have an intermediate correlation with that of $^{13}$CO, following $\rm log( N(HINSA)) = (0.52\pm 0.26) log(N_{^{13}CO}) + (10 \pm 4.1) $. HI abundance relative to total hydrogen [HI]/[H] has an average value of $4.4\times 10^{-3}$, which is about 2.8 times of the average value of previous HINSA surveys toward molecular clouds. For clouds with total column density N$\rm_H >5 \times 10^{20}$ cm$^{-2}$, an inverse correlation between HINSA abundance and total hydrogen column density is found, confirming the depletion of cold HI gas during molecular gas formation in more massive clouds. Nonthermal line width of $^{13}$CO is about 0-0.5 km s$^{-1}$ larger than that of HINSA. One possible explanation of narrower nonthermal width of HINSA is that HINSA region is smaller than that of $^{13}$CO. Based on an analytic model of H$_2$ formation and H$_2$ dissociation by cosmic ray, we found the cloud ages to be within 10$^{6.7}$-10$^{7.0}$ yr for five sources.

preprint2016arXiv

Label distribution based facial attractiveness computation by deep residual learning

Two challenges lie in the facial attractiveness computation research: the lack of true attractiveness labels (scores), and the lack of an accurate face representation. In order to address the first challenge, this paper recasts facial attractiveness computation as a label distribution learning (LDL) problem rather than a traditional single-label supervised learning task. In this way, the negative influence of the label incomplete problem can be reduced. Inspired by the recent promising work in face recognition using deep neural networks to learn effective features, the second challenge is expected to be solved from a deep learning point of view. A very deep residual network is utilized to enable automatic learning of hierarchical aesthetics representation. Integrating these two ideas, an end-to-end deep learning framework is established. Our approach achieves the best results on a standard benchmark SCUT-FBP dataset compared with other state-of-the-art work.

Shu Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

34 published item(s)

A Natural Primal-Dual Hybrid Gradient Method for Adversarial Neural Network Training on Solving Partial Differential Equations

Judge, Then Drive: A Critic-Centric Vision Language Action Framework for Autonomous Driving

MOODv2: Masked Image Modeling for Out-of-Distribution Detection

A new construction of nonlinear codes via algebraic function fields

DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation

Deuterated ammonia in Galactic massive star-forming regions

DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D Detectors

EfficientNeRF: Efficient Neural Radiance Fields

Generalized Few-shot Semantic Segmentation

Good locally repairable codes via propagation rules

Improving Data Driven Inverse Text Normalization using Data Augmentation

Neural Parametric Fokker-Planck Equations

Optimal control for stochastic nonlinear Schrodinger equation on graph

Properties of dense molecular gas along the major axis of M 82

ResLT: Residual Learning for Long-tailed Recognition

SEA: Bridging the Gap Between One- and Two-stage Detector Distillation via SEmantic-aware Alignment

Spatial distribution of HOCN around Sagittarius B2

Stratified Transformer for 3D Point Cloud Segmentation

Upper bounds on maximum lengths of Singleton-optimal locally repairable codes

Wasserstein Hamiltonian flow with common noise on graph

Widespread subsonic turbulence in Ophiuchus North 1

Are Type Ia Supernovae in Restframe $H$ Brighter in More Massive Galaxies?

The Dependence of the Type Ia Supernova Host Bias on Observation or Fitting Technique

What is a stochastic Hamiltonian process on finite graph? An optimal transport answer

3DSSD: Point-based 3D Single Stage Object Detector

CO observations toward HI-rich Ultra Diffuse Galaxies

Dive Deeper Into Box for Object Detection

DSGN: Deep Stereo Geometry Network for 3D Object Detection

Filament Intersections and Cold Dense Cores in Orion A North

Jointly Modeling Aspect and Sentiment with Dynamic Heterogeneous Graph Neural Networks

Parametric Fokker-Planck equation

PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation

Pilot HI Survey of Planck Galactic Cold Clumps with FAST

Label distribution based facial attractiveness computation by deep residual learning