Source author record

Jia Sun

Jia Sun appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Artificial Intelligence astro-ph.SR cond-mat.mtrl-sci eess.IV Machine Learning math.NA Numerical Analysis physics.chem-ph physics.med-ph

Catalog footprint

What is connected

6works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

CaC: Advancing Video Reward Models via Hierarchical Spatiotemporal Concentrating

In this paper, we propose Concentrate and Concentrate (CaC), a coarse-to-fine anomaly reward model based on Vision-Language Models. During inference, it first conducts a global temporal scan to anchor anomalous time windows, then performs fine-grained spatial grounding within the localized interval, and finally derives robust judgments via structured spatiotemporal Chain-of-Thought reasoning. To equip the model with these capabilities, we construct the first large-scale generated video anomaly dataset with per-frame bounding-box annotations, temporal anomaly windows, and fine-grained attribution labels. Building on this dataset, we design a three-stage progressive training paradigm. The model initially learns spatial and temporal anchoring through single- and multi-frame supervised fine-tuning, and then is optimized by a reinforcement learning strategy based on two-turn Group Relative Policy Optimization (GRPO). Beyond conventional accuracy rewards, we introduce Temporal and Spatial IoU rewards to supervise the intermediate localization process, effectively guiding the model toward more grounded and interpretable spatiotemporal reasoning. Extensive experiments demonstrate that CaC can stably concentrate on subtle anomalies, achieving a 25.7% accuracy improvement on fine-grained anomaly benchmarks and, when used as a reward signal, CaC reduces generated-video anomalies by 11.7% while improving overall video quality.

preprint2022arXiv

CENN: Conservative energy method based on neural networks with subdomains for solving variational problems involving heterogeneous and complex geometries

We propose a conservative energy method based on neural networks with subdomains for solving variational problems (CENN), where the admissible function satisfying the essential boundary condition without boundary penalty is constructed by the radial basis function (RBF), particular solution neural network, and general neural network. The loss term is the potential energy, optimized based on the principle of minimum potential energy. The loss term at the interfaces has the lower order derivative compared to the strong form PINN with subdomains. The advantage of the proposed method is higher efficiency, more accurate, and less hyperparameters than the strong form PINN with subdomains. Another advantage of the proposed method is that it can apply to complex geometries based on the special construction of the admissible function. To analyze its performance, the proposed method CENN is used to model representative PDEs, the examples include strong discontinuity, singularity, complex boundary, non-linear, and heterogeneous problems. Furthermore, it outperforms other methods when dealing with heterogeneous problems.

preprint2022arXiv

Computer-aided Recognition and Assessment of a Porous Bioelastomer on Ultrasound Images for Regenerative Medicine Applications

Biodegradable elastic scaffolds have attracted more and more attention in the field of soft tissue repair and tissue engineering. These scaffolds made of porous bioelastomers support tissue ingrowth along with their own degradation. It is necessary to develop a computer-aided analyzing method based on ultrasound images to identify the degradation performance of the scaffold, not only to obviate the need to do destructive testing, but also to monitor the scaffold's degradation and tissue ingrowth over time. It is difficult using a single traditional image processing algorithm to extract continuous and accurate contour of a porous bioelastomer. This paper proposes a joint algorithm for the bioelastomer's contour detection and a texture feature extraction method for monitoring the degradation behavior of the bioelastomer. Mean-shift clustering method is used to obtain the bioelastomer's and native tissue's clustering feature information. Then the OTSU image binarization method automatically selects the optimal threshold value to convert the grayscale ultrasound image into a binary image. The Canny edge detector is used to extract the complete bioelastomer's contour. The first-order and second-order statistical features of texture are extracted. The proposed joint algorithm not only achieves the ideal extraction of the bioelastomer's contours in ultrasound images, but also gives valuable feedback of the degradation behavior of the bioelastomer at the implant site based on the changes of texture characteristics and contour area. The preliminary results of this study suggest that the proposed computer-aided image processing techniques have values and potentials in the non-invasive analysis of tissue scaffolds in vivo based on ultrasound images and may help tissue engineers evaluate the tissue scaffold's degradation and cellular ingrowth progress and improve the scaffold designs.

preprint2022arXiv

MetaFormer: A Unified Meta Framework for Fine-Grained Recognition

Fine-Grained Visual Classification(FGVC) is the task that requires recognizing the objects belonging to multiple subordinate categories of a super-category. Recent state-of-the-art methods usually design sophisticated learning pipelines to tackle this task. However, visual information alone is often not sufficient to accurately differentiate between fine-grained visual categories. Nowadays, the meta-information (e.g., spatio-temporal prior, attribute, and text description) usually appears along with the images. This inspires us to ask the question: Is it possible to use a unified and simple framework to utilize various meta-information to assist in fine-grained identification? To answer this problem, we explore a unified and strong meta-framework(MetaFormer) for fine-grained visual classification. In practice, MetaFormer provides a simple yet effective approach to address the joint learning of vision and various meta-information. Moreover, MetaFormer also provides a strong baseline for FGVC without bells and whistles. Extensive experiments demonstrate that MetaFormer can effectively use various meta-information to improve the performance of fine-grained recognition. In a fair comparison, MetaFormer can outperform the current SotA approaches with only vision information on the iNaturalist2017 and iNaturalist2018 datasets. Adding meta-information, MetaFormer can exceed the current SotA approaches by 5.9% and 5.3%, respectively. Moreover, MetaFormer can achieve 92.3% and 92.7% on CUB-200-2011 and NABirds, which significantly outperforms the SotA approaches. The source code and pre-trained models are released athttps://github.com/dqshuai/MetaFormer.

preprint2015arXiv

A deep near-infrared survey toward the Aquila molecular cloud - I. Molecular hydrogen outflows

We have performed an unbiased deep near-infrared survey toward the Aquila molecular cloud with a sky coverage of ~1 deg2. We identified 45 molecular hydrogen emission-line objects(MHOs), of which only 11 were previously known. Using the Spitzer archival data we also identified 802 young stellar objects (YSOs) in this region. Based on the morphology and the location of MHOs and YSO candidates, we associate 43 MHOs with 40 YSO candidates. The distribution of jet length shows an exponential decrease in the number of outflows with increasing length and the molecular hydrogen outflows seem to be oriented randomly. Moreover, there is no obvious correlation between jet lengths, jet opening angles, or jet H2 1-0 S(1) luminosities and spectral indices of the possible driving sources in this region. We also suggest that molecular hydrogen outflows in the Aquila molecular cloud are rather weak sources of turbulence, unlikely to generate the observed velocity dispersion in the region of survey.

preprint2012arXiv

Gated Conductance of Thin Indium Tin Oxide - The Simplest Transistor

Transistors are the fundamental building block of modern electronic devices. So far, all transistors are based on various types of semiconductor junctions. The most common bipolar-junction transistors and metal-oxide-semiconductor field-effect transistors contain p-n junctions to control the current, depending on applied biases across the junctions. Thin-film transistors need metal-semiconductor junctions for injecting and extracting electrons from their channels. Here, by coating a heavily-doped thin indium-tin-oxide (ITO) film through a shadow mask onto a biopolymer chitosan/ITO/glass substrate, we can have a high-performance junctionless transparent organic-inorganic hybrid thin film transistor. This could be the simplest transistor in the world, to our knowledge, not only in its structure, but also its fabrication process. In addition, the device performance is found to be greatly enhanced using a reinforced chitosan/SiO2 hybrid bilayer dielectric stack. Our results clearly show that this architecture can lead to a new class of low-cost transistors.

Jia Sun

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

CaC: Advancing Video Reward Models via Hierarchical Spatiotemporal Concentrating

CENN: Conservative energy method based on neural networks with subdomains for solving variational problems involving heterogeneous and complex geometries

Computer-aided Recognition and Assessment of a Porous Bioelastomer on Ultrasound Images for Regenerative Medicine Applications

MetaFormer: A Unified Meta Framework for Fine-Grained Recognition

A deep near-infrared survey toward the Aquila molecular cloud - I. Molecular hydrogen outflows

Gated Conductance of Thin Indium Tin Oxide - The Simplest Transistor