Source author record

Hongbo Sun

Hongbo Sun appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning physics.optics Artificial Intelligence Computer Vision cond-mat.mes-hall eess.SY Systems and Control

Catalog footprint

What is connected

6works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Segment-Aligned Policy Optimization for Multi-Modal Reasoning

Existing reinforcement learning approaches for Large Language Models typically perform policy optimization at the granularity of individual tokens or entire response sequences. However, such formulations often misalign with the natural step-wise structure of reasoning processes, leading to suboptimal credit assignment and unstable training in multi-modal reasoning tasks. To bridge this gap, we propose Segment-Aligned Policy Optimization (SAPO), a novel reinforcement learning paradigm that treats coherent reasoning steps, rather than tokens or full sequences as fundamental units of policy update. SAPO introduces a step-wise Markov decision process abstraction over reasoning segments, accompanied by segment-level value estimation, advantage computation, and importance sampling mechanisms that are semantically aligned with reasoning boundaries. Experiments on representative reasoning benchmarks demonstrate that SAPO consistently outperforms token-level and sequence-level policy optimization methods, achieving significant accuracy improvements while exhibiting better training stability and value estimation consistency. Our work underscores the importance of aligning reinforcement learning updates with the intrinsic structure of reasoning, paving the way for more efficient and semantically grounded policy optimization in complex reasoning tasks. Codes and models will be released to ensure full reproducibility.

preprint2022arXiv

SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual Categorization

Fine-grained visual categorization (FGVC) aims at recognizing objects from similar subordinate categories, which is challenging and practical for human's accurate automatic recognition needs. Most FGVC approaches focus on the attention mechanism research for discriminative regions mining while neglecting their interdependencies and composed holistic object structure, which are essential for model's discriminative information localization and understanding ability. To address the above limitations, we propose the Structure Information Modeling Transformer (SIM-Trans) to incorporate object structure information into transformer for enhancing discriminative representation learning to contain both the appearance information and structure information. Specifically, we encode the image into a sequence of patch tokens and build a strong vision transformer framework with two well-designed modules: (i) the structure information learning (SIL) module is proposed to mine the spatial context relation of significant patches within the object extent with the help of the transformer's self-attention weights, which is further injected into the model for importing structure information; (ii) the multi-level feature boosting (MFB) module is introduced to exploit the complementary of multi-level features and contrastive learning among classes to enhance feature robustness for accurate recognition. The proposed two modules are light-weighted and can be plugged into any transformer network and trained end-to-end easily, which only depends on the attention weights that come with the vision transformer itself. Extensive experiments and analyses demonstrate that the proposed SIM-Trans achieves state-of-the-art performance on fine-grained visual categorization benchmarks. The code is available at https://github.com/PKU-ICST-MIPL/SIM-Trans_ACMMM2022.

preprint2020arXiv

A Holistic Framework for Parameter Coordination of Interconnected Microgrids against Disasters

This paper proposes a holistic framework for parameter coordination of a power electronic-interfaced microgrid interconnection against natural disasters. The paper identifies a transient stability issue in a microgrid interconnection. Based on recent advances in control theory, we design a framework that can systematically coordinate system parameters, such that post-disaster equilibrium points of microgrid interconnections are asymptotically stable. The core of the framework is a stability assessment algorithm using sum of squares programming. The efficacy of the proposed framework is tested in a four-microgrid interconnection. The proposed framework has potential to extend to microgrid interconnections with a wide range of hierarchical control schemes.

preprint2020arXiv

Vortical Reflection and Spiraling Fermi Arcs with Weyl Metamaterials

Scatterings and transport in Weyl semimetals have caught growing attention in condensed matter physics, with observables including chiral zero modes and the associated magnetoresistance and chiral magnetic effects. Measurement of electrical conductance is usually performed in these studies, which, however, cannot resolve the momentum of electrons, preventing direct observation of the phase singularities in scattering matrix associated with Weyl point. Here we experimentally demonstrate a helical phase distribution in the angle (momentum) resolved scattering matrix of electromagnetic waves in a photonic Weyl metamaterial. It further leads to spiraling Fermi arcs in an air gap sandwiched between a Weyl metamaterial and a metal plate. Benefiting from the alignment-free feature of angular vortical reflection, our findings establish a new platform in manipulating optical angular momenta with photonic Weyl systems.

preprint2018arXiv

Learning Dynamical Demand Response Model in Real-Time Pricing Program

Price responsiveness is a major feature of end use customers (EUCs) that participate in demand response (DR) programs, and has been conventionally modeled with static demand functions, which take the electricity price as the input and the aggregate energy consumption as the output. This, however, neglects the inherent temporal correlation of the EUC behaviors, and may result in large errors when predicting the actual responses of EUCs in real-time pricing (RTP) programs. In this paper, we propose a dynamical DR model so as to capture the temporal behavior of the EUCs. The states in the proposed dynamical DR model can be explicitly chosen, in which case the model can be represented by a linear function or a multi-layer feedforward neural network, or implicitly chosen, in which case the model can be represented by a recurrent neural network or a long short-term memory unit network. In both cases, the dynamical DR model can be learned from historical price and energy consumption data. Numerical simulation illustrated how the states are chosen and also showed the proposed dynamical DR model significantly outperforms the static ones.

preprint2016arXiv

Pancharatnam-Berry phase induced spin-selective transmission in herringbone dielectric metamaterials

Manipulating the polarisation of light is crucial for sensing and imaging applications. One such aspect in particular is selective transmission of one circular polarisation (spin) when light is transmitted through a medium or a device. However, most present methods of achieving this have relatively low efficiency and selectivity, whilst high selectivity examples rely on lossy and complex three-dimensional helical or multilayer structures. Here, we propose a dielectric metamaterial approach for achieving spin-selective transmission of electromagnetic waves, utilizing spin-controlled constructive or destructive interference between two Pancharatnam-Berry (PB) phases in conjunction with propagative dynamic phase. The dielectric metamaterial, consisting of monolithic silicon herringbone structures, exhibits a broadband operation in the terahertz regime whilst obtaining a spin-selective efficiency upwards of 60%. Such a device is robust and is not easily degraded by errors in fabrication.