Researcher profile

Qing Liao

Qing Liao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Enhancing the Code Reasoning Capabilities of LLMs via Consistency-based Reinforcement Learning

Code reasoning refers to the task of predicting the output of a program given its source code and specific inputs. It can measure the reasoning capability of large language models (LLMs) and also benefit downstream tasks such as code generation and mathematical reasoning. Existing work has verified the effectiveness of reinforcement learning on the task. However, these methods design rewards solely based on final outputs or coarse-grained signals, and neglect the inherent consistency of the stepwise reasoning process in the task. Therefore, these methods often result in sparse reward or reward hacking, which limits the full play of enhanced learning capabilities. To alleviate these issues, we propose CodeThinker, a consistency-driven reinforcement learning framework for code reasoning. Specifically, CodeThinker has three key components: (1) a stepwise reasoning-aware model training module, which utilizes a consistency tracing paradigm as a template to synthesize training data that captures the stepwise reasoning process; (2) a dynamic beam sampling strategy, which aims to improve the quality of sampled outputs under a fixed sampling budget; and (3) a consistency reward mechanism that can effectively alleviate reward hacking. Experiments on three popular benchmarks show that CodeThinker achieves state-of-the-art performance across multiple LLMs. For instance, it outperforms the strongest baseline by 4.3% in accuracy when deployed on Qwen2.5-Coder-7B-Instruct. We also validate the effectiveness of CodeThinker on downstream tasks. Results show that, without additional training, CodeThinker obtains average accuracy gains of 5.33 and 3.11 percentage points on mathematical reasoning and code reasoning tasks covering 17 programming languages, respectively.

preprint2024arXiv

Optical spin Hall effect pattern switching in polariton condensates in organic single-crystal microbelts

Topological polaritons, combining the robustness of the topological protected edge states to defects and disorder with the strong nonlinear properties of polariton bosons, represent an excellent platform to investigate novel photonic topological phases. In this work, we demonstrated the optical spin Hall effect (OSHE) and its symmetry switching in the exciton-polariton regime of pure DPAVBi crystals. Benefiting from the photonic Rashba-Dresselhaus spin-orbit coupling in organic crystals, we observed the separation of left- and right-circularly-polarized polariton emission in two-dimensional momentum space and real space, a signature of the OSHE. Above the lasing threshold, the OSHE pattern changes due to transverse quantization in the microbelt. This device without superlattice structure has great potential applications in topological polaritonics, such as information transmission, photonic integrated chips and quantum information.

preprint2022arXiv

Circularly polarized electroluminescence from a single-crystal organic microcavity light-emitting diode based on photonic spin-orbit interactions

Circularly polarized (CP) electroluminescence from organic light-emitting diodes (OLEDs) has aroused considerable attention for their potential in future display and photonic technologies. The development of CP-OLEDs relies largely on chiral-emitters, which not only remain rare owing to difficulties in design and synthesis but also limit the performance of electroluminescence. When the polarization (pseudospin) degrees of freedom of a photon interact with its orbital angular momentum, photonic spin-orbit interaction (SOI) emerges such as Rashba-Dresselhaus (RD) effect. Here, we demonstrate a chiral-emitter-free microcavity CP-OLED with a high dissymmetry factor (gEL) and high luminance by embedding a thin two-dimensional organic single crystal (2D-OSC) between two silver layers which serve as two metallic mirrors forming a microcavity and meanwhile also as two electrodes in an OLED architecture. In the presence of the RD effect, the SOIs in the birefringent 2D-OSC microcavity result in a controllable spin-splitting with CP dispersions. Thanks to the high emission efficiency and high carrier mobility of the OSC, chiral-emitter-free CP-OLEDs have been demonstrated exhibiting a high gEL of 1.1 and a maximum luminance of about 60000 cd/m2, which places our device among the best performing CP-OLEDs. This strategy opens a new avenue for practical applications towards on-chip microcavity CP-OLEDs.

preprint2022arXiv

On the Equity of Nuclear Norm Maximization in Unsupervised Domain Adaptation

Nuclear norm maximization has shown the power to enhance the transferability of unsupervised domain adaptation model (UDA) in an empirical scheme. In this paper, we identify a new property termed equity, which indicates the balance degree of predicted classes, to demystify the efficacy of nuclear norm maximization for UDA theoretically. With this in mind, we offer a new discriminability-and-equity maximization paradigm built on squares loss, such that predictions are equalized explicitly. To verify its feasibility and flexibility, two new losses termed Class Weighted Squares Maximization (CWSM) and Normalized Squares Maximization (NSM), are proposed to maximize both predictive discriminability and equity, from the class level and the sample level, respectively. Importantly, we theoretically relate these two novel losses (i.e., CWSM and NSM) to the equity maximization under mild conditions, and empirically suggest the importance of the predictive equity in UDA. Moreover, it is very efficient to realize the equity constraints in both losses. Experiments of cross-domain image classification on three popular benchmark datasets show that both CWSM and NSM contribute to outperforming the corresponding counterparts.

preprint2021arXiv

Realization of exciton-mediated optical spin-orbit interaction in organic microcrystalline resonators

The ability to control the spin-orbit interaction of light in optical microresonators is of fundamental importance for future photonics. Organic microcrystals, due to their giant optical anisotropy, play a crucial role in spin-optics and topological photonics. Here we realize controllable and wavelength-dependent Rashba-Dresselhaus spin-orbit interaction, attributed to the anisotropic excitonic response in an optical microcavity filled with an organic microcrystalline. We also investigate the transition of the spin-orbit interaction from dominant photonic type caused by the splitting of the transverse-electric and transverse-magnetic modes to spin-orbit interaction of the Rashba-Dresselhaus type. The interplay of the two allows us to engineer the spin-orbit interaction of light in organic microcavities, which besides its fundamental interest promises applications in spin-controlled on-chip integrated nanophotonic elements, towards exploiting non-magnetic and low-cost spin-photonic devices.