Researcher profile

Yujia Wang

Yujia Wang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

Rewarding Creativity: A Human-Aligned Generative Reward Model for Reinforcement Learning in Storytelling

While Large Language Models (LLMs) can generate fluent text, producing high-quality creative stories remains challenging. Reinforcement Learning (RL) offers a promising solution but faces two critical obstacles: designing reliable reward signals for subjective storytelling quality and mitigating training instability. This paper introduces the Reinforcement Learning for Creative Storytelling (RLCS) framework to systematically address both challenges. First, we develop a Generative Reward Model (GenRM) that provides multi-dimensional analysis and explicit reasoning about story preferences, trained through supervised fine-tuning on demonstrations with reasoning chains distilled from strong teacher models, followed by GRPO-based refinement on expanded preference data. Second, we introduce an entropy-based reward shaping strategy that dynamically prioritizes learning on confident errors and uncertain correct predictions, preventing overfitting on already-mastered patterns. Experiments demonstrate that GenRM achieves 68\% alignment with human creativity judgments, and RLCS significantly outperforms strong baselines including Gemini-2.5-Pro in overall story quality. This work provides a practical pipeline for applying RL to creative domains, effectively navigating the dual challenges of reward modeling and training stability.

preprint2022arXiv

Communication-Compressed Adaptive Gradient Method for Distributed Nonconvex Optimization

Due to the explosion in the size of the training datasets, distributed learning has received growing interest in recent years. One of the major bottlenecks is the large communication cost between the central server and the local workers. While error feedback compression has been proven to be successful in reducing communication costs with stochastic gradient descent (SGD), there are much fewer attempts in building communication-efficient adaptive gradient methods with provable guarantees, which are widely used in training large-scale machine learning models. In this paper, we propose a new communication-compressed AMSGrad for distributed nonconvex optimization problem, which is provably efficient. Our proposed distributed learning framework features an effective gradient compression strategy and a worker-side model update design. We prove that the proposed communication-efficient distributed adaptive gradient method converges to the first-order stationary point with the same iteration complexity as uncompressed vanilla AMSGrad in the stochastic nonconvex optimization setting. Experiments on various benchmarks back up our theory.

preprint2021arXiv

D2A U-Net: Automatic Segmentation of COVID-19 Lesions from CT Slices with Dilated Convolution and Dual Attention Mechanism

Coronavirus Disease 2019 (COVID-19) has caused great casualties and becomes almost the most urgent public health events worldwide. Computed tomography (CT) is a significant screening tool for COVID-19 infection, and automated segmentation of lung infection in COVID-19 CT images will greatly assist diagnosis and health care of patients. However, accurate and automatic segmentation of COVID-19 lung infections remains to be challenging. In this paper we propose a dilated dual attention U-Net (D2A U-Net) for COVID-19 lesion segmentation in CT slices based on dilated convolution and a novel dual attention mechanism to address the issues above. We introduce a dilated convolution module in model decoder to achieve large receptive field, which refines decoding process and contributes to segmentation accuracy. Also, we present a dual attention mechanism composed of two attention modules which are inserted to skip connection and model decoder respectively. The dual attention mechanism is utilized to refine feature maps and reduce semantic gap between different levels of the model. The proposed method has been evaluated on open-source dataset and outperforms cutting edges methods in semantic segmentation. Our proposed D2A U-Net with pretrained encoder achieves a Dice score of 0.7298 and recall score of 0.7071. Besides, we also build a simplified D2A U-Net without pretrained encoder to provide a fair comparison with other models trained from scratch, which still outperforms popular U-Net family models with a Dice score of 0.7047 and recall score of 0.6626. Our experiment results have shown that by introducing dilated convolution and dual attention mechanism, the number of false positives is significantly reduced, which improves sensitivity to COVID-19 lesions and subsequently brings significant increase to Dice score.

preprint2020arXiv

A Novel Method to Design Controller Parameters by Using Uniform Design Algorithm

Parameter selection is one of the most important parts for nearly all the control strategies. Traditionally, controller parameters are chosen by utilizing trial and error, which is always tedious and time consuming. Moreover, such method is highly dependent on the experience of researchers, which means that it is hard to be popularized. In this light, this paper proposes a novel parameter searching approach by utilizing uniform design (UD) algorithm. By which the satisfactory controller parameters under a performance index could be selected. In this end, two simulation examples are conducted to verify the effectiveness of proposed scheme. Simulation results show that this novel approach, as compared to other intelligent tuning algorithms, excels in efficiency and time saving.

preprint2020arXiv

Dynamics of A Single Population Model with Memory Effect and Spatial Heterogeneity

In this paper, a single population model with memory effect and the heterogeneity of the environment, equipped with the Neumann boundary, is considered. The global existence of a spatial nonhomogeneous steady state is proved by the method of upper and lower solutions, which is asymptotically stable for relatively small memorized diffusion. However, after the memorized diffusion rate exceeding a critical value, spatial inhomogeneous periodic solution can be generated through Hopf bifurcation, if the integral of intrinsic growth rate over the domain is negative. Such phenomenon will never happen, if only memorized diffusion or spatially heterogeneity is presented, and therefore must be induced by their joint effects. This indicates that the memorized diffusion will bring about spatial-temporal patterns in the overall hostile environment. When the integral of intrinsic growth rate over the domain is positive, it turns out that the steady state is still asymptotically stable. Finally, the possible dynamics of the model is also discussed, if the boundary condition is replaced by Dirichlet condition.

preprint2020arXiv

Emergent electric field control of phase transformation in oxide superlattices

Electric fields can transform materials with respect to their structure and properties, enabling various applications ranging from batteries to spintronics. Recently electrolytic gating, which can generate large electric fields and voltage-driven ion transfer, has been identified as a powerful means to achieve electric-field-controlled phase transformations. The class of transition metal oxides (TMOs) provide many potential candidates that present a strong response under electrolytic gating. However, very few show a reversible structural transformation at room-temperature. Here, we report the realization of a digitally synthesized TMO that shows a reversible, electric-field-controlled transformation between distinct crystalline phases at room-temperature. In superlattices comprised of alternating one-unit-cell of SrIrO3 and La0.2Sr0.8MnO3, we find a reversible phase transformation with a 7% lattice change and dramatic modulation in chemical, electronic, magnetic and optical properties, mediated by the reversible transfer of oxygen and hydrogen ions. Strikingly, this phase transformation is absent in the constituent oxides, solid solutions and larger period superlattices. Our findings open up a new class of materials for voltage-controlled functionality.

preprint2020arXiv

Robust ferromagnetism in highly strained SrCoO3 thin films

Epitaxial strain provides important pathways to control the magnetic and electronic states in transition metal oxides. However, the large strain is usually accompanied by a strong reduction of the oxygen vacancy formation energy, which hinders the direct manipulation of their intrinsic properties. Here using a post-deposition ozone annealing method, we obtained a series of oxygen stoichiometric SrCoO3 thin films with the tensile strain up to 3.0%. We observed a robust ferromagnetic ground state in all strained thin films, while interestingly the tensile strain triggers a distinct metal to insulator transition along with the increase of the tensile strain. The persistent ferromagnetic state across the electrical transition therefore suggests that the magnetic state is directly correlated with the localized electrons, rather than the itinerant ones, which then calls for further investigation of the intrinsic mechanism of this magnetic compound beyond the double-exchange mechanism.

preprint2018arXiv

Electric-field Control of Magnetism with Emergent Topological Hall Effect in SrRuO3 through Proton Evolution

Ionic substitution forms an essential pathway to manipulate the carrier density and crystalline symmetry of materials via ion-lattice-electron coupling, leading to a rich spectrum of electronic states in strongly correlated systems. Using the ferromagnetic metal SrRuO3 as a model system, we demonstrate an efficient and reversible control of both carrier density and crystalline symmetry through the ionic liquid gating induced protonation. The insertion of protons electron-dopes SrRuO3, leading to an exotic ferromagnetic to paramagnetic phase transition along with the increase of proton concentration. Intriguingly, we observe an emergent topological Hall effect at the boundary of the phase transition as the consequence of the newly-established Dzyaloshinskii-Moriya interaction owing to the breaking of inversion symmetry in protonated SrRuO3 with the proton compositional film-depth gradient. We envision that electric-field controlled protonation opens a novel strategy to design material functionalities.

preprint2017arXiv

High density array of epitaxial BiFeO3 nanodots with robust and reversibly switchable topological domain states

The exotic topological domains in ferroelectrics and multiferroics have attracted extensive interest in recent years due to their novel functionalities and potential applications in nanoelectronic devices. One of the key challenges for such applications is a realization of robust yet reversibly switchable nanoscale topological domain states with high density, wherein spontaneous topological structures can be individually addressed and controlled. This has been accomplished in our work using high density arrays of epitaxial BiFeO3 (BFO) nanodots with lateral size as small as ~60 nm. We demonstrate various types of spontaneous topological domain structures, including center-convergent domains, center-divergent domains, and double-center domains, which are stable over sufficiently long time yet can be manipulated and reversibly switched by electric field. The formation mechanisms of these topological domain states, assisted by the accumulation of compensating charges on the surface, have also been revealed. These result demonstrated that these reversibly switchable topological domain arrays are promising for applications in high density nanoferroelectric devices such as nonvolatile memories