Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

Beyond Direct Generation: A Decomposed Approach to Well-Crafted Screenwriting with LLMs

The screenplay serves as the foundation for television production, defining narrative structure, character development, and dialogue. While Large Language Models (LLMs) show great potential in creative writing, direct end-to-end generation approaches often fail to produce well-crafted screenplays. We argue this failure stems from forcing a single model to simultaneously master two disparate capabilities: creative narrative construction and rigid format adherence. The resulting outputs may mimic superficial style but lack the deep structural integrity and storytelling substance required for professional use. To enable LLMs to generate high-quality screenplays, we introduce Dual-Stage Refinement (DSR), a decomposed framework that decouples creative narrative generation from format conversion. The first stage transforms a brief outline into rich, novel-style prose. The second stage refines this narrative into a professionally formatted screenplay. This separation enables the model to specialize in one distinct capability at each stage. A key challenge in implementing DSR is the scarcity of paired outline-to-novel training data. We address this through hybrid data synthesis: reverse synthesis deconstructs existing screenplays into structured inputs, while forward synthesis leverages these inputs to generate high-quality narrative texts as training targets. Blind evaluations by professional screenwriters show that DSR achieves a 75% win rate against strong baselines like Gemini-2.5-Pro and reaches 82.7% of human-level performance. Our work demonstrates that decomposed generation architecture with tailored data synthesis effectively specializes LLMs in complex creative domains.

preprint2026arXiv

CAX-Agent: A Lightweight Agent Harness for Reliable APDL Automation

Large language models deployed for MAPDL finite-element simulation face practical reliability challenges: without structured execution control, tool encapsulation, and fault recovery, outputs may be inconsistent and task failures are common. The Agent Harness paradigm addresses this by inserting domain-specific orchestration middleware that manages tool lifecycles, workflow state, and recovery escalation. This paper presents the architecture of CAX-Agent, a lightweight agent harness purpose-built for MAPDL automation, and empirically evaluates one of its core components -- the recovery policy.CAX-Agent organizes execution into three layers -- LLM service, agent harness, and solver backend -- with a recovery ladder that escalates from deterministic rule patching through model-driven regeneration to context enrichment and human intervention. We evaluate three recovery strategies (no_recovery, rule_only, and model_only) on 50 standard structural benchmarks with three repeated runs per strategy (450 case-runs total). Two independent human raters score task completion under blind conditions; inter-rater agreement is strong (quadratic weighted Cohen's kappa = 0.84, 96 percent of score pairs within one point). Model_only achieves the best completion rate (0.9267), task score (3.59/4), total score (9.16/10), and zero-intervention rate (0.84), outperforming rule_only (0.7733, 3.17/4, 7.03/10, 0.00) and no_recovery (0.6933, 2.74/4, 5.60/10, 0.00) with large effect sizes (Cliff's delta = 0.81-0.87). The benchmark uses deliberately simple geometries to isolate recovery-policy effects; we discuss the scope of these findings and directions for broader validation.

preprint2026arXiv

Rewarding Creativity: A Human-Aligned Generative Reward Model for Reinforcement Learning in Storytelling

While Large Language Models (LLMs) can generate fluent text, producing high-quality creative stories remains challenging. Reinforcement Learning (RL) offers a promising solution but faces two critical obstacles: designing reliable reward signals for subjective storytelling quality and mitigating training instability. This paper introduces the Reinforcement Learning for Creative Storytelling (RLCS) framework to systematically address both challenges. First, we develop a Generative Reward Model (GenRM) that provides multi-dimensional analysis and explicit reasoning about story preferences, trained through supervised fine-tuning on demonstrations with reasoning chains distilled from strong teacher models, followed by GRPO-based refinement on expanded preference data. Second, we introduce an entropy-based reward shaping strategy that dynamically prioritizes learning on confident errors and uncertain correct predictions, preventing overfitting on already-mastered patterns. Experiments demonstrate that GenRM achieves 68\% alignment with human creativity judgments, and RLCS significantly outperforms strong baselines including Gemini-2.5-Pro in overall story quality. This work provides a practical pipeline for applying RL to creative domains, effectively navigating the dual challenges of reward modeling and training stability.

preprint2022arXiv

A Survey of ADMM Variants for Distributed Optimization: Problems, Algorithms and Features

By coordinating terminal smart devices or microprocessors to engage in cooperative computation to achieve systemlevel targets, distributed optimization is incrementally favored by both engineering and computer science. The well-known alternating direction method of multipliers (ADMM) has turned out to be one of the most popular tools for distributed optimization due to many advantages, such as modular structure, superior convergence, easy implementation and high flexibility. In the past decade, ADMM has experienced widespread developments. The developments manifest in both handling more general problems and enabling more effective implementation. Specifically, the method has been generalized to broad classes of problems (i.e.,multi-block, coupled objective, nonconvex, etc.). Besides, it has been extensively reinforced for more effective implementation, such as improved convergence rate, easier subproblems, higher computation efficiency, flexible communication, compatible with inaccurate information, robust to communication delays, etc. These developments lead to a plentiful of ADMM variants to be celebrated by broad areas ranging from smart grids, smart buildings, wireless communications, machine learning and beyond. However, there lacks a survey to document those developments and discern the results. To achieve such a goal, this paper provides a comprehensive survey on ADMM variants. Particularly, we discern the five major classes of problems that have been mostly concerned and discuss the related ADMM variants in terms of main ideas, main assumptions, convergence behaviors and main features. In addition, we figure out several important future research directions to be addressed. This survey is expected to work as a tutorial for both developing distributed optimization in broad areas and identifying existing theoretical research gaps.

preprint2021arXiv

The TESS Faint Star Search: 1,617 TOIs from the TESS Primary Mission

We present the detection of 1,617 new transiting planet candidates, identified in the Transiting Exoplanet Survey Satellite (TESS) full-frame images (FFIs) observed during the Primary Mission (Sectors 1 - 26). These candidates were initially detected by the Quick-Look Pipeline (QLP), which extracts FFI lightcurves for and searches all stars brighter than TESS magnitude T = 13.5 mag in each sector. However, QLP heavily relies on manual inspection for the identification of planet candidates, limiting vetting efforts to planet-hosting stars brighter than T = 10.5 mag and leaving millions of potential transit signals un-vetted. We describe an independent vetting pipeline applied to QLP transit search results, incorporating both automated vetting tests and manual inspection to identify promising planet candidates around these fainter stars. The new candidates discovered by this ongoing project will allow TESS to significantly improve the statistical power of demographics studies of giant, close-in exoplanets.

preprint2020arXiv

A Giant Planet Candidate Transiting a White Dwarf

Astronomers have discovered thousands of planets outside the solar system, most of which orbit stars that will eventually evolve into red giants and then into white dwarfs. During the red giant phase, any close-orbiting planets will be engulfed by the star, but more distant planets can survive this phase and remain in orbit around the white dwarf. Some white dwarfs show evidence for rocky material floating in their atmospheres, in warm debris disks, or orbiting very closely, which has been interpreted as the debris of rocky planets that were scattered inward and tidally disrupted. Recently, the discovery of a gaseous debris disk with a composition similar to ice giant planets demonstrated that massive planets might also find their way into tight orbits around white dwarfs, but it is unclear whether the planets can survive the journey. So far, the detection of intact planets in close orbits around white dwarfs has remained elusive. Here, we report the discovery of a giant planet candidate transiting the white dwarf WD 1856+534 (TIC 267574918) every 1.4 days. The planet candidate is roughly the same size as Jupiter and is no more than 14 times as massive (with 95% confidence). Other cases of white dwarfs with close brown dwarf or stellar companions are explained as the consequence of common-envelope evolution, wherein the original orbit is enveloped during the red-giant phase and shrinks due to friction. In this case, though, the low mass and relatively long orbital period of the planet candidate make common-envelope evolution less likely. Instead, the WD 1856+534 system seems to demonstrate that giant planets can be scattered into tight orbits without being tidally disrupted, and motivates searches for smaller transiting planets around white dwarfs.

preprint2020arXiv

A Node Embedding Framework for Integration of Similarity-based Drug Combination Prediction

Motivation: Drug combination is a sensible strategy for disease treatment by improving the efficacy and reducing concomitant side effects. Due to the large number of possible combinations among candidate compounds, exhaustive screening is prohibitive. Currently, a plenty of studies have focused on predicting potential drug combinations. However, these methods are not entirely satisfactory in performance and scalability. Results: In this paper, we proposed a Network Embedding framework in Multiplex Networks (NEMN) to predict synthetic drug combinations. Based on a multiplex drug similarity network, we offered alternative methods to integrate useful information from different aspects and to decide quantitative importance of each network. To explain the feasibility of NEMN, we applied our framework to the data of drug-drug interactions, on which it showed better performance in terms of AUPR and ROC. For Drug combination prediction, we found seven novel drug combinations which have been validated by external sources among the top-ranked predictions of our model.

preprint2020arXiv

Identification and Validation of the SNV Biomarkers Based on Multi-Dimensional Patterns

Background: Single nucleotide variants (SNVs) are detected as different distributions of DNA samples of distinct types of cancer patients. Even though, it is an exacting task to select the appropriate method to identify cancer to the greatest extent of SNVs. Results: In this paper, we proposed a biomarker concept based on SNV patterns in different feature dimensions. Raw dataset (2761 samples) consisting of twelve different cancers was obtained from TCGA (The Cancer Genome Atlas). After preliminary screening of 562,321 DNA mutation sites in the samples, the mutation sites were extracted and characterized by cancer types in six different SNV feature dimensions. In this study, we found that the extracted features showed similar distribution in the cluster center of the disease type of the samples. After the initial processing of the raw data, the sample was more focused on the subtype distribution of the cancer or the cancer at the SNV level. We used k-nearest neighbors (KNN) to classify the extracted features and Leave-One-Out cross verified them. The accuracy of classifying is stable at around 97% and reached 97.43% at the highest. During the validation phase, we found validated oncogenes in the loci of the features with the highest importance among nine cancers. Conclusions: In summary, the samples showed consistent patterns according to the cancer in which it belongs. It is feasible to classify the cancer of the sample by the distribution of different dimensions of the SNVs and has a high accuracy. And has potential implications for the discovery of cancer-causing genes.

preprint2020arXiv

Multi-Agent Deep Reinforcement Learning for HVAC Control in Commercial Buildings

In commercial buildings, about 40%-50% of the total electricity consumption is attributed to Heating, Ventilation, and Air Conditioning (HVAC) systems, which places an economic burden on building operators. In this paper, we intend to minimize the energy cost of an HVAC system in a multi-zone commercial building under dynamic pricing with the consideration of random zone occupancy, thermal comfort, and indoor air quality comfort. Due to the existence of unknown thermal dynamics models, parameter uncertainties (e.g., outdoor temperature, electricity price, and number of occupants), spatially and temporally coupled constraints associated with indoor temperature and CO2 concentration, a large discrete solution space, and a non-convex and non-separable objective function, it is very challenging to achieve the above aim. To this end, the above energy cost minimization problem is reformulated as a Markov game. Then, an HVAC control algorithm is proposed to solve the Markov game based on multi-agent deep reinforcement learning with attention mechanism. The proposed algorithm does not require any prior knowledge of uncertain parameters and can operate without knowing building thermal dynamics models. Simulation results based on real-world traces show the effectiveness, robustness and scalability of the proposed algorithm.

preprint2020arXiv

TOI 694 b and TIC 220568520 b: Two Low-Mass Companions Near the Hydrogen Burning Mass Limit Orbiting Sun-like Stars

We report the discovery of TOI 694 b and TIC 220568520 b, two low-mass stellar companions in eccentric orbits around metal-rich Sun-like stars, first detected by the Transiting Exoplanet Survey Satellite (TESS). TOI 694 b has an orbital period of 48.05131$\pm$0.00019 days and eccentricity of 0.51946$\pm$0.00081, and we derive a mass of 89.0$\pm$5.3 $M_J$ (0.0849$\pm$0.0051 $M_\odot$) and radius of 1.111$\pm$0.017 $R_J$ (0.1142$\pm$0.0017 $R_\odot$). TIC 220568520 b has an orbital period of 18.55769$\pm$0.00039 days and eccentricity of 0.0964$\pm$0.0032, and we derive a mass of 107.2$\pm$5.2 $M_J$ (0.1023$\pm$0.0050 $M_\odot$) and radius of 1.248$\pm$0.018 $R_J$ (0.1282$\pm$0.0019 $R_\odot$). Both binary companions lie close to and above the Hydrogen burning mass threshold that separates brown dwarfs and the lowest mass stars, with TOI 694 b being 2-$σ$ above the canonical mass threshold of 0.075 $M_\odot$. The relatively long periods of the systems mean that the magnetic fields of the low-mass companions are not expected to inhibit convection and inflate the radius, which according to one leading theory is common in similar objects residing in short-period tidally-synchronized binary systems. Indeed we do not find radius inflation for these two objects when compared to theoretical isochrones. These two new objects add to the short but growing list of low-mass stars with well-measured masses and radii, and highlight the potential of the TESS mission for detecting such rare objects orbiting bright stars.