Researcher profile

Bin Feng

Bin Feng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

Beyond Chemical QA: Evaluating LLM's Chemical Reasoning with Modular Chemical Operations

While large language models (LLMs) with Chain-of-Thought (CoT) reasoning excel in mathematics and coding, their potential for systematic reasoning in chemistry, a domain demanding rigorous structural analysis for real-world tasks like drug design and reaction engineering, remains untapped. Current benchmarks focus on simple knowledge retrieval, neglecting step-by-step reasoning required for complex tasks such as molecular optimization and reaction prediction. To address this, we introduce ChemCoTBench, a reasoning framework that bridges molecular structure understanding with arithmetic-inspired operations, including addition, deletion, and substitution, to formalize chemical problem-solving into transparent, step-by-step workflows. By treating molecular transformations as modular "chemical operations", the framework enables slow-thinking reasoning, mirroring the logic of mathematical proofs while grounding solutions in real-world chemical constraints. We evaluate models on two high-impact tasks: Molecular Property Optimization and Chemical Reaction Prediction. These tasks mirror real-world challenges while providing structured evaluability. By providing annotated datasets, a reasoning taxonomy, and baseline evaluations, ChemCoTBench bridges the gap between abstract reasoning methods and practical chemical discovery, establishing a foundation for advancing LLMs as tools for AI-driven scientific innovation.

preprint2026arXiv

SciCustom: A Framework for Custom Evaluation of Scientific Capabilities in Large Language Models

Large language models (LLMs) are increasingly applied to scientific research, yet existing evaluations often fail to reflect the fine-grained capabilities required in practice. Most benchmarks are manually curated or domain-generic, limiting scalability and alignment with real scientific use cases. In this paper, we propose a new framework named SciCustom to address the problem. It enables the custom construction of benchmarks from large-scale scientific data to evaluate application-specific scientific capabilities in LLMs. SciCustom first organizes scientific knowledge into ontology-grounded knowledge units with controlled granularity and trains a tagger to map large-scale data instances into this knowledge space. Given a custom requirement, relevant knowledge units are identified via voting-based multi-model consensus. These units enable relevance-aware benchmark retrieval via binary search, followed by proxy subset selection and data-grounded benchmark generation for efficient evaluation. Experiments in chemistry and healthcare demonstrate that SciCustom reveals fine-grained differences in LLM scientific capabilities that standard benchmarks overlook, while requiring neither expert annotation nor synthetic question generation. This work provides a scalable and application-aware foundation for benchmarking scientific capabilities in LLMs. The source code is available at https://github.com/yjwtheonly/SciCustom.

preprint2020arXiv

Deep multi-metric learning for text-independent speaker verification

Text-independent speaker verification is an important artificial intelligence problem that has a wide spectrum of applications, such as criminal investigation, payment certification, and interest-based customer services. The purpose of text-independent speaker verification is to determine whether two given uncontrolled utterances originate from the same speaker or not. Extracting speech features for each speaker using deep neural networks is a promising direction to explore and a straightforward solution is to train the discriminative feature extraction network by using a metric learning loss function. However, a single loss function often has certain limitations. Thus, we use deep multi-metric learning to address the problem and introduce three different losses for this problem, i.e., triplet loss, n-pair loss and angular loss. The three loss functions work in a cooperative way to train a feature extraction network equipped with Residual connections and squeeze-and-excitation attention. We conduct experiments on the large-scale \texttt{VoxCeleb2} dataset, which contains over a million utterances from over $6,000$ speakers, and the proposed deep neural network obtains an equal error rate of $3.48\%$, which is a very competitive result. Codes for both training and testing and pretrained models are available at \url{https://github.com/GreatJiweix/DmmlTiSV}, which is the first publicly available code repository for large-scale text-independent speaker verification with performance on par with the state-of-the-art systems.

preprint2020arXiv

Maximum Entropy Regularization and Chinese Text Recognition

Chinese text recognition is more challenging than Latin text due to the large amount of fine-grained Chinese characters and the great imbalance over classes, which causes a serious overfitting problem. We propose to apply Maximum Entropy Regularization to regularize the training process, which is to simply add a negative entropy term to the canonical cross-entropy loss without any additional parameters and modification of a model. We theoretically give the convergence probability distribution and analyze how the regularization influence the learning process. Experiments on Chinese character recognition, Chinese text line recognition and fine-grained image classification achieve consistent improvement, proving that the regularization is beneficial to generalization and robustness of a recognition model.

preprint2019arXiv

High electrical conducting deep-ultraviolet-transparent oxide semiconductor La-doped SrSnO3 exceeding ~3000 S cm-1

La-doped SrSnO3 (LSSO) is known as one of deep-ultraviolet (DUV)-transparent conducting oxides with an energy bandgap of ~4.6 eV. Since LSSO can be grown heteroepitaxially on more wide bandgap substrates such as MgO (Eg ~7.8 eV), LSSO is considered to be a good candidate as a DUV-transparent electrode. However, the electrical conductivity of LSSO films are below 1000 S cm^-1, most likely due to the low solubility of La ion in the LSSO lattice. Here we report that high electrically conducting (>3000 S cm^-1) LSSO thin films with an energy bandgap of ~4.6 eV can be fabricated by pulsed laser deposition on MgO substrate followed by a simple annealing in vacuum. From the X-ray diffraction and the scanning transmission electron microscopy analyses, we found that lateral grain growth occurred during the annealing, which improved the activation rate of La ion, leading to a significant improvement of carrier concentration (3.26 x 10^20 cm^-3) and Hall mobility (55.8 cm^2 V^-1 s^-1). The present DUV-transparent oxide semiconductor would be useful as a transparent electrode for developing optoelectronic devices, which transmit and/or emit DUV-light.

preprint2018arXiv

Buffer layer-less fabrication of high-mobility transparent oxide semiconductor, La-doped BaSnO3

Transparent oxide semiconductors (TOSs) showing both high visible transparency and high electron mobility have attracted great attention towards the realization of advanced optoelectronic devices. La-doped BaSnO3 (LBSO) is one of the most promising TOSs because its single crystal exhibits a high electron mobility. However, in the LBSO films, it is very hard to obtain high mobility due to the threading dislocations, which are originated from the lattice mismatch between the film and the substrate. Therefore, many researchers have tried to improve the mobility by inserting a buffer layer. While the buffer layers increased the electron mobilities, this approach leaves much to be desired since it involves a two-step film fabrication process and the enhanced mobility values are still significantly lower than single crystal values. We show herein that the electron mobility of LBSO films can be improved without inserting any buffer layers if the films are grown under highly oxidative ozone (O3) atmospheres. The O3 environments relaxed the LBSO lattice and reduced the formation of Sn2+ states, which are known to suppress the electron mobility in LBSO. The resultant O3-LBSO films showed improved mobility values up to 115 cm2 V-1 s-1, which is among the highest in LBSO films on SrTiO3 substrates and comparable to LBSO films with buffer layers.