Source author record

Dan Song

Dan Song appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computer Vision hep-ph nucl-th Computation and Language eess.IV Information Theory Machine Learning math.IT Multimedia

Catalog footprint

What is connected

8works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

EditEmoTalk: Controllable Speech-Driven 3D Facial Animation with Continuous Expression Editing

Speech-driven 3D facial animation aims to generate realistic and expressive facial motions directly from audio. While recent methods achieve high-quality lip synchronization, they often rely on discrete emotion categories, limiting continuous and fine-grained emotional control. We present EditEmoTalk, a controllable speech-driven 3D facial animation framework with continuous emotion editing. The key idea is a boundary-aware semantic embedding that learns the normal directions of inter-emotion decision boundaries, enabling a continuous expression manifold for smooth emotion manipulation. Moreover, we introduce an emotional consistency loss that enforces semantic alignment between the generated motion dynamics and the target emotion embedding through a mapping network, ensuring faithful emotional expression. Extensive experiments demonstrate that EditEmoTalk achieves superior controllability, expressiveness, and generalization while maintaining accurate lip synchronization. Code and pretrained models will be released.

preprint2026arXiv

Empirical Comparison of Encoder-Based Language Models and Feature-Based Supervised Machine Learning Approaches to Automated Scoring of Long Essays

Long context may impose challenges for encoder-only language models in text processing, specifically for automated scoring of essays. This study trained several commonly used encoder-based language models for automated scoring of long essays. The performance of these trained models was evaluated and compared with the ensemble models built upon the base language models with a token limit of 512?. The experimented models include BERT-based models (BERT, RoBERTa, DistilBERT, and DeBERTa), ensemble models integrating embeddings from multiple encoder models, and ensemble models of feature-based supervised machine learning models, including Gradient-Boosted Decision Trees, eXtreme Gradient Boosting, and Light Gradient Boosting Machine. We trained, validated, and tested each model on a dataset of 17,307 essays, with an 80%/10%/10% split, and evaluated model performance using Quadratic Weighted Kappa. This study revealed that an ensemble-of-embeddings model that combines multiple pre-trained language model representations with gradient-boosting classifier as the ensemble model significantly outperforms individual language models at scoring long essays.

preprint2023arXiv

Possible $Λ_c\barΛ_c$ molecular states and their productions in nulceon-antinulceon collision

In this work, a study of possible molecular states from the $Λ_c\barΛ_c$ interaction and their productions in nucleon-antinucleon collision is performed in a quasipotential Bethe-Salpeter equation approach. Two bound states with quantum numbers $J^{PC}=0^{-+}$ and $1^{--}$ are produced with almost the same binding energy from the $Λ_c\barΛ_c$ interaction which is described by the light meson exchanges. However, the result does not support the assignment of experimentally observed $Y(4630)$ as a $Λ_c\barΛ_c$ molecular state because it is hard to obtain a peak near experimental mass of the $Y(4630)$ which is far above the $Λ_c\barΛ_c$ threshold. The possibility to search these states in nucleon-antinucleon collision is studied by including couplings to $N\bar{N}$ and $D^{(*)}\bar{D}^{(*)}$ channels. The peaks can be found obviously near the $Λ_c\barΛ_c$ threshold in the $D^*\bar{D}^*$ channel at an order of amplitude of 10 $μ$b. Too small width of state with $0^{-+}$ may lead to the difficulty to be observed in experiment. Based on the results in the current work, search for the $Λ_c\barΛ_c$ molecular state with $1^{--}$ is suggested in process $N\bar{N}\to D^*\bar{D}^*$, which is accessible at $\rm \bar{P}ANDA$.

preprint2023arXiv

Possible molecular states from interactions of charmed baryons

In this work, we perform a systematic study of possible molecular states composed of two charmed baryons including hidden-charm systems $Λ_c\barΛ_c$, $Σ_c^{(*)}\barΣ_c^{(*)}$, and $Λ_c\barΣ_c^{(*)}$, and corresponding double-charm systems $Λ_cΛ_c$, $Σ_c^{(*)}Σ_c^{(*)}$, and $Λ_cΣ_c^{(*)}$. With the help of the heavy quark chiral effective Lagrangians, the interactions are described with $π$, $ρ$, $η$, $ω$, $ϕ$, and $σ$ exchanges. The potential kernels are constructed, and inserted into the quasipotential Bethe-Salpeter equation. The bound states from the interactions considered is studied by searching for the poles of the scattering amplitude. The results suggest that strong attractions exist in both hidden-charm and double-charm systems considered in the current work, and bound states can be produced in most of the systems. More experiment studies about these molecular states are suggested though the nucleon-nucleon collison at LHC and nucleon-antinucleon collison at $\rm \bar{P}ANDA$.

preprint2022arXiv

CTooth: A Fully Annotated 3D Dataset and Benchmark for Tooth Volume Segmentation on Cone Beam Computed Tomography Images

3D tooth segmentation is a prerequisite for computer-aided dental diagnosis and treatment. However, segmenting all tooth regions manually is subjective and time-consuming. Recently, deep learning-based segmentation methods produce convincing results and reduce manual annotation efforts, but it requires a large quantity of ground truth for training. To our knowledge, there are few tooth data available for the 3D segmentation study. In this paper, we establish a fully annotated cone beam computed tomography dataset CTooth with tooth gold standard. This dataset contains 22 volumes (7363 slices) with fine tooth labels annotated by experienced radiographic interpreters. To ensure a relative even data sampling distribution, data variance is included in the CTooth including missing teeth and dental restoration. Several state-of-the-art segmentation methods are evaluated on this dataset. Afterwards, we further summarise and apply a series of 3D attention-based Unet variants for segmenting tooth volumes. This work provides a new benchmark for the tooth volume segmentation task. Experimental evidence proves that attention modules of the 3D UNet structure boost responses in tooth areas and inhibit the influence of background and noise. The best performance is achieved by 3D Unet with SKNet attention module, of 88.04 \% Dice and 78.71 \% IOU, respectively. The attention-based Unet framework outperforms other state-of-the-art methods on the CTooth dataset. The codebase and dataset are released.

preprint2022arXiv

CTooth+: A Large-scale Dental Cone Beam Computed Tomography Dataset and Benchmark for Tooth Volume Segmentation

Accurate tooth volume segmentation is a prerequisite for computer-aided dental analysis. Deep learning-based tooth segmentation methods have achieved satisfying performances but require a large quantity of tooth data with ground truth. The dental data publicly available is limited meaning the existing methods can not be reproduced, evaluated and applied in clinical practice. In this paper, we establish a 3D dental CBCT dataset CTooth+, with 22 fully annotated volumes and 146 unlabeled volumes. We further evaluate several state-of-the-art tooth volume segmentation strategies based on fully-supervised learning, semi-supervised learning and active learning, and define the performance principles. This work provides a new benchmark for the tooth volume segmentation task, and the experiment can serve as the baseline for future AI-based dental imaging research and clinical application development.

preprint2022arXiv

Efficiently Computable Converses for Finite-Blocklength Communication

This paper presents a method for computing a finite-blocklength converse for the rate of fixed-length codes with feedback used on discrete memoryless channels (DMCs). The new converse is expressed in terms of a stochastic control problem whose solution can be efficiently computed using dynamic programming and Fourier methods. For channels such as the binary symmetric channel (BSC) and binary erasure channel (BEC), the accuracy of the proposed converse is similar to that of existing special-purpose converse bounds, but the new converse technique can be applied to arbitrary DMCs. We provide example applications of the new converse technique to the binary asymmetric channel (BAC) and the quantized amplitude-constrained AWGN channel.

preprint2014arXiv

Automated Generation of Geometric Theorems from Images of Diagrams

We propose an approach to generate geometric theorems from electronic images of diagrams automatically. The approach makes use of techniques of Hough transform to recognize geometric objects and their labels and of numeric verification to mine basic geometric relations. Candidate propositions are generated from the retrieved information by using six strategies and geometric theorems are obtained from the candidates via algebraic computation. Experiments with a preliminary implementation illustrate the effectiveness and efficiency of the proposed approach for generating nontrivial theorems from images of diagrams. This work demonstrates the feasibility of automated discovery of profound geometric knowledge from simple image data and has potential applications in geometric knowledge management and education.

Dan Song

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

EditEmoTalk: Controllable Speech-Driven 3D Facial Animation with Continuous Expression Editing

Empirical Comparison of Encoder-Based Language Models and Feature-Based Supervised Machine Learning Approaches to Automated Scoring of Long Essays

Possible $Λ_c\barΛ_c$ molecular states and their productions in nulceon-antinulceon collision

Possible molecular states from interactions of charmed baryons

CTooth: A Fully Annotated 3D Dataset and Benchmark for Tooth Volume Segmentation on Cone Beam Computed Tomography Images

CTooth+: A Large-scale Dental Cone Beam Computed Tomography Dataset and Benchmark for Tooth Volume Segmentation

Efficiently Computable Converses for Finite-Blocklength Communication

Automated Generation of Geometric Theorems from Images of Diagrams