Source author record

Tung Nguyen

Tung Nguyen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Human-Computer Interaction Biomolecules Computation and Language Computer Vision cond-mat.soft eess.SY Information Retrieval math.CO math.DS math.FA math.OC Neural and Evolutionary Computing Software Engineering Systems and Control

Catalog footprint

What is connected

9works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

SRA: Span Representation Alignment for Large Language Model Distillation

Cross-Tokenizer Knowledge Distillation (CTKD) enables knowledge transfer between a large language model and a smaller student, even when they employ different tokenizers. While existing approaches mainly focus on token-level alignment strategies, which are often brittle and sensitive to discrepancies between tokenizers, we argue that the method of aggregating tokens into more robust representations before distillation is of equal importance. In this paper, we introduce \textbf{SRA} (\textbf{S}pan \textbf{R}epresentation \textbf{A}lignment for Large Language Model Distillation), a novel framework that reframes CTKD through the physical lens of Multi-Particle Dynamical Systems. SRA shifts the fundamental unit of alignment from tokens to robust, tokenizer-agnostic spans. We model each span as a cluster of particles and represent its state by its Center of Mass (CoM) - an attention-weighted average that captures rich semantic information. We leverage the concept of span centers of mass with attention-derived weighting to prioritize the most salient spans. In addition, we employ a geometric regularizer to preserve the structural integrity of the representation space and introduce aligned span logit distillation to enhance knowledge transfer across models. In challenging cross-architecture distillation experiments, SRA consistently and significantly outperforms state-of-the-art CTKD baselines, validating our physically-grounded approach.

preprint2023arXiv

Induced paths in graphs without anticomplete cycles

Let us say a graph is $s\mathcal{O}$-free, where $s\ge 1$ is an integer, if there do not exist $s$ cycles of the graph that are pairwise vertex-disjoint and have no edges joining them. The structure of such graphs, even when $s=2$, is not well understood. For instance, until now we did not know how to test whether a graph is $2\mathcal{O}$-free in polynomial time; and there was an open conjecture, due to Ngoc Khang Le, that $2\mathcal{O}$-free graphs have only a polynomial number of induced paths. In this paper we prove Le's conjecture; indeed, we will show that for all $s\ge 1$, there exists $c>0$ such that every $s\mathcal{O}$-free graph $G$ has at most $|G|^c$ induced paths. This provides a poly-time algorithm to test if a graph is $s\mathcal{O}$-free, for all fixed $s$. The proof has three parts. First, there is a short and beautiful proof, due to Le, that reduces the question to proving the same thing for graphs with no cycles of length four. Second, there is a recent result of Bonamy, Bonnet, Déprés, Esperet, Geniet, Hilaire, Thomassé and Wesolek, that in every $s\mathcal{O}$-free graph $G$ with no cycle of length four, there is a set of vertices that intersects every cycle, with size logarithmic in $|G|$. And third, there is an argument that uses the result of Bonamy et al. to deduce the theorem. The last is the main content of this paper.

preprint2023arXiv

Machine Learning Approach to Polymerization Reaction Engineering: Determining Monomers Reactivity Ratios

Here, we demonstrate how machine learning enables the prediction of comonomers reactivity ratios based on the molecular structure of monomers. We combined multi-task learning, multi-inputs, and Graph Attention Network to build a model capable of predicting reactivity ratios based on the monomers chemical structures.

preprint2022arXiv

A Simple and Scalable Tensor Completion Algorithm via Latent Invariant Constraint for Recommendation System

In this paper we provide a latent-variable formulation and solution to the recommender system (RS) problem in terms of a fundamental property that any reasonable solution should be expected to satisfy. Specifically, we examine a novel tensor completion method to efficiently and accurately learn parameters of a model for the unobservable personal preferences that underly user ratings. By regularizing the tensor decomposition with a single latent invariant, we achieve three properties for a reliable recommender system: (1) uniqueness of the tensor completion result with minimal assumptions, (2) unit consistency that is independent of arbitrary preferences of users, and (3) a consensus ordering guarantee that provides consistent ranking between observed and unobserved rating scores. Our algorithm leads to a simple and elegant recommendation framework that has linear computational complexity and with no hyperparameter tuning. We provide empirical results demonstrating that the approach significantly outperforms current state-of-the-art methods.

preprint2020arXiv

Predictive Coding for Locally-Linear Control

High-dimensional observations and unknown dynamics are major challenges when applying optimal control to many real-world decision making tasks. The Learning Controllable Embedding (LCE) framework addresses these challenges by embedding the observations into a lower dimensional latent space, estimating the latent dynamics, and then performing control directly in the latent space. To ensure the learned latent dynamics are predictive of next-observations, all existing LCE approaches decode back into the observation space and explicitly perform next-observation prediction---a challenging high-dimensional task that furthermore introduces a large number of nuisance parameters (i.e., the decoder) which are discarded during control. In this paper, we propose a novel information-theoretic LCE approach and show theoretically that explicit next-observation prediction can be replaced with predictive coding. We then use predictive coding to develop a decoder-free LCE model whose latent dynamics are amenable to locally-linear control. Extensive experiments on benchmark tasks show that our model reliably learns a controllable latent space that leads to superior performance when compared with state-of-the-art LCE baselines.

preprint2016arXiv

Coexistence and Extinction in Time-Periodic Volterra-Lotka Type Systems with Nonlocal Dispersal

This paper deals with coexistence and extinction of time periodic Volterra-Lotka type competing systems with nonlocal dispersal. Such issues have already been studied for time independent systems with nonlocal dispersal and time periodic systems with random dispersal, but have not been studied yet for time periodic systems with nonlocal dispersal. In this paper, the relations between the coefficients representing Malthusian growths, self regulations and competitions of the two species have been obtained which ensure coexistence and extinction for the time periodic Volterra-Lotka type system with nonlocal dispersal. The underlying environment of the Volterra-Lotka type system under consideration has either hostile surroundings, or non-flux boundary, or is spatially periodic.

preprint2016arXiv

Image Colorization Using a Deep Convolutional Neural Network

In this paper, we present a novel approach that uses deep learning techniques for colorizing grayscale images. By utilizing a pre-trained convolutional neural network, which is originally designed for image classification, we are able to separate content and style of different images and recombine them into a single image. We then propose a method that can add colors to a grayscale image by combining its content with style of a color image having semantic similarity with the grayscale one. As an application, to our knowledge the first of its kind, we use the proposed method to colorize images of ukiyo-e a genre of Japanese painting?and obtain interesting results, showing the potential of this method in the growing field of computer assisted art.

preprint2016arXiv

Toward Mining Visual Log of Software

In this paper, we define visual log of a software system as data capturing the interactions between its users and its graphic user interface (GUI), such as screen-shots and screen recordings. We vision that mining such visual log could be useful for bug reproducing and debugging, automated GUI testing, user interface designing, question answering of common usages in software support, etc. Toward that vision, we propose a core framework for mining visual log of software. This framework focuses on detecting GUI elements and changes in visual log, removing users' private data, recognizing user interactions with GUI elements, and learning GUI usage patterns. We also performed a small study on the characteristics of GUI elements in mobile apps. The findings from this study suggested several heuristics to design techniques for recognizing GUI elements and interactions.

preprint2015arXiv

A Note on the Daubechies Approach in the Construction of Spline Type Orthogonal Scaling Functions

We use Lorentz polynomials to present the solutions explicitly of equations (6.1.7) of [I. Daubechies, Ten lectures on wavelets, CBMS-NSF Regional Conference Series in Applied Mathematics, 61. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, 1992] and (4.9) of [I. Daubechies, Orthonormal bases of compactly supported wavelets. Comm. Pure Appl. Math. 41 (1988), no. 7, 909--996] sot that we give an efficient way to prove Daubechies' results on the existence of spline type orthogonal scaling functions and to evaluate Daubechies scaling functions.

Tung Nguyen

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

SRA: Span Representation Alignment for Large Language Model Distillation

Induced paths in graphs without anticomplete cycles

Machine Learning Approach to Polymerization Reaction Engineering: Determining Monomers Reactivity Ratios

A Simple and Scalable Tensor Completion Algorithm via Latent Invariant Constraint for Recommendation System

Predictive Coding for Locally-Linear Control

Coexistence and Extinction in Time-Periodic Volterra-Lotka Type Systems with Nonlocal Dispersal

Image Colorization Using a Deep Convolutional Neural Network

Toward Mining Visual Log of Software

A Note on the Daubechies Approach in the Construction of Spline Type Orthogonal Scaling Functions