Source author record

Khanh Nguyen

Khanh Nguyen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Computation and Language Computer Vision eess.IV Human-Computer Interaction Artificial Intelligence Biomolecules cond-mat.soft Information Retrieval math.AP math.FA math.MG Neurons and Cognition Robotics

Catalog footprint

What is connected

12works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Language Models are Bounded Pragmatic Speakers: Understanding RLHF from a Bayesian Cognitive Modeling Perspective

How do language models "think"? This paper formulates a probabilistic cognitive model called the bounded pragmatic speaker, which can characterize the operation of different variations of language models. Specifically, we demonstrate that large language models fine-tuned with reinforcement learning from human feedback (Ouyang et al., 2022) embody a model of thought that conceptually resembles a fast-and-slow model (Kahneman, 2011), which psychologists have attributed to humans. We discuss the limitations of reinforcement learning from human feedback as a fast-and-slow model of thought and propose avenues for expanding this framework. In essence, our research highlights the value of adopting a cognitive probabilistic modeling approach to gain insights into the comprehension, evaluation, and advancement of language models.

preprint2022arXiv

A Framework for Learning to Request Rich and Contextually Useful Information from Humans

When deployed, AI agents will encounter problems that are beyond their autonomous problem-solving capabilities. Leveraging human assistance can help agents overcome their inherent limitations and robustly cope with unfamiliar situations. We present a general interactive framework that enables an agent to request and interpret rich, contextually useful information from an assistant that has knowledge about the task and the environment. We demonstrate the practicality of our framework on a simulated human-assisted navigation problem. Aided with an assistance-requesting policy learned by our method, a navigation agent achieves up to a 7x improvement in success rate on tasks that take place in previously unseen environments, compared to fully autonomous behavior. We show that the agent can take advantage of different types of information depending on the context, and analyze the benefits and challenges of learning the assistance-requesting policy when the assistant can recursively decompose tasks into subtasks.

preprint2022arXiv

AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching

This paper tackles the challenge of forensic medical image matching (FMIM) using deep neural networks (DNNs). FMIM is a particular case of content-based image retrieval (CBIR). The main challenge in FMIM compared to the general case of CBIR, is that the subject to whom a query image belongs may be affected by aging and progressive degenerative disorders, making it difficult to match data on a subject level. CBIR with DNNs is generally solved by minimizing a ranking loss, such as Triplet loss (TL), computed on image representations extracted by a DNN from the original data. TL, in particular, operates on triplets: anchor, positive (similar to anchor) and negative (dissimilar to anchor). Although TL has been shown to perform well in many CBIR tasks, it still has limitations, which we identify and analyze in this work. In this paper, we introduce (i) the AdaTriplet loss -- an extension of TL whose gradients adapt to different difficulty levels of negative samples, and (ii) the AutoMargin method -- a technique to adjust hyperparameters of margin-based losses such as TL and our proposed loss dynamically. Our results are evaluated on two large-scale benchmarks for FMIM based on the Osteoarthritis Initiative and Chest X-ray-14 datasets. The codes allowing replication of this study have been made publicly available at \url{https://github.com/Oulu-IMEDS/AdaTriplet}.

preprint2022arXiv

On Limits at Infinity of Weighted Sobolev Functions

We study necessary and sufficient conditions for a Muckenhoupt weight $w \in L^1_{\mathrm{loc}}(\mathbb R^d)$ that yield almost sure existence of radial, and vertical, limits at infinity for Sobolev functions $u \in W^{1,p}_{\mathrm{loc}}(\mathbb R^d,w)$ with a $p$-integrable gradient $|\nabla u|\in L^p(\mathbb R^d,w)$. The question is shown to subtly depend on the sense in which the limit is taken. First, we fully characterize the existence of radial limits. Second, we give essentially sharp sufficient conditions for the existence of vertical limits. In the specific setting of product and radial weights, we give if and only if statements. These generalize and give new proofs for results of Fefferman and Uspenski\uı.

preprint2022arXiv

Pressure and temperature dependence of fluorescence anisotropy of Green Fluorescent Protein

We have studied the effect of high hydrostatic pressure and temperature on the steady state fluorescence anisotropy of Green Fluorescent Protein (GFP). We find that the fluorescence anisotropy of GFP at a constant temperature decreases with increasing pressure. At atmospheric pressure, anisotropy decreases with increasing temperature but exhibits a maximum with temperature for pressure larger than 20 MPa. The temperature corresponding to the maximum of anisotropy increases with increasing pressure. By taking into account of the rotational correlation time changes of GFP with the pressure-temperature dependent viscosity of the solvent, we argue that viscosity increase with pressure is not a major contributing factor to the decrease in anisotropy with pressure. Our results suggest that the decrease of fluorescence anisotropy with pressure may result from changes in H-bonding environment around the chromophore.

preprint2020arXiv

Active Imitation Learning from Multiple Non-Deterministic Teachers: Formulation, Challenges, and Algorithms

We formulate the problem of learning to imitate multiple, non-deterministic teachers with minimal interaction cost. Rather than learning a specific policy as in standard imitation learning, the goal in this problem is to learn a distribution over a policy space. We first present a general framework that efficiently models and estimates such a distribution by learning continuous representations of the teacher policies. Next, we develop Active Performance-Based Imitation Learning (APIL), an active learning algorithm for reducing the learner-teacher interaction cost in this framework. By making query decisions based on predictions of future progress, our algorithm avoids the pitfalls of traditional uncertainty-based approaches in the face of teacher behavioral uncertainty. Results on both toy and photo-realistic navigation tasks show that APIL significantly reduces the numbers of interactions with teachers without compromising on performance. Moreover, it is robust to various degrees of teacher behavioral uncertainty.

preprint2020arXiv

Global Voices: Crossing Borders in Automatic News Summarization

We construct Global Voices, a multilingual dataset for evaluating cross-lingual summarization methods. We extract social-network descriptions of Global Voices news articles to cheaply collect evaluation data for into-English and from-English summarization in 15 languages. Especially, for the into-English summarization task, we crowd-source a high-quality evaluation dataset based on guidelines that emphasize accuracy, coverage, and understandability. To ensure the quality of this dataset, we collect human ratings to filter out bad summaries, and conduct a survey on humans, which shows that the remaining summaries are preferred over the social-network summaries. We study the effect of translation quality in cross-lingual summarization, comparing a translate-then-summarize approach with several baselines. Our results highlight the limitations of the ROUGE metric that are overlooked in monolingual summarization. Our dataset is available for download at https://forms.gle/gpkJDT6RJWHM1Ztz9 .

preprint2020arXiv

On the network orientational affinity assumption in polymers and the micro-macro connection through the chain stretch

We question the network affinity assumption in modeling chain orientations under polymer deformations, and the use of the stretch measure projected from the right Cauchy-Green deformation tensor (or non-affine micro-stretches derived from that measure) as a basic state variable for the micro-macro transition. These ingredients are standard, taken from the statistical theory of polymers, and used in most micromechanical polymer network and soft tissue models. The affinity assumption imposes a constraint in the network which results in an anisotropic distribution of the orientation of the chains and, hence, in an additional configurational entropy that should be included. This additional entropy would result in an additional stress tensor. But an arguably more natural alternative, in line with the typical assumption for the chain behavior itself and with the disregard of these forces, is to consider that the network may fluctuate unconstrained to adapt to macroscopic deformations. This way, the isotropic statistical distribution of the orientation of the chains is maintained unconstrained during deformation and no additional stress is imposed. Then, we show that this free-fluctuating network assumption is equivalent to consider the stretch projected from the stretch tensor (instead of the right Cauchy-Green deformation tensor) as the state variable for the deformation of the network chains. We show very important differences in predictions using both assumed behaviors, and demonstrate that with the free-fluctuating network assumption, we can obtain accurate predictions for all tests in polymers using just one test curve to calibrate the model. With the same macro-micro-macro approach employing the network affinity assumption, we are capable of capturing accurately only the test used for calibration of the model, but not the overall polymer behavior.

preprint2020arXiv

Pre-processing Image using Brightening, CLAHE and RETINEX

This paper focuses on finding the most optimal pre-processing methods considering three common algorithms for image enhancement: Brightening, CLAHE and Retinex. For the purpose of image training in general, these methods will be combined to find out the most optimal method for image enhancement. We have carried out the research on the different permutation of three methods: Brightening, CLAHE and Retinex. The evaluation is based on Canny Edge detection applied to all processed images. Then the sharpness of objects will be justified by true positive pixels number in comparison between images. After using different number combinations pre-processing functions on images, CLAHE proves to be the most effective in edges improvement, Brightening does not show much effect on the edges enhancement, and the Retinex even reduces the sharpness of images and shows little contribution on images enhancement.

preprint2016arXiv

Imitation Learning with Recurrent Neural Networks

We present a novel view that unifies two frameworks that aim to solve sequential prediction problems: learning to search (L2S) and recurrent neural networks (RNN). We point out equivalences between elements of the two frameworks. By complementing what is missing from one framework comparing to the other, we introduce a more advanced imitation learning framework that, on one hand, augments L2S s notion of search space and, on the other hand, enhances RNNs training procedure to be more robust to compounding errors arising from training on highly correlated examples.

preprint2015arXiv

Posterior calibration and exploratory analysis for natural language processing models

Many models in natural language processing define probabilistic distributions over linguistic structures. We argue that (1) the quality of a model' s posterior distribution can and should be directly evaluated, as to whether probabilities correspond to empirical frequencies, and (2) NLP uncertainty can be projected not only to pipeline components, but also to exploratory data analysis, telling a user when to trust and not trust the NLP analysis. We present a method to analyze calibration, and apply it to compare the miscalibration of several commonly used models. We also contribute a coreference sampling algorithm that can create confidence intervals for a political event extraction task.

preprint2015arXiv

Sensory feedback in a bump attractor model of path integration

The mammalian spatial navigation system makes use of several different sensory information channels. This information is then converted into a neural code that represents the animal's current position in space by engaging place cell, grid cell, and head direction cell networks. In particular, sensory landmark (allothetic) cues can be utilized in concert with an animal's knowledge of its own velocity (idiothetic) cues to generate a more accurate representation of position than (idiothetic) path integration provides on its own (Battaglia et al, 2004). We develop a computational model that merges path integration with information from external sensory cues that provide a reliable representation of spatial position along an annular track. Starting with a continuous bump attractor model, we allow for the possibility of synaptic spatial heterogeneity that would break the translation symmetry of space. We use asymptotic analysis to reduce the bump attractor model to a single scalar equation whose potential represents the impact of heterogeneity. Such heterogeneity causes errors to build up when the network performs path integration, but these errors can be corrected by an external control signal representing the effects of sensory cues. We demonstrate that there is an optimal strength and decay rate of the control signal when cues are placed both periodically and randomly. A similar analysis is performed when errors in path integration arise from dynamic noise fluctuations. Again, there is an optimal strength and decay of discrete control that minimizes the path integration error.

Khanh Nguyen

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

Language Models are Bounded Pragmatic Speakers: Understanding RLHF from a Bayesian Cognitive Modeling Perspective

A Framework for Learning to Request Rich and Contextually Useful Information from Humans

AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching

On Limits at Infinity of Weighted Sobolev Functions

Pressure and temperature dependence of fluorescence anisotropy of Green Fluorescent Protein

Active Imitation Learning from Multiple Non-Deterministic Teachers: Formulation, Challenges, and Algorithms

Global Voices: Crossing Borders in Automatic News Summarization

On the network orientational affinity assumption in polymers and the micro-macro connection through the chain stretch

Pre-processing Image using Brightening, CLAHE and RETINEX

Imitation Learning with Recurrent Neural Networks

Posterior calibration and exploratory analysis for natural language processing models

Sensory feedback in a bump attractor model of path integration