Researcher profile

Khanh Nguyen

Khanh Nguyen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2024arXiv

Language Models are Bounded Pragmatic Speakers: Understanding RLHF from a Bayesian Cognitive Modeling Perspective

How do language models "think"? This paper formulates a probabilistic cognitive model called the bounded pragmatic speaker, which can characterize the operation of different variations of language models. Specifically, we demonstrate that large language models fine-tuned with reinforcement learning from human feedback (Ouyang et al., 2022) embody a model of thought that conceptually resembles a fast-and-slow model (Kahneman, 2011), which psychologists have attributed to humans. We discuss the limitations of reinforcement learning from human feedback as a fast-and-slow model of thought and propose avenues for expanding this framework. In essence, our research highlights the value of adopting a cognitive probabilistic modeling approach to gain insights into the comprehension, evaluation, and advancement of language models.

preprint2022arXiv

A Framework for Learning to Request Rich and Contextually Useful Information from Humans

When deployed, AI agents will encounter problems that are beyond their autonomous problem-solving capabilities. Leveraging human assistance can help agents overcome their inherent limitations and robustly cope with unfamiliar situations. We present a general interactive framework that enables an agent to request and interpret rich, contextually useful information from an assistant that has knowledge about the task and the environment. We demonstrate the practicality of our framework on a simulated human-assisted navigation problem. Aided with an assistance-requesting policy learned by our method, a navigation agent achieves up to a 7x improvement in success rate on tasks that take place in previously unseen environments, compared to fully autonomous behavior. We show that the agent can take advantage of different types of information depending on the context, and analyze the benefits and challenges of learning the assistance-requesting policy when the assistant can recursively decompose tasks into subtasks.

preprint2022arXiv

AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching

This paper tackles the challenge of forensic medical image matching (FMIM) using deep neural networks (DNNs). FMIM is a particular case of content-based image retrieval (CBIR). The main challenge in FMIM compared to the general case of CBIR, is that the subject to whom a query image belongs may be affected by aging and progressive degenerative disorders, making it difficult to match data on a subject level. CBIR with DNNs is generally solved by minimizing a ranking loss, such as Triplet loss (TL), computed on image representations extracted by a DNN from the original data. TL, in particular, operates on triplets: anchor, positive (similar to anchor) and negative (dissimilar to anchor). Although TL has been shown to perform well in many CBIR tasks, it still has limitations, which we identify and analyze in this work. In this paper, we introduce (i) the AdaTriplet loss -- an extension of TL whose gradients adapt to different difficulty levels of negative samples, and (ii) the AutoMargin method -- a technique to adjust hyperparameters of margin-based losses such as TL and our proposed loss dynamically. Our results are evaluated on two large-scale benchmarks for FMIM based on the Osteoarthritis Initiative and Chest X-ray-14 datasets. The codes allowing replication of this study have been made publicly available at \url{https://github.com/Oulu-IMEDS/AdaTriplet}.

preprint2022arXiv

On Limits at Infinity of Weighted Sobolev Functions

We study necessary and sufficient conditions for a Muckenhoupt weight $w \in L^1_{\mathrm{loc}}(\mathbb R^d)$ that yield almost sure existence of radial, and vertical, limits at infinity for Sobolev functions $u \in W^{1,p}_{\mathrm{loc}}(\mathbb R^d,w)$ with a $p$-integrable gradient $|\nabla u|\in L^p(\mathbb R^d,w)$. The question is shown to subtly depend on the sense in which the limit is taken. First, we fully characterize the existence of radial limits. Second, we give essentially sharp sufficient conditions for the existence of vertical limits. In the specific setting of product and radial weights, we give if and only if statements. These generalize and give new proofs for results of Fefferman and Uspenski\uı.

preprint2022arXiv

Pressure and temperature dependence of fluorescence anisotropy of Green Fluorescent Protein

We have studied the effect of high hydrostatic pressure and temperature on the steady state fluorescence anisotropy of Green Fluorescent Protein (GFP). We find that the fluorescence anisotropy of GFP at a constant temperature decreases with increasing pressure. At atmospheric pressure, anisotropy decreases with increasing temperature but exhibits a maximum with temperature for pressure larger than 20 MPa. The temperature corresponding to the maximum of anisotropy increases with increasing pressure. By taking into account of the rotational correlation time changes of GFP with the pressure-temperature dependent viscosity of the solvent, we argue that viscosity increase with pressure is not a major contributing factor to the decrease in anisotropy with pressure. Our results suggest that the decrease of fluorescence anisotropy with pressure may result from changes in H-bonding environment around the chromophore.

preprint2020arXiv

Active Imitation Learning from Multiple Non-Deterministic Teachers: Formulation, Challenges, and Algorithms

We formulate the problem of learning to imitate multiple, non-deterministic teachers with minimal interaction cost. Rather than learning a specific policy as in standard imitation learning, the goal in this problem is to learn a distribution over a policy space. We first present a general framework that efficiently models and estimates such a distribution by learning continuous representations of the teacher policies. Next, we develop Active Performance-Based Imitation Learning (APIL), an active learning algorithm for reducing the learner-teacher interaction cost in this framework. By making query decisions based on predictions of future progress, our algorithm avoids the pitfalls of traditional uncertainty-based approaches in the face of teacher behavioral uncertainty. Results on both toy and photo-realistic navigation tasks show that APIL significantly reduces the numbers of interactions with teachers without compromising on performance. Moreover, it is robust to various degrees of teacher behavioral uncertainty.

preprint2020arXiv

Global Voices: Crossing Borders in Automatic News Summarization

We construct Global Voices, a multilingual dataset for evaluating cross-lingual summarization methods. We extract social-network descriptions of Global Voices news articles to cheaply collect evaluation data for into-English and from-English summarization in 15 languages. Especially, for the into-English summarization task, we crowd-source a high-quality evaluation dataset based on guidelines that emphasize accuracy, coverage, and understandability. To ensure the quality of this dataset, we collect human ratings to filter out bad summaries, and conduct a survey on humans, which shows that the remaining summaries are preferred over the social-network summaries. We study the effect of translation quality in cross-lingual summarization, comparing a translate-then-summarize approach with several baselines. Our results highlight the limitations of the ROUGE metric that are overlooked in monolingual summarization. Our dataset is available for download at https://forms.gle/gpkJDT6RJWHM1Ztz9 .

preprint2020arXiv

On the network orientational affinity assumption in polymers and the micro-macro connection through the chain stretch

We question the network affinity assumption in modeling chain orientations under polymer deformations, and the use of the stretch measure projected from the right Cauchy-Green deformation tensor (or non-affine micro-stretches derived from that measure) as a basic state variable for the micro-macro transition. These ingredients are standard, taken from the statistical theory of polymers, and used in most micromechanical polymer network and soft tissue models. The affinity assumption imposes a constraint in the network which results in an anisotropic distribution of the orientation of the chains and, hence, in an additional configurational entropy that should be included. This additional entropy would result in an additional stress tensor. But an arguably more natural alternative, in line with the typical assumption for the chain behavior itself and with the disregard of these forces, is to consider that the network may fluctuate unconstrained to adapt to macroscopic deformations. This way, the isotropic statistical distribution of the orientation of the chains is maintained unconstrained during deformation and no additional stress is imposed. Then, we show that this free-fluctuating network assumption is equivalent to consider the stretch projected from the stretch tensor (instead of the right Cauchy-Green deformation tensor) as the state variable for the deformation of the network chains. We show very important differences in predictions using both assumed behaviors, and demonstrate that with the free-fluctuating network assumption, we can obtain accurate predictions for all tests in polymers using just one test curve to calibrate the model. With the same macro-micro-macro approach employing the network affinity assumption, we are capable of capturing accurately only the test used for calibration of the model, but not the overall polymer behavior.

preprint2020arXiv

Pre-processing Image using Brightening, CLAHE and RETINEX

This paper focuses on finding the most optimal pre-processing methods considering three common algorithms for image enhancement: Brightening, CLAHE and Retinex. For the purpose of image training in general, these methods will be combined to find out the most optimal method for image enhancement. We have carried out the research on the different permutation of three methods: Brightening, CLAHE and Retinex. The evaluation is based on Canny Edge detection applied to all processed images. Then the sharpness of objects will be justified by true positive pixels number in comparison between images. After using different number combinations pre-processing functions on images, CLAHE proves to be the most effective in edges improvement, Brightening does not show much effect on the edges enhancement, and the Retinex even reduces the sharpness of images and shows little contribution on images enhancement.