Source author record

Cuong Nguyen

Cuong Nguyen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Computer Vision Artificial Intelligence cond-mat.soft physics.geo-ph physics.optics Social and Information Networks

Catalog footprint

What is connected

10works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

People-Centred Medical Image Analysis

Recent advances in data-centric medical AI have produced highly accurate diagnostic systems, but the emphasis on data curation and performance metrics has not translated into widespread clinical adoption. We conjecture that this limited uptake stems from insufficient attention dedicated to the optimisation of fair performance across diverse patient populations and to workflow integration: performance biases can create regulatory barriers, and poorly integrated automation can disrupt clinical routines, degrade the quality of human-AI collaboration, and reduce clinicians' willingness to adopt AI tools. Prior work on workflow integration (e.g., Learning to Defer (L2D) and Learning to Complement (L2C)) and AI fairness has typically examined these challenges in isolation, overlooking their natural interdependence and the practical constraints of clinical environments, such as restricted clinician availability. We propose People-Centred Medical Image Analysis (PecMan), a human-AI framework that jointly optimises fairness, diagnostic accuracy, and workflow effectiveness through a dynamic gating mechanism that assigns cases to AI, clinicians, or both under clinician workload constraints. We also introduce the Fairness and Human-Centred AI (FairHAI) benchmark for evaluating trade-offs between accuracy, fairness, and clinician workload. Experiments using this benchmark show that PecMan consistently outperforms existing methods, paving the way for more trustworthy and clinically viable AI systems. Code will be available upon paper acceptance.

preprint2024arXiv

Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation

Deep learning faces a formidable challenge when handling noisy labels, as models tend to overfit samples affected by label noise. This challenge is further compounded by the presence of instance-dependent noise (IDN), a realistic form of label noise arising from ambiguous sample information. To address IDN, Label Noise Learning (LNL) incorporates a sample selection stage to differentiate clean and noisy-label samples. This stage uses an arbitrary criterion and a pre-defined curriculum that initially selects most samples as noisy and gradually decreases this selection rate during training. Such curriculum is sub-optimal since it does not consider the actual label noise rate in the training set. This paper addresses this issue with a new noise-rate estimation method that is easily integrated with most state-of-the-art (SOTA) LNL methods to produce a more effective curriculum. Synthetic and real-world benchmark results demonstrate that integrating our approach with SOTA LNL methods improves accuracy in most cases.

preprint2024arXiv

PASS: Peer-Agreement based Sample Selection for training with Noisy Labels

The prevalence of noisy-label samples poses a significant challenge in deep learning, inducing overfitting effects. This has, therefore, motivated the emergence of learning with noisy-label (LNL) techniques that focus on separating noisy- and clean-label samples to apply different learning strategies to each group of samples. Current methodologies often rely on the small-loss hypothesis or feature-based selection to separate noisy- and clean-label samples, yet our empirical observations reveal their limitations, especially for labels with instance dependent noise (IDN). An important characteristic of IDN is the difficulty to distinguish the clean-label samples that lie near the decision boundary (i.e., the hard samples) from the noisy-label samples. We, therefore, propose a new noisy-label detection method, termed Peer-Agreement based Sample Selection (PASS), to address this problem. Utilising a trio of classifiers, PASS employs consensus-driven peer-based agreement of two models to select the samples to train the remaining model. PASS is easily integrated into existing LNL models, enabling the improvement of the detection accuracy of noisy- and clean-label samples, which increases the classification accuracy across various LNL benchmarks.

preprint2023arXiv

Geo-mechanical aspects for breakage detachment of rock fines by Darcys flow

Suspension-colloidal-nano transport in porous media encompasses the detachment of detrital fines against electrostatic attraction and authigenic fines by breakage, from the rock surface. While much is currently known about the underlying mechanisms governing detachment of detrital particles, including detachment criteria at the pore scale and its upscaling for the core scale, a critical gap exists due to absence of this knowledge for authigenic fines. Integrating 3D Timoshenkos beam theory of elastic cylinder deformation with CFD-based model for viscous flow around the attached particle and with strength failure criteria for particle-rock bond, we developed a novel theory for fines detachment by breakage at the pore scale. The breakage criterium derived includes analytical expressions for tensile and shear stress maxima along with two geometric diagrams which allow determining the breaking stress. This leads to an explicit formula for the breakage flow velocity. Its upscaling yields a mathematical model for fines detachment by breakage, expressed in the form of the maximum retained concentration of attached fines versus flow velocity -- maximum retention function (MRF) for breakage. We performed corefloods with piecewise constant increasing flow rates, measuring breakthrough concentration and pressure drop across the core. The behaviour of the measured data is consistent with two-population colloidal transport, attributed to detrital and authigenic fines migration. Indeed, the laboratory data show high match with the analytical model for two-population colloidal transport, which validates the proposed mathematical model for fines detachment by breakage.

preprint2023arXiv

Task Weighting in Meta-learning with Trajectory Optimisation

Developing meta-learning algorithms that are un-biased toward a subset of training tasks often requires hand-designed criteria to weight tasks, potentially resulting in sub-optimal solutions. In this paper, we introduce a new principled and fully-automated task-weighting algorithm for meta-learning methods. By considering the weights of tasks within the same mini-batch as an action, and the meta-parameter of interest as the system state, we cast the task-weighting meta-learning problem to a trajectory optimisation and employ the iterative linear quadratic regulator to determine the optimal action or weights of tasks. We theoretically show that the proposed algorithm converges to an $ε_{0}$-stationary point, and empirically demonstrate that the proposed approach out-performs common hand-engineering weighting methods in two few-shot learning benchmarks.

preprint2022arXiv

Instance-Dependent Noisy Label Learning via Graphical Modelling

Noisy labels are unavoidable yet troublesome in the ecosystem of deep learning because models can easily overfit them. There are many types of label noise, such as symmetric, asymmetric and instance-dependent noise (IDN), with IDN being the only type that depends on image information. Such dependence on image information makes IDN a critical type of label noise to study, given that labelling mistakes are caused in large part by insufficient or ambiguous information about the visual classes present in images. Aiming to provide an effective technique to address IDN, we present a new graphical modelling approach called InstanceGM, that combines discriminative and generative models. The main contributions of InstanceGM are: i) the use of the continuous Bernoulli distribution to train the generative model, offering significant training advantages, and ii) the exploration of a state-of-the-art noisy-label discriminative classifier to generate clean labels from instance-dependent noisy-label samples. InstanceGM is competitive with current noisy-label learning approaches, particularly in IDN benchmarks using synthetic and real-world datasets, where our method shows better accuracy than the competitors in most experiments.

preprint2022arXiv

Why So Inflammatory? Explainability in Automatic Detection of Inflammatory Social Media Users

Hate speech and misinformation, spread over social networking services (SNS) such as Facebook and Twitter, have inflamed ethnic and political violence in countries across the globe. We argue that there is limited research on this problem within the context of the Global South and present an approach for tackling them. Prior works have shown how machine learning models built with user-level interaction features can effectively identify users who spread inflammatory content. While this technique is beneficial in low-resource language settings where linguistic resources such as ground truth data and processing capabilities are lacking, it is still unclear how these interaction features contribute to model performance. In this work, we investigate and show significant differences in interaction features between users who spread inflammatory content and others who do not, applying explainability tools to understand our trained model. We find that features with higher interaction significance (such as account age and activity count) show higher explanatory power than features with lower interaction significance (such as name length and if the user has a location on their bio). Our work extends research directions that aim to understand the nature of inflammatory content in low-resource, high-risk contexts as the growth of social media use in the Global South outstrips moderation efforts.

preprint2021arXiv

Similarity of Classification Tasks

Recent advances in meta-learning has led to remarkable performances on several few-shot learning benchmarks. However, such success often ignores the similarity between training and testing tasks, resulting in a potential bias evaluation. We, therefore, propose a generative approach based on a variant of Latent Dirichlet Allocation to analyse task similarity to optimise and better understand the performance of meta-learning. We demonstrate that the proposed method can provide an insightful evaluation for meta-learning algorithms on two few-shot classification benchmarks that matches common intuition: the more similar the higher performance. Based on this similarity measure, we propose a task-selection strategy for meta-learning and show that it can produce more accurate classification results than methods that randomly select training tasks.

preprint2019arXiv

Uncertainty in Model-Agnostic Meta-Learning using Variational Inference

We introduce a new, rigorously-formulated Bayesian meta-learning algorithm that learns a probability distribution of model parameter prior for few-shot learning. The proposed algorithm employs a gradient-based variational inference to infer the posterior of model parameters to a new task. Our algorithm can be applied to any model architecture and can be implemented in various machine learning paradigms, including regression and classification. We show that the models trained with our proposed meta-learning algorithm are well calibrated and accurate, with state-of-the-art calibration and classification results on two few-shot classification benchmarks (Omniglot and Mini-ImageNet), and competitive results in a multi-modal task-distribution regression.

preprint2016arXiv

Bridging the Gap between Crosslinking Chemistry and Directed Assembly of Metasurfaces Using Electrohydrodynamic Flow

Advances in understanding chemical and physical driving forces in self-assembly allow the fabrication of unique nanoarchitectures with subwavelength building blocks as the basis for plasmonic and metamaterial devices. Chemical crosslinking of colloidal nanospheres has produced among the smallest gap spacings, necessary to obtain regions of greatly enhanced electric field, hotspots, which are critical to tailor light-matter interactions. However, obtaining uniform electromagnetic response of dense nanoantennas over large area for use in devices remains challenging. In this work, electrohydrodynamic (EHD) flow and chemical crosslinking is combined to form dense, yet discrete, Au nanosphere clusters (oligomers) on a working electrode. EHD provides a long range driving force to bring nanospheres together and anhydride crosslinking yields 0.9 nm gap spacings. Using selective chemistry, nanospheres are simultaneously crosslinked onto a block copolymer template, producing oligomers with a narrower wavelength band width and higher hotspot intensity than monolayer structures produced without a template. We investigate nanoantenna response via full wave simulations, ultraviolet-visible spectroscopy, and surface enhanced Raman scattering (SERS). Nanoantennas exhibit uniform hotspot intensity and gap spacing. Simulations show field enhancements of 600, correlating well with measured average SERS enhancement of 1.4x10^9. Nanoantenna substrates produce a SERS signal with a relative standard deviation of 10% measured over a 1 mm2 area, crucial for nano-optical devices such as optical sensors, among other applications. Understanding long range (EHD flow) and short range (chemical crosslinking) driving forces provides the control for assembling colloidal nanoparticles in architectures for large area plasmonic and metasurface device fabrication.