Source author record

Huy Tran

Huy Tran appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.CV Computer Vision math.PR Machine Learning Artificial Intelligence Distributed, Parallel, and Cluster Computing eess.SY Logic in Computer Science math.GT Robotics Social and Information Networks Software Engineering Systems and Control

Catalog footprint

What is connected

11works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

ASAP: Amortized Doubly-Stochastic Attention via Sliced Dual Projection

Doubly-stochastic attention has emerged as a transport-based alternative to row-softmax attention, with recent Transformer variants using it to reduce attention sinks and rank collapse while improving performance. In this family, the standard approach is Sinkhorn scaling, which trains more efficiently but still repeats matrix scaling in every inference forward pass. Sliced-transport attention removes the online iteration, but its soft sorting approximation materializes dense tensors for each slice, requiring substantially more training resources than Sinkhorn attention. We introduce ASAP: Amortized Doubly-Stochastic Attention via Sliced Dual Projection, a train-then-compile method that trains the doubly-stochastic layer with Sinkhorn, then replaces the iterative scaling loop at inference with a fixed sliced-dual operator. It learns a lightweight parametric map from exact one-dimensional Kantorovich potentials to the Sinkhorn query-side dual, then reconstructs the attention plan with a two-sided entropic c-transform. Across language and vision benchmarks, ASAP keeps the cheaper training setup and remains highly competitive with recent baselines. In the main frozen-layer benchmark, ASAP is 5.3 faster than the trained Sinkhorn teacher while matching its accuracy; in downstream replacements, ASAP recovers most of the teacher performance without any retraining.

preprint2022arXiv

Coarse-to-Fine Reasoning for Visual Question Answering

Bridging the semantic gap between image and question is an important step to improve the accuracy of the Visual Question Answering (VQA) task. However, most of the existing VQA methods focus on attention mechanisms or visual relations for reasoning the answer, while the features at different semantic levels are not fully utilized. In this paper, we present a new reasoning framework to fill the gap between visual features and semantic clues in the VQA task. Our method first extracts the features and predicates from the image and question. We then propose a new reasoning framework to effectively jointly learn these features and predicates in a coarse-to-fine manner. The intensively experimental results on three large-scale VQA datasets show that our proposed approach achieves superior accuracy comparing with other state-of-the-art methods. Furthermore, our reasoning framework also provides an explainable way to understand the decision of the deep neural network when predicting the answer.

preprint2022arXiv

Fine-Grained Visual Classification using Self Assessment Classifier

Extracting discriminative features plays a crucial role in the fine-grained visual classification task. Most of the existing methods focus on developing attention or augmentation mechanisms to achieve this goal. However, addressing the ambiguity in the top-k prediction classes is not fully investigated. In this paper, we introduce a Self Assessment Classifier, which simultaneously leverages the representation of the image and top-k prediction classes to reassess the classification results. Our method is inspired by continual learning with coarse-grained and fine-grained classifiers to increase the discrimination of features in the backbone and produce attention maps of informative areas on the image. In practice, our method works as an auxiliary branch and can be easily integrated into different architectures. We show that by effectively addressing the ambiguity in the top-k prediction classes, our method achieves new state-of-the-art results on CUB200-2011, Stanford Dog, and FGVC Aircraft datasets. Furthermore, our method also consistently improves the accuracy of different existing fine-grained classifiers with a unified setup.

preprint2022arXiv

WayFAST: Navigation with Predictive Traversability in the Field

We present a self-supervised approach for learning to predict traversable paths for wheeled mobile robots that require good traction to navigate. Our algorithm, termed WayFAST (Waypoint Free Autonomous Systems for Traversability), uses RGB and depth data, along with navigation experience, to autonomously generate traversable paths in outdoor unstructured environments. Our key inspiration is that traction can be estimated for rolling robots using kinodynamic models. Using traction estimates provided by an online receding horizon estimator, we are able to train a traversability prediction neural network in a self-supervised manner, without requiring heuristics utilized by previous methods. We demonstrate the effectiveness of WayFAST through extensive field testing in varying environments, ranging from sandy dry beaches to forest canopies and snow covered grass fields. Our results clearly demonstrate that WayFAST can learn to avoid geometric obstacles as well as untraversable terrain, such as snow, which would be difficult to avoid with sensors that provide only geometric data, such as LiDAR. Furthermore, we show that our training pipeline based on online traction estimates is more data-efficient than other heuristic-based methods.

preprint2020arXiv

A support theorem for SLE curves

For all $κ> 0$, we show that the support of SLE$_κ$ curves is the closure in the sup-norm of the set of Loewner curves driven by nice (e.g. smooth) functions. It follows that the support is the closure of the set of simple curves starting at $0$.

preprint2020arXiv

The continuum self-similar tree

We introduce the continuum self-similar tree (CSST) and characterize it topologically. We apply this to answer a question of Curien about the topology of the continuum random tree (CRT). We also give a topological characterization of other trees with branch points of finite or infinite valences.

preprint2016arXiv

On the regularity of SLE trace

We revisit regularity of SLE trace, for all $κ\neq 8$, and establish Besov regularity under the usual half-space capacity parametrization. With an embedding theorem of Garsia--Rodemich--Rumsey type, we obtain finite moments (and hence almost surely) optimal variation regularity with index $\min (1 + κ/ 8, 2) $, improving on previous works of Werness, and also (optimal) Hölder regularity à la Johansson Viklund and Lawler.

preprint2014arXiv

Automated Mapping of UML Activity Diagrams to Formal Specifications for Supporting Containment Checking

Business analysts and domain experts are often sketching the behaviors of a software system using high-level models that are technology- and platform-independent. The developers will refine and enrich these high-level models with technical details. As a consequence, the refined models can deviate from the original models over time, especially when the two kinds of models evolve independently. In this context, we focus on behavior models; that is, we aim to ensure that the refined, low-level behavior models conform to the corresponding high-level behavior models. Based on existing formal verification techniques, we propose containment checking as a means to assess whether the system's behaviors described by the low-level models satisfy what has been specified in the high-level counterparts. One of the major obstacles is how to lessen the burden of creating formal specifications of the behavior models as well as consistency constraints, which is a tedious and error-prone task when done manually. Our approach presented in this paper aims at alleviating the aforementioned challenges by considering the behavior models as verification inputs and devising automated mappings of behavior models onto formal properties and descriptions that can be directly used by model checkers. We discuss various challenges in our approach and show the applicability of our approach in illustrative scenarios.

preprint2014arXiv

Regularity of Loewner Curves

The Loewner equation encrypts a growing simple curve in the plane into a real-valued driving function. We show that if the driving function $λ$ is in $C^β$ with $β>2$ (or real analytic) then the Loewner curve is in $C^{β+ \frac{1}{2}}$ (respectively analytic). This is a converse to a result by Earle and Epstein and extends a result of Wong.

preprint2013arXiv

Convergence of an algorithm simulating Loewner curves

The development of Schramm--Loewner evolution (SLE) as the scaling limits of discrete models from statistical physics makes direct simulation of SLE an important task. The most common method, suggested by Marshall and Rohde \cite{MR05}, is to sample Brownian motion at discrete times, interpolate appropriately in between and solve explicitly the Loewner equation with this approximation. This algorithm always produces piecewise smooth non self-intersecting curves whereas SLE$_κ$ has been proven to be simple for $κ\in[0,4]$, self-touching for $κ\in(4,8)$ and space-filling for $κ\geq 8$. In this paper we show that this sequence of curves converges to SLE$_κ$ for all $κ\neq 8$ by giving a condition on deterministic driving functions to ensure the sup-norm convergence of simulated curves when we use this algorithm.

preprint2011arXiv

SocialCloud: Using Social Networks for Building Distributed Computing Services

In this paper we investigate a new computing paradigm, called SocialCloud, in which computing nodes are governed by social ties driven from a bootstrapping trust-possessing social graph. We investigate how this paradigm differs from existing computing paradigms, such as grid computing and the conventional cloud computing paradigms. We show that incentives to adopt this paradigm are intuitive and natural, and security and trust guarantees provided by it are solid. We propose metrics for measuring the utility and advantage of this computing paradigm, and using real-world social graphs and structures of social traces; we investigate the potential of this paradigm for ordinary users. We study several design options and trade-offs, such as scheduling algorithms, centralization, and straggler handling, and show how they affect the utility of the paradigm. Interestingly, we conclude that whereas graphs known in the literature for high trust properties do not serve distributed trusted computing algorithms, such as Sybil defenses---for their weak algorithmic properties, such graphs are good candidates for our paradigm for their self-load-balancing features.

Huy Tran

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

ASAP: Amortized Doubly-Stochastic Attention via Sliced Dual Projection

Coarse-to-Fine Reasoning for Visual Question Answering

Fine-Grained Visual Classification using Self Assessment Classifier

WayFAST: Navigation with Predictive Traversability in the Field

A support theorem for SLE curves

The continuum self-similar tree

On the regularity of SLE trace

Automated Mapping of UML Activity Diagrams to Formal Specifications for Supporting Containment Checking

Regularity of Loewner Curves

Convergence of an algorithm simulating Loewner curves

SocialCloud: Using Social Networks for Building Distributed Computing Services