Source author record

Jun Yuan

Jun Yuan appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

20works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Effective Knowledge Transfer for Multi-Task Recommendation Models

The conversion rate (CVR) is a crucial metric for evaluating the effectiveness of platforms, as it quantifies the alignment of content with audience preferences. However, the limited nature of customers' conversion actions presents a significant challenge for training ranking models effectively. In this paper, we propose an Effective Knowledge Transfer method for Multi-task Recommendation Models (EKTM). This method enables the ranking model to learn from diverse user behaviors, thereby enhancing performance through the transfer of knowledge across distinct yet related tasks. Each specific CVR task can directly benefit from the insights provided by other tasks. To achieve this, we first introduce a router module that integrates and disseminates knowledge across tasks. Subsequently, each CVR task is equipped with a transmitter module that facilitates the transformation of knowledge from the router. Additionally, we propose an enhanced module to ensure that the transferred knowledge benefit the original task learning. Extensive experiments on several benchmark datasets demonstrate that our proposed method outperforms existing state-of-the-art approaches. Online A/B testing on a commercial platform has validated the effectiveness of the EKTM algorithm in large-scale industrial settings, resulting in a 3.93% uplift in effective Cost Per Mille (eCPM). The algorithm has since been fully deployed across two of the platform's main-traffic scenarios.

preprint2023arXiv

Gamma and Vega Hedging Using Deep Distributional Reinforcement Learning

We show how D4PG can be used in conjunction with quantile regression to develop a hedging strategy for a trader responsible for derivatives that arrive stochastically and depend on a single underlying asset. We assume that the trader makes the portfolio delta neutral at the end of each day by taking a position in the underlying asset. We focus on how trades in the options can be used to manage gamma and vega. The option trades are subject to transaction costs. We consider three different objective functions. We reach conclusions on how the optimal hedging strategy depends on the trader's objective function, the level of transaction costs, and the maturity of the options used for hedging. We also investigate the robustness of the hedging strategy to the process assumed for the underlying asset.

preprint2023arXiv

JEPOO: Highly Accurate Joint Estimation of Pitch, Onset and Offset for Music Information Retrieval

Melody extraction is a core task in music information retrieval, and the estimation of pitch, onset and offset are key sub-tasks in melody extraction. Existing methods have limited accuracy, and work for only one type of data, either single-pitch or multipitch. In this paper, we propose a highly accurate method for joint estimation of pitch, onset and offset, named JEPOO. We address the challenges of joint learning optimization and handling both single-pitch and multi-pitch data through novel model design and a new optimization technique named Pareto modulated loss with loss weight regularization. This is the first method that can accurately handle both single-pitch and multi-pitch music data, and even a mix of them. A comprehensive experimental study on a wide range of real datasets shows that JEPOO outperforms state-ofthe-art methods by up to 10.6%, 8.3% and 10.3% for the prediction of Pitch, Onset and Offset, respectively, and JEPOO is robust for various types of data and instruments. The ablation study shows the effectiveness of each component of JEPOO.

preprint2023arXiv

Super-chirality of paraxial higher order Poincare modes

We demonstrate that higher order Poincare modes of order m are super-chiral, displaying enhancement factors proportional to $m$ and $m^2$ in their helicity/chirality. With m having arbitrarily large integer values, such modes, in principle, possess unlimited super-chirality. These findings pave the way to applications, including the strong enhancements of optical interactions with chiral matter. The work indicates considerable flexibility in controlling the helicity of any higher order paraxial twisted light mode and it incorporates a very wide range of physical scenarios.

preprint2022arXiv

Evaluation of Sampling Methods for Scatterplots

Given a scatterplot with tens of thousands of points or even more, a natural question is which sampling method should be used to create a small but "good" scatterplot for a better abstraction. We present the results of a user study that investigates the influence of different sampling strategies on multi-class scatterplots. The main goal of this study is to understand the capability of sampling methods in preserving the density, outliers, and overall shape of a scatterplot. To this end, we comprehensively review the literature and select seven typical sampling strategies as well as eight representative datasets. We then design four experiments to understand the performance of different strategies in maintaining: 1) region density; 2) class density; 3) outliers; and 4) overall shape in the sampling results. The results show that: 1) random sampling is preferred for preserving region density; 2) blue noise sampling and random sampling have comparable performance with the three multi-class sampling strategies in preserving class density; 3) outlier biased density based sampling, recursive subdivision based sampling, and blue noise sampling perform the best in keeping outliers; and 4) blue noise sampling outperforms the others in maintaining the overall shape of a scatterplot.

preprint2022arXiv

iSEA: An Interactive Pipeline for Semantic Error Analysis of NLP Models

Error analysis in NLP models is essential to successful model development and deployment. One common approach for diagnosing errors is to identify subpopulations in the dataset where the model produces the most errors. However, existing approaches typically define subpopulations based on pre-defined features, which requires users to form hypotheses of errors in advance. To complement these approaches, we propose iSEA, an Interactive Pipeline for Semantic Error Analysis in NLP Models, which automatically discovers semantically-grounded subpopulations with high error rates in the context of a human-in-the-loop interactive system. iSEA enables model developers to learn more about their model errors through discovered subpopulations, validate the sources of errors through interactive analysis on the discovered subpopulations, and test hypotheses about model errors by defining custom subpopulations. The tool supports semantic descriptions of error-prone subpopulations at the token and concept level, as well as pre-defined higher-level features. Through use cases and expert interviews, we demonstrate how iSEA can assist error understanding and analysis.

preprint2022arXiv

Visual Analysis of Neural Architecture Spaces for Summarizing Design Principles

Recent advances in artificial intelligence largely benefit from better neural network architectures. These architectures are a product of a costly process of trial-and-error. To ease this process, we develop ArchExplorer, a visual analysis method for understanding a neural architecture space and summarizing design principles. The key idea behind our method is to make the architecture space explainable by exploiting structural distances between architectures. We formulate the pairwise distance calculation as solving an all-pairs shortest path problem. To improve efficiency, we decompose this problem into a set of single-source shortest path problems. The time complexity is reduced from O(kn^2N) to O(knN). Architectures are hierarchically clustered according to the distances between them. A circle-packing-based architecture visualization has been developed to convey both the global relationships between clusters and local neighborhoods of the architectures in each cluster. Two case studies and a post-analysis are presented to demonstrate the effectiveness of ArchExplorer in summarizing design principles and selecting better-performing architectures.

preprint2022arXiv

Visual Exploration of Machine Learning Model Behavior with Hierarchical Surrogate Rule Sets

One of the potential solutions for model interpretation is to train a surrogate model: a more transparent model that approximates the behavior of the model to be explained. Typically, classification rules or decision trees are used due to the intelligibility of their logic-based expressions. However, decision trees can grow too deep and rule sets can become too large to approximate a complex model. Unlike paths on a decision tree that must share ancestor nodes (conditions), rules are more flexible. However, the unstructured visual representation of rules makes it hard to make inferences across rules. To address these issues, we present a workflow that includes novel algorithmic and interactive solutions. First, we present Hierarchical Surrogate Rules (HSR), an algorithm that generates hierarchical rules based on user-defined parameters. We also contribute SuRE, a visual analytics (VA) system that integrates HSR and interactive surrogate rule visualizations. Particularly, we present a novel feature-aligned tree to overcome the shortcomings of existing rule visualizations. We evaluate the algorithm in terms of parameter sensitivity, time performance, and comparison with surrogate decision trees and find that it scales reasonably well and outperforms decision trees in many respects. We also evaluate the visualization and the VA system by a usability study with 24 volunteers and an observational study with 7 domain experts. Our investigation shows that the participants can use feature-aligned trees to perform non-trivial tasks with very high accuracy. We also discuss many interesting observations that can be useful for future research on designing effective rule-based VA systems.

preprint2021arXiv

Visualizing Rule Sets: Exploration and Validation of a Design Space

Rule sets are often used in Machine Learning (ML) as a way to communicate the model logic in settings where transparency and intelligibility are necessary. Rule sets are typically presented as a text-based list of logical statements (rules). Surprisingly, to date there has been limited work on exploring visual alternatives for presenting rules. In this paper, we explore the idea of designing alternative representations of rules, focusing on a number of visual factors we believe have a positive impact on rule readability and understanding. The paper presents an initial design space for visualizing rule sets and a user study exploring their impact. The results show that some design factors have a strong impact on how efficiently readers can process the rules while having minimal impact on accuracy. This work can help practitioners employ more effective solutions when using rules as a communication strategy to understand ML models.

preprint2020arXiv

A Survey of Visual Analytics Techniques for Machine Learning

Visual analytics for machine learning has recently evolved as one of the most exciting areas in the field of visualization. To better identify which research topics are promising and to learn how to apply relevant techniques in visual analytics, we systematically review 259 papers published in the last ten years together with representative works before 2010. We build a taxonomy, which includes three first-level categories: techniques before model building, techniques during model building, and techniques after model building. Each category is further characterized by representative analysis tasks, and each task is exemplified by a set of recent influential works. We also discuss and highlight research challenges and promising potential future research opportunities useful for visual analytics researchers.

preprint2020arXiv

Chirality-enabled optical dipole potential energy for two-level atoms

We consider the optical dipole potential energy, which arises from the interaction of a two-level atom with a circularly polarized Laguerre-Gaussian laser beam of small waist. The beam is characterized by the existence of a longitudinal electric field component which is responsible for the appearance of a chiral term in the optical dipole potential energy. This term reverses sign if either the winding number or the wave polarization of the beam reverses sign. We propose a scheme of a bi-chromatic vortex interaction with the two-level atom in which the resulting optical dipole potential is fully chiral.

preprint2020arXiv

OoDAnalyzer: Interactive Analysis of Out-of-Distribution Samples

One major cause of performance degradation in predictive models is that the test samples are not well covered by the training data. Such not well-represented samples are called OoD samples. In this paper, we propose OoDAnalyzer, a visual analysis approach for interactively identifying OoD samples and explaining them in context. Our approach integrates an ensemble OoD detection method and a grid-based visualization. The detection method is improved from deep ensembles by combining more features with algorithms in the same family. To better analyze and understand the OoD samples in context, we have developed a novel kNN-based grid layout algorithm motivated by Hall's theorem. The algorithm approximates the optimal layout and has $O(kN^2)$ time complexity, faster than the grid layout algorithm with overall best performance but $O(N^3)$ time complexity. Quantitative evaluation and case studies were performed on several datasets to demonstrate the effectiveness and usefulness of OoDAnalyzer.

preprint2020arXiv

ViCE: Visual Counterfactual Explanations for Machine Learning Models

The continued improvements in the predictive accuracy of machine learning models have allowed for their widespread practical application. Yet, many decisions made with seemingly accurate models still require verification by domain experts. In addition, end-users of a model also want to understand the reasons behind specific decisions. Thus, the need for interpretability is increasingly paramount. In this paper we present an interactive visual analytics tool, ViCE, that generates counterfactual explanations to contextualize and evaluate model decisions. Each sample is assessed to identify the minimal set of changes needed to flip the model's output. These explanations aim to provide end-users with personalized actionable insights with which to understand, and possibly contest or improve, automated decisions. The results are effectively displayed in a visual interface where counterfactual explanations are highlighted and interactive methods are provided for users to explore the data and model. The functionality of the tool is demonstrated by its application to a home equity line of credit dataset.

preprint2016arXiv

Manipulated Object Proposal: A Discriminative Object Extraction and Feature Fusion Framework for First-Person Daily Activity Recognition

Detecting and recognizing objects interacting with humans lie in the center of first-person (egocentric) daily activity recognition. However, due to noisy camera motion and frequent changes in viewpoint and scale, most of the previous egocentric action recognition methods fail to capture and model highly discriminative object features. In this work, we propose a novel pipeline for first-person daily activity recognition, aiming at more discriminative object feature representation and object-motion feature fusion. Our object feature extraction and representation pipeline is inspired by the recent success of object hypotheses and deep convolutional neural network based detection frameworks. Our key contribution is a simple yet effective manipulated object proposal generation scheme. This scheme leverages motion cues such as motion boundary and motion magnitude (in contrast, camera motion is usually considered as "noise" for most previous methods) to generate a more compact and discriminative set of object proposals, which are more closely related to the objects which are being manipulated. Then, we learn more discriminative object detectors from these manipulated object proposals based on region-based convolutional neural network (R-CNN). Meanwhile, we develop a network based feature fusion scheme which better combines object and motion features. We show in experiments that the proposed framework significantly outperforms the state-of-the-art recognition performance on a challenging first-person daily activity benchmark.

preprint2015arXiv

Multi-qubit quantum phase gates based on surface plasmons of a nanosphere

The Dicke subradiance and superradiance resulting from the interaction between surface plasmons of a nanosphere and an ensemble of quantum emitters have been investigated using a Green function approach. Based on such an investigation, we propose a scheme for a deterministic multiqubit quantum phase gate. As an example, twoqubit, threequbit, and fourqubit quantum phase gates have been designed and analyzed in detail. Phenomena due to the losses in the metal are discussed. Potential applications of these phenomena to quantuminformation processing are anticipated.

preprint2015arXiv

Three-dimensional magnetic cloak working from DC to 250 kHz

Invisible cloaking is one of major outcomes of the metamaterial research, but the practical potential, in particular for high frequencies (e.g., microwave to visible light), is fatally challenged by the complex material properties they usually demand. On the other hand, it will be advantageous and also technologically instrumental to design cloaking devices for applications at low frequencies where electromagnetic components are favorably uncoupled. In this work, we vastly develop the bilayer approach to create a three-dimensional magnetic cloak able to work in both static and dynamic fields. Under the quasi-static approximation, we demonstrate a perfect magnetic cloaking device with a large frequency band from zero to 250 kHz. The practical potential of our device is experimentally verified by using a commercial metal detector, which may lead us to having a real cloaking application where the dynamic magnetic field can be manipulated in desired ways.

preprint2014arXiv

Dimensional cross-over of the bandgap transition in quasi-two-dimensional MoS2

The anisotropy of the electronic transition is an important physical property not only determining the materials' optical property, but also revealing the underlying character of the electronic states involved. Here we used momentum-resolved electron energy-loss spectroscopy to study the evolution of the anisotropy of the electronic transition involving the low energy valence electrons in the free-standing MoS2 systems as the layer thickness was reduced to monolayer. We used the orientation and the spectral-density analysis to show that indirect to direct band-gap transition is accompanied by a three- to two-dimensional anisotropy cross-over. The result provides a logical explanation for the large sensitivity of indirect transition to the change of thickness compared with that for direct transition. By tracking the energy of indirect transition, we also revealed the asymmetric response of the valence band and conduction band to the quantum confinement effect. Our results have implication for future optoelectronic applications of atomic thin MoS2.

preprint2014arXiv

Half-CNN: A General Framework for Whole-Image Regression

The Convolutional Neural Network (CNN) has achieved great success in image classification. The classification model can also be utilized at image or patch level for many other applications, such as object detection and segmentation. In this paper, we propose a whole-image CNN regression model, by removing the full connection layer and training the network with continuous feature maps. This is a generic regression framework that fits many applications. We demonstrate this method through two tasks: simultaneous face detection & segmentation, and scene saliency prediction. The result is comparable with other models in the respective fields, using only a small scale network. Since the regression model is trained on corresponding image / feature map pairs, there are no requirements on uniform input size as opposed to the classification model. Our framework avoids classifier design, a process that may introduce too much manual intervention in model development. Yet, it is highly correlated to the classification network and offers some in-deep review of CNN structures.

preprint2014arXiv

Ultra-Fast Fluorescence Imaging in Vivo with Conjugated Polymer Fluorophores in the Second Near-Infrared Window

In vivo fluorescence imaging in the second near-infrared window (1.0-1.7 microns) can afford deep tissue penetration and high spatial resolution, owing to the reduced scattering of long-wavelength photons. Here, we synthesize a series of low-bandgap donor/acceptor copolymers with tunable emission wavelengths of 1050-1350 nm in this window. Non-covalent functionalization with phospholipid-polyethylene glycol results in water-soluble and biocompatible polymeric nanoparticles, allowing for live cell molecular imaging at > 1000 nm with polymer fluorophores for the first time. Importantly, the high quantum yield of the polymer allows for in vivo, deep-tissue and ultrafast imaging of mouse arterial blood flow with an unprecedented frame rate of > 25 frames per second. The high time resolution results in spatially and time resolved imaging of the blood flow pattern in cardiogram waveform over a single cardiac cycle (~ 200 ms) of a mouse, which has not been observed with fluorescence imaging in this window before.

preprint2012arXiv

Quantised orbital angular momentum transfer and magnetic dichroism in the interaction of electron vortices with matter

Following the very recent experimental realisation of electron vortices, we consider their interaction with matter, in particular the transfer of orbital angular momentum in the context of electron energy loss spectroscopy, and the recently observed dichroism in thin film magnetised iron samples. We show here that orbital angular momentum exchange does indeed occur between electron vortices and the internal electronic-type motion, as well as center of mass motion of atoms in the electric dipole approximation. This contrasts with the case of optical vortices where such transfer only occurs in transitions involving multipoles higher than the dipole. The physical basis of the observed dichroism is explained.

Jun Yuan

What is connected

Connect this record

See the researcher in context

Building this map preview

20 published item(s)

Effective Knowledge Transfer for Multi-Task Recommendation Models

Gamma and Vega Hedging Using Deep Distributional Reinforcement Learning

JEPOO: Highly Accurate Joint Estimation of Pitch, Onset and Offset for Music Information Retrieval

Super-chirality of paraxial higher order Poincare modes

Evaluation of Sampling Methods for Scatterplots

iSEA: An Interactive Pipeline for Semantic Error Analysis of NLP Models

Visual Analysis of Neural Architecture Spaces for Summarizing Design Principles

Visual Exploration of Machine Learning Model Behavior with Hierarchical Surrogate Rule Sets

Visualizing Rule Sets: Exploration and Validation of a Design Space

A Survey of Visual Analytics Techniques for Machine Learning

Chirality-enabled optical dipole potential energy for two-level atoms

OoDAnalyzer: Interactive Analysis of Out-of-Distribution Samples

ViCE: Visual Counterfactual Explanations for Machine Learning Models

Manipulated Object Proposal: A Discriminative Object Extraction and Feature Fusion Framework for First-Person Daily Activity Recognition

Multi-qubit quantum phase gates based on surface plasmons of a nanosphere

Three-dimensional magnetic cloak working from DC to 250 kHz

Dimensional cross-over of the bandgap transition in quasi-two-dimensional MoS2

Half-CNN: A General Framework for Whole-Image Regression

Ultra-Fast Fluorescence Imaging in Vivo with Conjugated Polymer Fluorophores in the Second Near-Infrared Window

Quantised orbital angular momentum transfer and magnetic dichroism in the interaction of electron vortices with matter