Source author record

Yilin Yang

Yilin Yang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Machine Learning Artificial Intelligence Computer Vision Human-Computer Interaction math.NA Numerical Analysis Social and Information Networks

Catalog footprint

What is connected

5works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Improving Multilingual Translation by Representation and Gradient Regularization

Multilingual Neural Machine Translation (NMT) enables one model to serve all translation directions, including ones that are unseen during training, i.e. zero-shot translation. Despite being theoretically attractive, current models often produce low quality translations -- commonly failing to even produce outputs in the right target language. In this work, we observe that off-target translation is dominant even in strong multilingual systems, trained on massive multilingual corpora. To address this issue, we propose a joint approach to regularize NMT models at both representation-level and gradient-level. At the representation level, we leverage an auxiliary target language prediction task to regularize decoder outputs to retain information about the target language. At the gradient level, we leverage a small amount of direct data (in thousands of sentence pairs) to regularize model gradients. Our results demonstrate that our approach is highly effective in both reducing off-target translation occurrences and improving zero-shot translation performance by +5.59 and +10.38 BLEU on WMT and OPUS datasets respectively. Moreover, experiments show that our method also works well when the small amount of direct data is not available.

preprint2022arXiv

Sentiment Analysis of Covid-related Reddits

This paper focuses on Sentiment Analysis of Covid-19 related messages from the r/Canada and r/Unitedkingdom subreddits of Reddit. We apply manual annotation and three Machine Learning algorithms to analyze sentiments conveyed in those messages. We use VADER and TextBlob to label messages for Machine Learning experiments. Our results show that removal of shortest and longest messages improves VADER and TextBlob agreement on positive sentiments and F-score of sentiment classification by all the three algorithms

preprint2020arXiv

EchoLock: Towards Low Effort Mobile User Identification

User identification plays a pivotal role in how we interact with our mobile devices. Many existing authentication approaches require active input from the user or specialized sensing hardware, and studies on mobile device usage show significant interest in less inconvenient procedures. In this paper, we propose EchoLock, a low effort identification scheme that validates the user by sensing hand geometry via commodity microphones and speakers. These acoustic signals produce distinct structure-borne sound reflections when contacting the user's hand, which can be used to differentiate between different people based on how they hold their mobile devices. We process these reflections to derive unique acoustic features in both the time and frequency domain, which can effectively represent physiological and behavioral traits, such as hand contours, finger sizes, holding strength, and gesture. Furthermore, learning-based algorithms are developed to robustly identify the user under various environments and conditions. We conduct extensive experiments with 20 participants using different hardware setups in key use case scenarios and study various attack models to demonstrate the performance of our proposed system. Our results show that EchoLock is capable of verifying users with over 90% accuracy, without requiring any active input from the user.

preprint2020arXiv

Numerical Methods for Biomembranes: conforming subdivision methods versus non-conforming PL methods

The Canham-Helfrich-Evans models of biomembranes consist of a family of geometric constrained variational problems. In this article, we compare two classes of numerical methods for these variational problems based on piecewise linear (PL) and subdivision surfaces (SS). Since SS methods are based on spline approximation and can be viewed as higher order versions of PL methods, one may expect that the only difference between the two methods is in the accuracy order. In this paper, we prove that a numerical method based on minimizing any one of the `PL Willmore energies' proposed in the literature would fail to converge to a solution of the continuous problem, whereas a method based on minimization of the bona fide Willmore energy, well-defined for SS but not PL surfaces, succeeds. Motivated by this analysis, we propose also a regularization method for the PL method based on techniques from conformal geometry. We address a number of implementation issues crucial for the efficiency of our solver. A software package called Wmincon accompanies this article, provides parallel implementations of all the relevant geometric functionals. When combined with a standard constrained optimization solver, the geometric variational problems can then be solved numerically. To this end, we realize that some of the available optimization algorithms/solvers are capable of preserving symmetry, while others manage to break symmetry; we explore the consequences of this observation.

preprint2020arXiv

Towards Frequency-Based Explanation for Robust CNN

Current explanation techniques towards a transparent Convolutional Neural Network (CNN) mainly focuses on building connections between the human-understandable input features with models' prediction, overlooking an alternative representation of the input, the frequency components decomposition. In this work, we present an analysis of the connection between the distribution of frequency components in the input dataset and the reasoning process the model learns from the data. We further provide quantification analysis about the contribution of different frequency components toward the model's prediction. We show that the vulnerability of the model against tiny distortions is a result of the model is relying on the high-frequency features, the target features of the adversarial (black and white-box) attackers, to make the prediction. We further show that if the model develops stronger association between the low-frequency component with true labels, the model is more robust, which is the explanation of why adversarially trained models are more robust against tiny distortions.

Yilin Yang

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Improving Multilingual Translation by Representation and Gradient Regularization

Sentiment Analysis of Covid-related Reddits

EchoLock: Towards Low Effort Mobile User Identification

Numerical Methods for Biomembranes: conforming subdivision methods versus non-conforming PL methods

Towards Frequency-Based Explanation for Robust CNN