Researcher profile

Zhan Shi

Zhan Shi contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2022arXiv

Driver Side and Traffic Based Evaluation Model for On-Street Parking Solutions

Parking has been a painful problem for urban drivers. The parking pain exacerbates as more people tend to live in cities in the context of global urbanization. Thus, it is demanding to find a solution to mitigate d rivers' parking headaches. Many solutions tried to resolve the parking issue by predicting parking occupancy. Their focuses were on the accuracy of the theoretical side but lacked a standardized model to evaluate these proposals in practice. This paper develops a Driver Side and Traffic Based Evaluation Model (DSTBM), which provides a general evaluation scheme for different parking solutions. Two common parking detection methods, fixed sensing and mobile sensing are analyzed using DSTBM. The results indicate first, DSTBM examines different solutions from the driver's perspective and has no conflicts with other evaluation schemes; second, DSTBM confirms that fixed sensing performs better than mobile sensing in terms of prediction accuracy.

preprint2022arXiv

Graph-based Active Learning for Semi-supervised Classification of SAR Data

We present a novel method for classification of Synthetic Aperture Radar (SAR) data by combining ideas from graph-based learning and neural network methods within an active learning framework. Graph-based methods in machine learning are based on a similarity graph constructed from the data. When the data consists of raw images composed of scenes, extraneous information can make the classification task more difficult. In recent years, neural network methods have been shown to provide a promising framework for extracting patterns from SAR images. These methods, however, require ample training data to avoid overfitting. At the same time, such training data are often unavailable for applications of interest, such as automatic target recognition (ATR) and SAR data. We use a Convolutional Neural Network Variational Autoencoder (CNNVAE) to embed SAR data into a feature space, and then construct a similarity graph from the embedded data and apply graph-based semi-supervised learning techniques. The CNNVAE feature embedding and graph construction requires no labeled data, which reduces overfitting and improves the generalization performance of graph learning at low label rates. Furthermore, the method easily incorporates a human-in-the-loop for active learning in the data-labeling process. We present promising results and compare them to other standard machine learning methods on the Moving and Stationary Target Acquisition and Recognition (MSTAR) dataset for ATR with small amounts of labeled data.

preprint2020arXiv

Generalised Lipschitz Regularisation Equals Distributional Robustness

The problem of adversarial examples has highlighted the need for a theory of regularisation that is general enough to apply to exotic function classes, such as universal approximators. In response, we give a very general equality result regarding the relationship between distributional robustness and regularisation, as defined with a transportation cost uncertainty set. The theory allows us to (tightly) certify the robustness properties of a Lipschitz-regularised model with very mild assumptions. As a theoretical application we show a new result explicating the connection between adversarial learning and distributional robustness. We then give new results for how to achieve Lipschitz regularisation of kernel classifiers, which are demonstrated experimentally.

preprint2020arXiv

Improving Image Captioning with Better Use of Captions

Image captioning is a multimodal problem that has drawn extensive attention in both the natural language processing and computer vision community. In this paper, we present a novel image captioning architecture to better explore semantics available in captions and leverage that to enhance both image representation and caption generation. Our models first construct caption-guided visual relationship graphs that introduce beneficial inductive bias using weakly supervised multi-instance learning. The representation is then enhanced with neighbouring and contextual nodes with their textual and visual features. During generation, the model further incorporates visual relationships using multi-task learning for jointly predicting word and object/predicate tag sequences. We perform extensive experiments on the MSCOCO dataset, showing that the proposed framework significantly outperforms the baselines, resulting in the state-of-the-art performance under a wide range of evaluation metrics.

preprint2020arXiv

Learning Execution through Neural Code Fusion

As the performance of computer systems stagnates due to the end of Moore's Law, there is a need for new models that can understand and optimize the execution of general purpose code. While there is a growing body of work on using Graph Neural Networks (GNNs) to learn representations of source code, these representations do not understand how code dynamically executes. In this work, we propose a new approach to use GNNs to learn fused representations of general source code and its execution. Our approach defines a multi-task GNN over low-level representations of source code and program state (i.e., assembly code and dynamic memory states), converting complex source code constructs and complex data structures into a simpler, more uniform format. We show that this leads to improved performance over similar methods that do not use execution and it opens the door to applying GNN models to new tasks that would not be feasible from static code alone. As an illustration of this, we apply the new model to challenging dynamic tasks (branch prediction and prefetching) from the SPEC CPU benchmark suite, outperforming the state-of-the-art by 26% and 45% respectively. Moreover, we use the learned fused graph embeddings to demonstrate transfer learning with high performance on an indirectly related task (algorithm classification).

preprint2020arXiv

Results and conjectures on a toy model of depinning

We review recent results and conjectures for a simplified version of the depinning problem in presence of disorder which was introduced by Derrida and Retaux in 2014. For this toy model, the depinning transition has been predicted to be of the Berezinskii--Kosterlitz--Thouless type. Here we discuss under which integrability conditions this prediction can be proved and how it is modified otherwise.

preprint2020arXiv

The critical behaviors and the scaling functions of a coalescence equation

We show that a coalescence equation exhibits a variety of critical behaviors, depending on the initial condition. This equation was introduced a few years ago to understand a toy model {studied by Derrida and Retaux to mimic} the depinning transition in presence of disorder. It was shown recently that this toy model exhibits the same critical behaviors as the equation studied in the present work. Here we find several families of exact solutions of this coalescence equation, in particular a family of scaling functions which are closely related to the different possible critical behaviors. These scaling functions lead to new conjectures, in particular on the shapes of the critical trees, that we have checked numerically.

preprint2020arXiv

The Derrida--Retaux conjecture on recursive models

We are interested in the nearly supercritical regime in a family of max-type recursive models studied by Collet, Eckman, Glaser and Martin and by Derrida and Retaux, and prove that under a suitable integrability assumption on the initial distribution, the free energy vanishes at the transition with an essential singularity with exponent $\tfrac12$. This gives a weaker answer to a conjecture of Derrida and Retaux. Other behaviours are obtained when the integrability condition is not satisfied.

preprint2020arXiv

The stable Derrida--Retaux system at criticality

The Derrida--Retaux recursive system was investigated by Derrida and Retaux (2014) as a hierarchical renormalization model in statistical physics. A prediction of Derrida and Retaux (2014) on the free energy has recently been rigorously proved (Chen, Dagard, Derrida, Hu, Lifshits and Shi (2019+)), confirming the Berezinskii--Kosterlitz--Thouless-type phase transition in the system. Interestingly, it has been established in Chen, Dagard, Derrida, Hu, Lifshits and Shi (2019+) that the prediction is valid only under a certain integrability assumption on the initial distribution, and a new type of universality result has been shown when this integrability assumption is not satisfied. We present a unified approach for systems satisfying a certain domination condition, and give an upper bound for derivatives of all orders of the moment generating function. When the integrability assumption is not satisfied, our result allows to identify the large-time order of magnitude of the product of the moment generating functions at criticality, confirming and completing a previous result in Collet, Eckmann, Glaser and Martin (1984).