Researcher profile

Tong Ye

Tong Ye contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2022arXiv

Uncertainty Calibration for Deep Audio Classifiers

Although deep Neural Networks (DNNs) have achieved tremendous success in audio classification tasks, their uncertainty calibration are still under-explored. A well-calibrated model should be accurate when it is certain about its prediction and indicate high uncertainty when it is likely to be inaccurate. In this work, we investigate the uncertainty calibration for deep audio classifiers. In particular, we empirically study the performance of popular calibration methods: (i) Monte Carlo Dropout, (ii) ensemble, (iii) focal loss, and (iv) spectral-normalized Gaussian process (SNGP), on audio classification datasets. To this end, we evaluate (i-iv) for the tasks of environment sound and music genre classification. Results indicate that uncalibrated deep audio classifiers may be over-confident, and SNGP performs the best and is very efficient on the two datasets of this paper.

preprint2022arXiv

VU-BERT: A Unified framework for Visual Dialog

The visual dialog task attempts to train an agent to answer multi-turn questions given an image, which requires the deep understanding of interactions between the image and dialog history. Existing researches tend to employ the modality-specific modules to model the interactions, which might be troublesome to use. To fill in this gap, we propose a unified framework for image-text joint embedding, named VU-BERT, and apply patch projection to obtain vision embedding firstly in visual dialog tasks to simplify the model. The model is trained over two tasks: masked language modeling and next utterance retrieval. These tasks help in learning visual concepts, utterances dependence, and the relationships between these two modalities. Finally, our VU-BERT achieves competitive performance (0.7287 NDCG scores) on VisDial v1.0 Datasets.

preprint2020arXiv

Designing and Analysis of A Wi-Fi Data Offloading Strategy Catering for the Preference of Mobile Users

In recent years, offloading mobile traffic through Wi-Fi has emerged as a potential solution to lower down the communication cost for mobile users. Users hope to reduce the cost while keeping the delay in an acceptable range through Wi-Fi offloading. Also, different users have different sensitivities to the cost and the delay performance. How to make a proper cost-delay tradeoff according to the user's preference is the key issue in the design of the offloading strategy. To address this issue, we propose a preference-oriented offloading strategy for current commercial terminals, which transmit traffic only via one channel simultaneously. We model the strategy as a three-state M/MMSP/1 queueing system, of which the service process is a Markov modulated service process (MMSP), and obtain the structured solutions by establishing a hybrid embedded Markov chain. Our analysis shows that, given the user's preference, there exists an optimal deadline to maximize the utility, which is defined as the linear combination of the cost and the delay. We also provide a method to select the optimal deadline. Our simulation demonstrates that this strategy with the optimal deadline can achieve a good performance.

preprint2020arXiv

Modular WSS-based OXCs for Large-Scale Optical Networks

The explosive growth of broadband applications calls for large-scale optical cross-connects (OXCs). However, the classical wavelength selective switch (WSS) based OXC is not scalable in terms of the size of employed WSSs and the cabling complexity. To solve this problem, we propose a three-phase approach to construct a modular WSS-based OXC. In phase 1, we factorize the interconnection network between the input stage and the output stage of the traditional OXC into a set of small-size interconnection networks. In phase 2, we decompose each WSS into a two-stage cascaded structure of small-size WSSs. In phase 3, we combine the small-size interconnection networks with the small-size WSSs to form a set of small-size OXC modules. At last, we obtain a modular OXC, which is a network of small-size OXCs. Similar to the classical OXC, the modular OXC is nonblocking at each wavelength and possesses a self-routing property. Our analysis shows that the modular OXC has small cabling complexity and acceptable physical-layer performance.