Researcher profile

Chenyang Yang

Chenyang Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2022arXiv

Data Leakage in Notebooks: Static Detection and Better Processes

Data science pipelines to train and evaluate models with machine learning may contain bugs just like any other code. Leakage between training and test data can lead to overestimating the model's accuracy during offline evaluations, possibly leading to deployment of low-quality models in production. Such leakage can happen easily by mistake or by following poor practices, but may be tedious and challenging to detect manually. We develop a static analysis approach to detect common forms of data leakage in data science code. Our evaluation shows that our analysis accurately detects data leakage and that such leakage is pervasive among over 100,000 analyzed public notebooks. We discuss how our static analysis approach can help both practitioners and educators, and how leakage prevention can be designed into the development process.

preprint2022arXiv

Privacy Leakage in Proactive VR Streaming: Modeling and Tradeoff

Proactive tile-based virtual reality (VR) video streaming employs the viewpoint of a user to predict the tiles to be requested, renders and delivers the predicted tiles before playback. Recently, it has been found that the identity and preference of the user can be inferred from the trace of viewpoint uploaded for proactive streaming, which indicates that viewpoint leakage incurs privacy leakage. In this paper, we strive to answer the following questions regarding viewpoint leakage during proactive VR video streaming. When is the viewpoint leaked? Can privacy-preserving approaches (e.g., federated or individual training, using predictors with no need for training, or predicting locally) avoid viewpoint leakage? We find that if the prediction error or the quality of experience (QoE) metric is uploaded for adaptive streaming, the real viewpoint can be inferred even with the privacy-preserving approaches. Then, we define viewpoint leakage probability to characterize the accuracy of the inferred viewpoint, and respectively derive the probability when uploading prediction error and QoE metric. We find that the viewpoint leakage probability can be reduced by sacrificing QoE or increasing resources. Simulation with the state-of-the-art predictor over a real dataset shows that such a tradeoff does not exist only in rare cases.

preprint2021arXiv

A Tutorial on Ultra-Reliable and Low-Latency Communications in 6G: Integrating Domain Knowledge into Deep Learning

As one of the key communication scenarios in the 5th and also the 6th generation (6G) of mobile communication networks, ultra-reliable and low-latency communications (URLLC) will be central for the development of various emerging mission-critical applications. State-of-the-art mobile communication systems do not fulfill the end-to-end delay and overall reliability requirements of URLLC. In particular, a holistic framework that takes into account latency, reliability, availability, scalability, and decision making under uncertainty is lacking. Driven by recent breakthroughs in deep neural networks, deep learning algorithms have been considered as promising ways of developing enabling technologies for URLLC in future 6G networks. This tutorial illustrates how domain knowledge (models, analytical tools, and optimization frameworks) of communications and networking can be integrated into different kinds of deep learning algorithms for URLLC. We first provide some background of URLLC and review promising network architectures and deep learning frameworks for 6G. To better illustrate how to improve learning algorithms with domain knowledge, we revisit model-based analytical tools and cross-layer optimization frameworks for URLLC. Following that, we examine the potential of applying supervised/unsupervised deep learning and deep reinforcement learning in URLLC and summarize related open problems. Finally, we provide simulation and experimental results to validate the effectiveness of different learning algorithms and discuss future directions.

preprint2021arXiv

Duration-Squeezing-Aware Communication and Computing for Proactive VR

Proactive tile-based virtual reality video streaming computes and delivers the predicted tiles to be requested before playback. All existing works overlook the important fact that computing and communication (CC) tasks for a segment may squeeze the time for the tasks for the next segment, which will cause less and less available time for the latter segments. In this paper, we jointly optimize the durations for CC tasks to maximize the completion rate of CC tasks under the task duration-squeezing-aware constraint. To ensure the latter segments remain enough time for the tasks, the CC tasks for a segment are not allowed to squeeze the time for computing and delivering the subsequent segment. We find the closed-form optimal solution, from which we find a minimum-resource-limited, an unconditional and a conditional resource-tradeoff regions, which are determined by the total time for proactive CC tasks and the playback duration of a segment. Owing to the duration-squeezing-prohibited constraints, the increase of the configured resources may not be always useful for improving the completion rate of CC tasks. Numerical results validate the impact of the duration-squeezing-prohibited constraints and illustrate the three regions.

preprint2021arXiv

Federated Learning Based Proactive Handover in Millimeter-wave Vehicular Networks

Proactive handover can avoid frequent handovers and reduce handover delay, which plays an important role in maintaining the quality of service (QoS) for mobile users in millimeter-wave vehicular networks. To reduce the communication cost of training the learning model for proactive handover, we propose a federated learning (FL) framework. The proposed FL framework can accommodate the limited storage capacity of each user, increase the number of users who participate in the FL, and adapt to the dynamic mobility pattern. Simulation results validate the effectiveness of the proposed FL framework. Compared to reactive handover schemes, the proposed handover scheme can reduce unnecessary handovers and improve the QoS of users simultaneously.

preprint2020arXiv

Constructing Deep Neural Networks with a Priori Knowledge of Wireless Tasks

Deep neural networks (DNNs) have been employed for designing wireless systems in many aspects, say transceiver design, resource optimization, and information prediction. Existing works either use the fully-connected DNN or the DNNs with particular architectures developed in other domains. While generating labels for supervised learning and gathering training samples are time-consuming or cost-prohibitive, how to develop DNNs with wireless priors for reducing training complexity remains open. In this paper, we show that two kinds of permutation invariant properties widely existed in wireless tasks can be harnessed to reduce the number of model parameters and hence the sample and computational complexity for training. We find special architecture of DNNs whose input-output relationships satisfy the properties, called permutation invariant DNN (PINN), and augment the data with the properties. By learning the impact of the scale of a wireless system, the size of the constructed PINNs can flexibly adapt to the input data dimension. We take predictive resource allocation and interference coordination as examples to show how the PINNs can be employed for learning the optimal policy with unsupervised and supervised learning. Simulations results demonstrate a dramatic gain of the proposed PINNs in terms of reducing training complexity.

preprint2020arXiv

Deep Learning for Ultra-Reliable and Low-Latency Communications in 6G Networks

In the future 6th generation networks, ultra-reliable and low-latency communications (URLLC) will lay the foundation for emerging mission-critical applications that have stringent requirements on end-to-end delay and reliability. Existing works on URLLC are mainly based on theoretical models and assumptions. The model-based solutions provide useful insights, but cannot be directly implemented in practice. In this article, we first summarize how to apply data-driven supervised deep learning and deep reinforcement learning in URLLC, and discuss some open problems of these methods. To address these open problems, we develop a multi-level architecture that enables device intelligence, edge intelligence, and cloud intelligence for URLLC. The basic idea is to merge theoretical models and real-world data in analyzing the latency and reliability and training deep neural networks (DNNs). Deep transfer learning is adopted in the architecture to fine-tune the pre-trained DNNs in non-stationary networks. Further considering that the computing capacity at each user and each mobile edge computing server is limited, federated learning is applied to improve the learning efficiency. Finally, we provide some experimental and simulation results and discuss some future directions.

preprint2020arXiv

Optimizing Wireless Systems Using Unsupervised and Reinforced-Unsupervised Deep Learning

Resource allocation and transceivers in wireless networks are usually designed by solving optimization problems subject to specific constraints, which can be formulated as variable or functional optimization. If the objective and constraint functions of a variable optimization problem can be derived, standard numerical algorithms can be applied for finding the optimal solution, which however incur high computational cost when the dimension of the variable is high. To reduce the on-line computational complexity, learning the optimal solution as a function of the environment's status by deep neural networks (DNNs) is an effective approach. DNNs can be trained under the supervision of optimal solutions, which however, is not applicable to the scenarios without models or for functional optimization where the optimal solutions are hard to obtain. If the objective and constraint functions are unavailable, reinforcement learning can be applied to find the solution of a functional optimization problem, which is however not tailored to optimization problems in wireless networks. In this article, we introduce unsupervised and reinforced-unsupervised learning frameworks for solving both variable and functional optimization problems without the supervision of the optimal solutions. When the mathematical model of the environment is completely known and the distribution of environment's status is known or unknown, we can invoke unsupervised learning algorithm. When the mathematical model of the environment is incomplete, we introduce reinforced-unsupervised learning algorithms that learn the model by interacting with the environment. Our simulation results confirm the applicability of these learning frameworks by taking a user association problem as an example.

preprint2020arXiv

Proactive Optimization with Machine Learning: Femto-caching with Future Content Popularity

Optimizing resource allocation with predicted information has shown promising gain in boosting network performance and improving user experience. Earlier research efforts focus on optimizing proactive policies under the assumption of knowing the future information. Recently, various techniques have been proposed to predict the required information, and the prediction results were then treated as the true value in the optimization, i.e., "first-predict-then-optimize". In this paper, we introduce a proactive optimization framework for anticipatory resource allocation, where the future information is implicitly predicted under the same objective with the policy optimization in a single step. An optimization problem is formulated to integrate the implicit prediction and the policy optimization, based on the conditional distribution of the future information given the historical observations. To solve such a problem, we transform it equivalently to a problem depending on the joint distribution of future and historical information. Then, we resort to unsupervised learning with neural networks to learn the proactive policy as a function of the past observations via stochastic optimization. We take proactive caching and bandwidth allocation at base stations as a concrete example, where the objective function is the conditional expectation of successful offloading probability taken over the future popularity given the historically observed popularity. We use simulation to validate the proposed framework and compare it with the "first-predict-then-optimize" strategy and a heuristic "end-to-end" optimization strategy with supervised learning.

preprint2020arXiv

Unsupervised Deep Learning for Optimizing Wireless Systems with Instantaneous and Statistic Constraints

Deep neural networks (DNNs) have been introduced for designing wireless policies by approximating the mappings from environmental parameters to solutions of optimization problems. Considering that labeled training samples are hard to obtain, unsupervised deep learning has been proposed to solve functional optimization problems with statistical constraints recently. However, most existing problems in wireless communications are variable optimizations, and many problems are with instantaneous constraints. In this paper, we establish a unified framework of using unsupervised deep learning to solve both kinds of problems with both instantaneous and statistic constraints. For a constrained variable optimization, we first convert it into an equivalent functional optimization problem with instantaneous constraints. Then, to ensure the instantaneous constraints in the functional optimization problems, we use DNN to approximate the Lagrange multiplier functions, which is trained together with a DNN to approximate the policy. We take two resource allocation problems in ultra-reliable and low-latency communications as examples to illustrate how to guarantee the complex and stringent quality-of-service (QoS) constraints with the framework. Simulation results show that unsupervised learning outperforms supervised learning in terms of QoS violation probability and approximation accuracy of the optimal policy, and can converge rapidly with pre-training.