Source author record

Yuan Luo

Yuan Luo appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

41works

26topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Mechanism Design for Federated Learning with Non-Monotonic Network Effects

Mechanism design is pivotal to federated learning (FL) for maximizing social welfare by coordinating self-interested clients. Existing mechanisms, however, often overlook the network effects of client participation and the diverse model performance requirements (i.e., generalization error) across applications, leading to suboptimal incentives and social welfare, or even inapplicability in real deployments. To address this gap, we explore incentive mechanism design for FL with network effects and application-specific requirements of model performance. We develop a theoretical model to quantify the impact of network effects on heterogeneous client participation, revealing the non-monotonic nature of such effects. Based on these insights, we propose a Model Trading and Sharing (MoTS) framework, which enables clients to obtain FL models through either participation or purchase. To further address clients' strategic behaviors, we design a Social Welfare maximization with Application-aware and Network effects (SWAN) mechanism, exploiting model customer payments for incentivization. Experimental results on a hardware prototype demonstrate that our SWAN mechanism outperforms existing FL mechanisms, improving social welfare by up to $352.42\%$ and reducing extra incentive costs by $93.07\%$.

preprint2023arXiv

Accelerating Iterated Persistent Homology Computations with Warm Starts

Persistent homology is a topological feature used in a variety of applications such as generating features for data analysis and penalizing optimization problems. We develop an approach to accelerate persistent homology computations performed on many similar filtered topological spaces which is based on updating associated matrix factorizations. Our approach improves the update scheme of Cohen-Steiner, Edelsbrunner, and Morozov for permutations by additionally handling addition and deletion of cells in a filtered topological space and by processing changes in a single batch. We show that the complexity of our scheme scales with the number of elementary changes to the filtration which as a result is often less expensive than the full persistent homology computation. Finally, we perform computational experiments demonstrating practical speedups in several situations including feature generation and optimization guided by persistent homology.

preprint2022arXiv

AKI-BERT: a Pre-trained Clinical Language Model for Early Prediction of Acute Kidney Injury

Acute kidney injury (AKI) is a common clinical syndrome characterized by a sudden episode of kidney failure or kidney damage within a few hours or a few days. Accurate early prediction of AKI for patients in ICU who are more likely than others to have AKI can enable timely interventions, and reduce the complications of AKI. Much of the clinical information relevant to AKI is captured in clinical notes that are largely unstructured text and requires advanced natural language processing (NLP) for useful information extraction. On the other hand, pre-trained contextual language models such as Bidirectional Encoder Representations from Transformers (BERT) have improved performances for many NLP tasks in general domain recently. However, few have explored BERT on disease-specific medical domain tasks such as AKI early prediction. In this paper, we try to apply BERT to specific diseases and present an AKI domain-specific pre-trained language model based on BERT (AKI-BERT) that could be used to mine the clinical notes for early prediction of AKI. AKI-BERT is a BERT model pre-trained on the clinical notes of patients having risks for AKI. Our experiments on Medical Information Mart for Intensive Care III (MIMIC-III) dataset demonstrate that AKI-BERT can yield performance improvements for early AKI prediction, thus expanding the utility of the BERT model from general clinical domain to disease-specific domain.

preprint2022arXiv

Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequences

Transformers-based models, such as BERT, have dramatically improved the performance for various natural language processing tasks. The clinical knowledge enriched model, namely ClinicalBERT, also achieved state-of-the-art results when performed on clinical named entity recognition and natural language inference tasks. One of the core limitations of these transformers is the substantial memory consumption due to their full self-attention mechanism. To overcome this, long sequence transformer models, e.g. Longformer and BigBird, were proposed with the idea of sparse attention mechanism to reduce the memory usage from quadratic to the sequence length to a linear scale. These models extended the maximum input sequence length from 512 to 4096, which enhanced the ability of modeling long-term dependency and consequently achieved optimal results in a variety of tasks. Inspired by the success of these long sequence transformer models, we introduce two domain enriched language models, namely Clinical-Longformer and Clinical-BigBird, which are pre-trained from large-scale clinical corpora. We evaluate both pre-trained models using 10 baseline tasks including named entity recognition, question answering, and document classification tasks. The results demonstrate that Clinical-Longformer and Clinical-BigBird consistently and significantly outperform ClinicalBERT as well as other short-sequence transformers in all downstream tasks. We have made our source code available at [https://github.com/luoyuanlab/Clinical-Longformer] the pre-trained models available for public download at: [https://huggingface.co/yikuan8/Clinical-Longformer].

preprint2022arXiv

Deep Learning Reveals Patterns of Diverse and Changing Sentiments Towards COVID-19 Vaccines Based on 11 Million Tweets

Over 12 billion doses of COVID-19 vaccines have been administered at the time of writing. However, public perceptions of vaccines have been complex. We analyzed COVID-19 vaccine-related tweets to understand the evolving perceptions of COVID-19 vaccines. We finetuned a deep learning classifier using a state-of-the-art model, XLNet, to detect each tweet's sentiment automatically. We employed validated methods to extract the users' race or ethnicity, gender, age, and geographical locations from user profiles. Incorporating multiple data sources, we assessed the sentiment patterns among subpopulations and juxtaposed them against vaccine uptake data to unravel their interactive patterns. 11,211,672 COVID-19 vaccine-related tweets corresponding to 2,203,681 users over two years were analyzed. The finetuned model for sentiment classification yielded an accuracy of 0.92 on testing set. Users from various demographic groups demonstrated distinct patterns in sentiments towards COVID-19 vaccines. User sentiments became more positive over time, upon which we observed subsequent upswing in the population-level vaccine uptake. Surrounding dates where positive sentiments crest, we detected encouraging news or events regarding vaccine development and distribution. Positive sentiments in pregnancy-related tweets demonstrated a delayed pattern compared with trends in general population, with postponed vaccine uptake trends. Distinctive patterns across subpopulations suggest the need of tailored strategies. Global news and events profoundly involved in shaping users' thoughts on social media. Populations with additional concerns, such as pregnancy, demonstrated more substantial hesitancy since lack of timely recommendations. Feature analysis revealed hesitancies of various subpopulations stemmed from clinical trial logics, risks and complications, and urgency of scientific evidence.

preprint2022arXiv

Distribution Preserving Graph Representation Learning

Graph neural network (GNN) is effective to model graphs for distributed representations of nodes and an entire graph. Recently, research on the expressive power of GNN attracted growing attention. A highly-expressive GNN has the ability to generate discriminative graph representations. However, in the end-to-end training process for a certain graph learning task, a highly-expressive GNN risks generating graph representations overfitting the training data for the target task, while losing information important for the model generalization. In this paper, we propose Distribution Preserving GNN (DP-GNN) - a GNN framework that can improve the generalizability of expressive GNN models by preserving several kinds of distribution information in graph representations and node representations. Besides the generalizability, by applying an expressive GNN backbone, DP-GNN can also have high expressive power. We evaluate the proposed DP-GNN framework on multiple benchmark datasets for graph classification tasks. The experimental results demonstrate that our model achieves state-of-the-art performances.

preprint2022arXiv

ImageGCN: Multi-Relational Image Graph Convolutional Networks for Disease Identification with Chest X-rays

Image representation is a fundamental task in computer vision. However, most of the existing approaches for image representation ignore the relations between images and consider each input image independently. Intuitively, relations between images can help to understand the images and maintain model consistency over related images, leading to better explainability. In this paper, we consider modeling the image-level relations to generate more informative image representations, and propose ImageGCN, an end-to-end graph convolutional network framework for inductive multi-relational image modeling. We apply ImageGCN to chest X-ray images where rich relational information is available for disease identification. Unlike previous image representation models, ImageGCN learns the representation of an image using both its original pixel features and its relationship with other images. Besides learning informative representations for images, ImageGCN can also be used for object detection in a weakly supervised manner. The experimental results on 3 open-source x-ray datasets, ChestX-ray14, CheXpert and MIMIC-CXR demonstrate that ImageGCN can outperform respective baselines in both disease identification and localization tasks and can achieve comparable and often better results than the state-of-the-art methods.

preprint2022arXiv

Machine Learning Applications in Lung Cancer Diagnosis, Treatment and Prognosis

The recent development of imaging and sequencing technologies enables systematic advances in the clinical study of lung cancer. Meanwhile, the human mind is limited in effectively handling and fully utilizing the accumulation of such enormous amounts of data. Machine learning-based approaches play a critical role in integrating and analyzing these large and complex datasets, which have extensively characterized lung cancer through the use of different perspectives from these accrued data. In this article, we provide an overview of machine learning-based approaches that strengthen the varying aspects of lung cancer diagnosis and therapy, including early detection, auxiliary diagnosis, prognosis prediction and immunotherapy practice. Moreover, we highlight the challenges and opportunities for future applications of machine learning in lung cancer.

preprint2022arXiv

MedGCN: Medication recommendation and lab test imputation via graph convolutional networks

Laboratory testing and medication prescription are two of the most important routines in daily clinical practice. Developing an artificial intelligence system that can automatically make lab test imputations and medication recommendations can save costs on potentially redundant lab tests and inform physicians of a more effective prescription. We present an intelligent medical system (named MedGCN) that can automatically recommend the patients' medications based on their incomplete lab tests, and can even accurately estimate the lab values that have not been taken. In our system, we integrate the complex relations between multiple types of medical entities with their inherent features in a heterogeneous graph. Then we model the graph to learn a distributed representation for each entity in the graph based on graph convolutional networks (GCN). By the propagation of graph convolutional networks, the entity representations can incorporate multiple types of medical information that can benefit multiple medical tasks. Moreover, we introduce a cross regularization strategy to reduce overfitting for multi-task training by the interaction between the multiple tasks. In this study, we construct a graph to associate 4 types of medical entities, i.e., patients, encounters, lab tests, and medications, and applied a graph neural network to learn node embeddings for medication recommendation and lab test imputation. we validate our MedGCN model on two real-world datasets: NMEDW and MIMIC-III. The experimental results on both datasets demonstrate that our model can outperform the state-of-the-art in both tasks. We believe that our innovative system can provide a promising and reliable way to assist physicians to make medication prescriptions and to save costs on potentially redundant lab tests.

preprint2022arXiv

Multimodal Machine Learning in Precision Health

As machine learning and artificial intelligence are more frequently being leveraged to tackle problems in the health sector, there has been increased interest in utilizing them in clinical decision-support. This has historically been the case in single modal data such as electronic health record data. Attempts to improve prediction and resemble the multimodal nature of clinical expert decision-making this has been met in the computational field of machine learning by a fusion of disparate data. This review was conducted to summarize this field and identify topics ripe for future research. We conducted this review in accordance with the PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analyses) extension for Scoping Reviews to characterize multi-modal data fusion in health. We used a combination of content analysis and literature searches to establish search strings and databases of PubMed, Google Scholar, and IEEEXplore from 2011 to 2021. A final set of 125 articles were included in the analysis. The most common health areas utilizing multi-modal methods were neurology and oncology. However, there exist a wide breadth of current applications. The most common form of information fusion was early fusion. Notably, there was an improvement in predictive performance performing heterogeneous data fusion. Lacking from the papers were clear clinical deployment strategies and pursuit of FDA-approved tools. These findings provide a map of the current literature on multimodal data fusion as applied to health diagnosis/prognosis problems. Multi-modal machine learning, while more robust in its estimations over unimodal methods, has drawbacks in its scalability and the time-consuming nature of information concatenation.

preprint2022arXiv

Open-Set Recognition of Breast Cancer Treatments

Open-set recognition generalizes a classification task by classifying test samples as one of the known classes from training or "unknown." As novel cancer drug cocktails with improved treatment are continually discovered, predicting cancer treatments can naturally be formulated in terms of an open-set recognition problem. Drawbacks, due to modeling unknown samples during training, arise from straightforward implementations of prior work in healthcare open-set learning. Accordingly, we reframe the problem methodology and apply a recent existing Gaussian mixture variational autoencoder model, which achieves state-of-the-art results for image datasets, to breast cancer patient data. Not only do we obtain more accurate and robust classification results, with a 24.5% average F1 increase compared to a recent method, but we also reexamine open-set recognition in terms of deployability to a clinical setting.

preprint2022arXiv

Topology-Preserving Dimensionality Reduction via Interleaving Optimization

Dimensionality reduction techniques are powerful tools for data preprocessing and visualization which typically come with few guarantees concerning the topological correctness of an embedding. The interleaving distance between the persistent homology of Vietoris-Rips filtrations can be used to identify a scale at which topological features such as clusters or holes in an embedding and original data set are in correspondence. We show how optimization seeking to minimize the interleaving distance can be incorporated into dimensionality reduction algorithms, and explicitly demonstrate its use in finding an optimal linear projection. We demonstrate the utility of this framework to data visualization.

preprint2021arXiv

Deep Learning-Based Anomaly Detection in Cyber-Physical Systems: Progress and Opportunities

Anomaly detection is crucial to ensure the security of cyber-physical systems (CPS). However, due to the increasing complexity of CPSs and more sophisticated attacks, conventional anomaly detection methods, which face the growing volume of data and need domain-specific knowledge, cannot be directly applied to address these challenges. To this end, deep learning-based anomaly detection (DLAD) methods have been proposed. In this paper, we review state-of-the-art DLAD methods in CPSs. We propose a taxonomy in terms of the type of anomalies, strategies, implementation, and evaluation metrics to understand the essential properties of current methods. Further, we utilize this taxonomy to identify and highlight new characteristics and designs in each CPS domain. Also, we discuss the limitations and open problems of these methods. Moreover, to give users insights into choosing proper DLAD methods in practice, we experimentally explore the characteristics of typical neural models, the workflow of DLAD methods, and the running performance of DL models. Finally, we discuss the deficiencies of DL approaches, our findings, and possible directions to improve DLAD methods and motivate future research.

preprint2020arXiv

A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports

Joint image-text embedding extracted from medical images and associated contextual reports is the bedrock for most biomedical vision-and-language (V+L) tasks, including medical visual question answering, clinical image-text retrieval, clinical report auto-generation. In this study, we adopt four pre-trained V+L models: LXMERT, VisualBERT, UNIER and PixelBERT to learn multimodal representation from MIMIC-CXR radiographs and associated reports. The extrinsic evaluation on OpenI dataset shows that in comparison to the pioneering CNN-RNN model, the joint embedding learned by pre-trained V+L models demonstrate performance improvement in the thoracic findings classification task. We conduct an ablation study to analyze the contribution of certain model components and validate the advantage of joint embedding over text-only embedding. We also visualize attention maps to illustrate the attention mechanism of V+L models.

preprint2020arXiv

Finite Blocklength Analysis of Gaussian Random Coding in AWGN Channels under Covert Constraint

This paper considers the achievability and converse bounds on the maximal channel coding rate at a given blocklength and error probability over AWGN channels. The problem stems from covert communication with Gaussian codewords. By re-visiting [18], we first present new and more general achievability bounds for random coding schemes under maximal or average probability of error requirements. Such general bounds are then applied to covert communication in AWGN channels where codewords are generated from Gaussian distribution while meeting the maximal power constraint. Further comparison is made between the new achievability bounds and existing one with deterministic codebooks.

preprint2020arXiv

Med2Meta: Learning Representations of Medical Concepts with Meta-Embeddings

Distributed representations of medical concepts have been used to support downstream clinical tasks recently. Electronic Health Records (EHR) capture different aspects of patients' hospital encounters and serve as a rich source for augmenting clinical decision making by learning robust medical concept embeddings. However, the same medical concept can be recorded in different modalities (e.g., clinical notes, lab results)-with each capturing salient information unique to that modality-and a holistic representation calls for relevant feature ensemble from all information sources. We hypothesize that representations learned from heterogeneous data types would lead to performance enhancement on various clinical informatics and predictive modeling tasks. To this end, our proposed approach makes use of meta-embeddings, embeddings aggregated from learned embeddings. Firstly, modality-specific embeddings for each medical concept is learned with graph autoencoders. The ensemble of all the embeddings is then modeled as a meta-embedding learning problem to incorporate their correlating and complementary information through a joint reconstruction. Empirical results of our model on both quantitative and qualitative clinical evaluations have shown improvements over state-of-the-art embedding models, thus validating our hypothesis.

preprint2020arXiv

Mixture-based Multiple Imputation Model for Clinical Data with a Temporal Dimension

The problem of missing values in multivariable time series is a key challenge in many applications such as clinical data mining. Although many imputation methods show their effectiveness in many applications, few of them are designed to accommodate clinical multivariable time series. In this work, we propose a multiple imputation model that capture both cross-sectional information and temporal correlations. We integrate Gaussian processes with mixture models and introduce individualized mixing weights to handle the variance of predictive confidence of Gaussian process models. The proposed model is compared with several state-of-the-art imputation algorithms on both real-world and synthetic datasets. Experiments show that our best model can provide more accurate imputation than the benchmarks on all of our datasets.

preprint2020arXiv

Open-Set Recognition with Gaussian Mixture Variational Autoencoders

In inference, open-set classification is to either classify a sample into a known class from training or reject it as an unknown class. Existing deep open-set classifiers train explicit closed-set classifiers, in some cases disjointly utilizing reconstruction, which we find dilutes the latent representation's ability to distinguish unknown classes. In contrast, we train our model to cooperatively learn reconstruction and perform class-based clustering in the latent space. With this, our Gaussian mixture variational autoencoder (GMVAE) achieves more accurate and robust open-set classification results, with an average F1 improvement of 29.5%, through extensive experiments aided by analytical results.

preprint2020arXiv

Total Variation Distance Based Performance Analysis of Covert Communication over AWGN Channels in Non-asymptotic Regime

This paper investigates covert communication over an additive white Gaussian noise (AWGN) channel in finite block length regime on the assumption of Gaussian codebooks. We first review some achievability and converse bounds on the throughput under maximal power constraint. From these bounds and the analysis of TVD at the adversary, the first and second asymptotics of covert communication are investigated by the help of some divergences inequalities. Furthermore, the analytic solution of TVD, and approximation expansions which can be easily evaluated with given snr (signal noise ratio) are presented. In this way, the proper power level for covert communication can be approximated with given covert constraint of TVD, which leads to more accurate estimation of the power compared with preceding bounds. Further elaboration on the effect of such asymptotic characteristics on the primary channels's throughput in finite block regime is also provided. The results will be very helpful for understanding the behavior of the total variation distance and practical covert communication.

preprint2020arXiv

Towrad 5G Air Interface Technology: Sparse Code Muliple Access

The fifth generation wireless networks focus on the design of low latency, high data rate, high reliability, and massive connectivity communications. Non-orthogonal multiple access (NOMA) is an essential enabling technology to accommodate the wide range of communication requirements. By coordinating the massive devices within the same resource block on power domain, frequency domain or code domain, NOMA is superior to conventional orthogonal multiple access in terms of the network connectivity, the throughputs of system and etc. Sparse code multiple access (SCMA) is a kind of multi-carrier code domain NOMA and has been studied extensively. The challenges for designing a high quality SCMA system is to seek the feasible encoding and decoding schemes to meet the desired requirements. In this article, we present some recent progresses towards the design of multi-dimensional codebooks, the practical low complexity decoder, as well as the Grant-Free multiple access for SCMA system. In particular, we show how the SCMA codebooks construction are motived by the combined design of multi-dimensional constellation and factor graphs. In addition, various low complexity SCMA decoders are also reviewed with a special focus on sphere decoding. Moreover, based on the framework of belief propagation, the SCMA Grant-Free transmission is introduced and the problem of collision resolution is also discussed.

preprint2016arXiv

A Low Complexity Detection Algorithm for SCMA

Sparse code multiple access (SCMA) is a new multiple access technique which supports massive connectivity. Compared with the current Long Term Evolution (LTE) system, it enables the overloading of active users on limited orthogonal resources and thus meets the requirement of the fifth generation (5G) wireless networks. However, the computation complexity of existing detection algorithms increases exponentially with $d_f$ (the degree of the resource nodes). Although the codebooks are designed to have low density, the detection still takes considerable time. The parameter $d_f$ must be designed to be very small, which largely limits the choice of codebooks. In this paper, a new detection algorithm is proposed by discretizing the probability distribution functions (PDFs) in the layer nodes (variable nodes). Given $M$ as the size of one codebook, the detection complexity of each resource node (function node) is reduced from $O(d_f M^{d_f})$ to $O(d_f^3 \ln (d_f))$. Its detection accuracy can quickly approach that of the previous detection algorithms with the decrease of sampling interval in discretization.

preprint2016arXiv

An Integrated Spectrum and Information Market for Green Cognitive Communications

A database-assisted TV white space network can achieve the goal of green cognitive communication by effectively reducing the energy consumption in cognitive communications. The success of such a novel network relies on a proper business model that provides incentives for all parties involved. In this paper, we propose an integrated spectrum and information market for a database-assisted TV white space network, where the geo-location database serves as both the spectrum market platform and the information market platform. We study the interactions among the database, the spectrum licensee, and unlicensed users by modelling the system as a three-stage sequential decision process. In Stage I, the database and the licensee negotiate regarding the commission for the licensee to use the spectrum market platform. In Stage II, the database and the licensee compete for selling information or channels to unlicensed users. In Stage III, unlicensed users determine whether they should buy exclusive usage right of licensed channels from the licensee or information regarding unlicensed channels from the database. Analyzing such a three-stage model is challenging due to the co-existence of both positive and negative network externalities in the information market. Despite of this, we are able to characterize how the network externalities affect the equilibrium behaviors of all parties involved. We analytically show that in this integrated market, the licensee can never get a market share more than half. Our numerical results further show that the proposed integrated market can improve the network profit up to 87%, compared with a pure information market.

preprint2016arXiv

HySIM: A Hybrid Spectrum and Information Market for TV White Space Networks

We propose a hybrid spectrum and information market for a database-assisted TV white space network, where the geo-location database serves as both a spectrum market platform and an information market platform. We study the inter- actions among the database operator, the spectrum licensee, and unlicensed users systematically, using a three-layer hierarchical model. In Layer I, the database and the licensee negotiate the commission fee that the licensee pays for using the spectrum market platform. In Layer II, the database and the licensee compete for selling information or channels to unlicensed users. In Layer III, unlicensed users determine whether they should buy the exclusive usage right of licensed channels from the licensee, or the information regarding unlicensed channels from the database. Analyzing such a three-layer model is challenging due to the co-existence of both positive and negative network externalities in the information market. We characterize how the network externalities affect the equilibrium behaviours of all parties involved. Our numerical results show that the proposed hybrid market can improve the network profit up to 87%, compared with a pure information market. Meanwhile, the achieved network profit is very close to the coordinated benchmark solution (the gap is less than 4% in our simulation).

preprint2016arXiv

Parametric Prediction from Parametric Agents

We consider a problem of prediction based on opinions elicited from heterogeneous rational agents with private information. Making an accurate prediction with a minimal cost requires a joint design of the incentive mechanism and the prediction algorithm. Such a problem lies at the nexus of statistical learning theory and game theory, and arises in many domains such as consumer surveys and mobile crowdsourcing. In order to elicit heterogeneous agents' private information and incentivize agents with different capabilities to act in the principal's best interest, we design an optimal joint incentive mechanism and prediction algorithm called COPE (COst and Prediction Elicitation), the analysis of which offers several valuable engineering insights. First, when the costs incurred by the agents are linear in the exerted effort, COPE corresponds to a "crowd contending" mechanism, where the principal only employs the agent with the highest capability. Second, when the costs are quadratic, COPE corresponds to a "crowd-sourcing" mechanism that employs multiple agents with different capabilities at the same time. Numerical simulations show that COPE improves the principal's profit and the network profit significantly (larger than 30% in our simulations), comparing to those mechanisms that assume all agents have equal capabilities.

preprint2015arXiv

MINE GOLD to Deliver Green Cognitive Communications

Geo-location database-assisted TV white space network reduces the need of energy-intensive processes (such as spectrum sensing), hence can achieve green cognitive communication effectively. The success of such a network relies on a proper business model that provides incentives for all parties involved. In this paper, we propose MINE GOLD (a Model of INformation markEt for GeO-Location Database), which enables databases to sell the spectrum information to unlicensed white space devices (WSDs) for profit. Specifically, we focus on an oligopoly information market with multiple databases, and study the interactions among databases and WSDs using a two-stage hierarchical model. In Stage I, databases compete to sell information to WSDs by optimizing their information prices. In Stage II, each WSD decides whether and from which database to purchase the information, to maximize his benefit of using the TV white space. We first characterize how the WSDs' purchasing behaviors dynamically evolve, and what is the equilibrium point under fixed information prices from the databases. We then analyze how the system parameters and the databases' pricing decisions affect the market equilibrium, and what is the equilibrium of the database price competition. Our numerical results show that, perhaps counter-intuitively, the databases' aggregate revenue is not monotonic with the number of databases. Moreover, numerical results show that a large degree of positive network externality would improve the databases' revenues and the system performance.

preprint2015arXiv

Spectrum Reservation Contract Design in TV White Space Networks

In this paper, we study a broker-based TV white space market, where unlicensed white space devices (WSDs) purchase white space spectrum from TV licensees via a third-party geo-location database (DB), which serves as a spectrum broker, reserving spectrum from TV licensees and then reselling the reserved spectrum to WSDs. We propose a contract-theoretic framework for the database's spectrum reservation under demand stochasticity and information asymmetry. In such a framework, the database offers a set of contract items in the form of reservation amount and the corresponding payment, and each WSD chooses the best contract item based on its private information. We systematically study the optimal reservation contract design (that maximizes the database's expected profit) under two different risk-bearing schemes: DB-bearing-risk and WSD-bearing-risk, depending on who (the database or the WSDs) will bear the risk of over reservation. Counter-intuitively, we show that the optimal contract under DB-bearing-risk leads to a higher profit for the database and a higher total network profit.

preprint2015arXiv

Ω-deformation of B-twisted gauge theories and the 3d-3d correspondence

We study Ω-deformation of B-twisted gauge theories in two dimensions. As an application, we construct an Ω-deformed, topologically twisted five-dimensional maximally supersymmetric Yang-Mills theory on the product of a Riemann surface $Σ$ and a three-manifold $M$, and show that when $Σ$ is a disk, this theory is equivalent to analytically continued Chern-Simons theory on $M$. Based on these results, we establish a correspondence between three-dimensional $\mathcal{N} = 2$ superconformal theories and analytically continued Chern-Simons theory. Furthermore, we argue that there is a mirror symmetry between Ω-deformed two-dimensional theories.

preprint2014arXiv

A Topological Chern-Simons Sigma Model and New Invariants of Three-Manifolds

We construct a topological Chern-Simons sigma model on a Riemannian three-manifold M with gauge group G whose hyperkahler target space X is equipped with a G-action. Via a perturbative computation of its partition function, we obtain new topological invariants of M that define new weight systems which are characterized by both Lie algebra structure and hyperkahler geometry. In canonically quantizing the sigma model, we find that the partition function on certain M can be expressed in terms of Chern-Simons knot invariants of M and the intersection number of certain G-equivariant cycles in the moduli space of G-covariant maps from M to X. We also construct supersymmetric Wilson loop operators, and via a perturbative computation of their expectation value, we obtain new knot invariants of M that define new knot weight systems which are also characterized by both Lie algebra structure and hyperkahler geometry.

preprint2014arXiv

N=2 supersymmetric gauge theories and quantum integrable systems

We study N=2 supersymmetric gauge theories on the product of a two-sphere and a cylinder. We show that the low-energy dynamics of a BPS sector of such a theory is described by a quantum integrable system, with the Planck constant set by the inverse of the radius of the sphere. If the sphere is replaced with a hemisphere, then our system reduces to an integrable system of the type studied by Nekrasov and Shatashvili. In this case we establish a correspondence between the effective prepotential of the gauge theory and the Yang-Yang function of the integrable system.

preprint2014arXiv

Relative generalized Hamming weights of one-point algebraic geometric codes

Security of linear ramp secret sharing schemes can be characterized by the relative generalized Hamming weights of the involved codes. In this paper we elaborate on the implication of these parameters and we devise a method to estimate their value for general one-point algebraic geometric codes. As it is demonstrated, for Hermitian codes our bound is often tight. Furthermore, for these codes the relative generalized Hamming weights are often much larger than the corresponding generalized Hamming weights.

preprint2014arXiv

Transformation-optics macroscopic visible-light cloaking beyond two dimensions

Transformation optics, a recent geometrical design strategy of controlling light by combining Maxwell's principles of electromagnetism with Einstein's general relativity, promises without precedent an invisibility cloaking device that can render a macroscopic object invisible in three dimensions. However, most previous proof-of-concept transformation-optics cloaking devices focused predominantly on two dimensions, whereas detection of a macroscopic object along its third dimension was always unfailing. Here, we report the first experimental demonstration of transformation-optics macroscopic visible-light cloaking beyond two dimensions. This almost-three-dimensional cloak exhibits three-dimensional (3D) invisibility for illumination near its center (i.e. with a limited field of view), and its ideal wide-angle invisibility performance is preserved in multiple two-dimensional (2D) planes intersecting in the 3D space. Both light ray trajectories and optical path lengths have been verified experimentally at the macroscopic scale, which provides unique evidence on the geometrical nature of transformation optics.

preprint2013arXiv

A note on the five valued conjectures of Johansen and Helleseth and zeta functions

For the complete five-valued cross-correlation distribution between two $m$-sequences ${s_t}$ and ${s_{dt}}$ of period $2^m-1$ that differ by the decimation $d={{2^{2k}+1}\over {2^k+1}}$ where $m$ is odd and $\mbox{gcd}(k,m)=1$, Johansen and Hellseth expressed it in terms of some exponential sums. And two conjectures are presented that are of interest in their own right. In this correspondence we study these conjectures for the particular case where $k=3$, and the cases $k=1,2$ can also be analyzed in a similar process. When $k>3$, the degrees of the relevant polynomials will become higher. Here the multiplicity of the biggest absolute value of the cross-correlation is no more than one-sixth of the multiplicity corresponding the smallest absolute value.

preprint2013arXiv

On the bounds and achievability about the ODPC of $\mathcal{GRM}(2,m)^*$ over prime field for increasing message length

The optimum distance profiles of linear block codes were studied for increasing or decreasing message length while keeping the minimum distances as large as possible, especially for Golay codes and the second-order Reed-Muller codes, etc. Cyclic codes have more efficient encoding and decoding algorithms. In this paper, we investigate the optimum distance profiles with respect to the cyclic subcode chains (ODPCs) of the punctured generalized second-order Reed-Muller codes $\mathcal{GRM}(2,m)^*$ which were applied in Power Control in OFDM Modulations in channels with synchronization, and so on. For this, two standards are considered in the inverse dictionary order, i.e., for increasing message length. Four lower bounds and upper bounds on ODPC are presented, where the lower bounds almost achieve the corresponding upper bounds in some sense. The discussions are over nonbinary prime field.

preprint2013arXiv

On the Optimum Cyclic Subcode Chains of $\mathcal{RM}(2,m)^*$ for Increasing Message Length

The distance profiles of linear block codes can be employed to design variational coding scheme for encoding message with variational length and getting lower decoding error probability by large minimum Hamming distance. %, e.g. the design of TFCI in CDMA and the researches on the second-order Reed-Muller code $\mathcal{RM}(2,m)$, etc. Considering convenience for encoding, we focus on the distance profiles with respect to cyclic subcode chains (DPCs) of cyclic codes over $GF(q)$ with length $n$ such that $\mbox{gcd}(n,q) = 1$. In this paper the optimum DPCs and the corresponding optimum cyclic subcode chains are investigated on the punctured second-order Reed-Muller code $\mathcal{RM}(2,m)^*$ for increasing message length, where two standards on the optimums are studied according to the rhythm of increase.

preprint2013arXiv

The Weight Distributions of a Class of Cyclic Codes with Three Nonzeros over F3

Cyclic codes have efficient encoding and decoding algorithms. The decoding error probability and the undetected error probability are usually bounded by or given from the weight distributions of the codes. Most researches are about the determination of the weight distributions of cyclic codes with few nonzeros, by using quadratic form and exponential sum but limited to low moments. In this paper, we focus on the application of higher moments of the exponential sum to determine the weight distributions of a class of ternary cyclic codes with three nonzeros, combining with not only quadratic form but also MacWilliams' identities. Another application of this paper is to emphasize the computer algebra system Magma for the investigation of the higher moments. In the end, the result is verified by one example using Matlab.

preprint2013arXiv

The weight distributions of some cyclic codes with three or four nonzeros over F3

Because of efficient encoding and decoding algorithms, cyclic codes are an important family of linear block codes, and have applications in communica- tion and storage systems. However, their weight distributions are known only for a few cases mainly on the codes with one or two nonzeros. In this paper, the weight distributions of two classes of cyclic codes with three or four nonzeros are determined.

preprint2012arXiv

Degraded Broadcast Channel with Side Information, Confidential Messages and Noiseless Feedback

In this paper, first, we investigate the model of degraded broadcast channel with side information and confidential messages. This work is from Steinberg's work on the degraded broadcast channel with causal and noncausal side information, and Csisz$\acute{a}$r-Körner's work on broadcast channel with confidential messages. Inner and outer bounds on the capacity-equivocation regions are provided for the noncausal and causal cases. Superposition coding and double-binning technique are used in the corresponding achievability proofs. Then, we investigate the degraded broadcast channel with side information, confidential messages and noiseless feedback. The noiseless feedback is from the non-degraded receiver to the channel encoder. Inner and outer bounds on the capacity-equivocation region are provided for the noncausal case, and the capacity-equivocation region is determined for the causal case. Compared with the model without feedback, we find that the noiseless feedback helps to enlarge the inner bounds for both causal and noncausal cases. In the achievability proof of the feedback model, the noiseless feedback is used as a secret key shared by the non-degraded receiver and the transmitter, and therefore, the code construction for the feedback model is a combination of superposition coding, Gel'fand-Pinsker's binning, block Markov coding and Ahlswede-Cai's secret key on the feedback system.

preprint2012arXiv

Information-Theoretical Security for Several Models of Multiple-Access Channel

Several security models of multiple-access channel (MAC) are investigated. First, we study the degraded MAC with confidential messages, where two users transmit their confidential messages (no common message) to a destination, and each user obtains a degraded version of the output of the MAC. Each user views the other user as a eavesdropper, and wishes to keep its confidential message as secret as possible from the other user. Measuring each user's uncertainty about the other user's confidential message by equivocation, the inner and outer bounds on the capacity-equivocation region for this model have been provided. The result is further explained via the binary and Gaussian examples. Second, the discrete memoryless multiple-access wiretap channel (MAC-WT) is studied, where two users transmit their corresponding confidential messages (no common message) to a legitimate receiver, while an additional wiretapper wishes to obtain the messages via a wiretap channel. This new model is considered into two cases: the general MAC-WT with cooperative encoders, and the degraded MAC-WT with non-cooperative encoders. The capacity-equivocation region is totally determined for the cooperative case, and inner and outer bounds on the capacity-equivocation region are provided for the non-cooperative case. For both cases, the results are further explained via the binary examples.

preprint2011arXiv

Broadband phase-preserved optical elevator

Phase-preserved optical elevator is an optical device to lift up an entire plane virtually without distortion in light path or phase. Using transformation optics, we have predicted and observed the realization of such a broadband phase-preserved optical elevator, made of a natural homogeneous birefringent crystal without resorting to absorptive and narrowband metamaterials involving time-consuming nano-fabrication. In our demonstration, the optical elevator is designed to lift a sheet upwards, and the phase is verified to be preserved always. The camouflage capability is also demonstrated in the presence of adjacent objects of the same scale at will. The elevating device functions in different surrounding media over the wavelength range of 400-700 nm. Our work opens up prospects for studies of light trapping, solar energy, illusion optics, communication, and imaging.

preprint2011arXiv

Evaluation of a Bundling Technique for Parallel Coordinates

We describe a technique for bundled curve representations in parallel-coordinates plots and present a controlled user study evaluating their effectiveness. Replacing the traditional C^0 polygonal lines by C^1 continuous piecewise Bezier curves makes it easier to visually trace data points through each coordinate axis. The resulting Bezier curves can then be bundled to visualize data with given cluster structures. Curve bundles are efficient to compute, provide visual separation between data clusters, reduce visual clutter, and present a clearer overview of the dataset. A controlled user study with 14 participants confirmed the effectiveness of curve bundling for parallel-coordinates visualization: 1) compared to polygonal lines, it is equally capable of revealing correlations between neighboring data attributes; 2) its geometric cues can be effective in displaying cluster information. For some datasets curve bundling allows the color perceptual channel to be applied to other data attributes, while for complex cluster patterns, bundling and color can represent clustering far more clearly than either alone.

preprint2011arXiv

Macroscopic Invisibility Cloak for Visible Light

Invisibility cloaks, a subject that usually occurs in science fiction and myths, have attracted wide interest recently because of their possible realization. The biggest challenge to true invisibility is known to be the cloaking of a macroscopic object in the broad range of wavelengths visible to the human eye. Here we experimentally solve this problem by incorporating the principle of transformation optics into a conventional optical lens fabrication with low-cost materials and simple manufacturing techniques. A transparent cloak made of two pieces of calcite is created. This cloak is able to conceal a macroscopic object with a maximum height of 2 mm, larger than 3500 free-space-wavelength, inside a transparent liquid environment. Its working bandwidth encompassing red, green and blue light is also demonstrated.

Yuan Luo

What is connected

Connect this record

See the researcher in context

Building this map preview

41 published item(s)

Mechanism Design for Federated Learning with Non-Monotonic Network Effects

Accelerating Iterated Persistent Homology Computations with Warm Starts

AKI-BERT: a Pre-trained Clinical Language Model for Early Prediction of Acute Kidney Injury

Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequences

Deep Learning Reveals Patterns of Diverse and Changing Sentiments Towards COVID-19 Vaccines Based on 11 Million Tweets

Distribution Preserving Graph Representation Learning

ImageGCN: Multi-Relational Image Graph Convolutional Networks for Disease Identification with Chest X-rays

Machine Learning Applications in Lung Cancer Diagnosis, Treatment and Prognosis

MedGCN: Medication recommendation and lab test imputation via graph convolutional networks

Multimodal Machine Learning in Precision Health

Open-Set Recognition of Breast Cancer Treatments

Topology-Preserving Dimensionality Reduction via Interleaving Optimization

Deep Learning-Based Anomaly Detection in Cyber-Physical Systems: Progress and Opportunities

A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports

Finite Blocklength Analysis of Gaussian Random Coding in AWGN Channels under Covert Constraint

Med2Meta: Learning Representations of Medical Concepts with Meta-Embeddings

Mixture-based Multiple Imputation Model for Clinical Data with a Temporal Dimension

Open-Set Recognition with Gaussian Mixture Variational Autoencoders

Total Variation Distance Based Performance Analysis of Covert Communication over AWGN Channels in Non-asymptotic Regime

Towrad 5G Air Interface Technology: Sparse Code Muliple Access

A Low Complexity Detection Algorithm for SCMA

An Integrated Spectrum and Information Market for Green Cognitive Communications

HySIM: A Hybrid Spectrum and Information Market for TV White Space Networks

Parametric Prediction from Parametric Agents

MINE GOLD to Deliver Green Cognitive Communications

Spectrum Reservation Contract Design in TV White Space Networks

Ω-deformation of B-twisted gauge theories and the 3d-3d correspondence

A Topological Chern-Simons Sigma Model and New Invariants of Three-Manifolds

N=2 supersymmetric gauge theories and quantum integrable systems

Relative generalized Hamming weights of one-point algebraic geometric codes

Transformation-optics macroscopic visible-light cloaking beyond two dimensions

A note on the five valued conjectures of Johansen and Helleseth and zeta functions

On the bounds and achievability about the ODPC of $\mathcal{GRM}(2,m)^*$ over prime field for increasing message length

On the Optimum Cyclic Subcode Chains of $\mathcal{RM}(2,m)^*$ for Increasing Message Length

The Weight Distributions of a Class of Cyclic Codes with Three Nonzeros over F3

The weight distributions of some cyclic codes with three or four nonzeros over F3

Degraded Broadcast Channel with Side Information, Confidential Messages and Noiseless Feedback

Information-Theoretical Security for Several Models of Multiple-Access Channel

Broadband phase-preserved optical elevator

Evaluation of a Bundling Technique for Parallel Coordinates

Macroscopic Invisibility Cloak for Visible Light