Source author record

Yuan Zhang

Yuan Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

87works

45topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation

We present JoyAI-Image, a unified multimodal foundation model for visual understanding, text-to-image generation, and instruction-guided image editing. JoyAI-Image couples a spatially enhanced Multimodal Large Language Model (MLLM) with a Multimodal Diffusion Transformer (MMDiT), allowing perception and generation to interact through a shared multimodal interface. Around this architecture, we build a scalable training recipe that combines unified instruction tuning, long-text rendering supervision, spatially grounded data, and both general and spatial editing signals. This design gives the model broad multimodal capability while strengthening geometry-aware reasoning and controllable visual synthesis. Experiments across understanding, generation, long-text rendering, and editing benchmarks show that JoyAI-Image achieves state-of-the-art or highly competitive performance. More importantly, the bidirectional loop between enhanced understanding, controllable spatial editing, and novel-view-assisted reasoning enables the model to move beyond general visual competence toward stronger spatial intelligence. These results suggest a promising path for unified visual models in downstream applications such as vision-language-action systems and world models.

preprint2026arXiv

Before the Body Moves: Learning Anticipatory Joint Intent for Language-Conditioned Humanoid Control

Natural language is an intuitive interface for humanoid robots, yet streaming whole-body control requires control representations that are executable now and anticipatory of future physical transitions. Existing language-conditioned humanoid systems typically generate kinematic references that a low-level tracker must repair reactively, or use latent/action policies whose outputs do not explicitly encode upcoming contact changes, support transfers, and balance preparation. We propose \textbf{DAJI} (\emph{Dynamics-Aligned Joint Intent}), a hierarchical framework that learns an anticipatory joint-intent interface between language generation and closed-loop control. DAJI-Act distills a future-aware teacher into a deployable diffusion action policy through student-driven rollouts, while DAJI-Flow autoregressively generates future intent chunks from language and intent history. Experiments show that DAJI achieves strong results in anticipatory latent learning, single-instruction generation, and streaming instruction following, reaching 94.42\% rollout success on HumanML3D-style generation and 0.152 subsequence FID on BABEL.

preprint2026arXiv

First-Order Efficiency for Probabilistic Value Estimation via A Statistical Viewpoint

Probabilistic values, including Shapley values and semivalues, provide a model-agnostic framework to attribute the behavior of a black-box model to data points or features, with a wide range of applications including explainable artificial intelligence and data valuation. However, their exact computation requires utility evaluations over exponentially many coalitions, making Monte Carlo approximation essential in modern machine learning applications. Existing estimators are often developed through different identification strategies, including weighted averages, self-normalized weighting, regression adjustment, and weighted least squares. Our key observation is that these seemingly distinct constructions share a common first-order error structure, in which the leading term is an augmented inverse-probability weighted influence term determined by the sampling law and a working surrogate function. This first-order representation yields an explicit expression for the leading mean squared error (MSE), which characterizes how the sampling law and the surrogate jointly determine statistical efficiency. Guided by this criterion, we propose an Efficiency-Aware Surrogate-adjusted Estimator (EASE) that directly chooses the sampling law and surrogate to minimize the first-order MSE. We demonstrate that EASE consistently outperforms state-of-the-art estimators for various probabilistic values.

preprint2026arXiv

FlowAct-R1: Towards Interactive Humanoid Video Generation

Interactive humanoid video generation aims to synthesize lifelike visual agents that can engage with humans through continuous and responsive video. Despite recent advances in video synthesis, existing methods often grapple with the trade-off between high-fidelity synthesis and real-time interaction requirements. In this paper, we propose FlowAct-R1, a framework specifically designed for real-time interactive humanoid video generation. Built upon a MMDiT architecture, FlowAct-R1 enables the streaming synthesis of video with arbitrary durations while maintaining low-latency responsiveness. We introduce a chunkwise diffusion forcing strategy, complemented by a novel self-forcing variant, to alleviate error accumulation and ensure long-term temporal consistency during continuous interaction. By leveraging efficient distillation and system-level optimizations, our framework achieves a stable 25fps at 480p resolution with a time-to-first-frame (TTFF) of only around 1.5 seconds. The proposed method provides holistic and fine-grained full-body control, enabling the agent to transition naturally between diverse behavioral states in interactive scenarios. Experimental results demonstrate that FlowAct-R1 achieves exceptional behavioral vividness and perceptual realism, while maintaining robust generalization across diverse character styles.

preprint2026arXiv

Generalized Priority-Aware Shapley Value

Shapley value and its priority-aware extensions are widely used for valuation in machine learning, but existing methods require pairwise priority to be binary and acyclic, a restriction spectacularly violated in real-data examples such as aggregated human preferences and multi-criterion comparisons. We introduce the generalized priority-aware Shapley value (GPASV), a random order value defined on arbitrary directed weighted priority graphs, in which pairwise edges penalize rather than forbid order violations. GPASV covers a range of classical models as boundary cases. We establish GPASV through an axiomatic characterization, develop the associated computational methods, and introduce a priority sweeping diagnostic extending PASV's. We apply GPASV to LLM ensemble valuation on the cyclic Chatbot Arena preference graph, illustrating that priority-aware valuation is not a one-button operation: different balances of pairwise graph priority versus individual soft priority produce substantively different valuations of the same data.

preprint2026arXiv

SEED: Targeted Data Selection by Weighted Independent Set

Data selection seeks to identify a compact yet informative subset from large-scale training corpora, balancing sample quality against collection diversity. We formulate this problem as a Weighted Independent Set (WIS) on a similarity graph, where nodes represent data samples weighted by influence, and edges connect semantically redundant pairs. This formulation naturally yields subsets that are simultaneously high-quality and diverse. However, two challenges arise in practice: naive node weights fail to distinguish informative signals from gradient noise, and edge construction under heterogeneous domain distributions produces structurally imbalanced graphs that bias selection toward sparse regions. To address these issues, we introduce two principled refinements from a unified graph perspective: (1) \textit{node value calibration} that restricts influence estimation to the bilateral salient subspace to ground node importance in task-relevant signals rather than surface-level statistics; (2) \textit{local scale normalization} that adapts edge thresholds to local neighborhood density, mitigating graph imbalance induced by cross-domain distribution shifts. Together, these components yield a robust and scalable data selection pipeline dubbed SEED. We further construct \texttt{Honeybee-Remake-SEED-200K}, a compact multimodal dataset curated by SEED. Extensive experiments show that SEED consistently outperforms state-of-the-art methods on instruction tuning, visual instruction tuning, and semantic segmentation across diverse model families.

preprint2026arXiv

Teacher-Feature Drifting: One-Step Diffusion Distillation with Pretrained Diffusion Representations

Sampling from pretrained diffusion and flow-matching models typically requires many forward passes to generate diverse and high-fidelity images. Existing distillation methods often rely on multiple auxiliary networks, carefully designed training stages, or complex optimization pipelines. In this work, we revisit the recently proposed Drifting Model objective and show that a single drifting loss can be directly used to simplify one step distillation. A key observation is that the pretrained diffusion teacher itself already provides a strong representation space. Unlike the original Drifting Model, which relies on an additional pretrained feature extractor, we use intermediate hidden states of the pretrained teacher model as the feature representation. This removes the need for training or introducing an extra representation network while preserving a semantically meaningful feature geometry for drifting. Furthermore, we introduce a lightweight mode coverage loss to mitigate mode collapse during distillation and encourage the student generator to cover diverse teacher-supported regions. Extensive experiments on ImageNet and SDXL demonstrate that our method achieves efficient one step generation with competitive image quality and diversity, achieving FID scores of 1.58 on ImageNet-64$\times$64 and 18.4 on SDXL, while substantially simplifying the overall distillation framework.

preprint2026arXiv

TextLDM: Language Modeling with Continuous Latent Diffusion

Diffusion Transformers (DiT) trained with flow matching in a VAE latent space have unified visual generation across images and videos. A natural next step toward a single architecture for both generation (visual synthesis) and understanding (text generation) is to apply this framework to language modeling. We propose TextLDM, which transfers the visual latent diffusion recipe to text generation with minimal architectural modification. A Transformer-based VAE maps discrete tokens to continuous latents, enhanced by Representation Alignment (REPA) with a frozen pretrained language model to produce representations effective for conditional denoising. A standard DiT then performs flow matching in this latent space, identical in architecture to its visual counterpart. The central challenge we address is obtaining high-quality continuous text representations: we find that reconstruction fidelity alone is insufficient, and that aligning latent features with a pretrained language model via REPA is critical for downstream generation quality. Trained from scratch on OpenWebText2, TextLDM substantially outperforms prior diffusion language models and matches GPT-2 under the same settings. Our results establish that the visual DiT recipe transfers effectively to language, taking a concrete step toward unified diffusion architectures for multimodal generation and understanding.

preprint2026arXiv

Toward Natural and Companionable Virtual Agents via Cross-Temporal Emotional Modeling

Recent advances in foundation models have enabled conversational agents that aim for sustained companionship rather than mere task completion. Yet most still remain unable to support natural, long-term companion-like interactions, resulting in experiences that feel episodic and inauthentic. We argue that current agents overlooked cross-temporal modeling of agents' social behaviors and internal emotions: generated behaviors rarely influence an agent's emotional state, and emotional states seldom shape subsequent behaviors. We present Cross-Temporal Emotion Modeling (CTEM), a framework that links long-term behavioral history to moment-to-moment emotional expression. CTEM establishes a closed loop where past experiences update an evolving emotional state; this state conditions immediate interactions; and user feedback continually revises both memory and emotional state, enabling reflection and anticipation. We instantiate CTEM as Auri, a companion agent on an instant-messaging platform, and report a 21-day in-the-wild study showing that CTEM shows improvements in perceived naturalness, coherence, and emotional harmony.

preprint2026arXiv

Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising

We propose X-WAM, a Unified 4D World Model that unifies real-time robotic action execution and high-fidelity 4D world synthesis (video + 3D reconstruction) in a single framework, addressing the critical limitations of prior unified world models (e.g., UWM) that only model 2D pixel-space and fail to balance action efficiency and world modeling quality. To leverage the strong visual priors of pretrained video diffusion models, X-WAM imagines the future world by predicting multi-view RGB-D videos, and obtains spatial information efficiently through a lightweight structural adaptation: replicating the final few blocks of the pretrained Diffusion Transformer into a dedicated depth prediction branch for the reconstruction of future spatial information. Moreover, we propose Asynchronous Noise Sampling (ANS) to jointly optimize generation quality and action decoding efficiency. ANS applies a specialized asynchronous denoising schedule during inference, which rapidly decodes actions with fewer steps to enable efficient real-time execution, while dedicating the full sequence of steps to generate high-fidelity video. Rather than entirely decoupling the timesteps during training, ANS samples from their joint distribution to align with the inference distribution. Pretrained on over 5,800 hours of robotic data, X-WAM achieves 79.2% and 90.7% average success rate on RoboCasa and RoboTwin 2.0 benchmarks, while producing high-fidelity 4D reconstruction and generation surpassing existing methods in both visual and geometric metrics.

preprint2024arXiv

Optimal Nonparametric Inference on Network Effects with Dependent Edges

Testing network effects in weighted directed networks is a foundational problem in econometrics, sociology, and psychology. Yet, the prevalent edge dependency poses a significant methodological challenge. Most existing methods are model-based and come with stringent assumptions, limiting their applicability. In response, we introduce a novel, fully nonparametric framework that requires only minimal regularity assumptions. While inspired by recent developments in $U$-statistic literature (arXiv:1712.00771, arXiv:2004.06615), our approach notably broadens their scopes. Specifically, we identified and carefully addressed the challenge of indeterminate degeneracy in the test statistics $-$ a problem that aforementioned tools do not handle. We established Berry-Esseen type bound for the accuracy of type-I error rate control. Using original analysis, we also proved the minimax optimality of our test's power. Simulations underscore the superiority of our method in computation speed, accuracy, and numerical robustness compared to competing methods. We also applied our method to the U.S. faculty hiring network data and discovered intriguing findings.

preprint2024arXiv

Theoretical Study on Superradiant Raman Scattering with Rubidium Atoms in An Optical Cavity

Superradiant Raman scattering of Rubidium atoms has been explored in the experiment [Nature 484, 78 (2012)] to prove the concept of the superradiant laser, which attracts significant attentions in quantum metrology due to the expected ultra-narrow linewidth down to millihertz. To better understand the physics involved in this experiment, we have developed a quantum master equation theory by treating the Rubidium atoms as three-level systems, and coupling them with a dressed laser and an optical cavity. Our simulations show different superradiant Raman scattering pulses for the systems within the crossover and strong coupling regime, and the shifted and broader spectrum of the steady-state Raman scattering. Thus, our studies provide a unified view on the superradiant Raman scattering pulses, and an alternative explanation to the broad spectrum of the steady-state Raman scattering, as observed in the experiment. In future, our theory can be readily applied to study other interesting phenomena relying on the superradiant Raman scattering, such as magnetic field sensing, real-time tracking of quantum phase, Dicke phase transition of non-equilibrium dynamics and so on.

preprint2023arXiv

SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning

Value factorisation is a useful technique for multi-agent reinforcement learning (MARL) in global reward game, however its underlying mechanism is not yet fully understood. This paper studies a theoretical framework for value factorisation with interpretability via Shapley value theory. We generalise Shapley value to Markov convex game called Markov Shapley value (MSV) and apply it as a value factorisation method in global reward game, which is obtained by the equivalence between the two games. Based on the properties of MSV, we derive Shapley-Bellman optimality equation (SBOE) to evaluate the optimal MSV, which corresponds to an optimal joint deterministic policy. Furthermore, we propose Shapley-Bellman operator (SBO) that is proved to solve SBOE. With a stochastic approximation and some transformations, a new MARL algorithm called Shapley Q-learning (SHAQ) is established, the implementation of which is guided by the theoretical results of SBO and MSV. We also discuss the relationship between SHAQ and relevant value factorisation methods. In the experiments, SHAQ exhibits not only superior performances on all tasks but also the interpretability that agrees with the theoretical analysis. The implementation of this paper is on https://github.com/hsvgbkhgbv/shapley-q-learning.

preprint2022arXiv

Asymptotic theory in network models with covariates and a growing number of node parameters

We propose a general model that jointly characterizes degree heterogeneity and homophily in weighted, undirected networks. We present a moment estimation method using node degrees and homophily statistics. We establish consistency and asymptotic normality of our estimator using novel analysis. We apply our general framework to three applications, including both exponential family and non-exponential family models. Comprehensive numerical studies and a data example also demonstrate the usefulness of our method.

preprint2022arXiv

Capacity Analysis of Holographic MIMO Channels with Practical Constraints

Holographic Multiple-Input and Multiple-Output (MIMO) is envisioned as a promising technology to realize unprecedented spectral efficiency by integrating a large number of antennas into a compact space. Most research on holographic MIMO is based on isotropic scattering environments, and the antenna gain is assumed to be unlimited by deployment space. However, the channel might not satisfy isotropic scattering because of generalized angle distributions, and the antenna gain is limited by the array aperture in reality. In this letter, we aim to analyze the holographic MIMO channel capacity under practical angle distribution and array aperture constraints. First, we calculate the spectral density for generalized angle distributions by introducing a wavenumber domain-based method. And then, the capacity under generalized angle distributions is analyzed and two different aperture schemes are considered. Finally, numerical results show that the capacity is obviously affected by angle distribution at high signal-to-noise ratio (SNR) but hardly affected at low SNR, and the capacity will not increase infinitely with antenna density due to the array aperture constraint.

preprint2022arXiv

Cavity Quantum Electrodynamics Effects of Optically Cooled Nitrogen-Vacancy Centers Coupled to a High Frequency Microwave Resonator

Recent experiments demonstrated the cooling of a microwave mode of a high-quality dielectric resonator coupled to optically cooled nitrogen-vacancy (NV) spins in diamond. Our recent theoretical study [arXiv:2110.10950] pointed out the cooled NV spins can be used to realize cavity quantum electrodynamics effects (C-QED) at room temperature. In this article, we propose to modify the setup used in a recent diamond maser experiment [Nature 55, 493-496 (2018)], which features a higher spin transition frequency, a lower spin-dephasing rate and a stronger NV spins-resonator coupling, to realize better microwave mode cooling and the room-temperature CQED effects. To describe more precisely the optical spin cooling and the collective spin-resonator coupling, we extend the standard Jaynes-Cumming model to account for the rich electronic and spin levels of the NV centers. Our calculations show that for the proposed setup it is possible to cool the microwave mode from $293$ K (room temperature) to $116$ K, which is about $72$ K lower than the previous records, and to study the intriguing dynamics of the CQED effects under the weak-to-strong coupling transition by varying the laser power. With simple modifications, our model can be applied to, e.g., other solid-state spins or triplet spins of pentacene molecules, and to investigate other effects, such as the operations of pulsed and continuous-wave masing.

preprint2022arXiv

Controllable Semantic Parsing via Retrieval Augmentation

In practical applications of semantic parsing, we often want to rapidly change the behavior of the parser, such as enabling it to handle queries in a new domain, or changing its predictions on certain targeted queries. While we can introduce new training examples exhibiting the target behavior, a mechanism for enacting such behavior changes without expensive model re-training would be preferable. To this end, we propose ControllAble Semantic Parser via Exemplar Retrieval (CASPER). Given an input query, the parser retrieves related exemplars from a retrieval index, augments them to the query, and then applies a generative seq2seq model to produce an output parse. The exemplars act as a control mechanism over the generic generative model: by manipulating the retrieval index or how the augmented query is constructed, we can manipulate the behavior of the parser. On the MTOP dataset, in addition to achieving state-of-the-art on the standard setup, we show that CASPER can parse queries in a new domain, adapt the prediction toward the specified patterns, or adapt to new semantic schemas without having to further re-train the model.

preprint2022arXiv

Cross-Scale Vector Quantization for Scalable Neural Speech Coding

Bitrate scalability is a desirable feature for audio coding in real-time communications. Existing neural audio codecs usually enforce a specific bitrate during training, so different models need to be trained for each target bitrate, which increases the memory footprint at the sender and the receiver side and transcoding is often needed to support multiple receivers. In this paper, we introduce a cross-scale scalable vector quantization scheme (CSVQ), in which multi-scale features are encoded progressively with stepwise feature fusion and refinement. In this way, a coarse-level signal is reconstructed if only a portion of the bitstream is received, and progressively improves the quality as more bits are available. The proposed CSVQ scheme can be flexibly applied to any neural audio coding network with a mirrored auto-encoder structure to achieve bitrate scalability. Subjective results show that the proposed scheme outperforms the classical residual VQ (RVQ) with scalability. Moreover, the proposed CSVQ at 3 kbps outperforms Opus at 9 kbps and Lyra at 3kbps and it could provide a graceful quality boost with bitrate increase.

preprint2022arXiv

End-to-End Neural Speech Coding for Real-Time Communications

Deep-learning based methods have shown their advantages in audio coding over traditional ones but limited attention has been paid on real-time communications (RTC). This paper proposes the TFNet, an end-to-end neural speech codec with low latency for RTC. It takes an encoder-temporal filtering-decoder paradigm that has seldom been investigated in audio coding. An interleaved structure is proposed for temporal filtering to capture both short-term and long-term temporal dependencies. Furthermore, with end-to-end optimization, the TFNet is jointly optimized with speech enhancement and packet loss concealment, yielding a one-for-all network for three tasks. Both subjective and objective results demonstrate the efficiency of the proposed TFNet.

preprint2022arXiv

In Defense of Kalman Filtering for Polyp Tracking from Colonoscopy Videos

Real-time and robust automatic detection of polyps from colonoscopy videos are essential tasks to help improve the performance of doctors during this exam. The current focus of the field is on the development of accurate but inefficient detectors that will not enable a real-time application. We advocate that the field should instead focus on the development of simple and efficient detectors that an be combined with effective trackers to allow the implementation of real-time polyp detectors. In this paper, we propose a Kalman filtering tracker that can work together with powerful, but efficient detectors, enabling the implementation of real-time polyp detectors. In particular, we show that the combination of our Kalman filtering with the detector PP-YOLO shows state-of-the-art (SOTA) detection accuracy and real-time processing. More specifically, our approach has SOTA results on the CVC-ClinicDB dataset, with a recall of 0.740, precision of 0.869, $F_1$ score of 0.799, an average precision (AP) of 0.837, and can run in real time (i.e., 30 frames per second). We also evaluate our method on a subset of the Hyper-Kvasir annotated by our clinical collaborators, resulting in SOTA results, with a recall of 0.956, precision of 0.875, $F_1$ score of 0.914, AP of 0.952, and can run in real time.

preprint2022arXiv

KuaiRand: An Unbiased Sequential Recommendation Dataset with Randomly Exposed Videos

Recommender systems deployed in real-world applications can have inherent exposure bias, which leads to the biased logged data plaguing the researchers. A fundamental way to address this thorny problem is to collect users' interactions on randomly expose items, i.e., the missing-at-random data. A few works have asked certain users to rate or select randomly recommended items, e.g., Yahoo!, Coat, and OpenBandit. However, these datasets are either too small in size or lack key information, such as unique user ID or the features of users/items. In this work, we present KuaiRand, an unbiased sequential recommendation dataset containing millions of intervened interactions on randomly exposed videos, collected from the video-sharing mobile App, Kuaishou. Different from existing datasets, KuaiRand records 12 kinds of user feedback signals (e.g., click, like, and view time) on randomly exposed videos inserted in the recommendation feeds in two weeks. To facilitate model learning, we further collect rich features of users and items as well as users' behavior history. By releasing this dataset, we enable the research of advanced debiasing large-scale recommendation scenarios for the first time. Also, with its distinctive features, KuaiRand can support various other research directions such as interactive recommendation, long sequential behavior modeling, and multi-task learning. The dataset and its news will be available at https://kuairand.com.

preprint2022arXiv

LBCF: A Large-Scale Budget-Constrained Causal Forest Algorithm

Offering incentives (e.g., coupons at Amazon, discounts at Uber and video bonuses at Tiktok) to user is a common strategy used by online platforms to increase user engagement and platform revenue. Despite its proven effectiveness, these marketing incentives incur an inevitable cost and might result in a low ROI (Return on Investment) if not used properly. On the other hand, different users respond differently to these incentives, for instance, some users never buy certain products without coupons, while others do anyway. Thus, how to select the right amount of incentives (i.e. treatment) to each user under budget constraints is an important research problem with great practical implications. In this paper, we call such problem as a budget-constrained treatment selection (BTS) problem. The challenge is how to efficiently solve BTS problem on a Large-Scale dataset and achieve improved results over the existing techniques. We propose a novel tree-based treatment selection technique under budget constraints, called Large-Scale Budget-Constrained Causal Forest (LBCF) algorithm, which is also an efficient treatment selection algorithm suitable for modern distributed computing systems. A novel offline evaluation method is also proposed to overcome an intrinsic challenge in assessing solutions' performance for BTS problem in randomized control trials (RCT) data. We deploy our approach in a real-world scenario on a large-scale video platform, where the platform gives away bonuses in order to increase users' campaign engagement duration. The simulation analysis, offline and online experiments all show that our method outperforms various tree-based state-of-the-art baselines. The proposed approach is currently serving over hundreds of millions of users on the platform and achieves one of the most tremendous improvements over these months.

preprint2022arXiv

Learning Multi-granularity User Intent Unit for Session-based Recommendation

Session-based recommendation aims to predict a user's next action based on previous actions in the current session. The major challenge is to capture authentic and complete user preferences in the entire session. Recent work utilizes graph structure to represent the entire session and adopts Graph Neural Network to encode session information. This modeling choice has been proved to be effective and achieved remarkable results. However, most of the existing studies only consider each item within the session independently and do not capture session semantics from a high-level perspective. Such limitation often leads to severe information loss and increases the difficulty of capturing long-range dependencies within a session. Intuitively, compared with individual items, a session snippet, i.e., a group of locally consecutive items, is able to provide supplemental user intents which are hardly captured by existing methods. In this work, we propose to learn multi-granularity consecutive user intent unit to improve the recommendation performance. Specifically, we creatively propose Multi-granularity Intent Heterogeneous Session Graph which captures the interactions between different granularity intent units and relieves the burden of long-dependency. Moreover, we propose the Intent Fusion Ranking module to compose the recommendation results from various granularity user intents. Compared with current methods that only leverage intents from individual items, IFR benefits from different granularity user intents to generate more accurate and comprehensive session representation, thus eventually boosting recommendation performance. We conduct extensive experiments on five session-based recommendation datasets and the results demonstrate the effectiveness of our method.

preprint2022arXiv

MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image

In this work, we propose a framework for single-view hand mesh reconstruction, which can simultaneously achieve high reconstruction accuracy, fast inference speed, and temporal coherence. Specifically, for 2D encoding, we propose lightweight yet effective stacked structures. Regarding 3D decoding, we provide an efficient graph operator, namely depth-separable spiral convolution. Moreover, we present a novel feature lifting module for bridging the gap between 2D and 3D representations. This module begins with a map-based position regression (MapReg) block to integrate the merits of both heatmap encoding and position regression paradigms for improved 2D accuracy and temporal coherence. Furthermore, MapReg is followed by pose pooling and pose-to-vertex lifting approaches, which transform 2D pose encodings to semantic features of 3D vertices. Overall, our hand reconstruction framework, called MobRecon, comprises affordable computational costs and miniature model size, which reaches a high inference speed of 83FPS on Apple A14 CPU. Extensive experiments on popular datasets such as FreiHAND, RHD, and HO3Dv2 demonstrate that our MobRecon achieves superior performance on reconstruction accuracy and temporal coherence. Our code is publicly available at https://github.com/SeanChenxy/HandMesh.

preprint2022arXiv

Optimal $L^p$ regularity for $\bar\partial$ on the Hartogs triangle

In this paper, we prove weighted $L^p$ estimates for the canonical solutions on product domains. As an application, we show that if $p\in [4, \infty)$, the $\bar\partial$ equation on the Hartogs triangle with $L^p$ data admits $L^p$ solutions with the desired estimates. For any $ε>0$, by constructing an example with $L^p$ data but having no $L^{p+ε}$ solutions, we verify the sharpness of the $L^p$ regularity on the Hartogs triangle.

preprint2022arXiv

Toward a Human-Centered AI-assisted Colonoscopy System

AI-assisted colonoscopy has received lots of attention in the last decade. Several randomised clinical trials in the previous two years showed exciting results of the improving detection rate of polyps. However, current commercial AI-assisted colonoscopy systems focus on providing visual assistance for detecting polyps during colonoscopy. There is a lack of understanding of the needs of gastroenterologists and the usability issues of these systems. This paper aims to introduce the recent development and deployment of commercial AI-assisted colonoscopy systems to the HCI community, identify gaps between the expectation of the clinicians and the capabilities of the commercial systems, and highlight some unique challenges in Australia.

preprint2022arXiv

Unique continuation for $\bar\partial$ with square-integrable potentials

In this paper, we investigate the unique continuation property for the inequality $|\bar\partial u| \le V|u|$, where $u$ is a vector-valued function from a domain in $\mathbb C^n$ to $\mathbb C^N$, and the potential $V\in L^2$. We show that the strong unique continuation property holds when $n=1$, and the weak unique continuation property holds when $n\ge 2$. In both cases, the $L^2$ integrability condition on the potential is optimal.

preprint2022arXiv

WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic Segmentation for Lung Adenocarcinoma

Lung cancer is the leading cause of cancer death worldwide, and adenocarcinoma (LUAD) is the most common subtype. Exploiting the potential value of the histopathology images can promote precision medicine in oncology. Tissue segmentation is the basic upstream task of histopathology image analysis. Existing deep learning models have achieved superior segmentation performance but require sufficient pixel-level annotations, which is time-consuming and expensive. To enrich the label resources of LUAD and to alleviate the annotation efforts, we organize this challenge WSSS4LUAD to call for the outstanding weakly-supervised semantic segmentation (WSSS) techniques for histopathology images of LUAD. Participants have to design the algorithm to segment tumor epithelial, tumor-associated stroma and normal tissue with only patch-level labels. This challenge includes 10,091 patch-level annotations (the training set) and over 130 million labeled pixels (the validation and test sets), from 87 WSIs (67 from GDPH, 20 from TCGA). All the labels were generated by a pathologist-in-the-loop pipeline with the help of AI models and checked by the label review board. Among 532 registrations, 28 teams submitted the results in the test phase with over 1,000 submissions. Finally, the first place team achieved mIoU of 0.8413 (tumor: 0.8389, stroma: 0.7931, normal: 0.8919). According to the technical reports of the top-tier teams, CAM is still the most popular approach in WSSS. Cutmix data augmentation has been widely adopted to generate more reliable samples. With the success of this challenge, we believe that WSSS approaches with patch-level annotations can be a complement to the traditional pixel annotations while reducing the annotation efforts. The entire dataset has been released to encourage more researches on computational pathology in LUAD and more novel WSSS techniques.

preprint2021arXiv

Active Frequency Measurement on Superradiant Strontium Clock Transitions

We develop a stochastic mean-field theory to describe active frequency measurements of pulsed superradiant emission, studied in recent experiments with strontium-87 atoms trapped in an optical lattice inside an optical cavity [M. Norcia, et al., Phys. Rev. X 8, 21036 (2018)]. Our theory reveals the intriguing dynamics of atomic ensembles with multiple transition frequencies, and it reproduces the superradiant beats signal, noisy power spectra, and frequency uncertainty in remarkable agreement with the experiments. Moreover, by reducing the number of atoms, elongating the superradiant pulses and shortening the experimental duty cycle, we predict a short-term frequency uncertainty $9\times10^{-16} \sqrt{τ/s}$, which makes active frequency measurements with superradiant transitions comparable with the record performance of current frequency standards [M. Schioppo, et al., Nat. Photonics, 11, 48 (2017)]. Our theory combines cavity-quantum electrodynamics and quantum measurement theory, and it can be readily applied to explore conditional quantum dynamics and describe frequency measurements for other processes such as steady-state superradiance and superradiant Raman lasing.

preprint2021arXiv

Cavity Quantum Electrodynamics Effects with Nitrogen Vacancy Center Spins in Diamond and Microwave Resonators at Room Temperature

Cavity quantum electrodynamics (C-QED) effects, such as Rabi splitting, Rabi oscillations and superradiance, have been demonstrated with nitrogen vacancy center spins in diamond in microwave resonators at cryogenic temperature. In this article we explore the possibility to realize strong collective coupling and the resulting C-QED effects with ensembles of spins at room temperature. Thermal excitation of the individual spins by the hot environment leads to population of collective Dicke states with low symmetry and a reduced collective spin-microwave field coupling. However, we show with simulations that the thermal excitation can be compensated by spin-cooling via optical pumping. The resulting population of Dicke states with higher symmetry implies strong coupling with currently available high-quality resonators and enables C-QED effects at room temperature with potential applications in quantum sensing and quantum information processing.

preprint2021arXiv

Hölder estimates for the $\bar\partial$ problem for $(p,q)$ forms on product domains

The purpose of this paper is to study Hölder estimates for the $\bar\partial$ problem for $(p,q)$ forms on products of general planar domains. As indicated by an example of Stein and Kerzman, solutions to the $\bar\partial$ problem on product domains in $\mathbb C^n (n\ge 2)$ does not gain regularity in Hölder spaces. Making use of an integral representation of Nijenhuis and Woolf, we show that given a $\bar\partial$-closed $(p,q)$ form with $C^{k,α}$ components, $0\le p\le n, 1\le q\le n$, $k\in \mathbb Z^+\cup \{0\}, 0<α\le 1$, there is a $C^{k, α'}$ solution to the $\bar\partial$ problem on product domains for any $0<α'<α$ with the desired Hölder estimate.

preprint2021arXiv

Investigating the integrate and fire model as the limit of a random discharge model: a stochastic analysis perspective

In the mean field integrate-and-fire model, the dynamics of a typical neuron within a large network is modeled as a diffusion-jump stochastic process whose jump takes place once the voltage reaches a threshold. In this work, the main goal is to establish the convergence relationship between the regularized process and the original one where in the regularized process, the jump mechanism is replaced by a Poisson dynamic, and jump intensity within the classically forbidden domain goes to infinity as the regularization parameter vanishes. On the macroscopic level, the Fokker-Planck equation for the process with random discharges (i.e. Poisson jumps) are defined on the whole space, while the equation for the limit process is on the half space. However, with the iteration scheme, the difficulty due to the domain differences has been greatly mitigated and the convergence for the stochastic process and the firing rates can be established. Moreover, we find a polynomial-order convergence for the distribution by a re-normalization argument in probability theory. Finally, by numerical experiments, we quantitatively explore the rate and the asymptotic behavior of the convergence for both linear and nonlinear models.

preprint2021arXiv

Modelling Hierarchical Structure between Dialogue Policy and Natural Language Generator with Option Framework for Task-oriented Dialogue System

Designing task-oriented dialogue systems is a challenging research topic, since it needs not only to generate utterances fulfilling user requests but also to guarantee the comprehensibility. Many previous works trained end-to-end (E2E) models with supervised learning (SL), however, the bias in annotated system utterances remains as a bottleneck. Reinforcement learning (RL) deals with the problem through using non-differentiable evaluation metrics (e.g., the success rate) as rewards. Nonetheless, existing works with RL showed that the comprehensibility of generated system utterances could be corrupted when improving the performance on fulfilling user requests. In our work, we (1) propose modelling the hierarchical structure between dialogue policy and natural language generator (NLG) with the option framework, called HDNO, where the latent dialogue act is applied to avoid designing specific dialogue act representations; (2) train HDNO via hierarchical reinforcement learning (HRL), as well as suggest the asynchronous updates between dialogue policy and NLG during training to theoretically guarantee their convergence to a local maximizer; and (3) propose using a discriminator modelled with language models as an additional reward to further improve the comprehensibility. We test HDNO on MultiWoz 2.0 and MultiWoz 2.1, the datasets on multi-domain dialogues, in comparison with word-level E2E model trained with RL, LaRL and HDSA, showing improvements on the performance evaluated by automatic evaluation metrics and human evaluation. Finally, we demonstrate the semantic meanings of latent dialogue acts to show the explanability for HDNO.

preprint2021arXiv

Optimal Hölder regularity for the $\bar\partial$ problem on product domains in $\mathbb C^2$

The note concerns the $\bar\partial$ problem on product domains in $\mathbb C^2$. We show that there exists a bounded solution operator from $C^{k, α}$ into itself, $k\in \mathbb Z^+\cup \{0\}, 0<α< 1$. The regularity result is optimal in view of an example of Stein-Kerzman.

preprint2021arXiv

Some Rigorous Results on the Phase Transition of Finitary Random Interlacement

In this paper, we show several rigorous results on the phase transition of Finitary Random Interlacement (FRI). For the high intensity regime, we show the existence of a critical fiber length, and give the exact asymptotic of it as intensity goes to infinity. At the same time, our result for the low intensity regime proves the global existence of a non-trivial phase transition with respect to the system intensity.

preprint2021arXiv

Structural Controllability of Networked Relative Coupling Systems

This paper studies the controllability of networked relative coupling systems (NRCSs), in which subsystems are of fixed high-order linear dynamics and coupled through relative variables depending on their neighbors, from a structural perspective. The purpose is to explore conditions for subsystem dynamics and network topologies under which for almost all weights of the subsystem interaction links, the corresponding numerical NRCSs are controllable, which is called structurally controllable. Three types of subsystem interaction fashions are considered: 1) each subsystem is single-input-single-output (SISO), 2) each subsystem is multiple-input-multiple-output (MIMO), and the weights for all channels between two subsystems are identical, and 3) each subsystem is MIMO, but different channels between two subsystems can be weighted differently. {We show that all parameter-dependent modes of the NRCSs are generically controllable under some necessary connectivity conditions. We then derive necessary and/or sufficient conditions for structural controllability depending on subsystem dynamics and network topologies' connectivity in a decoupled form for all the three interaction fashions.} We also extend our results to handle certain subsystem heterogeneities and demonstrate their direct applications on some practical systems, including the mass-spring-damper system and the power network.

preprint2021arXiv

Weighted Sylvester sums on the Frobenius set in more variables

Let $a_1,a_2,\dots,a_k$ be positive integers with $\gcd(a_1,a_2,\dots,a_k)=1$. Let ${\rm NR}={\rm NR}(a_1,a_2,\dots,a_k)$ denote the set of positive integers nonrepresentable in terms of $a_1,a_2,\dots,a_k$. The largest nonrepresentable integer $\max{\rm NR}$, the number of nonrepresentable positive integers $\sum_{n\in{\rm NR}}1$ and the sum of nonrepresentable positive integers $\sum_{n\in{\rm NR}}n$ have been widely studied for a long time as related to the famous Frobenius problem. In this paper by using Eulerian numbers, we give formulas for the weighted sum $\sum_{n\in{\rm NR}}λ^{n}n^μ$, where $μ$ is a nonnegative integer and $λ$ is a complex number. We also examine power sums of nonrepresentable numbers and some formulae for three variables. Several examples illustrate and support our results.

preprint2020arXiv

Cauchy singular integral operator with parameters in Log-Hölder spaces

This paper is motivated by a claim in the classical textbook of Muskhelishvili concerning the Cauchy singular integral operator $S$ on Hölder functions with parameters. To the contrary of the claim, a counter example was constructed by Tumanov which shows that $S$ with parameters fails to maintain the same Hölder regularity with respect to the parameters. In view of the example, the behavior of the Cauchy singular integral operator with parameters between a type of Log-Hölder spaces is investigated to obtain the sharp norm estimates. At the end of the paper, we discuss its application to the $\bar\partial$ problem on product domains.

preprint2020arXiv

Characterization of complementing pairs of $({\mathbb Z}_{\geq 0})^n$

Let $A, B, C$ be subsets of an abelian group $G$. A pair $(A, B)$ is called a $C$-pair if $A, B\subset C$ and $C$ is the direct sum of $A$ and $B$. The $(\Z_{\geq 0})$-pairs are characterized by de Bruijn in 1950 and the $(\Z_{\geq 0})^2$-pairs are characterized by Niven in 1971. In this paper, we characterize the $(\Z_{\geq 0})^n$-pairs for all $n\geq 1$. We show that every $(\Z_{\geq 0})^n$-pair is characterized by a weighted tree if it is primitive, that is, it is not a Cartesian product of a $(\Z_{\geq 0})^p$-pair and a $(\Z_{\geq 0})^q$-pair of lower dimensions.

preprint2020arXiv

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Applying artificial intelligence techniques in medical imaging is one of the most promising areas in medicine. However, most of the recent success in this area highly relies on large amounts of carefully annotated data, whereas annotating medical images is a costly process. In this paper, we propose a novel method, called FocalMix, which, to the best of our knowledge, is the first to leverage recent advances in semi-supervised learning (SSL) for 3D medical image detection. We conducted extensive experiments on two widely used datasets for lung nodule detection, LUNA16 and NLST. Results show that our proposed SSL methods can achieve a substantial improvement of up to 17.3% over state-of-the-art supervised learning approaches with 400 unlabeled CT scans.

preprint2020arXiv

Generic Detectability and Isolability of Topology Failures in Networked Linear Systems

This paper studies the possibility of detecting and isolating topology failures (including link failures and node failures) of a networked system from subsystem measurements, in which subsystems are of fixed high-order linear dynamics, and the exact interaction weights among them are unknown. We prove that in such class of networked systems with the same network topologies, the detectability and isolability of a given topology failure (set) are generic properties, indicating that it is the network topology that dominates the property of being detectable or isolable for a failure (set). We first give algebraic conditions for detectability and isolability of arbitrary parameter perturbations for a lumped plant, and then derive graph-theoretical necessary and sufficient conditions for generic detectability and isolability of topology failures for the networked systems. On the basis of these results, we consider the problems of deploying the smallest set of sensors for generic detectability and isolability. We reduce the associated sensor placement problems to the hitting set problems, which can be effectively solved by greedy algorithms with guaranteed approximation performances.

preprint2020arXiv

Improving Monocular Depth Estimation by Leveraging Structural Awareness and Complementary Datasets

Monocular depth estimation plays a crucial role in 3D recognition and understanding. One key limitation of existing approaches lies in their lack of structural information exploitation, which leads to inaccurate spatial layout, discontinuous surface, and ambiguous boundaries. In this paper, we tackle this problem in three aspects. First, to exploit the spatial relationship of visual features, we propose a structure-aware neural network with spatial attention blocks. These blocks guide the network attention to global structures or local details across different feature layers. Second, we introduce a global focal relative loss for uniform point pairs to enhance spatial constraint in the prediction, and explicitly increase the penalty on errors in depth-wise discontinuous regions, which helps preserve the sharpness of estimation results. Finally, based on analysis of failure cases for prior methods, we collect a new Hard Case (HC) Depth dataset of challenging scenes, such as special lighting conditions, dynamic objects, and tilted camera angles. The new dataset is leveraged by an informed learning curriculum that mixes training examples incrementally to handle diverse data distributions. Experimental results show that our method outperforms state-of-the-art approaches by a large margin in terms of both prediction accuracy on NYUDv2 dataset and generalization performance on unseen datasets.

preprint2020arXiv

Lipschitz classification of Bedford-McMullen carpets with uniform horizontal fibers

Let ${\cal M}_{t,v,r}(n,m)$, $2\leq m<n$, be the collection of self-affine carpets with expanding matrix $\diag(n,m)$ which are totally disconnected, possessing vacant rows and with uniform horizontal fibers. In this paper, we introduce a notion of structure tree of a metric space, and thanks to this new notion, we completely characterize when two carpets in ${\cal M}_{t,v,r}(n,m)$ are Lipschitz equivalent.

preprint2020arXiv

Mapping Natural Language Instructions to Mobile UI Action Sequences

We present a new problem: grounding natural language instructions to mobile user interface actions, and create three new datasets for it. For full task evaluation, we create PIXELHELP, a corpus that pairs English instructions with actions performed by people on a mobile UI emulator. To scale training, we decouple the language and action data by (a) annotating action phrase spans in HowTo instructions and (b) synthesizing grounded descriptions of actions for mobile user interfaces. We use a Transformer to extract action phrase tuples from long-range natural language instructions. A grounding Transformer then contextually represents UI objects using both their content and screen position and connects them to object descriptions. Given a starting screen and instruction, our model achieves 70.59% accuracy on predicting complete ground-truth action sequences in PIXELHELP.

preprint2020arXiv

Neural Inheritance Relation Guided One-Shot Layer Assignment Search

Layer assignment is seldom picked out as an independent research topic in neural architecture search. In this paper, for the first time, we systematically investigate the impact of different layer assignments to the network performance by building an architecture dataset of layer assignment on CIFAR-100. Through analyzing this dataset, we discover a neural inheritance relation among the networks with different layer assignments, that is, the optimal layer assignments for deeper networks always inherit from those for shallow networks. Inspired by this neural inheritance relation, we propose an efficient one-shot layer assignment search approach via inherited sampling. Specifically, the optimal layer assignment searched in the shallow network can be provided as a strong sampling priori to train and search the deeper ones in supernet, which extremely reduces the network search space. Comprehensive experiments carried out on CIFAR-100 illustrate the efficiency of our proposed method. Our search results are strongly consistent with the optimal ones directly selected from the architecture dataset. To further confirm the generalization of our proposed method, we also conduct experiments on Tiny-ImageNet and ImageNet. Our searched results are remarkably superior to the handcrafted ones under the unchanged computational budgets. The neural inheritance relation discovered in this paper can provide insights to the universal neural architecture search.

preprint2020arXiv

On (non-)monotonicity and phase diagram of finitary random interlacement

In this paper, we study the evolution of a Finitary Random Interlacement (FRI) with respect to the expected length of each fiber. In contrast to the previously proved phase transition between sufficiently large and small fiber length, we show that for $d=3,4$, FRI is NOT stochastically monotone as fiber length increasing. At the same time, numerical evidences still strongly support the existence of a unique and sharp phase transition on the existence of a unique infinite cluster, while the critical value for phase transition is estimated to be an inversely proportional function with respect to the system intensity.

preprint2020arXiv

On Chemical Distance and Local Uniqueness of a Sufficiently Supercritical Finitary Random Interlacement

In this paper, we study geometric properties of the unique infinite cluster $Γ$ in a sufficiently supercritical Finitary Random Interlacements $\mathcal{FI}^{u,T}$ in $\mathbb{Z}^d, \ d\ge 3$. We prove that the chemical distance in $Γ$ is, with stretched exponentially high probability, of the same order as the Euclidean distance in $\mathbb{Z}^d$. This also implies a shape theorem parallel to those for Bernoulli percolation and random interlacements. We also prove local uniqueness of $\mathcal{FI}^{u,T}$, which says any two large clusters in $\mathcal{FI}^{u,T}$ "close to each other" will with stretched exponentially high probability be connected to each other within the same order of the distance between them.

preprint2020arXiv

On some threshold-one attractive interacting particle systems on homogeneous trees

In this paper, we consider the threshold-one contact process and the threshold-one voter model w/o spontaneous death on homogeneous trees $\mathbb{T}_d$, $d\ge 2$. Mainly inspired by the corresponding arguments for ordinary contact processes, we prove that the complete convergence theorem holds for these three systems under strong survival. When the systems survives weakly, complete convergence may also hold under certain transition and/or initial conditions.

preprint2020arXiv

Optomechanical Collective Effects in Surface-Enhanced Raman Scattering from Many Molecules

The interaction between molecules is commonly ignored in surface-enhanced Raman scattering (SERS). Under this assumption, the total SERS signal is described as the sum of the individual contributions of each molecule treated independently. We adopt here an optomechanical description of SERS within a cavity quantum electrodynamics framework to study how collective effects emerge from the quantum correlations of distinct molecules. We derive analytical expressions for identical molecules and implement numerical simulations to analyze two types of collective phenomena: (i) a decrease of the laser intensity threshold to observe strong non-linearities as the number of molecules increases, within intense illumination, and (ii) identification of superradiance in the SERS signal, namely a quadratic scaling with the number of molecules. The laser intensity required to observe the latter in the anti-Stokes scattering is relatively moderate, which makes it particularly accessible to experiments. Our results also show that collective phenomena can survive in the presence of moderate homogeneous and inhomogeneous broadening.

preprint2020arXiv

Structural Controllability of Undirected Diffusive Networks with Vector-Weighted Edges

In this paper, controllability of undirected networked systems with {diffusively coupled subsystems} is considered, where each subsystem is of {identically {\emph{fixed}}} general high-order single-input-multi-output dynamics. The underlying graph of the network topology is {\emph{vector-weighted}}, rather than scalar-weighted. The aim is to find conditions under which the networked system is structurally controllable, i.e., for almost all vector values for interaction links of the network topology, the corresponding system is controllable. It is proven that, the networked system is structurally controllable, if and only if each subsystem is controllable and observable, and the network topology is globally input-reachable. These conditions are further extended to the cases {with multi-input-multi-output subsystems and matrix-weighted edges,} or where both directed and undirected interaction links exist.

preprint2020arXiv

Tilings of convex polyhedral cones and topological properties of self-affine tiles

Let $\textbf{a}_1,\dots, \textbf{a}_r$ be vectors in a half-space of $\mathbb{R}^n$. We call $$C=\textbf{a}_1\mathbb{R}^++\cdots+\textbf{a}_r \mathbb{R}^+$$ a convex polyhedral cone, and call $\{\textbf{a}_1,\dots, \textbf{a}_r\}$ a generator set of $C$. A generator set with the minimal cardinality is called a frame. We investigate the translation tilings of convex polyhedral cones. Let $T\subset \mathbb{R}^n$ be a compact set such that $T$ is the closure of its interior, and $\mathcal{J}\subset \mathbb{R}^n$ be a discrete set. We say $(T,\mathcal{J})$ is a translation tiling of $C$ if $T+\mathcal{J}=C$ and any two translations of $T$ in $T+\mathcal{J}$ are disjoint in Lebesgue measure. We show that if the cardinality of a frame of $C$ is larger than $\dim C$, the dimension of $C$, then $C$ does not admit any translation tiling; if the cardinality of a frame of $C$ equals $\dim C$, then the translation tilings of $C$ can be reduced to the translation tilings of $(\mathbb{Z}^+)^n$. As an application, we characterize all the self-affine tiles possessing polyhedral corners, which generalizes a result of Odlyzko [A. M. Odlyzko, \textit{Non-negative digit sets in positional number systems}, Proc. London Math. Soc., \textbf{37}(1978), 213-229.].

preprint2019arXiv

Stationary DLA is well defined

In this paper, we construct an infinite stationary Diffusion Limited Aggregation (SDLA) on the upper half planar lattice, growing from an infinite line, with local growth rate proportional to the stationary harmonic measure. We prove that the SDLA is ergodic with respect to integer left-right translations.

preprint2019arXiv

The Surprising Accuracy of Benford's Law in Mathematics

Benford's law is an empirical ``law'' governing the frequency of leading digits in numerical data sets. Surprisingly, for mathematical sequences the predictions derived from it can be uncannily accurate. For example, among the first billion powers of $2$, exactly $301029995$ begin with digit 1, while the Benford prediction for this count is $10^9\log_{10}2=301029995.66\dots$. Similar ``perfect hits'' can be observed in other instances, such as the digit $1$ and $2$ counts for the first billion powers of $3$. We prove results that explain many, but not all, of these surprising accuracies, and we relate the observed behavior to classical results in Diophantine approximation as well as recent deep conjectures in this area.

preprint2016arXiv

Connectivity properties of Branching Interlacements

We consider connectivity properties of the Branching Interlacements model in $\mathbb{Z}^d,~d\ge5$, recently introduced by Angel, Ráth and Zhu in 2016. Using stochastic dimension techniques we show that every two vertices visited by the branching interlacements are connected via at most $\lceil d/4\rceil$ conditioned critical branching random walks from the underlying Poisson process, and that this upper bound is sharp. In particular every such two branching random walks intersect if and only if $5\le d\le 8$. The stochastic dimension of branching random walk result is of independent interest. We additionally obtain heat kernel bounds for branching random walks conditioned on survival.

preprint2016arXiv

Feasibility study of online tuning of the luminosity in a circular collider with the robust conjugate direction search method

The robust conjugate direction search (RCDS) method has high tolerance to noise in beam experiments. It has been demonstrated that this method can be used to optimize the machine performance of a light source online. In our study, taking BEPCII as an example, the feasibility of online tuning of the luminosity in a circular collider is explored, through numerical simulation and preliminary online experiments. It is shown that the luminosity that is artificially decreased by a deviation of beam orbital offset from optimal trajectory can be recovered with this method.

preprint2016arXiv

Stack-propagation: Improved Representation Learning for Syntax

Traditional syntax models typically leverage part-of-speech (POS) information by constructing features from hand-tuned templates. We demonstrate that a better approach is to utilize POS tags as a regularizer of learned representations. We propose a simple method for learning a stacked pipeline of models which we call "stack-propagation". We apply this to dependency parsing and tagging, where we use the hidden layer of the tagger network as a representation of the input tokens for the parser. At test time, our parser does not require predicted POS tags. On 19 languages from the Universal Dependencies, our method is 1.3% (absolute) more accurate than a state-of-the-art graph-based approach and 2.7% more accurate than the most comparable greedy model.

preprint2016arXiv

The Evolving Voter Model on Thick Graphs

In the evolving voter model, when an individual interacts with a neighbor having an opinion different from theirs, they will with probability $1-α$ imitate the neighbor but with probability $ α$ will sever the connection and choose a new neighbor at random (i) from the graph or (ii) from those with the same opinion. Durrett et al. used simulation and heuristics to study these dynamics on sparse graphs. Recently Basu and Sly have studied this system with $1-α= ν/N$ on a dense Erdős-Rényi graph $G(N,1/2)$ and rigorously proved that there is a phase transition from rapid disconnection into components with a single opinion to prolonged persistence of discordant edges as $ν$ increases. In this paper, we consider the intermediate situation of Erdős-Rényi random graphs with average degree $L=N^a$ where $0 < a < 1$. Most of the paper is devoted to a rigorous analysis of an approximation of the dynamics called the approximate master equation. Using ideas of \cite{LMR} and \cite{Silk} we are able to analyze these dynamics in great detail.

preprint2016arXiv

Theoretical Study of Plasmonic Lasing in Junctions with many Molecules

We calculate the quantum state of the plasmon field excited by an ensemble of molecular emitters, which are driven by exchange of electrons with metallic nano-particle electrodes. Assuming identical emitters that are coupled collectively to the plasmon mode but are otherwise subject to independent relaxation channels, we show that symmetry constraints on the total system density matrix imply a drastic reduction in the numerical complexity. For $N_{\text{m}}$ three-level molecules we may thus represent the density matrix by a number of terms scaling as $(N_{\rm m}+8)!/(8!N_{\rm m}!)$ instead of $9^{N_{\text{m}}}$, and this allows exact simulations of up to $N_{\text{m}}=10$ molecules. Our simulations demonstrate that many emitters compensate strong plasmon damping and lead to the population of high plasmon number states and a narrowed linewidth of the plasmon field. For large $N_{\text{m}}$, our exact results are reproduced by an approximate approach based on the plasmon reduced density matrix. With this approach, we have extended the simulations to more than $50$ molecules and shown that the plasmon number state population follows a Poisson-like distribution. An alternative approach based on nonlinear rate equations for the molecular state populations and the mean plasmon number also reproduce the main lasing characteristics of the system.

preprint2016arXiv

Transceiver Design for Cooperative Non-Orthogonal Multiple Access Systems with Wireless Energy Transfer

In this paper, an energy harvesting (EH) based cooperative non-orthogonal multiple access (NOMA) system is considered, where node S simultaneously sends independent signals to a stronger node R and a weaker node D. We focus on the scenario that the direct link between S and D is too weak to meet the quality of service (QoS) of D. Based on the NOMA principle, node R, the stronger user, has prior knowledge about the information of the weaker user, node D. To satisfy the targeted rate of D, R also serves as an EH decode-and-forward (DF) relay to forward the traffic from S to D. In the sense of equivalent cognitive radio concept, node R viewed as a secondary user assists to boost the performance of D, in exchange for receiving its own information from S. Specifically, transmitter beamforming design, power splitting ratio optimization and receiver filter design to maximize node R rate are studied with the predefined QoS constraint of D and the power constraint of S. Since the problem is non-convex, we propose an iterative approach to solve it. Moreover, to reduce the computational complexity, a zero- forcing (ZF) based solution is also presented. Simulation results demonstrate that, both two proposed schemes have better performance than the direction transmission.

preprint2015arXiv

Coexistence of grass, saplings and trees in the Staver-Levin forest model

In this paper, we consider two attractive stochastic spatial models in which each site can be in state 0, 1 or 2: Krone's model in which 0${}={}$vacant, 1${}={}$juvenile and 2${}={}$a mature individual capable of giving birth, and the Staver-Levin forest model in which 0${}={}$grass, 1${}={}$sapling and 2${}={}$tree. Our first result shows that if $(0,0)$ is an unstable fixed point of the mean-field ODE for densities of 1's and 2's then when the range of interaction is large, there is positive probability of survival starting from a finite set and a stationary distribution in which all three types are present. The result we obtain in this way is asymptotically sharp for Krone's model. However, in the Staver-Levin forest model, if $(0,0)$ is attracting then there may also be another stable fixed point for the ODE, and in some of these cases there is a nontrivial stationary distribution.

preprint2015arXiv

Community Detection in Networks with Node Features

Many methods have been proposed for community detection in networks, but most of them do not take into account additional information on the nodes that is often available in practice. In this paper, we propose a new joint community detection criterion that uses both the network edge information and the node features to detect community structures. One advantage our method has over existing joint detection approaches is the flexibility of learning the impact of different features which may differ across communities. Another advantage is the flexibility of choosing the amount of influence the feature information has on communities. The method is asymptotically consistent under the block model with additional assumptions on the feature distributions, and performs well on simulated and real networks.

preprint2015arXiv

Continuous solutions of nonlinear Cauchy-Riemann equations and pseudoholomorphic curves in normal coordinates

We establish elliptic regularity for nonlinear inhomogeneous Cauchy-Riemann equations under minimal assumptions, and give a counterexample in a borderline case. In some cases where the inhomogeneous term has a separable factorization, the solution set can be explicitly calculated. The methods also give local parametric formulas for pseudoholomorphic curves with respect to some continuous almost complex structures.

preprint2015arXiv

Convergence of Stochastic Interacting Particle Systems in Probability under a Sobolev Norm

In this paper, we consider particle systems with interaction and Brownian motion. We prove that when the initial data is from the sampling of Chorin's method, i.e., the initial vertices are on lattice points $hi\in \mathbb{R}^d$ with mass $ρ_0(hi) h^d$, where $ρ_0$ is some initial density function, then the regularized empirical measure of the interacting particle system converges in probability to the corresponding mean-field partial differential equation with initial density $ρ_0$, under the Sobolev norm of $L^\infty(L^2)\cap L^2(H^1)$. Our result is true for all those systems when the interacting function is bounded, Lipschitz continuous and satisfies certain regular condition. And if we further regularize the interacting particle system, it also holds for some of the most important systems of which the interacting functions are not. For systems with repulsive Coulomb interaction, this convergence holds globally on any interval $[0,t]$. And for systems with attractive Newton force as interacting function, we have convergence within the largest existence time of the regular solution of the corresponding Keller-Segel equation.

preprint2015arXiv

Detecting Overlapping Communities in Networks Using Spectral Methods

Community detection is a fundamental problem in network analysis which is made more challenging by overlaps between communities which often occur in practice. Here we propose a general, flexible, and interpretable generative model for overlapping communities, which can be thought of as a generalization of the degree-corrected stochastic block model. We develop an efficient spectral algorithm for estimating the community memberships, which deals with the overlaps by employing the K-medians algorithm rather than the usual K-means for clustering in the spectral domain. We show that the algorithm is asymptotically consistent when networks are not too sparse and the overlaps between communities not too large. Numerical experiments on both simulated networks and many real social networks demonstrate that our method performs very well compared to a number of benchmark methods for overlapping community detection.

preprint2015arXiv

Mass Dependence of the Entropy Product and Sum

For black holes with multiple horizons, the area product of all horizons has been proven to be mass independent in many cases. Counterexamples were also found in some occasions. In this paper, we first prove a theorem derived from the first law of black hole thermodynamics and a mathematical lemma related to the Vandermonde determinant. With these arguments, we develop some general criterion for the mass independence of the entropy product as well as the entropy sum. In particular, if a $d$-dimensional spacetime is spherically symmetric and its radial metric function $f(r)$ is a Laurent series in $r$ with the lowest power $-m$ and the highest power $n$, we find the criteria is extremely simple: The entropy product is mass independent if and only if $m\geq d-2$ and $n\geq4-d$. The entropy sum is mass independent if and only if $m\geq d-2$ and $n\geq 2$. Compared to previous works, our method does not require an exact expression of the metric. Our arguments turn out to be useful even for rotating black holes. By applying our theorem and lemma to a Myers-Perry black hole with spacetime dimension $d$, we show that the entropy product/sum is mass independent for all $d>4$, while it is mass dependent only for $d=4$, i.e., the Kerr solution.

preprint2015arXiv

On Marine Mammal Acoustic Detection Performance Bounds

Since the spectrogram does not preserve phase information contained in the original data, any algorithm based on the spectrogram is not likely to be optimum for detection. In this paper, we present the Short Time Fourier Transform detector to detect marine mammals in the time-frequency plane. The detector uses phase information for detection. We evaluate this detector by comparing it to the existing spectrogram based detectors for different SNRs and various environments including a known ocean, uncertain ocean, and mean ocean. The results show that this detector outperforms the spectrogram based detector. Simulations are presented using the polynomial phase signal model of the North Atlantic Right Whale (NARW), along with the bellhop ray tracing model.

preprint2015arXiv

Privacy-preserving Network Functionality Outsourcing

Since the advent of software defined networks ({SDN}), there have been many attempts to outsource the complex and costly local network functionality, i.e. the middlebox, to the cloud in the same way as outsourcing computation and storage. The privacy issues, however, may thwart the enterprises' willingness to adopt this innovation since the underlying configurations of these middleboxes may leak crucial and confidential information which can be utilized by attackers. To address this new problem, we use firewall as an sample functionality and propose the first privacy preserving outsourcing framework and schemes in SDN. The basic technique that we exploit is a ground-breaking tool in cryptography, the \textit{cryptographic multilinear map}. In contrast to the infeasibility in efficiency if a naive approach is adopted, we devise practical schemes that can outsource the middlebox as a blackbox after \textit{obfuscating} it such that the cloud provider can efficiently perform the same functionality without knowing its underlying private configurations. Both theoretical analysis and experiments on real-world firewall rules demonstrate that our schemes are secure, accurate, and practical.

preprint2015arXiv

Weak Convergence of a Seasonally Forced Stochastic Epidemic Model

In this study we extend the results of Kurtz (1970,1971) to show the weak convergence of epidemic processes that include explicit time dependence, specifically where the transmission parameter,$β(t)$, carries a time dependency. We first show that when population size goes to infinity, the time inhomogeneous process converges weakly to the solution of the mean-field ODE. Our second result is that, under proper scaling, the central limit type fluctuations converge to a diffusion process.

preprint2014arXiv

Higher dimensional Frobenius problem and Lipschitz equivalence of Cantor sets

The higher dimensional Frobenius problem was introduced by a preceding paper [Fan, Rao and Zhang, Higher dimensional Frobenius problem: maximal saturated cones, growth function and rigidity, Preprint 2014]. %the higher dimensional Frobenius problem was introduced and a directional growth function was studied. In this paper, we investigate the Lipschitz equivalence of dust-like self-similar sets in $\mathbb R^d$. For any self-similar set, we associate with it a higher dimensional Frobenius problem, and we show that the directional growth function of the associate higher dimensional Frobenius problem is a Lipschitz invariant. As an application, we solve the Lipschitz equivalence problem when two dust-like self-similar sets $E$ and $F$ have coplanar ratios, by showing that they are Lipschitz equivalent if and only if the contraction vector of the $p$-th iteration of $E$ is a permutation of that of the $q$-th iteration of $F$ for some $p, q\geq 1$. This partially answers a question raised by Falconer and Marsh [On the Lipschitz equivalence of Cantor sets, \emph{Mathematika,} \textbf{39} (1992), 223--233].

preprint2014arXiv

Higher dimensional Frobenius problem: Maximal saturated cone, growth function and rigidity

We consider $m$ integral vectors $X_1,...,X_m \in \mathbb{Z}^s$ located in a half-space of $\mathbb{R}^s$ ($m\ge s\geq 1$) and study the structure of the additive semi-group $X_1 \mathbb{N} +... + X_m \mathbb{N}$. We introduce and study maximal saturated cone and directional growth function which describe some aspects of the structure of the semi-group. When the vectors $X_1, ..., X_m$ are located in a fixed hyperplane, we obtain an explicit formula for the directional growth function and we show that this function completely characterizes the defining data $(X_1, ..., X_m)$ of the semi-group. The last result will be applied to the study of Lipschitz equivalence of Cantor sets (see [H. Rao and Y. Zhang, Higher dimensional Frobenius problem and Lipschitz equivalence of Cantor sets, Preprint 2014]).

preprint2014arXiv

On the CR transversality of holomorphic maps into hyperquadrics

Let $M_\ell$ be a smooth Levi-nondegenerate hypersurface of signature $\ell$ in $\mathbf C^n$ with $ n\ge 3$, and write $H_\ell^N$ for the standard hyperquadric of the same signature in $\mathbf C^N$ with $N-n< \frac{n-1}{2}$. Let $F$ be a holomorphic map sending $M_\ell$ into $H_\ell^N$. Assume $F$ does not send a neighborhood of $M_\ell$ in $\mathbf C^n$ into $H_\ell^N$. We show that $F$ is necessarily CR transversal to $M_\ell$ at any point. Equivalently, we show that $F$ is a local CR embedding from $M_\ell$ into $H_\ell^N$.

preprint2014arXiv

Some rigorous results for the stacked contact process

The stacked contact process is a stochastic model for the spread of an infection within a population of hosts located on the $d$-dimensional integer lattice. Regardless of whether they are healthy or infected, hosts give birth and die at the same rate and in accordance to the evolution rules of the neutral multitype contact process. The infection is transmitted both vertically from infected parents to their offspring and horizontally from infected hosts to nearby healthy hosts. The population survives if and only if the common birth rate of healthy and infected hosts exceeds the critical value of the basic contact process. The main purpose of this work is to study the existence of a phase transition between extinction and persistence of the infection in the parameter region where the hosts survive.

preprint2014arXiv

Testing cosmic censorship conjecture near extremal black holes with cosmological constants

It has been shown previously that an extremal Reissner-Nordström or an extremal Kerr black hole cannot be overcharged or overspun by a test particle if radiative and self-force effects are neglected. In this paper, we consider extremal charged and rotating black holes with cosmological constants. By studying the motion of test particles, we find the following results: An extremal Reissner-Nordström anti-de Sitter (RN-AdS) black hole can be overcharged by a test particle but an extremal Reissner-Nordström de Sitter (RN-dS) black hole cannot be overcharged. We also show that both extrmal Kerr-de-Sitter (Kerr-dS) and Kerr-anti-de-Sitter (Kerr-AdS) black holes can be overspun by a test particle, implying a possible breakdown of the cosmic censorship conjecture. For the Kerr-AdS case, the overspinning requires that the energy of the particle be negative, a reminiscent of the Penrose process. In contrast to the extremal RN and Kerr black holes, in which cases the cosmic censorship is upheld, our results suggest some subtle relations between the cosmological constants and the comic censorship. We also discuss the effect of radiation reaction for the Kerr-dS case and find that the magnitude of energy loss due to gravitational radiation may not be enough to prevent the violation of the cosmic censorship.

preprint2013arXiv

Analytical and Numerical Characterizations of Shannon Ordering for Discrete Memoryless Channels

This paper studies several problems concerning channel inclusion, which is a partial ordering between discrete memoryless channels (DMCs) proposed by Shannon. Specifically, majorization-based conditions are derived for channel inclusion between certain DMCs. Furthermore, under general conditions, channel equivalence defined through Shannon ordering is shown to be the same as permutation of input and output symbols. The determination of channel inclusion is considered as a convex optimization problem, and the sparsity of the weights related to the representation of the worse DMC in terms of the better one is revealed when channel inclusion holds between two DMCs. For the exploitation of this sparsity, an effective iterative algorithm is established based on modifying the orthogonal matching pursuit algorithm.

preprint2013arXiv

CR singular images of generic submanifolds under holomorphic maps

The purpose of this paper is to organize some results on the local geometry of CR singular real-analytic manifolds that are images of CR manifolds via a CR map that is a diffeomorphism onto its image. We find a necessary (sufficient in dimension 2) condition for the diffeomorphism to extend to a finite holomorphic map. The multiplicity of this map is a biholomorphic invariant that is precisely the Moser invariant of the image when it is a Bishop surface with vanishing Bishop invariant. In higher dimensions, we study Levi-flat CR singular images and we prove that the set of CR singular points must be large, and in the case of codimension 2, necessarily Levi-flat or complex. We also show that there exist real-analytic CR functions on such images that satisfy the tangential CR conditions at the singular points, yet fail to extend to holomorphic functions in a neighborhood. We provide many examples to illustrate the phenomena that arise.

preprint2013arXiv

Destroying extremal Kerr-Newman black holes with test particles

It has been shown that a nearly extremal black hole can be overcharged or overspun by a test particle if radiative and self-force effects are neglected, indicating that the cosmic censorship might fail. In contrast, the existing evidence in literature suggests that an extremal black hole cannot be overcharged or overspun in a similar process. In this paper, we show explicitly that even an exactly extremal black hole can be destroyed by a test particle, leading to a possible violation of the cosmic censorship. By considering higher order terms, which were neglected in previous analysis, we show that the violation is generic for any extremal Kerr-Newman black hole with nonvanishing charge and angular momentum. We also find that the allowed parameter range for the particle is very narrow, indicating that radiative and self-force effects should be considered and may prevent violation of the cosmic censorship.

preprint2013arXiv

Earth Occultation Imaging Applied to BATSE -- Application to a Combined BATSE-GBM Survey of the Hard X-Ray Sky

A combined BATSE-GBM hard X-ray catalog is presented based on Earth Occultation Imaging applied to a reanalysis of BATSE data. An imaging approach has been developed for the reanalysis of Earth Occultation analysis of BATSE data. The standard occultation analysis depends on a predetermined catalog of potential sources, so that a real source not present in the catalog may induce systematic errors when source counts associated with an uncatalogued source are incorrectly attributed to catalog sources. The goal of the imaging analysis is to find a complete set of hard X-ray sources, including sources not in the original BATSE occultation catalog. Using the imaging technique, we have identified 15 known sources and 17 unidentified sources and added them to the BATSE occultation catalog. The resulting expanded BATSE catalog of sources observed during 1991-2000 is compared to the ongoing GBM survey.

preprint2013arXiv

On the existence of solutions to nonlinear systems of higher order Poisson type

In this paper, we study the existence of higher order Poisson type systems. In detail, we prove a Residue type phenomenon for the fundamental solution of Laplacian in $\RR^n, n\ge 3$. This is analogous to the Residue theorem for the Cauchy kernel in $\CC$. With the aid of the Residue type formula for the fundamental solution, we derive the higher order derivative formula for the Newtonian potential and obtain its appropriate $\s C^{k, α}$ estimates. The existence of solutions to higher order Poisson type nonlinear systems is concluded as an application of the fixed point theorem.

preprint2013arXiv

Optimization Approach to Parametric Tuning of Power System Stabilizer Based on Trajectory Sensitivity Analysis

This paper proposed an transient-based optimal parametric tuning method for power system stabilizer (PSS) based on trajectory sensitivity (TS) analysis of hybrid system, such as hybrid power system (HPS). The main objective is to explore a systematic optimization approach of PSS under large disturbance of HPS, where its nonlinear features cannot be ignored, which, however, the traditional eigenvalue-based small signal optimizations do neglect the higher order terms of Taylor series of the system state equations. In contrast to previous work, the proposed TS optimal method focuses on the gradient information of objective function with respect to decision variables by means of the trajectory sensitivity of HPS to the PSS parameters, and optimizes the PSS parameters in terms of the conjugate gradient method. Firstly, the traditional parametric tuning methods of PSS are introduced. Then, the systematic mathematical models and transient trajectory simulation are presented by introducing switching/reset events in terms of triggering hypersurfaces so as to formulate the optimization problem using TS analysis. Finally, a case study of IEEE three-machine-nine-bus standard test system is discussed in detail to exemplify the practicality and effectiveness of the proposed optimal method.

preprint2013arXiv

Sustainable Ecosystem Planning Based on Discrete Stochastic Dynamic Programming and Evolutionary Game Theory

This paper proposed a discrete stochastic dynamic programming (SDP) model for sustainable ecosystem (SE) planning of the Loess Plateau in Northwestern, China, and analyzed the ecological resource planning by the evolutionary game model in the decision-making process. The main objective is to explore a new approach of SE planning from a viewpoint of discrete SDP and evolutionary game theory, with a specific application in the area of ecological resource planning such as water management problems. In contrast to previous work, the proposed SDP method focuses on the transition probability matrix of the ecosystem in a statistic sense, and uses the DP algorithm to obtain the optimal ecological resource planning strategies among multi-subsystems, then analyzes impacts of decision between different users. Firstly, the application background and the concept of SE planning are introduced. Then, a brief overview of existing theory for analyzing sustainable ecosystem is presented. Furthermore, a SDP-based mathematical model and its application to water resource planning of central areas of Loess Plateau are presented as an example. Finally, supplementary analysis of impacts between different users in SE planning as a game playing is provided.

preprint2012arXiv

Calculation of Droplet Size and Formation Time in Electrohydrodynamic Based Pulsatile Drug Delivery System

Electrohydrodynamic (EHD) generation, a commonly used method in BioMEMS, plays a significant role in the pulsed-release drug delivery system for a decade. In this paper, an EHD based drug delivery system is well designed, which can be used to generate a single drug droplet as small as 2.83 nL in 8.5 ms with a total device of 2x2x3 mm^3, and an external supplied voltage of 1500 V. Theoretically, we derive the expressions for the size and the formation time of a droplet generated by EHD method, while taking into account the drug supply rate, properties of liquid, gap between electrodes, nozzle size, and charged droplet neutralization. This work proves a repeatable, stable and controllable droplet generation and delivery system based on EHD method.

preprint2012arXiv

Pulsatile Drug Delivery System Based on Electrohydrodynamic Method

Electrohydrodynamic (EHD) generation, a commonly used method in BioMEMS, plays a significant role in the pulsatile drug delivery system for a decade. In this paper, an EHD based drug delivery system is well designed, which can be used to generate a single drug droplet as small as 2.83 nL in 8.5 ms with a total device of 2\times2\times3 mm^3, and an external supplied voltage of 1500 V. Theoretically, we derive the expressions for the size and the formation time of a droplet generated by EHD method, while taking into account the drug supply rate, properties of liquid, gap between two electrodes, nozzle size, and charged droplet neutralization. This work proves a repeatable, stable and controllable droplet generation and delivery system based on EHD method experimentally as well as theoretically.

preprint2011arXiv

Applications of Stochastic Ordering to Wireless Communications

Stochastic orders are binary relations defined on probability distributions which capture intuitive notions like being larger or being more variable. This paper introduces stochastic ordering of instantaneous SNRs of fading channels as a tool to compare the performance of communication systems over different channels. Stochastic orders unify existing performance metrics such as ergodic capacity, and metrics based on error rate functions for commonly used modulation schemes through their relation with convex, and completely monotonic (c.m.) functions. Toward this goal, performance metrics such as instantaneous error rates of M-QAM and M-PSK modulations are shown to be c.m. functions of the instantaneous SNR, while metrics such as the instantaneous capacity are seen to have a completely monotonic derivative (c.m.d.). It is shown that the commonly used parametric fading distributions for modeling line of sight (LoS), exhibit a monotonicity in the LoS parameter with respect to the stochastic Laplace transform order. Using stochastic orders, average performance of systems involving multiple random variables are compared over different channels, even when closed form expressions for such averages are not tractable. These include diversity combining schemes, relay networks, and signal detection over fading channels with non-Gaussian additive noise, which are investigated herein. Simulations are also provided to corroborate our results.

preprint2011arXiv

Applications of Tauberian Theorem for High-SNR Analysis of Performance over Fading Channels

This paper derives high-SNR asymptotic average error rates over fading channels by relating them to the outage probability, under mild assumptions. The analysis is based on the Tauberian theorem for Laplace-Stieltjes transforms which is grounded on the notion of regular variation, and applies to a wider range of channel distributions than existing approaches. The theory of regular variation is argued to be the proper mathematical framework for finding sufficient and necessary conditions for outage events to dominate high-SNR error rate performance. It is proved that the diversity order being $d$ and the cumulative distribution function (CDF) of the channel power gain having variation exponent $d$ at 0 imply each other, provided that the instantaneous error rate is upper-bounded by an exponential function of the instantaneous SNR. High-SNR asymptotic average error rates are derived for specific instantaneous error rates. Compared to existing approaches in the literature, the asymptotic expressions are related to the channel distribution in a much simpler manner herein, and related with outage more intuitively. The high-SNR asymptotic error rate is also characterized under diversity combining schemes with the channel power gain of each branch having a regularly varying CDF. Numerical results are shown to corroborate our theoretical analysis.

preprint2011arXiv

Asymptotic Capacity Analysis for Adaptive Transmission Schemes under General Fading Distributions

Asymptotic comparisons of ergodic channel capacity at high and low signal-to-noise ratios (SNRs) are provided for several adaptive transmission schemes over fading channels with general distributions, including optimal power and rate adaptation, rate adaptation only, channel inversion and its variants. Analysis of the high-SNR pre-log constants of the ergodic capacity reveals the existence of constant capacity difference gaps among the schemes with a pre-log constant of ?1. Closed-form expressions for these high-SNR capacity difference gaps are derived, which are proportional to the SNR loss between these schemes in dB scale. The largest one of these gaps is found to be between the optimal power and rate adaptation scheme and the channel inversion scheme. Based on these expressions it is shown that the presence of space diversity or multi-user diversity makes channel inversion arbitrarily close to achieving optimal capacity at high SNR with sufficiently large number of antennas or users. A low-SNR analysis also reveals that the presence of fading provably always improves capacity at sufficiently low SNR, compared to the additive white Gaussian noise (AWGN) case. Numerical results are shown to corroborate our analytical results.

preprint2011arXiv

Multi-User Diversity with Random Number of Users

Multi-user diversity is considered when the number of users in the system is random. The complete monotonicity of the error rate as a function of the (deterministic) number of users is established and it is proved that randomization of the number of users always leads to deterioration of average system performance at any average SNR. Further, using stochastic ordering theory, a framework for comparison of system performance for different user distributions is provided. For Poisson distributed users, the difference in error rate of the random and deterministic number of users cases is shown to asymptotically approach zero as the average number of users goes to infinity for any fixed average SNR. In contrast, for a finite average number of users and high SNR, it is found that randomization of the number of users deteriorates performance significantly, and the diversity order under fading is dominated by the smallest possible number of users. For Poisson distributed users communicating over Rayleigh faded channels, further closed-form results are provided for average error rate, and the asymptotic scaling law for ergodic capacity is also provided. Simulation results are provided to corroborate our analytical findings.

preprint2011arXiv

Rigidity for local holomorphic isometric embeddings from ${\BB}^n$ into ${\BB}^{N_1}\times... \times{\BB}^{N_m}$ up to conformal factors

In this article, we study local holomorphic isometric embeddings from ${\BB}^n$ into ${\BB}^{N_1}\times... \times{\BB}^{N_m}$ with respect to the normalized Bergman metrics up to conformal factors. Assume that each conformal factor is smooth Nash algebraic. Then each component of the map is a multi-valued holomorphic map between complex Euclidean spaces by the algebraic extension theorem derived along the lines of Mok and Mok-Ng. Applying holomorphic continuation and analyzing real analytic subvarieties carefully, we show that each component is either a constant map or a proper holomorphic map between balls. Applying a linearity criterion of Huang, we conclude the total geodesy of non-constant components.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Source provenance

Where this author record came from

arxivconfidence 95%

external id: arxiv:2605.04128:author:6:yuan-zhang

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.15018:author:4:yuan-zhang

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.15691:author:1:yuan-zhang

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2604.26694:author:8:yuan-zhang