Source author record

Hao Huang

Hao Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

42works

23topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

GEM: Gaussian Evolution Model for Occupancy Forecasting and Motion Planning

Future 3D semantic occupancy forecasting and motion planning are central to autonomous driving, as they require models to reason about how surrounding scenes evolve and how the ego vehicle should act. Existing occupancy world models commonly discretize scenes into latent embeddings, volumetric features, or quantized tokens, and forecast future states through fixed-step autoregressive generation. This limits temporal flexibility, obscures scene evolution, accumulates errors over long horizons, and poorly matches the continuous-time dynamics of real driving scenes. We propose GEM, a Gaussian Evolution Model for non-autoregressive occupancy world modeling, where driving scenes are represented as explicit continuous 4D Gaussian primitives with learned dynamics. Instead of rolling out future occupancy states step by step, GEM directly queries the Gaussian world representation at arbitrary timestamps and splats the corresponding conditional 3D Gaussians into semantic occupancy volumes. This enables efficient forecasting over the full horizon while retaining a compact and interpretable scene representation. By decoupling spatial geometry, temporal support, and primitive motion, GEM makes the predicted world easier to inspect, as each primitive's evolution can be followed continuously over time. The same representation also supports motion planning by predicting future ego trajectories from the learned Gaussian world. Extensive experiments show that GEM achieves state-of-the-art future semantic occupancy forecasting and strong motion planning performance, while providing flexible temporal querying.

preprint2025arXiv

OxygenREC: An Instruction-Following Generative Framework for E-commerce Recommendation

Traditional recommendation systems suffer from inconsistency in multi-stage optimization objectives. Generative Recommendation (GR) mitigates them through an end-to-end framework; however, existing methods still rely on matching mechanisms based on inductive patterns. Although responsive, they lack the ability to uncover complex user intents that require deductive reasoning based on world knowledge. Meanwhile, LLMs show strong deep reasoning capabilities, but their latency and computational costs remain challenging for industrial applications. More critically, there are performance bottlenecks in multi-scenario scalability: as shown in Figure 1, existing solutions require independent training and deployment for each scenario, leading to low resource utilization and high maintenance costs-a challenge unaddressed in GR literature. To address these, we present OxygenREC, an industrial recommendation system that leverages Fast-Slow Thinking to deliver deep reasoning with strict latency and multi-scenario requirements of real-world environments. First, we adopt a Fast-Slow Thinking architecture. Slow thinking uses a near-line LLM pipeline to synthesize Contextual Reasoning Instructions, while fast thinking employs a high-efficiency encoder-decoder backbone for real-time generation. Second, to ensure reasoning instructions effectively enhance recommendation generation, we introduce a semantic alignment mechanism with Instruction-Guided Retrieval (IGR) to filter intent-relevant historical behaviors and use a Query-to-Item (Q2I) loss for instruction-item consistency. Finally, to resolve multi-scenario scalability, we transform scenario information into controllable instructions, using unified reward mapping and Soft Adaptive Group Clip Policy Optimization (SA-GCPO) to align policies with diverse business objectives, realizing a train-once-deploy-everywhere paradigm.

preprint2022arXiv

A non-uniform extension of Baranyai's Theorem

A celebrated theorem of Baranyai states that when $k$ divides $n$, the family $K_n^k$ of all $k$-subsets of an $n$-element set can be partitioned into perfect matchings. In other words, $K_n^k$ is $1$-factorable. In this paper, we determine all $n, k$, such that the family $K_n^{\le k}$ consisting of subsets of $[n]$ of size up to $k$ is $1$-factorable, and thus extend Baranyai's Theorem to the non-uniform setting. In particular, our result implies that for fixed $k$ and sufficiently large $n$, $K_n^{\le k}$ is $1$-factorable if and only if $n \equiv 0$ or $-1 \pmod k$.

preprint2022arXiv

An Analysis of the Power Imbalance on the Uplink of Power-Domain NOMA

This paper analyzes the power imbalance factor on the uplink of a 2-user Power-domain NOMA system and reveals that the minimum value of the average error probability is achieved when the user signals are perfectly balanced in terms of power as in Multi-User MIMO with power control. The analytic result is obtained by analyzing the pairwise error probability and exploiting a symmetry property of the error events. This result is supported by computer simulations using the QPSK and 16QAM signal formats and uncorrelated Rayleigh fading channels. This finding leads to the questioning of the basic philosophy of Power-domain NOMA and suggests that the best strategy for uncorrelated channels is to perfectly balance the average signal powers received from the users and to use a maximum likelihood receiver for their detection.

preprint2022arXiv

Analysis of the Power Imbalance in Power-Domain NOMA on Correlated Rayleigh Fading Channels

This paper analyzes the power imbalance issue in power-domain NOMA (PD-NOMA) in the presence of channel correlations, typically encountered on the downlink of cellular systems when the base station antennas have an insufficient separation. In a recent paper, the authors analyzed this issue for a typical uplink scenario with uncorrelated channels, and the study revealed an astounding result that the optimum in terms of average error probability is achieved when the user signals are perfectly balanced in terms of power as in multi-user MIMO with power control. This result led to some questioning of the concept of PD-NOMA for uncorrelated Rayleigh fading channels. In the present paper, we make a similar analysis for the downlink, and the study gives a very clear insight into the influence of the power imbalance at different levels of channel correlation. First, with full correlation (user signals transmitted from the same antenna), the PD-NOMA concept reduces to simple signal constellation design. The optimum is achieved when the power imbalance between the user signals is such that the resulting constellation has uniform spacing. Any deviation from this optimum will lead to a hierarchical constellation with performance loss. Also, this optimum power imbalance is shown to hold for a range of strong channel correlations, but for moderate and low correlation values perfectly power balanced NOMA takes over as in the presence of uncorrelated channels.

preprint2022arXiv

Fine-Grained Predicates Learning for Scene Graph Generation

The performance of current Scene Graph Generation models is severely hampered by some hard-to-distinguish predicates, e.g., "woman-on/standing on/walking on-beach" or "woman-near/looking at/in front of-child". While general SGG models are prone to predict head predicates and existing re-balancing strategies prefer tail categories, none of them can appropriately handle these hard-to-distinguish predicates. To tackle this issue, inspired by fine-grained image classification, which focuses on differentiating among hard-to-distinguish object classes, we propose a method named Fine-Grained Predicates Learning (FGPL) which aims at differentiating among hard-to-distinguish predicates for Scene Graph Generation task. Specifically, we first introduce a Predicate Lattice that helps SGG models to figure out fine-grained predicate pairs. Then, utilizing the Predicate Lattice, we propose a Category Discriminating Loss and an Entity Discriminating Loss, which both contribute to distinguishing fine-grained predicates while maintaining learned discriminatory power over recognizable ones. The proposed model-agnostic strategy significantly boosts the performances of three benchmark models (Transformer, VCTree, and Motif) by 22.8\%, 24.1\% and 21.7\% of Mean Recall (mR@100) on the Predicate Classification sub-task, respectively. Our model also outperforms state-of-the-art methods by a large margin (i.e., 6.1\%, 4.6\%, and 3.2\% of Mean Recall (mR@100)) on the Visual Genome dataset.

preprint2022arXiv

Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder

Intermediate layer output (ILO) regularization by means of multitask training on encoder side has been shown to be an effective approach to yielding improved results on a wide range of end-to-end ASR frameworks. In this paper, we propose a novel method to do ILO regularized training differently. Instead of using conventional multitask methods that entail more training overhead, we directly make the intermediate layer output as input to the decoder, that is, our decoder not only accepts the output of the final encoder layer as input, it also takes the output of the encoder ILO as input during training. With the proposed method, as both encoder and decoder are simultaneously "regularized", the network is more sufficiently trained, consistently leading to improved results, over the ILO-based CTC method, as well as over the original attention-based modeling method without the proposed method employed.

preprint2022arXiv

Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition

Internal Language Model Estimation (ILME) based language model (LM) fusion has been shown significantly improved recognition results over conventional shallow fusion in both intra-domain and cross-domain speech recognition tasks. In this paper, we attempt to apply our ILME method to cross-domain code-switching speech recognition (CSSR) work. Specifically, our curiosity comes from several aspects. First, we are curious about how effective the ILME-based LM fusion is for both intra-domain and cross-domain CSSR tasks. We verify this with or without merging two code-switching domains. More importantly, we train an end-to-end (E2E) speech recognition model by means of merging two monolingual data sets and observe the efficacy of the proposed ILME-based LM fusion for CSSR. Experimental results on SEAME that is from Southeast Asian and another Chinese Mainland CS data set demonstrate the effectiveness of the proposed ILME-based LM fusion method.

preprint2022arXiv

Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition

In Uyghur speech, consonant and vowel reduction are often encountered, especially in spontaneous speech with high speech rate, which will cause a degradation of speech recognition performance. To solve this problem, we propose an effective phone mask training method for Conformer-based Uyghur end-to-end (E2E) speech recognition. The idea is to randomly mask off a certain percentage features of phones during model training, which simulates the above verbal phenomena and facilitates E2E model to learn more contextual information. According to experiments, the above issues can be greatly alleviated. In addition, deep investigations are carried out into different units in masking, which shows the effectiveness of our proposed masking unit. We also further study the masking method and optimize filling strategy of phone mask. Finally, compared with Conformer-based E2E baseline without mask training, our model demonstrates about 5.51% relative Word Error Rate (WER) reduction on reading speech and 12.92% on spontaneous speech, respectively. The above approach has also been verified on test-set of open-source data THUYG-20, which shows 20% relative improvements.

preprint2022arXiv

PM-MMUT: Boosted Phone-Mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition

Consonant and vowel reduction are often encountered in speech, which might cause performance degradation in automatic speech recognition (ASR). Our recently proposed learning strategy based on masking, Phone Masking Training (PMT), alleviates the impact of such phenomenon in Uyghur ASR. Although PMT achieves remarkably improvements, there still exists room for further gains due to the granularity mismatch between the masking unit of PMT (phoneme) and the modeling unit (word-piece). To boost the performance of PMT, we propose multi-modeling unit training (MMUT) architecture fusion with PMT (PM-MMUT). The idea of MMUT framework is to split the Encoder into two parts including acoustic feature sequences to phoneme-level representation (AF-to-PLR) and phoneme-level representation to word-piece-level representation (PLR-to-WPLR). It allows AF-to-PLR to be optimized by an intermediate phoneme-based CTC loss to learn the rich phoneme-level context information brought by PMT. Experimental results on Uyghur ASR show that the proposed approaches outperform obviously the pure PMT. We also conduct experiments on the 960-hour Librispeech benchmark using ESPnet1, which achieves about 10% relative WER reduction on all the test set without LM fusion comparing with the latest official ESPnet1 pre-trained model.

preprint2021arXiv

Man-in-The-Middle Attacks and Defense in a Power System Cyber-Physical Testbed

Man-in-The-Middle (MiTM) attacks present numerous threats to a smart grid. In a MiTM attack, an intruder embeds itself within a conversation between two devices to either eavesdrop or impersonate one of the devices, making it appear to be a normal exchange of information. Thus, the intruder can perform false data injection (FDI) and false command injection (FCI) attacks that can compromise power system operations, such as state estimation, economic dispatch, and automatic generation control (AGC). Very few researchers have focused on MiTM methods that are difficult to detect within a smart grid. To address this, we are designing and implementing multi-stage MiTM intrusions in an emulation-based cyber-physical power system testbed against a large-scale synthetic grid model to demonstrate how such attacks can cause physical contingencies such as misguided operation and false measurements. MiTM intrusions create FCI, FDI, and replay attacks in this synthetic power grid. This work enables stakeholders to defend against these stealthy attacks, and we present detection mechanisms that are developed using multiple alerts from intrusion detection systems and network monitoring tools. Our contribution will enable other smart grid security researchers and industry to develop further detection mechanisms for inconspicuous MiTM attacks.

preprint2021arXiv

Multi-Source Data Fusion for Cyberattack Detection in Power Systems

Cyberattacks can cause a severe impact on power systems unless detected early. However, accurate and timely detection in critical infrastructure systems presents challenges, e.g., due to zero-day vulnerability exploitations and the cyber-physical nature of the system coupled with the need for high reliability and resilience of the physical system. Conventional rule-based and anomaly-based intrusion detection system (IDS) tools are insufficient for detecting zero-day cyber intrusions in the industrial control system (ICS) networks. Hence, in this work, we show that fusing information from multiple data sources can help identify cyber-induced incidents and reduce false positives. Specifically, we present how to recognize and address the barriers that can prevent the accurate use of multiple data sources for fusion-based detection. We perform multi-source data fusion for training IDS in a cyber-physical power system testbed where we collect cyber and physical side data from multiple sensors emulating real-world data sources that would be found in a utility and synthesizes these into features for algorithms to detect intrusions. Results are presented using the proposed data fusion application to infer False Data and Command injection-based Man-in- The-Middle (MiTM) attacks. Post collection, the data fusion application uses time-synchronized merge and extracts features followed by pre-processing such as imputation and encoding before training supervised, semi-supervised, and unsupervised learning models to evaluate the performance of the IDS. A major finding is the improvement of detection accuracy by fusion of features from cyber, security, and physical domains. Additionally, we observed the co-training technique performs at par with supervised learning methods when fed with our features.

preprint2020arXiv

A 16-channel fiber array-coupled superconducting single-photon detector array with average system detection efficiency over 60% at telecom wavelength

We report a compact, scalable, and high-performance superconducting nanowire single-photon detector (SNSPD) array by using a multichannel optical fiber array-coupled configuration. For single pixels with an active area of 18 um in diameter and illuminated at the telecom wavelength of 1550 nm, we achieved a pixel yield of 13/16 on one chip, an average system detection efficiency of 69% at a dark count rate of 160 cps, a minimum timing jitter of 74 ps, and a maximum count rate of ~40 Mcps. The optical crosstalk coefficient between adjacent channels is better than -60 dB. The performance of the fiber array-coupled detectors is comparable with a standalone detector coupled to a single fiber. Our method is promising for the development of scalable, high-performance, and high-yield SNSPDs.

preprint2020arXiv

Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems

In this paper, we present a series of complementary approaches to improve the recognition of underrepresented named entities (NE) in hybrid ASR systems without compromising overall word error rate performance. The underrepresented words correspond to rare or out-of-vocabulary (OOV) words in the training data, and thereby can't be modeled reliably. We begin with graphemic lexicon which allows to drop the necessity of phonetic models in hybrid ASR. We study it under different settings and demonstrate its effectiveness in dealing with underrepresented NEs. Next, we study the impact of neural language model (LM) with letter-based features derived to handle infrequent words. After that, we attempt to enrich representations of underrepresented NEs in pretrained neural LM by borrowing the embedding representations of rich-represented words. This let us gain significant performance improvement on underrepresented NE recognition. Finally, we boost the likelihood scores of utterances containing NEs in the word lattices rescored by neural LMs and gain further performance improvement. The combination of the aforementioned approaches improves NE recognition by up to 42% relatively.

preprint2020arXiv

Design and Evaluation of A Cyber-Physical Resilient Power System Testbed

A power system is a complex cyber-physical system whose security is critical to its function. A major challenge is to model and analyze its communication pathways with respect to cyber threats. To achieve this, the design and evaluation of a cyber-physical power system (CPPS) testbed called Resilient Energy Systems Lab (RESLab) is presented that captures realistic cyber, physical, and protection system features. RESLab is architected to be a fundamental tool for studying and improving the resilience of complex CPPS to cyber threats. The cyber network is emulated using Common Open Research Emulator (CORE) that acts as a gateway for the physical and protection devices to communicate. The physical grid is simulated in the dynamic time frame using PowerWorld Dynamic Studio (PWDS). The protection components are modeled with both PWDS and physical devices including the SEL Real-Time Automation Controller (RTAC). Distributed Network Protocol 3 (DNP3) is used to monitor and control the grid. Then, exemplifying the design and validation of these tools, this paper presents four case studies on cyber-attack and defense using RESLab, where we demonstrate false data and command injection using Man-in-the-Middle and Denial of Service attacks and validate them on a large-scale synthetic electric grid.

preprint2020arXiv

Gathering GitHub OSS Requirements from Q&A Community: an Empirical Study

Cross-community collaboration can exploit the expertise and knowledges of crowds in different communities. Recently increasing users in open source software (OSS) community like GitHub attempt to gather software requirements from question and answer (Q&A) communities such as Stack Overflow (SO). In order to investigate this emerging crosscommunity collaboration phenomenon, the paper presents an exploratory study on cross-community requirements gathering of OSS projects in GitHub. We manually sample 3266 practice cases and quantitatively analyze the popularity of the phenomenon, the characteristics of the gathered requirements, and collaboration behaviors of cross-community. Some important findings are obtained: more than half of the requirements gathered from SO are enhancements and the majority of the gathered requirements arenon-functionalrequirements.Inaddition,OSSdeveloperscan directlyobtainrelatedsolutionsandcontributionsofthegathered requirements from SO in the gathering process.

preprint2020arXiv

Generalized Contingency Analysis Based on Graph Theory and Line Outage Distribution Factor

Identifying the multiple critical components in power systems whose absence together has severe impact on system performance is a crucial problem for power systems known as (N-x) contingency analysis. However, the inherent combinatorial feature of the N-x contingency analysis problem incurs by the increase of x in the (N-x) term, making the problem intractable for even relatively small test systems. We present a new framework for identifying the N-x contingencies that captures both topology and physics of the network. Graph theory provides many ways to measure power grid graphs, i.e. buses as nodes and lines as edges, allowing researchers to characterize system structure and optimize algorithms. This paper proposes a scalable approach based on the group betweenness centrality (GBC) concept that measures the impact of multiple components in the electric power grid as well as line outage distribution factors (LODFs) that find the lines whose loss has the highest impact on the power flow in the network. The proposed approach is a quick and efficient solution for identifying the most critical lines in power networks. The proposed approach is validated using various test cases, and results show that the proposed approach is able to quickly identify multiple contingencies that result in violations.

preprint2020arXiv

Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech

Accent conversion (AC) transforms a non-native speaker's accent into a native accent while maintaining the speaker's voice timbre. In this paper, we propose approaches to improving accent conversion applicability, as well as quality. First of all, we assume no reference speech is available at the conversion stage, and hence we employ an end-to-end text-to-speech system that is trained on native speech to generate native reference speech. To improve the quality and accent of the converted speech, we introduce reference encoders which make us capable of utilizing multi-source information. This is motivated by acoustic features extracted from native reference and linguistic information, which are complementary to conventional phonetic posteriorgrams (PPGs), so they can be concatenated as features to improve a baseline system based only on PPGs. Moreover, we optimize model architecture using GMM-based attention instead of windowed attention to elevate synthesized performance. Experimental results indicate when the proposed techniques are applied the integrated system significantly raises the scores of acoustic quality (30$\%$ relative increase in mean opinion score) and native accent (68$\%$ relative preference) while retaining the voice identity of the non-native speaker.

preprint2020arXiv

Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-switching Speech Recognition

In this paper, we conduct data selection analysis in building an English-Mandarin code-switching (CS) speech recognition (CSSR) system, which is aimed for a real CSSR contest in China. The overall training sets have three subsets, i.e., a code-switching data set, an English (LibriSpeech) and a Mandarin data set respectively. The code-switching data are Mandarin dominated. First of all, it is found using the overall data yields worse results, and hence data selection study is necessary. Then to exploit monolingual data, we find data matching is crucial. Mandarin data is closely matched with the Mandarin part in the code-switching data, while English data is not. However, Mandarin data only helps on those utterances that are significantly Mandarin-dominated. Besides, there is a balance point, over which more monolingual data will divert the CSSR system, degrading results. Finally, we analyze the effectiveness of combining monolingual data to train a CSSR system with the HMM-DNN hybrid framework. The CSSR system can perform within-utterance code-switch recognition, but it still has a margin with the one trained on code-switching data.

preprint2020arXiv

On local Turán problems

Since its formulation, Turán's hypergraph problems have been among the most challenging open problems in extremal combinatorics. One of them is the following: given a $3$-uniform hypergraph $\mathcal{F}$ on $n$ vertices in which any five vertices span at least one edge, prove that $|\mathcal{F}| \ge (1/4 -o(1))\binom{n}{3}$. The construction showing that this bound would be best possible is simply $\binom{X}{3} \cup \binom{Y}{3}$ where $X$ and $Y$ evenly partition the vertex set. This construction has the following more general $(2p+1, p+1)$-property: any set of $2p+1$ vertices spans a complete sub-hypergraph on $p+1$ vertices. One of our main results says that, quite surprisingly, for all $p>2$ the $(2p+1,p+1)$-property implies the conjectured lower bound.

preprint2020arXiv

Precise tuning of the superconducting properties of Mn-doped Al films for transition edge sensors by ion-implantation

Magnetic impurities in metallic superconductors are important for both fundamental and applied sciences. In this study, we focused on dilute Mn-doped aluminum (AlMn) films, which are common superconducting materials used to make transition edge sensors (TES). We developed a multi-energy ion-implantation technique to make AlMn films. Compared with frequently used sputtering techniques, ion-implantation provides more precise and reliable control of the Mn doping concentration in the AlMn films.The ion implantation also enables us to quantitatively analyze the superconducting transition temperature curve as a function of the Mn doping concentration. We found that Mn dopants act as magnetic impurities and suppression of superconductivity is counteracted by the antiferromagnetic Ruderman Kittel Kasuya Yosida interaction among Mn dopants. The RKKY interaction can be tuned through defect engineering in the ion-implantation process and through post-implantation annealing.

preprint2020arXiv

Robust Image Matching By Dynamic Feature Selection

Estimating dense correspondences between images is a long-standing image under-standing task. Recent works introduce convolutional neural networks (CNNs) to extract high-level feature maps and find correspondences through feature matching. However,high-level feature maps are in low spatial resolution and therefore insufficient to provide accurate and fine-grained features to distinguish intra-class variations for correspondence matching. To address this problem, we generate robust features by dynamically selecting features at different scales. To resolve two critical issues in feature selection,i.e.,how many and which scales of features to be selected, we frame the feature selection process as a sequential Markov decision-making process (MDP) and introduce an optimal selection strategy using reinforcement learning (RL). We define an RL environment for image matching in which each individual action either requires new features or terminates the selection episode by referring a matching score. Deep neural networks are incorporated into our method and trained for decision making. Experimental results show that our method achieves comparable/superior performance with state-of-the-art methods on three benchmarks, demonstrating the effectiveness of our feature selection strategy.

preprint2016arXiv

A note on the double-critical graph conjecture

A connected $n$-chromatic graph $G$ is double-critical if for all the edges $xy$ of $G$, the graph $G-x-y$ is $(n-2)$-chromatic. In 1966, Erd\H os and Lovász conjectured that the only double-critical $n$-chromatic graph is $K_n$. This conjecture remains unresolved for $n \ge 6.$ In this short note, we verify this conjecture for claw-free graphs $G$ of chromatic number $6$.

preprint2016arXiv

Degree versions of the Erdos-Ko-Rado Theorem and Erdos hypergraph matching conjecture

We use an algebraic method to prove a degree version of the celebrated Erd\H os-Ko-Rado theorem: given $n>2k$, every intersecting $k$-uniform hypergraph $H$ on $n$ vertices contains a vertex that lies on at most $\binom{n-2}{k-2}$ edges. This result can be viewed as a special case of the degree version of a well-known conjecture of Erdős on hypergraph matchings. Improving the work of Bollobás, Daykin, and Erd\H os from 1976, we show that given integers $n, k, s$ with $n\ge 3k^2 s$, every $k$-uniform hypergraph $H$ on $n$ vertices with minimum vertex degree greater than $\binom{n-1}{k-1}-\binom{n-s}{k-1}$ contains $s$ disjoint edges.

preprint2015arXiv

Large Supports are required for Well-Supported Nash Equilibria

We prove that for any constant $k$ and any $ε<1$, there exist bimatrix win-lose games for which every $ε$-WSNE requires supports of cardinality greater than $k$. To do this, we provide a graph-theoretic characterization of win-lose games that possess $ε$-WSNE with constant cardinality supports. We then apply a result in additive number theory of Haight to construct win-lose games that do not satisfy the requirements of the characterization. These constructions disprove graph theoretic conjectures of Daskalakis, Mehta and Papadimitriou, and Myers.

preprint2015arXiv

On graphs decomposable into induced matchings of linear sizes

We call a graph $G$ an $(r,t)$-Ruzsa-Szemerédi graph if its edge set can be partitioned into $t$ edge-disjoint induced matchings, each of size $r$. These graphs were introduced in 1978 and has been extensively studied since then. In this paper, we consider the case when $r=cn$. For $c>1/4$, we determine the maximum possible $t$ which is a constant depending only on $c$. On the other hand, when $c=1/4$, there could be as many as $Ω(\log n)$ induced matchings. We prove that this bound is tight up to a constant factor. Finally, when $c$ is fixed strictly between $1/5$ and $1/4$, we give a short proof that the number $t$ of induced matchings is $O(n/\log n)$. We are also able to further improve the upper bound to $o(n/\log n)$ for fixed $c> 1/4-b$ for some positive constant $b$.

preprint2014arXiv

4 X 20 Gbit/s mode division multiplexing over free space using vector modes and a q-plate mode (de)multiplexer

Vector modes are spatial modes that have spatially inhomogeneous states of polarization, such as, radial and azimuthal polarization. They can produce smaller spot sizes and stronger longitudinal polarization components upon focusing. As a result, they are used for many applications, including optical trapping and nanoscale imaging. In this work, vector modes are used to increase the information capacity of free space optical communication via the method of optical communication referred to as mode division multiplexing. A mode (de)multiplexer for vector modes based on a liquid crystal technology referred to as a q-plate is introduced. As a proof of principle, using the mode (de)multiplexer four vector modes each carrying a 20 Gbit/s quadrature phase shift keying signal on a single wavelength channel (~1550nm), comprising an aggregate 80 Gbit/s, were transmitted ~1m over the lab table with <-16.4 dB (<2%) mode crosstalk. Bit error rates for all vector modes were measured at the forward error correction threshold with power penalties < 3.41dB.

preprint2014arXiv

Maximizing the number of nonnegative subsets

Given a set of $n$ real numbers, if the sum of elements of every subset of size larger than $k$ is negative, what is the maximum number of subsets of nonnegative sum? In this note we show that the answer is $\binom{n-1}{k-1} + \binom{n-1}{k-2} + \cdots + \binom{n-1}{0}+1$, settling a problem of Tsukerman. We provide two proofs, the first establishes and applies a weighted version of Hall's Theorem and the second is based on an extension of the nonuniform Erdős-Ko-Rado Theorem.

preprint2014arXiv

More on the bipartite decomposition of random graphs

For a graph $G=(V,E)$, let $bc(G)$ denote the minimum number of pairwise edge disjoint complete bipartite subgraphs of $G$ so that each edge of $G$ belongs to exactly one of them. It is easy to see that for every graph $G$, $bc(G) \leq n -α(G)$, where $α(G)$ is the maximum size of an independent set of $G$. Erdős conjectured in the 80s that for almost every graph $G$ equality holds, i.e., that for the random graph $G(n,0.5)$, $bc(G)=n-α(G)$ with high probability, that is, with probability that tends to 1 as $n$ tends to infinity. The first author showed that this is slightly false, proving that for most values of $n$ tending to infinity and for $G=G(n,0.5)$, $bc(G) \leq n-α(G)-1$ with high probability. We prove a stronger bound: there exists an absolute constant $c>0$ so that $bc(G) \leq n-(1+c)α(G)$ with high probability.

preprint2014arXiv

Performance Metrics and Design Parameters for a Free-space Communication Link Based on Multiplexing of Multiple Orbital-Angular-Momentum Beams

We study the design parameters for an orbital angular momentum (OAM) multiplexed free-space data link. Power loss, channel crosstalk and power penalty of the link are analyzed in the case of misalignment between the transmitter and receiver (lateral displacement, receiver angular error, or transmitter pointing error). The relationship among the system power loss and link distance, transmitted beam size and receiver aperture size are discussed based on the beam divergence due to free space propagation. We also describe the trade-offs for different receiver aperture sizes and mode spacing of the transmitted OAM beams under given lateral displacements or receiver angular errors. Through simulations and some experiments, we show that (1) a system with a larger transmitted beam size and a larger receiver aperture is more tolerant to the lateral displacement but less tolerant to the receiver angular error; (2) a system with a larger mode spacing, which uses larger OAM charges, suffers more system power loss but less channel crosstalk; thus, a system with a small mode spacing shows lower system power penalty when system power loss dominates (e.g., small lateral displacement or receiver angular error) while that with a larger mode spacing shows lower power penalty when channel crosstalk dominates (e.g., larger lateral displacement or receiver angular error); (3) the effects of lateral displacement and receiver angular error are not necessarily independent; as an example of them combined, the effects of the transmitter pointing error on the system are also investigated.

preprint2014arXiv

The minimum number of nonnegative edges in hypergraphs

An r-unform n-vertex hypergraph H is said to have the Manickam-Miklós-Singhi (MMS) property if for every assignment of weights to its vertices with nonnegative sum, the number of edges whose total weight is nonnegative is at least the minimum degree of H. In this paper we show that for n>10r^3, every r-uniform n-vertex hypergraph with equal codegrees has the MMS property, and the bound on n is essentially tight up to a constant factor. This result has two immediate corollaries. First it shows that every set of n>10k^3 real numbers with nonnegative sum has at least $\binom{n-1}{k-1}$ nonnegative k-sums, verifying the Manickam-Miklós-Singhi conjecture for this range. More importantly, it implies the vector space Manickam-Miklós-Singhi conjecture which states that for n >= 4k and any weighting on the 1-dimensional subspaces of F_q^n with nonnegative sum, the number of nonnegative k-dimensional subspaces is at least ${n-1 \brack k-1}_q$. We also discuss two additional generalizations, which can be regarded as analogues of the Erdős-Ko-Rado theorem on k-intersecting families.

preprint2013arXiv

Deterministic bootstrap percolation in high dimensional grids

In this paper, we study the k-neighbor bootstrap percolation process on the d-dimensional grid [n]^d, and show that the minimum number of initial vertices that percolate is (1-d/k)n^d + O(n^{d-1})$ when d<=k<=2d. This confirms a conjecture of Pete.

preprint2013arXiv

On the maximum induced density of directed stars and related problems

Let k>=3 be an integer, we prove that the maximum induced density of the k-vertex directed star in a directed graph is attained by an iterated blow-up construction. This confirms a conjecture by Falgas-Ravry and Vaughan, who proved this for k=3, 4. This question provides the first known instance of density problem for which one can prove extremality of an iterated blow-up construction. We also study the inducibility of complete bipartite digraphs and discuss other related problems.

preprint2012arXiv

A problem of Erdős on the minimum number of $k$-cliques

Fifty years ago Erdős asked to determine the minimum number of $k$-cliques in a graph on $n$ vertices with independence number less than l. He conjectured that this minimum is achieved by the disjoint union of $l-1$ complete graphs of size $\frac{n}{l-1}$. This conjecture was disproved by Nikiforov who showed that the balanced blow-up of a 5-cycle has fewer 4-cliques than the union of 2 complete graphs of size $\frac{n}{2}$. In this paper we solve Erdős' problem for $(k,l)=(3,4)$ and $(k,l)=(4,3)$. Using stability arguments we also characterize the precise structure of extremal examples, confirming Erdős' conjecture for $(k,l)=(3,4)$ and showing that a blow-up of a 5-cycle gives the minimum for $(k,l)=(4,3)$.

preprint2012arXiv

Large feedback arc sets, high minimum degree subgraphs, and long cycles in Eulerian digraphs

A minimum feedback arc set of a directed graph $G$ is a smallest set of arcs whose removal makes $G$ acyclic. Its cardinality is denoted by $β(G)$. We show that an Eulerian digraph with $n$ vertices and $m$ arcs has $β(G) \ge m^2/2n^2+m/2n$, and this bound is optimal for infinitely many $m, n$. Using this result we prove that an Eulerian digraph contains a cycle of length at most $6n^2/m$, and has an Eulerian subgraph with minimum degree at least $m^2/24n^3$. Both estimates are tight up to a constant factor. Finally, motivated by a conjecture of Bollobás and Scott, we also show how to find long cycles in Eulerian digraphs.

preprint2012arXiv

Large matchings in uniform hypergraphs and the conjectures of Erdos and Samuels

In this paper we study conditions which guarantee the existence of perfect matchings and perfect fractional matchings in uniform hypergraphs. We reduce this problem to an old conjecture by Erdős on estimating the maximum number of edges in a hypergraph when the (fractional) matching number is given, which we are able to solve in some special cases using probabilistic techniques. Based on these results, we obtain some general theorems on the minimum $d$-degree ensuring the existence of perfect (fractional) matchings. In particular, we asymptotically determine the minimum vertex degree which guarantees a perfect matching in 4-uniform and 5-uniform hypergraphs. We also discuss an application to a problem of finding an optimal data allocation in a distributed storage system.

preprint2011arXiv

Bandwidth theorem for random graphs

A graph $G$ is said to have \textit{bandwidth} at most $b$, if there exists a labeling of the vertices by $1,2,..., n$, so that $|i - j| \leq b$ whenever $\{i,j\}$ is an edge of $G$. Recently, Böttcher, Schacht, and Taraz verified a conjecture of Bollobás and Komlós which says that for every positive $r,Δ,γ$, there exists $β$ such that if $H$ is an $n$-vertex $r$-chromatic graph with maximum degree at most $Δ$ which has bandwidth at most $βn$, then any graph $G$ on $n$ vertices with minimum degree at least $(1 - 1/r + γ)n$ contains a copy of $H$ for large enough $n$. In this paper, we extend this theorem to dense random graphs. For bipartite $H$, this answers an open question of Böttcher, Kohayakawa, and Taraz. It appears that for non-bipartite $H$ the direct extension is not possible, and one needs in addition that some vertices of $H$ have independent neighborhoods. We also obtain an asymptotically tight bound for the maximum number of vertex disjoint copies of a fixed $r$-chromatic graph $H_0$ which one can find in a spanning subgraph of $G(n,p)$ with minimum degree $(1-1/r + γ)np$.

preprint2011arXiv

Nonnegative k-sums, fractional covers, and probability of small deviations

More than twenty years ago, Manickam, Miklós, and Singhi conjectured that for any integers $n, k$ satisfying $n \geq 4k$, every set of $n$ real numbers with nonnegative sum has at least $\binom{n-1}{k-1}$ $k$-element subsets whose sum is also nonnegative. In this paper we discuss the connection of this problem with matchings and fractional covers of hypergraphs, and with the question of estimating the probability that the sum of nonnegative independent random variables exceeds its expectation by a given amount. Using these connections together with some probabilistic techniques, we verify the conjecture for $n \geq 33k^2$. This substantially improves the best previously known exponential lower bound $n \geq e^{ck \log\log k}$. In addition we prove a tight stability result showing that for every $k$ and all sufficiently large $n$, every set of $n$ reals with a nonnegative sum that does not contain a member whose sum with any other $k-1$ members is nonnegative, contains at least $\binom{n-1}{k-1}+\binom{n-k-1}{k-1}-1$ subsets of cardinality $k$ with nonnegative sum.

preprint2011arXiv

Quasi-randomness of graph balanced cut properties

Quasi-random graphs can be informally described as graphs whose edge distribution closely resembles that of a truly random graph of the same edge density. Recently, Shapira and Yuster proved the following result on quasi-randomness of graphs. Let $k \ge 2$ be a fixed integer, $α_1,...,α_k$ be positive reals satisfying $\sum_{i} α_i = 1$ and $(α_1,..., α_k) \neq (1/k,...,1/k)$, and $G$ be a graph on $n$ vertices. If for every partition of the vertices of $G$ into sets $V_1,..., V_k$ of size $α_1 n,..., α_k n$, the number of complete graphs on $k$ vertices which have exactly one vertex in each of these sets is similar to what we would expect in a random graph, then the graph is quasi-random. However, the method of quasi-random hypergraphs they used did not provide enough information to resolve the case $(1/k,..., 1/k)$ for graphs. In their work, Shapira and Yuster asked whether this case also forces the graph to be quasi-random. Janson also posed the same question in his study of quasi-randomness under the framework of graph limits. In this paper, we positively answer their question.

preprint2011arXiv

Rapid Mid-Infrared Variability in Protostellar Disks

Spectral energy distribution (SED) in protostellar disks is determined by the disks'internal dissipation and reprocessing of irradiation from their host stars. Around T Tauri stars, most mid-infrared (MIR) radiation (in a few to a few ten μm wavelength range) emerge from regions around a fraction to a few AU's. This region is interesting because it contains both the habitable zone and the snow line. Recent observations reveal that SED variations, in the MIR wavelength range. These variations are puzzling because they occur on time scale (a few days) which is much shorter than the dynamical (months to years) time scale at 1AU to a few AU's. They are probably caused by shadows casted by inner onto outer disk regions. Interaction between disks and their misaligned magnetized host stars can lead to warped structure and periodic SED modulations. Rapid aperiodic SED variations may also be induced by observed X-ray flares from T Tauri stars. These flares can significantly modulate the ionization fraction of the gas and the net charge carried by the grains near the surface of the inner disk. The newly charged grains may be accelerated by the stellar or disk magnetic field and adjust their distances from the midplane. Shadows casted by these grains attenuates the flux of stellar photons irradiated onto regions at several AU's from the central stars. We use this model to account for the observed rapid aperiodic SED variabilities. We suggest regular monitoring of SED variations will not only provide valuable information on the distribution of the disk aspect ratio near the habitable zone but also provide a probe on the interaction between the inner regions of the disk with the magnetosphere of their host stars.

preprint2011arXiv

The size of a hypergraph and its matching number

More than forty years ago, Erdős conjectured that for any T <= N/K, every K-uniform hypergraph on N vertices without T disjoint edges has at most max{\binom{KT-1}{K}, \binom{N}{K} - \binom{N-T+1}{K}} edges. Although this appears to be a basic instance of the hypergraph Turán problem (with a T-edge matching as the excluded hypergraph), progress on this question has remained elusive. In this paper, we verify this conjecture for all T < N/(3K^2). This improves upon the best previously known range T = O(N/K^3), which dates back to the 1970's.

preprint2010arXiv

A counterexample to the Alon-Saks-Seymour conjecture and related problems

Consider a graph obtained by taking edge disjoint union of $k$ complete bipartite graphs. Alon, Saks and Seymour conjectured that such graph has chromatic number at most $k+1$. This well known conjecture remained open for almost twenty years. In this paper, we construct a counterexample to this conjecture and discuss several related problems in combinatorial geometry and communication complexity.

Hao Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

42 published item(s)

GEM: Gaussian Evolution Model for Occupancy Forecasting and Motion Planning

OxygenREC: An Instruction-Following Generative Framework for E-commerce Recommendation

A non-uniform extension of Baranyai's Theorem

An Analysis of the Power Imbalance on the Uplink of Power-Domain NOMA

Analysis of the Power Imbalance in Power-Domain NOMA on Correlated Rayleigh Fading Channels

Fine-Grained Predicates Learning for Scene Graph Generation

Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder

Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition

Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition

PM-MMUT: Boosted Phone-Mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition

Man-in-The-Middle Attacks and Defense in a Power System Cyber-Physical Testbed

Multi-Source Data Fusion for Cyberattack Detection in Power Systems

A 16-channel fiber array-coupled superconducting single-photon detector array with average system detection efficiency over 60% at telecom wavelength

Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems

Design and Evaluation of A Cyber-Physical Resilient Power System Testbed

Gathering GitHub OSS Requirements from Q&A Community: an Empirical Study

Generalized Contingency Analysis Based on Graph Theory and Line Outage Distribution Factor

Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech

Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-switching Speech Recognition

On local Turán problems

Precise tuning of the superconducting properties of Mn-doped Al films for transition edge sensors by ion-implantation

Robust Image Matching By Dynamic Feature Selection

A note on the double-critical graph conjecture

Degree versions of the Erdos-Ko-Rado Theorem and Erdos hypergraph matching conjecture

Large Supports are required for Well-Supported Nash Equilibria

On graphs decomposable into induced matchings of linear sizes

4 X 20 Gbit/s mode division multiplexing over free space using vector modes and a q-plate mode (de)multiplexer

Maximizing the number of nonnegative subsets

More on the bipartite decomposition of random graphs

Performance Metrics and Design Parameters for a Free-space Communication Link Based on Multiplexing of Multiple Orbital-Angular-Momentum Beams

The minimum number of nonnegative edges in hypergraphs

Deterministic bootstrap percolation in high dimensional grids

On the maximum induced density of directed stars and related problems

A problem of Erdős on the minimum number of $k$-cliques

Large feedback arc sets, high minimum degree subgraphs, and long cycles in Eulerian digraphs

Large matchings in uniform hypergraphs and the conjectures of Erdos and Samuels

Bandwidth theorem for random graphs

Nonnegative k-sums, fractional covers, and probability of small deviations

Quasi-randomness of graph balanced cut properties

Rapid Mid-Infrared Variability in Protostellar Disks

The size of a hypergraph and its matching number

A counterexample to the Alon-Saks-Seymour conjecture and related problems