Source author record

Yi Song

Yi Song appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory math.IT Computation and Language math.DS cond-mat.mtrl-sci eess.SP eess.SY Machine Learning math.CO nlin.CD Software Engineering Systems and Control

Catalog footprint

What is connected

14works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Enumeration of weighted plane trees by a permutation model

This work addresses an enumeration problem on weighted bi-colored plane trees with prescribed vertex data, with all vertices labeled distinctly. We give a bijection proof of the enumeration formula originally due to Kochetkov, hence affirmatively answer a question of Adrianov-Pakovich-Zvonkin. The argument is purely combinatorial and totally constructive, remaining valid for real-valued edge weights. A central process is a geometric construction that directly encodes each tree as a permutation. We also exhibit algebraic relationships between the enumeration problem, the partial order on partitions of vertices and the Stirling numbers of the second kind. Some computation examples are presented as appendices.

preprint2022arXiv

A Comprehensive Empirical Investigation on Failure Clustering in Parallel Debugging

The clustering technique has attracted a lot of attention as a promising strategy for parallel debugging in multi-fault scenarios, this heuristic approach (i.e., failure indexing or fault isolation) enables developers to perform multiple debugging tasks simultaneously through dividing failed test cases into several disjoint groups. When using statement ranking representation to model failures for better clustering, several factors influence clustering effectiveness, including the risk evaluation formula (REF), the number of faults (NOF), the fault type (FT), and the number of successful test cases paired with one individual failed test case (NSP1F). In this paper, we present the first comprehensive empirical study of how these four factors influence clustering effectiveness. We conduct extensive controlled experiments on 1060 faulty versions of 228 simulated faults and 141 real faults, and the results reveal that: 1) GP19 is highly competitive across all REFs, 2) clustering effectiveness decreases as NOF increases, 3) higher clustering effectiveness is easier to achieve when a program contains only predicate faults, and 4) clustering effectiveness remains when the scale of NSP1F is reduced to 20%.

preprint2022arXiv

Channel State Acquisition in FDD Massive MIMO: Rate-Distortion Bound and Effectiveness of "Analog" Feedback

We consider the problem of estimating channel fading coefficients (modeled as a correlated Gaussian vector) via Downlink (DL) training and Uplink (UL) feedback in wideband FDD massive MIMO systems. Using rate-distortion theory, we derive optimal bounds on the achievable channel state estimation error in terms of the number of training pilots in DL ($β_{tr}$) and feedback dimension in UL ($β_{fb}$), with random, spatially isotropic pilots. It is shown that when the number of training pilots exceeds the channel covariance rank ($r$), the optimal rate-distortion feedback strategy achieves an estimation error decay of $Θ(SNR^{-α})$ in estimating the channel state, where $α= min (β_{fb}/r , 1)$ is the so-called quality scaling exponent. We also discuss an "analog" feedback strategy, showing that it can achieve the optimal quality scaling exponent for a wide range of training and feedback dimensions with no channel covariance knowledge and simple signal processing at the user side. Our findings are supported by numerical simulations comparing various strategies in terms of channel state mean squared error and achievable ergodic sum-rate in DL with zero-forcing precoding.

preprint2022arXiv

FDD Massive MIMO Channel Training Optimal Rate Distortion Bounds and the Efficiency of one-shot Schemes

We study the problem of providing channel state information (CSI) at the transmitter in multi-user massive MIMO systems operating in frequency division duplexing (FDD). The wideband MIMO channel is a vector-valued random process correlated in time, space (antennas), and frequency (subcarriers). The base station (BS) broadcasts periodically beta_tr pilot symbols from its M antenna ports to K single-antenna users (UEs). Correspondingly, the K UEs send feedback messages about their channel state using beta_fb symbols in the uplink (UL). Using results from remote rate-distortion theory, we show that, as snr reaches infty, the optimal feedback strategy achieves a channel state estimation mean squared error (MSE) that behaves as Theta(1) if beta_tr < r and as Theta(snr^(-alpha)) when beta_tr >=r, where alpha = min(beta_fb/r, 1), where r is the rank of the channel covariance matrix. The MSE-optimal rate-distortion strategy implies encoding of long sequences of channel states, which would yield completely stale CSI and therefore poor multiuser precoding performance. Hence, we consider three practical one-shot CSI strategies with minimum one-slot delay and analyze their large-SNR channel estimation MSE behavior. These are: (1) digital feedback via entropy-coded scalar quantization (ECSQ), (2) analog feedback (AF), and (3) local channel estimation at the UEs and digital feedback. These schemes have different requirements in terms of knowledge of the channel statistics at the UE and at the BS. In particular, the latter strategy requires no statistical knowledge and is closely inspired by a CSI feedback scheme currently proposed in 3GPP standardization.

preprint2022arXiv

Many-Class Text Classification with Matching

In this work, we formulate \textbf{T}ext \textbf{C}lassification as a \textbf{M}atching problem between the text and the labels, and propose a simple yet effective framework named TCM. Compared with previous text classification approaches, TCM takes advantage of the fine-grained semantic information of the classification labels, which helps distinguish each class better when the class number is large, especially in low-resource scenarios. TCM is also easy to implement and is compatible with various large pretrained language models. We evaluate TCM on 4 text classification datasets (each with 20+ labels) in both few-shot and full-data settings, and this model demonstrates significant improvements over other text classification paradigms. We also conduct extensive experiments with different variants of TCM and discuss the underlying factors of its success. Our method and analyses offer a new perspective on text classification.

preprint2021arXiv

Robust Kalman filter-based dynamic state estimation of natural gas pipeline networks

To obtain the accurate transient states of the big scale natural gas pipeline networks under the bad data and non-zero mean noises conditions, a robust Kalman filter-based dynamic state estimation method is proposed using the linearized gas pipeline transient flow equations in this paper. Firstly, the dynamic state estimation model is built. Since the gas pipeline transient flow equations are less than the states, the boundary conditions are used as supplementary constraints to predict the transient states. To increase the measurement redundancy, the zero mass flow rate constraints at the sink nodes are taken as virtual measurements. Secondly, to ensure the stability under bad data condition, the robust Kalman filter algorithm is proposed by introducing a time-varying scalar matrix to regulate the measurement error variances correctly according to the innovation vector at every time step. At last, the proposed method is applied to a 30-node gas pipeline networks in several kinds of measurement conditions. The simulation shows that the proposed robust dynamic state estimation can decrease the effects of bad data and achieve better estimating results.

preprint2020arXiv

An Exploratory Study of Argumentative Writing by Young Students: A Transformer-based Approach

We present a computational exploration of argument critique writing by young students. Middle school students were asked to criticize an argument presented in the prompt, focusing on identifying and explaining the reasoning flaws. This task resembles an established college-level argument critique task. Lexical and discourse features that utilize detailed domain knowledge to identify critiques exist for the college task but do not perform well on the young students data. Instead, transformer-based architecture (e.g., BERT) fine-tuned on a large corpus of critique essays from the college task performs much better (over 20% improvement in F1 score). Analysis of the performance of various configurations of the system suggests that while children's writing does not exhibit the standard discourse structure of an argumentative essay, it does share basic local sequential structures with the more mature writers.

preprint2015arXiv

Parallel Stitching of Two-Dimensional Materials

Diverse parallel stitched two-dimensional heterostructures are synthesized, including metal-semiconductor (graphene-MoS2), semiconductor-semiconductor (WS2-MoS2), and insulator-semiconductor (hBN-MoS2), directly through selective sowing of aromatic molecules as the seeds in chemical vapor deposition (CVD) method. Our methodology enables the large-scale fabrication of lateral heterostructures with arbitrary patterns, and clean and precisely aligned interfaces, which offers tremendous potential for its application in integrated circuits.

preprint2011arXiv

On the Spectrum Handoff for Cognitive Radio Ad Hoc Networks without Common Control Channel

Cognitive radio (CR) technology is a promising solution to enhance the spectrum utilization by enabling unlicensed users to exploit the spectrum in an opportunistic manner. Since unlicensed users are temporary visitors to the licensed spectrum, they are required to vacate the spectrum when a licensed user reclaims it. Due to the randomness of the appearance of licensed users, disruptions to both licensed and unlicensed communications are often difficult to prevent. In this chapter, a proactive spectrum handoff framework for CR ad hoc networks is proposed to address these concerns. In the proposed framework, channel switching policies and a proactive spectrum handoff protocol are proposed to let unlicensed users vacate a channel before a licensed user utilizes it to avoid unwanted interference. Network coordination schemes for unlicensed users are also incorporated into the spectrum handoff protocol design to realize channel rendezvous. Moreover, a distributed channel selection scheme to eliminate collisions among unlicensed users is proposed. In our proposed framework, unlicensed users coordinate with each other without using a common control channel. We compare our proposed proactive spectrum handoff protocol with a reactive spectrum handoff protocol, under which unlicensed users switch channels after collisions with licensed transmissions occur. Simulation results show that our proactive spectrum handoff outperforms the reactive spectrum handoff approach in terms of higher throughput and fewer collisions to licensed users. In addition, we propose a novel three dimensional discrete-time Markov chain to characterize the process of reactive spectrum handoffs and analyze the performance of unlicensed users. We validate the numerical results obtained from our proposed Markov model against simulation and investigate other parameters of interest in the spectrum handoff scenario.

preprint2011arXiv

Optimal Power Control for Concurrent Transmissions of Location-aware Mobile Cognitive Radio Ad Hoc Networks

In a cognitive radio (CR) network, CR users intend to operate over the same spectrum band licensed to legacy networks. A tradeoff exists between protecting the communications in legacy networks and maximizing the throughput of CR transmissions, especially when CR links are unstable due to the mobility of CR users. Because of the non-zero probability of false detection and implementation complexity of spectrum sensing, in this paper, we investigate a sensing-free spectrum sharing scenario for mobile CR ad hoc networks to improve the frequency reuse by incorporating the location awareness capability in CR networks. We propose an optimal power control algorithm for the CR transmitter to maximize the concurrent transmission region of CR users especially in mobile scenarios. Under the proposed power control algorithm, the mobile CR network achieves maximized throughput without causing harmful interference to primary users in the legacy network. Simulation results show that the proposed optimal power control algorithm outperforms the algorithm with the fixed power policy in terms of increasing the packet delivery ratio in the network.

preprint2011arXiv

Performance Analysis of Spectrum Handoff for Cognitive Radio Ad Hoc Networks without Common Control Channel under Homogeneous Primary Traffic

Cognitive radio (CR) technology is regarded as a promising solution to the spectrum scarcity problem. Due to the spectrum varying nature of CR networks, unlicensed users are required to perform spectrum handoffs when licensed users reuse the spectrum. In this paper, we study the performance of the spectrum handoff process in a CR ad hoc network under homogeneous primary traffic. We propose a novel three dimensional discrete-time Markov chain to characterize the process of spectrum handoffs and analyze the performance of unlicensed users. Since in real CR networks, a dedicated common control channel is not practical, in our model, we implement a network coordination scheme where no dedicated common control channel is needed. Moreover, in wireless communications, collisions among simultaneous transmissions cannot be immediately detected and the whole collided packets need to be retransmitted, which greatly affects the network performance. With this observation, we also consider the retransmissions of the collided packets in our proposed discrete-time Markov chain. In addition, besides the random channel selection scheme, we study the impact of different channel selection schemes on the performance of the spectrum handoff process. Furthermore, we also consider the spectrum sensing delay in our proposed Markov model and investigate its effect on the network performance. We validate the numerical results obtained from our proposed Markov model against simulation and investigate other parameters of interest in the spectrum handoff scenario. Our proposed analytical model can be applied to various practical network scenarios. It also provides new insights on the process of spectrum handoffs. Currently, no existing analysis has considered the comprehensive aspects of spectrum handoff as what we consider in this paper.

preprint2007arXiv

Dynamical Systems on Three Manifolds Part I: Knots, Links and Chaos

In this paper, we give an explicit construction of dynamical systems (defined within a solid torus) containing any knot (or link) and arbitrarily knotted chaos. The first is achieved by expressing the knots in terms of braids, defining a system containing the braids and extending periodically to obtain a system naturally defined on a torus and which contains the given knotted trajectories. To get explicit differential equations for dynamical systems containing the braids, we will use a certain function to define a tube neigbourhood of the braid. The second one, generating chaotic systems, is realized by modeling the Smale horseshoe.

preprint2007arXiv

Dynamical Systems On Three Manifolds Part II: 3-Manifolds,Heegaard Splittings and Three-Dimensional Systems

The global behaviour of nonlinear systems is extremely important in control and systems theory since the usual local theories will only give information about a system in some neighbourhood of an operating point. Away from that point, the system may have totally different behaviour and so the theory developed for the local system will be useless for the global one. In this paper we shall consider the analytical and topological structure of systems on 2- and 3- manifolds and show that it is possible to obtain systems with 'arbitrarily strange' behaviour, i.e., arbitrary numbers of chaotic regimes which are knotted and linked in arbitrary ways. We shall do this by considering Heegaard Splittings of these manifolds and the resulting systems defined on the boundaries.

preprint2007arXiv

Inversely Unstable Solutions of Two-Dimensional Systems on Genus-p Surfaces and the Topology of Knotted Attractors

In this paper, we will show that a periodic nonlinear, time-varying dissipative system that is defined on a genus-p surface contains one or more invariant sets which act as attractors. Moreover, we shall generalize a result in [Martins, 2004] and give conditions under which these invariant sets are not homeomorphic to a circle individually, which implies the existence of chaotic behaviour. This is achieved by studying the appearance of inversely unstable solutions within each invariant set.

Yi Song

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Enumeration of weighted plane trees by a permutation model

A Comprehensive Empirical Investigation on Failure Clustering in Parallel Debugging

Channel State Acquisition in FDD Massive MIMO: Rate-Distortion Bound and Effectiveness of "Analog" Feedback

FDD Massive MIMO Channel Training Optimal Rate Distortion Bounds and the Efficiency of one-shot Schemes

Many-Class Text Classification with Matching

Robust Kalman filter-based dynamic state estimation of natural gas pipeline networks

An Exploratory Study of Argumentative Writing by Young Students: A Transformer-based Approach

Parallel Stitching of Two-Dimensional Materials

On the Spectrum Handoff for Cognitive Radio Ad Hoc Networks without Common Control Channel

Optimal Power Control for Concurrent Transmissions of Location-aware Mobile Cognitive Radio Ad Hoc Networks

Performance Analysis of Spectrum Handoff for Cognitive Radio Ad Hoc Networks without Common Control Channel under Homogeneous Primary Traffic

Dynamical Systems on Three Manifolds Part I: Knots, Links and Chaos

Dynamical Systems On Three Manifolds Part II: 3-Manifolds,Heegaard Splittings and Three-Dimensional Systems

Inversely Unstable Solutions of Two-Dimensional Systems on Genus-p Surfaces and the Topology of Knotted Attractors