Source author record

Song Yang

Song Yang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.AG Machine Learning math.DG quant-ph Computer Vision cond-mat.mes-hall math.AT Artificial Intelligence cond-mat.mtrl-sci Data Structures and Algorithms eess.AS eess.IV math.RA math.SG Methodology Networking and Internet Architecture physics.chem-ph physics.optics Sound

Catalog footprint

What is connected

16works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence Grounding

Temporal sentence grounding (TSG) aims to identify the temporal boundary of a specific segment from an untrimmed video by a sentence query. All existing works first utilize a sparse sampling strategy to extract a fixed number of video frames and then conduct multi-modal interactions with query sentence for reasoning. However, we argue that these methods have overlooked two indispensable issues: 1) Boundary-bias: The annotated target segment generally refers to two specific frames as corresponding start and end timestamps. The video downsampling process may lose these two frames and take the adjacent irrelevant frames as new boundaries. 2) Reasoning-bias: Such incorrect new boundary frames also lead to the reasoning bias during frame-query interaction, reducing the generalization ability of model. To alleviate above limitations, in this paper, we propose a novel Siamese Sampling and Reasoning Network (SSRN) for TSG, which introduces a siamese sampling mechanism to generate additional contextual frames to enrich and refine the new boundaries. Specifically, a reasoning strategy is developed to learn the inter-relationship among these frames and generate soft labels on boundaries for more accurate frame-query reasoning. Such mechanism is also able to supplement the absent consecutive visual semantics to the sampled sparse frames for fine-grained activity understanding. Extensive experiments demonstrate the effectiveness of SSRN on three challenging datasets.

preprint2022arXiv

Bott-Chern hypercohomology and bimeromorphic invariants

The aim of this article is to study the geometry of Bott-Chern hypercohomology from the bimeromorphic point of view. We construct some new bimeromorphic invariants involving the cohomology for the sheaf of germs of pluriharmonic functions, the truncated holomorphic de Rham cohomology, and the de Rham cohomology. To define these invariants, using a sheaf-theoretic approach, we establish a blow-up formula together with a canonical morphism for the Bott-Chern hypercohomology. In particular, we compute the invariants of some compact complex threefolds, such as Iwasawa manifolds and quintic threefolds.

preprint2022arXiv

Hodge cohomology on blow-ups along subvarieties

We establish a blow-up formula for Hodge cohomology of locally free sheaves on smooth proper varieties over an algebraically closed field of positive characteristic. For this, we introduce a notion of relative Hodge sheaves and study their behavior under blow-ups along smooth centers. In particular, as an application, we study the blow-up invariance of the $E_2$-degeneracy of the Hochschild--Kostant--Rosenberg spectral sequence for smooth proper varieties.

preprint2022arXiv

Replacing the Framingham-based equation for prediction of cardiovascular disease risk and adverse outcome by using artificial intelligence and retinal imaging

Purpose: To create and evaluate the accuracy of an artificial intelligence Deep learning platform (ORAiCLE) capable of using only retinal fundus images to predict both an individuals overall 5 year cardiovascular risk (CVD) and the relative contribution of the component risk factors that comprise this risk. Methods: We used 165,907 retinal images from a database of 47,236 patient visits. Initially, each image was paired with biometric data age, ethnicity, sex, presence and duration of diabetes a HDL/LDL ratios as well as any CVD event wtihin 5 years of the retinal image acquisition. A risk score based on Framingham equations was calculated. The real CVD event rate was also determined for the individuals and overall population. Finally, ORAiCLE was trained using only age, ethnicity, sex plus retinal images. Results: Compared to Framingham-based score, ORAiCLE was up to 12% more accurate in prediciting cardiovascular event in he next 5-years, especially for the highest risk group of people. The reliability and accuracy of each of the restrictive models was suboptimal to ORAiCLE performance ,indicating that it was using data from both sets of data to derive its final results. Conclusion: Retinal photography is inexpensive and only minimal training is required to acquire them as fully automated, inexpensive camera systems are now widely available. As such, AI-based CVD risk algorithms such as ORAiCLE promise to make CV health screening more accurate, more afforadable and more accessible for all. Furthermore, ORAiCLE unique ability to assess the relative contribution of the components that comprise an individuals overall risk would inform treatment decisions based on the specific needs of an individual, thereby increasing the likelihood of positive health outcomes.

preprint2022arXiv

Space Meets Time: Local Spacetime Neural Network For Traffic Flow Forecasting

Traffic flow forecasting is a crucial task in urban computing. The challenge arises as traffic flows often exhibit intrinsic and latent spatio-temporal correlations that cannot be identified by extracting the spatial and temporal patterns of traffic data separately. We argue that such correlations are universal and play a pivotal role in traffic flow. We put forward {spacetime interval learning} as a paradigm to explicitly capture these correlations through a unified analysis of both spatial and temporal features. Unlike the state-of-the-art methods, which are restricted to a particular road network, we model the universal spatio-temporal correlations that are transferable from cities to cities. To this end, we propose a new spacetime interval learning framework that constructs a local-spacetime context of a traffic sensor comprising the data from its neighbors within close time points. Based on this idea, we introduce local spacetime neural network (STNN), which employs novel spacetime convolution and attention mechanism to learn the universal spatio-temporal correlations. The proposed STNN captures local traffic patterns, which does not depend on a specific network structure. As a result, a trained STNN model can be applied on any unseen traffic networks. We evaluate the proposed STNN on two public real-world traffic datasets and a simulated dataset on dynamic networks. The experiment results show that STNN not only improves prediction accuracy by 4% over state-of-the-art methods, but is also effective in handling the case when the traffic network undergoes dynamic changes as well as the superior generalization capability.

preprint2021arXiv

Mono-elemental saturable absorber in mode-locked fiber laser: A review

Two-dimensional mono-elemental material is an excellent saturable absorber candidate with low saturation intensity, large modulation depth, high nonlinearities, and fast recovery time of excited carriers. Typically, these mono-elemental material with two-dimensional structure possesses tunable bandgap from metallic to semiconducting according to different number of layers. The successful application of these materials as the saturable absorber has exploited the development of mode-locked fiber lasers. Therefore, this review is intended to provide an up-to-date information to the development of mono-elemental saturable absorber for the advances in mode-locked fiber laser, with emphasis on their material properties, synthesis process and material characterization. Meanwhile, issues and challenges of the review research topic will be highlighted and addressed with several concrete recommendations.

preprint2020arXiv

Multimodal Learning For Classroom Activity Detection

Classroom activity detection (CAD) focuses on accurately classifying whether the teacher or student is speaking and recording both the length of individual utterances during a class. A CAD solution helps teachers get instant feedback on their pedagogical instructions. This greatly improves educators' teaching skills and hence leads to students' achievement. However, CAD is very challenging because (1) the CAD model needs to be generalized well enough for different teachers and students; (2) data from both vocal and language modalities has to be wisely fused so that they can be complementary; and (3) the solution shouldn't heavily rely on additional recording device. In this paper, we address the above challenges by using a novel attention based neural framework. Our framework not only extracts both speech and language information, but utilizes attention mechanism to capture long-term semantic dependence. Our framework is device-free and is able to take any classroom recording as input. The proposed CAD learning framework is evaluated in two real-world education applications. The experimental results demonstrate the benefits of our approach on learning attention based neural network from classroom data with different modalities, and show our approach is able to outperform state-of-the-art baselines in terms of various evaluation metrics.

preprint2019arXiv

Chaos Phase Induced Mass-producible Monolayer Two-dimensional Material

Crystal phase is well studied and presents a periodical atom arrangement in three dimensions lattice, but the "amorphous phase" is poorly understood. Here, by starting from cage-like bicyclocalix[2]arene[2]triazines building block, a brand-new 2D MOF is constructed with extremely weak interlaminar interaction existing between two adjacent 2D-crystal layer. Inter-layer slip happens under external disturbance and leads to the loss of periodicity at one dimension in the crystal lattice, resulting in an interim phase between the crystal and amorphous phase - the chaos phase, non-periodical in microscopic scale but orderly in mesoscopic scale. This chaos phase 2D MOF is a disordered self-assembly of black-phosphorus like 3D-layer, which has excellent mechanical-strength and a thickness of 1.15 nm. The bulky 2D-MOF material is readily to be exfoliated into monolayer nanosheets in gram-scale with unprecedented evenness and homogeneity, as well as previously unattained lateral size (>10 um), which present the first mass-producible monolayer 2D material and can form wafer-scale film on substrate.

preprint2018arXiv

Bott-Chern blow-up formula and bimeromorphic invariance of the $\partial\bar{\partial}$-Lemma for threefolds

The purpose of this paper is to study the bimeromorphic invariants of compact complex manifolds in terms of Bott-Chern cohomology. We prove a blow-up formula for Bott-Chern cohomology. As an application, we show that for compact complex threefolds the non-Kählerness degrees, introduced by Angella-Tomassini [Invent. Math. 192, (2013), 71-81], are bimeromorphic invariants. Consequently, the $\partial\bar{\partial}$-Lemma on threefolds admits the bimeromorphic invariance.

preprint2016arXiv

Locally conformal symplectic blow-ups

In this paper, we study the blow-up of a locally conformal symplectic manifold.We show that there exists a locally conformal symplectic structure on the blow-up of a locally conformal symplectic manifold along a compact induced symplectic submanifold.

preprint2016arXiv

Optimization Problems in Correlated Networks

Solving the shortest path and the min-cut problems are key in achieving high performance and robust communication networks. Those problems have often beeny studied in deterministic and independent networks both in their original formulations as well as in several constrained variants. However, in real-world networks, link weights (e.g., delay, bandwidth, failure probability) are often correlated due to spatial or temporal reasons, and these correlated link weights together behave in a different manner and are not always additive. In this paper, we first propose two correlated link-weight models, namely (i) the deterministic correlated model and (ii) the (log-concave) stochastic correlated model. Subsequently, we study the shortest path problem and the min-cut problem under these two correlated models. We prove that these two problems are NP-hard under the deterministic correlated model, and even cannot be approximated to arbitrary degree in polynomial time. However, these two problems are polynomial-time solvable under the (constrained) nodal deterministic correlated model, and can be solved by convex optimization under the (log-concave) stochastic correlated model.

preprint2015arXiv

Batalin-Vilkovisky algebras and the noncommutative Poincare duality of Koszul Calabi-Yau algebras

Let $A$ be a Koszul Calabi-Yau algebra. We show that there exists an isomorphism of Batalin-Vilkovisky algebras between the Hochschild cohomology ring of $A$ and that of its Koszul dual algebra $A^!$. This confirms (a generalization of) a conjecture of R.~Rouquier.

preprint2012arXiv

Efficient Semiparametric Estimation of Short-term and Long-term Hazard Ratios with Right-Censored Data

The proportional hazards assumption in the commonly used Cox model for censored failure time data is often violated in scientific studies. Yang and Prentice (2005) proposed a novel semiparametric two-sample model that includes the proportional hazards model and the proportional odds model as sub-models, and accommodates crossing survival curves. The model leaves the baseline hazard unspecified and the two model parameters can be interpreted as the short-term and long-term hazard ratios. Inference procedures were developed based on a pseudo score approach. Although extension to accommodate covariates was mentioned, no formal procedures have been provided or proved. Furthermore, the pseudo score approach may not be asymptotically efficient. We study the extension of the short-term and long-term hazard ratio model of Yang and Prentice (2005) to accommodate potentially time-dependent covariates. We develop efficient likelihood-based estimation and inference procedures. The nonparametric maximum likelihood estimators are shown to be consistent, asymptotically normal, and asymptotically efficient. Extensive simulation studies demonstrate that the proposed methods perform well in practical settings. The proposed method captured the phenomenon of crossing hazards in a cancer clinical trial and identified a genetic marker with significant long-term effect missed by using the proportional hazards model on age-at-onset of alcoholism in a genetic study.

preprint2011arXiv

Entanglement Enhanced Information Transfer through Strongly Correlated Systems and its Application to Optical Lattices

We show that the inherent entanglement of the ground state of strongly correlated systems can be exploited for both classical and quantum communications. Our strategy is based on a single qubit rotation which encodes information in the entangled nature of the ground state. In classical communication, our mechanism conveys more than one bit of information in each shot, just as dense coding does, without demanding long range entanglement. In our scheme for quantum communication, which may more appropriately be considered as a remote state preparation, the quality is higher than the highly studied attaching scenarios. Moreover, we propose to implement this new way of communication in optical lattices where all the requirements of our proposal have already been achieved.

preprint2011arXiv

Multipartite continuous-variable entanglement distillation using local squeezing and only one photon-subtraction operation

In this paper, we study entanglement distillation of multipartite continuous-variable Gaussian entangled states. Following Opatrný \emph{et al.}'s photon subtraction (PS) scheme, the probability of successful distillation decreases exponentially with the number of parties $N$. However, here, we shall propose an entanglement distillation scheme whose success probability scales as a constant with $N$. Our protocol employs several local squeezers, but it requires only a single PS operation. Using the logarithmic negativity as a measure of entanglement, we find that both the success probability and the distilled entanglement can be improved at the same time. Moreover, an $N$-mode transfer theorem (transferring states from phase space to Hilbert space) is presented.

preprint2010arXiv

Spin State Transfer in Laterally Coupled Quantum Dot Chains with Disorders

Quantum dot arrays are a promising media for transferring quantum information between two distant points without resorting to mobile qubits. Here we study two most common disorders namely, hyperfine interaction and exchange coupling fluctuations, in quantum dot arrays and their effects on quantum communication through these chains. Our results show that the hyperfine interaction is more destructive than the exchange coupling fluctuations. The average optimal time for communication is not affected by any disorder in the system and our simulations show that anti-ferromagnetic chains are much more resistive than the ferromagnetic ones against both kind of disorders. Even when time modulation of a coupling and optimal control is employed to improve the transmission, the anti-ferromagnetic chain performs much better. We have assumed the quasi-static approximation for hyperfine interaction and time dependent fluctuations in the exchange couplings. Particularly, for studying exchange coupling fluctuations we have considered the static disorder, white noise and $1/f$ noise.

Song Yang

What is connected

Connect this record

See the researcher in context

Building this map preview

16 published item(s)

Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence Grounding

Bott-Chern hypercohomology and bimeromorphic invariants

Hodge cohomology on blow-ups along subvarieties

Replacing the Framingham-based equation for prediction of cardiovascular disease risk and adverse outcome by using artificial intelligence and retinal imaging

Space Meets Time: Local Spacetime Neural Network For Traffic Flow Forecasting

Mono-elemental saturable absorber in mode-locked fiber laser: A review

Multimodal Learning For Classroom Activity Detection

Chaos Phase Induced Mass-producible Monolayer Two-dimensional Material

Bott-Chern blow-up formula and bimeromorphic invariance of the $\partial\bar{\partial}$-Lemma for threefolds

Locally conformal symplectic blow-ups

Optimization Problems in Correlated Networks

Batalin-Vilkovisky algebras and the noncommutative Poincare duality of Koszul Calabi-Yau algebras

Efficient Semiparametric Estimation of Short-term and Long-term Hazard Ratios with Right-Censored Data

Entanglement Enhanced Information Transfer through Strongly Correlated Systems and its Application to Optical Lattices

Multipartite continuous-variable entanglement distillation using local squeezing and only one photon-subtraction operation

Spin State Transfer in Laterally Coupled Quantum Dot Chains with Disorders