Source author record

Shenghua Liu

Shenghua Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Social and Information Networks Machine Learning Artificial Intelligence cond-mat.mtrl-sci cond-mat.supr-con physics.soc-ph

Catalog footprint

What is connected

7works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Learning node embeddings via summary graphs: a brief theoretical analysis

Graph representation learning plays an important role in many graph mining applications, but learning embeddings of large-scale graphs remains a problem. Recent works try to improve scalability via graph summarization -- i.e., they learn embeddings on a smaller summary graph, and then restore the node embeddings of the original graph. However, all existing works depend on heuristic designs and lack theoretical analysis. Different from existing works, we contribute an in-depth theoretical analysis of three specific embedding learning methods based on introduced kernel matrix, and reveal that learning embeddings via graph summarization is actually learning embeddings on a approximate graph constructed by the configuration model. We also give analysis about approximation error. To the best of our knowledge, this is the first work to give theoretical analysis of this approach. Furthermore, our analysis framework gives interpretation of some existing methods and provides great insights for future work on this problem.

preprint2022arXiv

MonLAD: Money Laundering Agents Detection in Transaction Streams

Given a stream of money transactions between accounts in a bank, how can we accurately detect money laundering agent accounts and suspected behaviors in real-time? Money laundering agents try to hide the origin of illegally obtained money by dispersive multiple small transactions and evade detection by smart strategies. Therefore, it is challenging to accurately catch such fraudsters in an unsupervised manner. Existing approaches do not consider the characteristics of those agent accounts and are not suitable to the streaming settings. Therefore, we propose MonLAD and MonLAD-W to detect money laundering agent accounts in a transaction stream by keeping track of their residuals and other features; we devise AnoScore algorithm to find anomalies based on the robust measure of statistical deviation. Experimental results show that MonLAD outperforms the state-of-the-art baselines on real-world data and finds various suspicious behavior patterns of money laundering. Additionally, several detected suspected accounts have been manually-verified as agents in real money laundering scenario.

preprint2022arXiv

Multi-scale Anomaly Detection for Big Time Series of Industrial Sensors

Given a multivariate big time series, can we detect anomalies as soon as they occur? Many existing works detect anomalies by learning how much a time series deviates away from what it should be in the reconstruction framework. However, most models have to cut the big time series into small pieces empirically since optimization algorithms cannot afford such a long series. The question is raised: do such cuts pollute the inherent semantic segments, like incorrect punctuation in sentences? Therefore, we propose a reconstruction-based anomaly detection method, MissGAN, iteratively learning to decode and encode naturally smooth time series in coarse segments, and finding out a finer segment from low-dimensional representations based on HMM. As a result, learning from multi-scale segments, MissGAN can reconstruct a meaningful and robust time series, with the help of adversarial regularization and extra conditional states. MissGAN does not need labels or only needs labels of normal instances, making it widely applicable. Experiments on industrial datasets of real water network sensors show our MissGAN outperforms the baselines with scalability. Besides, we use a case study on the CMU Motion dataset to demonstrate that our model can well distinguish unexpected gestures from a given conditional motion.

preprint2020arXiv

Summarizing graphs using the configuration model

Given a large graph, how can we summarize it with fewer nodes and edges while maintaining its key properties, such as spectral property? Although graphs play more and more important roles in many real-world applications, the growth of their size presents great challenges to graph analysis. As a solution, graph summarization, which aims to find a compact representation that preserves the important properties of a given graph, has received much attention, and numerous algorithms have been developed for it. However, most of the algorithms adopt the uniform reconstruction scheme, which is based on an unrealistic assumption that edges are uniformly distributed. In this work, we propose a novel and realistic reconstruction scheme, which preserves the degree of nodes, and we develop an efficient graph summarization algorithm called DPGS based on the Minimum Description Length principle. We theoretically analyze the difference between the original and summary graphs from a spectral perspective, and we perform extensive experiments on multiple real-world datasets. The results show that DPGS yields compact representation that preserves the essential properties of the original graph.

preprint2016arXiv

Efficient Thermal Conductance in Organometallic Perovskite CH3NH3PbI3 Films

Perovskite-based optoelectronic devices have shown great promise for solar conversion and other optoelectronic applications, but their long-term performance instability is regarded as a major obstacle to their widespread deployment. Previous works have shown that the ultralow thermal conductivity and inefficient heat spreading might put an intrinsic limit on the lifetime of perovskite devices. Here, we report the observation of a remarkably efficient thermal conductance, with conductivity of 11.2 +/- 0.8 W m^-1 K^-1 at room temperature, in densely-packed perovskite CH3NH3PbI3 films, via noncontact time-domain thermal reflectance measurements. The temperature-dependent experiments suggest the important roles of organic cations and structural phase transitions, which are further confirmed by temperature-dependent Raman spectra. The thermal conductivity at room temperature observed here is over one order of magnitude larger than that in the early report, suggesting that perovskite device performance will not be limited by thermal stability.

preprint2015arXiv

Learning user-specific latent influence and susceptibility from information cascades

Predicting cascade dynamics has important implications for understanding information propagation and launching viral marketing. Previous works mainly adopt a pair-wise manner, modeling the propagation probability between pairs of users using n^2 independent parameters for n users. Consequently, these models suffer from severe overfitting problem, specially for pairs of users without direct interactions, limiting their prediction accuracy. Here we propose to model the cascade dynamics by learning two low-dimensional user-specific vectors from observed cascades, capturing their influence and susceptibility respectively. This model requires much less parameters and thus could combat overfitting problem. Moreover, this model could naturally model context-dependent factors like cumulative effect in information propagation. Extensive experiments on synthetic dataset and a large-scale microblogging dataset demonstrate that this model outperforms the existing pair-wise models at predicting cascade dynamics, cascade size, and "who will be retweeted".

preprint2013arXiv

Orbital-Selective Mottness in KxFe2-ySe2 Superconductors Revealed by Pump-Probe Spectroscopy

We report transient optical signatures of the orbital-selective Mottness in superconducting KxFe2-ySe2 crystals by using dual-color pump-probe spectroscopy. Besides multi-exponential decay recovery dynamics of photo-induced quasiparticles, a damped oscillatory component due to coherent acoustic phonons emerges when the superconducting phase is suppressed by increasing the temperature or excitation power. The oscillatory component diminishes with significant enhancement of a slow decay component upon raising temperature to 150-160 K. These results are in consistence with the picture of orbital-selective Mott phase transition, indicating a vital role played by electron correlation in the iron-based superconductors.

Shenghua Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

Learning node embeddings via summary graphs: a brief theoretical analysis

MonLAD: Money Laundering Agents Detection in Transaction Streams

Multi-scale Anomaly Detection for Big Time Series of Industrial Sensors

Summarizing graphs using the configuration model

Efficient Thermal Conductance in Organometallic Perovskite CH3NH3PbI3 Films

Learning user-specific latent influence and susceptibility from information cascades

Orbital-Selective Mottness in KxFe2-ySe2 Superconductors Revealed by Pump-Probe Spectroscopy