Source author record

Jinho Lee

Jinho Lee appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Computer Vision cond-mat.supr-con Artificial Intelligence cond-mat.mtrl-sci cond-mat.str-el Hardware Architecture Computational Engineering, Finance, and Science cond-mat.mes-hall Data Structures and Algorithms Distributed, Parallel, and Cluster Computing Multiagent Systems q-fin.CP q-fin.GN q-fin.PM

Catalog footprint

What is connected

16works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

AGIS: Fast Approximate Graph Pattern Mining with Structure-Informed Sampling

Approximate Graph Pattern Mining (AGPM) is essential for analyzing large-scale graphs where exact counting is computationally prohibitive. While there exist numerous sampling-based AGPM systems, they all rely on uniform sampling and overlook the underlying probability distribution. This limitation restricts their scalability to a broader range of patterns. In this paper, we introduce AGIS, an extremely fast AGPM system capable of counting arbitrary patterns from huge graphs. AGIS employs structure-informed neighbor sampling, a novel sampling technique that deviates from uniformness but allocates specific sampling probabilities based on the pattern structure. We first derive the ideal sampling distribution for AGPM and then present a practical method to approximate it. Furthermore, we develop a method that balances convergence speed and computational overhead, determining when to use the approximated distribution. Experimental results demonstrate that AGIS significantly outperforms the state-of-the-art AGPM system, achieving 28.5x geometric mean speedup and more than 100,000x speedup in specific cases. Furthermore, AGIS is the only AGPM system that scales to graphs with tens of billions of edges and robustly handles diverse patterns, successfully providing accurate estimates within seconds. We will open-source AGIS to encourage further research in this field.

preprint2022arXiv

ETF Portfolio Construction via Neural Network trained on Financial Statement Data

Recently, the application of advanced machine learning methods for asset management has become one of the most intriguing topics. Unfortunately, the application of these methods, such as deep neural networks, is difficult due to the data shortage problem. To address this issue, we propose a novel approach using neural networks to construct a portfolio of exchange traded funds (ETFs) based on the financial statement data of their components. Although a number of ETFs and ETF-managed portfolios have emerged in the past few decades, the ability to apply neural networks to manage ETF portfolios is limited since the number and historical existence of ETFs are relatively smaller and shorter, respectively, than those of individual stocks. Therefore, we use the data of individual stocks to train our neural networks to predict the future performance of individual stocks and use these predictions and the portfolio deposit file (PDF) to construct a portfolio of ETFs. Multiple experiments have been performed, and we have found that our proposed method outperforms the baselines. We believe that our approach can be more beneficial when managing recently listed ETFs, such as thematic ETFs, of which there is relatively limited historical data for training advanced machine learning methods.

preprint2022arXiv

It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher

Model quantization is considered as a promising method to greatly reduce the resource requirements of deep neural networks. To deal with the performance drop induced by quantization errors, a popular method is to use training data to fine-tune quantized networks. In real-world environments, however, such a method is frequently infeasible because training data is unavailable due to security, privacy, or confidentiality concerns. Zero-shot quantization addresses such problems, usually by taking information from the weights of a full-precision teacher network to compensate the performance drop of the quantized networks. In this paper, we first analyze the loss surface of state-of-the-art zero-shot quantization techniques and provide several findings. In contrast to usual knowledge distillation problems, zero-shot quantization often suffers from 1) the difficulty of optimizing multiple loss terms together, and 2) the poor generalization capability due to the use of synthetic samples. Furthermore, we observe that many weights fail to cross the rounding threshold during training the quantized networks even when it is necessary to do so for better performance. Based on the observations, we propose AIT, a simple yet powerful technique for zero-shot quantization, which addresses the aforementioned two problems in the following way: AIT i) uses a KL distance loss only without a cross-entropy loss, and ii) manipulates gradients to guarantee that a certain portion of weights are properly updated after crossing the rounding thresholds. Experiments show that AIT outperforms the performance of many existing methods by a great margin, taking over the overall state-of-the-art position in the field.

preprint2022arXiv

Shai-am: A Machine Learning Platform for Investment Strategies

The finance industry has adopted machine learning (ML) as a form of quantitative research to support better investment decisions, yet there are several challenges often overlooked in practice. (1) ML code tends to be unstructured and ad hoc, which hinders cooperation with others. (2) Resource requirements and dependencies vary depending on which algorithm is used, so a flexible and scalable system is needed. (3) It is difficult for domain experts in traditional finance to apply their experience and knowledge in ML-based strategies unless they acquire expertise in recent technologies. This paper presents Shai-am, an ML platform integrated with our own Python framework. The platform leverages existing modern open-source technologies, managing containerized pipelines for ML-based strategies with unified interfaces to solve the aforementioned issues. Each strategy implements the interface defined in the core framework. The framework is designed to enhance reusability and readability, facilitating collaborative work in quantitative research. Shai-am aims to be a pure AI asset manager for solving various tasks in financial markets.

preprint2021arXiv

DANCE: Differentiable Accelerator/Network Co-Exploration

To cope with the ever-increasing computational demand of the DNN execution, recent neural architecture search (NAS) algorithms consider hardware cost metrics into account, such as GPU latency. To further pursue a fast, efficient execution, DNN-specialized hardware accelerators are being designed for multiple purposes, which far-exceeds the efficiency of the GPUs. However, those hardware-related metrics have been proven to exhibit non-linear relationships with the network architectures. Therefore it became a chicken-and-egg problem to optimize the network against the accelerator, or to optimize the accelerator against the network. In such circumstances, this work presents DANCE, a differentiable approach towards the co-exploration of the hardware accelerator and network architecture design. At the heart of DANCE is a differentiable evaluator network. By modeling the hardware evaluation software with a neural network, the relation between the accelerator architecture and the hardware metrics becomes differentiable, allowing the search to be performed with backpropagation. Compared to the naive existing approaches, our method performs co-exploration in a significantly shorter time, while achieving superior accuracy and hardware cost metrics.

preprint2021arXiv

GradPIM: A Practical Processing-in-DRAM Architecture for Gradient Descent

In this paper, we present GradPIM, a processing-in-memory architecture which accelerates parameter updates of deep neural networks training. As one of processing-in-memory techniques that could be realized in the near future, we propose an incremental, simple architectural design that does not invade the existing memory protocol. Extending DDR4 SDRAM to utilize bank-group parallelism makes our operation designs in processing-in-memory (PIM) module efficient in terms of hardware cost and performance. Our experimental results show that the proposed architecture can improve the performance of DNN training and greatly reduce memory bandwidth requirement while posing only a minimal amount of overhead to the protocol and DRAM area.

preprint2021arXiv

Homogeneous superconducting gap in DBCO synthesized by oxide molecular beam epitaxy

Much of what is known about high-temperature cuprate superconductors stems from studies based on two surface analytical tools, angle-resolved photoemission spectroscopy (ARPES) and spectroscopic imaging scanning tunneling microscopy (SI-STM). A question of general interest is whether and when the surface properties probed by ARPES and SI-STM are representative of the intrinsic properties of bulk materials. We find this question is prominent in thin films of a rarely studied cuprate DBCO. We synthesize DBCO films by oxide molecular beam epitaxy and study them by in situ ARPES and SI-STM. Both ARPES and SI-STM show that the surface DBCO layer is different from the bulk of the film. It is heavily underdoped, while the doping level in the bulk is close to optimal doping evidenced by bulk-sensitive mutual inductance measurements. ARPES shows the typical electronic structure of a heavily underdoped CuO2 plane and two sets of one-dimensional bands originating from the CuO chains with one of them gapped. SI-STM reveals two different energy scales in the local density of states, with one corresponding to the superconductivity and the other one to the pseudogap. While the pseudogap shows large variations over the length scale of a few nanometers, the superconducting gap is very homogeneous. This indicates that the pseudogap and superconductivity are of different origins.

preprint2020arXiv

Atomic-scale Electronic Structure of the Cuprate Pair Density Wave State Coexisting with Superconductivity

The defining characteristic of hole-doped cuprates is $d$-wave high temperature superconductivity. However, intense theoretical interest is now focused on whether a pair density wave state (PDW) could coexist with cuprate superconductivity (D. F. Agterberg et al., Annual Review of Condensed Matter Physics 11, 231 (2020)). Here, we use a strong-coupling mean-field theory of cuprates, to model the atomic-scale electronic structure of an eight-unit-cell periodic, $d$-symmetry form factor, pair density wave (PDW) state coexisting with $d$-wave superconductivity (DSC). From this PDW+DSC model, the atomically-resolved density of Bogoliubov quasiparticle states N(r,E) is predicted at the terminal BiO surface of Bi$_2$Sr$_2$CaCu$_2$O$_8$ and compared with high-precision electronic visualization experiments using spectroscopic imaging STM. The PDW+DSC model predictions include the intra-unit-cell structure and periodic modulations of N(r,E), the modulations of the coherence peak energy $Δ_p$ (r), and the characteristics of Bogoliubov quasiparticle interference in scattering-wavevector space (q-space). Consistency between all these predictions and the corresponding experiments indicates that lightly hole-doped Bi$_2$Sr$_2$CaCu$_2$O$_8$ does contain a PDW+DSC state. Moreover, in the model the PDW+DSC state becomes unstable to a pure DSC state at a critical hole density p*, with empirically equivalent phenomena occurring in the experiments. All these results are consistent with a picture in which the cuprate translational symmetry breaking state is a PDW, the observed charge modulations are its consequence, the antinodal pseudogap is that of the PDW state, and the cuprate critical point at p* ~ 19% occurs due to disappearance of this PDW.

preprint2020arXiv

MAPS: Multi-agent Reinforcement Learning-based Portfolio Management System

Generating an investment strategy using advanced deep learning methods in stock markets has recently been a topic of interest. Most existing deep learning methods focus on proposing an optimal model or network architecture by maximizing return. However, these models often fail to consider and adapt to the continuously changing market conditions. In this paper, we propose the Multi-Agent reinforcement learning-based Portfolio management System (MAPS). MAPS is a cooperative system in which each agent is an independent "investor" creating its own portfolio. In the training procedure, each agent is guided to act as diversely as possible while maximizing its own return with a carefully designed loss function. As a result, MAPS as a system ends up with a diversified portfolio. Experiment results with 12 years of US market data show that MAPS outperforms most of the baselines in terms of Sharpe ratio. Furthermore, our results show that adding more agents to our system would allow us to get a higher Sharpe ratio by lowering risk with a more diversified portfolio.

preprint2020arXiv

SimEx: Express Prediction of Inter-dataset Similarity by a Fleet of Autoencoders

Knowing the similarity between sets of data has a number of positive implications in training an effective model, such as assisting an informed selection out of known datasets favorable to model transfer or data augmentation problems with an unknown dataset. Common practices to estimate the similarity between data include comparing in the original sample space, comparing in the embedding space from a model performing a certain task, or fine-tuning a pretrained model with different datasets and evaluating the performance changes therefrom. However, these practices would suffer from shallow comparisons, task-specific biases, or extensive time and computations required to perform comparisons. We present SimEx, a new method for early prediction of inter-dataset similarity using a set of pretrained autoencoders each of which is dedicated to reconstructing a specific part of known data. Specifically, our method takes unknown data samples as input to those pretrained autoencoders, and evaluate the difference between the reconstructed output samples against their original input samples. Our intuition is that, the more similarity exists between the unknown data samples and the part of known data that an autoencoder was trained with, the better chances there could be that this autoencoder makes use of its trained knowledge, reconstructing output samples closer to the originals. We demonstrate that our method achieves more than 10x speed-up in predicting inter-dataset similarity compared to common similarity-estimating practices. We also demonstrate that the inter-dataset similarity estimated by our method is well-correlated with common practices and outperforms the baselines approaches of comparing at sample- or embedding-spaces, without newly training anything at the comparison time.

preprint2020arXiv

SoFAr: Shortcut-based Fractal Architectures for Binary Convolutional Neural Networks

Binary Convolutional Neural Networks (BCNNs) can significantly improve the efficiency of Deep Convolutional Neural Networks (DCNNs) for their deployment on resource-constrained platforms, such as mobile and embedded systems. However, the accuracy degradation of BCNNs is still considerable compared with their full precision counterpart, impeding their practical deployment. Because of the inevitable binarization error in the forward propagation and gradient mismatch problem in the backward propagation, it is nontrivial to train BCNNs to achieve satisfactory accuracy. To ease the difficulty of training, the shortcut-based BCNNs, such as residual connection-based Bi-real ResNet and dense connection-based BinaryDenseNet, introduce additional shortcuts in addition to the shortcuts already present in their full precision counterparts. Furthermore, fractal architectures have been also been used to improve the training process of full-precision DCNNs since the fractal structure triggers effects akin to deep supervision and lateral student-teacher information flow. Inspired by the shortcuts and fractal architectures, we propose two Shortcut-based Fractal Architectures (SoFAr) specifically designed for BCNNs: 1. residual connection-based fractal architectures for binary ResNet, and 2. dense connection-based fractal architectures for binary DenseNet. Our proposed SoFAr combines the adoption of shortcuts and the fractal architectures in one unified model, which is helpful in the training of BCNNs. Results show that our proposed SoFAr achieves better accuracy compared with shortcut-based BCNNs. Specifically, the Top-1 accuracy of our proposed RF-c4d8 ResNet37(41) and DRF-c2d2 DenseNet51(53) on ImageNet outperforms Bi-real ResNet18(64) and BinaryDenseNet51(32) by 3.29% and 1.41%, respectively, with the same computational complexity overhead.

preprint2016arXiv

Detection of a Cooper-Pair Density Wave in Bi$_{2}$Sr$_{2}$CaCu$_{2}$O$_{8+x}$

The quantum condensate of Cooper-pairs forming a superconductor was originally conceived to be translationally invariant. In theory, however, pairs can exist with finite momentum $Q$ and thereby generate states with spatially modulating Cooper-pair density. While never observed directly in any superconductor, such a state has been created in ultra-cold $^{6}$Li gas. It is now widely hypothesized that the cuprate pseudogap phase contains such a 'pair density wave' (PDW) state. Here we use nanometer resolution scanned Josephson tunneling microscopy (SJTM) to image Cooper-pair tunneling from a $d$-wave superconducting STM tip to the condensate of Bi$_{2}$Sr$_{2}$CaCu$_{2}$O$_{8+x}$. Condensate visualization capabilities are demonstrated directly using the Cooper-pair density variations surrounding Zn impurity atoms and at the Bi$_{2}$Sr$_{2}$CaCu$_{2}$O$_{8+x}$ crystal-supermodulation. Then, by using Fourier analysis of SJTM images, we discover the direct signature of a Cooper-pair density modulation at wavevectors $Q_{p} \approx (0.25,0)2π/ a_{0}$;$(0,0.25)2π/ a_{0}$ in Bi$_{2}$Sr$_{2}$CaCu$_{2}$O$_{8+x}$. The amplitude of these modulations is ~5% of the homogenous condensate density and their form factor exhibits primarily $s$/$s'$-symmetry. This phenomenology is expected within Ginzburg-Landau theory when a charge density wave with $d$-symmetry form factor and wave vector $Q_{c}=Q_{p}$ coexists with a homogeneous $d$-symmetry superconductor ; it is also encompassed by several contemporary microscopic theories for the pseudogap phase.

preprint2016arXiv

Globally Optimal Object Tracking with Fully Convolutional Networks

Tracking is one of the most important but still difficult tasks in computer vision and pattern recognition. The main difficulties in the tracking field are appearance variation and occlusion. Most traditional tracking methods set the parameters or templates to track target objects in advance and should be modified accordingly. Thus, we propose a new and robust tracking method using a Fully Convolutional Network (FCN) to obtain an object probability map and Dynamic Programming (DP) to seek the globally optimal path through all frames of video. Our proposed method solves the object appearance variation problem with the use of a FCN and deals with occlusion by DP. We show that our method is effective in tracking various single objects through video frames.

preprint2014arXiv

Imaging Dirac-Mass Disorder from Magnetic Dopant-Atoms in the Ferromagnetic Topological Insulator Cr$_x$(Bi$_{0.1}$Sb$_{0.9}$)$_{2-x}$Te$_3$

To achieve and utilize the most exotic electronic phenomena predicted for the surface states of 3D topological insulators (TI),it is necessary to open a "Dirac-mass gap" in their spectrum by breaking time-reversal symmetry. Use of magnetic dopant atoms to generate a ferromagnetic state is the most widely used approach. But it is unknown how the spatial arrangements of the magnetic dopant atoms influence the Dirac-mass gap at the atomic scale or, conversely, whether the ferromagnetic interactions between dopant atoms are influenced by the topological surface states. Here we image the locations of the magnetic (Cr) dopant atoms in the ferromagnetic TI Cr$_{0.08}$(Bi$_{0.1}$Sb$_{0.9}$)$_{1.92}$Te$_3$. Simultaneous visualization of the Dirac-mass gap $Δ(r)$ reveals its intense disorder, which we demonstrate directly is related to fluctuations in $n(r)$, the Cr atom areal density in the termination layer. We find the relationship of surface-state Fermi wavevectors to the anisotropic structure of $Δ(r)$ consistent with predictions for surface ferromagnetism mediated by those states. Moreover, despite the intense Dirac-mass disorder, the anticipated relationship $Δ(r)\propto n(r)$ is confirmed throughout, and exhibits an electron-dopant interaction energy $J^*$=145$meV\cdot nm^2$. These observations reveal how magnetic dopant atoms actually generate the TI mass gap locally and that, to achieve the novel physics expected of time-reversal-symmetry breaking TI materials, control of the resulting Dirac-mass gap disorder will be essential.

preprint2014arXiv

Simultaneous Transitions in Cuprate Momentum-Space Topology and Electronic Symmetry Breaking

The existence of electronic symmetry breaking in the underdoped cuprates, and its disappearance with increased hole-density $p$, are now widely reported. However, the relationship between this transition and the momentum space ($\vec{k}$-space) electronic structure underpinning the superconductivity has not been established. Here we visualize the $\vec{Q}$=0 (intra-unit-cell) and $\vec{Q}\neq$0 (density wave) broken-symmetry states simultaneously with the coherent $\vec{k}$-space topology, for Bi$_2$Sr$_2$CaCu$_2$O$_{8+d}$ samples spanning the phase diagram 0.06$\leq p \leq$0.23. We show that the electronic symmetry breaking tendencies weaken with increasing $p$ and disappear close to $p_c$=0.19. Concomitantly, the coherent $\vec{k}$-space topology undergoes an abrupt transition, from arcs to closed contours, at the same $p_c$. These data reveal that the $\vec{k}$-space topology transformation in cuprates is linked intimately with the disappearance of the electronic symmetry breaking at a concealed critical point.

preprint2014arXiv

Spin-dependent polaron formation dynamics in Eu$_{0.75}$Y$_{0.25}$MnO$_3$ probed by femtosecond pump-probe spectroscopy

We present a femtosecond optical pump-probe study of the multiferroic manganite Eu$_{0.75}$Y$_{0.25}$MnO$_3$. The optical response of the material at pump energies of 1.55 and 3.1 eV is dominated by the $d$-$d$ and $p$-$d$ transitions of the Mn$^{3+}$ ions. The relaxation of photoexcited electrons includes the relaxation of the Jahn-Teller distortion and polaron trapping at Mn$^{2+}$ and Mn$^{4+}$ sites. Ultrafast switching of superexchange interactions due to modulated $e_g$ orbital occupancy creates a localized spin excitation, which then decays on a time scale of tens of picoseconds at low temperatures. The localized spin state decay appears as a tremendous increase in the amplitude of the photoinduced reflectance, due to the strong coupling of optical transitions to the spin-spin correlations in the crystalline $a$-$b$ plane.

Jinho Lee

What is connected

Connect this record

See the researcher in context

Building this map preview

16 published item(s)

AGIS: Fast Approximate Graph Pattern Mining with Structure-Informed Sampling

ETF Portfolio Construction via Neural Network trained on Financial Statement Data

It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher

Shai-am: A Machine Learning Platform for Investment Strategies

DANCE: Differentiable Accelerator/Network Co-Exploration

GradPIM: A Practical Processing-in-DRAM Architecture for Gradient Descent

Homogeneous superconducting gap in DBCO synthesized by oxide molecular beam epitaxy

Atomic-scale Electronic Structure of the Cuprate Pair Density Wave State Coexisting with Superconductivity

MAPS: Multi-agent Reinforcement Learning-based Portfolio Management System

SimEx: Express Prediction of Inter-dataset Similarity by a Fleet of Autoencoders

SoFAr: Shortcut-based Fractal Architectures for Binary Convolutional Neural Networks

Detection of a Cooper-Pair Density Wave in Bi$_{2}$Sr$_{2}$CaCu$_{2}$O$_{8+x}$

Globally Optimal Object Tracking with Fully Convolutional Networks

Imaging Dirac-Mass Disorder from Magnetic Dopant-Atoms in the Ferromagnetic Topological Insulator Cr$_x$(Bi$_{0.1}$Sb$_{0.9}$)$_{2-x}$Te$_3$

Simultaneous Transitions in Cuprate Momentum-Space Topology and Electronic Symmetry Breaking

Spin-dependent polaron formation dynamics in Eu$_{0.75}$Y$_{0.25}$MnO$_3$ probed by femtosecond pump-probe spectroscopy