Source author record

Ye Luo

Ye Luo appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

18works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Uncertainty-Adjusted Sorting for Asset Pricing with Machine Learning

Machine learning is central to empirical asset pricing, but portfolio construction still relies on point predictions and largely ignores asset-specific estimation uncertainty. We propose a simple change: sort assets using uncertainty-adjusted prediction bounds instead of point predictions alone. Across a broad set of ML models and a U.S. equity panel, this approach improves portfolio performance relative to point-prediction sorting. These gains persist even when bounds are built from partial or misspecified uncertainty information. They arise mainly from reduced volatility and are strongest for flexible machine learning models. Identification and robustness exercises show that these improvements are driven by asset-level rather than time or aggregate predictive uncertainty.

preprint2022arXiv

Equalized Focal Loss for Dense Long-Tailed Object Detection

Despite the recent success of long-tailed object detection, almost all long-tailed object detectors are developed based on the two-stage paradigm. In practice, one-stage detectors are more prevalent in the industry because they have a simple and fast pipeline that is easy to deploy. However, in the long-tailed scenario, this line of work has not been explored so far. In this paper, we investigate whether one-stage detectors can perform well in this case. We discover the primary obstacle that prevents one-stage detectors from achieving excellent performance is: categories suffer from different degrees of positive-negative imbalance problems under the long-tailed data distribution. The conventional focal loss balances the training process with the same modulating factor for all categories, thus failing to handle the long-tailed problem. To address this issue, we propose the Equalized Focal Loss (EFL) that rebalances the loss contribution of positive and negative samples of different categories independently according to their imbalance degrees. Specifically, EFL adopts a category-relevant modulating factor which can be adjusted dynamically by the training status of different categories. Extensive experiments conducted on the challenging LVIS v1 benchmark demonstrate the effectiveness of our proposed method. With an end-to-end training pipeline, EFL achieves 29.2% in terms of overall AP and obtains significant performance improvements on rare categories, surpassing all existing state-of-the-art methods. The code is available at https://github.com/ModelTC/EOD.

preprint2022arXiv

High-Dimensional $L_2$Boosting: Rate of Convergence

Boosting is one of the most significant developments in machine learning. This paper studies the rate of convergence of $L_2$Boosting, which is tailored for regression, in a high-dimensional setting. Moreover, we introduce so-called \textquotedblleft post-Boosting\textquotedblright. This is a post-selection estimator which applies ordinary least squares to the variables selected in the first stage by $L_2$Boosting. Another variant is \textquotedblleft Orthogonal Boosting\textquotedblright\ where after each step an orthogonal projection is conducted. We show that both post-$L_2$Boosting and the orthogonal boosting achieve the same rate of convergence as LASSO in a sparse, high-dimensional setting. We show that the rate of convergence of the classical $L_2$Boosting depends on the design matrix described by a sparse eigenvalue constant. To show the latter results, we derive new approximation results for the pure greedy algorithm, based on analyzing the revisiting behavior of $L_2$Boosting. We also introduce feasible rules for early stopping, which can be easily implemented and used in applied work. Our results also allow a direct comparison between LASSO and boosting which has been missing from the literature. Finally, we present simulation studies and applications to illustrate the relevance of our theoretical results and to provide insights into the practical aspects of boosting. In these simulation studies, post-$L_2$Boosting clearly outperforms LASSO.

preprint2021arXiv

Deblurring Processor for Motion-Blurred Faces Based on Generative Adversarial Networks

Low-quality face image restoration is a popular research direction in today's computer vision field. It can be used as a pre-work for tasks such as face detection and face recognition. At present, there is a lot of work to solve the problem of low-quality faces under various environmental conditions. This paper mainly focuses on the restoration of motion-blurred faces. In increasingly abundant mobile scenes, the fast recovery of motion-blurred faces can bring highly effective speed improvements in tasks such as face matching. In order to achieve this goal, a deblurring method for motion-blurred facial image signals based on generative adversarial networks(GANs) is proposed. It uses an end-to-end method to train a sharp image generator, i.e., a processor for motion-blurred facial images. This paper introduce the processing progress of motion-blurred images, the development and changes of GANs and some basic concepts. After that, it give the details of network structure and training optimization design of the image processor. Then we conducted a motion blur image generation experiment on some general facial data set, and used the pairs of blurred and sharp face image data to perform the training and testing experiments of the processor GAN, and gave some visual displays. Finally, MTCNN is used to detect the faces of the image generated by the deblurring processor, and compare it with the result of the blurred image. From the results, the processing effect of the deblurring processor on the motion-blurred picture has a significant improvement both in terms of intuition and evaluation indicators of face detection.

preprint2021arXiv

L-SNet: from Region Localization to Scale Invariant Medical Image Segmentation

Coarse-to-fine models and cascade segmentation architectures are widely adopted to solve the problem of large scale variations in medical image segmentation. However, those methods have two primary limitations: the first-stage segmentation becomes a performance bottleneck; the lack of overall differentiability makes the training process of two stages asynchronous and inconsistent. In this paper, we propose a differentiable two-stage network architecture to tackle these problems. In the first stage, a localization network (L-Net) locates Regions of Interest (RoIs) in a detection fashion; in the second stage, a segmentation network (S-Net) performs fine segmentation on the recalibrated RoIs; a RoI recalibration module between L-Net and S-Net eliminating the inconsistencies. Experimental results on the public dataset show that our method outperforms state-of-the-art coarse-to-fine models with negligible computation overheads.

preprint2021arXiv

Min-Max-Plus Neural Networks

We present a new model of neural networks called Min-Max-Plus Neural Networks (MMP-NNs) based on operations in tropical arithmetic. In general, an MMP-NN is composed of three types of alternately stacked layers, namely linear layers, min-plus layers and max-plus layers. Specifically, the latter two types of layers constitute the nonlinear part of the network which is trainable and more sophisticated compared to the nonlinear part of conventional neural networks. In addition, we show that with higher capability of nonlinearity expression, MMP-NNs are universal approximators of continuous functions, even when the number of multiplication operations is tremendously reduced (possibly to none in certain extreme cases). Furthermore, we formulate the backpropagation algorithm in the training process of MMP-NNs and introduce an algorithm of normalization to improve the rate of convergence in training.

preprint2021arXiv

Shape-Enforcing Operators for Point and Interval Estimators

A common problem in econometrics, statistics, and machine learning is to estimate and make inference on functions that satisfy shape restrictions. For example, distribution functions are nondecreasing and range between zero and one, height growth charts are nondecreasing in age, and production functions are nondecreasing and quasi-concave in input quantities. We propose a method to enforce these restrictions ex post on point and interval estimates of the target function by applying functional operators. If an operator satisfies certain properties that we make precise, the shape-enforced point estimates are closer to the target function than the original point estimates and the shape-enforced interval estimates have greater coverage and shorter length than the original interval estimates. We show that these properties hold for six different operators that cover commonly used shape restrictions in practice: range, convexity, monotonicity, monotone convexity, quasi-convexity, and monotone quasi-convexity. We illustrate the results with two empirical applications to the estimation of a height growth chart for infants in India and a production function for chemical firms in China.

preprint2020arXiv

Adaptive Discrete Smoothing for High-Dimensional and Nonlinear Panel Data

In this paper we develop a data-driven smoothing technique for high-dimensional and non-linear panel data models. We allow for individual specific (non-linear) functions and estimation with econometric or machine learning methods by using weighted observations from other individuals. The weights are determined by a data-driven way and depend on the similarity between the corresponding functions and are measured based on initial estimates. The key feature of such a procedure is that it clusters individuals based on the distance / similarity between them, estimated in a first stage. Our estimation method can be combined with various statistical estimation procedures, in particular modern machine learning methods which are in particular fruitful in the high-dimensional case and with complex, heterogeneous data. The approach can be interpreted as a \textquotedblleft soft-clustering\textquotedblright\ in comparison to traditional\textquotedblleft\ hard clustering\textquotedblright that assigns each individual to exactly one group. We conduct a simulation study which shows that the prediction can be greatly improved by using our estimator. Finally, we analyze a big data set from didichuxing.com, a leading company in transportation industry, to analyze and predict the gap between supply and demand based on a large set of covariates. Our estimator clearly performs much better in out-of-sample prediction compared to existing linear panel data estimators.

preprint2020arXiv

QMCPACK: Advances in the development, efficiency, and application of auxiliary field and real-space variational and diffusion Quantum Monte Carlo

We review recent advances in the capabilities of the open source ab initio Quantum Monte Carlo (QMC) package QMCPACK and the workflow tool Nexus used for greater efficiency and reproducibility. The auxiliary field QMC (AFQMC) implementation has been greatly expanded to include k-point symmetries, tensor-hypercontraction, and accelerated graphical processing unit (GPU) support. These scaling and memory reductions greatly increase the number of orbitals that can practically be included in AFQMC calculations, increasing accuracy. Advances in real space methods include techniques for accurate computation of band gaps and for systematically improving the nodal surface of ground state wavefunctions. Results of these calculations can be used to validate application of more approximate electronic structure methods including GW and density functional based techniques. To provide an improved foundation for these calculations we utilize a new set of correlation-consistent effective core potentials (pseudopotentials) that are more accurate than previous sets; these can also be applied in quantum-chemical and other many-body applications, not only QMC. These advances increase the efficiency, accuracy, and range of properties that can be studied in both molecules and materials with QMC and QMCPACK.

preprint2020arXiv

TurboRVB: a many-body toolkit for {\it ab initio} electronic simulations by quantum Monte Carlo

TurboRVB is a computational package for {\it ab initio} Quantum Monte Carlo (QMC) simulations of both molecular and bulk electronic systems. The code implements two types of well established QMC algorithms: Variational Monte Carlo (VMC), and Diffusion Monte Carlo in its robust and efficient lattice regularized variant. A key feature of the code is the possibility of using strongly correlated many-body wave functions. The electronic wave function (WF) is obtained by applying a Jastrow factor, which takes into account dynamical correlations, to the most general mean-field ground state, written either as an antisymmetrized geminal product with spin-singlet pairing, or as a Pfaffian, including both singlet and triplet correlations. This wave function can be viewed as an efficient implementation of the so-called resonating valence bond (RVB) ansatz, first proposed by L. Pauling and P. W. Anderson in quantum chemistry and condensed matter physics, respectively. The RVB ansatz implemented in TurboRVB has a large variational freedom, including the Jastrow correlated Slater determinant as its simplest, but nontrivial case. Moreover, it has the remarkable advantage of remaining with an affordable computational cost, proportional to the one spent for the evaluation of a single Slater determinant. The code implements the adjoint algorithmic differentiation that enables a very efficient evaluation of energy derivatives, comprising the ionic forces. Thus, one can perform structural optimizations and molecular dynamics in the canonical NVT ensemble at the VMC level. For the electronic part, a full WF optimization is made possible thanks to state-of-the-art stochastic algorithms for energy minimization. The code has been efficiently parallelized by using a hybrid MPI-OpenMP protocol, that is also an ideal environment for exploiting the computational power of modern GPU accelerators.

preprint2016arXiv

Detecting "Smart" Spammers On Social Network: A Topic Model Approach

Spammer detection on social network is a challenging problem. The rigid anti-spam rules have resulted in emergence of "smart" spammers. They resemble legitimate users who are difficult to identify. In this paper, we present a novel spammer classification approach based on Latent Dirichlet Allocation(LDA), a topic model. Our approach extracts both the local and the global information of topic distribution patterns, which capture the essence of spamming. Tested on one benchmark dataset and one self-collected dataset, our proposed method outperforms other state-of-the-art methods in terms of averaged F1-score.

preprint2016arXiv

Phase Stability of TiO$_2$ Polymorphs from Diffusion Quantum Monte Carlo

Titanium dioxide, TiO$_2$, has multiple applications in catalysis, energy conversion and memristive devices because of its electronic structure. Most of these applications utilize the naturally existing phases: rutile, anatase and brookite. Despite the simple form of TiO$_2$ and its wide uses, there is long-standing disagreement between theory and experiment on the energetic ordering of these phases that has never been resolved. We present the first analysis of phase stability at zero temperature using the highly accurate many-body fixed node diffusion Quantum Monte Carlo (QMC) method. We also include the effects of temperature by calculating the Helmholtz free energy including both internal energy and vibrational contributions from density functional perturbation theory based quasi harmonic phonon calculations. Our QMC calculations find that anatase is the most stable phase at zero temperature, consistent with many previous mean-field calculations. However, at elevated temperatures, rutile becomes the most stable phase. For all finite temperatures, brookite is always the least stable phase.

preprint2015arXiv

Ab-initio molecular dynamics simulation of liquid water by Quantum Monte Carlo

Although liquid water is ubiquitous in chemical reactions at roots of life and climate on the earth, the prediction of its properties by high-level ab initio molecular dynamics simulations still represents a formidable task for quantum chemistry. In this article we present a room temperature simulation of liquid water based on the potential energy surface obtained by a many-body wave function through quantum Monte Carlo (QMC) methods. The simulated properties are in good agreement with recent neutron scattering and X-ray experiments, particularly concerning the position of the oxygen-oxygen peak in the radial distribution function, at variance of previous Density Functional Theory attempts. Given the excellent performances of QMC on large scale supercomputers, this work opens new perspectives for predictive and reliable ab-initio simulations of complex chemical systems.

preprint2014arXiv

Static and dynamical correlation in diradical molecules by Quantum Monte Carlo using the Jastrow Antisymmetrized Geminal Power ansatz

Diradical molecules are essential species involved in many organic and inorganic chemical reactions. The computational study of their electronic structure is often challenging, because a reliable description of the correlation, and in particular of the static one, requires multi-reference techniques. The Jastrow correlated Antisymmetrized Geminal Power (JAGP) is a compact and efficient wave function ansatz, based on the valence-bond representation, which can be used within Quantum Monte Carlo (QMC) approaches. The AGP part can be rewritten in terms of molecular orbitals, obtaining a multi-determinant expansion with zero-seniority number. In the present work we demonstrate the capability of the JAGP ansatz to correctly describe the electronic structure of two diradical prototypes: the orthogonally twisted ethylene, C2H4, and the methylene, CH2, representing respectively a homosymmetric and heterosymmetric system. On the other hand, we show that the simple ansatz of a Jastrow correlated Single Determinant (JSD) wave function is unable to provide an accurate description of the electronic structure in these diradical molecules, both at variational level and, more remarkably, in the fixed-nodes projection schemes showing that a poor description of the static correlation yields an inaccurate nodal surface. The suitability of JAGP to correctly describe diradicals with a computational cost comparable with that of a JSD calculation, in combination with a favorable scalability of QMC algorithms with the system size, opens new perspectives in the ab initio study of large diradical systems, like the transition states in cycloaddition reactions and the thermal isomerization of biological chromophores.

preprint2013arXiv

Molecular properties by Quantum Monte Carlo: an investigation on the role of the wave function ansatz and the basis set in the water molecule

Quantum Monte Carlo methods are accurate and promising many body techniques for electronic structure calculations which, in the last years, are encountering a growing interest thanks to their favorable scaling with the system size and their efficient parallelization, particularly suited for the modern high performance computing facilities. The ansatz of the wave function and its variational flexibility are crucial points for both the accurate description of molecular properties and the capabilities of the method to tackle large systems. In this paper, we extensively analyze, using different variational ansatzes, several properties of the water molecule, namely: the total energy, the dipole and quadrupole momenta, the ionization and atomization energies, the equilibrium configuration, and the harmonic and fundamental frequencies of vibration. The investigation mainly focuses on variational Monte Carlo calculations, although several lattice regularized diffusion Monte Carlo calculations are also reported. Through a systematic study, we provide a useful guide to the choice of the wave function, the pseudo potential, and the basis set for QMC calculations. We also introduce a new strategy for the definition of the atomic orbitals involved in the Jastrow - Antisymmetrised Geminal power wave function, in order to drastically reduce the number of variational parameters. This scheme significantly improves the efficiency of QMC energy minimization in case of large basis sets.

preprint2013arXiv

Tropical Convexity and Canonical Projections

Using a potential theory on metric graphs "Gamma", we introduce the notion of tropical convexity to the space "RDiv^d(Gamma)" of effective R-divisors of degree d on "Gamma" and show that a natural metric can be defined on "RDiv^d(Gamma)". In addition, we extend the notion of reduced divisors which is conventionally defined in a complete linear system |D| with respect to a single point in "Gamma". In our general setting, a reduced divisor is defined uniquely as an R-divisor in a compact tropical convex subset "T" of "RDiv^d(Gamma)" with respect to a certain R-divisor "E" of the same degree d. In this sense, we consider reduced divisors as canonical projections onto "T". We also investigate some basic properties of tropical convex sets using techniques developed from general reduced divisors.

preprint2012arXiv

Ultra-compact On-Chip Plasmonic Light Concentrator

We present a novel approach for achieving tightly concentrated optical field by a hybrid photonic-plasmonic device in an integrated platform, which is a triangle-shaped metal taper mounted on top of a dielectric waveguide. This device, which we call a plasmomic light concentrator (PLC), can achieve side-coupling of light energy from the dielectric waveguide to the plasmonic region and light focusing into the apex of the metal taper(at the scale ~10nm) at the same time. For demonstration, we numerically investigate a PLC, which is a metal (Au) taper on a dielectric (Si3N4) waveguide at working wavelengths around 800nm. We show that three major effects (mode beat, nanofocusing, and weak resonance) interplay to generate this light concentration phenomenon and govern the performance of the device. By coordinating these effects, the PLC can be designed to be super compact while maintaining high efficiency over a wide band. In particular, we demonstrate that under optimized size parameters and wavelength a field concentration factor (FCF), which is the ratio of the norm of the electric field at the apex over the average norm of the electric field in the inputting waveguide, of about 13 can be achieved with the length of the device less than 1um for a moderate tip radius 20nm. Moreover, we show that a FCF of 5-10 is achievable over a wavelength range 700-1100nm when the length of the device is further reduced to about 400nm.

preprint2009arXiv

Rank-determining sets of metric graphs

A metric graph is a geometric realization of a finite graph by identifying each edge with a real interval. A divisor on a metric graph $Γ$ is an element of the free abelian group on $Γ$. The rank of a divisor on a metric graph is a concept appearing in the Riemann-Roch theorem for metric graphs (or tropical curves) due to Gathmann and Kerber, and Mikhalkin and Zharkov. We define a \emph{rank-determining set} of a metric graph $Γ$ to be a subset $A$ of $Γ$ such that the rank of a divisor $D$ on $Γ$ is always equal to the rank of $D$ restricted on $A$. We show constructively in this paper that there exist finite rank-determining sets. In addition, we investigate the properties of rank-determining sets in general and formulate a criterion for rank-determining sets. Our analysis is a based on an algorithm to derive the $v_0$-reduced divisor from any effective divisor in the same linear system.

Ye Luo

What is connected

Connect this record

See the researcher in context

Building this map preview

18 published item(s)

Uncertainty-Adjusted Sorting for Asset Pricing with Machine Learning

Equalized Focal Loss for Dense Long-Tailed Object Detection

High-Dimensional $L_2$Boosting: Rate of Convergence

Deblurring Processor for Motion-Blurred Faces Based on Generative Adversarial Networks

L-SNet: from Region Localization to Scale Invariant Medical Image Segmentation

Min-Max-Plus Neural Networks

Shape-Enforcing Operators for Point and Interval Estimators

Adaptive Discrete Smoothing for High-Dimensional and Nonlinear Panel Data

QMCPACK: Advances in the development, efficiency, and application of auxiliary field and real-space variational and diffusion Quantum Monte Carlo

TurboRVB: a many-body toolkit for {\it ab initio} electronic simulations by quantum Monte Carlo

Detecting "Smart" Spammers On Social Network: A Topic Model Approach

Phase Stability of TiO$_2$ Polymorphs from Diffusion Quantum Monte Carlo

Ab-initio molecular dynamics simulation of liquid water by Quantum Monte Carlo

Static and dynamical correlation in diradical molecules by Quantum Monte Carlo using the Jastrow Antisymmetrized Geminal Power ansatz

Molecular properties by Quantum Monte Carlo: an investigation on the role of the wave function ansatz and the basis set in the water molecule

Tropical Convexity and Canonical Projections

Ultra-compact On-Chip Plasmonic Light Concentrator

Rank-determining sets of metric graphs