Researcher profile

Oleg Smirnov

Oleg Smirnov contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
20works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

20 published item(s)

preprint2026arXiv

Towards Unified Approaches in Self-Supervised Event Stream Modeling: Progress and Prospects

The proliferation of digital interactions across diverse domains, such as healthcare, e-commerce, gaming, and finance, has resulted in the generation of vast volumes of event stream (ES) data. ES data comprises continuous sequences of timestamped events that encapsulate detailed contextual information relevant to each domain. While ES data holds significant potential for extracting actionable insights and enhancing decision-making, its effective utilization is hindered by challenges such as the scarcity of labeled data and the fragmented nature of existing research efforts. Self-Supervised Learning (SSL) has emerged as a promising paradigm to address these challenges by enabling the extraction of meaningful representations from unlabeled ES data. In this survey, we systematically review and synthesize SSL methodologies tailored for ES modeling across multiple domains, bridging the gaps between domain-specific approaches that have traditionally operated in isolation. We present a comprehensive taxonomy of SSL techniques, encompassing both predictive and contrastive paradigms, and analyze their applicability and effectiveness within different application contexts. Furthermore, we identify critical gaps in current research and propose a future research agenda aimed at developing scalable, domain-agnostic SSL frameworks for ES modeling. By unifying disparate research efforts and highlighting cross-domain synergies, this survey aims to accelerate innovation, improve reproducibility, and expand the applicability of SSL to diverse real-world ES challenges.

preprint2022arXiv

IntelliTC: Automating Type Changes in IntelliJ IDEA

Developers often change types of program elements. Such refactoring often involves updating not only the type of the element itself, but also the API of all type-dependent references in the code, thus it is tedious and time-consuming. Despite type changes being more frequent than renamings, just a few current IDE tools provide partially-automated support only for a small set of hard-coded types. Researchers have recently proposed a data-driven approach to inferring API rewrite rules for type change patterns in Java using code commits history. In this paper, we build upon these recent advances and introduce IntelliTC - a tool to perform Java type change refactoring. We implemented it as a plugin for IntelliJ IDEA, a popular Java IDE developed by JetBrains. We present 3 different ways of providing support for such a refactoring from the standpoint of the user experience: Classic mode, Suggested Refactoring, and Inspection mode. To evaluate these modalities of using IntelliTC, we surveyed 22 experienced software developers. They positively rated the usefulness of the tool. The source code and distribution of the plugin are available on GitHub: https://github.com/JetBrains-Research/data-driven-type-migration. A demonstration video is on YouTube: https://youtu.be/pdcfvADA1PY.

preprint2022arXiv

LADUMA: Discovery of a luminous OH megamaser at $z > 0.5$

In the local Universe, OH megamasers (OHMs) are detected almost exclusively in infrared-luminous galaxies, with a prevalence that increases with IR luminosity, suggesting that they trace gas-rich galaxy mergers. Given the proximity of the rest frequencies of OH and the hyperfine transition of neutral atomic hydrogen (HI), radio surveys to probe the cosmic evolution of HI in galaxies also offer exciting prospects for exploiting OHMs to probe the cosmic history of gas-rich mergers. Using observations for the Looking At the Distant Universe with the MeerKAT Array (LADUMA) deep HI survey, we report the first untargeted detection of an OHM at $z > 0.5$, LADUMA J033046.20$-$275518.1 (nicknamed "Nkalakatha"). The host system, WISEA J033046.26$-$275518.3, is an infrared-luminous radio galaxy whose optical redshift $z \approx 0.52$ confirms the MeerKAT emission line detection as OH at a redshift $z_{\rm OH} = 0.5225 \pm 0.0001$ rather than HI at lower redshift. The detected spectral line has 18.4$σ$ peak significance, a width of $459 \pm 59\,{\rm km\,s^{-1}}$, and an integrated luminosity of $(6.31 \pm 0.18\,{\rm [statistical]}\,\pm 0.31\,{\rm [systematic]}) \times 10^3\,L_\odot$, placing it among the most luminous OHMs known. The galaxy's far-infrared luminosity $L_{\rm FIR} = (1.576 \pm 0.013) \times 10^{12}\,L_\odot$ marks it as an ultra-luminous infrared galaxy; its ratio of OH and infrared luminosities is similar to those for lower-redshift OHMs. A comparison between optical and OH redshifts offers a slight indication of an OH outflow. This detection represents the first step towards a systematic exploitation of OHMs as a tracer of galaxy growth at high redshifts.

preprint2022arXiv

Mass Testing and Characterization of 20-inch PMTs for JUNO

Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).

preprint2022arXiv

MeqSilhouette v2: Spectrally-resolved polarimetric synthetic data generation for the Event Horizon Telescope

We present MeqSilhouette v2.0 (MeqSv2), a fully polarimetric, time-and frequency-resolved synthetic data generation software for simulating millimetre (mm) wavelength very long baseline interferometry (VLBI) observations with heterogeneous arrays. Synthetic data are a critical component in understanding real observations, testing calibration and imaging algorithms, and predicting performance metrics of existing or proposed sites. MeqSv2 applies physics-based instrumental and atmospheric signal corruptions constrained by empirically-derived site and station parameters to the data. The new version is capable of applying instrumental polarization effects and various other spectrally-resolved effects using the Radio Interferometry Measurement Equation (RIME) formalism and produces synthetic data compatible with calibration pipelines designed to process real data. We demonstrate the various corruption capabilities of MeqSv2 using different arrays, with a focus on the effect of complex bandpass gains on closure quantities for the EHT at 230 GHz. We validate the frequency-dependent polarization leakage implementation by performing polarization self-calibration of synthetic EHT data using PolSolve. We also note the potential applications for cm-wavelength VLBI array analysis and design and future directions.

preprint2022arXiv

Performance evaluation of baseline-dependent averaging based onfull-scale SKA1-LOW simulation

The Square Kilometre Array (SKA) is the largest radio interferometer under construction in the world. Its immense amount of visibility data poses a considerable challenge to the subsequent processing by the science data processor (SDP). Baseline dependent averaging (BDA), which reduces the amount of visibility data based on the baseline distribution of the radio interferometer, has become a focus of SKA SDP development. This paper developed and implemented a full-featured BDA module based on Radio Astronomy Simulation, Calibration and Imaging Library (RASCIL). Simulated observations were then performed with RASCIL based on a full-scale SKA1-LOW configuration. The performance of the BDA was systematically investigated and evaluated based on the simulated data. The experimental results presented that the amount of visibility data is reduced by about 50\% to 85\% for different time intervals ($Δt_{max}$). In addition, different $Δt_{max}$ have a significant effect on the imaging quality. The smaller the $Δt_{max}$, the smaller the degradation of the imaging quality.

preprint2022arXiv

RFI Identification Based On Deep-Learning]{A Robust RFI Identification For Radio Interferometry based on a Convolutional Neural Network

The rapid development of new generation radio interferometers such as the Square Kilometer Array (SKA) has opened up unprecedented opportunities for astronomical research. However, anthropogenic Radio Frequency Interference (RFI) from communication technologies and other human activities severely affects the fidelity of observational data. It also significantly reduces the sensitivity of the telescopes. We proposed a robust Convolutional Neural Network (CNN) model to identify RFI based on machine learning methods. We overlaid RFI on the simulation data of SKA1-LOW to construct three visibility function datasets. One dataset was used for modeling, and the other two were used for validating the model's usability. The experimental results show that the Area Under the Curve (AUC) reaches 0.93, with satisfactory accuracy and precision. We then further investigated the effectiveness of the model by identifying the RFI in the actual observational data from LOFAR and MeerKAT. The results show that the model performs well. The overall effectiveness is comparable to AOFlagger software and provides an improvement over existing methods in some instances.

preprint2022arXiv

WS-Snapshot: An effective algorithm for wide-field and large-scale imaging

The Square Kilometre Array (SKA) is the largest radio interferometer under construction in the world. The high accuracy, wide-field and large size imaging significantly challenge the construction of the Science Data Processor (SDP) of SKA. We propose a hybrid imaging method based on improved W-Stacking and snapshots. The w range is reduced by fitting the snapshot $uv$ plane, thus effectively enhancing the performance of the improved W-Stacking algorithm. We present a detailed implementation of WS-Snapshot. With full-scale SKA1-LOW simulations, we present the imaging performance and imaging quality results for different parameter cases. The results show that the WS-Snapshot method enables more efficient distributed processing and significantly reduces the computational time overhead within an acceptable accuracy range, which would be crucial for subsequent SKA science studies.

preprint2021arXiv

Comparison of classical and Bayesian imaging in radio interferometry

CLEAN, the commonly employed imaging algorithm in radio interferometry, suffers from a number of shortcomings: in its basic version it does not have the concept of diffuse flux, and the common practice of convolving the CLEAN components with the CLEAN beam erases the potential for super-resolution; it does not output uncertainty information; it produces images with unphysical negative flux regions; and its results are highly dependent on the so-called weighting scheme as well as on any human choice of CLEAN masks to guiding the imaging. Here, we present the Bayesian imaging algorithm resolve which solves the above problems and naturally leads to super-resolution. We take a VLA observation of Cygnus~A at four different frequencies and image it with single-scale CLEAN, multi-scale CLEAN and resolve. Alongside the sky brightness distribution resolve estimates a baseline-dependent correction function for the noise budget, the Bayesian equivalent of weighting schemes. We report noise correction factors between 0.4 and 429. The enhancements achieved by resolve come at the cost of higher computational effort.

preprint2021arXiv

JUNO Physics and Detector

The Jiangmen Underground Neutrino Observatory (JUNO) is a 20 kton LS detector at 700-m underground. An excellent energy resolution and a large fiducial volume offer exciting opportunities for addressing many important topics in neutrino and astro-particle physics. With 6 years of data, the neutrino mass ordering can be determined at 3-4 sigma and three oscillation parameters can be measured to a precision of 0.6% or better by detecting reactor antineutrinos. With 10 years of data, DSNB could be observed at 3-sigma; a lower limit of the proton lifetime of 8.34e33 years (90% C.L.) can be set by searching for p->nu_bar K^+; detection of solar neutrinos would shed new light on the solar metallicity problem and examine the vacuum-matter transition region. A core-collapse supernova at 10 kpc would lead to ~5000 IBD and ~2000 (300) all-flavor neutrino-proton (electron) scattering events. Geo-neutrinos can be detected with a rate of ~400 events/year. We also summarize the final design of the JUNO detector and the key R&D achievements. All 20-inch PMTs have been tested. The average photon detection efficiency is 28.9% for the 15,000 MCP PMTs and 28.1% for the 5,000 dynode PMTs, higher than the JUNO requirement of 27%. Together with the >20 m attenuation length of LS, we expect a yield of 1345 p.e. per MeV and an effective energy resolution of 3.02%/\sqrt{E (MeV)}$ in simulations. The underwater electronics is designed to have a loss rate <0.5% in 6 years. With degassing membranes and a micro-bubble system, the radon concentration in the 35-kton water pool could be lowered to <10 mBq/m^3. Acrylic panels of radiopurity <0.5 ppt U/Th are produced. The 20-kton LS will be purified onsite. Singles in the fiducial volume can be controlled to ~10 Hz. The JUNO experiment also features a double calorimeter system with 25,600 3-inch PMTs, a LS testing facility OSIRIS, and a near detector TAO.

preprint2021arXiv

Xova: Baseline-Dependent Time and Channel Averaging for Radio Interferometry

Xova is a software package that implements baseline-dependent time and channel averaging on Measurement Set data. The uv-samples along a baseline track are aggregated into a bin until a specified decorrelation tolerance is exceeded. The degree of decorrelation in the bin correspondingly determines the amount of channel and timeslot averaging that is suitable for samples in the bin. This necessarily implies that the number of channels and timeslots varies per bin and the output data loses the rectilinear input shape of the input data.

preprint2020arXiv

A probabilistic approach to phase calibration: I. Effects of source structure on fringe-fitting

We propose a probabilistic framework for performing simultaneous estimation of source structure and fringe-fitting parameters in Very Long Baseline Interferometry (VLBI) observations. As a first step, we demonstrate this technique through the analysis of synthetic short-duration Event Horizon Telescope (EHT) observations of various geometric source models at 230 GHz, in the presence of baseline-dependent thermal noise. We perform Bayesian parameter estimation and model selection between the different source models to obtain reliable uncertainty estimates and correlations between various source and fringe-fitting related model parameters. We also compare the Bayesian posteriors with those obtained using widely-used VLBI data reduction packages such as CASA and AIPS, by fringe-fitting 200 Monte Carlo simulations of each source model with different noise realisations, to obtain distributions of the Maximum A Posteriori (MAP) estimates. We find that, in the presence of resolved asymmetric source structure and a given array geometry, the traditional practice of fringe-fitting with a point source model yields appreciable offsets in the estimated phase residuals, potentially biasing or limiting the dynamic range of the starting model used for self-calibration. Simultaneously estimating the source structure earlier in the calibration process with formal uncertainties improves the precision and accuracy of fringe-fitting and establishes the potential of the available data especially when there is little prior information. We also note the potential applications of this method to astrometry and geodesy for specific science cases and the planned improvements to the computational performance and analyses of more complex source distributions.

preprint2020arXiv

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

The Jiangmen Underground Neutrino Observatory~(JUNO) features a 20~kt multi-purpose underground liquid scintillator sphere as its main detector. Some of JUNO&#39;s features make it an excellent experiment for $^8$B solar neutrino measurements, such as its low-energy threshold, its high energy resolution compared to water Cherenkov detectors, and its much large target mass compared to previous liquid scintillator detectors. In this paper we present a comprehensive assessment of JUNO&#39;s potential for detecting $^8$B solar neutrinos via the neutrino-electron elastic scattering process. A reduced 2~MeV threshold on the recoil electron energy is found to be achievable assuming the intrinsic radioactive background $^{238}$U and $^{232}$Th in the liquid scintillator can be controlled to 10$^{-17}$~g/g. With ten years of data taking, about 60,000 signal and 30,000 background events are expected. This large sample will enable an examination of the distortion of the recoil electron spectrum that is dominated by the neutrino flavor transformation in the dense solar matter, which will shed new light on the tension between the measured electron spectra and the predictions of the standard three-flavor neutrino oscillation framework. If $Δm^{2}_{21}=4.8\times10^{-5}~(7.5\times10^{-5})$~eV$^{2}$, JUNO can provide evidence of neutrino oscillation in the Earth at the about 3$σ$~(2$σ$) level by measuring the non-zero signal rate variation with respect to the solar zenith angle. Moveover, JUNO can simultaneously measure $Δm^2_{21}$ using $^8$B solar neutrinos to a precision of 20\% or better depending on the central value and to sub-percent precision using reactor antineutrinos. A comparison of these two measurements from the same detector will help elucidate the current tension between the value of $Δm^2_{21}$ reported by solar neutrino experiments and the KamLAND experiment.

preprint2020arXiv

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

The Taishan Antineutrino Observatory (TAO, also known as JUNO-TAO) is a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO). A ton-level liquid scintillator detector will be placed at about 30 m from a core of the Taishan Nuclear Power Plant. The reactor antineutrino spectrum will be measured with sub-percent energy resolution, to provide a reference spectrum for future reactor neutrino experiments, and to provide a benchmark measurement to test nuclear databases. A spherical acrylic vessel containing 2.8 ton gadolinium-doped liquid scintillator will be viewed by 10 m^2 Silicon Photomultipliers (SiPMs) of >50% photon detection efficiency with almost full coverage. The photoelectron yield is about 4500 per MeV, an order higher than any existing large-scale liquid scintillator detectors. The detector operates at -50 degree C to lower the dark noise of SiPMs to an acceptable level. The detector will measure about 2000 reactor antineutrinos per day, and is designed to be well shielded from cosmogenic backgrounds and ambient radioactivities to have about 10% background-to-signal ratio. The experiment is expected to start operation in 2022.

preprint2015arXiv

Weak Lensing Simulations for the SKA

Weak gravitational lensing measurements are traditionally made at optical wavelengths where many highly resolved galaxy images are readily available. However, the Square Kilometre Array (SKA) holds great promise for this type of measurement at radio wavelengths owing to its greatly increased sensitivity and resolution over typical radio surveys. The key to successful weak lensing experiments is in measuring the shapes of detected sources to high accuracy. In this document we describe a simulation pipeline designed to simulate radio images of the quality required for weak lensing, and will be typical of SKA observations. We provide as input, images with realistic galaxy shapes which are then simulated to produce images as they would have been observed with a given radio interferometer. We exploit this pipeline to investigate various stages of a weak lensing experiment in order to better understand the effects that may impact shape measurement. We first show how the proposed SKA1-Mid array configurations perform when we compare the (known) input and output ellipticities. We then investigate how making small changes to these array configurations impact on this input-outut ellipticity comparison. We also demonstrate how alternative configurations for SKA1-Mid that are smaller in extent, and with a faster survey speeds produce similar performance to those originally proposed. We then show how a notional SKA configuration performs in the same shape measurement challenge. Finally, we describe ongoing efforts to utilise our simulation pipeline to address questions relating to how applicable current (mostly originating from optical data) shape measurement techniques are to future radio surveys. As an alternative to such image plane techniques, we lastly discuss a shape measurement technique based on the shapelets formalism that reconstructs the source shapes directly from the visibility data.

preprint2014arXiv

A JVLA 10~degree^2 deep survey

(Abridged)One of the fundamental challenges for astrophysics in the 21st century is finding a way to untangle the physical processes that govern galaxy formation and evolution. Given the importance and scope of this problem, the multi-wavelength astronomical community has used the past decade to build up a wealth of information over specific extragalactic deep fields to address key questions in galaxy formation and evolution. These fields generally cover at least 10square degrees to facilitate the investigation of the rarest, typically most massive, galaxies and AGN. Furthermore, such areal coverage allows the environments to be fully accounted for, thereby linking the single halo to the two-halo terms in the halo occupation distribution. Surveys at radio wavelengths have begun to lag behind those at other wavelengths, especially in this medium-deep survey tier. However, the survey speed offered by the JVLA means that we can now reach a point where we can begin to obtain commensurate data at radio wavelengths to those which already exists from the X-ray through to the far-infrared over ~10 square degrees. We therefore present the case for a 10 square degree survey to 1.5uJy at L-band in A or B Array, requiring ~4000 hours to provide census of star-formation and AGN-accretion activity in the Universe. For example, the observations will allow galaxies forming stars at 10Msolar/yr to be detected out to z~1 and luminous infrared galaxies (1000Msolar/yr to be found out to z~6. Furthermore, the survey area ensures that we will have enough cosmic volume to find these rare sources at all epochs. The bandwidth will allow us to determine the polarisation properties galaxies in the high-redshift Universe as a function of stellar mass, morphology and redshift.

preprint2014arXiv

Morphological classification of radio sources for galaxy evolution and cosmology with SKA-MID

Morphologically classifying radio sources in continuum images with the SKA has the potential to address some of the key questions in cosmology and galaxy evolution. In particular, we may use different classes of radio sources as independent tracers of the dark-matter density field, and thus overcome cosmic variance in measuring large-scale structure, while on the galaxy evolution side we could measure the mechanical feedback from FRII and FRI jets. This work makes use of a \texttt{MeqTrees}-based simulations framework to forecast the ability of the SKA to recover true source morphologies at high redshifts. A suite of high resolution images containing realistic continuum source distributions with different morphologies (FRI, FRII, starburst galaxies) is fed through an SKA Phase 1 simulator, then analysed to determine the sensitivity limits at which the morphologies can still be distinguished. We also explore how changing the antenna distribution affects these results.

preprint2014arXiv

Non-thermal emission from galaxy clusters: feasibility study with SKA1

Galaxy clusters are known to host a variety of extended radio sources: tailed radio galaxies whose shape is modelled by the interaction with the intra-cluster medium (ICM); radio bubbles filling cavities in the ICM distribution and rising buoyantly through the thermal gas; diffuse giant radio sources (&#34;halos&#34; and &#34;relics&#34;) revealing the presence of relativistic electrons and magnetic fields in the intra-cluster volume. It is currently the subject of an active debate how the non-thermal components that we observe at radio wavelengths affect the physical properties of the ICM and depend on the dynamical state of galaxy clusters. In this work we start our SKA1 feasibility study of the &#34;radio cluster zoo&#34; through simulations of a typical radio-loud cluster, hosting several bright tailed radio galaxies and a diffuse radio halo. Realistic simulations of SKA1 observations are obtained through the MeqTrees software. A new deconvolution algorithm, based on sparse representations and optimised for the detection of faint diffuse astronomical sources, is tested and compared to the classical CLEAN method.

preprint2014arXiv

Very Long Baseline Interferometry with the SKA

Adding VLBI capability to the SKA arrays will greatly broaden the science of the SKA, and is feasible within the current specifications. SKA-VLBI can be initially implemented by providing phased-array outputs for SKA1-MID and SKA1-SUR and using these extremely sensitive stations with other radio telescopes, and in SKA2 by realising a distributed configuration providing baselines up to thousands of km, merging it with existing VLBI networks. The motivation for and the possible realization of SKA-VLBI is described in this paper.

preprint2009arXiv

Nuclear physics for geo-neutrino studies

Geo-neutrino studies are based on theoretical estimates of geo-neutrino spectra. We propose a method for a direct measurement of the energy distribution of antineutrinos from decays of long-lived radioactive isotopes. We present preliminary results for the geo-neutrinos from Bi-214 decay, a process which accounts for about one half of the total geo-neutrino signal. The feeding probability of the lowest state of Bi-214 - the most important for geo-neutrino signal - is found to be p_0 = 0.177 \pm 0.004 (stat) ^{+0.003}_{-0.001} (sys), under the hypothesis of Universal Neutrino Spectrum Shape (UNSS). This value is consistent with the (indirect) estimate of the Table of Isotopes (ToI). We show that achievable larger statistics and reduction of systematics should allow to test possible distortions of the neutrino spectrum from that predicted using the UNSS hypothesis. Implications on the geo-neutrino signal are discussed.