Researcher profile

Oleg Vasilyev

Oleg Vasilyev contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2022arXiv

Consistency and Coherence from Points of Contextual Similarity

Factual consistency is one of important summary evaluation dimensions, especially as summary generation becomes more fluent and coherent. The ESTIME measure, recently proposed specifically for factual consistency, achieves high correlations with human expert scores both for consistency and fluency, while in principle being restricted to evaluating such text-summary pairs that have high dictionary overlap. This is not a problem for current styles of summarization, but it may become an obstacle for future summarization systems, or for evaluating arbitrary claims against the text. In this work we generalize the method, and make a variant of the measure applicable to any text-summary pairs. As ESTIME uses points of contextual similarity, it provides insights into usefulness of information taken from different BERT layers. We observe that useful information exists in almost all of the layers except the several lowest ones. For consistency and fluency - qualities focused on local text details - the most useful layers are close to the top (but not at the top); for coherence and relevance we found a more complicated and interesting picture.

preprint2020arXiv

Is human scoring the best criteria for summary evaluation?

Normally, summary quality measures are compared with quality scores produced by human annotators. A higher correlation with human scores is considered to be a fair indicator of a better measure. We discuss observations that cast doubt on this view. We attempt to show a possibility of an alternative indicator. Given a family of measures, we explore a criterion of selecting the best measure not relying on correlations with human scores. Our observations for the BLANC family of measures suggest that the criterion is universal across very different styles of summaries.

preprint2020arXiv

Zero-shot topic generation

We present an approach to generating topics using a model trained only for document title generation, with zero examples of topics given during training. We leverage features that capture the relevance of a candidate span in a document for the generation of a title for that document. The output is a weighted collection of the phrases that are most relevant for describing the document and distinguishing it within a corpus, without requiring access to the rest of the corpus. We conducted a double-blind trial in which human annotators scored the quality of our machine-generated topics along with original human-written topics associated with news articles from The Guardian and The Huffington Post. The results show that our zero-shot model generates topic labels for news documents that are on average equal to or higher quality than those written by humans, as judged by humans.

preprint2019arXiv

Tracer Diffusion on a Crowded Random Manhattan Lattice

We study by extensive numerical simulations the dynamics of a hard-core tracer particle (TP) in presence of two competing types of disorder - frozen convection flows on a square random Manhattan lattice and a crowded dynamical environment formed by a lattice gas of mobile hard-core particles. The latter perform lattice random walks, constrained by a single-occupancy condition of each lattice site, and are either insensitive to random flows (model A) or choose the jump directions as dictated by the local directionality of bonds of the random Manhattan lattice (model B). We focus on the TP disorder-averaged mean-squared displacement, (which shows a super-diffusive behaviour $\sim t^{4/3}$, $t$ being time, in all the cases studied here), on higher moments of the TP displacement, and on the probability distribution of the TP position $X$ along the $x$-axis. Our analysis evidences that in absence of the lattice gas particles the latter has a Gaussian central part $\sim \exp(- u^2)$, where $u = X/t^{2/3}$, and exhibits slower-than-Gaussian tails $\sim \exp(-|u|^{4/3})$ for sufficiently large $t$ and $u$. Numerical data convincingly demonstrate that in presence of a crowded environment the central Gaussian part and non-Gaussian tails of the distribution persist for both models.

preprint2010arXiv

Ballistic deposition patterns beneath a growing KPZ interface

We consider a (1+1)-dimensional ballistic deposition process with next-nearest neighbor interaction, which belongs to the KPZ universality class, and introduce for this discrete model a variational formulation similar to that for the randomly forced continuous Burgers equation. This allows to identify the characteristic structures in the bulk of a growing aggregate ("clusters" and "crevices") with minimizers and shocks in the Burgers turbulence, and to introduce a new kind of equipped Airy process for ballistic growth. We dub it the "hairy Airy process" and investigate its statistics numerically. We also identify scaling laws that characterize the ballistic deposition patterns in the bulk: the law of "thinning" of the forest of clusters with increasing height, the law of transversal fluctuations of cluster boundaries, and the size distribution of clusters. The corresponding critical exponents are determined exactly based on the analogy with the Burgers turbulence and simple scaling considerations.