Researcher profile

Lior Shamir

Lior Shamir contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2025arXiv

CornViT: A Multi-Stage Convolutional Vision Transformer Framework for Hierarchical Corn Kernel Analysis

Accurate grading of corn kernels is critical for seed certification, directional seeding, and breeding, yet it is still predominantly performed by manual inspection. This work introduces CornViT, a three-stage Convolutional Vision Transformer (CvT) framework that emulates the hierarchical reasoning of human seed analysts for single-kernel evaluation. Three sequential CvT-13 classifiers operate on 384x384 RGB images: Stage 1 distinguishes pure from impure kernels; Stage 2 categorizes pure kernels into flat and round morphologies; and Stage 3 determines the embryo orientation (up vs. down) for pure, flat kernels. Starting from a public corn seed image collection, we manually relabeled and filtered images to construct three stage-specific datasets: 7265 kernels for purity, 3859 pure kernels for morphology, and 1960 pure-flat kernels for embryo orientation, all released as benchmarks. Head-only fine-tuning of ImageNet-22k pretrained CvT-13 backbones yields test accuracies of 93.76% for purity, 94.11% for shape, and 91.12% for embryo-orientation detection. Under identical training conditions, ResNet-50 reaches only 76.56 to 81.02 percent, whereas DenseNet-121 attains 86.56 to 89.38 percent accuracy. These results highlight the advantages of convolution-augmented self-attention for kernel analysis. To facilitate adoption, we deploy CornViT in a Flask-based web application that performs stage-wise inference and exposes interpretable outputs through a browser interface. Together, the CornViT framework, curated datasets, and web application provide a deployable solution for automated corn kernel quality assessment in seed quality workflows. Source code and data are publicly available.

preprint2022arXiv

A Possible Large-scale Alignment of Galaxy Spin Directions -- Analysis of 10 Datasets from SDSS, Pan-STARRS, and HST

Multiple observations made by several different telescopes have shown asymmetry between the number of spiral galaxies rotating in opposite directions in different parts of the sky. One of the immediate questions regarding the possible asymmetry of the spin directions is whether the distribution forms a cosmological-scale axis. This paper analyzes and compares 10 different datasets published in the past decade, collected by SDSS, Pan-STARRS, and Hubble Space Telescope. The datasets contain spiral galaxies separated by their spin direction, and the distribution can show dipole axes. The analysis shows that the directions of the most probable dipole axes are consistent in datasets that have similar average redshift, but different between datasets that have different average redshift. The analysis also shows that the location of the most probable axis correlates with the average redshift of the galaxies in the datasets. That is, the location of the most probable axis shifts when the redshift gets higher, and the correlation is statistically significant. This provides a certain indication of a drift in a possible axis formed by the distribution of galaxy spin directions, or a cosmological scale structure that peaks at a certain distance from Earth.

preprint2022arXiv

Analysis of $\sim10^6$ spiral galaxies from four telescopes shows large-scale patterns of asymmetry in galaxy spin directions

The ability to collect unprecedented amounts of astronomical data has enabled the studying scientific questions that were impractical to study in the pre-information era. This study uses large datasets collected by four different robotic telescopes to profile the large-scale distribution of the spin directions of spiral galaxies. These datasets cover the Northern and Southern hemispheres, in addition to data acquired from space by the Hubble Space Telescope. The data were annotated automatically by a fully symmetric algorithm, as well as manually through a long labor-intensive process, leading to a dataset of nearly $10^6$ galaxies. The data shows possible patterns of asymmetric distribution of the spin directions, and the patterns agree between the different telescopes. The profiles also agree when using automatic or manual annotation of the galaxies, showing very similar large-scale patterns. Combining all data from all telescopes allows the most comprehensive analysis of its kind to date in terms of both the number of galaxies and the footprint size. The results show a statistically significant profile that is consistent across all telescopes. The instruments used in this study are DECam, HST, SDSS, and Pan-STARRS. The paper also discusses possible sources of bias, and analyzes the design of previous work that showed different results. Further research will be required to understand and validate these preliminary observations.

preprint2022arXiv

Analysis of spin directions of galaxies in the DESI Legacy Survey

The DESI Legacy Survey is a digital sky survey with a large footprint compared to other Earth-based surveys, covering both the Northern and Southern hemispheres. This paper shows the distribution of the spin directions of spiral galaxies imaged by DESI Legacy Survey. A simple analysis of dividing nearly 1.3$\cdot10^6$ spiral galaxies into two hemispheres shows a higher number of galaxies spinning counterclockwise in the Northern hemisphere, and a higher number of galaxies spinning clockwise in the Southern hemisphere. That distribution is consistent with previous observations, but uses a far larger number of galaxies and a larger footprint. The larger footprint allows a comprehensive analysis without the need to fit the distribution into an a priori model, making this study different from all previous analyses of this kind. Fitting the spin directions of the galaxies to cosine dependence shows a dipole axis alignment with probability of $P<10^{-5}$. The analysis is done with a trivial selection of the galaxies, as well as simple explainable annotation algorithm that does not make use of any form of machine learning, deep learning, or pattern recognition. While further work will be required, these results are aligned with previous studies suggesting the possibility of a large-scale alignment of galaxy angular momentum.

preprint2022arXiv

Asymmetry in galaxy spin directions -- analysis of data from DES and comparison to four other sky surveys

The paper shows an analysis of the large-scale distribution of galaxy spin directions of 739,286 galaxies imaged by DES. The distribution of the spin directions of the galaxies exhibits a large-scale dipole axis. Comparison of the location of the dipole axis to a similar analysis with data from SDSS, Pan-STARRS, and DESI Legacy Survey shows that all sky surveys exhibit dipole axes within 52$^o$ or less from each other, well within 1$σ$ error. While non-random distribution is unexpected, the findings are consistent across all sky surveys, regardless of the telescope or whether the data were annotated manually or automatically. Possible errors that can lead to the observation are discussed. The paper also discusses previous studies showing opposite conclusions,and analyzes the decisions that led to these results. Although the observation is provocative, and further research will be required, the existing evidence justifies to consider the contention that galaxy spin directions as observed from Earth are not necessarily randomly distributed. Possible explanations can be related to mature cosmological theories, but also to the internal structure of galaxies.

preprint2022arXiv

Large-scale asymmetry in galaxy spin directions -- analysis of galaxies with spectra in DES, SDSS, and DESI Legacy Survey

Multiple previous studies using several different probes have shown considerable evidence for the existence of cosmological-scale anisotropy and a Hubble-scale axis. One of the probes that show such evidence is the distribution of the directions toward which galaxies spin. The advantage of the analysis of the distribution of galaxy spin directions compared to the CMB anisotropy is that the ratio of galaxy spin directions is a relative measurement, and therefore less sensitive to background contamination such as Milky Way obstruction. Another advantage is that many spiral galaxies have spectra, and therefore allow to analyze the location of such axis relative to Earth. This paper shows an analysis of the distribution of the spin directions of over 90K galaxies with spectra. That analysis is also compared to previous analyses using the Earth-based SDSS, Pan-STARRS, and DESI Legacy Survey, as well as space-based data collected by HST. The results show very good agreement between the distribution patterns observed with the different telescopes. The dipole or quadrupole axes formed by the spin directions of the galaxies with spectra do not necessarily go directly through Earth.

preprint2022arXiv

New evidence and analysis of cosmological-scale asymmetry in galaxy spin directions

In the past several decades, multiple cosmological theories that are based on the contention that the Universe has a major axis have been proposed. Such theories can be based on the geometry of the Universe, or multiverse theories such as black hole cosmology. The contention of a cosmological-scale axis is supported by certain evidence such as the dipole axis formed by the CMB distribution. Here I study another form of cosmological-scale axis, based on the distribution of the spin direction of spiral galaxies. Data from four different telescopes is analyzed, showing nearly identical axis profiles when the distribution of the redshifts of the galaxies is similar.

preprint2022arXiv

Self-Supervised Approach to Addressing Zero-Shot Learning Problem

In recent years, self-supervised learning has had significant success in applications involving computer vision and natural language processing. The type of pretext task is important to this boost in performance. One common pretext task is the measure of similarity and dissimilarity between pairs of images. In this scenario, the two images that make up the negative pair are visibly different to humans. However, in entomology, species are nearly indistinguishable and thus hard to differentiate. In this study, we explored the performance of a Siamese neural network using contrastive loss by learning to push apart embeddings of bumblebee species pair that are dissimilar, and pull together similar embeddings. Our experimental results show a 61% F1-score on zero-shot instances, a performance showing 11% improvement on samples of classes that share intersections with the training set.

preprint2022arXiv

Systematic biases when using deep neural networks for annotating large catalogs of astronomical images

Deep convolutional neural networks (DCNNs) have become the most common solution for automatic image annotation due to their non-parametric nature, good performance, and their accessibility through libraries such as TensorFlow. Among other fields, DCNNs are also a common approach to the annotation of large astronomical image databases acquired by digital sky surveys. One of the main downsides of DCNNs is the complex non-intuitive rules that make DCNNs act as a ``black box&#34;, providing annotations in a manner that is unclear to the user. Therefore, the user is often not able to know what information is used by the DCNNs for the classification. Here we demonstrate that the training of a DCNN is sensitive to the context of the training data such as the location of the objects in the sky. We show that for basic classification of elliptical and spiral galaxies, the sky location of the galaxies used for training affects the behavior of the algorithm, and leads to a small but consistent and statistically significant bias. That bias exhibits itself in the form of cosmological-scale anisotropy in the distribution of basic galaxy morphology. Therefore, while DCNNs are powerful tools for annotating images of extended sources, the construction of training sets for galaxy morphology should take into consideration more aspects than the visual appearance of the object. In any case, catalogs created with deep neural networks that exhibit signs of cosmological anisotropy should be interpreted with the possibility of consistent bias.

preprint2021arXiv

AdeNet: Deep learning architecture that identifies damaged electrical insulators in power lines

Ceramic insulators are important to electronic systems, designed and installed to protect humans from the danger of high voltage electric current. However, insulators are not immortal, and natural deterioration can gradually damage them. Therefore, the condition of insulators must be continually monitored, which is normally done using UAVs. UAVs collect many images of insulators, and these images are then analyzed to identify those that are damaged. Here we describe AdeNet as a deep neural network designed to identify damaged insulators, and test multiple approaches to automatic analysis of the condition of insulators. Several deep neural networks were tested, as were shallow learning methods. The best results (88.8\%) were achieved using AdeNet without transfer learning. AdeNet also reduced the false negative rate to $\sim$7\%. While the method cannot fully replace human inspection, its high throughput can reduce the amount of labor required to monitor lines for damaged insulators and provide early warning to replace damaged insulators.

preprint2021arXiv

Analysis of the alignment of non-random patterns of spin directions in populations of spiral galaxies

Observations of non-random distribution of galaxies with opposite spin directions have recently attracted considerable attention. Here, a method for identifying cosine-dependence in a dataset of galaxies annotated by their spin directions is described in the light of different aspects that can impact the statistical analysis of the data. These aspects include the presence of duplicate objects in a dataset, errors in the galaxy annotation process, and non-random distribution of the asymmetry that does not necessarily form a dipole or quadrupole axes. The results show that duplicate objects in the dataset can artificially increase the likelihood of cosine dependence detected in the data, but a very high number of duplicate objects is required to lead to a false detection of an axis. Inaccuracy in galaxy annotations has relatively minor impact on the identification of cosine dependence when the error is randomly distributed between clockwise and counterclockwise galaxies. However, when the error is not random, even a small bias of 1% leads to a statistically significant cosine dependence that peaks at the celestial pole. Experiments with artificial datasets in which the distribution was not random showed strong cosine dependence even when the data did not form a full dipole axis alignment. The analysis when using the unmodified data shows asymmetry profile similar to the profile shown in multiple previous studies using several different telescopes.

preprint2021arXiv

Automatic identification of outliers in Hubble Space Telescope galaxy images

Rare extragalactic objects can carry substantial information about the past, present, and future universe. Given the size of astronomical databases in the information era it can be assumed that very many outlier galaxies are included in existing and future astronomical databases. However, manual search for these objects is impractical due to the required labor, and therefore the ability to detect such objects largely depends on computer algorithms. This paper describes an unsupervised machine learning algorithm for automatic detection of outlier galaxy images, and its application to several Hubble Space Telescope fields. The algorithm does not require training, and therefore is not dependent on the preparation of clean training sets. The application of the algorithm to a large collection of galaxies detected a variety of outlier galaxy images. The algorithm is not perfect in the sense that not all objects detected by the algorithm are indeed considered outliers, but it reduces the dataset by two orders of magnitude to allow practical manual identification. The catalogue contains 147 objects that would be very difficult to identify without using automation.

preprint2020arXiv

Asymmetry between galaxies with different spin patterns: A comparison between COSMOS, SDSS, and Pan-STARRS

Previous observations of a large number of galaxies show differences between the photometry of spiral galaxies with clockwise spin patterns and spiral galaxies with counterclockwise spin patterns. In this study the mean magnitude of a large number of clockwise galaxies is compared to the mean magnitude of a large number of counterclockwise galaxies. The observed difference between clockwise and counterclockwise spiral galaxies imaged by the space-based COSMOS survey is compared to the differences between clockwise and counterclockwise galaxies imaged by the Earth-based SDSS and Pan-STARRS around the same field. The annotation of clockwise and counterclockwise galaxies is a fully automatic process that does not involve human intervention, and in all experiments both clockwise and counterclockwise galaxies are separated from the same fields. The comparison shows that the same asymmetry was identified by all three telescopes, providing strong evidence that the rotation direction of a spiral galaxy is linked to its luminosity as measured from Earth. Analysis of the luminosity difference using a large number of galaxies from different parts of the sky shows that the difference between clockwise and counterclockwise galaxies changes with the direction of observation, and oriented around an axis.

preprint2020arXiv

Eliminating self-selection: Using data science for authentic undergraduate research in a first-year introductory course

Research experience and mentoring has been identified as an effective intervention for increasing student engagement and retention in the STEM fields, with high impact on students from undeserved populations. However, one-on-one mentoring is limited by the number of available faculty, and in certain cases also by the availability of funding for stipend. One-on-one mentoring is further limited by the selection and self-selection of students. Since research positions are often competitive, they are often taken by the best-performing students. More importantly, many students who do not see themselves as the top students of their class, or do not identify themselves as researchers might not apply, and that self selection can have the highest impact on non-traditional students. To address the obstacles of scalability, selection, and self-selection, we designed a data science research experience for undergraduates as part of an introductory computer science course. Through the intervention, the students are exposed to authentic research as early as their first semester. The intervention is inclusive in the sense that all students registered to the course participate in the research, with no process of selection or self-selection. The research is focused on analytics of large text databases. Using discovery-enabling software tools, the students analyze a corpus of congressional speeches, and identify patterns of differences between democratic speeches and republican speeches, differences between speeches for and against certain bills, and differences between speeches about bills that passed and bills that did not pass. In the beginning of the research experience all student follow the same protocol and use the same data, and then each group of students work on their own research project as part of their final project of the course.

preprint2020arXiv

Large-scale asymmetry between clockwise and counterclockwise galaxies revisited

The ability of digital sky surveys to collect and store very large amounts of data provides completely new ways to study the local universe. Perhaps one of the most provocative observations reported with such tools is the asymmetry between galaxies with clockwise and counterclockwise spin patterns. Here I use $\sim1.7\cdot10^5$ spiral galaxies from SDSS and sort them by their spin patterns (clockwise or counterclockwise) to identify and profile a possible large-scale pattern of the distribution of galaxy spin patterns as observed from Earth. The analysis shows asymmetry between the number of clockwise and counterclockwise spiral galaxies imaged by SDSS, and a dipole axis. These findings largely agree with previous reports using smaller datasets. The probability of the differences between the number of galaxies to occur by chance is (P<4*10^-9), and the probability of an asymmetry axis to occur by mere chance is (P<1.4*10^-5).

preprint2020arXiv

Patterns of galaxy spin directions in SDSS and Pan-STARRS show parity violation and multipoles

The distribution of spin directions of $\sim6.4\cdot10^4$ SDSS spiral galaxies with spectra was examined, and compared to the distribution of $\sim3.3\cdot10^4$ Pan-STARRS galaxies. The analysis shows a statistically significant asymmetry between the number of SDSS galaxies with opposite spin directions, and the magnitude and direction of the asymmetry changes with the direction of observation and with the redshift. The redshift dependence shows that the distribution of the spin direction of SDSS galaxies becomes more asymmetric as the redshift gets higher. Fitting the distribution of the galaxy spin directions to a quadrupole alignment provides fitness with statistical significance >5$σ$, which grows to >8$σ$ when just galaxies with z>0.15 are used. Similar analysis with Pan-STARRS galaxies provides dipole and quadrupole alignments nearly identical to the analysis of SDSS galaxies, showing that the source of the asymmetry is not necessarily a certain unknown flaw in a specific telescope system. While these observations are clearly provocative, there is no known error that could exhibit itself in such form. The data analysis process is fully automatic, and uses deterministic and symmetric algorithms with defined rules. It does not involve either manual analysis that can lead to human perceptual bias, or machine learning that can capture human biases or other subtle differences that are difficult to identify due to the complex nature of machine learning processes. Also, an error in the galaxy annotation process is expected to show consistent bias in all parts of the sky, rather than change with the direction of observation to form a clear and definable pattern.