Researcher profile

Ivan Garibay

Ivan Garibay contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

ConRetroBert: EMA Stabilized Dual Encoders for Template-Based Single-Step Retrosynthesis

Template based single step retrosynthesis predicts reactants by selecting and applying an explicit reaction template, making each prediction traceable to a chemical transformation rule. This is useful for synthesis planning, but template based methods are often viewed as less competitive than template free models because template prediction is commonly formulated as global classification over a long tailed rule library. We argue that this weakness is not inherent to templates, but to the learning formulation. We present ConRetroBert, a dual encoder framework that reframes template based retrosynthesis as dense product template retrieval followed by candidate set listwise ranking. Stage 1 uses contrastive pretraining to learn a shared embedding space between products and reaction templates. Stage 2 refines template ranking over mined hard negative candidate sets with a multi positive listwise objective. To enable template side adaptation without destabilizing hard negative mining, ConRetroBert uses a slow moving exponential moving average template encoder for retrieval bank construction while updating the live template encoder through the ranking loss. On the local USPTO-50k benchmark, Stage 2 candidate set ranking improves top-1 reaction accuracy from 50.5% to 61.3%, while EMA stabilized template adaptation further improves it to 62.4%. Fine tuning from a leakage controlled USPTO-Full checkpoint reaches 75.4% top-1 accuracy on USPTO-50k. We also show that retrieval based template prediction is strong in the long tail of rare templates, and that many correct reactant predictions arise from alternative explicit templates rather than only the recorded positive label. Code and data are available at https://github.com/JahidBasher/ConRetroBert.

preprint2022arXiv

Dominant Set-based Active Learning for Text Classification and its Application to Online Social Media

Recent advances in natural language processing (NLP) in online social media are evidently owed to large-scale datasets. However, labeling, storing, and processing a large number of textual data points, e.g., tweets, has remained challenging. On top of that, in applications such as hate speech detection, labeling a sufficiently large dataset containing offensive content can be mentally and emotionally taxing for human annotators. Thus, NLP methods that can make the best use of significantly less labeled data points are of great interest. In this paper, we present a novel pool-based active learning method that can be used for the training of large unlabeled corpus with minimum annotation cost. For that, we propose to find the dominant sets of local clusters in the feature space. These sets represent maximally cohesive structures in the data. Then, the samples that do not belong to any of the dominant sets are selected to be used to train the model, as they represent the boundaries of the local clusters and are more challenging to classify. Our proposed method does not have any parameters to be tuned, making it dataset-independent, and it can approximately achieve the same classification accuracy as full training data, with significantly fewer data points. Additionally, our method achieves a higher performance in comparison to the state-of-the-art active learning strategies. Furthermore, our proposed algorithm is able to incorporate conventional active learning scores, such as uncertainty-based scores, into its selection criteria. We show the effectiveness of our method on different datasets and using different neural network architectures.

preprint2022arXiv

Resilience from Diversity: Population-based approach to harden models against adversarial attacks

Traditional deep learning networks (DNN) exhibit intriguing vulnerabilities that allow an attacker to force them to fail at their task. Notorious attacks such as the Fast Gradient Sign Method (FGSM) and the more powerful Projected Gradient Descent (PGD) generate adversarial samples by adding a magnitude of perturbation $ε$ to the input's computed gradient, resulting in a deterioration of the effectiveness of the model's classification. This work introduces a model that is resilient to adversarial attacks. Our model leverages an established mechanism of defense which utilizes randomness and a population of DNNs. More precisely, our model consists of a population of $n$ diverse submodels, each one of them trained to individually obtain a high accuracy for the task at hand, while forced to maintain meaningful differences in their weights. Each time our model receives a classification query, it selects a submodel from its population at random to answer the query. To counter the attack transferability, diversity is introduced and maintained in the population of submodels. Thus introducing the concept of counter linking weights. A Counter-Linked Model (CLM) consists of a population of DNNs of the same architecture where a periodic random similarity examination is conducted during the simultaneous training to guarantee diversity while maintaining accuracy. Though the randomization technique proved to be resilient against adversarial attacks, we show that by retraining the DNNs ensemble or training them from the start with counter linking would enhance the robustness by around 20\% when tested on the MNIST dataset and at least 15\% when tested on the CIFAR-10 dataset. When CLM is coupled with adversarial training, this defense mechanism achieves state-of-the-art robustness.

preprint2021arXiv

Deep Agent: Studying the Dynamics of Information Spread and Evolution in Social Networks

This paper explains the design of a social network analysis framework, developed under DARPA's SocialSim program, with novel architecture that models human emotional, cognitive and social factors. Our framework is both theory and data-driven, and utilizes domain expertise. Our simulation effort helps in understanding how information flows and evolves in social media platforms. We focused on modeling three information domains: cryptocurrencies, cyber threats, and software vulnerabilities for the three interrelated social environments: GitHub, Reddit, and Twitter. We participated in the SocialSim DARPA Challenge in December 2018, in which our models were subjected to extensive performance evaluation for accuracy, generalizability, explainability, and experimental power. This paper reports the main concepts and models, utilized in our social media modeling effort in developing a multi-resolution simulation at the user, community, population, and content levels.

preprint2020arXiv

A Stance Data Set on Polarized Conversations on Twitter about the Efficacy of Hydroxychloroquine as a Treatment for COVID-19

At the time of this study, the SARS-CoV-2 virus that caused the COVID-19 pandemic has spread significantly across the world. Considering the uncertainty about policies, health risks, financial difficulties, etc. the online media, specially the Twitter platform, is experiencing a high volume of activity related to this pandemic. Among the hot topics, the polarized debates about unconfirmed medicines for the treatment and prevention of the disease have attracted significant attention from online media users. In this work, we present a stance data set, COVID-CQ, of user-generated content on Twitter in the context of COVID-19. We investigated more than 14 thousand tweets and manually annotated the opinions of the tweet initiators regarding the use of "chloroquine" and "hydroxychloroquine" for the treatment or prevention of COVID-19. To the best of our knowledge, COVID-CQ is the first data set of Twitter users' stances in the context of the COVID-19 pandemic, and the largest Twitter data set on users' stances towards a claim, in any domain. We have made this data set available to the research community via GitHub. We expect this data set to be useful for many research purposes, including stance detection, evolution and dynamics of opinions regarding this outbreak, and changes in opinions in response to the exogenous shocks such as policy decisions and events.

preprint2020arXiv

CD-SEIZ: Cognition-Driven SEIZ Compartmental Model for the Prediction of Information Cascades on Twitter

Information spreading social media platforms has become ubiquitous in our lives due to viral information propagation regardless of its veracity. Some information cascades turn out to be viral since they circulated rapidly on the Internet. The uncontrollable virality of manipulated or disorientated true information (fake news) might be quite harmful, while the spread of the true news is advantageous, especially in emergencies. We tackle the problem of predicting information cascades by presenting a novel variant of SEIZ (Susceptible/ Exposed/ Infected/ Skeptics) model that outperforms the original version by taking into account the cognitive processing depth of users. We define an information cascade as the set of social media users' reactions to the original content which requires at least minimal physical and cognitive effort; therefore, we considered retweet/ reply/ quote (mention) activities and tested our framework on the Syrian White Helmets Twitter data set from April 1st, 2018 to April 30th, 2019. In the prediction of cascade pattern via traditional compartmental models, all the activities are grouped, and their summation is taken into account; however, transition rates between compartments should vary according to the activity type since their requirements of physical and cognitive efforts are not same. Based on this assumption, we design a cognition-driven SEIZ (CD-SEIZ) model in the prediction of information cascades on Twitter. We tested SIS, SEIZ, and CD-SEIZ models on 1000 Twitter cascades and found that CD-SEIZ has a significantly low fitting error and provides a statistically more accurate estimation.

preprint2020arXiv

Effects of Assortativity on Consensus Formation with Heterogeneous Agents

Despite the widespread use of Barabasi's scale-free networks and Erdos-Renyi networks of which degree correlation (assortativity) is neutral, numerous studies demonstrated that online social networks tend to show assortative mixing (positive degree correlation), while non-social networks show a disassortative mixing (negative degree correlation). First, we analyzed the variability in the assortativity coefficients of different groups of the same platform by using three different subreddits in Reddit. Our data analysis results showed that Reddit is disassortative, and assortativity coefficients of the aforementioned subreddits are computed as -0.0384, -0.0588 and -0.1107, respectively. Motivated by the variability in the results even in the same platform, we decided to investigate the sensitivity of dynamics of consensus formation to the assortativity of the network. We concluded that the system is more likely to reach a consensus when the network is disassortatively mixed or neutral; however, the likelihood of the consensus significantly decreases when the network is assortatively mixed. Surprisingly, the time elapsed until all nodes fix their opinions is slightly lower when the network is neutral compared to either assortative or disassortative networks. These results are more pronounced when the thresholds of agents are more heterogeneously distributed.

preprint2020arXiv

Effects of Non-Cognitive Factors on Post-Secondary Persistence of Deaf Students: An Agent-Based Modeling Approach

Post-secondary education persistence is the likelihood of a student remaining in post-secondary education. Although statistics show that post-secondary persistence for deaf students has increased recently, there are still many obstacles obstructing students from completing their post-secondary degree goals. Therefore, increasing the persistence rate is crucial to increase education and work goals for deaf students. In this work, we present an agent-based model using NetLogo software for the persistence phenomena of deaf students. We consider four non-cognitive factors: having clear goals, social integration, social skills, and academic experience, which influence the departure decision of deaf students. Progress and results of this work suggest that agent-based modeling approaches promise to give better understanding of what will increase persistence.

preprint2020arXiv

Facial Expression Phoenix (FePh): An Annotated Sequenced Dataset for Facial and Emotion-Specified Expressions in Sign Language

Facial expressions are important parts of both gesture and sign language recognition systems. Despite the recent advances in both fields, annotated facial expression datasets in the context of sign language are still scarce resources. In this manuscript, we introduce an annotated sequenced facial expression dataset in the context of sign language, comprising over $3000$ facial images extracted from the daily news and weather forecast of the public tv-station PHOENIX. Unlike the majority of currently existing facial expression datasets, FePh provides sequenced semi-blurry facial images with different head poses, orientations, and movements. In addition, in the majority of images, identities are mouthing the words, which makes the data more challenging. To annotate this dataset we consider primary, secondary, and tertiary dyads of seven basic emotions of "sad", "surprise", "fear", "angry", "neutral", "disgust", and "happy". We also considered the "None" class if the image's facial expression could not be described by any of the aforementioned emotions. Although we provide FePh as a facial expression dataset of signers in sign language, it has a wider application in gesture recognition and Human Computer Interaction (HCI) systems.

preprint2020arXiv

Growing green: the role of path dependency and structural jumps in the green economy expansion

Existing research argues that countries increase their production basket by adding products which require similar capabilities to those they already produce, a process referred to as path dependency. Green economic growth is a global movement that seeks to achieve economic expansion while at the same time mitigating environmental risks. We postulate that countries engaging in green economic growth are motivated to invest strategically to develop new capabilities that will help them transition to a green economy. As a result, they could potentially increase their production baskets not only by a path dependent process but also by the non path dependent process we term, high investment structural jumps. The main objective of this research is to determine whether countries increase their green production basket mainly by a process of path dependency, or alternatively, by a process of structural jumps. We analyze data from 65 countries and over a period from years 2007 to 2017. We focus on China as our main case study. The results of this research show that countries not only increase their green production baskets based on their available capabilities, following path dependency, but also expand to products that path dependency does not predict by investing in innovating and developing new environmental related technologies.

preprint2020arXiv

Quantifying Latent Moral Foundations in Twitter Narratives: The Case of the Syrian White Helmets Misinformation

For years, many studies employed sentiment analysis to understand the reasoning behind people's choices and feelings, their communication styles, and the communities which they belong to. We argue that gaining more in-depth insight into moral dimensions coupled with sentiment analysis can potentially provide superior results. Understanding moral foundations can yield powerful results in terms of perceiving the intended meaning of the text data, as the concept of morality provides additional information on the unobservable characteristics of information processing and non-conscious cognitive processes. Therefore, we studied latent moral loadings of Syrian White Helmets-related tweets of Twitter users from April 1st, 2018 to April 30th, 2019. For the operationalization and quantification of moral rhetoric in tweets, we use Extended Moral Foundations Dictionary in which five psychological dimensions (Harm/Care, Fairness/Reciprocity, In-group/Loyalty, Authority/Respect and Purity/Sanctity) are considered. We show that people tend to share more tweets involving the virtue moral rhetoric than the tweets involving the vice rhetoric. We observe that the pattern of the moral rhetoric of tweets among these five dimensions are very similar during different time periods, while the strength of the five dimension is time-variant. Even though there is no significant difference between the use of Fairness/Reciprocity, In-group/Loyalty or Purity/Sanctity rhetoric, the less use of Harm/Care rhetoric is significant and remarkable. Besides, the strength of the moral rhetoric and the polarization in morality across people are mostly observed in tweets involving Harm/Care rhetoric despite the number of tweets involving the Harm/Care dimension is low.

preprint2020arXiv

Resistance of communities against disinformation

The spread of disinformation is considered a big threat to societies and has recently received unprecedented attention. In this paper we propose an agent-based model to simulate dissemination of a conspiracy in a population. The model is able to compare the resistance of different network structures against the activity of conspirators. Results show that connectedness of network structure and centrality of conspirators are of crucial importance in preventing conspiracies from becoming widespread.

preprint2020arXiv

The Degree-Dependent Threshold Model: Towards a Better Understanding of Opinion Dynamics on Online Social Networks

With the rapid growth of online social media, people become increasingly overwhelmed by the volume and the content of the information present in the environment. The threshold model is currently one of the most common methods to capture the effect of people on others' opinions and emotions. Although many studies employ and try to improve upon the threshold model, the search for an appropriate threshold function for defining human behavior is an essential and yet unattained quest. The definition of heterogeneity in thresholds of individuals is oftentimes poorly defined, which leads to the rather simplistic use of uniform and binary functions, albeit they are far from representing the reality. In this study, we use Twitter data of size 30,704,025 tweets to mimic the adoption of a new opinion. Our results show that the threshold is not only correlated with the out-degree of nodes, which contradicts other studies but also correlated with nodes' in-degree. Therefore, we simulated two cases in which thresholds are out-degree and in-degree dependent, separately. We concluded that the system is more likely to reach a consensus when thresholds are in-degree dependent; however, the time elapsed until all nodes fix their opinions is significantly higher in this case. Additionally, we did not observe a notable effect of mean-degree on either the average opinion or the fixation time of opinions for both cases, and increasing seed size has a negative effect on reaching a consensus. Although threshold heterogeneity has a slight influence on the average opinion, the positive effect of heterogeneity on reaching a consensus is more pronounced when thresholds are in-degree dependent.

preprint2020arXiv

The Effects of Information Overload on Online Conversation Dynamics

The inhibiting effects of information overload on the behavior of online social media users, can affect the population-level characteristics of information dissemination through online conversations. We introduce a mechanistic, agent-based model of information overload and investigate the effects of information overload threshold and rate of information loss on observed online phenomena. We find that conversation volume and participation are lowest under high information overload thresholds and mid-range rates of information loss. Calibrating the model to user responsiveness data on Twitter, we replicate and explain several observed phenomena: 1) Responsiveness is sensitive to information overload threshold at high rates of information loss; 2) Information overload threshold and rate of information loss are Pareto-optimal and users may experience overload at inflows exceeding 30 notifications per hour; 3) Local abundance of small cascades of modest global popularity and local scarcity of larger cascades of high global popularity explains why overloaded users receive, but do not respond to large, highly popular cascades; 4) Users typically work with 7 notifications per hour; 5) Over-exposure to information can suppress the likelihood of response by overloading users, contrary to analogies to biologically-inspired viral spread. Reconceptualizing information spread with the mechanisms of information overload creates a richer representation of online conversation dynamics, enabling a deeper understanding of how (dis)information is transmitted over social media.

preprint2011arXiv

Transformation Networks: How Innovation and the Availability of Technology can Increase Economic Performance

A transformation network describes how one set of resources can be transformed into another via technological processes. Transformation networks in economics are useful because they can highlight areas for future innovations, both in terms of new products, new production techniques, or better efficiency. They also make it easy to detect areas where an economy might be fragile. In this paper, we use computational simulations to investigate how the density of a transformation network affects the economic performance, as measured by the gross domestic product (GDP), of an artificial economy. Our results show that on average, the GDP of our economy increases as the density of the transformation network increases. We also find that while the average performance increases, the maximum possible performance decreases and the minimum possible performance increases.