Source author record

Sujoy Ganguly

Sujoy Ganguly appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Machine Learning Computer Vision Databases Graphics Biological Physics cond-mat.soft cond-mat.stat-mech Neurons and Cognition physics.flu-dyn Software Engineering

Catalog footprint

What is connected

6works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning

The creation and destruction of agents in cooperative multi-agent reinforcement learning (MARL) is a critically under-explored area of research. Current MARL algorithms often assume that the number of agents within a group remains fixed throughout an experiment. However, in many practical problems, an agent may terminate before their teammates. This early termination issue presents a challenge: the terminated agent must learn from the group's success or failure which occurs beyond its own existence. We refer to propagating value from rewards earned by remaining teammates to terminated agents as the Posthumous Credit Assignment problem. Current MARL methods handle this problem by placing these agents in an absorbing state until the entire group of agents reaches a termination condition. Although absorbing states enable existing algorithms and APIs to handle terminated agents without modification, practical training efficiency and resource use problems exist. In this work, we first demonstrate that sample complexity increases with the quantity of absorbing states in a toy supervised learning task for a fully connected network, while attention is more robust to variable size input. Then, we present a novel architecture for an existing state-of-the-art MARL algorithm which uses attention instead of a fully connected layer with absorbing states. Finally, we demonstrate that this novel architecture significantly outperforms the standard architecture on tasks in which agents are created or destroyed within episodes as well as standard multi-agent coordination tasks.

preprint2022arXiv

PeopleSansPeople: A Synthetic Data Generator for Human-Centric Computer Vision

In recent years, person detection and human pose estimation have made great strides, helped by large-scale labeled datasets. However, these datasets had no guarantees or analysis of human activities, poses, or context diversity. Additionally, privacy, legal, safety, and ethical concerns may limit the ability to collect more human data. An emerging alternative to real-world data that alleviates some of these issues is synthetic data. However, creation of synthetic data generators is incredibly challenging and prevents researchers from exploring their usefulness. Therefore, we release a human-centric synthetic data generator PeopleSansPeople which contains simulation-ready 3D human assets, a parameterized lighting and camera system, and generates 2D and 3D bounding box, instance and semantic segmentation, and COCO pose labels. Using PeopleSansPeople, we performed benchmark synthetic data training using a Detectron2 Keypoint R-CNN variant [1]. We found that pre-training a network using synthetic data and fine-tuning on various sizes of real-world data resulted in a keypoint AP increase of $+38.03$ ($44.43 \pm 0.17$ vs. $6.40$) for few-shot transfer (limited subsets of COCO-person train [2]), and an increase of $+1.47$ ($63.47 \pm 0.19$ vs. $62.00$) for abundant real data regimes, outperforming models trained with the same real data alone. We also found that our models outperformed those pre-trained with ImageNet with a keypoint AP increase of $+22.53$ ($44.43 \pm 0.17$ vs. $21.90$) for few-shot transfer and $+1.07$ ($63.47 \pm 0.19$ vs. $62.40$) for abundant real data regimes. This freely-available data generator should enable a wide range of research into the emerging field of simulation to real transfer learning in the critical area of human-centric computer vision.

preprint2022arXiv

PSP-HDRI$+$: A Synthetic Dataset Generator for Pre-Training of Human-Centric Computer Vision Models

We introduce a new synthetic data generator PSP-HDRI$+$ that proves to be a superior pre-training alternative to ImageNet and other large-scale synthetic data counterparts. We demonstrate that pre-training with our synthetic data will yield a more general model that performs better than alternatives even when tested on out-of-distribution (OOD) sets. Furthermore, using ablation studies guided by person keypoint estimation metrics with an off-the-shelf model architecture, we show how to manipulate our synthetic data generator to further improve model performance.

preprint2021arXiv

Technology Readiness Levels for Machine Learning Systems

The development and deployment of machine learning (ML) systems can be executed easily with modern tools, but the process is typically rushed and means-to-an-end. The lack of diligence can lead to technical debt, scope creep and misaligned objectives, model misuse and failures, and expensive consequences. Engineering systems, on the other hand, follow well-defined processes and testing standards to streamline development for high-quality, reliable results. The extreme is spacecraft systems, where mission critical measures and robustness are ingrained in the development process. Drawing on experience in both spacecraft engineering and ML (from research through product across domain areas), we have developed a proven systems engineering approach for machine learning development and deployment. Our "Machine Learning Technology Readiness Levels" (MLTRL) framework defines a principled process to ensure robust, reliable, and responsible systems while being streamlined for ML workflows, including key distinctions from traditional software engineering. Even more, MLTRL defines a lingua franca for people across teams and organizations to work collaboratively on artificial intelligence and machine learning technologies. Here we describe the framework and elucidate it with several real world use-cases of developing ML methods from basic research through productization and deployment, in areas such as medical diagnostics, consumer computer vision, satellite imagery, and particle physics.

preprint2016arXiv

Morphology of Fly Larval Class IV Dendrites Accords with a Random Branching and Contact Based Branch Deletion Model

Dendrites are branched neuronal processes that receive input signals from other neurons or the outside world [1]. To maintain connectivity as the organism grows, dendrites must also continue to grow. For example, the dendrites in the peripheral nervous system continue to grow and branch to maintain proper coverage of their receptor fields [2, 3, 4, 5]. One such neuron is the Drosophila melanogaster class IV dendritic arborization neuron [6]. The dendritic arbors of these neurons tile the larval surface [7], where they detect localized noxious stimuli, such as jabs from parasitic wasps [8]. In the present study, we used a novel measure, the hitting probability, to show that the class IV neuron forms a tight mesh that covers the larval surface. Furthermore, we found that the mesh size remains largely unchanged during the larval stages, despite a dramatic increase in overall size of the neuron and the larva. We also found that the class IV dendrites are dense (assayed with the fractal dimension) and uniform (assayed with the lacunarity) throughout the larval stages. To understand how the class IV neuron maintains its morphology during larval development, we constructed a mathematical model based on random branching and self-avoidance. We found that if the branching rate is uniform in space and time and that if all contacting branches are deleted, we can reproduce the branch length distribution, mesh size and density of the class IV dendrites throughout the larval stages. Thus, a simple set of statistical rules can generate and maintain a complex branching morphology during growth.

preprint2011arXiv

Fluid dynamics and noise in bacterial cell-cell and cell-surface scattering

Bacterial processes ranging from gene expression to motility and biofilm formation are constantly challenged by internal and external noise. While the importance of stochastic fluctuations has been appreciated for chemotaxis, it is currently believed that deterministic long-range fluid dynamical effects govern cell-cell and cell-surface scattering - the elementary events that lead to swarming and collective swimming in active suspensions and to the formation of biofilms. Here, we report the first direct measurements of the bacterial flow field generated by individual swimming Escherichia coli both far from and near to a solid surface. These experiments allowed us to examine the relative importance of fluid dynamics and rotational diffusion for bacteria. For cell-cell interactions it is shown that thermal and intrinsic stochasticity drown the effects of long-range fluid dynamics, implying that physical interactions between bacteria are determined by steric collisions and near-field lubrication forces. This dominance of short-range forces closely links collective motion in bacterial suspensions to self-organization in driven granular systems, assemblages of biofilaments, and animal flocks. For the scattering of bacteria with surfaces, long-range fluid dynamical interactions are also shown to be negligible before collisions; however, once the bacterium swims along the surface within a few microns after an aligning collision, hydrodynamic effects can contribute to the experimentally observed, long residence times. As these results are based on purely mechanical properties, they apply to a wide range of microorganisms.

Sujoy Ganguly

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning

PeopleSansPeople: A Synthetic Data Generator for Human-Centric Computer Vision

PSP-HDRI$+$: A Synthetic Dataset Generator for Pre-Training of Human-Centric Computer Vision Models

Technology Readiness Levels for Machine Learning Systems

Morphology of Fly Larval Class IV Dendrites Accords with a Random Branching and Contact Based Branch Deletion Model

Fluid dynamics and noise in bacterial cell-cell and cell-surface scattering