Source author record

Abhishek Basu

Abhishek Basu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Networking and Internet Architecture

Catalog footprint

What is connected

2works

2topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

SafeDiffusion-R1: Online Reward Steering for Safe Diffusion Post-Training

Diffusion models have been widely studied for removing unsafe content learned during pre-training. Existing methods require expensive supervised data, either unsafe-text paired with safe-image groundtruth or negative/positive image pairs, making them impractical to scale. Furthermore, offline reinforcement learning and supervised fine-tuning approaches that generate synthetic data offline suffer from catastrophic forgetting, degrading generation quality. We propose a novel online reinforcement learning framework that addresses both data scarcity and model degradation through post-training with Group Relative Policy Optimization (GRPO) on both negative and positive text prompts. To eliminate the need for fine-tuning specialized safe/unsafe reward models, we introduce a \textit{steering reward mechanism} that exploits an inherent property of CLIP embeddings: steering text representations toward positive safety directions and away from negative ones in the embedding space. Our online-policy approach enables the model to learn from diverse prompts, including explicit unsafe content, without catastrophic forgetting. Extensive experiments demonstrate that our method reduces inappropriate content to 18.07\% (vs. 48.9\% for SD v1.4) and nudity detections to 15 (vs. 646 baseline) while improving compositional generation quality from 42.08\% to 47.83\% on GenEval. Remarkably, these safety gains generalize to out-of-domain unsafe prompts across seven harm categories, achieving state-of-the-art performance without supervised paired data or reward tuning. Github: https://github.com/MAXNORM8650/SafeDiffusion-R1.

preprint2020arXiv

Employing WiFi Direct to Build a Wireless Network over both 2.4 GHz and 5.8 GHz bands

Almost all the WiFi networks today provide single band (either 2.4 GHz or 5.8 GHz) wireless communication functionality for connected mobile nodes. In a single band network, the interference depends on number of nodes in the network and the presence of other networks in the proximity. As the number of nodes in a Network increases, the interference in the network also increases which reduces the throughput of the network. If there are two single band networks, one operating in 2.4 GHz frequency band and other in 5.8 GHz frequency band, then nodes in network 1 will not cause any interference with nodes in network 2. This can be used as a basic idea to implement a network where the nodes in the same network use both the frequency bands to minimize the interference between nodes. We show that building dual band multi-hop network gives better performance in terms of throughput compared to that of a network with single frequency band.