Source author record

Sahil Gupta

Sahil Gupta appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision eess.IV Machine Learning physics.class-ph

Catalog footprint

What is connected

3works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Novel View Synthesis using DDIM Inversion

Synthesizing novel views from a single input image is a challenging task. It requires extrapolating the 3D structure of a scene while inferring details in occluded regions, and maintaining geometric consistency across viewpoints. Many existing methods must fine-tune large diffusion backbones using multiple views or train a diffusion model from scratch, which is extremely expensive. Additionally, they suffer from blurry reconstruction and poor generalization. This gap presents the opportunity to explore an explicit lightweight view translation framework that can directly utilize the high-fidelity generative capabilities of a pretrained diffusion model while reconstructing a scene from a novel view. Given the DDIM-inverted latent of a single input image, we employ a camera pose-conditioned translation U-Net, TUNet, to predict the inverted latent corresponding to the desired target view. However, the image sampled using the predicted latent may result in a blurry reconstruction. To this end, we propose a novel fusion strategy that exploits the inherent noise correlation structure observed in DDIM inversion. The proposed fusion strategy helps preserve the texture and fine-grained details. To synthesize the novel view, we use the fused latent as the initial condition for DDIM sampling, leveraging the generative prior of the pretrained diffusion model. Extensive experiments on MVImgNet demonstrate that our method outperforms existing methods.

preprint2020arXiv

Particle sliding on a turntable in the presence of frictional forces

Motion of a point particle sliding on a turntable is studied. The equations of motion are derived assuming that the table exerts frictional force on the particle, which is of constant magnitude and directed opposite to the direction of motion of the particle relative to the turntable. After expressing the equations in terms of dimensionless variables, some of the general properties of the solutions are discussed. Approximate analytic solutions are found for the cases in which (i) the particle is released from rest with respect to the lab frame and, (ii) the particle is released from rest with respect to the turntable. The equations are then solved numerically to get a more complete understanding of the motion. It is found that one can define an escape speed for the particle which is the minimum speed required to get the particle to move off to infinity. The escape speed is a function of the distance from the center of the turntable and for a given distance from the center, it depends on the direction of initial velocity. A qualitative explanation of this behavior is given in terms of the fictitious forces. Numerical study also indicates an alternative way for measuring the coefficient of friction between the particle and the turntable.

preprint2020arXiv

Style is a Distribution of Features

Neural style transfer (NST) is a powerful image generation technique that uses a convolutional neural network (CNN) to merge the content of one image with the style of another. Contemporary methods of NST use first or second order statistics of the CNN's features to achieve transfers with relatively little computational cost. However, these methods cannot fully extract the style from the CNN's features. We present a new algorithm for style transfer that fully extracts the style from the features by redefining the style loss as the Wasserstein distance between the distribution of features. Thus, we set a new standard in style transfer quality. In addition, we state two important interpretations of NST. The first is a re-emphasis from Li et al., which states that style is simply the distribution of features. The second states that NST is a type of generative adversarial network (GAN) problem.

Sahil Gupta

What is connected

Connect this record

See the researcher in context

Building this map preview

3 published item(s)

Novel View Synthesis using DDIM Inversion

Particle sliding on a turntable in the presence of frictional forces

Style is a Distribution of Features