Source author record

Siddhartha Sen

Siddhartha Sen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-th cond-mat.mes-hall math-ph math.MP Artificial Intelligence Machine Learning cond-mat.other cs.CY Data Structures and Algorithms Networking and Internet Architecture Biomolecules cond-mat.mtrl-sci cond-mat.soft cond-mat.stat-mech gr-qc math.AG Multimedia nlin.CD Quantitative Methods

Catalog footprint

What is connected

26works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Detecting Individual Decision-Making Style: Exploring Behavioral Stylometry in Chess

The advent of machine learning models that surpass human decision-making ability in complex domains has initiated a movement towards building AI systems that interact with humans. Many building blocks are essential for this activity, with a central one being the algorithmic characterization of human behavior. While much of the existing work focuses on aggregate human behavior, an important long-range goal is to develop behavioral models that specialize to individual people and can differentiate among them. To formalize this process, we study the problem of behavioral stylometry, in which the task is to identify a decision-maker from their decisions alone. We present a transformer-based approach to behavioral stylometry in the context of chess, where one attempts to identify the player who played a set of games. Our method operates in a few-shot classification framework, and can correctly identify a player from among thousands of candidate players with 98% accuracy given only 100 labeled games. Even when trained on amateur play, our method generalises to out-of-distribution samples of Grandmaster players, despite the dramatic differences between amateur and world-class players. Finally, we consider more broadly what our resulting embeddings reveal about human style in chess, as well as the potential ethical implications of powerful methods for identifying individuals from behavioral data.

preprint2022arXiv

Learning Models of Individual Behavior in Chess

AI systems that can capture human-like behavior are becoming increasingly useful in situations where humans may want to learn from these systems, collaborate with them, or engage with them as partners for an extended duration. In order to develop human-oriented AI systems, the problem of predicting human actions -- as opposed to predicting optimal actions -- has received considerable attention. Existing work has focused on capturing human behavior in an aggregate sense, which potentially limits the benefit any particular individual could gain from interaction with these systems. We extend this line of work by developing highly accurate predictive models of individual human behavior in chess. Chess is a rich domain for exploring human-AI interaction because it combines a unique set of properties: AI systems achieved superhuman performance many years ago, and yet humans still interact with them closely, both as opponents and as preparation tools, and there is an enormous corpus of recorded data on individual player games. Starting with Maia, an open-source version of AlphaZero trained on a population of human players, we demonstrate that we can significantly improve prediction accuracy of a particular player's moves by applying a series of fine-tuning methods. Furthermore, our personalized models can be used to perform stylometry -- predicting who made a given set of moves -- indicating that they capture human decision-making at an individual level. Our work demonstrates a way to bring AI systems into better alignment with the behavior of individual people, which could lead to large improvements in human-AI interaction.

preprint2022arXiv

Measuring the Effect of Training Data on Deep Learning Predictions via Randomized Experiments

We develop a new, principled algorithm for estimating the contribution of training data points to the behavior of a deep learning model, such as a specific prediction it makes. Our algorithm estimates the AME, a quantity that measures the expected (average) marginal effect of adding a data point to a subset of the training data, sampled from a given distribution. When subsets are sampled from the uniform distribution, the AME reduces to the well-known Shapley value. Our approach is inspired by causal inference and randomized experiments: we sample different subsets of the training data to train multiple submodels, and evaluate each submodel's behavior. We then use a LASSO regression to jointly estimate the AME of each data point, based on the subset compositions. Under sparsity assumptions ($k \ll N$ datapoints have large AME), our estimator requires only $O(k\log N)$ randomized submodel trainings, improving upon the best prior Shapley value estimators.

preprint2022arXiv

Mimetic Models: Ethical Implications of AI that Acts Like You

An emerging theme in artificial intelligence research is the creation of models to simulate the decisions and behavior of specific people, in domains including game-playing, text generation, and artistic expression. These models go beyond earlier approaches in the way they are tailored to individuals, and the way they are designed for interaction rather than simply the reproduction of fixed, pre-computed behaviors. We refer to these as mimetic models, and in this paper we develop a framework for characterizing the ethical and social issues raised by their growing availability. Our framework includes a number of distinct scenarios for the use of such models, and considers the impacts on a range of different participants, including the target being modeled, the operator who deploys the model, and the entities that interact with it.

preprint2020arXiv

A New Abstraction for Internet QoE Optimization

A perennial quest in networking research is how to achieve higher quality of experience (QoE) for users without incurring more resources. This work revisits an important yet often overlooked piece of the puzzle: what should the QoE abstraction be? A QoE abstraction is a representation of application quality that describes how decisions affect QoE. The conventional wisdom has relied on developing hand-crafted quality metrics (e.g., video rebuffering events, web page loading time) that are specialized to each application, content, and setting. We argue that in many cases, it maybe fundamentally hard to capture a user's perception of quality using a list of handcrafted metrics, and that expanding the metric list may lead to unnecessary complexity in the QoE model without a commensurate gain. Instead, we advocate for a new approach based on a new QoE abstraction called visual rendering. Rather than a list of metrics, we model the process of quality perception as a user watching a continuous "video" (visual rendering) of all the pixels on their screen. The key advantage of visual rendering is that it captures the full experience of a user with the same abstraction for all applications. This new abstraction opens new opportunities (e.g., the possibility of end-to-end deep learning models that infer QoE directly from a visual rendering) but it also gives rise to new research challenges (e.g., how to emulate the effect on visual rendering of an application decision). This paper makes the case for visual rendering as a unifying abstraction for Internet QoE and outlines a new research agenda to unleash its opportunities.

preprint2020arXiv

Aligning Superhuman AI with Human Behavior: Chess as a Model System

As artificial intelligence becomes increasingly intelligent---in some cases, achieving superhuman performance---there is growing potential for humans to learn from and collaborate with algorithms. However, the ways in which AI systems approach problems are often different from the ways people do, and thus may be uninterpretable and hard to learn from. A crucial step in bridging this gap between human and artificial intelligence is modeling the granular actions that constitute human behavior, rather than simply matching aggregate human performance. We pursue this goal in a model system with a long history in artificial intelligence: chess. The aggregate performance of a chess player unfolds as they make decisions over the course of a game. The hundreds of millions of games played online by players at every skill level form a rich source of data in which these decisions, and their exact context, are recorded in minute detail. Applying existing chess engines to this data, including an open-source implementation of AlphaZero, we find that they do not predict human moves well. We develop and introduce Maia, a customized version of Alpha-Zero trained on human chess games, that predicts human moves at a much higher accuracy than existing engines, and can achieve maximum accuracy when predicting decisions made by players at a specific skill level in a tuneable way. For a dual task of predicting whether a human will make a large mistake on the next move, we develop a deep neural network that significantly outperforms competitive baselines. Taken together, our results suggest that there is substantial promise in designing artificial intelligence systems with human collaboration in mind by first accurately modeling granular human decision-making.

preprint2020arXiv

SENSEI: Aligning Video Streaming Quality with Dynamic User Sensitivity

This paper aims to improve video streaming by leveraging a simple observation: users are more sensitive to low quality in certain parts of a video than in others. For instance, rebuffering during key moments of a sports video (e.g., before a goal is scored) is more annoying than rebuffering during normal gameplay. Such dynamic quality sensitivity, however, is rarely captured by current approaches, which predict QoE (quality-of-experience) using one-size-fits-all heuristics that are too simplistic to understand the nuances of video content. Instead of proposing yet another heuristic, we take a different approach: we run a separate crowdsourcing experiment for each video to derive users' quality sensitivity at different parts of the video. Of course, the cost of doing this at scale can be prohibitive, but we show that careful experiment design combined with a suite of pruning techniques can make the cost negligible compared to how much content providers invest in content generation and distribution. Our ability to accurately profile time-varying user sensitivity inspires a new approach: dynamically aligning higher (lower) quality with higher (lower) sensitivity periods. We present a new video streaming system called SENSEI that incorporates dynamic quality sensitivity into existing quality adaptation algorithms. We apply SENSEI to two state-of-the-art adaptation algorithms. SENSEI can take seemingly unusual actions: e.g., lowering bitrate (or initiating a rebuffering event) even when bandwidth is sufficient so that it can maintain a higher bitrate without rebuffering when quality sensitivity becomes higher in the near future. Compared to state-of-the-art approaches, SENSEI improves QoE by 15.1% or achieves the same QoE with 26.8% less bandwidth on average.

preprint2015arXiv

Coulomb screening in graphene with topological defects

We analyze the screening of an external Coulomb charge in gapless graphene cone, which is taken as a prototype of a topological defect. In the subcritical regime, the induced charge is calculated using both the Green's function and the Friedel sum rule. The dependence of the polarization charge on the Coulomb strength obtained from the Green's function clearly shows the effect of the conical defect and indicates that the critical charge itself depends on the sample topology. Similar analysis using the Friedel sum rule indicates that the two results agree for low values of the Coulomb charge but differ for the higher strengths, especially in the presence of the conical defect. For a given subcritical charge, the transport cross-section has a higher value in the presence of the conical defect. In the supercritical regime we show that the coefficient of the power law tail of polarization charge density can be expressed as a summation of functions which vary log periodically with the distance from the Coulomb impurity. The period of variation depends on the conical defect. In the presence of the conical defect, the Fano resonances begin to appear in the transport cross-section for a lower value of the Coulomb charge. For both sub and supercritical regime we derive the dependence of LDOS on the conical defect. The effects of generalized boundary condition on the physical observables are also discussed.

preprint2014arXiv

A Back-to-Basics Empirical Study of Priority Queues

The theory community has proposed several new heap variants in the recent past which have remained largely untested experimentally. We take the field back to the drawing board, with straightforward implementations of both classic and novel structures using only standard, well-known optimizations. We study the behavior of each structure on a variety of inputs, including artificial workloads, workloads generated by running algorithms on real map data, and workloads from a discrete event simulator used in recent systems networking research. We provide observations about which characteristics are most correlated to performance. For example, we find that the L1 cache miss rate appears to be strongly correlated with wallclock time. We also provide observations about how the input sequence affects the relative performance of the different heap variants. For example, we show (both theoretically and in practice) that certain random insertion-deletion sequences are degenerate and can lead to misleading results. Overall, our findings suggest that while the conventional wisdom holds in some cases, it is sorely mistaken in others.

preprint2014arXiv

Friedel sum rule in the presence of topological defects for graphene

The Friedel sum rule is extended to deal with topological defects for the case of a graphene cone in the presence of an external Coulomb charge. The dependence in the way the number of states change due to both the topological defect as well as the Coulomb charge are studied. Our analysis addresses both the cases of a subcritical as well as a supercritical value of the Coulomb charge. We also discuss the experimental implications of introducing a self-adjoint extension of the system Hamiltonian. We argue that the boundary conditions following from the self-adjoint extension encode the effect of short range interactions present in the system.

preprint2014arXiv

On the stability and magnetic properties of surface nanobubbles in water

A model for gas nanobubbles is proposed in which their remarkable stability is explained as due to the presence of a qualitatively different form of water covering the nanobubble surface layer which leads to a reduction of the diffusion coefficient by a factor of 10^9. It is shown that this new form of water is created by the interaction between the electrons of water molecules with the zero point vacuum electromagnetic field. The model gives an estimate for the life time of surface nanobubbles, explains why they are not influenced by surfactants and predicts that they should exhibit nonlinear paramagnetism.

preprint2013arXiv

An Effective Theory of Superfluid Turbulence from Local Scale Invariance

We construct a three-dimensional locally scale invariant version of the free Schroedinger field theory. It is shown to provide an effective theory of superfluid turbulence in Helium IV.

preprint2013arXiv

Scale invariance and superfluid turbulence

We construct a Schroedinger field theory invariant under local spatial scaling. It is shown to provide an effective theory of superfluid turbulence by deriving, analytically, the observed Kolmogorov 5/3 law and to lead to a Biot-Savart interaction between the observed filament excitations of the system as well.

preprint2012arXiv

An Algebraic Geometry Method for Calculating DOS for 2D tight binding models

An algebraic geometry method is used to calculate the moments of the electron density of states as a function of the energy for lattices in the tight binding approximation. Interpreting the moments as the Mellin transform of the density allows writing down a formula for the density as an inverse Mellin transform. The method is illustrated by working out the density function for the two-dimensional square and honeycomb lattices.

preprint2012arXiv

Effect of topological defects and Coulomb charge on the low energy quantum dynamics of gapped graphene

We study the combined effect of a conical topological defect and a Coulomb charge impurity on the dynamics of Dirac fermions in gapped graphene. Beyond a certain strength of the Coulomb charge, quantum instability sets in, which demarcates the boundary between sub and supercritical values of the charge. In the subcritical regime, for certain values of the system parameters, the allowed boundary conditions in gapped graphene cone can be classified in terms of a single real parameter. We show that the observables such as local density of states, scattering phase shifts and the bound state spectra are sensitive to the value of this real parameter, which is interesting from an empirical point of view. For a supercritical Coulomb charge, we analyze the system with a regularized potential as well as with a zigzag boundary condition and find the effect of the sample topology on the observable features of the system.

preprint2011arXiv

Incremental Cycle Detection, Topological Ordering, and Strong Component Maintenance

We present two on-line algorithms for maintaining a topological order of a directed $n$-vertex acyclic graph as arcs are added, and detecting a cycle when one is created. Our first algorithm handles $m$ arc additions in $O(m^{3/2})$ time. For sparse graphs ($m/n = O(1)$), this bound improves the best previous bound by a logarithmic factor, and is tight to within a constant factor among algorithms satisfying a natural {\em locality} property. Our second algorithm handles an arbitrary sequence of arc additions in $O(n^{5/2})$ time. For sufficiently dense graphs, this bound improves the best previous bound by a polynomial factor. Our bound may be far from tight: we show that the algorithm can take $Ω(n^2 2^{\sqrt{2\lg n}})$ time by relating its performance to a generalization of the $k$-levels problem of combinatorial geometry. A completely different algorithm running in $Θ(n^2 \log n)$ time was given recently by Bender, Fineman, and Gilbert. We extend both of our algorithms to the maintenance of strong components, without affecting the asymptotic time bounds.

preprint2010arXiv

Effect of topology on the critical charge in graphene

We show that the critical charge for the Dirac excitations in gapless graphene depends on the spatial topology of the sample. In particular, for graphene cones, the effective value of the critical charge can tend towards zero for a suitable angle of the conical sample. We discuss the nature of the scattering phase shifts, quasi-bound state energies and local density of states for a gapless graphene cone and determine the dependence of these physical quantities on the sample topology.

preprint2010arXiv

Geometric Finiteness, Holography and Quasinormal Modes for the Warped AdS_3 Black Hole

We show that there exists a precise kinematical notion of holography for the Euclidean warped $AdS_3$ black hole. This follows from the fact that the Euclidean warped $AdS_3$ black hole spacetime is a geometrically finite hyperbolic manifold. For such manifolds a theorem of Sullivan provides a one-to-one correspondence between the hyperbolic structure in the bulk and the conformal structure of its boundary. Using this theorem we obtain the holographic quasinormal modes for the warped $AdS_3$ black hole.

preprint2010arXiv

The Central Charge of the Warped AdS^3 Black Hole

The AdS/CFT conjecture offers the possibility of a quantum description for a black hole in terms of a CFT. This has led to the study of general AdS^3 type black holes with a view to constructing an explicit toy quantum black hole model. Such a CFT description would be characterized by its central charge and the dimensions of its primary fields. Recently the expression for the central charges (C_L, C_R) of the CFT dual to the warped AdS^3 have been determined using asymptotic symmetry arguments. The central charges depend, as expected, on the warping factor. We show that topological arguments, used by Witten to constrain central charges for the BTZ black hole, can be generalized to deal with the warped AdS^3 case. Topology constrains the warped factor to be rational numbers while quasinormal modes are conjectured to give the dimensions of primary fields. We find that in the limit when warping is large or when it takes special rational values the system tends to Witten's conjectured unique CFT's with central charges that are multiples of 24.

preprint2010arXiv

Turbulent flow in graphene

We demonstrate the possibility of a turbulent flow of electrons in graphene in the hydrodynamic region, by calculating the corresponding turbulent probability density function. This is used to calculate the contribution of the turbulent flow to the conductivity within a quantum Boltzmann approach. The dependence of the conductivity on the system parameters arising from the turbulent flow is very different from that due to scattering.

preprint2009arXiv

Effects of Quantum Vacuum Fluctuations of the Electric Field on DNA Condensation

By assuming that not only counter-ions but DNA molecules as well are thermally distributed according to a Boltzmann law, we propose a modified Poisson-Boltzmann equation at the classical level as starting point to compute the effects of quantum fluctuations of the electric field on the interaction among DNA-cation complexes. The latter are modeled here as infinite one-dimensional wires ($δ$-functions). Our goal is to single out such quantum-vacuum-driven interaction from the counterion-induced and water-related interactions. We obtain a universal, frustration-free Casimir-like (codimension 2) interaction that extensive numerical analysis show to be a good candidate to explain the formation and stability of DNA aggregates. Such Casimir energy is computed for a variety of configurations of up to 19 DNA strands in a hexagonal array. It is found to be strongly many-body.

preprint2009arXiv

Scattering in graphene with impurities : A low energy effective theory

We analyze the scattering sector of the Hamiltonians for both gapless and gapped graphene in the presence of a charge impurity using the 2D Dirac equation, which is applicable in the long wavelength limit. We show that for certain range of the system parameters, the combined effect of the short range interactions due to the charge impurity can be modelled using a single real parameter appearing in the boundary conditions. The phase shifts and the scattering matrix depend explicitly on this parameter. We argue that this parameter for graphene can be fixed empirically, through measurements of observables that depend on the scattering data.

preprint2004arXiv

Quantum Weak Turbulence

The study of the phenomenon of quantum weak turbulence is extended by determining the quasiparticle spectrum associated with such a system using a Green's function approach. The quasiparticle spectrum calculated establishes the dissipative regime and the inertial regime, hence a Kolmogorov type of picture.

preprint2003arXiv

Black Hole Decay as Geodesic Motion

We show that a formalism for analyzing the near-horizon conformal symmetry of Schwarzschild black holes using a scalar field probe is capable of describing black hole decay. The equation governing black hole decay can be identified as the geodesic equation in the space of black hole masses. This provides a novel geometric interpretation for the decay of black holes. Moreover, this approach predicts a precise correction term to the usual expression for the decay rate of black holes.

preprint1999arXiv

Equivalence of the Self-Dual Model and Maxwell-Chern-Simons Theory on Arbitrary Manifolds

Using a group-invariant version of the Faddeev-Popov method we explicitly obtain the partition functions of the Self-Dual Model and Maxwell-Chern-Simons theory. We show that their ratio coincides with the partition function of abelian Chern-Simons theory to within a phase factor depending on the geometrical properties of the manifold.

preprint1993arXiv

Symplectic Manifolds, Coherent States and Semiclassical Approximation

We describe the symplectic structure and Hamiltonian dynamics for a class of Grassmannian manifolds. Using the two dimensional sphere ($S^2$) and disc ($D^2$) as illustrative cases, we write their path integral representations using coherent state techniques. These path integrals can be evaluated exactly by semiclassical methods, thus providing examples of localisation formula. Along the way, we also give a local coordinate description for a class of Grassmannians.

Siddhartha Sen

What is connected

Connect this record

See the researcher in context

Building this map preview

26 published item(s)

Detecting Individual Decision-Making Style: Exploring Behavioral Stylometry in Chess

Learning Models of Individual Behavior in Chess

Measuring the Effect of Training Data on Deep Learning Predictions via Randomized Experiments

Mimetic Models: Ethical Implications of AI that Acts Like You

A New Abstraction for Internet QoE Optimization

Aligning Superhuman AI with Human Behavior: Chess as a Model System

SENSEI: Aligning Video Streaming Quality with Dynamic User Sensitivity

Coulomb screening in graphene with topological defects

A Back-to-Basics Empirical Study of Priority Queues

Friedel sum rule in the presence of topological defects for graphene

On the stability and magnetic properties of surface nanobubbles in water

An Effective Theory of Superfluid Turbulence from Local Scale Invariance

Scale invariance and superfluid turbulence

An Algebraic Geometry Method for Calculating DOS for 2D tight binding models

Effect of topological defects and Coulomb charge on the low energy quantum dynamics of gapped graphene

Incremental Cycle Detection, Topological Ordering, and Strong Component Maintenance

Effect of topology on the critical charge in graphene

Geometric Finiteness, Holography and Quasinormal Modes for the Warped AdS_3 Black Hole

The Central Charge of the Warped AdS^3 Black Hole

Turbulent flow in graphene

Effects of Quantum Vacuum Fluctuations of the Electric Field on DNA Condensation

Scattering in graphene with impurities : A low energy effective theory

Quantum Weak Turbulence

Black Hole Decay as Geodesic Motion

Equivalence of the Self-Dual Model and Maxwell-Chern-Simons Theory on Arbitrary Manifolds

Symplectic Manifolds, Coherent States and Semiclassical Approximation