Source author record

Vedant Misra

Vedant Misra appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Artificial Intelligence Computation and Language gr-qc physics.soc-ph q-fin.GN q-fin.ST q-fin.TR

Catalog footprint

What is connected

4works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

In this paper we propose to study generalization of neural networks on small algorithmically generated datasets. In this setting, questions about data efficiency, memorization, generalization, and speed of learning can be studied in great detail. In some situations we show that neural networks learn through a process of "grokking" a pattern in the data, improving generalization performance from random chance level to perfect generalization, and that this improvement in generalization can happen well past the point of overfitting. We also study generalization as a function of dataset size and find that smaller datasets require increasing amounts of optimization for generalization. We argue that these datasets provide a fertile ground for studying a poorly understood aspect of deep learning: generalization of overparametrized neural networks beyond memorization of the finite training dataset.

preprint2022arXiv

Solving Quantitative Reasoning Problems with Language Models

Language models have achieved remarkable performance on a wide range of tasks that require natural language understanding. Nevertheless, state-of-the-art models have generally struggled with tasks that require quantitative reasoning, such as solving mathematics, science, and engineering problems at the college level. To help close this gap, we introduce Minerva, a large language model pretrained on general natural language data and further trained on technical content. The model achieves state-of-the-art performance on technical benchmarks without the use of external tools. We also evaluate our model on over two hundred undergraduate-level problems in physics, biology, chemistry, economics, and other sciences that require quantitative reasoning, and find that the model can correctly answer nearly a third of them.

preprint2012arXiv

Evidence of market manipulation in the financial crisis

We provide direct evidence of market manipulation at the beginning of the financial crisis in November 2007. The type of manipulation, a "bear raid," would have been prevented by a regulation that was repealed by the Securities and Exchange Commission in July 2007. The regulation, the uptick rule, was designed to prevent manipulation and promote stability and was in force from 1938 as a key part of the government response to the 1929 market crash and its aftermath. On November 1, 2007, Citigroup experienced an unusual increase in trading volume and decrease in price. Our analysis of financial industry data shows that this decline coincided with an anomalous increase in borrowed shares, the selling of which would be a large fraction of the total trading volume. The selling of borrowed shares cannot be explained by news events as there is no corresponding increase in selling by share owners. A similar number of shares were returned on a single day six days later. The magnitude and coincidence of borrowing and returning of shares is evidence of a concerted effort to drive down Citigroup's stock price and achieve a profit, i.e., a bear raid. Interpretations and analyses of financial markets should consider the possibility that the intentional actions of individual actors or coordinated groups can impact market behavior. Markets are not sufficiently transparent to reveal even major market manipulation events. Our results point to the need for regulations that prevent intentional actions that cause markets to deviate from equilibrium and contribute to crashes. Enforcement actions cannot reverse severe damage to the economic system. The current "alternative" uptick rule which is only in effect for stocks dropping by over 10% in a single day is insufficient. Prevention may be achieved through improved availability of market data and the original uptick rule or other transaction limitations.

preprint2010arXiv

Rational Orbits around Charged Black Holes

We show that all eccentric timelike orbits in Reissner-Nordström spacetime can be classified using a taxonomy that draws upon an isomorphism between periodic orbits and the set of rational numbers. By virtue of the fact that the rationals are dense, the taxonomy can be used to approximate aperiodic orbits with periodic orbits. This may help reduce computational overhead for calculations in gravitational wave astronomy. Our dynamical systems approach enables us to study orbits for both charged and uncharged particles in spite of the fact that charged particle orbits around a charged black hole do not admit a simple one-dimensional effective potential description. Finally, we show that comparing periodic orbits in the RN and Schwarzschild geometries enables us to distinguish charged and uncharged spacetimes by looking only at the orbital dynamics.