Source author record

Michael Hahn

Michael Hahn appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.SR Computation and Language physics.atom-ph physics.space-ph Artificial Intelligence Machine Learning physics.plasm-ph astro-ph.IM Cryptography and Security

Catalog footprint

What is connected

16works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Barriers to Universal Reasoning With Transformers (And How to Overcome Them)

Chain-of-Thought (CoT) has been shown to empirically improve Transformers' performance, and theoretically increase their expressivity to Turing completeness. However, whether Transformers can learn to generalize to CoT traces longer than those seen during training is understudied. We use recent theoretical frameworks for Transformer length generalization and find that -- under standard positional encodings and a finite alphabet -- Transformers with CoT cannot solve problems beyond $TC^0$, i.e. the expressivity benefits do not hold under the stricter requirement of length-generalizable learnability. However, if we allow the vocabulary to grow with problem size, we attain a length-generalizable simulation of Turing machines where the CoT trace length is linear in the simulated runtime up to a constant. Our construction overcomes two core obstacles to reliable length generalization: repeated copying and last-occurrence retrieval. We assign each tape position a unique signpost token, and log only value changes to enable recovery of the current tape symbol through counts circumventing both barriers. Further, we empirically show that the use of such signpost tokens and value change encodings provide actionable guidance to improve length generalization on hard problems.

preprint2026arXiv

How Few-Shot Examples Add Up: A Causal Decomposition of Function Vectors in In-Context Learning

In-context learning (ICL) excels at new tasks from minimal examples, yet we still lack a mechanistic explanation of how few-shot prompts shape a model's function vector (FV)--a causal activation direction that drives task behavior on the ICL query. Across tasks and models, an $n$-shot FV is well-approximated by a linear combination of example-level sub-FVs, suggesting additive and composable contributions from individual demonstrations. Beyond additivity, we show that models contextualize individual examples' representations based on prior examples to adaptively reweight which demonstrations dominate the FV: attention shifts toward examples that are more informative and less ambiguous under the context. Finally, a causal decomposition separates Query-Key routing from Value updates, finding that contextualization's most consistent contributions to FV quality arise from Query-Key alignment--particularly in ambiguous settings--while Value-mediated effects are more heterogeneous. Together, these results unify additive superposition with context-dependent attention reweighting into a mechanistic, testable account of how few-shot prompts implement tasks.

preprint2026arXiv

SafeReview: Defending LLM-based Review Systems Against Adversarial Hidden Prompts

As Large Language Models (LLMs) are increasingly integrated into academic peer review, their vulnerability to adversarial prompts -- adversarial instructions embedded in submissions to manipulate outcomes -- emerges as a critical threat to scholarly integrity. To counter this, we propose a novel adversarial framework where a Generator model, trained to create sophisticated attack prompts, is jointly optimized with a Defender model tasked with their detection. This system is trained using a loss function inspired by Information Retrieval Generative Adversarial Networks, which fosters a dynamic co-evolution between the two models, forcing the Defender to develop robust capabilities against continuously improving attack strategies. The resulting framework demonstrates significantly enhanced resilience to novel and evolving threats compared to static defenses, thereby establishing a critical foundation for securing the integrity of peer review.

preprint2026arXiv

Tug-of-war between idioms' figurative and literal interpretations in LLMs

Idioms present a unique challenge for language models due to their non-compositional figurative interpretations, which often strongly diverge from the idiom's literal interpretation. In this paper, we employ causal tracing to systematically analyze how pretrained causal transformers deal with this ambiguity. We localize three mechanisms: (i) Early sublayers and specific attention heads retrieve an idiom's figurative interpretation, while suppressing its literal interpretation. (ii) When disambiguating context precedes the idiom, the model leverages it from the earliest layer and later layers refine the interpretation if the context conflicts with the retrieved interpretation. (iii) Then, selective, competing pathways carry both interpretations: an intermediate pathway prioritizes the figurative interpretation and a parallel direct route favors the literal interpretation, ensuring that both readings remain available. Our findings provide mechanistic evidence for idiom comprehension in autoregressive transformers.

preprint2022arXiv

Crosslinguistic word order variation reflects evolutionary pressures of dependency and information locality

Languages vary considerably in syntactic structure. About 40% of the world's languages have subject-verb-object order, and about 40% have subject-object-verb order. Extensive work has sought to explain this word order variation across languages. However, the existing approaches are not able to explain coherently the frequency distribution and evolution of word order in individual languages. We propose that variation in word order reflects different ways of balancing competing pressures of dependency locality and information locality, whereby languages favor placing elements together when they are syntactically related or contextually informative about each other. Using data from 80 languages in 17 language families and phylogenetic modeling, we demonstrate that languages evolve to balance these pressures, such that word order change is accompanied by change in the frequency distribution of the syntactic structures which speakers communicate to maintain overall efficiency. Variability in word order thus reflects different ways in which languages resolve these evolutionary pressures. We identify relevant characteristics that result from this joint optimization, particularly the frequency with which subjects and objects are expressed together for the same verb. Our findings suggest that syntactic structure and usage across languages co-adapt to support efficient communication under limited cognitive resources.

preprint2022arXiv

Evidence for Parameteric Decay Instability in the Lower Solar Atmosphere

We find evidence for the first observation of the parametric decay instability (PDI) in the lower solar atmosphere. Specifically, we find that the power spectrum of density fluctuations near the solar transition region resembles the power spectrum of the velocity fluctuations, but with the frequency axis scaled up by about a factor of two. These results are from an analysis of the Si IV lines observed by the Interface Region Imaging Spectrometer (IRIS) in the transition region of a polar coronal hole. We also find that the density fluctuations have radial velocity of about 75 km/s and that the velocity fluctuations are much faster with an estimated speed of 250 km/s, as is expected for sound waves and Alfvén waves, respectively, in the transition region. Theoretical calculations show that this frequency relationship is consistent with those expected from PDI for the plasma conditions of the observed region. These measurements suggest an interaction between sound waves and Alfvén waves in the transition region that is evidence for the parametric decay instability.

preprint2019arXiv

Laboratory Calibrations of Fe XII-XIV Line-Intensity Ratios for Electron Density Diagnostics

We have used an electron beam ion trap to measure electron-density-diagnostic line-intensity ratios for extreme ultraviolet lines from F XII, XIII, and XIV at wavelengths of 185-205 255-276 Angstroms. These ratios can be used as density diagnostics for astrophysical spectra and are especially relevant to solar physics. We found that density diagnostics using the Fe XIII 196.53/202.04 and the Fe XIV 264.79/274.21 and 270.52A/274.21 line ratios are reliable using the atomic data calculated with the Flexible Atomic Code. On the other hand, we found a large discrepancy between the FAC theory and experiment for the commonly used Fe XII (186.85 + 186.88)/195.12 line ratio. These FAC theory calculations give similar results to the data tabulated in CHIANTI, which are commonly used to analyze solar observations. Our results suggest that the discrepancies seen between solar coronal density measurements using the Fe XII (186.85 + 186.88)/195.12 and Fe XIII 196.54/202.04 line ratios are likely due to issues with the atomic calculations for Fe XII.

preprint2019arXiv

Measured reduction in Alfvén wave energy propagating through longitudinal gradients scaled to match solar coronal holes

We have explored the effectiveness of a longitudinal gradient in Alfvén speed in reducing the energy of propagating Alfvén waves under conditions scaled to match solar coronal holes. The experiments were conducted in the Large Plasma Device at the University of California, Los Angeles. Our results show that the energy of the transmitted Alfvén wave decreases as the inhomogeneity parameter, $λ/L_{\rm A}$, increases. Here, $λ$ is the wavelength of the Alfvén wave and $L_{\rm A}$ is the scale length of Alfvén speed gradient. For gradients similar to those in coronal holes, the waves are observed to lose a factor of $\approx 5$ more energy than they do when propagating through a uniform plasma without a gradient. We have carried out further experiments and analyses to constrain the cause of wave energy reduction in the gradient. The loss of Alfvén wave energy from mode coupling is unlikely, as we have not detected any other modes. Contrary to theoretical expectations, the reduction in the energy of the transmitted wave is not accompanied by a detectable reflected wave. Nonlinear effects are ruled out as the amplitude of the initial wave is too small and the wave frequency well below the ion cyclotron frequency. Since the total energy must be conserved, it is possible that the lost wave energy is being deposited in the plasma. Further studies are needed to explore where the energy is going.

preprint2016arXiv

Inferring the Coronal Density Irregularity from EUV Spectra

Understanding the density structure of the solar corona is important for modeling both coronal heating and the solar wind. Direct measurements are difficult because of line-of-sight integration and possible unresolved structures. We present a new method for quantifying such structure using density-sensitive EUV line intensities to derive a density irregularity parameter, a relative measure of the amount of structure along the line of sight. We also present a simple model to relate the inferred irregularities to physical quantities, such as the filling factor and density contrast. For quiet Sun regions and interplume regions of coronal holes, we find a density contrast of at least a factor of three to ten and corresponding filling factors of about 10-20%. Our results are in rough agreement with other estimates of the density structures in these regions. The irregularity diagnostic provides a useful relative measure of unresolved structure in various regions of the corona.

preprint2015arXiv

A Simple Method for Modeling Collision Processes in Plasmas with a Kappa Energy Distribution

We demonstrate that a nonthermal distribution of particles described by a kappa distribution can be accurately approximated by a weighted sum of Maxwell-Boltzmann distributions. We apply this method to modeling collision processes in kappa-distribution plasmas, with a particular focus on atomic processes important for solar physics. The relevant collision process rate coefficients are generated by summing appropriately weighted Maxwellian rate coefficients. This method reproduces the rate coefficients for a kappa distribution to an estimated accuracy of better than 5%. This is equal to or better than the accuracy of rate coefficients generated using "reverse engineering" methods, which attempt to extract the needed cross sections from the published Maxwellian rate coefficient data and then reconvolve the extracted cross sections with the desired kappa distribution. Our approach of summing Maxwellian rate coefficients is easy to implement using existing spectral analysis software. Moreover, the weights in the sum of the Maxwell-Boltzmann distribution rate coefficients can be found for any value of the parameter kappa, thereby enabling one to model plasmas with a time-varying kappa. Tabulated Maxwellian fitting parameters are given for specific values of kappa from 1.7 to 100. We also provide polynomial fits to these parameters over this entire range. Several applications of our technique are presented, including the plasma equilibrium charge state distribution (CSD), predicting line ratios, modeling the influence of electron impact multiple ionization on the equilibrium CSD of kappa-distribution plasmas, and calculating the time-varying CSD of plasmas during a solar flare.

preprint2015arXiv

Relative Abundance Measurements in Plumes and Interplumes

We present measurements of relative elemental abundances in plumes and interplumes. Plumes are bright, narrow structures in coronal holes that extend along open magnetic field lines far out into the corona. Previous work has found that in some coronal structures the abundances of elements with a low first ionization potential (FIP) < 10 eV are enhanced relative to their photospheric abundances. This coronal-to-photospheric abundance ratio, commonly called the FIP bias, is typically 1 for element with a high-FIP (> 10 eV). We have used EIS spectroscopic observations made on 2007 March 13 and 14 over an ~24 hour period to characterize abundance variations in plumes and interplumes. To assess their elemental composition, we have used a differential emission measure (DEM) analysis, which accounts for the thermal structure of the observed plasma. We have used lines from ions of iron, silicon, and sulfur. From these we have estimated the ratio of the iron and silicon FIP bias relative to that for sulfur. From the results, we have created FIP-bias-ratio maps. We find that the FIP-bias ratio is sometimes higher in plumes than in interplumes and that this enhancement can be time dependent. These results may help to identify whether plumes or interplumes contribute to the fast solar wind observed in situ and may also provides constraints on the formation and heating mechanisms of plumes.

preprint2014arXiv

Electron Impact Ionization of Stored Highly Charged Ions

Accurate cross section data for electron impact ionization (EII) are needed in order to interpret the spectra of collisionally ionized plasmas both in astrophysics and in the laboratory. Models and spectroscopic diagnostics of such plasmas rely on accurate ionization balance calculations, which depend, in turn, on the underlying rates for EII and electron-ion recombination. EII measurements have been carried out using the TSR storage ring located at the Max-Planck-Institut fuer Kernphysik in Heidelberg, Germany. Storage ring measurements are largely free of metastable contamination, resulting in unambiguous EII data, unlike what is encountered with other experimental geometries. As it is impractical to perform experiments for every ion, theory must provide the bulk of the necessary EII data. In order to guide theory, TSR experiments have focused on providing at least one measurement for every isoelectronic sequence. EII data have been measured for ions from 13 isoelectronic sequences: Li-like silicon and chlorine, Be-like sulfur, B-like magnesium, and F-like through K-like iron. These experimental results provide an important benchmark for EII theory.

preprint2014arXiv

Evidence for Wave Heating of the Quiet Sun Corona

We have measured the energy and dissipation of Alfvenic waves in the quiet Sun. A magnetic field was used to infer the location and orientation of the magnetic field lines along which the waves are expected to travel. The waves were measured using spectral lines to infer the wave amplitude. The waves cause a non-thermal broadening of the spectral lines, which can be expressed as a non-thermal velocity v_nt. By combining the spectroscopic measurements with this magnetic field model we were able to trace the variation of v_nt along the magnetic field. At the footpoints of the quiet Sun loops we find that waves inject an energy flux in the range of 1.2-5.2 x 10^5 erg cm^-2 s^-1. At the minimum of this range, this amounts to more than 80% of the energy needed to heat the quiet Sun. We also find that these waves are dissipated over a region centered on the top of the loops. The position along the loop where the damping begins is strongly correlated with the length of the loop, implying that the damping mechanism depends on the global loop properties rather than on local collisional dissipation.

preprint2014arXiv

Influence of Electron-Impact Multiple Ionization on Equilibrium and Dynamic Charge State Distributions: A Case Study Using Iron

We describe the influence of electron-impact multiple ionization (EIMI) on the ionization balance of collisionally ionized plasmas. We are unaware of any previous ionization balance calculations that have included EIMI, which is usually assumed to be unimportant. Here, we incorporate EIMI cross-section data into calculations of both equilibrium and non-equilibrium charge-state distributions (CSDs). For equilibrium CSDs, we find that EIMI has only a small effect and can usually be ignored. However, for non-equilibrium plasmas the influence of EIMI can be important. In particular, we find that for plasmas in which the temperature oscillates there are significant differences in the CSD when including versus neglecting EIMI. These results have implications for modeling and spectroscopy of impulsively heated plasmas, such as nanoflare heating of the solar corona.

preprint2013arXiv

Observational Quantification of the Energy Dissipated by Alfvén Waves in a Polar Coronal Hole: Evidence that Waves Drive the Fast Solar Wind

We present a measurement of the energy carried and dissipated by Alfvén waves in a polar coronal hole. Alfvén waves have been proposed as the energy source that heats the corona and drives the solar wind. Previous work has shown that line widths decrease with height in coronal holes, which is a signature of wave damping, but have been unable to quantify the energy lost by the waves. This is because line widths depend on both the non-thermal velocity v_nt and the ion temperature T_i. We have implemented a means to separate the T_i and v_nt contributions using the observation that at low heights the waves are undamped and the ion temperatures do not change with height. This enables us to determine the amount of energy carried by the waves at low heights, which is proportional to v_nt. We find the initial energy flux density present was 6.7 +/- 0.7 x 10^5 erg cm^-2 s^-1, which is sufficient to heat the coronal hole and acccelerate the solar wind during the 2007 - 2009 solar minimum. Additionally, we find that about 85% of this energy is dissipated below 1.5 R_sun, sufficiently low that thermal conduction can transport the energy throughout the coronal hole, heating it and driving the fast solar wind. The remaining energy is roughly consistent with what models show is needed to provide the extended heating above the sonic point for the fast solar wind. We have also studied T_i, which we found to be in the range of 1 - 2 MK, depending on the ion species.

preprint2012arXiv

Evidence of Wave Damping at Low Heights in a Polar Coronal Hole

We have measured the widths of spectral lines from a polar coronal hole using the Extreme Ultraviolet Imaging Spectrometer onboard Hinode. Polar coronal holes are regions of open magnetic field and the source of the fast solar wind. We find that the line widths decrease at relatively low heights. Previous observations have attributed such decreases to systematic effects, but we find that such effects are too small to explain our results. We conclude that the line narrowing is real. The non-thermal line widths are believed to be proportional to the amplitude of Alfven waves propagating along these open field lines. Our results suggest that Alfven waves are damped at unexpectedly low heights in a polar coronal hole. We derive an estimate on the upper limit for the energy dissipated between 1.1 and 1.3 solar radii and find that it is enough to account for up to 70% of that required to heat the polar coronal hole and accelerate the solar wind.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint

Fields this researcher appears in

astro-ph.SR Computation and Language physics.atom-ph physics.space-ph Artificial Intelligence Machine Learning physics.plasm-ph astro-ph.IM Cryptography and Security

Source provenance

Where this author record came from

arxivconfidence 95%

external id: arxiv:2605.16591:author:4:michael-hahn

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2604.26506:author:6:michael-hahn

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2604.25800:author:5:michael-hahn

Imported May 20, 2026Synced May 20, 2026

9 works

Daniel Wolf Savin

Researcher

Daniel Wolf Savin contributes to research discovery and scholarly infrastructure.

Open to collaborate

1 works

Aleksandra Bakalova

Researcher

Aleksandra Bakalova contributes to research discovery and scholarly infrastructure.

Open to collaborate

1 works

Alexander Koller

Researcher

Alexander Koller contributes to research discovery and scholarly infrastructure.

Open to collaborate

1 works

Chengwei Qin

Researcher

Chengwei Qin contributes to research discovery and scholarly infrastructure.

Open to collaborate

Michael Hahn

What is connected

Connect this record

See the researcher in context

Building this map preview

16 published item(s)

Barriers to Universal Reasoning With Transformers (And How to Overcome Them)

How Few-Shot Examples Add Up: A Causal Decomposition of Function Vectors in In-Context Learning

SafeReview: Defending LLM-based Review Systems Against Adversarial Hidden Prompts

Tug-of-war between idioms' figurative and literal interpretations in LLMs

Crosslinguistic word order variation reflects evolutionary pressures of dependency and information locality

Evidence for Parameteric Decay Instability in the Lower Solar Atmosphere

Laboratory Calibrations of Fe XII-XIV Line-Intensity Ratios for Electron Density Diagnostics

Measured reduction in Alfvén wave energy propagating through longitudinal gradients scaled to match solar coronal holes

Inferring the Coronal Density Irregularity from EUV Spectra

A Simple Method for Modeling Collision Processes in Plasmas with a Kappa Energy Distribution

Relative Abundance Measurements in Plumes and Interplumes

Electron Impact Ionization of Stored Highly Charged Ions

Evidence for Wave Heating of the Quiet Sun Corona

Influence of Electron-Impact Multiple Ionization on Equilibrium and Dynamic Charge State Distributions: A Case Study Using Iron

Observational Quantification of the Energy Dissipated by Alfvén Waves in a Polar Coronal Hole: Evidence that Waves Drive the Fast Solar Wind

Evidence of Wave Damping at Low Heights in a Polar Coronal Hole