Source author record

Peter Staar

Peter Staar appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.str-el Machine Learning Artificial Intelligence Information Retrieval quant-ph

Catalog footprint

What is connected

9works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

BusiNet -- a Light and Fast Text Detection Network for Business Documents

For digitizing or indexing physical documents, Optical Character Recognition (OCR), the process of extracting textual information from scanned documents, is a vital technology. When a document is visually damaged or contains non-textual elements, existing technologies can yield poor results, as erroneous detection results can greatly affect the quality of OCR. In this paper we present a detection network dubbed BusiNet aimed at OCR of business documents. Business documents often include sensitive information and as such they cannot be uploaded to a cloud service for OCR. BusiNet was designed to be fast and light so it could run locally preventing privacy issues. Furthermore, BusiNet is built to handle scanned document corruption and noise using a specialized synthetic dataset. The model is made robust to unseen noise by employing adversarial training strategies. We perform an evaluation on publicly available datasets demonstrating the usefulness and broad applicability of our model.

preprint2022arXiv

TableFormer: Table Structure Understanding with Transformers

Tables organize valuable content in a concise and compact representation. This content is extremely valuable for systems such as search engines, Knowledge Graph's, etc, since they enhance their predictive capabilities. Unfortunately, tables come in a large variety of shapes and sizes. Furthermore, they can have complex column/row-header configurations, multiline rows, different variety of separation lines, missing entries, etc. As such, the correct identification of the table-structure from an image is a non-trivial task. In this paper, we present a new table-structure identification model. The latter improves the latest end-to-end deep learning model (i.e. encoder-dual-decoder from PubTabNet) in two significant ways. First, we introduce a new object detection decoder for table-cells. In this way, we can obtain the content of the table-cells from programmatic PDF's directly from the PDF source and avoid the training of the custom OCR decoders. This architectural change leads to more accurate table-content extraction and allows us to tackle non-english tables. Second, we replace the LSTM decoders with transformer based decoders. This upgrade improves significantly the previous state-of-the-art tree-editing-distance-score (TEDS) from 91% to 98.5% on simple tables and from 88.7% to 95% on complex tables.

preprint2022arXiv

Unsupervised Domain Generalization by Learning a Bridge Across Domains

The ability to generalize learned representations across significantly different visual domains, such as between real photos, clipart, paintings, and sketches, is a fundamental capacity of the human visual system. In this paper, different from most cross-domain works that utilize some (or full) source domain supervision, we approach a relatively new and very practical Unsupervised Domain Generalization (UDG) setup of having no training supervision in neither source nor target domains. Our approach is based on self-supervised learning of a Bridge Across Domains (BrAD) - an auxiliary bridge domain accompanied by a set of semantics preserving visual (image-to-image) mappings to BrAD from each of the training domains. The BrAD and mappings to it are learned jointly (end-to-end) with a contrastive self-supervised representation model that semantically aligns each of the domains to its BrAD-projection, and hence implicitly drives all the domains (seen or unseen) to semantically align to each other. In this work, we show how using an edge-regularized BrAD our approach achieves significant gains across multiple benchmarks and a range of tasks, including UDG, Few-shot UDA, and unsupervised generalization across multi-domain datasets (including generalization to unseen domains and classes).

preprint2021arXiv

Robust PDF Document Conversion Using Recurrent Neural Networks

The number of published PDF documents has increased exponentially in recent decades. There is a growing need to make their rich content discoverable to information retrieval tools. In this paper, we present a novel approach to document structure recovery in PDF using recurrent neural networks to process the low-level PDF data representation directly, instead of relying on a visual re-interpretation of the rendered PDF page, as has been proposed in previous literature. We demonstrate how a sequence of PDF printing commands can be used as input into a neural network and how the network can learn to classify each printing command according to its structural function in the page. This approach has three advantages: First, it can distinguish among more fine-grained labels (typically 10-20 labels as opposed to 1-5 with visual methods), which results in a more accurate and detailed document structure resolution. Second, it can take into account the text flow across pages more naturally compared to visual methods because it can concatenate the printing commands of sequential pages. Last, our proposed method needs less memory and it is computationally less expensive than visual methods. This allows us to deploy such models in production environments at a much lower cost. Through extensive architectural search in combination with advanced feature engineering, we were able to implement a model that yields a weighted average F1 score of 97% across 17 distinct structural labels. The best model we achieved is currently served in production environments on our Corpus Conversion Service (CCS), which was presented at KDD18 (arXiv:1806.02284). This model enhances the capabilities of CCS significantly, as it eliminates the need for human annotated label ground-truth for every unseen document layout. This proved particularly useful when applied to a huge corpus of PDF articles related to COVID-19.

preprint2016arXiv

Optimizing qubit resources for quantum chemistry simulations in second quantization on a quantum computer

Quantum chemistry simulations on a quantum computer suffer from the overhead needed for encoding the fermionic problem in a bosonic system of qubits. By exploiting the block diagonality of a fermionic Hamiltonian, we show that the number of required qubits can be reduced by a factor of two or more. There is no need to go into the basis of the Hilbert space for this reduction because all operations can be performed in the operator space. The scheme is conceived as a pre-computational step that would be performed on a classical computer prior to the actual quantum simulation. We apply this scheme to reduce the number of qubits necessary to simulate both the Hamiltonian of the two-site Fermi-Hubbard model and the hydrogen molecule. Both quantum systems can then be simulated with a two-qubit quantum computer.

preprint2014arXiv

Two-particle correlations in a dynamic cluster approximation with continuous momentum dependence: Superconductivity in the 2D Hubbard model

The DCA$^+$ algortihm was recently introduced to extend the dynamic cluster approximation (DCA) with a continuous lattice self-energy in order to achieve better convergence with cluster size. Here we extend the DCA$^+$ algorithm to the calculation of two-particle correlation functions by introducing irreducible vertex functions with continuous momentum dependence consistent with the DCA$^+$ self-energy. This enables a significantly more controlled and reliable study of phase transitions than with the DCA. We test the new method by calculating the superconducting transition temperature $T_{c}$ in the attractive Hubbard model and show that it reproduces previous high-precision determinantal quantum Monte Carlo results. We then calculate $T_c$ in the doped repulsive Hubbard model, for which previous DCA calculations could only access the weak-coupling ($U=4t$) regime for large clusters. We show that the new algorithm provides access to much larger clusters and delivers asymptotically converged results for $T_c$ for both the weak ($U=4t$) and intermediate ($U=7t$) coupling regimes, and thereby enables the accurate determination of the exact infinite cluster size result.

preprint2013arXiv

DCA$^+$: Dynamical Cluster Approximation with continuous lattice self-energy

The dynamical cluster approximation (DCA) is a systematic extension beyond the single site approximation in dynamical mean field theory (DMFT), to include spatially non-local correlations in quantum many-body simulations of strongly correlated systems. We extend the DCA with a continuous lattice self-energy in oder to achieve better convergence with cluster size. The new method, which we call DCA$^+$, cures the cluster shape dependence problems of the DCA, without suffering from causality violations of previous attempts to interpolate the cluster self-energy. A practical approach based on standard inference techniques is given to deduce the continuous lattice self-energy from an interpolated cluster self-energy. We study the pseudogap region of a hole-doped two-dimensional Hubbard model and find that in the DCA$^+$ algorithm, the self-energy and pseudo-gap temperature $T^*$ converge monotonously with cluster size. Introduction of a continuous lattice self-energy eliminates artificial long-rage correlations and thus significantly reduces the sign problem of the quantum Monte Carlo cluster solver in the DCA$^+$ algorithm compared to the normal DCA. Simulations with much larger cluster sizes thus become feasible, which, along with the improved convergence in cluster size, raises hope that precise extrapolations to the exact infinite cluster size limit can be reached for other physical quantities as well.

preprint2013arXiv

The Continuous-Pole-Expansion method to obtain spectra of electronic lattice models

We present a new algorithm to analytically continue the self-energy of quantum many-body systems from Matsubara frequencies to the real axis. The method allows straightforward, unambiguous computation of electronic spectra for lattice models of strongly correlated systems from self-energy data that has been collected with state-of-the are continuous time solvers within dynamical mean field simulations. Using well-known analytical properties of the self-energy, the analytic continuation is cast into a constrained minimization problem that can be formulated as a quadratic programmable optimization with linear constraints. The algorithm is validated against exactly solvable finite size problems, showing that all features of the spectral function near the Femi level are very well reproduced and coarse features are reproduced for all energies. The method is applied to two well known lattice problems, the two-dimensional Hubbard model at half filling where the momentum dependence of the gap formation is studied, as well as a multi-band model of NiO, for which the spectral function can be directly compared to experiment. Agreement with results published results is very good.

preprint2011arXiv

Sub-matrix updates for the Continuous-Time Auxiliary Field algorithm

We present a sub-matrix update algorithm for the continuous-time auxiliary field method that allows the simulation of large lattice and impurity problems. The algorithm takes optimal advantage of modern CPU architectures by consistently using matrix instead of vector operations, resulting in a speedup of a factor of $\approx 8$ and thereby allowing access to larger systems and lower temperature. We illustrate the power of our algorithm at the example of a cluster dynamical mean field simulation of the Néel transition in the three-dimensional Hubbard model, where we show momentum dependent self-energies for clusters with up to 100 sites.

Peter Staar

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

BusiNet -- a Light and Fast Text Detection Network for Business Documents

TableFormer: Table Structure Understanding with Transformers

Unsupervised Domain Generalization by Learning a Bridge Across Domains

Robust PDF Document Conversion Using Recurrent Neural Networks

Optimizing qubit resources for quantum chemistry simulations in second quantization on a quantum computer

Two-particle correlations in a dynamic cluster approximation with continuous momentum dependence: Superconductivity in the 2D Hubbard model

DCA$^+$: Dynamical Cluster Approximation with continuous lattice self-energy

The Continuous-Pole-Expansion method to obtain spectra of electronic lattice models

Sub-matrix updates for the Continuous-Time Auxiliary Field algorithm