Source author record

Arnim Bleier

Arnim Bleier appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Digital Libraries Social and Information Networks Computation and Language cs.CY Machine Learning physics.soc-ph Artificial Intelligence Information Retrieval

Catalog footprint

What is connected

8works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2016arXiv

A System for Probabilistic Linking of Thesauri and Classification Systems

This paper presents a system which creates and visualizes probabilistic semantic links between concepts in a thesaurus and classes in a classification system. For creating the links, we build on the Polylingual Labeled Topic Model (PLL-TM). PLL-TM identifies probable thesaurus descriptors for each class in the classification system by using information from the natural language text of documents, their assigned thesaurus descriptors and their designated classes. The links are then presented to users of the system in an interactive visualization, providing them with an automatically generated overview of the relations between the thesaurus and the classification system.

preprint2014arXiv

Social Media Monitoring of the Campaigns for the 2013 German Bundestag Elections on Facebook and Twitter

As more and more people use social media to communicate their view and perception of elections, researchers have increasingly been collecting and analyzing data from social media platforms. Our research focuses on social media communication related to the 2013 election of the German parlia-ment [translation: Bundestagswahl 2013]. We constructed several social media datasets using data from Facebook and Twitter. First, we identified the most relevant candidates (n=2,346) and checked whether they maintained social media accounts. The Facebook data was collected in November 2013 for the period of January 2009 to October 2013. On Facebook we identified 1,408 Facebook walls containing approximately 469,000 posts. Twitter data was collected between June and December 2013 finishing with the constitution of the government. On Twitter we identified 1,009 candidates and 76 other agents, for example, journalists. We estimated the number of relevant tweets to exceed eight million for the period from July 27 to September 27 alone. In this document we summarize past research in the literature, discuss possibilities for research with our data set, explain the data collection procedures, and provide a description of the data and a discussion of issues for archiving and dissemination of social media data.

preprint2014arXiv

When Politicians Talk: Assessing Online Conversational Practices of Political Parties on Twitter

Assessing political conversations in social media requires a deeper understanding of the underlying practices and styles that drive these conversations. In this paper, we present a computational approach for assessing online conversational practices of political parties. Following a deductive approach, we devise a number of quantitative measures from a discussion of theoretical constructs in sociological theory. The resulting measures make different - mostly qualitative - aspects of online conversational practices amenable to computation. We evaluate our computational approach by applying it in a case study. In particular, we study online conversational practices of German politicians on Twitter during the German federal election 2013. We find that political parties share some interesting patterns of behavior, but also exhibit some unique and interesting idiosyncrasies. Our work sheds light on (i) how complex cultural phenomena such as online conversational practices are amenable to quantification and (ii) the way social media such as Twitter are utilized by political parties.

preprint2013arXiv

Author Name Co-Mention Analysis: Testing a Poor Man's Author Co-Citation Analysis Method

As a social science information service for the German language countries, we document research projects, publications, and data in relevant fields. At the same time, we aim to provide well-founded bibliometric studies of these fields. Performing a citation analysis on an area of the German social sciences is, however, a serious challenge given the low and likely significantly biased coverage of these fields in the standard citation databases. Citations, and especially author citations, play a highly significant role in that literature, however. In this work in progress, we report preliminary methods and results for an author name co-mention analysis of a large fragment of a particularly interesting corpus of German sociology: a quarter century's worth of the full-text proceedings of the Deutsche Gesellschaft fuer Soziologie (DGS), which celebrated its 100th anniversary meeting in 2012. Results are encouraging for this poor cousin of author co-citation analysis, but considerable refinements, especially of the underlying computational infrastructure for full-text analysis, appear advisable for full-scale deployment of this method.

preprint2013arXiv

Practical Collapsed Stochastic Variational Inference for the HDP

Recent advances have made it feasible to apply the stochastic variational paradigm to a collapsed representation of latent Dirichlet allocation (LDA). While the stochastic variational paradigm has successfully been applied to an uncollapsed representation of the hierarchical Dirichlet process (HDP), no attempts to apply this type of inference in a collapsed setting of non-parametric topic modeling have been put forward so far. In this paper we explore such a collapsed stochastic variational Bayes inference for the HDP. The proposed online algorithm is easy to implement and accounts for the inference of hyper-parameters. First experiments show a promising improvement in predictive performance.

preprint2013arXiv

Towards an Author-Topic-Term-Model Visualization of 100 Years of German Sociological Society Proceedings

Author co-citation studies employ factor analysis to reduce high-dimensional co-citation matrices to low-dimensional and possibly interpretable factors, but these studies do not use any information from the text bodies of publications. We hypothesise that term frequencies may yield useful information for scientometric analysis. In our work we ask if word features in combination with Bayesian analysis allow well-founded science mapping studies. This work goes back to the roots of Mosteller and Wallace's (1964) statistical text analysis using word frequency features and a Bayesian inference approach, tough with different goals. To answer our research question we (i) introduce a new data set on which the experiments are carried out, (ii) describe the Bayesian model employed for inference and (iii) present first results of the analysis.

preprint2013arXiv

When Politicians Tweet: A Study on the Members of the German Federal Diet

In this preliminary study we compare the characteristics of retweets and replies on more than 350,000 messages collected by following members of the German Federal Diet on Twitter. We find significant differences in the characteristics pointing to distinct types of usages for retweets and replies. Using time series and regression analysis we observe that the likelihood of a politician using replies increases with typical leisure times while retweets occur constant over time. Including formal references increases the probability of a message being retweeted but drops its chance of being replied. This hints to a more professional use for retweets while replies tend to have a personal connotation.

preprint2012arXiv

A simple non-parametric Topic Mixture for Authors and Documents

This article reviews the Author-Topic Model and presents a new non-parametric extension based on the Hierarchical Dirichlet Process. The extension is especially suitable when no prior information about the number of components necessary is available. A blocked Gibbs sampler is described and focus put on staying as close as possible to the original model with only the minimum of theoretical and implementation overhead necessary.

Arnim Bleier

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

A System for Probabilistic Linking of Thesauri and Classification Systems

Social Media Monitoring of the Campaigns for the 2013 German Bundestag Elections on Facebook and Twitter

When Politicians Talk: Assessing Online Conversational Practices of Political Parties on Twitter

Author Name Co-Mention Analysis: Testing a Poor Man's Author Co-Citation Analysis Method

Practical Collapsed Stochastic Variational Inference for the HDP

Towards an Author-Topic-Term-Model Visualization of 100 Years of German Sociological Society Proceedings

When Politicians Tweet: A Study on the Members of the German Federal Diet

A simple non-parametric Topic Mixture for Authors and Documents