Source author record

Ioannis Anagnostopoulos

Ioannis Anagnostopoulos appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Social and Information Networks Information Retrieval physics.soc-ph cs.CY Databases

Catalog footprint

What is connected

8works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2016arXiv

A Query Language for Multi-version Data Web Archives

The Data Web refers to the vast and rapidly increasing quantity of scientific, corporate, government and crowd-sourced data published in the form of Linked Open Data, which encourages the uniform representation of heterogeneous data items on the web and the creation of links between them. The growing availability of open linked datasets has brought forth significant new challenges regarding their proper preservation and the management of evolving information within them. In this paper, we focus on the evolution and preservation challenges related to publishing and preserving evolving linked data across time. We discuss the main problems regarding their proper modelling and querying and provide a conceptual model and a query language for modelling and retrieving evolving data along with changes affecting them. We present in details the syntax of the query language and demonstrate its functionality over a real-world use case of evolving linked dataset from the biological domain.

preprint2015arXiv

Discovering similar Twitter accounts using semantics

On daily basis, millions of Twitter accounts post a vast number of tweets including numerous Twitter entities (mentions, replies, hashtags, photos, URLs). Many of these entities are used in common by many accounts. The more common entities are found in the messages of two different accounts, the more similar, in terms of content or interest, they tend to be. Towards this direction, we introduce a methodology for discovering and suggesting similar Twitter accounts, based entirely on their disseminated content in terms of Twitter entities used. The methodology is based exclusively on semantic representation protocols and related technologies. An ontological schema is also described towards the semantification of the Twitter accounts and their entities.

preprint2014arXiv

Exploratory Analysis of a Terabyte Scale Web Corpus

In this paper we present a preliminary analysis over the largest publicly accessible web dataset: the Common Crawl Corpus. We measure nine web characteristics from two levels of granularity using MapReduce and we comment on the initial observations over a fraction of it. To the best of our knowledge two of the characteristics, the language distribution and the HTML version of pages have not been analyzed in previous work, while the specific dataset has been only analyzed on page level.

preprint2014arXiv

InfluenceTracker: Rating the impact of a Twitter account

We describe a methodology of rating the influence of a Twitter ac-count in this famous microblogging service. We then evaluate it over real ac-counts, under the belief that influence is not only a matter of quantity (amount of followers), but also a mixture of quality measures that reflect interaction, awareness, and visibility in the social sphere. The authors of this paper have created InfluenceTracker, a publicly available website where anyone can rate and compare the recent activity of any Twitter account.

preprint2014arXiv

Lifespan and propagation of information in On-line Social Networks a Case Study

Since 1950, information flows have been in the centre of scientific research. Up until internet penetration in the late 90s, these studies were based over traditional offline social networks. Several observations in offline information flows studies, such as two-step flow of communication and the importance of weak ties, were verified in several online studies, showing that the diffused information flows from one Online Social Network (OSN) to several others. Within that flow, information is shared to and reproduced by the users of each network. Furthermore, the original content is enhanced or weakened according to its topic, the dynamic and exposure of each OSNs. In such a concept, each OSN is considered a layer of information flows that interacts with each other. In this paper, we examine such flows in several social networks, as well as their diffusion and lifespan across multiple OSNs, in terms of user-generated content. Our results verify the perception of content and information connection in various OSNs.

preprint2014arXiv

Semantifying Twitter: the influenceTracker ontology

In this paper, we propose an ontology schema towards semantification provision of Twitter social analytics. The ontology is deployed over a publicly available service that measures how influential a Twitter account is, by combining its social activity and interaction over Twittersphere. Apart from influential quantity and quality measures, the service provides a SPARQL endpoint where users can perform advance semantic queries through the RDFized Twitter entities (mentions, replies, hashtags, photos, URLs) over the semantic graph.

preprint2013arXiv

Real Time Enhanced Random Sampling of Online Social Networks

Social graphs can be easily extracted from Online Social Networks. However these networks are getting larger from day to day. Sampling methods used to evaluate graph information cannot accurately extract graph properties. Furthermore Social Networks are limiting the access to their data, making the crawling process even harder. A novel approach on Random Sampling is proposed, considering both limitation and resources. We evaluate this proposal with 4 different settings on 5 different Test Graphs, crawled directly from Twitter. Through comparing the results we observe the pros and cons of its method as well as their resource allocation. Concluding we present their best area of application.

preprint2012arXiv

A methodology for internal Web ethics

The vigorous impact of the Web in time and space arises from the fact that it motivates massive creation, editing and distribution of information by Users with little knowledge. This unprecedented continuum provides novel opportunities for innovation but also puts under jeopardy its survival as a stable construct that nurtures a complex system of connections. We examine the Web as an ethics determined space by demonstrating Hayek's theory of freedom in a three-leveled Web: technological, contextualized and economic. Our approach accounts for the co-dependence of code and values, and assumes that the Web is a self-contained system that exists in and by itself. This view of internal Web ethics directly connects the concept of freedom with issues like centralization of traffic and data control, rights on visiting log file, custom User profiles and the interplay among function, structure and morality of the Web. It is also demonstrated, in the case of Net Neutrality, that generic freedom-coercion trade-offs are incomplete in treating specific cases at work.

Ioannis Anagnostopoulos

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

A Query Language for Multi-version Data Web Archives

Discovering similar Twitter accounts using semantics

Exploratory Analysis of a Terabyte Scale Web Corpus

InfluenceTracker: Rating the impact of a Twitter account

Lifespan and propagation of information in On-line Social Networks a Case Study

Semantifying Twitter: the influenceTracker ontology

Real Time Enhanced Random Sampling of Online Social Networks

A methodology for internal Web ethics