You are hereResearch

Research


My current research is the large scale analysis of news-media content. Media analysis has been the field of social scientists for a long time. Their studies typically focus on detecting specific biases over limited number of news-outlets and for small time periods, since most of the work is performed manually. Nowadays, the on-line presence of most mainstream media, the advances of modern machine learning and data mining techniques allows the automated analysis of vast numbers of outlets in large time frames. I work on detecting, explaining and understanding the patterns that can be found in the news media content.

Examples of my research findings and achievements:

Reconstruction of the news-outlets network, based on the sharing of content of news-outlets

Relevant publications: [ECML/PKDD2010,WI-IAT2010]



An illustration of the global media network. Nodes are news-outlets that cover the same stories more than expected by chance. Outlets from the same country are coloured the same.
Prediction of popularity of news articles based on SVM ranking.

Relevant publications: [AIAI2010]



Prediction accuracies of "popular" vs. "non-popular" articles for ten news outlets.
Comparing countries based on their media content.

Relevant publications: [CIP2010]



Example of `Citations' network of EU countries for July 2009. Country A points to country B if B is mentioned `frequently' in news outlets of country A. .
Measuring biases in media content.

Relevant publications: [WAPA2010]



Comparison of readability of different topics in large scale-across a vast amount of news articles, for hundreds of outlets and for over a year of continuous monitoring.
Learning of network topology.


Relevant publications: [ECML/PKDD2010]

Three reference networks are constructed based on nodes' properties of the news-outlets network, i.e. language and country of origin, and media type. We show that they are able to predict network topology under a supervised learning scheme based on Generalised Linear Models.
Our group created a demo, namely Found in Translation, that presents some aspects of my work. My contributions were on text categorisation and building the demo pipeline.


Relevant publication:[ECML/PKDD2010B].



We constantly monitor and compare the EU countries based on the topic biases that their media choose to cover.

- Multilabel classification of multilingual news articles [WI-IAT2010, ECML/PKDD2010B].
- Correlation of the network topology to other known networks