Français Anglais
Accueil Annuaire Plan du site
Accueil > Evenements > Séminaires
Séminaire d'équipe(s) BD
Online Refresh Strategies for Content Based Feed Aggregation
Roxana Horincar

15 March 2013, 14h00 - 15 March 2013, 15h30
Salle/Bat : 445/PCRI-N
Contact : jesus.camacho-rodriguez@lri.fr

Activités de recherche :

Résumé :
With the rapid growth of data sources, services and devices connected to the Internet, online available web content is getting more and more diverse and dynamic. In order to facilitate the efficient dissemination of evolving and temporary information, many web applications publish their new information as RSS and Atom documents which are then collected and transformed by RSS aggregators like Google Reader or Yahoo! News. I address the particular issue of large-scale aggregation of highly dynamic information sources by focusing on the design of optimal refresh strategies for large collections of RSS feed documents.

First, I introduce two quality measures specific to RSS aggregation which reflect the information completeness and average freshness of the result feeds. Then, I propose a best-effort feed refresh strategy that achieves maximum aggregation quality compared with all other existing policies with the same average number of refreshes. This strategy is based on specific online change estimation models developed after a deep analysis of the temporal publication characteristics of a representative collection of real-world RSS feeds. The presented methods have been implemented and tested against synthetic and real-world RSS feed data sets.

Pour en savoir plus :
Séminaires
Programming computing media (reporté)
Combinatoire
Friday 18 September 2020 - 14h30
Salle : 445 - PCRI-N
Frédéric Gruau .............................................

forum-dev Continuous Integration
Friday 05 June 2020 - 10h00
Salle : 0 - 650
Erik Bray .............................................

Large-scale Spectral Clustering for GPU-based Plat
Calcul à haute performance
Tuesday 24 March 2020 - 10h30
Salle : 465 - PCRI-N
Guanlin He .............................................

Recherche Opérationnelle à Google
Optimisation combinatoire et stochastique
Thursday 12 March 2020 - 14h30
Salle : 445 - PCRI-N
Laurent Perron .............................................

Forum dev-LRI
Wednesday 05 February 2020 - 14h00
Salle : 455 - PCRI-N
Erik Bray .............................................