New Book: Modelling and Data Mining in Blogosphere

. Friday, July 31, 2009

A new Data Mining for Social Media book has been released. Authored by Nitin Agarwal (University of Arkansas at Little Rock) and Huan Liu (Arizona State University), "This book offers a comprehensive overview of the various concepts and research issues about blogs or weblogs. It introduces techniques and approaches, tools and applications, and evaluation methodologies with examples and case studies".

ISBN: 9781598299083 paperback
ISBN: 9781598299090 ebook

Online version available:

Table of Contents:

  • Chapter 1: Modeling Blogosphere
  • Chapter 2: Blog Clustering and Community Discovery
  • Chapter 3: Influence and Trust
  • Chapter 4: Spam Filtering in Blogosphere
  • Chapter 5: Data Collection and Evaluation
  • Appendix A: Tools in Blogosphere
  • Appendix B: API Examples

The Lemur Query Log Project

. Wednesday, July 29, 2009

Jose Maria Gomez has published in his blog about the Lemur Query Log Project, which is a very interesting iniciative leaded by Dr. Bruce Croft. The Lemir Query Log Project features a toolbar that collect queries and related navigation from users and send it to a database which collects a massive query log that may benefits the IR research community.

Information Retrieval, as most of the subdisciplines related to intelligent information access, relies on the availability of data, more specifically on testing datasets. That's the reason why projects like Lemur Query Log are so important for future researchs and developments.