Frauds in Science

. Saturday, June 13, 2009
0 comments

A month ago I wrote a post in my Spanish Intelligent Systems blog about frauds in science. In this post I resume what I wrote because I think it can be a good initial point for a debate about the present and future of Science.

Talking about frauds in science can be quite long, there are many little things in the actual scientific process that should be corrected (fake conferences, strange publishing processes, etc.), but I'll focus on big frauds,.

In August 2005, PLoS Medicine published "Why Most Published Research Findings are False", dealing with bad experimental design which conduces to wrong research findings. As stated in this paper, the scientific process is much focused in nobel research, and there is almost no support for research trying to replicate previous results, trying to corroborate previous findings.

In the paper "Repairing research integrity" published in Nature, June 2008, Sandra Titus and her team analyzes the integrity on scientific studies. Based on a survey over more than 4.000 researchers from over 600 institutiones, the results showed more than 200 cases of bad conduct in some scientific study, a number much higher than the registered previously by ORI (Office of Research Integrity). More than 60% of the total meant to data falsification, being plagiarism the next more usual bad conduct detected. Some of these frauds are detected on time, as the Kristin Roovers case, that was discovered by the editors of The Journal of Clinical Investigations when he sent a paper containing some images that had been manipulated with Photoshop.

There exists some regions where frauds are even a bigger problem. This is the case of China, where more than 60% of PhD students admit they have plagiarize some work. This represents a really big problem for China's research and even for the whole scientific community.

Recently, another big fraud in science was discovered, when The Scientist informed that Elsevier, one of the biggest scientific publishers, has several agreements with companiers to publish scientific journals that the companies use to promote their products. The first case detected was the Australasian Journal of Bone and Joint Medicine, where a paper was published promoting a product from Merck, a company that paid Elsevier for designing this journal. Summer Johnson writes about this big fraud in Bioethics, a really recommended lecture.

The scientific community must react to all these things, if we want to preserve the image of science but, what can we do? I think there are several options that could improve the scientific process:

1.- Open Access. The Elsevier case must make us to think that letting companies like Elsevier to control scientific publishing is not a good idea. Open Access seems a good way to prevent science from the desires and interests of big publishing companies. It is also a good way to assure an egalitarian access to scientific results.

2.- We should try to help iniciatives refering to negative results (or less important ones). Journals like Journal of Interesting Negative Results in Natural Language Processing and Machine Learning, Journal of Negative Results on Biomedicine, or Journal of Negative Results, are doing a good job publishing that kind of results.

3.- It also seems very important to improve working conditions of researchers. For instance, in Spain a lot of researchers earn less money that if they were working in a supermarket or driving a taxi, occupations with less responsabilities and less impact in the society. Who can care about doing high quality research if can't give his/her family a decent living?

4.- It is also needed to take up again scientific ethics. As researchers we must value what science word means. Science is not about publishing papers, science is all about improving the global knowledge, science is something really great.

Wikipedia Page Traffic Statistics for DataMiners

.
0 comments

Gregory Piatetsky pointed in KDnuggets Twitter account the release of a data package containing 7 months of hourly pageview statistics for all articles in Wikipedia. This dataset has a compressed size over 320 GB, over 1 TB uncompressed, and includes 7 months of hourly page traffic statistics for over 2.5 Million Wikipedia articles. All text content, statistics and link data in the dataset are licensed under GFDL (GNU Free Document License).

1st International Workshop on Mining Social Media

. Wednesday, May 06, 2009
2 comments

I'm very glad to announce the MSM09 workshop that we are organizing in Sevilla (Spain), November 9. I hope to see some of you there :D

-------------------------------------------------------
CALL FOR PAPERS

1st International Workshop on Mining Social Media
http://www.socialgamingplatform.com/msm09/
Workshop at CAEPIA 2009 (http://www.lsi.us.es/caepia09)
November 9, 2009, Sevilla, Spain
Submission deadline: July 31, 2009
-------------------------------------------------------

OVERVIEW

Social Media are technological tools that allow users sharing and discuss information. Most Social Media are Internet based applications that manage textual information, as blogs (Blogger, Wordpress), microblogging (Twitter, Pownce), wikis (Wikipedia), forums, or Social Networks (Facebook, MySpace, LinkedIn). But there also exist other Social Media Internet applications where users share more than text, as photo sharing tools (Flickr, Picasa), video sharing (YouTube, Vimeo), livecasting (Ustream), or audio and music sharing (last.fm, ccMixter, FreeSound). More recent Social Media includes virtual worlds (Second Life), online gaming (World of Warcraft, WarHammer Online), game sharing (Miniclip.com) and Mobile Social Media like Nomad Social Networks where users share their current position in the Real World.

Social Media have been able to shift the way information is generated and consumed. At first, information was generated by one person and “consumed” by many people, but now the information is generated by many people and consumed by many people, changing the needs in information access and management. It is also noticeable that Social Media applications manage huge quantities of users and data: Facebook and MySpace manage between 100 and 150 million users, it is estimated that 1 million blog posts are generated each day, microblogging services like Twitter generates 3 million messages each day, YouTube manages more that 150.000 million videos, etc. All these points make clear that Social Media is an excellent application field for data miners.


CALL FOR PARTICIPATION

The Mining Social Media workshop aims to bring practitioners but also researchers with a specific focus on the application of existent or novel Data Mining techniques into the field of Social Media. We encourage the submission of experimental papers where Data Mining techniques are applied into existent Social Media, but also more theoretical papers that show clear application in real Social Media applications. The interesting topics include blog post analysis, blog comments analysis, blog spam, recommender systems for Social Media, clickstream analysis, relevance analysis, spam users detection, behavior analysis, contextual mining, Social Media user segmentation, route analysis for nomad Social Networks, multimedia mining, search in Social Media, etc.

All the submissions must be practical or theoretical applications of Data Mining or Information Management to Social Media (blogs, wikis, social networks, social services, e-mail, etc.).

A. Data Mining for Social Networks
* Novel Algorithms
* Association Rules
* Mining semi-structured data
* Classification and Ranking
* Clustering
* Text Mining
* Machine Learning
* Privacy Preserved Data Mining
* Statistical Methods
* Temporal and spatial data mining
* Parallel and Distributed Data Mining
* Interactive and Online Mining
* Data and Knowledge Visualization
* Multimedia mining (audio/video)
* Ensemble Methods
* Web Mining
* Graph Mining
* Link Mining
B. Information Management for Social Networks
* Recommender Systems
* Information Retrieval
* Sentiment Analysis
* Natural Language Processing
* Question Answering
* Semantic Processing
* Graph Analysis and Complex Networks
* Social Network Analysis
C. Possible applications
* E-Mail Spam Detection
* Blog/Social Networks Spam Detection
* Community Detection
* Users/content recommenders
* Trends discovery
* Blogs/Social Networks Community Dynamics
* User Reviews Ranking
* Blogs/Social Networks Contributions Summarization
* Abuse/Fraud Detection
* User Profile Modeling
* Event Detection and Tracking in Social Media

We welcome contributions through research papers and industrial reports/case studies on applications. We also welcome work-in-progress contributions, as well papers discussing potential research directions.


IMPORTANT DATES

* Deadline for submission: 31 July 2009
* Notification of acceptance: 17 September 2009
* Workshop: 9 November 2009


INVITED SPEAKER

* William W. Cohen (Carnegie Mellon University)


INDUSTRY PANEL

The Industry Panel features panelists from several Social Media companies, who will give their vision about needs, opportunities and current solutions of Data Mining and related techniques to real Social Media solutions. Up to date, Strands (http://www.strands.com) and Tuenti (http://www.tuenti.com) have confirmed their presence in this panel.


SUBMISSION OF PAPERS

Submissions must be anonymous. Papers must be sent in a PDF file, written in English and not exceed 12 pages including figures, references, etc. and should be formatted according to the Springer LNAI guidelines. Papers will be reviewed by at least two PC members, and accepted papers will be published in the workshop proceedings. We are currently studying the possibility of publishing Springer post-proceedings and a special issue on a Data Mining journal.


SPONSORS (confirmed as of May 3, 2009):

* MAVIR (http://www.mavir.net)
* UEM (http://www.uem.es)
* Fundación Madri+D para el Conocimiento (http://www.madrimasd.org)
* STRANDS (http://www.strands.com)


ORGANIZATION COMMITEE

* Jose Carlos Cortizo (Social Gaming Platform, Universidad Europea de Madrid) - primary contact (josecarlos.cortizo@wipley.com)
* Francisco Manuel Carrero (Social Gaming Platform, Universidad Europea de Madrid)
* Jose Maria Gomez (Optenet)
* Enrique Puertas (Universidad Europea de Madrid)
* Borja Monsalve (Social Gaming Platform, Universidad Europea de Madrid)


PROGRAM COMMITEE

* Nitin Agarwal (University of Arkansas at Little Rock, USA)
* Aris Anagnostopoulos (Sapienza Università di Roma , Italy)
* Raul Arrabales (Universidad Carlos III de Madrid, Spain)
* Dominik Benz (University of Kassel, Germany)
* Paolo Boldi (University of Milano, Italy)
* Iván Cantador (Universidad Autónoma de Madrid & University of Glasgow, United Kingdom)
* Francisco Manuel Carrero (Social Gaming Platform, Spain)
* Pablo Castells (Universidad Autónoma de Madrid, Spain)
* Meeyoung Cha (Max Planck Institute, Germany)
* Yun Chi (Nec Laboratories America, USA)
* Jose Carlos Cortizo (Social Gaming Platform, Spain)
* Debora Donato (Yahoo! Research, Spain)
* Cesar Estebanez (Universidad Carlos III de Madrid, Spain)
* Jose Maria Gomez (Optenet, Spain)
* Antonio Gulli (Ask.com, Italy)
* Viet Ha-Thuc (University of Iowa, USA)
* José Antonio Iglesias Martínez (Universidad Carlos III de Madrid, Spain)
* Akshay Java (MSN Microsoft, USA)
* Gueorgi Kossinets (Google, USA)
* Zornitsa Kozareva (Universidad de Alicante, Spain)
* Beate Krause (University of Kassel, Germany)
* Emmanuelle Lebhar (CNRS & Universidad de Chile, Chile)
* Agapito Ledezma Espino (Universidad Carlos III de Madrid, Spain)
* Kristina Lerman (USC Information Sciences Institute, California, USA)
* Ee-Peng Lim (Singapore Management University, Singapore)
* Stéphane Marchand-Maillet (University of Geneva, Switzerland)
* Luis Martin (Universidad Politecnica de Madrid, Spain)
* Borja Monsalve (Social Gaming Platform, Spain)
* Cesar de Pablo (Universidad Carlos III de Madrid, Spain)
* Manos Papagelis (University of Toronto, Canada)
* Victor Peinado (Universidad Nacional de Educación a Distancia, Spain)
* Enrique Puertas (Universidad Europea de Madrid, Spain)
* Josep Lluis de la Rosa (Universidad de Girona & Rensselaer Polytechnic Institute, USA)
* Sara Owsley Sood (Pomona College, USA)
* Markus Strohmaier (Graz University of Technology, Austria)
* Yaiza Temprado (Universidad Europea de Madrid, Spain)
* Marc Torrens (Strands, Spain)



The Socialization of the WWW

. Thursday, April 23, 2009
0 comments

This year, WWW2009 is being held in Madrid, a great event and a great place to celebrate the 20th Aniversary of the WWW. The WWW Conference is a great place to study the current opportunities and problems on the Web setup, and that’s why I use to read the abstracts of the accepted papers each years. And there is a visible evolution towards the Social Web in the last 3 years.

In WWW 2007 at Banff (Canada), there was no Social track, only a session called “Mining in Social Networks” under Data Mining track, were 3 papers were presented, but addressing Social Networks as a general paradigm of communication among people. WWW 2008 at Beining (China) had a Social Networks track, containing 3 sessions: “Analysis of Social Networks and Online Interaction Spaces”, “Discovery and Evolution of Communities” and “Applications and Infrastructures for Web 2.0”, and some other papers addressing Social Media as “Finding the Right Facts in the Crowd: Factoid Question Answering over Social Media”.

WWW 2009 clearly shows that the Web is Social. There exist a “Social Networks and Web 2.0” track addressing Social Media and Collaborative Web, with 4 sessions (Recommender Systems, Interactions in Social Communities, Difusion and Search in Social Networks and Photos and Web 2.0) and 12 papers, but there are also, at least, 5 papers in other tracks that address other relevant topics like privacy from a Social Media point of view:
More than 12% of the papers accepted in WWW2009 are related to Social Media, and that’s a very good symptom of the Socialization of the Web.

Aware of Ranking Scam

. Wednesday, April 22, 2009
0 comments

In the comments of a previous post, Jijung Tang links a very interesting information about online anonymous ranking sites. He points out some considerations about cs.conference-ranking.net and cs-conference-ranking.org. A must read post.

When I wrote the post about Conference Rankings, I didn't take care about who was behind cs-conference-ranking.org. I was said it was a good reference, and much of the conferences seem to be in a logical order. But Jijung is right, it's weird to find IAENG conferences among the most relevant conferences, and to hide the information about the creators of the ranking is not a good signal.

Social Media, Data Mining & Machine Learning

.
0 comments

I've been more than two months without updating the blog. A lot of work with my new company, trying to find funding for starting-up, and developing tons and tons of code lines. Now I'll try to update the blog once per week of once each two weeks, at least, but as my research interests have shifted a little bit to Social Media applications, I've decided to change the name of the blog from "Business Intelligence, Data Mining & Machine Learning" o "Social Media, Data Mining & Machine Learning". Probably the name of the blog is not so important, but working now on Social Media I feel more comfortable with that keywork in the title.

SSMS 2009 - Summer School on Multimedia Semantics

.
0 comments

** SSMS 2009 - SUMMER SCHOOL ON MULTIMEDIA SEMANTICS ** -- University of Koblenz, Germany --

Managing and Modeling of Multimedia and User Generated Content in Web 3.0

http://www.smart-society.net/ssms09 23-28 August 2009, Koblenz, Germany Application deadline: 15 May 2009 (Friday)

INTRODUCTION

We are pleased to announce 4th edition of Summer School on Multimedia Semantics. This summer school series successfully started in 2006, each year offering top level education at great European locations (Kallithea, Greece; Glasgow, UK; Crete, Greece) for students from all over the world. Summer School on Multimedia Semantics is intended for PhD and Master students, who want to learn more on use of semantics in various media. We offer lectures by leading researchers in the field. Organized student poster sessions will create unique networking opportunities. They are excellent place to present your work, get comments and feedback from senior researchers, or exchange ideas with other participants.

PROGRAM HIGHLIGHTS:

  • Social Media Modeling
  • Audio Processing and Semantics
  • Video Analysis and Semantic Retrieval
  • Multimedia Personalization

LECTURERS:

  • Prof. Dr. Karlheinz Brandenburg (IDMT Fraunhofer, Germany)
  • Prof. Dr. Andreas Nuernberger (University of Magdeburg, Germany)
  • Prof. Dr. Lynda Hardman (Centrum Wiskunde & Informatica, Netherlands)
  • Prof. Dr. Steffen Staab (University of Koblenz-Landau, Germany)
  • Prof. Dr. Andreas Hotho (University of Kassel, Germany)
  • Dr. Marcel Worring (University of Amsterdam, Netherlands)
  • Prof. Dr. Fabio Ciravegna (University of Sheffield, England)
  • Dr. Yiannis Kompatsiaris (ITI, Greece)
  • Prof. Dr. Noel O`Connor (Dublin City University, Ireland)
  • Dr. Ansgar Scherp (University of Koblenz-Landau, Germany)

APPLICATION PROCEDURE:

We welcome applicants from anywhere in the world. Summer school is mainly addressed for graduate students (Masters and PhD). Each participant is required to bring a poster presenting her/his work. Organized poster sessions will allow to exchange ideas and get feedback on your research. Applications must be send directly to Ruth Götten ( confsec@uni-koblenz.de ) - PDF, ?MsWord or plain text are preferred.

Application must include: - your name, - organization, - name of your supervisor(s), - abstract of your work (limit: 1 page)

More details available at: http://www.smart-society.net/ssms09/index.php?page=registration

CONTACT: confsec@uni-koblenz.de

VENUE:

Koblenz (which means: merging of the rivers) is situated in the picturesque valley of the Rhine and Moselle. Middle Rhine Valley is a UNESCO World Heritage site ( http://www.welterbe-mittelrheintal.de ) City is by four mountain ranges with many vineyards in the valley, its narrow alleyways and happy atmosphere, Koblenz is welcoming town for guests from all over the world. The town has old traditions, reaching over 2000 years to the times of the Roman empire. More at http://en.wikipedia.org/wiki/Koblenz

ACCOMMODATION AND SOCIAL EVENTS:

The Summer School classes will be held at the Campus of the University Koblenz-Landau ( http://www.uni-koblenz.de ). Suggested accommodation is provided by "Contel" Hotel ( http://www.contel-koblenz.de ) in Koblenz, which is conveniently located within walking distance from the Koblenz campus. Various other options for housing in different price ranges are also available - check SSMS'09 website for more details. Study hard and enjoy summer! Each
day of intensive study is followed by a different social event. Participants and lecturers will find many opportunities to talk, socialize and have more direct contact outside of the classroom. On Wednesday we plan a longer, half-day trip outside of Koblenz.

CALENDAR:

  • Application deadline: 15 May 2009 Acceptance notification: 1 June 2009
  • Registration deadline: 15 June 2009 Summer School: 23 - 28 August

COST:

The tuition fee for each student is 350 EUR. It covers lectures, meals, excursion and other organized social events. Accommodation is NOT included in the tuition. If you have questions, please contact Ruth Götten at confsec@uni-koblenz.de

STUDENT SCHOLARSHIPS

We offer a limited number of student scholarships that cover the tuition fee for the summer school. To apply for a scholarship, please attach a letter to your application (limit 1 page) explaining what you expect to learn during the summer school in Koblenz, motivate why you apply for this scholarship and clarify why you cannot use other funds.

ADVISORY BOARD:

  • Dr. Yiannis Kompatsiaris, ITI, Greece
  • Prof. Dr. Ebroul Izquierdo, Queen Mary University London, England
  • Prof. Dr. Lynda Hardman, CWI, The Netherlands
  • Prof. Dr. Fabio Ciravegna, University of Sheffield, England
  • Prof. Dr. Alan Smeaton, Dublin City University, Ireland

ORGANIZATIONAL COMMITTEE:

  • Prof. Dr. Steffen Staab
  • Dr. Ansgar Scherp
  • Dr. Marcin Grzegorzek
  • Dr. Maciej Janik
  • Ruth Götten, Dipl.-Päd.

IEEE International Conference on Data Mining 2009

. Saturday, January 31, 2009
4 comments

December 6-9, 2009 Miami, U.S.A.

The IEEE International Conference on Data Mining (ICDM) has established itself as the world's premier research conference in data mining. The 2009 edition of ICDM provides a leading forum for presentation of original research results, as well as exchange and dissemination of innovative, practical development experiences.

The conference covers all aspects of data mining, including algorithms, software and systems, and applications. In addition, ICDM draws researchers and application developers from a wide range of data mining related areas such as statistics, machine learning, pattern recognition, databases and data warehousing, data visualization, knowledge-based systems, and high performance computing.

By promoting novel, high quality research findings, and innovative solutions to challenging data mining problems, the conference seeks to continuously advance the state-of-the-art in data mining.

Besides the technical program, the conference will feature workshops, tutorials, panels, and the ICDM data mining contest.

Topics of Interest

  • Data mining foundations
    • Novel data mining algorithms in traditional areas (such as classification, regression, clustering, probabilistic modeling, pattern discovery, and association analysis)
    • Models and algorithms for new, structured, data types, such as arising in chemistry, biology, environment, and other scientific domains
    • Developing a unifying theory of data mining
    • Mining sequences and sequential data
    • Mining spatial and temporal datasets
    • Mining textual and unstructured datasets
    • Distributed data mining
    • High performance implementations of data mining algorithms
    • Privacy and anonymity-preserving data analysis
  • Mining in emerging domains
    • Stream data mining
    • Mining moving object data, RFID data, and data from sensor networks
    • Ubiquitous knowledge discovery
    • Mining multi-agent data
    • Mining and link analysis in networked settings: web, social and computer networks, and online communities
    • Mining the semantic web
    • Data mining in electronic commerce, such as recommendation, sponsored web search, advertising, and marketing tasks
  • Methodological aspects and the KDD process
    • Data pre-processing, data reduction, feature selection, and feature transformation
    • Quality assessment, interestingness analysis, and post-processing
    • Statistical foundations for robust and scalable data mining
    • Handling imbalanced data
    • Automating the mining process and other process related issues
    • Dealing with cost sensitive data and loss models
    • Human-machine interaction and visual data mining
    • Integration of data warehousing, OLAP and data mining
    • Data mining query languages
    • Security and data integrity
  • Integrated KDD applications, systems, and experiences
    • Bioinformatics, computational chemistry, ecoinformatics
    • Computational finance, online trading, and analysis of markets
    • Intrusion detection, fraud prevention, and surveillance
    • Healthcare, epidemic modeling, and clinical research
    • Customer relationship management
    • Telecommunications, network and systems management
    • Sustainable mobility and intelligent transportation systems

Important Dates

  • April 13, 2009 - Deadline for workshop proposals
  • June 26, 2009 - Deadline for paper submission, tutorial submission, and panel proposals
  • September 4, 2009 - Notification to authors
  • September 28, 2009 - Deadline for camera-ready copies
  • December 6-9, 2009 Conference

Information Access vs. Information Retrieval

. Tuesday, January 27, 2009
0 comments

Jose Maria Gomez publishes a very interesting post about the differences of Information Access and Information Retrieval that are not so clear for a lot of people, including researchers of areas distant from IR or IA.

The Future of Social Networks

. Thursday, January 08, 2009
0 comments

Before Christmas, I wrote a post in my Spanish blog Sistemas Inteligentes (Intelligent Systems) containing some reflexions about the future of Social Networks. I try to resume the main idea and translate to English in this post, as I think Social Networks, and all Social Media are a really interesing field to KDD.

As stated by Wikipedia, "Emergence is the way complex systems and patterns arise out of a multiplicity of relatively simple interactions. Emergence is central to the theories of integrative levels and of complex systems". Some strong ideas in AI are connected with emergence, like Swarm Intelligence that "is a type of artificial intelligence based on the collective behavior of decentralized, self-organized systems. The expression was introduced by Gerardo Beni and Jing Wang in 1989, in the context of cellular robotic systems".


But now, what does emergence have in common with Social Networks? For me there is a clear similarity: both emergent systems and social networks present a group of individuals interacting among them to form something bigger. In emergent systems that group of, let's say stupid or limited, individuals are able to connect among themselves to create some kind of cooperative organism that is more intelligent than the union of the intelligences of the individuals. In today's Social Networks (refering to the Social Networks applications like Facebook or MySpace) we have a really better initial state, we have a group of intelligent individual cooperating among them, but the result is not the expected, because the global information is just the union (or even less, as some information may be duplicated) of the information generated by each user. But it's even worse, the global intelligence of the system is almost null, as today's Social Network systems are all about information and are not trying to create a superior layer of the system by processing all that information and creating real knowledge.

For me, it's clear that the future of Social Networks is about developing systems that generates added value to the users by processing all the information and connexions. Next years, Social Networks will need to use KDD techniques and I'me sure Social Media will become the next big application field for KDD and Machine Learning.

CFP: 25th Conference of the SEPLN (Spanish Society for Natural Language Processing)

. Thursday, December 18, 2008
0 comments

September 8-10, 2009
Palacio Miramar, Donostia - San Sebastián
http://ixa2.si.ehu.es/sepln2009/

INTRODUCTION

The 25th edition of the Annual Conference of the Spanish Society for Natural Language Processing (SEPLN) will take place in the Miramar Palace in San Sebastian on September 8, 9 and 10, 2008.

We also expect to organise three satellite workshops during the week of the conference (see list of workshops).

The huge amount of information available in digital format and in different languages demands systems that enable us to access this vast library in an increasingly more structured way.

In this same area, there is a renewed interest in improving information accessibility and information exploitation in multilingual environments. Many of the formal foundations for dealing appropriately with these necessities have been, and are still being established in the area of Natural Language Processing and its many branches:

  • Information extraction and retrieval, Questions answering systems,
  • Machine Translation, Automatic analysis of textual content, Text
  • Generation, and Speech recognition and synthesis.

The aim of the conference is to provide a forum for discussion and communication where the latest research work and developments in the field of Natural Language Processing (NLP) can be presented by scientific and business communities. The conference also aims at exposing new possibilities of real applications and R&D projects in this field.

Moreover, as in previous editions, there is the intention of identifying future guidelines or paths for basic research and foreseen software applications, in order to compare them against the market needs. Finally, the conference intends to be an appropriate forum in helping new professionals to become active members in this field.


TOPICS

Researchers and companies are encouraged to send communications, project abstracts or demonstrations related to any of the following language technology topics:
  • Linguistic, mathematic and psycholinguistic models of language
  • Corpus linguistics
  • Development of linguistic resources and tools
  • Grammars and formalisms for morphological and syntactic analysis
  • Semantics, pragmatics and discourse
  • Lexical ambiguity resolution
  • Machine Learning in NLP
  • Monolingual and multilingual text generation
  • Machine translation
  • Speech synthesis and recognition
  • Monolingual and multilingual information extraction and retrieval
  • Question answering systems
  • Automatic textual content analysis
  • Text summarization
  • NLP-based generation of teaching resources
  • NLP for languages with limited resources
  • NLP industrial applications

STRUCTURE OF THE CONFERENCE

The conference will last three days, and will consist of sessions devoted to presenting papers, posters, ongoing research projects, prototype product demonstrations or products connected with topics addressed in the conference. Besides, we expect to organize three satellite workshops during the week of the conference.


SUBMISSIONS

The proposal must be submitted earlier than April 24, 2009 and they must meet certain format and style requirements.

Both the delivery and revision of proposals will be done exclusively in PDF electronic format via the Myreview system. We recommend using the LaTeX and Word templates that can be downloaded from the conference webpage.

Besides, the proposals will have to comply the following requirements, depending if they are communications, demos or projects.


COMMUNICATIONS

Authors are encouraged to send theoretical or system-related proposals.

The proposals must include the following sections:

  • A title of the communication.
  • The complete names of the authors, their affiliations, address, and e-mail (anonymous in the submitted proposal).
  • An abstract in English and Spanish (maximum 150 words), including a list of keywords or related topics.
  • The proposal can be written and presented in Spanish or English, and its overall maximum length will be 8 pages, excluding references, which can take up an additional whole page at the most.
  • The documents must not include headings or footnotes.

The papers proposed will be assessed at least by three reviewers, and can be accepted to be presented either as posters or as communications, depending on the program necessities. However, no distinction will be made between communications and posters in the printed version of the SEPLN magazine.


PROJECTS AND DEMOS

As in previous editions, the organizers encourage participants to give oral presentations of R&D projects and demos of systems or tools related to the NLP field. For oral presentations on R&D projects to be accepted, the following information must be included:

  • Project title
  • Name, affiliation, address, e¬mail and phone number of the project director
  • Funding institutions
  • Groups participating in the project
  • Abstract (2 pages maximum)

For demonstrations to be accepted, the following information is mandatory:

  • Demo title
  • Name, affiliation, e-mail and phone number of the authors
  • Abstract (2 pages maximum)
  • Time estimation for the whole presentation

IMPORTANT DATES

  • April 24, 2009: Deadline for submitting papers, projects and demos
  • May 25, 2009: Notification of acceptance
  • June 19, 2009: Deadline for submitting the final version
  • July 15, 2009: Deadline for early registration
  • Sept. 7, 2009: Workshops
  • Sept. 8, 9 & 10: 25th SEPLN Conference

SCIENTIFIC COMMITTEE

Chairman: Kepa Sarasola (Euskal Herriko Unibertsitatea)

Members:

* Itziar Aduriz (Universitat de Barcelona)
* José Gabriel Amores (Universidad de Sevilla)
* Jose Maria Arriola (Euskal Herriko Unibertsitatea)
* Xabier Artola (Euskal Herriko Unibertsitatea)
* Toni Badía (Universitat Pompeu Fabra)
* Manuel de Buenaga (Universidad Europea de Madrid)
* Irene Castellón (Universitat de Barcelona)
* Arantza Díaz de Ilarraza (Euskal Herriko Unibertsitatea)
* Víctor Díaz Madrigal (Universidad de Sevilla)
* Antonio Ferrández (Universitat d'Alacant)
* Mikel Forcada (Universitat d'Alacant)
* Ana García-Serrano (Universidad Politécnica de Madrid)
* Alexander Gelbukh (Instituto Politécnico Nacional. México)
* Koldo Gojenola (Euskal Herriko Unibertsitatea)
* Xavier Gómez Guinovart (Universidade de Vigo)
* Julio Gonzalo (UNED)
* José Miguel Goñi (Universidad Politécnica de Madrid)
* José Carlos González (Universidad Politécnica de Madrid)
* Montserrat Marichalar (Euskal Herriko Unibertsitatea)
* José Mariño (Universitat Politècnica de Catalunya)
* M. Antonia Martí (Universitat de Barcelona)
* María Teresa Martín (Universidad de Jaén)
* Patricio Martínez (Universitat d'Alacant)
* Paloma Martínez (Universidad Carlos III, Madrid)
* Raquel Martínez (UNED)
* Ruslan Mitkov (Universidad de Wolverhampton)
* Manuel Montes y Gómez (Instituto Nacional de Astrofísica, Óptica y
Electrónica. México)
* Lidia Moreno (Universitat Politècnica de València)
* Lluís Padró (Universitat Politècnica de Catalunya)
* Ramón López Cózar (Universidad de Granada)
* Manuel Palomar (Universitat d'Alacant)
* Ferrán Pla (Universitat Politècnica de València)
* German Rigau (Euskal Herriko Unibertsitatea)
* Horacio Rodríguez (Universitat Politècnica de Catalunya)
* Leonel Ruiz Miyares (Centro de Lingüística Aplicada de Santiago de
Cuba)
* Emilio Sanchís (Universitat Politècnica de València)
* Kepa Sarasola (Euskal Herriko Unibertsitatea)

* Mariona Taulé (Universitat de Barcelona)
* L. Alfonso Ureña (Universidad de Jaén)
* Felisa Verdejo (UNED)
* Manuel Vilares (Universidad de A Coruña)
* Luis Villaseñor-Pineda (Instituto Nacional de Astrofísica, Óptica y
Electrónica. México)


CONTACT INFORMATION

All the information about the Conference is available in the 25th SEPLN Conference website: http://ixa2.si.ehu.es/sepln2009/ E-mail: sepln2009@ehu.es

CFP: 21st International Joint Conference on Artificial Intelligence (IJCAI-09)

. Monday, December 08, 2008
0 comments

The IJCAI-09 Program Committee invites submissions of technical papers for IJCAI-09, to be held in Pasadena, CA, USA, July 11-17, 2009. Submissions are invited on significant, original, and previously unpublished research on all aspects of artificial intelligence.

The theme of IJCAI-09 is "The Interdisciplinary Reach of Artificial Intelligence," with a focus on the broad impact of artificial intelligence on science, engineering, medicine, social sciences, arts and humanities. The conference will include invited talks, workshops, tutorials, and other events dedicated to this theme.
  • Important dates for authors of technical papers:
  • Electronic abstract submission: January 7, 2009 (11:59PM, PST)
  • Electronic paper submission: January 12, 2009 (11:59PM, PST)
  • Author feedback period: March 13-16, 2009 (11:59PM, PDT). Please note: Daylight savings time starts on March 8.
  • Author notification of acceptance/rejection: March 31, 2009
  • Camera-ready copy due: April 14, 2009
  • Technical sessions: July 13-17, 2009

Submission Details

Submitted papers must be formatted according to IJCAI guidelines and submitted electronically through the IJCAI-09 paper submission site. Full instructions for submission, including formatting guidelines and electronic templates for paper submission, are available on the IJCAI-09 website: http://www.ijcai-09.org (see the link titled Submission Details). Submitting authors will be required to register with the IJCAI-09 paper submission software (this will be linked from the IJCAI-09 website during the first week of December, 2008).

Papers may be accepted for either oral or poster presentation; papers accepted for either form of presentation will not be distinguished in the conference proceedings, nor will designation of oral or poster presentation be made on the quality of the contribution. Instead, these distinctions will be made in the interests of overall program coherence and quality.

To facilitate review, the paper title, author names, contact details, and a brief abstract must be submitted electronically by Jan. 7, 2009 (11:59 PST). No paper will be accepted for review unless an accompanying abstract is received by the deadline. Technical papers are due electronically on Jan. 12, 2009 (11:59 PST). Authors bear full responsibility for compliance with submission standards. Submissions received after the deadline or that do not meet the length or formatting requirements will not be accepted for review. No email or fax submissions will be accepted. Notification of receipt of the electronically submitted papers will be emailed to the designated contact author soon after receipt. If there are problems with the electronic submission, the program chair will contact the designated author by email. The last day for inquiries regarding lost submissions is Jan. 19, 2009. Notification of acceptance or rejection of submitted papers will be emailed to the designated author by March 31, 2009. The opportunity to respond to preliminary reviews will be made available to authors prior to this date, during the period March 13-16, 2009.

Guidelines for such responses, along with details of the reviewing process will be posted on the IJCAI-09 website. Camera-ready copy of accepted papers must be received by the publisher by April 14, 2009. Note: at least one author of each accepted paper is required to attend the conference to present the work. Authors will be required to confirm their acceptance of this requirement at the time of submission.

Authors who do not have access to the web should contact the program chair at pcchair09@ijcai.org no later than December 15, 2008 for alternate submission instructions.

Content Areas

To facilitate the reviewing process, authors will be required to choose two to four appropriate content area keywords from the list provided by the IJCAI-09 submission software, which will be part of the online paper registration process. Authors are encouraged to select the most specific keywords that accurately describe the main aspects of their contributions. General categories should only be used if specific categories do not apply or do not accurately reflect the main contributions. Each keyword is placed within one of ten 10 major themes; however, many of the keywords cut across multiple themes, and authors should feel free to select any keyword descriptive of the contribution, even if the major theme within which is it categorized is not the most appropriate. A list of keywords is appended to the end of this call.

The major themes are:

Agent-based and Multi-agent Systems
Constraints, Satisfiability, and Search
Knowledge Representation, Reasoning and Logic
Machine Learning
Multidisciplinary Topics And Applications
Natural Language Processing
Planning and Scheduling
Robotics and Vision
Uncertainty in AI
Web and Knowledge-based Information Systems

Policy on Multiple Submissions

IJCAI will not accept any paper which, at the time of submission, is under review for or has already been published or accepted for publication in a journal or another conference. Authors are also required not to submit their papers elsewhere during IJCAI's review period. These restrictions apply only to journals and conferences, not to workshops and similar specialized presentations with a limited audience and without archival proceedings. Authors will be required to confirm that their submissions conform to these requirements at the time
of submission.

Paper Length and Format

Submitted technical papers must be no longer than six pages, including all figures and references, and must be formatted according to posted IJCAI-09 guidelines. Specifically, papers must be formatted for "letter-size" (8.5" x 11") paper, in double-column format with a 10pt font. Electronic templates for the LaTeX typesetting package, as well as a Word template, that conform to IJCAI-09 guidelines will be made available at the conference website (see above) during the first week of December, as will further details on formatting.

Authors are required to submit their electronic papers in PDF format. Files in Postscript (ps), or any other format will not be accepted.

Submitted papers must not exceed six (6) formatted pages, including references and figures. This six-page limit will be strictly enforced: over-length papers will not be considered for review. Each accepted paper will be allowed six pages in the proceedings; up to two additional pages may be purchased at a price of $275 per page. In order to make blind reviewing possible, authors must omit their names and affiliations from the paper. Also, while the references should include all published literature relevant to the paper, including previous works of the authors, it should not include unpublished works. When referring to one's own work, use the third person rather than the first person. For example, say "Previously, Foo and Bar [7] have shown that...", rather than "In our previous work [7] we have shown that..." For accepted papers, such identifying information can be added to the final camera-ready version for publication.

Review Process

Papers will be subject to blind peer review. Selection criteria include accuracy and originality of ideas, clarity and significance of results and quality of the presentation. Each paper will be assigned to three Program Committee members, one Senior Program Committee member and one Area Chair for review. The reviewing process will include a short period for the authors to view reviews and respond to technical questions on the submitted work raised by the reviewers before final decisions are made. The decision of the Program Committee will be final and cannot be appealed.

Papers accepted for the conference will be scheduled for oral or poster presentation and will be printed in the proceedings. At least one author of each accepted paper will be required to attend the conference to present the work.

Please send inquiries about paper submissions to ijcai09@aaai.org.

Inquiries about the conference program can be directed to:

Craig Boutilier
Program Chair, IJCAI-09
Department of Computer Science
University of Toronto
Toronto, ON, M5S 3H5, CANADA
Email: pcchair09@ijcai.org

For further information please visit the conference web site: http://www.ijcai-09.org

List of keywords:

Agent-based and Multi-agent Systems
  • Agent/AI Theories and Architectures
  • Agent-based Simulation and Emergent Behavior
  • Agent Communication
  • Argumentation
  • Auctions And Market-Based Systems
  • Coordination And Collaboration
  • Distributed AI
  • E-Commerce
  • Game Theory
  • Information/Mobile/Software Agents
  • Multiagent Learning
  • Multiagent Planning
  • Multiagent Systems (General/other)
  • Negotiation And Contract-Based Systems
  • Social Choice Theory

Constraints, Satisfiability, and Search
  • Applications
  • Constraint Optimization
  • Constraint Satisfaction (General/other)
  • Distributed Search/CSP/Optimization
  • Dynamic Programming
  • Search, SAT, CSP: Evaluation and Analysis
  • Global Constraints
  • Heuristic Search
  • Search, SAT, CSP: Meta-heuristics
  • Meta-Reasoning
  • Quantifier Formulations
  • Satisfiability (General/other)
  • SAT and CSP: Modeling/Formulations
  • Search (General/other)
  • SAT and CSP: Solvers and Tools

Knowledge Representation, Reasoning and Logic
  • Action, Change and Causality
  • Automated Reasoning and Theorem Proving
  • Belief Change
  • Common-Sense Reasoning
  • Computational Complexity of Reasoning
  • Description Logics and Ontologies
  • Diagnosis and Abductive Reasoning
  • Geometric, Spatial, and Temporal Reasoning
  • Knowledge Representation Languages
  • Knowledge Representation (General/other)
  • Logic Programming
  • Many-Valued And Fuzzy Logics
  • Nonmonotonic Reasoning
  • Preferences
  • Qualitative Reasoning
  • Reasoning with Beliefs

Machine Learning
  • Active Learning
  • Case-based Reasoning
  • Classification
  • Cost-Sensitive Learning
  • Data Mining
  • Ensemble Methods
  • Evolutionary Computation
  • Feature Selection/Construction
  • Kernel Methods
  • Learning Graphical Models
  • Learning Preferences/Rankings
  • Learning Theory
  • Machine Learning (General/other)
  • Neural Networks
  • Online Learning
  • Reinforcement Learning
  • Relational Learning
  • Time-series/Data Streams
  • Transfer, Adaptation, Multi-task Learning
  • Semi-Supervised/Unsupervised Learning
  • Structured Learning

Multidisciplinary Topics And Applications
  • AI and Natural Sciences
  • AI and Social Sciences
  • Art And Music
  • Autonomic Computing
  • Cognitive Modeling
  • Computational Biology
  • Computer Games
  • Computer-Aided Education
  • Database Systems
  • Philosophical and Ethical Issues
  • Human-Computer Interaction
  • Intelligent User Interfaces
  • Interactive Entertainment
  • Personalization and User Modeling
  • Real-Time Systems
  • Security and Privacy
  • Validation and Verification

Natural-Language Processing
  • Dialogue
  • Discourse
  • Information Extraction
  • Information Retrieval
  • Machine Translation
  • Morphology and Phonology
  • Natural Language Generation
  • Natural Language Semantics
  • Natural Language Summarization
  • Natural Language Syntax
  • Natural Language Processing (General/other)
  • Psycholinguistics
  • Question Answering
  • Speech Recognition And Understanding
  • Text Classification

Planning and Scheduling
  • Activity and Plan Recognition
  • Hybrid Systems
  • Markov Decisions Processes
  • Model-Based Reasoning
  • POMDPs
  • Plan Execution And Monitoring
  • Plan/Workflow Analysis
  • Planning Algorithms
  • Planning under Uncertainty
  • Planning (General/other)
  • Scheduling
  • Theoretical Foundations of Planning

Robotics and Vision
  • Behavior And Control
  • Cognitive Robotics
  • Human Robot Interaction
  • Localization, Mapping, State Estimation
  • Manipulation
  • Motion and Path Planning
  • Multi-Robot Systems
  • Robotics
  • Sensor Networks
  • Vision and Perception

Uncertainty in AI
  • Approximate Probabilistic Inference
  • Bayesian Networks
  • Decision/Utility Theory
  • Exact Probabilistic Inference
  • Graphical Models
  • Preference Elicitation
  • Sequential Decision Making
  • Uncertainty Representations
  • Uncertainty in AI (General/other)

Web and Knowledge-based Information Systems
  • Information Extraction
  • Information Integration
  • Information Retrieval
  • Knowledge Acquisition
  • Knowledge Engineering
  • Knowledge-based Systems (General/other)
  • Ontologies
  • Recommender Systems
  • Semantic Web
  • Social Networks
  • Source Wrapping
  • Web Mining
  • Web Search
  • Web Technologies (General/other)

Call for ICML/UAI/COLT 2009 Workshop Proposals

. Monday, December 01, 2008
0 comments

The ICML, UAI, and COLT conferences will be colocated in Montreal June 14-21 2009. We solict proposals for workshops to be held during a single joint workshop day on June 18. This date lies between ICML (June 14-17) and UAI/COLT (June 19-21). Workshops will be selected on the basis of their interest to the attendees of one or more of the conferences.

The goal of the workshops is to provide an informal forum for researchers to discuss important research questions and challenges. Controversial issues, open problems, and comparisons of competing approaches are encouraged. Representation of alternative viewpoints and panel-style discussions are also encouraged.

Organization

The format, style, and content of accepted workshops is under the control of the workshop organizers and largely autonomous from the main conferences. The workshops will be seven hours long and split into morning and afternoon sessions. Workshop organizers will be expected to manage the workshop content, specify the workshop format, be present to moderate the discussion and panels, invite experts in the domain, and maintain a website for the workshop. Workshop registration will be handled centrally by the main conferences with a single uniform registration fee and with registrants allowed to attend workshops other than the one they register for.

Submission Instructions

Proposals should specify clearly all of the following:

  • the workshop's title (what is it called?)
  • topic (what is it about?)
  • motivation (why a workshop on this topic?)
  • impact and expected outcomes (what will having the workshop do?)
  • potential invited speakers (who might come?)
  • a list of related publications (where can we learn more?)
  • main workshop organizer (who is making it happen?)
  • other organizers (who else is making it happen?)
  • workshop URL (where will interested parties get more information?)
  • relevant conferences (which of ICML, UAI, and COLT would it appeal to?)

Please also provide brief CVs of all organizers. This information should be sent by email (in plain text or pdf format) to Icml-uai-colt-workshops09@ssli.ee.washington.edu by 19 Jan 2009.

16 PhD Scholarships in bioinformatics and robotics

.
0 comments

I just published in "Computer Science PhD" 16 PhD Scholarships for working on 16 individual projects in the fields of bioinformatics and robotics at the Graduate School for Computing in Medicine and Life Sciences (Germany). PhD scholarships amount to 1250 € per month. Students with a master's degree (or its equivalent) in computer science, mathematics, or engineering are invited to apply for admission. The application deadline is January 15, 2009.

ICML 2008 Call for Papers

. Thursday, November 27, 2008
0 comments

The 26th International Conference On Machine Learning (ICML-2009)
June 14-18, 2009, Montreal, Canada

This call for papers extends the preliminary call by including the conference website, icml2009.org, and the list of area chairs and topic descriptors, www.cs.rutgers.edu/~mlittman/icml09/ac.html . Please browse the list of area chairs to get a sense of the scope and coverage of this year's conference. We encourage a broad range of submissions!

ICML 2009 invites submission of engagingly written papers on substantial, original, and previously unpublished research in *all* aspects of machine learning. We welcome submissions of innovative work on systems that are self adaptive, systems that improve their own performance, or systems that apply logical, statistical, probabilistic or other formalisms to the analysis of data, to the learning of predictive models, or to interaction with the environment. We welcome innovative applications, theoretical contributions, carefully evaluated empirical studies, and we particularly welcome work that combines all of these elements. We also encourage submissions that bridge the gap between machine learning and other fields of research. ICML 2009 will be held in Montreal, Canada, June 14-18, 2009, and will be co-located with the Uncertainty in Artificial Intelligence Conference (UAI), and the Conference on Learning Theory (COLT), and Multidisciplinary Symposium on Reinforcement Learning (MSRL).

DATES (Note slightly earlier schedule than 2008):

  • January 26: Full paper submissions due (no separate abstract date)
  • February 27: First round reviews available
  • March 10: Author responses due
  • April 6: Acceptance notification
  • April 20: Final camera-ready version due
  • June 14: ICML Tutorials
  • June 15-17 ICML Conference
  • June 18: Joint Workshops Day, ICML/UAI/COLT; MSRL
Format of the Conference

The conference will include three days of technical presentations, one day of tutorials and one day of workshops. Accepted papers will each have an oral presentation as well as a poster in an evening poster session. There will also be talks by several invited speakers and a banquet.

Awards

Awards will be given for Best Paper(s), Best Student Paper(s) (first-authored by a student), Best Application Paper, 10-year Best Paper (most influential paper of ICML 1999).

Submission

Submission format, details and style files will soon be available on the ICML 2009 website (icml2009.org). Submission of papers and the management of the paper reviewing process will be entirely electronic.

Review Process (New for 2009!)

Our review process this year will be slightly different from previous years to further encourage innovative papers on a variety of topics. Authors will indicate a preference for an area chair to handle their papers via an inverse bidding process. It is crucial for authors to familiarize themselves with the 2009 area chairs and their topic descriptiors (http://www.cs.rutgers.edu/~mlittman/icml09/ac.html). The goal is to ensure each submission is considered by reviewers appropriate to the paper's intended contribution. Each submitted paper will receive two first round reviews. As in recent years, authors will have the opportunity to see and respond to the reviews before a final decision is made. Papers that receive at least one positive review in the first round will receive one or more additional reviews. Final decisions will be made using the input from all reviewers, the author feedback, the assigned area chair, and programme co-chairs. Reviewing for ICML 2009 will be blind to the identities of the authors. No conditional accepts will be granted this year.

ICML 2009 will not accept any paper that is substantially similar to another paper that is currently under review or has already been accepted for publication in a journal or another conference. The programme co-chairs will consider making an exception for papers published in substantially disjoint communities (application conferences, for example), as long as the submitted papers are themselves clearly targeted to a machine-learning audience. Please clearly indicate which contributions are novel and which are previous work, either by the authors or others. If a paper submitted to ICML 2009 and another already published or already submitted paper contain substantial overlap in content and the content is not clearly indicated (anonymously) as being previous work, then the ICML submission may be rejected on the grounds of being a dual submission.

Similarly, authors must withdraw their papers if they submit an overlapping paper elsewhere during ICML's review period.

With your help, we expect another excellent conference!

-The ICML2009 Organizational Team


General Chair:
Andrea Danyluk (Williams College)
Programme co-chairs:
Leon Bottou (NEC Research)
Michael Littman (Rutgers University)
Local Arrangements Chair:
Doina Precup (McGill University)