Sao Carlos STIL 2009
September 8-11, 2009
São Carlos/SP, Brazil

Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo

TIL


The 7th Brazilian Symposium in Information and Human Language Technology
 

II International Workshop on Web and Text Intelligence (WTI)

 

September 11, 2009

 

Preface | Committees | Papers

 

Preface

 

Web and text intelligence are related areas that seek to improve human computer interaction in general and providing users with tools to explore and analyze information available on the Internet. Both areas benefit from knowledge, concepts and techniques from artificial intelligence, statistics, linguistics, among other fields. Web and Text Mining are hot application areas for AI and a source of inspiration for new methods and algorithms.

 

Known developments include self adaptive web sites, usage monitoring, web site personalization, information retrieval, information extraction, automatic Web site organization, large document collection mining and exploration, visualization and usability.

 

The II International Workshop on Web and Text Intelligence held in São Carlos – Brazil, associated to STIL 2009, gathered researchers who work on methods and theories (and their applications) that help us understand the Web and build automatic tools for better exploiting its complex world and who also develop new contributions dealing with text mining applications. This workshop is a follow-up of the I International Workshop on Web and Text Intelligence that took place in Salvador, Brazil, in October 2008 as a workshop associated to SBIA08.

 

The one day event gathered researchers, who presented state of the art research and proposed specific techniques and tools for enhancing systems and applications related to the Web and to textual document processing. Topics covered include ontology learning, ontology-based visualization, ontology-based information extraction applied to non-textual semantic annotation, information extraction from bibliographical references of scientific papers, text classification, semi-supervised approaches applied to relevance feedback for information retrieval, features and terms analysis for Portuguese corpus pre-processing, document clustering, and social network visualization. Papers accepted as poster presentations deal with the tasks of authorship identification, information extraction for use in web collaborative systems, and ontology learning.

 

Papers accepted for full presentation have been selected after a blind crossed peer-review process, with an acceptance rate of 50%. We would like to thank our Program Committee members for their valuable contribution to the final result of the workshop. We also thank the organizers of STIL for the opportunity of having WTI 09 collocated with STIL. Our gratitude goes also to our institutions, ICMC from the University of São Paulo and LIAAD-INESC Porto L.A. from the University of Porto. The support of both the FCT funded project Site-O-Matic (POSC/EIA/58367/2004) and the CNPq funded project "Modelagem Computacional de Sistemas Complexos Utilizando Mineração de Dados, Imagens e Textos" (550963/2007-3) have been also very important for the success of this event.

 

Organizers


Alípio M. Jorge, University of Porto

Alneu A. Lopes, University of Sao Paulo

Solange O. Rezende, University of Sao Paulo

 

Program Committee


Alípio Jorge, U. Porto, Portugal

Alneu Lopes, USP, Brazil

Ana Carolina Lorena, UFABC Brazil

Carlos Soares, U. Porto, Portugal

Flávia de Almeida Barros, UFPE, Brazil

Gael Harry Dias, UBI, Portugal

João Luís Garcia Rosa, USP, Brazil

José Luís Borges, U. Porto, Portugal

José Paulo Leal, U. Porto, Portugal

Lubos Popelinsky, Mazarik University, Czech Republic

Maarten van Someren, U. Amsterdam, Netherlands

Maria Carolina Monard, USP, Brazil

Maria Cristina Ferreira de Oliveira, USP, Brazil

Mario J. Silva, Universidade de Lisboa, Portugal

Paulo Azevedo, U. Minho, Portugal

Pavel Brazdil, U. Porto, Portugal

Ricardo Bastos Cavalcante Prudêncio UFPE, Brazil

Rosane Minghim USP, Brazil

Solange Rezende, USP, Brazil

Thiago Alexandre Salgueiro Pardo, USP, Brazil

Zhao Liang, USP, Brazil


Organizing Committee


Bruno Magalhães Nogueira, USP, Brazil, brunomn at icmc.usp.br

Fabiano Fernandez dos Santos, USP, Brazil, fabianof at icmc.usp.br

Ricardo M. Marcacini, USP, Brazil, marcacini at grad.icmc.usp.br

 

Full Papers

 

Experiments on Meta-data Generation of Web Business Charts
Horacio Saggion (University of Sheffield - United Kingdom)
 
Wordnet-based metrics do not seem to help document clustering
Alexandre Passos (UNICAMP), Jacques Wainer (UNICAMP)
 
Information Extraction from Tagged Bibliographical References
Alberto Cáceres Álvarez (ICMC-USP), Alneu de Andrade Lopes (ICMC-USP)
 
Exploração visual multidimensional de redes sociais
G. F. Andery (ICMC-USP), A. A. Lopes (ICMC-USP), R. Minghim (ICMC-USP)
 
Class-Test: Classificação automática de textos para auxiliar a criação de suítes de teste
Leonardo S. Lima (UFPE), Flávia A. Barros (UFPE), Ricardo B. C. Prudêncio (UFPE)
 
An Experiment Using Markov Logic Networks to Extract Ontology Concepts from Text
Lucas Drumond (UFMA), Rosario Girardi (UFMA)
 
Utilizando Co-Training para Realimentação de Relevância na WEB
Matheus Victor Brum Soares (ICMC-USP), Ronaldo C. Prati (UFABC), Maria Carolina Monard (ICMC-USP)
 
Content Based Visual Mining of Document Collections Using Ontologies
Katia Romero Felizardo (ICMC-USP), Rafael Messias Martins (ICMC-USP), José Carlos Maldonado (ICMC-USP), Alneu de Andrade Lopes (ICMC-USP), Rosane Minghim (ICMC-USP)
 
O Efeito do uso de Diferentes Formas de Geração de Termos na Compreensibilidade e Representatividade dos Termos em Coleções Textuais na Língua Portuguesa
Merley Conrado (ICMC-USP), Ricardo M. Marcacini (ICMC-USP), Maria F. Moura (Embrapa), Solange O. Rezende (ICMC-USP)

 

Extended Abstracts

 

O Uso de Dicionário de Atributos Estilométricos na Identificação de Autoria de Textos de Língua Portuguesa
Paulo Júnior Varela (PUC-PR), Edson J. R. Justino (PUC-PR), Luiz E. S. Oliveira (PUC-PR)
 
Enfoque probabilístico para la construcción de ontologías
Isidra Ocampo-Guzman (CINVESTAV - México), Ivan Lopez-Arevalo (CINVESTAV - México), Edgar Tello-Leal (UAT - México), Victor Sosa-Sosa (CINVESTAV - México)
 
Desenvolvimento de Sistemas de Extração de Informações para Ambientes Colaborativos na Web
Douglas Nogueira (UNIFOR), Vladia Pinheiro (UFC), Vasco Furtado (UNIFOR), Tarcisio Pequeno (UNIFOR)

 

Preface | Committees | Papers