Français Anglais
Accueil Annuaire Plan du site
Accueil > Production scientifique > Résultat majeur
Production scientifique
Résultat majeur : ENTITY DISCOVERY AND ANNOTATION IN TABLES
ENTITY DISCOVERY AND ANNOTATION IN TABLES
07 janvier 2013

G. Quercini and C. Reynaud. International Conference on Extending Database Technology
The Web is rich of tables (e.g., HTML tables, spreadsheets, Google Fusion Tables) that host a considerable wealth of high-quality relational data. Not surprinsingly, they have been increasingly drawing the attention of numerous researchers, especially from the information retrieval and extraction community; unlike unstructured texts, indeed, tables usually favour the automatic extraction of data because of their regular structure and properties. The data extraction is usually complemented by the annotation of the table, which finds its semantics by identifying a type for each column, the relations between columns, if any, and the entities that occur in each cell. In this paper, we focus on the problem of discovering and annotating entities in tables. More specially, entity annotation refers to the task of assigning a label (e.g. "restaurant", "museum") to a phrase (e.g. "T.G.I. Friday's", "Metropolitan Museum of Art") that denotes an entity.
Compared to existing approaches, we tackle this problem in a pragmatic way, which is motivated by specific application needs; in particular, we focus on Google Fusion Tables, which is a rapidly growing collection of tables with rich and high-quality data. The main novelty of our approach is that it does not rely on a pre-compiled reference catalogue of annotated entities, typically extracted from ontologies such as Yago and DBpedia, which limits the annotations to the sole entities that belong to the catalogue. Instead, we train an algorithm to look for information on previously unseen entities on the Web so as to annotate them with the correct type.



Activités de recherche
  ° Bases de données
  ° Intégration d'informations

Equipe
  ° Intelligence Artificielle et Systèmes d'Inférence

Contact
  [aucun]
Résultats majeurs
HOW FAST CAN YOU CONVERGE TOWARDS A CONSENSUS VALUE?
28 octobre 2021
In their recent work, Matthias Fuegger (LMF), Thomas Nowak (LISN), and Manfred Schwarz (TU Wien) stu

MODEL TRANSFORMATION AS CONSERVATIVE THEORY-TRANSFORMATION
30 octobre 2020
We present a new technique to construct tool support for domain-specific languages (DSLs) inside the

BEST STUDENT PAPER AWARD (ML) AT ECML 2019
20 septembre 2019
Guillaume Doquet (A&O), Best Student Paper Award (category Machine Learning) at ECML 2019.

BEST PAPER AWARD - HPCS 2019 - ON SERVER-SIDE FILE ACCESS PATTERN MATCHING
17 juillet 2019
Francieli Zanon Boito¹ , Ramon Nou², Laércio Lima Pilla³, Jean Luca Bez⁴, Jean-François Méhaut¹, T

BEST FULL PAPER AWARD EDM 2019 - EDUCATIONAL DATA MINING
05 juillet 2019
DAS3H: Modeling Student Learning and Forgetting for Optimally Scheduling Distributed Practice of Ski