AIDA: Accurate Online Disambiguation of Named Entities in Text and Tables

News

Stay up to date with AIDA news and releases, send a mail to: aida-news-subscribe@lists.mpi-inf.mpg.de

Overview

AIDA is a framework and online tool for entity detection and disambiguation. Given a natural-language text or a Web table, it maps mentions of ambiguous names onto canonical entities (e.g., individual people or places) registered in the YAGO2 knowledge base.

You can try AIDA on any text you like in the online demo.

To experimentally verify the quality of AIDA, we annotated nearly 1,400 newswire articles with the entities mentioned in each article. This collection is available for download (see Downloads).

Further Information

If you need any further information, please contact us via mail: NAME_OF_PROJECT@mpi-inf.mpg.de

Discuss on our Mailing List

Discuss AIDA with us and other users by joining our mailing list. Send a mail to aida-users-subscribe@lists.mpi-inf.mpg.de or sign up at https://lists.mpi-inf.mpg.de/listinfo/aida-users.

Downloads and Datasets

Find all datasets related to AIDA in our downloads area.

AIDA JSON Web Service

We provide a HTTP JSON web service for AIDA so that you can try it out without any hassle of setting it up. It's as easy as:

curl --data text="Dylan was born in Duluth." https://gate.d5.mpi-inf.mpg.de/aida/service/disambiguate

More information is available in our web service description.

Please do not use it for comparison in scientific papers or for running time experiments, as the service changes continuously. If you want to compare AIDA for research, please download it and set it up on your own machines.

Publications

U-AIDA: a Customizable System for Named Entity Recognition, Classification, and Disambiguation PDF
Mohamed Amir Yosef
Dissertation, 2015
Discovering and Disambiguating Named Entities in Text PDF
Johannes Hoffart
Dissertation, 2015
EDRAK: Entity-Centric Data Resource for Arabic Knowledge
Mohamed H Gad-Elrab, Mohamed Amir Yosef, Gerhard Weikum
In: ANLP Workshop 2015 at ACL-IJCNLP 2015, Beijing, China, 2015
Named Entity Disambiguation for Resource-Poor Languages
Mohamed H Gad-Elrab, Mohamed Amir Yosef, Gerhard Weikum
In: Proceedings of the Eighth Workshop on Exploiting Semantic Annotations in Information Retrieval, p. 29-34, ESAIR 2015, Melbourne, Australia, 2015
AIDA-Social: Entity Linking on the Social Stream
Yusra Ibrahim, Mohamed Amir Yosef, Gerhard Weikum
In: Proceedings of the 7th International Workshop on Exploiting Semantic Annotations in Information Retrieval, p. 17-19. ESAIR 2014, Shanghai, China, 2014
AIDArabic A Named-Entity Disambiguation Framework for Arabic Text
Mohamed Amir Yosef, Marc Spaniol, Gerhard Weikum
In: Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing, p. 187-195. ANLP 2014, Doha, Qatar, 2014
Discovering Emerging Entities with Ambiguous Names
Johannes Hoffart, Yasemin Altun, Gerhard Weikum
In: Proceedings of the 23rd International World Wide Web Conference, p. 385–395. WWW 2014, Seoul, South Korea, 2014
AIDA-light: High-Throughput Named-Entity Disambiguation
Dat Ba Nguyen, Johannes Hoffart, Martin Theobald, Gerhard Weikum
In: Linked Data on the Web, WWW 2014, Seoul, South Korea, 2014
KORE: Keyphrase Overlap Relatedness for Entity Disambiguation
Johannes Hoffart, Stephan Seufert, Dat Ba Nguyen, Martin Theobald, and Gerhard Weikum
In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, p. 545-554, CIKM 2012, Maui, USA, 2012
AIDA: An Online Tool for Accurate Disambiguation of Named Entities in Text and Tables
Mohamed Amir Yosef, Johannes Hoffart, Ilaria Bordino, Marc Spaniol, Gerhard Weikum
In: Proceedings of the 37th International Conference on Very Large Databases, VLDB 2011, p. 1450–1453, Seattle, WA, 2011
Robust Disambiguation of Named Entities in Text PDF
Johannes Hoffart, Mohamed Amir Yosef, Ilaria Bordino, Hagen Fürstenau, Manfred Pinkal, Marc Spaniol, Bilyana Taneva, Stefan Thater, Gerhard Weikum
In: Conference on Empirical Methods in Natural Language Processing, p. 782–792, Edinburgh, Scotland, 2011
For scientific works, please cite this paper

Demo

AIDA can be tested online, with different methods and configurations for entity disambiguation: AIDA Demo

Please use either Firefox or Chrome to view the Demo.

Results

In the EMNLP 2011 paper, the results are given on the subset of 228 CoNLL testb documents wich could be processed by all the competitor methods (1270testb, 1308testb, and 1349testb are missing). We did this for the sake of comparability. The results of our AIDA methods on all 231 CoNLL testb documents are given below. The short names are the same as in the paper.

AIDA results in % on all 231 CoNLL-YAGO testb documents (using the original EMNLP 2011 code)
Measure	sim-k	r-prior sim-k	r-prior sim-k coh	r-prior sim-k r-coh	prior
Macro Precision@1.0	76.65	80.81	80.86	82.02	71.36
Micro Precision@1.0	76.65	80.06	82.24	82.29	66.55

AIDA results in % on all 231 CoNLL-YAGO testb documents (using the code re-engineered early 2013)
Measure	sim-k	r-prior sim-k	r-prior sim-k coh	r-prior sim-k r-coh	prior
Macro Precision@1.0	76.00	81.03	80.67	81.66	75.16
Micro Precision@1.0	76.61	80.56	82.05	82.54	70.46