DBpedia Spotlight for collective linking of entities in HTML pages

Hector

Abstract

DBpedia Spotlight is a tool that can automatically annotate mentions of DBpedia resources from text documents. In the information age, more and more contents are published on the Internet. It is valuable to introduce DBpedia functionality for users to conveniently annotate the web page during browsing. In addition, some recent researches have indicated that collective disambiguation (consider the disambiguation decisions of related mentions in a context as a whole) will result in a better performance than merely context based disambiguation. Introducing collective disambiguation techniques to DBpedia Spotlight may help enhance the overall annotation quality.

Additional Information

The basic implementation will be based on Han's work:

Xianpei Han, Le Sun, and Jun Zhao. 2011. Collective entity linking in web text: a graph-based method. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval (SIGIR '11). ACM, New York, NY, USA

Code samples