All Works

Entity linking of tweets based on dominant entity candidates

Yue Feng, Thomson Reuters
Fattane Zarrinkalam, Ryerson University
Ebrahim Bagheri, Ryerson University
Hossein Fani, Ryerson University
Feras Al-Obeidat, Zayed University

Document Type

Article

Source of Publication

Social Network Analysis and Mining

Publication Date

12-1-2018

Abstract

© 2018, Springer-Verlag GmbH Austria, part of Springer Nature. Entity linking, also known as semantic annotation, of textual content has received increasing attention. Recent works in this area have focused on entity linking on text with special characteristics such as search queries and tweets. The semantic annotation of tweets is specially proven to be challenging given the informal nature of the writing and the short length of the text. In this paper, we propose a method to perform entity linking on tweets built based on one primary hypothesis. We hypothesize that while there are formally many possible entity candidates for an ambiguous mention in a tweet, as listed on the disambiguation page of the corresponding entity on Wikipedia, there are only few entity candidates that are likely to be employed in the context of Twitter. Based on this hypothesis, we propose a method to identify such dominant entity candidates for each ambiguous mention and use them in the annotation process. Particularly, our proposed work integrates two phases (i) dominant entity candidate detection, which applies community detection methods for finding the dominant candidates of ambiguous mentions; and (ii) named entity disambiguation that links a tweet to entities in Wikipedia by only considering the identified dominant entity candidates. Our investigations show that: (1) there are only very few entity candidates for each ambiguous mention in a tweet that need to be considered when performing disambiguation. This helps us limit the candidate search space and hence noticeably reduce the entity linking time; (2) limiting the search space to only a subset of disambiguation options will not only improve entity linking execution time but will also lead to improved accuracy of the entity linking process when the main entity candidates of each mention are mined from a temporally aligned corpus. We show that our proposed method offers competitive results with the state-of-the-art methods in terms of precision and recall on widely used gold standard datasets while significantly reducing the time for processing each tweet.

DOI Link

10.1007/s13278-018-0523-0

ISSN

1869-5450

Publisher

Springer-Verlag Wien

Volume

Issue

First Page

Disciplines

Computer Sciences

Keywords

Disambiguation, Entity linking, Linked data, Microblogging

Scopus ID

85049165546

Recommended Citation

Feng, Yue; Zarrinkalam, Fattane; Bagheri, Ebrahim; Fani, Hossein; and Al-Obeidat, Feras, "Entity linking of tweets based on dominant entity candidates" (2018). All Works. 1512.
https://zuscholars.zu.ac.ae/works/1512

Indexed in Scopus

yes

Open Access

Link to Full Text

COinS

All Works

Entity linking of tweets based on dominant entity candidates

Document Type

Source of Publication

Publication Date

Abstract

DOI Link

ISSN

Publisher

Volume

Issue

First Page

Disciplines

Keywords

Scopus ID

Recommended Citation

Indexed in Scopus

Open Access

Search

Browse

Contribute

Content Type

All Works

Entity linking of tweets based on dominant entity candidates

Author First name, Last name, Institution

Document Type

Source of Publication

Publication Date

Abstract

DOI Link

ISSN

Publisher

Volume

Issue

First Page

Disciplines

Keywords

Scopus ID

Recommended Citation

Indexed in Scopus

Open Access

Share

Search

Browse

Contribute

Content Type