Handling uncertainty in information extraction [PDF]
This position paper proposes an interactive approach for developing information extractors based on the ontology definition process with knowledge about possible (in)correctness of annotations.
Habib, Mena B., Keulen, Maurice van
core +1 more source
Using Contexts and Constraints for Improved Geotagging of Human Trafficking Webpages
Extracting geographical tags from webpages is a well-motivated application in many domains. In illicit domains with unusual language models, like human trafficking, extracting geotags with both high precision and recall is a challenging problem.
Kapoor, Rahul +2 more
core +1 more source
A study of the tourism web coverage in Switzerland [PDF]
This paper discusses experiments that were performed to understand the geographic and linguistic coverage of web resources focusing on tourism-related themes in Switzerland.
Venkateswaran, R
core +1 more source
Extracting and Analyzing Semantic Relatedness between Cities Using News Articles
News articles capture a variety of topics about our society. They reflect not only the socioeconomic activities that happened in our physical world, but also some of the cultures, human interests, and public concerns that exist only in the perceptions of
Hu, Yingjie, Shaw, Shih-Lung, Ye, Xinyue
core +1 more source
Toponym disambiguation in historical documents using network analysis of qualitative relationships
In this paper we use network analysis to identify qualitative "neighbors" for toponyms in an eighteenth-century French encyclopedia, but could apply to any entry-based text with annotated toponyms. This method draws on relations in a corpus of articles, which improves disambiguation at a later stage with an external resource.
Moncla, Ludovic +4 more
openaire +1 more source
Word sense discrimination in information retrieval: a spectral clustering-based approach [PDF]
International audienceWord sense ambiguity has been identified as a cause of poor precision in information retrieval (IR) systems. Word sense disambiguation and discrimination methods have been defined to help systems choose which documents should be ...
Chifu, Adrian-Gabriel +3 more
core +4 more sources
TALP-UPC at MediaEval 2014 Placing Task: Combining geographical knowledge bases and language models for large-scale textual georeferencing [PDF]
This paper describes our Georeferencing approaches, experiments, and results at the MediaEval 2014 Placing Task evaluation. The task consists of predicting the most probable geographical coordinates of Flickr images and videos using its visual, audio and
Ferrés Domènech, Daniel +1 more
core
A geo-temporal information extraction service for processing descriptive metadata in digital libraries [PDF]
In the context of digital map libraries, resources are usually described according to metadata records that define the relevant subject, location, time-span, format and keywords.
Borbinha, José +3 more
core +1 more source
Application of Text Summarization techniques to the Geographical Information Retrieval task [PDF]
Automatic Text Summarization has been shown to be useful for Natural Language Processing tasks such as Question Answering or Text Classification and other related fields of computer science such as Information Retrieval.
Lloret, Elena +3 more
core +2 more sources
Assessing the Veracity of Methods for Extracting Place Semantics from Flickr Tags [PDF]
The volume and potential value of user generated content (UGC) is ever growing. Multiply sourced, its value is greatly increased by the inclusion of metadata that adequately and accurately describes that content – particularly if such data are to be ...
Chaudhry, Omair, Mackaness, William
core +1 more source

