Results 1 to 10 of about 721,212 (219)
This chapter reveals the overview of text mining; text mining, patent analysis, and keyword selection; text mining and sentiment analysis in modern marketing; text mining applications in the biomedical sciences; and the multifaceted applications of text mining.
Kristen Cooper
doaj +2 more sources
Text Mining Infrastructure in R [PDF]
During the last decade text mining has become a widely used discipline utilizing statistical and machine learning methods. We present the tm package which provides a framework for text mining applications within R.
Kurt Hornik, Ingo Feinerer, David Meyer
doaj +1 more source
One of the fastest-growing fields in bioinformatics is text mining: the application of natural language processing techniques to problems of knowledge management and discovery, using large collections of biological or biomedical text such as MEDLINE.
Andrew B, Clegg, Adrian J, Shepherd
+5 more sources
A large dataset of scientific text reuse in Open-Access publications
We present the Webis-STEREO-21 dataset, a massive collection of Scientific Text Reuse in Open-access publications. It contains 91 million cases of reused text passages found in 4.2 million unique open-access publications.
Lukas Gienapp +4 more
doaj +1 more source
Text mining is the process of analyzing naturally occurring text for the purpose of discovering and capturing semantic information for insertion and storage in what I'll call a Knowledge Organization Structure (KOS) with the ultimate goal of enabling knowledge discovery via either textual or visual access for use in a wide range of significant ...
Meier, Marco C., Beckh, Michael
openaire +3 more sources
Comparing neural models for nested and overlapping biomedical event detection
Background Nested and overlapping events are particularly frequent and informative structures in biomedical event extraction. However, state-of-the-art neural models either neglect those structures during learning or use syntactic features and external ...
Kurt Espinosa +5 more
doaj +1 more source
Automatic identification of suicide notes with a transformer-based deep learning model
Suicide is one of the leading causes of death worldwide. At the same time, the widespread use of social media has led to an increase in people posting their suicide notes online. Therefore, designing a learning model that can aid the detection of suicide
Tianlin Zhang +2 more
doaj +1 more source
Natural language processing applied to mental illness detection: a narrative review
Mental illness is highly prevalent nowadays, constituting a major cause of distress in people’s life with impact on society’s health and well-being. Mental illness is a complex multi-factorial disease associated with individual risk factors and a variety
Tianlin Zhang +3 more
doaj +1 more source
MuSe: The Musical Sentiment Dataset
The MuSe (Music Sentiment) dataset contains sentiment information for 90,001 songs. We computed scores for the affective dimensions of valence, dominance, and arousal, based on the user-generated tags that are available for each song via Last.fm.
Christopher Akiki, Manuel Burghardt
doaj +1 more source
COPIOUS: A gold standard corpus of named entities towards extracting species occurrence from biodiversity literature [PDF]
Background Species occurrence records are very important in the biodiversity domain. While several available corpora contain only annotations of species names or habitats and geographical locations, there is no consolidated corpus that covers all ...
Nhung Nguyen +2 more
doaj +2 more sources

