Results 61 to 70 of about 1,551,832 (303)
Extracting a Topic Specific Dataset from a Twitter Archive [PDF]
Datasets extracted from the microblogging service Twitter are often generated using specific query terms or hashtags. We describe how a dataset produced using the query term ‘syria’ can be increased in size to include tweets on the topic of Syria that do not contain that query term.
Beatrice Alex+4 more
openaire +2 more sources
Unsupervised machine learning is utilized as a part of the process of topic modeling to discover dormant topics hidden within a large number of documents.
Nassera HABBAT+3 more
doaj +1 more source
Consensus molecular subtypes (CMS1‐4) have been identified to study colorectal cancer heterogeneity and serve as potential biomarkers. In this study, we developed and evaluated NanoCMSer, a NanoString‐based classifier using 55 genes, optimized for FF and FFPE to facilitate the clinical evaluation of CMS subtyping.
Arezo Torang+10 more
wiley +1 more source
Recurrent Coupled Topic Modeling over Sequential Documents [PDF]
The abundant sequential documents such as online archival, social media and news feeds are streamingly updated, where each chunk of documents is incorporated with smoothly evolving yet dependent topics. Such digital texts have attracted extensive research on dynamic topic modeling to infer hidden evolving topics and their temporal dependencies. However,
arxiv
Large multidimensional digital images of cancer tissue are becoming prolific, but many challenges exist to automatically extract relevant information from them using computational tools. We describe publicly available resources that have been developed jointly by expert and non‐expert computational biologists working together during a virtual hackathon
Sandhya Prabhakaran+16 more
wiley +1 more source
Finding Topic Experts in the Twitter Dataset Using LDA Algorithm
In microblogging services like Twitter, the expert judgment problem has gained increasing attention in social media. Twitter is a new type of social media that provides a publicly available way for users to publish 140-character short messages (tweets). However, previous methods cannot be directly applied to twitter experts finding problems.
Ashwini Anandrao Shirolkar+1 more
openaire +1 more source
Targeted metabolomics reveals novel diagnostic biomarkers for colorectal cancer
This study employed targeted metabolomic profiling to identify 302 distinct metabolites present in platelet‐rich plasma (PRP), revealing aberrant metabolic profiles amongst individuals diagnosed with colorectal cancer (CRC). Compared to carcinoembryonic antigen (CEA) and cancer antigen 19‐9 (CA199), our metabolite panel showed improved sensitivity ...
Zuojian Hu+7 more
wiley +1 more source
Zero-Shot Stance Detection: A Dataset and Model using Generalized Topic Representations [PDF]
Emily Allaway, Kathleen McKeown
openalex +3 more sources
Using Topic Modeling Methods for Short-Text Data: A Comparative Analysis
With the growth of online social network platforms and applications, large amounts of textual user-generated content are created daily in the form of comments, reviews, and short-text messages.
Rania Albalawi+2 more
doaj +1 more source
Recurrent Embedded Topic Model
In this paper we propose the Recurrent Embedded Topic Model (RETM) which is a modification of the Embedded Topic Modelling (ETM) by reusing the Continuous Bag of Words (CBOW) that the model had implemented and applying it to a recurrent neural network ...
Carlos Vargas, Hiram Ponce
doaj +1 more source