Results 11 to 20 of about 33,665 (246)
Clustering narrow-domain short texts, such as academic abstracts, is an extremely difficult clustering problem. Firstly, short texts lead to low frequency and sparseness of words, making clustering results highly unstable and inaccurate; Secondly, narrow domain leads to great overlapping of insignificant words and makes it hard to distinguish between ...
Changzhou Li +9 more
openaire +1 more source
Neuropeptides contain more chemical information than other classical neurotransmitters and have multiple receptor recognition sites. These characteristics allow neuropeptides to have a correspondingly higher selectivity for nerve receptors and fewer side
Di Liu, Zhengkui Lin, Cangzhi Jia
doaj +1 more source
Unstructured Text Documents Summarization With Multi-Stage Clustering
In natural language processing, text summarization is an important application used to extract desired information by reducing large text. Existing studies use keyword-based algorithms for grouping text, which do not give the documents' actual theme. Our
Muhammad Yahya Saeed +3 more
doaj +1 more source
Study on Title Encoding Methods for e-Commerce Downstream Tasks
In an e-Commerce marketplace there are usually many downstream tasks which have (relatively) less available resources than the few mainstream priority tasks, like recommendation or search.
Cristian Cardellino, Rafael Carrascosa
doaj +1 more source
Large scale text mining for deriving useful insights: A case study focused on microbiome
Text mining has been shown to be an auxiliary but key driver for modeling, data harmonization, and interpretation in bio-medicine. Scientific literature holds a wealth of information and embodies cumulative knowledge and remains the core basis on which ...
Syed Ashif Jardary Al Ahmed +8 more
doaj +1 more source
The ability to stop malware as soon as they start spreading will always play an important role in defending computer systems. It must be a huge benefit for organizations as well as society if intelligent defense systems could themselves detect and ...
Kien Tran, Hiroshi Sato, Masao Kubo
doaj +1 more source
Mobile applications (apps) on IOS and Android devices are mostly maintained and updated via Apple Appstore and Google Play, respectively, where the users are allowed to provide reviews regarding their satisfaction towards particular apps.
Xiaozhou Li +3 more
doaj +1 more source
Ion channels are the second largest drug target family. Ion channel dysfunction may lead to a number of diseases such as Alzheimer’s disease, epilepsy, cephalagra, and type II diabetes.
Jie Zheng, Xuan Xiao, Wang-Ren Qiu
doaj +1 more source
AbstractMy last column ended with some comments about Kuhn and word2vec. Word2vec has racked up plenty of citations because it satisifies both of Kuhn’s conditions for emerging trends: (1) a few initial (promising, if not convincing) successes that motivate early adopters (students) to do more, as well as (2) leaving plenty of room for early adopters ...
openaire +1 more source
Vietnamese Text Classification Algorithm using Long Short Term Memory and Word2Vec
In the context of the ongoing forth industrial revolution and fast computer science development the amount of textual information becomes huge. So, prior to applying the seemingly appropriate methodologies and techniques to the above data processing ...
Huu Nguyen Phat, Nguyen Thi Minh Anh
doaj +1 more source

