Results 91 to 100 of about 1,460 (118)
Some of the next articles are maybe not open access.

Chinese Story Generation with FastText Transformer Network

2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), 2019
The sequence transformer models are based on complex recurrent neural network or convolutional networks that include an encoder and a decoder. High-accuracy models are usually represented by used connect the encoder and decoder through an attention mechanism. Story generation is an important thing.
Jhe-Wei Lin, Yu-Che Gao, Rong-Guey Chang
openaire   +1 more source

Devise Sparse Compression Schedulers to Enhance FastText Methods

49th International Conference on Parallel Processing - ICPP : Workshops, 2020
In natural language processing(NLP), the general way to understand the meaning of a word is via word embedding. The word embedding training model can convert words into multidimensional vectors and make the words that do not know “meaning” into vectors with “meaning”. Famous word embedding training models, include models such as FastText, Word2Vec, and
Chen-Ting Chao   +5 more
openaire   +1 more source

Typo Correction in Domain-Specific Texts Using FastText

2020 Innovations in Intelligent Systems and Applications Conference (ASYU), 2020
Analyzing customer reviews are quite important for customer satisfaction. Customer reviews might contain spelling mistakes, which causes data pollution and decreases the efficiency of the analyzes. In this study, a domain-specific solution is proposed by using the data related to tourism.
Ahmet Tugrul Bayrak, Bekir Berker Turker
openaire   +1 more source

Performance Comparison of Word2vec and fastText Embedding Models

Journal of Digital Contents Society, 2020
Word2vec 임베딩 모델은 단순하고 성능이 우수하기 때문에, 자연어 처리 분야에서 가장 널리 쓰이는 모델 중 하나이지만 몇 가지 한계도 있다. 이런 한계를 극복하기 위해 일반적인 언어에 적용 가능한 fastText 임베딩 모델이 제안되었고, 이후 한국어에 적합한 특정한 fastText 모델도 제안되었다. 본 연구는 유사도 검사, 유추 검사 및 감정 분석을 통해 몇 가지 word2vec 및 fastText 모델의 성능을 비교 평가하는 것을 목표로 한다. fastText 모델을 제안한 이전 연구의 결과와는 달리, 최소한 유추 검사와 감정 분석의 측면에서는 fastText 모델이 word2vec 모델보다 더 우수하다고 단정 지을
Hyungsuc Kang, Janghoon Yang
openaire   +1 more source

Hate Speech and Abusive Language Classification using fastText

2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI), 2019
Hate speeches are defined as utterances, writings, actions, or performances that are intended to incite violence or prejudice against a person on the basis of the characteristics of a particular group that he or she is representing, such as race, ethnicity.
Guntur Budi Herwanto   +3 more
openaire   +1 more source

Detecting Webshell Based on Random Forest with FastText

Proceedings of the 2018 International Conference on Computing and Artificial Intelligence, 2018
Web-based remote access Trojan (or webshell) is a kind of tool for network intrusion, which can be uploaded to a website to access web service management authority. Once attacker injected successfully, it can cause great damage so that it is crucial to detect webshell effectively. Webshells are flexible and changeable by using of obfuscation techniques,
Yong Fang   +3 more
openaire   +1 more source

Subwords-Only Alternatives to fastText for Morphologically Rich Languages

Programming and Computer Software, 2021
In this work, we present purely subword-based alternatives to fastText word embedding algorithm The alternatives are modifications of the original fastText model, but rely on subword information only, eliminating the reliance on word-level vectors and at the same time helping to dramatically reduce the size of embeddings.
Tsolak Ghukasyan   +2 more
openaire   +1 more source

fastText (sub)word Vectors

Computational implementations of semantic knowledge represent the meaning of words as numerical vectors, derived from their usage in (natural) language. This methodology, known as distributional semantics, has seen substantial advancements, such as the extension reviewed in this article: fastText.
Bonandrini, R, Gatti, D.
openaire   +1 more source

CLASSIFICATION OF CYBERATTACKS USING THE FASTTEXT MODEL

Scientific and Practical Journal "Materials of Scientific Conferences of the Petro Mohyla Black Sea National University"
The paper presents a research of parsing User Agent Strings (UAS) for machine learning and examines the features of relevant software libraries. A more accurate and structured method for identifying cyberattack characteristics is implemented without relying on complex regular expressions.
Dmytro Zavorotnii, Ivan Burlachenko
openaire   +1 more source

Compromised Tweet Detection Using Siamese Networks and fastText Representations

2019 15th International Conference on Network and Service Management (CNSM), 2019
The aim of this work is to detect compromised users of tweets based on their writing styles. In this paper, we use Siamese Networks to learn a representation of user tweets that allows us to classify them based on a limited amount of ground truth data.
Mihir Joshi   +2 more
openaire   +1 more source

Home - About - Disclaimer - Privacy