Is a Pretrained Model the Answer to Situational Awareness Detection on Social Media? [PDF]
Social media can be valuable for extracting information about an event or incident on the ground. However, the vast amount of content shared, and the linguistic variants of languages used on social media make it challenging to identify important ...
Lee, Kahhe, Lo, Siaw Ling, Zhang, Yuhao
core +4 more sources
An Investigation of Structures Responsible for Gender Bias in BERT and DistilBERT
In recent years, large Transformer-based Pre-trained Language Models (PLM) have changed the Natural Language Processing (NLP) landscape, by pushing the performance boundaries of the state-of-the-art on a wide variety of tasks. However, this performance gain goes along with an increase in complexity, and as a result, the size of such models (up to ...
Thibaud Leteno +3 more
openaire +2 more sources
A Feasible and Explainable Network Traffic Classifier Utilizing DistilBERT
While user-oriented service industries are rapidly growing, various network devices provide these services through different access paths. Accordingly, the network flow is also increasing explosively. As demand for management related to limited network resources increases, the network traffic classification grows to prominence.
Chang-Yui Shin +3 more
openaire +2 more sources
Analysing Natural Language Processing Techniques: A Comparative Study of NLTK, spaCy, BERT, and DistilBERT on Customer Query Datasets [PDF]
Sentiment analysis within customer queries stems from its critical role in shaping the perception of a company’s brand. Poor handling of customer queries may lead to adverse consequences.
De Camillis, Patrizia
core +1 more source
A PERFORMANCE ANALYSIS OF SUGGESTION MINING IN MOBILE APP REVIEWS USING LARGE LANGUAGE MODELS TRANSFER LEARNING TECHNIQUES [PDF]
One of the enduring issues with obtaining user feedback is to find effective ways of actionable improvement recommendations from this feedback for quality mobile app development.
Makarand Lotan Mali , Nitin N. Patil
doaj +1 more source
Authorship identification using ensemble learning
With time, textual data is proliferating, primarily through the publications of articles. With this rapid increase in textual data, anonymous content is also increasing.
Ahmed Abbasi +5 more
doaj +1 more source
Efficient Document Re-Ranking for Transformers by Precomputing Term Representations
Deep pretrained transformer networks are effective at various ranking tasks, such as question answering and ad-hoc document ranking. However, their computational expenses deem them cost-prohibitive in practice.
Frieder, Ophir +5 more
core +1 more source
Multilingual Fine-Grained Named Entity Recognition [PDF]
The “MultiCoNER II Multilingual Complex Named Entity Recognition” task\footnote[1]{\url{https://multiconer.github.io}} within SemEval 2023 competition focuses on identifying complex named entities (NEs), such as the titles of creative works (e.g., songs,
Viorica-Camelia Lupancu, Adrian Iftene
doaj +1 more source
Data Augmentation Methods for Enhancing Robustness in Text Classification Tasks
Text classification is widely studied in natural language processing (NLP). Deep learning models, including large pre-trained models like BERT and DistilBERT, have achieved impressive results in text classification tasks.
Huidong Tang +2 more
doaj +1 more source
Evaluating the Performance Impact of Fine-Tuning Optimization Strategies on Pre-Trained DistilBERT Models Towards Hate Speech Detection in Social Media [PDF]
Hate speech can be defined as forms of expression that incite hatred or encourage violence towards a person or group based on race, religion, gender, or sexual orientation.
McGovern, Aidan
core +1 more source

