Slovak morphological tokenizer using the Byte-Pair Encoding algorithm [PDF]
This study introduces a new approach to text tokenization, SlovaK Morphological Tokenizer (SKMT), which integrates the morphology of the Slovak language into the training process using the Byte-Pair Encoding (BPE) algorithm.
Dávid Držík, Frantisek Forgac
doaj +3 more sources
Predicate signatures from pair encodings via dual system proof technique
Recently, Attrapadung (Eurocrypt 2014) proposed a generic framework for fully (adaptively) secure predicate encryption (PE) based on a new primitive, called pair encodings.
Nandi Mridul, Pandit Tapas
doaj +2 more sources
GraphBPE: Molecular Graphs Meet Byte-Pair Encoding
accepted by ICML 2024 AI for Science ...
Yuchen Shen, Barnabás Póczos
openaire +3 more sources
Brain Activation during Memory Encoding in Type 2 Diabetes Mellitus: A Discordant Twin Pair Study [PDF]
Type 2 diabetes mellitus increases the risk of dementia and neuronal dysfunction may occur years before perceptible cognitive decline. We aimed to study the impact of type 2 diabetes on brain activation during memory encoding in middle-aged people ...
Amanda G. Wood +7 more
doaj +3 more sources
MGANSL: multi-network representation generating with generative adversarial network for synthetic lethality prediction [PDF]
Background: Cancer is a complex disease that arises from the simultaneous mutations of multiple biological molecules. An effective therapeutic strategy is to exploit synthetic lethality (SL) by targeting the SL partner of cancer driver genes ...
Jinxin Li +5 more
doaj +2 more sources
Edge-Deployable Fish Feeding-State Quantification and Recognition via Frame-Pair Motion Encoding and EfficientFeedingNet [PDF]
Accurate feeding-state monitoring is essential for improving feeding management, reducing feed waste, and supporting water quality and fish welfare in aquaculture.
Yuchen Xiao +7 more
doaj +2 more sources
Relativistically invariant encoding of quantum information revisited
In this work, we provide a detailed analysis of the issue of encoding of quantum information which is invariant with respect to arbitrary Lorentz transformations. We significantly extend already known results and provide compliments where necessary.
Konrad Schlichtholz, Marcin Markiewicz
doaj +2 more sources
IMPLEMENTASI ALGORITMA BYTE PAIR ENCODING UNTUK KOMPRESI FILE
Data compression is a field that focuses on forming a small output file from a large file. The need for compression has given birth to several methods that can be implemented in communication activities on the network as well as on storage activities ...
Ruslida, Ahmad Meidy +2 more
core +1 more source
Gene encoding the large subunit of As(III) oxidase (AioA), an important component of the microbial As(III) oxidation system, is a widely used biomarker to characterize As(III)-oxidizing communities in the environment.
Min Hu +9 more
doaj +1 more source
Byte Pair Encoding is Suboptimal for Language Model Pretraining [PDF]
The success of pretrained transformer language models (LMs) in natural language processing has led to a wide range of pretraining setups. In particular, these models employ a variety of subword tokenization methods, most notably byte-pair encoding (BPE) (Sennrich et al., 2016; Gage, 1994), the WordPiece method (Schuster and Nakajima, 2012), and unigram
Kaj Bostrom, Greg Durrett
openaire +2 more sources

