stringi: Fast and Portable Character String Processing in R
Effective processing of character strings is required at various stages of data analysis pipelines: from data cleansing and preparation, through information extraction, to report generation. Pattern searching, string collation and sorting, normalization,
Marek Gagolewski
doaj +1 more source
Enhancing Regular Expressions For Polish Text Processing
The paper presents proposition of regular expressions engine based on the modified Thompson’salgorithm dedicated to the Polish language processing.
Krzysztof Dorosz, Anna Szczerbińska
doaj +1 more source
Evaluation of the quality of the Voluntary Geographic Information for the road network in Bogotá D.C
The production of Voluntary Geographic Information has been growing considerably and continues to be an active area of research. However, the lack of knowledge about the quality of information generated on a voluntary and participatory basis raises ...
Luis A. Niño Beltran +3 more
doaj +1 more source
Automatic repair of regular expressions
We introduce RFixer, a tool for repairing complex regular expressions using examples and only consider regular expressions without non-regular operators (e.g., negative lookahead).
Rong Pan +3 more
semanticscholar +1 more source
A search for improved performance in regular expressions [PDF]
The primary aim of automated performance improvement is to reduce the running time of programs while maintaining (or improving on) functionality. In this paper, Genetic Programming is used to find performance improvements in regular expressions for an ...
Brendan Cody-Kenny +5 more
semanticscholar +1 more source
Determine point-to-point networking interactions using regular expressions
As Internet growth and becoming more popular, the number of concurrent data flows start to increasing, which makes sense in bandwidth requested. Providers and corporate customers need ability to identify point-to-point interactions.
Konstantin S. Deev, Yuriy V. Boyko
doaj +1 more source
Pattern Matching in YARA: Improved Aho-Corasick Algorithm
YARA is a tool for pattern matching used by malware analysts all over the world. YARA can scan files, as well as process memory. It allows us to define sequences of symbols as text strings, hexadecimal strings and regular expressions. However, the use of
Dominika Regeciova +2 more
doaj +1 more source
Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure. [PDF]
Garvin JH +11 more
europepmc +3 more sources
Derivatives of Regular Expressions with Lookahead
: Lookahead is an extension of regular expressions that has been adopted in many implementations and is widely used. Lookahead represents what is allowed as the rest of input. Morihata developed a conversion from regular expressions with lookahead (REwLA)
Takayuki Miyazaki, Yasuhiko Minamide
semanticscholar +1 more source
Regular Expression Based Medical Text Classification Using Constructive Heuristic Approach
Medical text classification assigns medical related text into different categories such as topics or disease types. Machine learning based techniques have been widely used to perform such tasks despite the obvious drawback in such “black box ...
Menglin Cui +5 more
doaj +1 more source

