Results 161 to 170 of about 2,430,336 (342)
Czech-English Parallel Corpus 1.0 (CzEng 1.0)
CzEng 1.0 is the fourth release of a sentence-parallel Czech-English corpus compiled at the Institute of Formal and Applied Linguistics (ÚFAL) freely available for non-commercial research purposes. CzEng 1.0 contains 15 million parallel sentences (233
Žabokrtský, Zdeněk +9 more
core
A corpus-based investigation on adverbs in portuguese/english journalistic translations [PDF]
Dissertação (mestrado) - Universidade Federal de Santa Catarina, Centro de Comunicação e Expressão, Programa de Pós-Graduação em Letras/Inglês e Literatura Correspondente, Florianópolis, 2009The main goal of this study is showing the differences between ...
Vaz, Maria Augusta Salvia
core
Delineating gender/sex‐related studies through bibliometric analysis
Abstract The multidisciplinary and socially grounded nature of Women's/Gender/Feminist Studies poses unique challenges for bibliometric analysis, as it extends beyond conventional disciplinary boundaries. This paper makes three key contributions: (1) We propose a novel retrieval method for constructing a corpus of scholarly documents in research areas ...
Natsumi S. Shokida +2 more
wiley +1 more source
Finnish-English parallel corpus fienWaC 1.0
The fienWaC corpus version 1.0 consists of parallel Finnish-English texts crawled from the .fi top-level domain for Finland. The corpus was built with Spidextor (https://github.com/abumatran/spidextor), a tool that glues together the output of SpiderLing
Esplà-Gomis, Miquel +4 more
core
Twenty-first century Corpus Workbench: Updating a query architecture for the new millennium [PDF]
Corpus Workbench (CWB) is a widely-used architecture for corpus analysis, originally designed at the IMS, University of Stuttgart (Christ 1994). It consists of a set of tools for indexing, managing and querying very large corpora with multiple layers of ...
Hardie, Andrew, Evert, Stefan
core
Abstract Content‐based citation analysis seeks to capture the meaning and functions of citations but continues to face unresolved methodological challenges. This study analyzes a stratified sample of library and information science publications to examine how citance segmentation and annotator expertise influence the consistency of classification ...
Zehra Taşkın
wiley +1 more source
Kacenka : parallel corpus of English and Czech texts
Parallel corpus, 3,297,283 words. The idea was to create a small parallel corpus which would enable to work with entire texts in translation analysis rather then short extracts.
core +1 more source
A corpus based approach to generalise a chatbot system [PDF]
Chatbot tools are computer programs which interact with users using natural languages. Most developers built their systems aiming to fool users that they are talking with real humans.
Abu Shawar, Bayan Aref
core
Abstract Overfoaming remains a critical challenge in industrial bioprocesses, compromising mass transfer, operational stability, and downstream efficiency in bioreactors and wastewater treatment systems. This study provides a bibliometric and scientometric assessment of nanoparticle‐enabled foam control to map technological trends and identify research
Antonio Átila Menezes Ferreira +6 more
wiley +1 more source
Parallel Corpus (EN-LT) of EUR-Lex Documents That Include Terms with the Adjective 'Green' (ELEXIS)
Bilingual parallel corpus of the EU English documents containing terms with the adjective 'green' and their Lithuanian translations. The size of the corpus is 4,447,683 words in English, and 3,649,812 words in Lithuanian.
Karlonaitė, Radvilė +1 more
core

