Parallel corpus - Open Access .click

Results 161 to 170 of about 2,430,336 (342)

Czech-English Parallel Corpus 1.0 (CzEng 1.0)

, 2011
CzEng 1.0 is the fourth release of a sentence-parallel Czech-English corpus compiled at the Institute of Formal and Applied Linguistics (ÚFAL) freely available for non-commercial research purposes. CzEng 1.0 contains 15 million parallel sentences (233
Žabokrtský, Zdeněk +9 more
core

A corpus-based investigation on adverbs in portuguese/english journalistic translations [PDF]

, 2012
Dissertação (mestrado) - Universidade Federal de Santa Catarina, Centro de Comunicação e Expressão, Programa de Pós-Graduação em Letras/Inglês e Literatura Correspondente, Florianópolis, 2009The main goal of this study is showing the differences between ...
Vaz, Maria Augusta Salvia
core

Delineating gender/sex‐related studies through bibliometric analysis

Journal of the Association for Information Science and Technology, EarlyView.
Abstract The multidisciplinary and socially grounded nature of Women's/Gender/Feminist Studies poses unique challenges for bibliometric analysis, as it extends beyond conventional disciplinary boundaries. This paper makes three key contributions: (1) We propose a novel retrieval method for constructing a corpus of scholarly documents in research areas ...
Natsumi S. Shokida, Diego Kozlowski, Vincent Larivière +2 more
wiley +1 more source

Finnish-English parallel corpus fienWaC 1.0

, 2016
The fienWaC corpus version 1.0 consists of parallel Finnish-English texts crawled from the .fi top-level domain for Finland. The corpus was built with Spidextor (https://github.com/abumatran/spidextor), a tool that glues together the output of SpiderLing
Esplà-Gomis, Miquel +4 more
core

Twenty-first century Corpus Workbench: Updating a query architecture for the new millennium [PDF]

, 2011
Corpus Workbench (CWB) is a widely-used architecture for corpus analysis, originally designed at the IMS, University of Stuttgart (Christ 1994). It consists of a set of tools for indexing, managing and querying very large corpora with multiple layers of ...
Hardie, Andrew, Evert, Stefan
core

Methodological challenges in content‐based citation analysis: Expertise, reliability, and the primacy of citance identification

Journal of the Association for Information Science and Technology, EarlyView.
Abstract Content‐based citation analysis seeks to capture the meaning and functions of citations but continues to face unresolved methodological challenges. This study analyzes a stratified sample of library and information science publications to examine how citance segmentation and annotator expertise influence the consistency of classification ...
Zehra Taşkın
wiley +1 more source

Kacenka : parallel corpus of English and Czech texts

, 1997
Parallel corpus, 3,297,283 words. The idea was to create a small parallel corpus which would enable to work with entire texts in translation analysis rather then short extracts.

core +1 more source

A corpus based approach to generalise a chatbot system [PDF]

, 2005
Chatbot tools are computer programs which interact with users using natural languages. Most developers built their systems aiming to fool users that they are talking with real humans.
Abu Shawar, Bayan Aref
core

Emerging research directions in nanoparticle‐based foam control for bioprocessing: a bibliometric approach

Biofuels, Bioproducts and Biorefining, EarlyView.
Abstract Overfoaming remains a critical challenge in industrial bioprocesses, compromising mass transfer, operational stability, and downstream efficiency in bioreactors and wastewater treatment systems. This study provides a bibliometric and scientometric assessment of nanoparticle‐enabled foam control to map technological trends and identify research
Antonio Átila Menezes Ferreira +6 more
wiley +1 more source

Parallel Corpus (EN-LT) of EUR-Lex Documents That Include Terms with the Adjective 'Green' (ELEXIS)

, 2021
Bilingual parallel corpus of the EU English documents containing terms with the adjective 'green' and their Lithuanian translations. The size of the corpus is 4,447,683 words in English, and 3,649,812 words in Lithuanian.
Karlonaitė, Radvilė, Rackevičienė, Sigita +1 more
core

natural language processing
fos: computer and information sciences
corpus linguistics

computation and language cs.cl
computer science - computation and language
translation

languages and literatures
parallel corpora
neural machine translation