Results 161 to 170 of about 2,430,336 (342)

Czech-English Parallel Corpus 1.0 (CzEng 1.0)

open access: yes, 2011
CzEng 1.0 is the fourth release of a sentence-parallel Czech-English corpus compiled at the Institute of Formal and Applied Linguistics (ÚFAL) freely available for non-commercial research purposes. CzEng 1.0 contains 15 million parallel sentences (233
Žabokrtský, Zdeněk   +9 more
core  

A corpus-based investigation on adverbs in portuguese/english journalistic translations [PDF]

open access: yes, 2012
Dissertação (mestrado) - Universidade Federal de Santa Catarina, Centro de Comunicação e Expressão, Programa de Pós-Graduação em Letras/Inglês e Literatura Correspondente, Florianópolis, 2009The main goal of this study is showing the differences between ...
Vaz, Maria Augusta Salvia
core  

Delineating gender/sex‐related studies through bibliometric analysis

open access: yesJournal of the Association for Information Science and Technology, EarlyView.
Abstract The multidisciplinary and socially grounded nature of Women's/Gender/Feminist Studies poses unique challenges for bibliometric analysis, as it extends beyond conventional disciplinary boundaries. This paper makes three key contributions: (1) We propose a novel retrieval method for constructing a corpus of scholarly documents in research areas ...
Natsumi S. Shokida   +2 more
wiley   +1 more source

Finnish-English parallel corpus fienWaC 1.0

open access: yes, 2016
The fienWaC corpus version 1.0 consists of parallel Finnish-English texts crawled from the .fi top-level domain for Finland. The corpus was built with Spidextor (https://github.com/abumatran/spidextor), a tool that glues together the output of SpiderLing
Esplà-Gomis, Miquel   +4 more
core  

Twenty-first century Corpus Workbench: Updating a query architecture for the new millennium [PDF]

open access: yes, 2011
Corpus Workbench (CWB) is a widely-used architecture for corpus analysis, originally designed at the IMS, University of Stuttgart (Christ 1994). It consists of a set of tools for indexing, managing and querying very large corpora with multiple layers of ...
Hardie, Andrew, Evert, Stefan
core  

Methodological challenges in content‐based citation analysis: Expertise, reliability, and the primacy of citance identification

open access: yesJournal of the Association for Information Science and Technology, EarlyView.
Abstract Content‐based citation analysis seeks to capture the meaning and functions of citations but continues to face unresolved methodological challenges. This study analyzes a stratified sample of library and information science publications to examine how citance segmentation and annotator expertise influence the consistency of classification ...
Zehra Taşkın
wiley   +1 more source

Kacenka : parallel corpus of English and Czech texts

open access: yes, 1997
Parallel corpus, 3,297,283 words. The idea was to create a small parallel corpus which would enable to work with entire texts in translation analysis rather then short extracts.

core   +1 more source

A corpus based approach to generalise a chatbot system [PDF]

open access: yes, 2005
Chatbot tools are computer programs which interact with users using natural languages. Most developers built their systems aiming to fool users that they are talking with real humans.
Abu Shawar, Bayan Aref
core  

Emerging research directions in nanoparticle‐based foam control for bioprocessing: a bibliometric approach

open access: yesBiofuels, Bioproducts and Biorefining, EarlyView.
Abstract Overfoaming remains a critical challenge in industrial bioprocesses, compromising mass transfer, operational stability, and downstream efficiency in bioreactors and wastewater treatment systems. This study provides a bibliometric and scientometric assessment of nanoparticle‐enabled foam control to map technological trends and identify research
Antonio Átila Menezes Ferreira   +6 more
wiley   +1 more source

Parallel Corpus (EN-LT) of EUR-Lex Documents That Include Terms with the Adjective 'Green' (ELEXIS)

open access: yes, 2021
Bilingual parallel corpus of the EU English documents containing terms with the adjective 'green' and their Lithuanian translations. The size of the corpus is 4,447,683 words in English, and 3,649,812 words in Lithuanian.
Karlonaitė, Radvilė   +1 more
core  

Home - About - Disclaimer - Privacy