Results 91 to 100 of about 975,756 (347)

PlantGFM: A Genomic Foundation Model for Discovery and Creation of Plant Genes

open access: yesAdvanced Science, EarlyView.
A plant genomic foundation model pre‐trained on 12 species enables both accurate gene prediction and de novo gene design. Through AI‐human knowledge screening, seven designed sequences showed transcriptional activity in plants, with two expressing stable proteins—demonstrating the first DNA‐RNA‐protein expression of LLM‐generated genes in plants and ...
Changhao Li   +10 more
wiley   +1 more source

PhosSight: A Unified Deep Learning Framework Boosting and Accelerating Phosphoproteome Identification to Enable Biological Discoveries

open access: yesAdvanced Science, EarlyView.
PhosSight is a unified deep‐learning framework for phosphoproteome identification, featured by a phosphorylation‐aware detectability predictor. It improves identification sensitivity in DDA through deep re‐localization and rescoring, accelerates DIA searches by detectability‐guided spectral library pruning, and expands phosphoproteome coverage to ...
Ben Wang   +10 more
wiley   +1 more source

Annotating and Analyzing Learner Production with UAM Corpus Tool

open access: yesTecnologías para la Investigación en Segundas Lenguas
The increasing integration of learner corpora into Second Language Acquisition (SLA) research has created the need for annotation tools that support systematic, transparent, and multi-layer analyses of learner production.
Aída García-Tejada, Teresa Quesada
doaj   +1 more source

Arabic learner corpus and its potential role in teaching Arabic to non-native speakers [PDF]

open access: yes
The literature on learner corpora (Al-Sulaiti, 2010; Granger & Dumont, 2012) shows that there is a need to compile an Arabic learner corpus, which can be used in research on Arabic language learning and teaching.
Alfaifi, AYG, Atwell, ES
core  

Oral reading tasks as proficiency indicators: insights from a learner corpus study

open access: yes, 2023
This study aims to explore the potential of oral reading tasks to establish learners’ proficiency when compiling learner corpora. Informed by research on oral reading fluency, we selected a text containing a variety of linguistic features and submitted ...
Cilibrasi, Luca   +2 more
core   +1 more source

Automated Extraction of Multicomponent Alloy Data Using Large Language Models for Sustainable Design

open access: yesAdvanced Science, EarlyView.
A large language model (LLM) based pipeline is developed to automatically extract a comprehensive and accurate multicomponent alloy database from literature corpus. The extracted dataset is integrated with sustainability indicators to identify potential alloys that outperform existing industrial benchmark materials in terms of both performance and ...
Aravindan Kamatchi Sundaram   +4 more
wiley   +1 more source

Millest räägivad eesti õppijakeele käändeasendused?

open access: yesLähivõrdlusi, 2011
The article expounds on some trends witnessed in the use of object cases in Estonian. It is a synchronous research basing on Estonian learner language and standard language corpus-driven and corpus-based comparative language usage analysis.
Pille Eslon
doaj   +1 more source

SCIL: A Spanish Corpus of Italian Learners

open access: yesProcedia - Social and Behavioral Sciences, 2013
AbstractIn the last 15 years research into the acquisition of Spanish as a Foreign/Second language has seen a growing interest in the building of learner corpora but, in most cases, they collect Spanish interlanguage of English-speaking learners. SCIL is a longitudinal Spanish Corpus of Italian Learners and consists of 457 written compositions (124,186
openaire   +2 more sources

Defining Frailty in Chinese‐Language Biomedical Literature (2014–2024): A Decade of Conceptual Evolution

open access: yesAGING MEDICINE, EarlyView.
Frailty definitions in Chinese‐language biomedical literature increasingly align with international frameworks, while retaining conceptual diversity, underscoring the need for multidimensional, cross‐cultural research and integration with traditional Chinese medicine for culturally sensitive clinical practice.
Haodong Wei   +5 more
wiley   +1 more source

MERLIN Written Learner Corpus for Czech, German, Italian 1.0

open access: yes, 2014
The MERLIN corpus is a written learner corpus for Czech, German, and Italian that has been designed to illustrate the Common European Framework of Reference for Languages (CEFR) with authentic learner data.
Boyd, Adriane   +19 more
core  

Home - About - Disclaimer - Privacy