Results 291 to 300 of about 17,660,936 (332)
Some of the next articles are maybe not open access.

NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark

Conference on Empirical Methods in Natural Language Processing, 2023
In this position paper, we argue that the classical evaluation on Natural Language Processing (NLP) tasks using annotated benchmarks is in trouble. The worst kind of data contamination happens when a Large Language Model (LLM) is trained on the test ...
Oscar Sainz   +5 more
semanticscholar   +1 more source

Anomaly Detection in Time Series: A Comprehensive Evaluation

Proceedings of the VLDB Endowment, 2022
Detecting anomalous subsequences in time series data is an important task in areas ranging from manufacturing processes over finance applications to health care monitoring.
Sebastian Schmidl   +2 more
semanticscholar   +1 more source

MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization

Annual Meeting of the Association for Computational Linguistics
Scientific data visualization plays a crucial role in research by enabling the direct display of complex information and assisting researchers in identifying implicit patterns.
Zhiyu Yang   +12 more
semanticscholar   +1 more source

Bibliometric indicators to evaluate scientific activity

Radiología (English Edition), 2021
Bibliometric indicators have been devised to quantify scientific production and to try to evaluate its impact in the community. In general, bibliometric indicators can be classified according to whether the unit of analysis is the author (individual or group) or journal.
C, García-Villar, J M, García-Santos
openaire   +2 more sources

Evaluation of students’ scientific process skills through reflective worksheets in the inquiry-based learning environments

, 2020
This study aimed to evaluate how 7th-grade students’ scientific process skills changed in the inquiry-based learning environment through reflective worksheets. For this purpose, four inquiry-based activities related to electrical circuits were developed.
Ayfer Mutlu
semanticscholar   +1 more source

ESP: A Scientific Evaluation

The American Journal of Psychology, 1966
Over the last years, extrasensory perception has been increasingly accepted by the scientific community. Its special interest to physicians probably centers around communication, since ESP implies that unspoken feelings or ideas can directly influence others.
John Beloff, C. E. M. Hansel
openaire   +2 more sources

Evaluating Scientific Evidence

2006
Scientific evidence is crucial in a burgeoning number of litigated cases, legislative enactments, regulatory decisions, and scholarly arguments. Evaluating Scientific Evidence explores the question of what counts as scientific knowledge, a question that has become a focus of heated courtroom and scholarly debate, not only in the United States, but in ...
openaire   +2 more sources

Scientific Approach to Job Evaluation

Hospital Topics, 1967
(1967). Scientific Approach to Job Evaluation. Hospital Topics: Vol. 45, No. 10, pp. 46-48.
openaire   +2 more sources

Evaluating scientific personnel

Electrical Engineering, 1957
A performance rating system is described for one of the largest industrial laboratories in the world, employing an extremely heterogeneous scientific personnel whose contributions in many cases are intangible. This situation has many unique facets, involving as it does the comparison of “horses and apples,” and psychological factors in human judgment.
openaire   +1 more source

Evaluation as Scientific Research

Evaluation Review, 1988
Ideal characteristics of a well established area of scientific inquiry are parsimony, generality, coherence, uniqueness, clarity of boundaries, and potential for cumulative inquiry. The role of evaluation as grazing area for varied species of social science and the entrepreneurial environments of practice have led it to try to define itself by method ...
openaire   +1 more source

Home - About - Disclaimer - Privacy