Research Progress and Implementation of FAIR Principles for Scientific Data Management [PDF]
[Purpose/Significance] With the development of data-intensive scientific research paradigm, the effective discovery and reuse of scientific data is of great significance to the sharing of research results.
CHEN Shuxian, LIU Guifeng, LIU Qiong
doaj +1 more source
ChatGPT for Teaching and Learning: An Experience from Data Science Education [PDF]
ChatGPT, an implementation and application of large language models, has gained significant popularity since its initial release. Researchers have been exploring ways to harness the practical benefits of ChatGPT in real-world scenarios. Educational researchers have investigated its potential in various subjects, e.g., programming, mathematics, finance,
arxiv +1 more source
Prospecting (in) the data sciences [PDF]
Data science is characterized by engaging heterogeneous data to tackle real world questions and problems. But data science has no data of its own and must seek it within real world domains. We call this search for data “prospecting” and argue that the dynamics of prospecting are pervasive in, even characteristic of, data science.
Stephen C Slota+3 more
openaire +4 more sources
Trust in Data Science: Collaboration, Translation, and Accountability in Corporate Data Science Projects [PDF]
The trustworthiness of data science systems in applied and real-world settings emerges from the resolution of specific tensions through situated, pragmatic, and ongoing forms of work. Drawing on research in CSCW, critical data studies, and history and sociology of science, and six months of immersive ethnographic fieldwork with a corporate data science
arxiv +1 more source
Big-Data-Driven Materials Science and its FAIR Data Infrastructure [PDF]
This chapter addresses the forth paradigm of materials research -- big-data driven materials science. Its concepts and state-of-the-art are described, and its challenges and chances are discussed.
A Agrawal+74 more
core +3 more sources
Opinionated practices for teaching reproducibility: motivation, guided instruction and practice [PDF]
In the data science courses at the University of British Columbia, we define data science as the study, development and practice of reproducible and auditable processes to obtain insight from data. While reproducibility is core to our definition, most data science learners enter the field with other aspects of data science in mind, for example ...
arxiv +1 more source
Developing a population-scale harmonised ethnicity-spine in Wales.
The COVID-19 pandemic has placed a spotlight on existing and enduring health inequalities experienced by different ethnic groups. There has been a longstanding call to generate and improve the use of ethnicity data available across different data ...
Ashley Akbari+10 more
doaj +1 more source
Defining data science: a new field of inquiry [PDF]
Data science is not a science. It is a research paradigm. Its power, scope, and scale will surpass science, our most powerful research paradigm, to enable knowledge discovery and change our world. We have yet to understand and define it, vital to realizing its potential and managing its risks. Modern data science is in its infancy.
arxiv
Data-driven predictions in the science of science
The desire to predict discoveries—to have some idea, in advance, of what will be discovered, by whom, when, and where—pervades nearly all aspects of modern science, from individual scientists to publishers, from funding agencies to hiring committees. In this Essay, we survey the emerging and interdisciplinary field of the “science of science” and what ...
Aaron Clauset+4 more
openaire +4 more sources
ICSU and the Challanges of Data and Information Management for International Science
The International Council for Science (ICSU) vision explicitly recognises the value of data and information to science and particularly emphasises the urgent requirement for universal and equitable access to high quality scientific data and information ...
Peter Fox, Ray Harris
doaj +1 more source