What is Open Science/Open Research?
The terms "open science" and "open data", and the idea of open research more broadly, are becoming ubiquitous in academic discourse. Far from being merely a trendy concept, though, open research is an important ideal and set of principles and practices for the scholarly community. But what exactly do we mean when we say open research, open science, and open data?
Benefits of Open Research
RDM and Open Research
How do research data management best practices contribute to open research? The ideals of open research go beyond merely making research and associated data available, however important that aspect may be. Shared research and data should also be well-organized, well-documented, readily understandable and interpreted, and reusable.
We believe in shifting attitudes, to go "beyond compliance" toward an ethos of stewardship and expanded sense of responsibility for producing and sharing research that has been conducted and managed with care. The principles and practices of research data management serve as recommendations and guidelines for how research data should be handled so that it reflects and embodies open ideals.
Moving Toward Reproducibility
Modes of analysis that make use of computation, code, and other sorts of processing on digital data are trending toward ubiquity in modern research. Because of the affordances of computational analysis, there is an opportunity to ensure that such research is as reproducible as possible. While the word 'reproducible' can mean different things in the context of research, here we want to focus on the idea of computational reproducibility, defined by the NASEM as "obtaining consistent computational results using the same input data, computational steps, methods, code, and conditions of analysis."
Some recommended practices for designing and using reproducible workflows are ones we have discussed elsewhere in this guide, such as organizing your files into a clear directory structure and creating detailed documentation. There are also practices specific to this topic, including:
Literate Programming
One highly useful concept for creating reproducible workflows in your research is literate programming. Literate programming is an approach to writing code that embeds snippets or chunks of executable code in a document that contains natural language explanations of the operations and analysis.
Jupyter notebooks is one popular application for writing "executable documents" in this style, which supports Python, R, and Julia. RStudio offers R Markdown, which works similarly to Jupyter notebooks and supports several languages. Quarto is an outgrowth of R Markdown that bills itself as an "open-source scientific and technical publishing system" which combines support for literate programming with interactive/dynamic features and additional integrations.
If you use R for your research, one powerful and flexible combination that may help you design reproducible workflows is using RStudio (and the capabilities of R Markdown mentioned previously) together with Git and GitHub. There is an R package called 'usethis' that allows you to execute Git commands from the R console and view commits in the RStudio window. This will allow you to create a repository and perform version control all from the same program that you write your analytical code in. For more information on this, see the RStudio & Version Control link below.
Workflow Capture
Another available option for reproducible workflow development is workflow capture software (also sometimes referred to as integrated workflow programs or other similar monikers). These programs were created to allow researchers to design workflows in a systematic way. Some examples of such software include Kepler and Taverna. This software is highly specialized and beyond the scope of this guide.
The Open Science Framework (OSF) is an open source, web-based project management tool created and maintained by the Center for Open Science (COS), a non-profit devoting effort to "proactively reform the norms and reward system in science and elevate rigor, transparency, sharing, and reproducibility."
The functionality of OSF is centered around projects, which are collaborative workspaces that offer various features including uploading files, version control, setting permissions for collaborators, and integrations with various external tools.
(Image sourced from the OSF website, which is licensed under a CC BY-SA 4.0 license)
OSF Features
If your research project is collaborative and you want a platform that will support your work, or if you have an interest in sharing parts of your research (such as methodology, plan for analysis, etc.) publicly while the project is ongoing, OSF may be a good choice for you. Some useful features of OSF include:
The University of Virginia is an OSF Institutions member, meaning you can sign in using your UVA credentials and affiliate your projects with UVA.
Interested in using OSF to manage your project or as a collaboration platform and have questions? Please reach out to us for more information.