Using a Replication
This section of the documentation is for people using a replication package built with the Boa Study Template.
If you downloaded a replication package that utilizes the Boa Study Template, please continue reading for more details on how to use that package.
We strongly encourage you to utilize Visual Studio
Code with the Boa Language and Infrastructure
Extension installed. If you have that extension, you can avoid
running terminal commands (like make
) and instead simply open the
study-config.json
file to control what files you download, which analyses you
run, etc.
If you plan to extend the replication package, beyond simply re-generating the paper's figures and tables, you will want to read up on Using the Template.
Requirements
You need a GNU Make compatible build system. Tested on GNU Make 3.81, but should work with newer versions.
If you plan to re-run any of the analyses, you will also need Python. See the Python requirements for more information.
Note that the Boa jobs themselves should be marked public, so you do not need a Boa user to view the actual jobs/output via the website. However, the Boa API requires a user/password to use it, so programmatically downloading (even public jobs) currently requires authenticating. You can, however, manually download each of the query outputs from the public URLs.
Docker Support
If you have Docker installed, you can use the provided Dockerfile
to build an
image capable of running all the scripts. This is the easiest way to get a
working environment. To build and run the image, run:
make run-docker
Once inside the container, you can run the make reproduce
command to
re-generate the figures and tables from the original paper. This will use the
cached data and should avoid having to download output from Boa.
File Organization
The organization of a replication package is the same as in the original study. See the page on Paths for more information.
Getting Boa Output
NOTE: This step is only needed if you don't already have the output
downloaded! If you downloaded data.zip
, you can skip this step!
The first step is to run Boa queries to generate output TXT data for further
processing: make txt
Processing the Boa Output
The Boa output is in a custom format, so first we convert it all into standard
CSV format: make csvs
If you use the make data
command instead of manually obtaining the outputs,
you do not need to do anything else as it will call this target for you.
Generating Figures and Tables
To generate all the figures and tables for the paper, you need to run the analysis for each specific research question on the output from Boa. There is also a helper target to run all analyses:
make analysis
Note that this triggers download of any missing Boa query outputs. Or you can run all analyses on only the cached data:
make reproduce
If you want to run individual analyses, you can also do so. The specific target names will vary based on the specific replication package, but often they are named based on research question:
make rq1
make rq2
You can also run a single analysis on the cached data by adding -reproduce
to
the target name:
make rq1-reproduce
make rq2-reproduce