Published tax research (PBO costings, Grattan reform papers, Tax
Institute briefs) has a reproducibility bar that goes beyond “I called
ato_individuals() and summed column X.” Reviewers need to
verify that the data you used is exactly the data you say you used.
ato provides four features to meet that bar:
ind <- ato_individuals_postcode(
year = c("2020-21", "2021-22", "2022-23"),
state = "NSW"
)
companies <- ato_companies(year = "2022-23", table = "industry")
tax_gap <- ato_tax_gaps()Each ato_tbl prints with the snapshot pin and SHA-256
digest in its provenance header.
A DOI makes “retrieved from data.gov.au on 2026-04-24” citable and
immutable. Your paper then cites
doi:10.5281/zenodo.XXXXXXXX instead of a URL that might
rotate.
dep <- ato_deposit_zenodo(
title = "ATO data snapshot for working paper v1",
creators = list(list(name = "Author, A.", orcid = "0000-0000-0000-0000")),
upload = FALSE # dry run; inspect payload first
)
dep$payload$metadata$title
# When ready to actually deposit:
# Sys.setenv(ZENODO_TOKEN = "...your token...")
# dep <- ato_deposit_zenodo(upload = TRUE)
# dep$doi_prereserve