Introduction
Last updated
Last updated
Version 1.0.1 | Last Updated May 31, 2022
This handbook is for California State employees that want to publish open data on the State’s Open Data Portal (https://data.ca.gov)
The guidance on uploading and publishing data (sections 4 and 5) only applies to direct publishing on https://data.ca.gov. However, the rest of the guidance establishes minimum expectations for preparing data for publishing.
You can see a list of State organizations that maintain their own open data portals. Reach out to your portal administrator or data coordinator if you have questions about publishing on those portals.
This guidance will evolve and grow with feedback. Throughout, we've called out opportunities for feedback on additional guidance indicated with a megaphone emoji (). Any feedback can be submitted to opendata@state.ca.gov.
Before diving in, it's important to understand a common definition of open data. This handbook guides you through publishing in a way that is consistent with this definition.
The Open Knowledge Foundation has developed a standard open data and content definition, summarized below.
Open data is data that can be freely used, shared and built-on by anyone, anywhere, for any purpose.
Building on this, open data must be openly licensed, accessible, machine readable, and published in an open format.
You can read more detail on what is open data, what is not open data, and value propositions for open data in the reference section of this handbook.
Publishing a new open dataset takes some planning and coordination, but it doesn’t have to be difficult. This handbook is designed to provide a reference guide you can return to as you go.
Get started by:
Skimming the handbook to familiarize yourself with its content
Starting immediately with the pre-publishing checklist
Bookmarking and coming back as you move through getting your data ready for publishing
The handbook is divided into sections that line up to a general publishing process. The diagram below shows those steps and are listed down below with links to each section.
Publishing process steps
Review the pre-publishing checklist. Summarizes things to get started on and be aware of early in the process to minimize surprises later on.
Prepare data for publishing. Guidance on preparing your dataset for publishing including identifying and implementing any necessary cleaning and merging of data as needed.
Create metadata and data dictionary. Guidance on minimum metadata and documentation needed to make the dataset useful to others.
Upload the dataset. Guidance on uploading the dataset to the open data portal for those publishing directly to data.ca.gov.
Get final publishing approval. Guidance on getting final approval to make the dataset publicly available.
Update and maintain the dataset. Guidance on ways to make sure the dataset is updated and maintained appropriately.