California Open Data Publisher's Handbook
Documents and ResourcesCA Open Data Portal
  • Introduction
  • ☑️1. Review the Pre-Publishing Checklist
  • 📈2. Prepare Data for Publishing
  • 📙3. Create Metadata and Data Dictionary
  • 🔼4. Upload the Dataset
  • 👍5. Get Final Publishing Approval
  • 🔄6. Update and Maintain the Dataset
  • 📣Feedback & Help
  • Reference
    • The What and Why of Open Data
    • Open Data Portals Managed by State Entities
    • Data Preparation and Formatting Guidance
      • Column Headers and Order
      • Date and Time
      • Text
      • Numeric
      • Addresses
    • Metadata Field Definitions
    • Data Dictionary: What to Include
    • Detailed Steps for Uploading Data to the Portal
    • Email Templates
    • Glossary
    • Acknowledgements
    • Version and Changelog
Powered by GitBook
On this page
  • Identify your publishing team
  • Key open data publishing roles defined
  • Find the person that fills each of the roles
  • Form your team
  • Start documenting your data
  • Identify possible needs for publishing support
  • Considerations for automated publishing
  • Considerations for manual publishing

Was this helpful?

Edit on GitHub
Export as PDF

1. Review the Pre-Publishing Checklist

PreviousIntroductionNext2. Prepare Data for Publishing

Last updated 1 year ago

Was this helpful?

Identify your publishing team

It takes a team to deliver high quality open data. There are 3 roles that will help you ensure publishing moves forward. They are pictured below and described in the checklist following. In some cases a single person may fulfill multiple roles, but you shouldn't be on your own entirely.

Key open data publishing roles defined

The Data Steward is the person most knowledgeable about the data including the sources, collection methods, and limitations.

The Data Coordinator acts as a liaison between internal Information Technology staff, organizational programs and leadership, and portal managers.

The Data Custodian is the person most knowledgeable about how the data is stored and protected and have technical knowledge on how to query and extract data.

They prepare data for publishing on the portal and work with Data Custodians for any system access needs and work with the Data Coordinator for publishing approval.

They are best positioned to convey to the appropriate parties any specific needs of the open data portal and program. They are trusted partners in open data within their organization.

They advise and help with data access and navigate technical options for automation.

Find the person that fills each of the roles

Others may be needed as you move through the process

These three roles are core, but you may need to bring in others as you move through publishing and approval. Legal and communications staff, for example, may be needed in your process. You can rely on your data coordinator to advise on who else to involve.

Form your team

Start documenting your data

Start identifying the fields you want to publish. This will help you and others clarify what you are planning to publish. Keep your documentation template (linked below) somewhere safe as you'll return to it again as you move toward publishing data.

Hold on to your data documentation template, you'll need it later

Identify possible needs for publishing support

If your dataset needs to be published on a regular schedule, it's good to start thinking of what to do about that now. Even if you don't need automation, thinking about how to help others publish the data early will save you headaches later.

Considerations for automated publishing

When data is published more frequently than quarterly, we highly recommend automation. Each organization will vary in its approach, but you can work with your IT department to investigate whether automation is possible.

If you're updating your dataset quarterly or more frequently:

Pro tip

Consider alternate publishing strategies like initially publishing manually and then following up with automation when resources are ready.

Considerations for manual publishing

If automation is not possible, or this is a dataset that gets updated infrequently (like once a year), you may update data manually. You'll still want to make sure the approach to publishing is well documented so you can easily cross-train others on updates. Start with the following:

Data Steward

Data Coordinator

Data Custodian

By the time you publish, you should create meaningful definitions, which is covered in step . You'll use this same template in that section. You will eventually copy certain elements over when you upload your dataset on the portal in section .

Discuss with your data custodian and other IT contacts what's possible. .

As you prepare your data, document your steps for producing the dataset. Going through a process to prepare can help clarify the business rules and logic that your team will need to automate. has more on this.

As you prepare your data, document your steps for producing the dataset. Create a data update procedure document that is easy for you and others to understand. has more on this.

3 Create Metadata and Data Dictionary
4 Upload the Dataset
Step 2. Prepare Data for Publishing
Step 2. Prepare Data for Publishing
☑️
send an email to the open data team to verify
Page cover image
27KB
California Open Data Metadata Template.xlsx
you can use this email template as a starting point
See this template email for a starting point