Data Publishing Principles

Assume the data must stand on its own

Once out in the wild, we don’t really have much control over how data will be used or who may rely on it. The documentation should reflect this; we should include as much information as possible while maintaining backreferences to the data’s source.

Publish the right amount of data

Pick an appropriate subset of the data to publish, based on volume, expected usage, and refresh/update frequency. For example, GTFS Schedule is fairly low volume and slow to change, so updating weekly or monthly is more than sufficient.