This documentation is for the deprecated Transitland v1 platform.
Feeds and feed versions
Transitland is built on publicly available GTFS data contributed by our user community to the Transitland Atlas. Detailed information is kept on each Feed, and updated whenever a new version of each Feed is discovered. Feed versions are archived for download (as .zip
files) and imported into the Transitland Datastore for API querying by operators, stops, routes, schedules, etc.
Feeds
A Feed represents a unique GTFS data source. Each Feed has a URL to a publicly accessible GTFS archive, a mapping of GTFS agency_id
values to Transitland Operators, the geographic extent of the Feed, and the details of the Feed's license.
Feed data model
Attribute | Type | Description |
---|---|---|
onestop_id | Onestop ID | Feed Onestop ID |
url | URL | Publicly accessible GTFS archive |
name | String | Feed name |
feed_format | Enum | Currently: gtfs |
geometry | Geometry | Convex hull of Stops in the Feed |
last_fetched_at | DateTime | Last time the Feed was retrieved |
last_imported_at | DateTime | Last time the Feed was imported |
license_name | String | License name, such as MIT |
license_use_without_attribution | Enum | yes , no , unknown |
license_create_derived_product | Enum | yes , no , unknown |
license_redistribute | Enum | yes , no , unknown |
license_attribution_text | String | Required attribution text |
license_url | URL | URL to Feed License |
tags | Object | Tags |
feed_versions | Feed Versions | Feed Version IDs (SHA1) for this Feed |
active_feed_version | Feed Version | Active Feed Version ID |
operators_in_feed | Object array | Mapping of gtfs agency_id s to Operators |
changesets_imported_from_this_feed | Changesets | Changesets created from Feed |
Feeds API
Endpoint: /api/v1/feeds
Feed license information
To learn more about how Transitland classifies the licenses associated with a feed, see this overview of Transitland legal and licensing issues.
Feed versions
Approximately once per day, the URL for each Feed is checked. When a new version of the Feed is found, a Feed Version is created. The ID for each Feed Version is the SHA1 checksum of the GTFS archive.
Feed versions data model
Attribute | Type | Description |
---|---|---|
sha1 | SHA1 | SHA1 checksum of the GTFS archive |
md5 | MD5 | MD5 checksum of GTFS archive |
feed | Onestop ID | Parent Feed Onestop ID |
fetched_at | DateTime | Time was originally fetched |
url | URL | URL when fetched |
download_url | URL | Archived copy of Feed Version, if allowed |
feedvalidator_url | URL | Archived Google feedvalidator.py report |
earliest_calendar_date | Date | First day of scheduled service |
latest_calendar_date | Date | Last day of scheduled service |
imported_at | DateTime | Last time Feed Version was imported |
import_level | Integer | Import level (0-4) |
import_status | Enum | Import status, such as most_recent_succeeded |
feed_version_imports | IDs | Feed Version Import IDs |
feed_version_infos | IDs | Feed Version Info IDs |
is_active_feed_version | Boolean | true if Feed Version is active |
changesets_imported_from_this_feed_version | Changesets | Changesets created from Feed Version |
Feed versions API
Endpoint: /api/v1/feed_versions
Active feed version
The most recent version of a feed that has been imported into the Transitland Datastore is marked as active. The schedule API endpoint only allows querying of the trips and calendars in the active feed version.
The FeedMaintenance service within Transitland Datastore automatically decides when to import a newly fetched feed version. If no need feed version is available when existing ScheduleStopPairs are about to expire, the FeedMaintenance service will extend them into the future.
Most recent feed version
Are you looking for the most recently fetched Feed Version? This is not necessarily the Feed Version where is_active_feed_version=true
.
To query for the most recently fetched version of a feed, use an API query like: https://transit.land/api/v1/feed_versions?feed_onestop_id=f-9v6-capitalmetro&per_page=1&sort_key=fetched_at&sort_order=desc
(replacing feed_onestop_id
as appropriate).
To directly download a copy of the most recently fetched version of a feed, use: https://transit.land/api/v1/feeds/f-9v6-capitalmetro/download_latest_feed_version
(replacing the Onestop ID as appropriate). Note that downloading is not allowed for Feeds where license_redistribute=no
.
Feed version reports
The Transitland Datastore creates a number of validation and statistical reports for each Feed Version. The currently defined types of reports are:
FeedVersionInfoStatistics
: General statisticsFeedVersionInfoConveyalValidation
: Conveyal gtfs-lib validation results
Additionally, the results of Google's feedvalidator.py HTML output will be stored on the Feed Version as feedvalidator_url
when available. In the future, this may instead be stored as an additional type of report.
Feed version report data model
Attribute | Type | Description |
---|---|---|
id | Integer | ID |
type | Enum | Report type |
feed_version_sha1 | Feed Version | Parent Feed Version |
feed_onestop_id | Onestop ID | Parent Feed |
data | JSON | JSON blob containing report data |
Feed version report API
Endpoint: /api/v1/feed_version_infos
Query parameter | Type | Description | Example |
---|---|---|---|
feed_onestop_id | Onestop ID | Filter by Feed | Caltrain |
feed_version_sha1 | Feed Version | Filter by Feed Version | Caltrain, single Feed Version |
type | Enum | Filter by report type | Caltrain statistics |
FeedVersionInfoStatistics reports
This report contains details about the files in the GTFS archive and basic statistics about the CSV columns and values.
filenames
: The filenames present in the directory of the archive containing the CSV files.statistics
: Data for each GTFS CSV file and column, with thetotal
number of rows with data for that column, theunique
number of values encountered, as well as themin
andmax
values.scheduled_service
: Key-value data for the number of seconds of scheduled service for each date the Feed has scheduled trips.
Example FeedVersionInfoStatistics report
FeedVersionInfoConveyalValidation reports
This report contains the JSON output of Conveyal's gtfs-lib validator.
Example FeedVersionInfoConveyalValidation report
Google feedvalidator.py reports
The HTML output of Google feedvalidator.py. Currently, this is stored on the actual Feed Version record as feedvalidator_url
as a link to a copy of the report stored on S3.
Example Feed Version with Google feedvalidator.py report
Feed version update statistics
The Feeds API also provides simple statistics characterizing when & how a Feed is updated. This includes the total number of Feed Versions, the average number of days between publication of new Feed Versions, the average number of service days in each Feed Version schedule, and the average number of days of overlapping schedule between subsequent Feed Versions. Note: because the fetched_at
value is used to sort the Feed Versions, manually uploaded files are excluded from statistics.
Endpoint: /api/v1/feeds/:feed_onestop_id/feed_version_update_statistics
Response
Attribute | Type | Description |
---|---|---|
feed_onestop_id | Onestop ID | Feed Onestop ID |
feed_versions_total | Integer | Total number of Feed Versions |
feed_versions_filtered | Integer | Total, excluding manually updated Feed Versions |
feed_versions_filtered_sha1 | Feed Versions | Filtered Feed Versions ordered by fetched_at |
fetched_at_frequency | Integer | Average days between new Feed Versions |
scheduled_service_duration_average | Float | Average number of days in each schedule |
scheduled_service_overlap_average | Float | Average number of days overlap between subsequent schedules |