Problem
We need to align how we're representing organizations that are related to a dataset with Google's expectations for the purposes of SEO.
The current draft data model for Dataset API Discovery includes:
"publisher": {
"@type": "Organization",
"name": "Fusion Lifestyle",
"description": "Fusion Lifestyle was established in April 2000 ...",
"email": "info@fusion-lifestyle.com",
"legalName": "Fusion Lifestyle",
"logo": {
"@type": "ImageObject",
"url": "https://res.cloudinary.com/gladstone/image/upload/fusion-lifestyle-live/ydokan4mlia7zigqd79d"
},
"url": "https://www.fusion-lifestyle.com/"
},
"bookingService": {
"@type": "BookingService",
"name": "Gladstone360",
"url": "https://www.gladstonesoftware.co.uk",
"softwareVersion": "3.0.2"
},
bookingService is a property in the OpenActive namespace, and is not recognised by Google. It is also not clear whether the publisher property is being used correctly in this context.
Considerations
Google's Structured Data Documentation
Google's Structured Data Documentation recommends the use of the property creator to represent the "The creator or author of this dataset", and does not provide specific references for other properties (though points to schema.org for more information).
schema.org
schema.org includes several options for attibution of the roles of organizations relating to a schema:Dataset:
creator - The creator/author of this CreativeWork
maintainer - A maintainer of a Dataset
publisher - The publisher of the creative work.
sourceOrganization
- and others
Existing OpenActive dataset sites
The dataset site text reads:
This data is owned by <a href="{{publisher.url}}">{{publisher.legalName}}</a> and is licensed under the Creative Commons Attribution Licence (CC-BY v4.0) for anyone to access, use and share; using attribution "<a href="{{url}}"><span>{{publisher.name}}</span></a>".
Platform: <a href="{{bookingService.url}}">{{bookingService.name}} {{bookingService.softwareVersion}}</a>.
Note that single database systems generally set bookingService to match publisher, as they are the same.
Proposal
Note this proposal doesn't consider maintainer, which could be useful to include within the JSON-LD as a duplicate of creator (if set)?
Multiple database systems
For multiple database systems, where there is one dataset site per activity provider:
creator - The activity provider
publisher - The booking system
This data is owned by <a href="{{creator.url}}">{{creator.legalName}}</a> and is licensed under the Creative Commons Attribution Licence (CC-BY v4.0) for anyone to access, use and share; using attribution "<a href="{{url}}"><span>{{creator.name}}</span></a>".
Platform: <a href="{{publisher.url}}">{{publisher.name}} {{publisher.softwareVersion}}</a>.
Single database systems
For single database systems, where there is one dataset site that contains data from multiple activity providers:
publisher - The booking system
This data is owned by <a href="{{publisher.url}}">{{publisher.legalName}}</a> and is licensed under the Creative Commons Attribution Licence (CC-BY v4.0) for anyone to access, use and share; using attribution "<a href="{{url}}"><span>{{publisher.name}}</span></a>".
(Note in this proposal the "Platform" reference is removed from the HTML for Single database systems)
Implementation note
We need to ensure the embedded DCAT markup reflects this change.
Problem
We need to align how we're representing organizations that are related to a dataset with Google's expectations for the purposes of SEO.
The current draft data model for Dataset API Discovery includes:
bookingServiceis a property in the OpenActive namespace, and is not recognised by Google. It is also not clear whether thepublisherproperty is being used correctly in this context.Considerations
Google's Structured Data Documentation
Google's Structured Data Documentation recommends the use of the property
creatorto represent the "The creator or author of this dataset", and does not provide specific references for other properties (though points to schema.org for more information).schema.org
schema.org includes several options for attibution of the roles of organizations relating to a schema:Dataset:
creator- The creator/author of this CreativeWorkmaintainer- A maintainer of a Datasetpublisher- The publisher of the creative work.sourceOrganizationExisting OpenActive dataset sites
The dataset site text reads:
Note that single database systems generally set
bookingServiceto matchpublisher, as they are the same.Proposal
Note this proposal doesn't consider
maintainer, which could be useful to include within the JSON-LD as a duplicate ofcreator(if set)?Multiple database systems
For multiple database systems, where there is one dataset site per activity provider:
creator- The activity providerpublisher- The booking systemSingle database systems
For single database systems, where there is one dataset site that contains data from multiple activity providers:
publisher- The booking system(Note in this proposal the "Platform" reference is removed from the HTML for Single database systems)
Implementation note
We need to ensure the embedded DCAT markup reflects this change.