Thanks for this Dan - I hadn’t realised that CSE didn’t include schema.org using JSON-LD that’s useful to know.

Apart from using the Common Web Crawl datasets do you have any other suggestions for finding examples of markup that would include the JSON-LD?

Owen

Owen Stephens
Owen Stephens Consulting
Web: http://www.ostephens.com
Email: [log in to unmask]
Telephone: 0121 288 6936

On 31 Jan 2017, at 14:09, Dan Brickley <[log in to unmask]> wrote:

Really glad to see this thread. A quick qualifier, which hopefully won't put you off... AFAIK the Custom Search schema.org implementation (unlike almost everything we're doing at Google) doesn't understand JSON-LD, so it won't pick up everything. --Dan

On 31 January 2017 at 13:57, Owen Stephens <[log in to unmask]> wrote:
Hi Mia,

To explore this a bit I’ve just setup a Google CSE which includes sites using one of the following schema.org types:

DataCatalog
Dataset
DataDownload

The CSE is available at https://cse.google.com/cse/publicurl?cx=012667802720864841810:rtkqc9vdej0

Searching with ‘site:ac.uk’ finds a lot of stuff from Durham - e.g. https://collections.durham.ac.uk/files/8p58pc92q
Parsing out the microdata from this URL (using Mida[1]) shows they are using Dataset:

>>mida https://collections.durham.ac.uk/files/8p58pc92q#.WJCVyxicZE4
Parsing: https://collections.durham.ac.uk/files/8p58pc92q#.WJCVyxicZE4
---
:type: http://schema.org/Dataset
:properties:
  funder:
  - :type: http://schema.org/Thing
  - :type: http://schema.org/Thing
  description:
  - Supporting data for the paper "Muon-spin relaxation study of the double perovskite
    insulators Sr2BOsO6 (B = Fe, Y, ln)"
  keywords:
  - muon-spin relaxation
  - magnetism
  publisher:
  - :type: http://schema.org/Organization
    :properties:
      name:
      - Durham University
  accountablePerson:
  - :type: http://schema.org/Person
    :properties:
      name:
      - T. Lancaster
  datePublished:
  - 11 January 2016, 14:01:11
  dateModified:
  - 17 May 2016, 14:05:37


Searching for ‘Museum’ gives me a number of sites including https://data.imls.gov - which marks up results with schema.org microdata - so just an extract here:

>>mida https://data.imls.gov
Parsing: https://data.imls.gov
---
:type: http://schema.org/Dataset
:properties:
  name:
  - 'Library Systems: FY 1992-2012 Public Libraries Survey Trends (Administrative
    Entity)'
  url:
  - https://data.imls.gov/Public-Libraries-Survey/Library-Systems-FY-1992-2012-Public-Libraries-Surv/uzd7-qfwt
  sameAs:
  - https://data.imls.gov/d/uzd7-qfwt
  keywords:
  - Public Libraries Survey
  - trend
  - administrative entity
  - public library
  description:
  - Find key information on trends in library systems around the United States from
    FY 1992 to FY 2012.These data include imputed values for libraries that did not
    submit information in the annual data collections. Imputation is a procedure for
    estimating a value for a specific data item where the response is missing.Download
    PLS data files to see imputation flag variables or learn more on the imputation
    methods used in each fiscal year at https://www.imls.gov/research-evaluation/data-collection/public-libraries-survey/explore-pls-data/pls-data
  distribution:
  - :type: http://schema.org/DataDownload
    :properties:
      fileFormat:
      - text/csv
      - text/csv
      - application/json
      - application/rdf+xml
      - application/rss+xml
      - text/tab-separated-values
      - application/xml
      url:
      - https://data.imls.gov/api/views/uzd7-qfwt.csv?accessType=DOWNLOAD
      - https://data.imls.gov/api/views/uzd7-qfwt.csv?accessType=DOWNLOAD&bom=true
      - https://data.imls.gov/api/views/uzd7-qfwt.json?accessType=DOWNLOAD
      - https://data.imls.gov/api/views/uzd7-qfwt.rdf?accessType=DOWNLOAD
      - https://data.imls.gov/api/views/uzd7-qfwt.rss?accessType=DOWNLOAD
      - https://data.imls.gov/api/views/uzd7-qfwt.tsv?accessType=DOWNLOAD
      - https://data.imls.gov/api/views/uzd7-qfwt.xml?accessType=DOWNLOAD


Hope this is helpful

Owen

[1] https://github.com/LawrenceWoodman/mida

Owen Stephens
Owen Stephens Consulting
Web: http://www.ostephens.com
Email: [log in to unmask]
Telephone: 0121 288 6936

> On 31 Jan 2017, at 13:36, Adrian Stevenson <[log in to unmask]> wrote:
>
> Hi Mia
>
> The short answer is no, but in case it’s of interest, I am looking at implementing schema.org markup for the Archives Hub service at the moment (https://archiveshub.jisc.ac.uk/). I am also working on a Jisc project looking to provide a UK research data discovery service (see https://rdds.jiscinvolve.org/wp/), and this is certainly on our radar for that as you might expect, although it’s low priority at the moment as we still have many other things to sort out. In case the archives side is of interest, there’s a W3C group https://www.w3.org/community/architypes/ but it’s gone a bit quiet so I’m wondering whether to try and pursue things there or some other way. Give me a shout if you want to get together at some point to compare notes, and I’m of course interested in your lessons to share :)
>
> Ade
> _____________________________
> Adrian Stevenson
> Senior Technical Coordinator
> Jisc Manchester
> 6th Floor, Churchgate House
> 56 Oxford Street
> Manchester
> M1 6EU
>
> Email: [log in to unmask]
> Tel: +44 (0) 161 413 7561
> http://www.twitter.com/adrianstevenson
> http://uk.linkedin.com/in/adrianstevenson/
>
>> On 31 Jan 2017, at 12:09, Mia <[log in to unmask]> wrote:
>>
>> Hello!
>>
>> As described in Facilitating the discovery of public datasets (https://research.googleblog.com/2017/01/facilitating-discovery-of-public.html), Google have new guidelines (https://developers.google.com/search/docs/data-types/datasets) to help data providers describe their datasets in a structured way that their search engine can understand.
>>
>> It's something we should think about for data.bl.uk, so in the hope that someone else has lessons to share, has anyone tried describing their stuff with http://schema.org/Dataset?
>>
>> Cheers,
>>
>> Mia
>>
>> --------------------------------------------
>> http://openobjects.org.uk/
>> http://twitter.com/mia_out
>> Check out my book! http://bit.ly/CrowdsourcingOurCulturalHeritage
>> P.S. I mostly use this address for list mail and don't check it daily
>> **************************************************************** website: http://museumscomputergroup.org.uk/ Twitter: http://www.twitter.com/ukmcg Facebook: http://www.facebook.com/museumscomputergroup [un]subscribe: http://museumscomputergroup.org.uk/email-list/ ****************************************************************
>
> Jisc is a registered charity (number 1149740) and a company limited by guarantee which is registered in England under Company No. 5747339, VAT No. GB 197 0632 86. Jisc’s registered office is: One Castlepark, Tower Hill, Bristol, BS2 0JA. T 0203 697 5800.
>
> Jisc Services Limited is a wholly owned Jisc subsidiary and a company limited by guarantee which is registered in England under company number 2881024, VAT number GB 197 0632 86. The registered office is: One Castle Park, Tower Hill, Bristol BS2 0JA. T 0203 697 5800.
>
> ****************************************************************
>       website:  http://museumscomputergroup.org.uk/
>       Twitter:  http://www.twitter.com/ukmcg
>      Facebook:  http://www.facebook.com/museumscomputergroup
> [un]subscribe:  http://museumscomputergroup.org.uk/email-list/
> ****************************************************************

****************************************************************
       website:  http://museumscomputergroup.org.uk/
       Twitter:  http://www.twitter.com/ukmcg
      Facebook:  http://www.facebook.com/museumscomputergroup
 [un]subscribe:  http://museumscomputergroup.org.uk/email-list/
****************************************************************

**************************************************************** website: http://museumscomputergroup.org.uk/ Twitter: http://www.twitter.com/ukmcg Facebook: http://www.facebook.com/museumscomputergroup [un]subscribe: http://museumscomputergroup.org.uk/email-list/ ****************************************************************

**************************************************************** website: http://museumscomputergroup.org.uk/ Twitter: http://www.twitter.com/ukmcg Facebook: http://www.facebook.com/museumscomputergroup [un]subscribe: http://museumscomputergroup.org.uk/email-list/ ****************************************************************