Print

Print


Thanks for this Dan - I hadn’t realised that CSE didn’t include schema.org using JSON-LD that’s useful to know.

Apart from using the Common Web Crawl datasets do you have any other suggestions for finding examples of markup that would include the JSON-LD?

Owen

Owen Stephens
Owen Stephens Consulting
Web: http://www.ostephens.com
Email: [log in to unmask]
Telephone: 0121 288 6936

> On 31 Jan 2017, at 14:09, Dan Brickley <[log in to unmask]> wrote:
> 
> Really glad to see this thread. A quick qualifier, which hopefully won't put you off... AFAIK the Custom Search schema.org <http://schema.org/> implementation (unlike almost everything we're doing at Google) doesn't understand JSON-LD, so it won't pick up everything. --Dan
> 
> On 31 January 2017 at 13:57, Owen Stephens <[log in to unmask] <mailto:[log in to unmask]>> wrote:
> Hi Mia,
> 
> To explore this a bit I’ve just setup a Google CSE which includes sites using one of the following schema.org <http://schema.org/> types:
> 
> DataCatalog
> Dataset
> DataDownload
> 
> The CSE is available at https://cse.google.com/cse/publicurl?cx=012667802720864841810:rtkqc9vdej0 <https://cse.google.com/cse/publicurl?cx=012667802720864841810:rtkqc9vdej0>
> 
> Searching with ‘site:ac.uk <http://ac.uk/>’ finds a lot of stuff from Durham - e.g. https://collections.durham.ac.uk/files/8p58pc92q <https://collections.durham.ac.uk/files/8p58pc92q>
> Parsing out the microdata from this URL (using Mida[1]) shows they are using Dataset:
> 
> >>mida https://collections.durham.ac.uk/files/8p58pc92q#.WJCVyxicZE4 <https://collections.durham.ac.uk/files/8p58pc92q#.WJCVyxicZE4>
> Parsing: https://collections.durham.ac.uk/files/8p58pc92q#.WJCVyxicZE4 <https://collections.durham.ac.uk/files/8p58pc92q#.WJCVyxicZE4>
> ---
> :type: http://schema.org/Dataset <http://schema.org/Dataset>
> :properties:
>   funder:
>   - :type: http://schema.org/Thing <http://schema.org/Thing>
>   - :type: http://schema.org/Thing <http://schema.org/Thing>
>   description:
>   - Supporting data for the paper "Muon-spin relaxation study of the double perovskite
>     insulators Sr2BOsO6 (B = Fe, Y, ln)"
>   keywords:
>   - muon-spin relaxation
>   - magnetism
>   publisher:
>   - :type: http://schema.org/Organization <http://schema.org/Organization>
>     :properties:
>       name:
>       - Durham University
>   accountablePerson:
>   - :type: http://schema.org/Person <http://schema.org/Person>
>     :properties:
>       name:
>       - T. Lancaster
>   datePublished:
>   - 11 January 2016, 14:01:11
>   dateModified:
>   - 17 May 2016, 14:05:37
> 
> 
> Searching for ‘Museum’ gives me a number of sites including https://data.imls.gov <https://data.imls.gov/> - which marks up results with schema.org <http://schema.org/> microdata - so just an extract here:
> 
> >>mida https://data.imls.gov <https://data.imls.gov/>
> Parsing: https://data.imls.gov <https://data.imls.gov/>
> ---
> :type: http://schema.org/Dataset <http://schema.org/Dataset>
> :properties:
>   name:
>   - 'Library Systems: FY 1992-2012 Public Libraries Survey Trends (Administrative
>     Entity)'
>   url:
>   - https://data.imls.gov/Public-Libraries-Survey/Library-Systems-FY-1992-2012-Public-Libraries-Surv/uzd7-qfwt <https://data.imls.gov/Public-Libraries-Survey/Library-Systems-FY-1992-2012-Public-Libraries-Surv/uzd7-qfwt>
>   sameAs:
>   - https://data.imls.gov/d/uzd7-qfwt <https://data.imls.gov/d/uzd7-qfwt>
>   keywords:
>   - Public Libraries Survey
>   - trend
>   - administrative entity
>   - public library
>   description:
>   - Find key information on trends in library systems around the United States from
>     FY 1992 to FY 2012.These data include imputed values for libraries that did not
>     submit information in the annual data collections. Imputation is a procedure for
>     estimating a value for a specific data item where the response is missing.Download
>     PLS data files to see imputation flag variables or learn more on the imputation
>     methods used in each fiscal year at https://www.imls.gov/research-evaluation/data-collection/public-libraries-survey/explore-pls-data/pls-data <https://www.imls.gov/research-evaluation/data-collection/public-libraries-survey/explore-pls-data/pls-data>
>   distribution:
>   - :type: http://schema.org/DataDownload <http://schema.org/DataDownload>
>     :properties:
>       fileFormat:
>       - text/csv
>       - text/csv
>       - application/json
>       - application/rdf+xml
>       - application/rss+xml
>       - text/tab-separated-values
>       - application/xml
>       url:
>       - https://data.imls.gov/api/views/uzd7-qfwt.csv?accessType=DOWNLOAD <https://data.imls.gov/api/views/uzd7-qfwt.csv?accessType=DOWNLOAD>
>       - https://data.imls.gov/api/views/uzd7-qfwt.csv?accessType=DOWNLOAD&bom=true <https://data.imls.gov/api/views/uzd7-qfwt.csv?accessType=DOWNLOAD&bom=true>
>       - https://data.imls.gov/api/views/uzd7-qfwt.json?accessType=DOWNLOAD <https://data.imls.gov/api/views/uzd7-qfwt.json?accessType=DOWNLOAD>
>       - https://data.imls.gov/api/views/uzd7-qfwt.rdf?accessType=DOWNLOAD <https://data.imls.gov/api/views/uzd7-qfwt.rdf?accessType=DOWNLOAD>
>       - https://data.imls.gov/api/views/uzd7-qfwt.rss?accessType=DOWNLOAD <https://data.imls.gov/api/views/uzd7-qfwt.rss?accessType=DOWNLOAD>
>       - https://data.imls.gov/api/views/uzd7-qfwt.tsv?accessType=DOWNLOAD <https://data.imls.gov/api/views/uzd7-qfwt.tsv?accessType=DOWNLOAD>
>       - https://data.imls.gov/api/views/uzd7-qfwt.xml?accessType=DOWNLOAD <https://data.imls.gov/api/views/uzd7-qfwt.xml?accessType=DOWNLOAD>
> —
> 
> Hope this is helpful
> 
> Owen
> 
> [1] https://github.com/LawrenceWoodman/mida <https://github.com/LawrenceWoodman/mida>
> 
> Owen Stephens
> Owen Stephens Consulting
> Web: http://www.ostephens.com <http://www.ostephens.com/>
> Email: [log in to unmask] <mailto:[log in to unmask]>
> Telephone: 0121 288 6936
> 
> > On 31 Jan 2017, at 13:36, Adrian Stevenson <[log in to unmask] <mailto:[log in to unmask]>> wrote:
> >
> > Hi Mia
> >
> > The short answer is no, but in case it’s of interest, I am looking at implementing schema.org <http://schema.org/> markup for the Archives Hub service at the moment (https://archiveshub.jisc.ac.uk/ <https://archiveshub.jisc.ac.uk/>). I am also working on a Jisc project looking to provide a UK research data discovery service (see https://rdds.jiscinvolve.org/wp/ <https://rdds.jiscinvolve.org/wp/>), and this is certainly on our radar for that as you might expect, although it’s low priority at the moment as we still have many other things to sort out. In case the archives side is of interest, there’s a W3C group https://www.w3.org/community/architypes/ <https://www.w3.org/community/architypes/> but it’s gone a bit quiet so I’m wondering whether to try and pursue things there or some other way. Give me a shout if you want to get together at some point to compare notes, and I’m of course interested in your lessons to share :)
> >
> > Ade
> > _____________________________
> > Adrian Stevenson
> > Senior Technical Coordinator
> > Jisc Manchester
> > 6th Floor, Churchgate House
> > 56 Oxford Street
> > Manchester
> > M1 6EU
> >
> > Email: [log in to unmask] <mailto:[log in to unmask]>
> > Tel: +44 (0) 161 413 7561 <tel:%2B44%20%280%29%20161%20413%207561>
> > http://www.twitter.com/adrianstevenson <http://www.twitter.com/adrianstevenson>
> > http://uk.linkedin.com/in/adrianstevenson/ <http://uk.linkedin.com/in/adrianstevenson/>
> >
> >> On 31 Jan 2017, at 12:09, Mia <[log in to unmask] <mailto:[log in to unmask]>> wrote:
> >>
> >> Hello!
> >>
> >> As described in Facilitating the discovery of public datasets (https://research.googleblog.com/2017/01/facilitating-discovery-of-public.html <https://research.googleblog.com/2017/01/facilitating-discovery-of-public.html>), Google have new guidelines (https://developers.google.com/search/docs/data-types/datasets <https://developers.google.com/search/docs/data-types/datasets>) to help data providers describe their datasets in a structured way that their search engine can understand.
> >>
> >> It's something we should think about for data.bl.uk <http://data.bl.uk/>, so in the hope that someone else has lessons to share, has anyone tried describing their stuff with http://schema.org/Dataset <http://schema.org/Dataset>?
> >>
> >> Cheers,
> >>
> >> Mia
> >>
> >> --------------------------------------------
> >> http://openobjects.org.uk/ <http://openobjects.org.uk/>
> >> http://twitter.com/mia_out <http://twitter.com/mia_out>
> >> Check out my book! http://bit.ly/CrowdsourcingOurCulturalHeritage <http://bit.ly/CrowdsourcingOurCulturalHeritage>
> >> P.S. I mostly use this address for list mail and don't check it daily
> >> **************************************************************** website: http://museumscomputergroup.org.uk/ <http://museumscomputergroup.org.uk/> Twitter: http://www.twitter.com/ukmcg <http://www.twitter.com/ukmcg> Facebook: http://www.facebook.com/museumscomputergroup <http://www.facebook.com/museumscomputergroup> [un]subscribe: http://museumscomputergroup.org.uk/email-list/ <http://museumscomputergroup.org.uk/email-list/> ****************************************************************
> >
> > Jisc is a registered charity (number 1149740) and a company limited by guarantee which is registered in England under Company No. 5747339, VAT No. GB 197 0632 86. Jisc’s registered office is: One Castlepark, Tower Hill, Bristol, BS2 0JA. T 0203 697 5800.
> >
> > Jisc Services Limited is a wholly owned Jisc subsidiary and a company limited by guarantee which is registered in England under company number 2881024, VAT number GB 197 0632 86. The registered office is: One Castle Park, Tower Hill, Bristol BS2 0JA. T 0203 697 5800.
> >
> > ****************************************************************
> >       website:  http://museumscomputergroup.org.uk/ <http://museumscomputergroup.org.uk/>
> >       Twitter:  http://www.twitter.com/ukmcg <http://www.twitter.com/ukmcg>
> >      Facebook:  http://www.facebook.com/museumscomputergroup <http://www.facebook.com/museumscomputergroup>
> > [un]subscribe:  http://museumscomputergroup.org.uk/email-list/ <http://museumscomputergroup.org.uk/email-list/>
> > ****************************************************************
> 
> ****************************************************************
>        website:  http://museumscomputergroup.org.uk/ <http://museumscomputergroup.org.uk/>
>        Twitter:  http://www.twitter.com/ukmcg <http://www.twitter.com/ukmcg>
>       Facebook:  http://www.facebook.com/museumscomputergroup <http://www.facebook.com/museumscomputergroup>
>  [un]subscribe:  http://museumscomputergroup.org.uk/email-list/ <http://museumscomputergroup.org.uk/email-list/>
> ****************************************************************
> 
> **************************************************************** website: http://museumscomputergroup.org.uk/ <http://museumscomputergroup.org.uk/> Twitter: http://www.twitter.com/ukmcg <http://www.twitter.com/ukmcg> Facebook: http://www.facebook.com/museumscomputergroup <http://www.facebook.com/museumscomputergroup>[un]subscribe: http://museumscomputergroup.org.uk/email-list/ <http://museumscomputergroup.org.uk/email-list/> ****************************************************************


****************************************************************
       website:  http://museumscomputergroup.org.uk/
       Twitter:  http://www.twitter.com/ukmcg
      Facebook:  http://www.facebook.com/museumscomputergroup
 [un]subscribe:  http://museumscomputergroup.org.uk/email-list/
****************************************************************