Exploring Experiments with Consuming JSON-LD and Primo's Linked Data

Slide Note
Embed
Share

Delve into the world of JSON-LD consumption and experiments with Primo's latest linked data. Learn about using tools like JQ, understanding JSON-LD validity, and exploring Primo's new RESTful Analytics Search API and more. Discover the importance of identifying collection strengths, topic modeling, and recommendation systems in library analytics.


Uploaded on Sep 16, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Consuming JSON-LD: Experiments with Primo's Latest Linked Data Corey Harper 2015-09-05 IGeLU 2015 Developers Day

  2. Curl | jq | less to Beta Search REST API curl "http://bobcatdev.library.nyu.edu/ primo_library/libweb/webservices /rest/v1/pnxs?q=any,contains,mo nkeys&inst=NYU" | jq -C '.docs' | less -r

  3. Ruby RDF & JSON-LD require 'rdf json/ld 'rdf/turtle 'openssl' resource = RDF::Resource(RDF::URI.new("http://bob catdev.library.nyu.edu/primo_library/libwe b/webservices/rest/v1/pnxs/L/nyu_aleph0 01770007?inst=NYU")) graph = RDF::Graph.new << JSON::LD::API.toRdf(resource) graph.dump(:ttl, prefixes: {list prefixes})

  4. What just happened? This doesn t actually work. Not _quite_ valid JSON-LD It needs an actual context Add the context yourself, and you get errors when validating / linting http://json-ld.org/playground/ https://github.com/ExLibrisGroup/Primo.PN X-context/blob/master/PNX-context.json

  5. Still works in JQ (but maybe not JSON Tools?)

  6. Consuming JSON (-LD?): Experiments in with (and Without) Primo's New RESTful Analytics Search API

  7. Distribution of Titles

  8. Why do this stuff? Understanding your collections Understanding queries and usage Identifying Strengths Topic Modeling Clustering Recommendation systems (Automatic Classification?)

  9. On Github (Also, DPLA Examples) https://github.com/chrpr/dpla- analytics/blob/master/primo/REST- api.ipynb

  10. Tree Map of Title Words

  11. Data Science Venn Diagram http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 14

  12. Analytics Writ Large APIs have much potential Limited in scope (for now) NYU External Data Warehousing Analyze Query Logs Analyze Dedup Merger Analyze URLs with (and outside) of Aleph

  13. Toolkit Tableau Business Intelligence R Stats Analysis Python SciPy, NumPy, Pandas, etc NLTK jq, awk, sed, grep, sort, uniq, tr, wc, etc. 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 16

  14. Data Quality Control

  15. Duplicate OCLC # Analysis

  16. Collection Management Decisions Warehouse combines: Primo dedupmrg & frbr matches Ebook SUSHI/Counter Stats Aleph Circ Stats Offsite & De-accessioning Decisions Regression analysis to demo correlations 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 20

  17. CRISP-DM http://commons.wikimedia.org/wiki/File:CRISP-DM_Process_Diagram.png Own work. Licensed under CC BY-SA 3.0 via Wikimedia Commons "CRISP-DM Process Diagram" by Kenneth Jensen 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 21

  18. Possibilities & Next Steps Exploratory Data Analysis. More. Answering questions about data quality More Topic Maps Bi- and Tri-gram Tokenization, Hapaxes Data Cleanup and QA Processing income batch data Integrate with other data streams: Google Analytics, AppDynamics, Kibana Cross system logs, searches, etc. 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 22

  19. Thanks! corey.harper@nyu.edu 212.998.2479 @chrpr 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 23

Related