Exploratory Search at DePaul University Library

exploratory search beyond the work level n.w
1 / 12
Embed
Share

Explore the innovative approach of DePaul University Library in conducting large-scale reading behavior analysis through the Reading Chicago Reading project. Discover how technical services staff proposed using APIs to enhance the search for OBOC English language manifestations in the HTRC Extracted Features dataset.

  • DePaul University Library
  • Reading Chicago Reading
  • Digital Humanities
  • APIs
  • Information Retrieval

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. EXPLORATORY SEARCH BEYOND THE WORK LEVEL Tami Luedtke, Technical Services Coordinator Ana Lucic, Digital Scholarship Librarian Megan Bernal, Associate University Librarian for Information Technology & Discovery Services DePaul University Library, Chicago, IL, May 11, 2017

  2. CPL LIAISON Jennifer Lizak PRINCIPAL INVESTIGATORS Robin Burke John Shanahan STUDENTS Zack Budde Marco Ferri Nicolas Nascimento Samantha Okrasinski Nasim Sonboli Jorge Staudt Mihaela Stoica Tim Zhang Dan Aasland Guan Yingting Hyunyou Choi Megan Bernal Ana Lucic SSRC SUPPORT Jessica Bishop-Royse Nandhini Gulasingam READING CHICAGO READING project web site: https://dh.depaul.press/reading-chicago/about/

  3. PROJECT BACKGROUND Reading Chicago Reading (RCR) explores large scale reading behavior through the analysis of the One Book One Chicago (OBOC) program s book finalists and: 1) Chicago Public Library circulation data 2) Demographic data about diverse Chicago communities Next step: 3) Adding text features extracted from the OBOC book finalists

  4. INFORMATION NEED RCR asked DePaul library staff to consider the following information need(s) and recommend potential solutions: For given work(s), about which author and title are known, are there any OBOC English language manifestations available in the HTRC Extracted Features dataset? Do any open programmatic tools exist that can help digital humanities scholars and librarians find and obtain this content in an mostly seamless and automated way? If not, do separate open programmatic tools exist that RCR could link together to achieve this end?

  5. A SERIES OF APIS Technical Services staff considered these information needs and proposed using a combination of OCLC and HathiTrust APIs to progress from known author and title metadata to the desired output.

  6. THE ADVENTURES OF AUGIE MARCH BY SAUL BELLOW OCLC CLASSIFY

  7. THE ADVENTURES OF AUGIE MARCH BY SAUL BELLOW - - OCLC LINKED DATA

  8. THE ADVENTURES OF AUGIE MARCH BY SAUL BELLOW HATHITRUST BIB API

  9. THE ADVENTURES OF AUGIE MARCH BY SAUL BELLOW HTRC FEATURE READER

  10. HTRC EXTRACTED FEATURES DATASET IN JSON Although we have established that a certain volume of a work is available in the HathiTrust extracted features dataset, is this the volume we are interested in? "tokenPosCount":{"lion":{"NN":1},"despotic":{"JJ":1},"down":{"RP":2," RB":1},"overfed":{"JJ":1},"mother":{"NN":1},"for":{"IN":5},"knuckles":{"NNS":1}, "Grandma":{"NNP":4},"fate":{"NN":1},"any":{"DT":1},"disguise":{"VB":1},"door":{" NN":1},"in":{"IN":4},"myself":{"PRP":1},"have":{"VB":1,"VBP":2},"learned":{"VBD":1},"lay": {"VBD":1},"is":{"VBZ":4},"belonged":{"VBD":1},"his":{"PRP$":2},"knows":{"VBZ":1},"every one":{"NN":1},"song":{"NN":1},"am":{"VBP":1},"hands":{"NNS":2},"elder":{"JJR":1},".":{".":1 3},"but":{"CC":1},"She":{"PRP":3},"fineness":{"NN":1},"what":{"WP":2},"teach":{"VB":1},"c ared":{"VBD":1},"embroidered":{"VBN":1},"Mahchy":{"NNP":2},"if":{"IN":1},"Winnie":{"N NP":4},"My":{"PRP$":2},"own":{"JJ":2},"free- style":{"JJ":1},"felt":{"VBN":1},"wore":{"VBD":1},"up":{"RB":1},"so":{"RB":1},"rest":{"NN":1}, "ran":{"VBD":1},"younger":{"JJR":1},"adjoining":{"VBG":1},"Lausch":{"NNP":2},"had":{"VB D":1},"governed":{"JJ":1},"parents":{"NNS":1},"accuracy":{"NN":1},"us":{"PRP":

  11. TAKEAWAYS Complexity stems from the following factors: any number of possible manifestations/items of a work may or may not be available on a system many APIs assume knowledge of item level IDs as a starting point for obtaining data This type of automated known-work search to find/explore any available item" for computational purposes is not very common yet. We expect more requests as digital humanities and other researchers seek entire machine readable texts and text metadata for computational analysis. Technical services librarians can use their knowledge of metadata to assist digital humanities scholars and other researchers in determining useful API searches/calls utilizing known metadata and for retrieving additional textual metadata that might be of use. RCR project developers hope to make alpha prototype program(s) available on GitHub soon.

  12. THANK YOU FOR LISTENING! Please contact us with questions, suggestions, or comments at: Tami Luedtke: tluedtke@depaul.edu Ana Lucic: alucic@depaul.edu Megan Bernal: mbernal2@depaul.edu

Related


More Related Content