{"id":82,"date":"2014-10-28T10:07:38","date_gmt":"2014-10-28T14:07:38","guid":{"rendered":"https:\/\/juliehardesty.com\/notions\/?p=82"},"modified":"2014-10-28T10:07:38","modified_gmt":"2014-10-28T14:07:38","slug":"all-the-slices-of-pie","status":"publish","type":"post","link":"https:\/\/juliehardesty.com\/notions\/all-the-slices-of-pie\/","title":{"rendered":"All the slices of pie"},"content":{"rendered":"<h2>Readings on combining and exposing library data sets<\/h2>\n<p>I feel like I&#8217;m seeing calls across a variety of subject domains\u00a0for sharing data and making it easily available and reusable. National funding models in the U.S. are beginning to <a href=\"http:\/\/www.nlm.nih.gov\/NIHbmic\/nih_data_sharing_policies.html\">require<\/a> <a href=\"http:\/\/www.neh.gov\/grants\/manage\/general-terms-and-conditions-awards-awards-issued-may-2009-or-later#intangible\">sharing<\/a> of data so this idea of providing your data for others to use is kind of catching on.<\/p>\n<p>I also finally read Aaron Swartz\u2019s posthumously published \u201c<a href=\"http:\/\/www.morganclaypool.com\/doi\/pdf\/10.2200\/S00481ED1V01Y201302WBE005\">A Programmable Web: An Unfinished Work<\/a>,\u201d which is an important read for a multitude of reasons. He makes his own call for exposing data in ways that make it easy for people to grab data they want or get all of the data and make use of it however they want (Chs. 5-7). His ideas implement this around JSON and web-based technology. I like that but I think there\u2019s probably also still a place for XML in\u00a0exchanging data in a standardized way or communicating data at an institutional level (feeding our data into DPLA, for example).<\/p>\n<p>With a goal of combining our library data for discovery, access, and reuse, I&#8217;ve been trying to uncover a literature review of sorts on combining data sets within a library context. I\u2019ve come upon ideas about how to evaluate and compare data sets for commonalities and how to think about providing data in ways that are actually useful and understandable to researchers outside of the library context. Following is the current state of an annotated bibliography, plus some delicious slices of pie because, well, pie:<\/p>\n<div id=\"attachment_86\" style=\"width: 650px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-86\" class=\"wp-image-86 size-full\" title=\"Slice of Cherry Blueberry Pie\" src=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/2811905230_9766b893d7_o.jpg\" alt=\"Slice of Cherry Blueberry Pie\" width=\"640\" height=\"480\" srcset=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/2811905230_9766b893d7_o.jpg 640w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/2811905230_9766b893d7_o-300x225.jpg 300w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/2811905230_9766b893d7_o-400x300.jpg 400w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><p id=\"caption-attachment-86\" class=\"wp-caption-text\">Slice of Cherry Blueberry Pie by <a href=\"https:\/\/flic.kr\/p\/5htKsE\">digidi via flickr<\/a><\/p><\/div>\n<p>Abed, Alea. (2014). Podcast: Project Blacklight, Hydra and libraries in the digital age. <i>Lucidworks.<\/i> <a href=\"http:\/\/www.lucidworks.com\/blog\/podcast-project-blacklight-hydra-and-libraries-in-the-digital-age\/\">http:\/\/www.lucidworks.com\/blog\/podcast-project-blacklight-hydra-and-libraries-in-the-digital-age\/<\/a><\/p>\n<p>Bess Sadler from Stanford University discusses <a href=\"http:\/\/projecthydra.org\">Project Hydra<\/a> and what is happening in new developments. They are trying to improve discovery and access for digital libraries by adding a technology stack onto the inventory system that has been digital repositories up to now. Also improving this inventorysystem by providing self-deposit interfaces. Two new areas of work highlighted were <a href=\"http:\/\/geoblacklight.org\">GeoBlacklight<\/a> for GIS data and displaying archival collections effectively in Blacklight.<\/p>\n<div id=\"attachment_89\" style=\"width: 650px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-89\" class=\"wp-image-89\" title=\"Maple-Bourbon Pumpkin Pie\" src=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/8177516875_7274b59694_b-300x277.jpg\" alt=\"Maple-Bourbon Pumpkin Pie\" width=\"640\" height=\"591\" srcset=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/8177516875_7274b59694_b-300x277.jpg 300w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/8177516875_7274b59694_b-324x300.jpg 324w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/8177516875_7274b59694_b.jpg 1024w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><p id=\"caption-attachment-89\" class=\"wp-caption-text\">Maple-Bourbon Pumpkin Pie by <a href=\"https:\/\/flic.kr\/p\/dsBUSg\">djwtwo via flickr<\/a><\/p><\/div>\n<p>Breeding, Marshall. (2005). Plotting a new course for metasearch. <i>Computers in Libraries, <\/i>25:2, pp. 27-29.<\/p>\n<p>Breeding makes the case for a giant central search of content instead of federated searching (searching against multiple targets). This provides a single access point instead of multiple search interfaces and lessens the burden of searching multiple targets and needing multiple indexes. Making this switch can be difficult since different providers don\u2019t always make metadata openly available for combining.<\/p>\n<p>Emde, Judith Z., Sara E. Morris, and Monica Claassen-Wilson. (2009). Testing an academic library website for usability with faculty and graduate students. <i>Evidence Based Library and Information Practice, <\/i>4:4, pp. 24-36.<\/p>\n<p>This article describes findings from a usability study of a library website. Findings include that graduate students tend to get results that are too broad from federated searching. They have to use quotation marks to be precise and results can be too mixed, making it hard to tell what is what. Federated searching is most helpful to graduate students to point out resources or databases they have not previously used. Another finding was that graduate students want subject-specific searching or limited combined subject searching, not cross-subject searching. Subject-specific resource help is most useful when given within a context, such as a course.<\/p>\n<p>Hofmann, Melissa A. and Sharon Q. Yang. (2011). How next-gen r u? A review of academic OPACs in the United States and Canada. <i>Computers in Libraries<\/i> 31:6, pp. 26-29.<\/p>\n<p>Initial study that was followed up in 2011 found that of 260 academic libraries surveyed, very few were using federated searching to combine data sources and most were still only offering catalog searching. If there was a discovery layer tool in use, it tended to provide faceted navigation.<\/p>\n<p>Hofmann, Melissa A. (2012). \u201cDiscovering\u201d what\u2019s changed: a revisit of the OPACs of 260 academic libraries. <i>Library Hi Tech<\/i> 30:2, pp. 253-274.<\/p>\n<p>In this 2011 follow-up to a 2009 study that found that discovery layers were not in wide use among academic online library catalogs, more institutions are using discovery layers but there are weaknesses in what these tools can do in terms of unified one-stop searching, recommended items, and relevancy display based on circulation statistics. Interest is shown in the <a href=\"http:\/\/www.extensiblecatalog.org\">eXtensible Catalog (XC) Metadata Toolkit<\/a>\u00a0because it \u201caggregates metadata from various silos, normalizes (cleans-up) metadata of varying levels of quality, and transform[s]\u2026 metadata into a consistent format for use in the discovery layer.\u201d [p. 261]<\/p>\n<div id=\"attachment_88\" style=\"width: 650px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-88\" class=\"wp-image-88\" title=\"Dutch Apple Pie a la mode\" src=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/3949201495_922cb89379_b-300x209.jpg\" alt=\"Dutch Apple Pie a la mode\" width=\"640\" height=\"446\" srcset=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/3949201495_922cb89379_b-300x209.jpg 300w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/3949201495_922cb89379_b-430x300.jpg 430w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/3949201495_922cb89379_b.jpg 1024w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><p id=\"caption-attachment-88\" class=\"wp-caption-text\">Dutch Apple Pie a la mode by <a href=\"https:\/\/flic.kr\/p\/71YG1X\">mattmendoza via flickr<\/a><\/p><\/div>\n<p>Johnson, Thomas. (2013). Indexing linked bibliographic data with JSON-LD, BibJSON and Elasticsearch. <i>The Code4Lib Journal<\/i>, 19.\u00a0<a href=\"http:\/\/journal.code4lib.org\/articles\/7949\">http:\/\/journal.code4lib.org\/articles\/7949<\/a><\/p>\n<p>This article describes using JSON to map RDF into JSON-LD (linked data). The main point of interest for me is that indexes were not actually combined but kept separate. This helped to include context along with the index and allowed for different mappings based on discrepancies between data sources. There were no performance issues querying across multiple indexes using JSON.<\/p>\n<div id=\"attachment_87\" style=\"width: 650px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-87\" class=\"wp-image-87\" title=\"All We are saying is Give Pie a Chance\" src=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/3816813507_d4c4c489e5_b-300x271.jpg\" alt=\"All We are saying is Give Pie a Chance\" width=\"640\" height=\"580\" srcset=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/3816813507_d4c4c489e5_b-300x271.jpg 300w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/3816813507_d4c4c489e5_b-331x300.jpg 331w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/3816813507_d4c4c489e5_b.jpg 1024w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><p id=\"caption-attachment-87\" class=\"wp-caption-text\">All We are saying is Give Pie a Chance by <a href=\"https:\/\/flic.kr\/p\/6PhaFr\">bitzcelt via flickr<\/a><\/p><\/div>\n<p>Kipp, Margaret E. I. (2005). Complementary ordiscrete contexts in online indexing: A comparison of user, creator, and intermediary keywords. <i>Canadian Journal of Information &amp; Library Sciences<\/i> 29:4, pp. 419-436.<\/p>\n<p>This article describes a study comparing descriptors assigned by different actors in the metadata creation process. 165 articles from CiteULike (a bookmarking web service\u00a0similar to de.li.cio.us) were compared based on user-provided tags, author-provided keywords, and intermediary-provided descriptors using the Voorbij scale along with structured thesauri from INSPEC and Library Literature to identify broader, narrower, and related terms. The study found that user tags are quite different from author- and intermediary-provided descriptors and can supplement a controlled vocabulary entryway to content. Additionally, providing both abbreviations and long-form terms helped to expand content use to interdisciplinary research.<\/p>\n<p>Limani, Fidan and Vladimir Radevski. (2013). Enrichment of digital libraries with Web 2.0: Resources for enhanced user search experience. <i>8th Annual South-East European Doctoral Student Conference: Infusing Research and Knowledge in South-East Europe.<\/i> South-East European Research Center: Thessaloniki, Greece, 2013. pp. 294-300.<\/p>\n<p>This article proposes connecting \u201ctraditional\u201d scientific research resources (indexed, categorized, and searchable) with scientific Web 2.0 data (socially maintained scholarly library services like blogs and wikis) by tagging those Web 2.0 data sources with authoritative links. This introduces Semantic Web connections to tie together these data sources and expose digital library collections more effectively, reducing the \u201csearch span and effort\u201d on the part of the user. [p. 299]<\/p>\n<div id=\"attachment_84\" style=\"width: 650px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-84\" class=\"wp-image-84\" title=\"key lime pie\" src=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/207708005_33b2e22777_b-300x200.jpg\" alt=\"key lime pie\" width=\"640\" height=\"427\" srcset=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/207708005_33b2e22777_b-300x200.jpg 300w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/207708005_33b2e22777_b-449x300.jpg 449w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/207708005_33b2e22777_b.jpg 1024w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><p id=\"caption-attachment-84\" class=\"wp-caption-text\">key lime pie by <a href=\"https:\/\/flic.kr\/p\/jmymv\">roboppy via flickr<\/a><\/p><\/div>\n<p>Stephens, Owen. (2011). Mashups and open data in libraries. <i>Serials: The Journal for the Serials Community<\/i> 24:3, pp. 245-250.<\/p>\n<p>Stephens argues that making data open involves more than just licensing &#8211; it should refer to \u201cthe ease with which data can be used, taking into consideration aspects such as format and access mechanisms.\u201d [p. 246] The most common ways library data is shared are via XML, JSON, and, increasingly, RDF but these \u201cformats offered are usually familiar only to those who specialize in library data.\u201d [p. 247] Offering APIs to access data makes it easier to understand and use the data, allowing mashups to occur and new ways to use data possible.<\/p>\n<p>Thomas, Marliese, Dana M. Caudle, and Cecilia M. Schmitz. (2009). To tag or not to tag? <i>Library Hi Tech, <\/i>27:3, pp. 411-434.<\/p>\n<p>This article describes a study comparing user-contributed tags to controlled vocabulary subject headings (LCSH) to identify broader, narrower, and related terms to identify new terms via the tags that can be brought in to enhance controlled vocabulary used in a system (a \u201ccollabulary\u201d). Kipp\u2019s modification of Voorbij scale was used to look at tags compared to hierarchical relationships from a thesaurus. Tagging is generally for personal use (such as finding something later) so there needs to be an incentive to create tags.<\/p>\n<div id=\"attachment_85\" style=\"width: 594px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-85\" class=\"wp-image-85 size-full\" title=\"Apple Pie\" src=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/244921874_44ec1cbfa9_o.jpg\" alt=\"Apple Pie\" width=\"584\" height=\"600\" srcset=\"https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/244921874_44ec1cbfa9_o.jpg 584w, https:\/\/juliehardesty.com\/notions\/wp-content\/uploads\/2014\/10\/244921874_44ec1cbfa9_o-292x300.jpg 292w\" sizes=\"auto, (max-width: 584px) 100vw, 584px\" \/><p id=\"caption-attachment-85\" class=\"wp-caption-text\">Apple Pie by <a href=\"https:\/\/flic.kr\/p\/nDhJW\">belochkavita via flickr<\/a><\/p><\/div>\n<p>Tillett, Barbara B. (2000). Authority control on the web. In: <i>Bicentennial Conference on Bibliographic Control for the New Millennium: Confronting the Challenges of Networked Resources and the Web (Washington DC, November 15-17, 2000)<\/i>.<\/p>\n<p>This report discusses the concept of a \u201cmandatory minimal set of data elements\u2026 in all authority records to facilitate international exchange or use\u201d [p. 5] It shows growing support for authority control to manage different sources of common metadata and the idea of common core data points for aligning and relating records from different sources.<\/p>\n<p>Voorbij, Henk J. (1998). Title keywords and subject descriptors: A comparison of subject search entries of books in the humanities and social sciences. <i>Journal of Documentation, <\/i>54:4, pp. 466-476.<\/p>\n<p>This article describes results from two studies &#8211; one where librarians compared subject descriptors and words in titles for 475 catalog records and rated them on a scale of 1 (subject is the same as the title) to 7 (subject is not at all in the title) and a second where librarians searched on subject and title words for the same topic. Findings suggest that subject descriptors enhanced recall for searches and 37% of the first study\u2019s records were enhanced by subject descriptors. [The scale used for comparison has been used in other studies (Thomas, et al., 2009; Kipp, 2006) with variations in what is being compared but focusing on comparing different types of metadata.]<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Readings on combining and exposing library data sets I feel like I&#8217;m seeing calls across a variety of subject domains\u00a0for sharing data and making it easily available and reusable. National funding models in the U.S. are beginning to require sharing &hellip; <a href=\"https:\/\/juliehardesty.com\/notions\/all-the-slices-of-pie\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3,8],"tags":[],"class_list":["post-82","post","type-post","status-publish","format-standard","hentry","category-metadata","category-readings"],"_links":{"self":[{"href":"https:\/\/juliehardesty.com\/notions\/wp-json\/wp\/v2\/posts\/82","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/juliehardesty.com\/notions\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/juliehardesty.com\/notions\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/juliehardesty.com\/notions\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/juliehardesty.com\/notions\/wp-json\/wp\/v2\/comments?post=82"}],"version-history":[{"count":10,"href":"https:\/\/juliehardesty.com\/notions\/wp-json\/wp\/v2\/posts\/82\/revisions"}],"predecessor-version":[{"id":112,"href":"https:\/\/juliehardesty.com\/notions\/wp-json\/wp\/v2\/posts\/82\/revisions\/112"}],"wp:attachment":[{"href":"https:\/\/juliehardesty.com\/notions\/wp-json\/wp\/v2\/media?parent=82"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/juliehardesty.com\/notions\/wp-json\/wp\/v2\/categories?post=82"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/juliehardesty.com\/notions\/wp-json\/wp\/v2\/tags?post=82"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}