IMLS Digital Collections and Content

Photo Wall

Posted on January 7, 2010 by kfenlon

A preview of our IMLS DCC Flickr Photostream

Filed under: Flickr Feasibility Study, Interface, Uncategorized | Leave a comment »

IMLS DCC on Flickr

Posted on November 12, 2009 by kfenlon

IMLS DCC on flickr

We are excited to announce that IMLS DCC has joined Flickr. Our first set of photographs — from Flora (IL) Public Library’s Charles Overstreet Collection — has been uploaded to the photosharing portal as part of the Flickr Feasibility Study (pdf), an IMLS DCC initiative that began this past summer with the goal of increasing the availability and exposure of the rare, historical photos in our collections. We will add to our photostream weekly, so keep checking back for new photos from a variety of collections.

Slated to appear tomorrow in our photostream is a selection from Indiana University’s Charles W. Cushman Photograph Collection, featuring vibrant photos of vintage cars, Chicago’s historic Maxwell Street, and San Francisco in the 1960s.

The Springfield Aviation Company Collection (from Lincoln Library, The Public Library of Springfield, Illinois) and Mining and Mother Jones in Mount Olive (from Mount Olive (IL) Public Library) will contribute photos to our stream over the next few weeks.

Filed under: Flickr Feasibility Study | Leave a comment »

A few updates

Posted on October 28, 2009 by rjurban

Progress Report to the Chief Officers of State Library Agencies (COSLA)

This week I’ve been attending the 2009 COSLA meeting in Nevada. Thanks to IMLS DCC Advisory Board member Jim Scheppke, I was invited to provide the COSLA Networking Committee with an update on our progress that I’ve shared in the slides below.

Filed under: Collection and Item Metadata Relationships (CIMR), Dissemination | Leave a comment »

Recent papers available in IDEALS

Posted on May 8, 2009 by amyjacks

The following recent papers from the IMLS DCC team are now available in IDEALS (Illinois Digital Environment for Access to Learning and Scholarship):

Interim Performance Report for Oct. 2008 – March 2009
http://hdl.handle.net/2142/11720

Zheng, Wu (2009). Exploring Hidden Connections among Historic Images. Displayed at iConference 2009, Chapel Hill, North Carolina, Feb. 8-11, 2009.
http://hdl.handle.net/2142/9608

Filed under: Collection and Item Metadata Relationships (CIMR), Dissemination | Leave a comment »

Visualizing Opening History

Posted on May 7, 2009 by amyjacks

One of the ongoing discussions in the Digital Collection Evaluation (DiCE) subgroup is how to visualize the subject, geographic, and temporal concentrations of the entire Opening History collection. Inspired by Richard’s previous post (Putting IMLS DCC on the map), I started with the Geomap visualization from the Google Visualization API and created a map showing the spatial coverage of Opening History collections. The Geomap visualization maps state names to locations on the map, and all I needed to do was create a sql query for each state.

Spatial Coverage visualization

I also created a similar map of the number of collections from hosting institutions in each state.

Hosting Institutions visualization

Encouraged by the ease of creating these visualizations I continued with the with Google Visualization Column Chart to show temporal coverage of Opening History collections. The IMLS DCC collection description application profile uses a controlled vocabulary of date ranges to indicate the temporal coverage of each collection, so I created sql queries for each date range to determine the number of collections within that date range.

Temporal Coverage visualization

The next step will be to integrate these types of visualizations into the Opening History interface so that visitors can quickly understand the geographic and temporal strengths of Opening History.

Filed under: Digital Collection Evalutation (DiCE), Interface | Tagged: collection-level metadata, Interface, Opening History, visualizations | Leave a comment »

Putting IMLS DCC on the Map

Posted on April 27, 2009 by rjurban

I recently attended the Museums & the Web 2009 conference in Indianapolis, IN. Prof. Mike Twidale and I were there to do a live patchwork prototyping demo of the IMLS DCC Collection Dashboard concept. We had a great crowd of attendees in our booth who provided us with lots of great ideas for next steps (more on that, and a similar demo we did at HASTAC III later). But I also participated in several “unconference” conversations about the semantic web and open/linked data.

At the moment, information from the IMLS DCC is only available via the website and via our OAI-PMH data providers (one for collection-level records, and another for item-level records). While these are great for sharing records between repositories, they don’t necessarily make the information that we have accessible to cool web services like Yahoo! Pipes. Mia Ridge, at the Science Museum in London (and keeper of the Museum API wiki) issued a challenge for us to DO ONE THING before April was over. So here’s my attempt at DOING ONE THING with IMLS DCC. (and is admittedly just a baby step).

One of the services I learned about at MW2009 is Dapper, a tool that will screenscrape HTML pages to produce various kinds of output that you can share with APIs (application program interfaces). Dapper fits nicely within our Patchwork Prototyping toolbox, as it lets us play with some IMLS DCC data in ways that we couldn’t before and without having to actually build an IMLS DCC API first. One of the desirables that came up in both our MW2009 and HASTAC demonstrations was being able to see IMLS DCC collections on a map. So here we go…

First I screenscraped the list of IMLS DCC Collections By Title page. Dapper then allowed me to create:

I took the Atom feed and passed it to the location extractor in Yahoo! Pipes to generate a map.

This is just a first baby step towards building other widgets for a collections dashboard! It needs some work (only a certain number of collections will appear on the map at any one time – you need to browse through the list to see more collections), but the idea behind the DO ONE THING challenge was to take some simple steps to build momentum.

A special thanks to colleague Piotr Adamczyck and his MuseumPipes blog for inspiration!

Filed under: Interface | Tagged: collections dashboard, Dapper, mw2009, Yahoo! Pipes | 2 Comments »

Opening History and the State Library of Ohio

Posted on April 23, 2009 by amyjacks

The State Library of Ohio recently added Opening History to the Ohio Digital and Special Collections section of the State Library website.

See their press release for more information about Ohio’s involvement in Opening History.

Filed under: Digital Collection Evalutation (DiCE) | Tagged: Ohio, Publicity | Leave a comment »

Patchwork Prototyping a Collection Dashboard

Posted on April 14, 2009 by rjurban

The IMLS Digital Collections and Content Interface research group is kicking off a new line of inquiry this week that will explore how we might build a “Collections Dashboard” for the DCC.

The Problem

According to user studies that we’ve conducted, users rarely find the full-text collection descriptions that we provide very helpful. The long screens of text scare them away and don’t really help them find what they are looking for. In the current iteration of the interface, if I stumble across an interesting item, it can be difficult to even find your way back to a collection-level description. The problem here seems to be that the notion of how and why collection-level descriptions are created is based on an old model that looks like this:

A Traditional Path to Items

But increasingly, the way we find things – particularly in online environments looks more like this:

A Digital Path to Items

Nina Simon takes this notion one step futher, by suggesting that we increasingly come at things indirectly through our social network.

In both of the latter cases a user may lack any understanding of institutional or collection context and may be left wondering just where they’ve ended up. As an aggregation of other people’s metadata, trying to orient the user of an item towards these context can be even more difficult. At present the IMLS DCC contains records from more than 500 collections, 240 different repositories for a total of more than 900,000 item-level metadata records. Simply flattening this out into a large blob of item-level metadata separates items from their contexts. (even Google has its page rank that organizes what appears at the top of your results list according to their place in the networked world).

For certain kinds of users, this kind of context isn’t really what they are interested in. They’ll be happy to find an item and move on to their next search. But for the students and scholars that are our primary focus in this part of the grant, context can be a very important part of their research process. A recent study of scholars who use physical object collection, conducted by the UK’s Research Information Network (RIN), illustrates the problem nicely. Collection-level descriptions, such as those offered by the Cornucopia project, offered insufficient information to meet the scholars needs. But interestingly, this same set of scholars said that item-level descriptions lacked information about contexts that make these items meaningful and valuable for their research. How can we restore that sense of both item-level granularity, while maintaining the rich contexts that these items come from?

A Solution

One of the main goals of the current phase of the IMLS DCC project (and particularly for the Collection-Item Metadata Relationships research group) has been to take advantage of collection-level and item-level metadata when used together as mutually supportive forms of description. For the interface group, we’ve been asking ourselves what this might mean in light of our usability studies that suggest the long textual descriptions scare people off.

What if we could provide users of the system a quick, easy way to get a 10,000 foot view of a collection? From this vantage point, individual items fall back to reveal the larger contours of a collection landscape. What are the high points? Where are there gaps? Does this look like a promising place to dig deeper for the kinds of items that will answer my research questions? What kind of landscape does this item come from? Will this collection lead me to find other things like it?

When we visit a physical collection all these kinds of information contexts come for free. We know that we’re under the dome of the Library of Congress or foraging in a tightly packed storeroom at the Early American Museum. I can walk down the ranges of my library and count off how many shelves the E 302 Collected Works of American Statesmen takes up. I can gauge how much work it will be to browse through 6 linear feet of archival materials or 600. I know it would take me days, if not weeks to tour the Louvre, but only a few hours to visit my university gallery. In our digital collections it can be hard to tell how vast, how diverse or how cohesive any one collection might be – let alone an aggregation of more than 500.

In order to do this we’ve borrowed the idea of “information dashboards” that are commonly found in enterprise settings where executives need a high-level overview of underlying processes (see Stephen Few’s book Information Dashboard Design. The Indianapolis Museum of Art was the first to apply this idea in a cultural heritage setting, but like its fore-bearers the IMA dashboard focuses on some of the dynamic processes at work in a museum setting. For the IMLS DCC Collection Dashboard, we’d like to extend this metaphor to represent the key features of a collection in a visualization that is quick and easy to understand.

Prof. Mike Twidale and I have setup a temporary demonstration space here where our evolving prototypes will be posted. Watch this blog space for more information and for opportunities to participate virtually in the design. We would particularly like feedback and comments from scholars who use historical collections about what high-level collection features are most useful for assessing a collections value for your research.

You are also invited to participate at the following upcoming conference venues:

Next Post: I’ll talk about the “patchwork prototyping” method we’re using to attack this problem.

Filed under: Dissemination, Interface | Tagged: Collection Dashboard, Patchwork Prototyping | 3 Comments »

Collection / Item Metadata Relationships questions

Posted on February 24, 2009 by amyjacks

During the Spring 2009 term, the Collection/Item Metadata Relationships (CIMR) working group will be taking a bottom-up approach to looking at our relationship categories amongst the collection and item-level metadata held by the project. We would welcome any advice or feedback on these initiatives:

Testing Category Rules
~~~~~~~~~~~~~~~~~~
We intended to develop a small-scale testbed (using a sample of collection-level and item-level metadata) in order to test rules based on the categories identified by the group to date (a/v-propagation, v-propagation, v-constraint). At the moment, we are considering using Protege since it already includes support for Dublin Core metadata, SWRL rule-sets and database connectivity. What other environments would advisory board members recommend for testing categories?

Time and Space Relationships
~~~~~~~~~~~~~~~~~~~~~~~~~~
Based on analysis completed by the DiCE Research Group, we will be focusing on relationships for Time (dates) and Space (coverage) this term, following up on earlier discussions about what it means to be temporally of geographically “within.” Preliminary work completed last fall suggests there are interesting correlations between CLD:coverage and ILD:dateCreated. This leads us to examine more complex forms of relationships between collections and items. (for example, a place name may be tied to a particular time/date, requiring categories that bind multiple elements together, instead of one to one relationships we’ve looked at so far). We are also looking for tools that might help us translate appropriate metadata elements from existing textual forms to other representations, e.g. translating place names into geographic coordinates that would allow reasoning about “withinness.”

Naturally the CIDOC-CRM has much to offer on the topic of time and place, are there ways we could use it to our advantage when time-space elements found in Dublin Core?

CIMR Best Practice Recommendations
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Currently collection-level and item-level metadata are created without much consideration about whether they could be used in mutually supportive ways. What are the lessons learned from the CIMR working group research that can inform other guiding documents, such as the Best Practices for OAI Data Provider Implementations and Shareable Metadata (http://webservices.itcs.umich.edu/mediawiki/oaibp/?PublicTOC)

Filed under: Collection and Item Metadata Relationships (CIMR) | Leave a comment »

Metasearch questions

Posted on February 23, 2009 by amyjacks

One of the objectives on the IMLS DCC project is to explore ways to more effectively integrate the metadata records we harvest from libraries, museums and archives with other digital content resources. We especially are striving to bridge the gap between primary source digitized content (most of the metadata we aggregate describes such content) and related secondary resources available in digital format (e.g., contemporary journal literature on the same or related topics, much of which is licensed). We are exploring a variety of tools and services that can facilitate this process, with particular attention to metasearch technologies. We’d be especially interested in your advice on how to make further advances in our research on this front.

So far, building on prior UIUC Library metasearch research and applications – e.g., our EasySearch application (see:
http://search.grainger.uiuc.edu/searchaid/easy_search_summary.html), we have added limited metasearch functionality to our Opening History portal
(http://imlsdcc.grainger.uiuc.edu/history/). On this portal when you do an item level search, we return search results for your query from Academic Search Premier, America: History and Life, (Elsevier) Scopus, and Google Book (when not blocked). A few specific issues on which we’d appreciate your comments:

1. What other targets would be relevant for us to pull in through metasearch? Especially those indexing digital secondary sources relevant to the portal’s topic thrust (American history). We’re also very interested in other portals, repositories, or resources recognized in various communities, e.g., museums and historical societies covering American history, which we might be able to tap (even if we have to do some screen scraping).

2. Should we consider adding metasearch functionality at other levels of the portal – e.g., to allow users looking at full records to do metasearches not of their original query, but of terms and indexing in found records?

3. Beyond standards-based Z39.50, SRU/SRW, and XML Gateway implementations of metasearch functionality, what other similar kinds of services should we be looking to exploit?

4. Should we look at making our metadata aggregation a metasearch target for other portals? Again are there existing portals in this domain that seek to exploit resources like ours. If so, which metasearch service protocols would be most critical and how might we register or otherwise advertise availability?

5. What else, in terms of the services we support or exploit, should we be looking at to improve our integration with other relevant digital repositories and portals?

Filed under: Metasearch | 2 Comments »