Home » Posts tagged 'zanran'
Tag Archives: zanran
I’ve always appreciated the Office of Scientific and Technical Information (OSTI) for the way they manage and give access to .gov scientific information – and for the fact that they’ve long shared their metadata freely and allowed libraries to download their MARC records in bulk. Now they’ve added *another* feature to their search for which I’ve long wished; OSTI has added a figure and table search to their engine! Now if we could only get GPO to add this feature to GOVINFO. Imagine having a zanran-style search for tabular data and images in all govt documents?!
Thanks again OSTI!
OSTI.GOV has introduced a search for figure and table images included in DOE’s collection of scientific and technical information. This innovative new feature allows users to search for and retrieve documents as usual, but the associated images are also retrieved, and can be viewed with the corresponding document or in a separate tab for images only. Currently, over 5,000 documents have been mined for images, resulting in over 41,000 available for searching.
To populate and power this image search, relevant figures and tables are extracted from full-text documents using modified open-source software, and then the images and associated metadata are carefully curated in-house to make them findable at OSTI.GOV. Emphasis has been placed on extracting visual materials from some of the newest full-text records in OSTI.GOV, specifically journal articles accepted manuscripts that have recently been released from embargo.