File Format Analysis and Preservation Planning for Born Digital Collections
Note: There have been new actions to this contract opportunity. To view the most recent action, please click here.
Looking for contract opportunity help?
APEX Accelerators are an official government contracting resource for small businesses. Find your local APEX Accelerator (opens in new window) for free government expertise related to contract opportunities.
APEX Accelerators are funded in part through a cooperative agreement with the Department of Defense.
The APEX Accelerators program was formerly known as the Procurement Technical Assistance Program (opens in new window) (PTAP).
General Information
- Contract Opportunity Type: Solicitation (Original)
- Original Published Date: Apr 08, 2022 02:09 pm EDT
- Original Date Offers Due: Apr 25, 2022 04:00 pm EDT
- Inactive Policy: 15 days after date offers due
- Original Inactive Date: May 10, 2022
- Initiative:
- None
Classification
- Original Set Aside:
- Product Service Code: DF01 - IT AND TELECOM - IT MANAGEMENT SUPPORT SERVICES (LABOR)
- NAICS Code:
- 541512 - Computer Systems Design Services
- Place of Performance: Washington , DC 20540USA
Description
Digital Collections Management and Services requires contract support to analyze the technical characteristics of complex heterogeneous eBook and eJournal content accessible in the Library of Congress’ (LOC/ Library) onsite access platform Stacks to inform preservation planning. This content is published, born digital material acquired from a wide range of publishers through the Cataloging in Publication Program (CIP), and Copyright Deposit through the U.S. Copyright Office.
The contractor shall analyze the technical characteristics of complex heterogeneous eBook and eJournal content accessible in the Library’s onsite access platform Stacks to inform preservation planning. Using specialized tools such as Apache Tika, this research helps understand the structure and composition of over 50,000 ePub files, 100,000 PDF files, and a small number of XML/ONIX for Books, JATS and HTML files. Many of these sets of files contain embedded data such as audio, video and other interactive features that are not fully transparent. This research will inform action plans for access and preservation.
The deliverables from the project shall include:
- Comparison matrixes for characterization tools and tools for rendering for eBook and eJournal supporting formats
- Gap analysis for unmet needs in tools for specific formats
- Report detailing process and outcomes with LOC-focused and community wide recommendations
- Meeting (method to be determined) with LOC staff and selected community members to discuss project results and recommendations
Attachments/Links
Contact Information
Contracting Office Address
- 101 Independence Ave SE LA 325
- Washington , DC 20540
- USA
Primary Point of Contact
- Riley Leonhardt
- rleonhardt@loc.gov
- Phone Number 2029406833
Secondary Point of Contact
- Michael F. Schuman
- mschuman@loc.gov
- Phone Number 2027071088
History
- May 10, 2022 11:56 pm EDTSolicitation (Updated)
- Apr 08, 2022 02:09 pm EDTSolicitation (Original)