Open main menu

In information technology, a repository is "a central place in which an aggregation of data is kept and maintained in an organized way, usually in computer storage."[1] It "may be just the aggregation of data itself into some accessible place of storage or it may also imply some ability to selectively extract data."[1]

Universal digital libraryEdit

The concept of a Universal digital library was described as "within reach" by a 2012 LA Times article[2] ("A universal digital library is within reach") which told about Google's attempts to "mass-digitize" what are termed "orphan works" - out-of-print copyrighted works.

The U.S. Copyright Office and the European Union have been working on this. Google has reached agreements in France which "lets the publisher choose which works can be scanned or sold." By contrast, Google has been trying in the USA for a "free to digitize and sell any works unless the copyright holders opted out" deal and is still unsuccessful.[3]

Information repositoryEdit

Attempts to develop what was called an "information repository" have been underway for decades:

  • In 1989 IBM tried to have OfficeVision combine mainframes and PCs to enable "an information repository."[4]
  • In 2003 Microsoft introduced OneNote as an extension to Microsoft Office 2003 which would support "a personal information repository."[5]
  • In 1996 an 1898-founded library obtaiend additional funding to expand its mission, and become a major "local resource center and regional information repository."[6] The New York Times described them as "the second largest in the New York City region, second only to the New York Public Library on Fifth Avenue." Their services include "a computer information center devoted to outside-item requests."

Federated information repositoryEdit

A federated information repository is an easy way to deploy a secondary tier of data storage that can comprise multiple, networked data storage technologies running on diverse operating systems, where data that no longer needs to be in primary storage is protected, classified according to captured metadata, processed, de-duplicated, and then purged, automatically, based on data service level objectives and requirements. In federated information repositories, data storage resources are virtualized as composite storage sets and operate as a federated environment.[7]

Federated information repositories were developed to mitigate problems arising from data proliferation and eliminate the need for separately deployed data storage solutions because of the concurrent deployment of diverse storage technologies running diverse operating systems. They feature centralized management for all deployed data storage resources. They are self-contained, support heterogeneous storage resources, support resource management to add, maintain, recycle, and terminate media, track of off-line media, and operate autonomously.[8]

Automated data managementEdit

Since one of the main reasons for the implementation of an federated nformation repository is to reduce the maintenance workload placed on IT staff by traditional data storage systems, federated information repositories are automated. Automation is accomplished via policies that can process data based on time, events, data age, and data content. Policies manage the following:

  • File system space management
  • Irrelevant data elimination (mp3, games, etc.)
  • Secondary storage resource management

Data is processed according to media type, storage pool, and storage technology.

Because federated information repositories are intended to reduce IT staff workload, they are designed to be easy to deploy and offer configuration flexibility, virtually limitless extensibility, redundancy, and reliable failover.

Data recoveryEdit

Federated information repositories feature robust, client based data search and recovery capabilities that, based on permissions, enable end users to search the information repository, view information repository contents, including data on off-line media, and recover individual files or multiple files to either their original network computer or another network computer.[8]

ReferencesEdit

  1. ^ a b Rouse, Margaret (April 2005). "Definition: repository". whatis.com. TechTarget. Retrieved 1 May 2019.
  2. ^ {{cite newspaper |newspaper=[[The Los Angeles Times |url=https://www.latimes.com/opinion/la-xpm-2012-may-01-la-oe-samuelson-google-books-and-copyright-20120501-story.html |title=A universal digital library is within reach |author=Pamela Samuelson |date=May 1, 2012}}
  3. ^ Eric Pfanner (August 25, 2011). "In France, Publisher and Google Reach Deal". The New York Times.
  4. ^ "IBM software to integrate systems". The New York Times. May 17, 1989.
  5. ^ John Markoff (December 11, 2003). "For Doodlers and Pack Rates, a Multi-Media Binder". NYTimes.com.
  6. ^ F. Romall (May 12, 1996). "Mt. Vernon Library Marks Its 100th Year". The New York Times.
  7. ^ Armstrong, Mark (9 August 2007). "Benefits of a Federated Information Repository as a Secondary Storage Tier". SNIA Enterprise Information World 2007 Conference. Storage Networking Industry Association (SNIA). Retrieved 1 May 2019.
  8. ^ a b "Area Under Surveillance". SoleraTec. SoleraTec LLC. 2019. Phoenix RSM: (Record, Store, Manage), Surveillance Video Management (information repository), Ultra-fast Search and Playback (content-based search queries). Retrieved 6 May 2019.