Premium
Ingest—A simple program for performing distributed relational database operations
Author(s) -
Silberberg David
Publication year - 1992
Publication title -
software: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.437
H-Index - 70
eISSN - 1097-024X
pISSN - 0038-0644
DOI - 10.1002/spe.4380220603
Subject(s) - computer science , database , identifier , row , joins , relational database , flat file database , information retrieval , computer file , versioning file system , programming language , stub file
The Hubble Space Telescope (HST) and ground system produce approximately 600 gigabytes of data per year in the form of many types of datasets. Because of the formidable size of the total data stream, the datasets are impractical to manage with a conventional database system. Therefore, they are archived onto an optical disk juke‐box system. A smaller HST Catalog exists to describe the archived data and manage access to the HST archived data. When the Catalog keywords are queried, the resulting records point to file names stored in the archive. This allows users to request datasets by their descriptive keywords instead of file names. The Catalog is populated by data from several external databases via the Ingest program. Ingest normalizes and/or joins rows of multiple external database tables and writes the new rows to the HST Catalog. Secondarily, Ingest parses data values, translates data values, and creates row identifiers for each row to be written to the HST Catalog. The Ingest process is driven by translation tables residing in the object database which can be altered on‐the‐fly. This paper describes the design of the HST Catalog Data Ingest program in more detail. Ingest has proven to be a powerful tool in the HST environment where functional requirements are many, database structures are enormous and both evolve rapidly.