As info flows between applications and processes, it needs to be collected from many different sources, changed across sites and consolidated in one place for application. The process of gathering, transporting and processing the data is called a virtual data pipe. It usually starts with ingesting data via a supply (for case, database updates). Then it ways to its vacation spot, which may be an information warehouse just for reporting and analytics or an advanced info lake to get predictive stats or equipment learning. On the way, it goes thru a series of improve and processing steps, which can consist of aggregation, filtering, splitting, joining, deduplication and data replication.

A typical canal will also experience metadata associated with the data, which can be used to keep tabs on where this came from and just how it was highly processed. This can be intended for auditing, reliability and conformity purposes. Finally, the pipeline may be providing data as a service to others, which is often referred to as the “data as a service” model.

IBM’s family of evaluation data managing solutions comprises Virtual Info Pipeline, which provides application-centric, SLA-driven automation to improve application development and testing by decoupling the operations of test replicate data out of storage, network and storage space infrastructure. It will this simply by creating electronic copies of production info to use with respect to development and tests, even though reducing the time to provision and refresh many data replications, which can be approximately 30TB in proportion. The solution as well provides a self-service interface designed for provisioning and reclaiming virtual data.

Categories: Uncategorized