Pipeline & Partition Parallelism in IBM Info Sphere Datastage - Pharma Jobs

Sunday, January 19, 2014

Pipeline & Partition Parallelism in IBM Info Sphere Datastage


Pipeline parallelism in IBM Info Sphere Datastage:
Pipe:
Pipe is a channel through which data moves from one stage to another stage

Pipeline parallelism: It’s a technique of simultaneously processing Extraction, Transformation and, Loading

Partition Parallelism in IBM Info Sphere Datastage:
Partitioning:
Partioning is a technique of dividing the data into chunks
Data stage supports 8 types of partitions
Partioning plays a important role in data stage
Every stage in Data stage associated with default partitioning technique
Defualt partinining technique is Hash

Note:
Selection of portioning techniques is based on
1 .Data(Volume ,Type                                              
2 .Stage
3. No of key Columns
5.Key column data type

 Partitioning techniques are grouped in to two categories
1.Key Based
2.Key Less

Key Based Partitiong techniques:
1.Hash
2.Modulo
3.Range
4.DB2

Key Less Partioning techniques:
1.Random
2.Round robin
3.Entire

4.Same







Pipeline parallelism in IBM Info Sphere Datastage, Partition Parallelism in IBM Info Sphere Datastage, IBM Info Sphere Datastage Tutorial, IBM Info Sphere Quality Stage in Datastage Tutorial, IBM Info Sphere Informatio Analyzer Tutorial, IBM Info Sphere Datastage,  Data Integration Information

No comments:

Post a Comment