Pipeline parallelism in IBM Info Sphere Datastage:
Pipe:
Pipe is a channel through which data moves from one stage to another stage
Pipeline parallelism: It’s a technique of simultaneously processing Extraction, Transformation and, Loading
Partition Parallelism in IBM Info Sphere Datastage:
Partitioning:
Partioning is a technique of dividing the data into chunks
Data stage supports 8 types of partitions
Partioning plays a important role in data stage
Every stage in Data stage associated with default partitioning technique
Defualt partinining technique is Hash
Note:
Selection of portioning techniques is based on
1 .Data(Volume ,Type
2 .Stage
3. No of key Columns
5.Key column data type
Partitioning techniques are grouped in to two categories
1.Key Based
2.Key Less
Key Based Partitiong techniques:
1.Hash
2.Modulo
3.Range
4.DB2
Key Less Partioning techniques:
1.Random
2.Round robin
3.Entire
4.Same
Click Here for IBM Info Sphere Datastage Tutorial
Click Here for IBM Info Sphere Quality Stage in Datastage Tutorial
Click Here for IBM Info Sphere Informatio Analyzer Tutorial
Click Here for More IBM Info Sphere Datastage & Data Integration Information
Pipeline parallelism in IBM Info Sphere Datastage, Partition Parallelism in IBM Info Sphere Datastage, IBM Info Sphere Datastage Tutorial, IBM Info Sphere Quality Stage in Datastage Tutorial, IBM Info Sphere Informatio Analyzer Tutorial, IBM Info Sphere Datastage, Data Integration Information
No comments:
Post a Comment