BusinessObjects Data Integrator XI R2: Accelerated: Extracting, Transforming, and Loading Data

Duration 4 Days

AudienceThis course is designed for individuals responsible for implementing ETL projects (batch-mode), administering and managing projects taht involve Data Integrator

Prerequisites
    Experience with these products or technologies will be helpful:
  • Knowledge of data warehousing and ETL concepts, SQL language
  • Experience with Microsoft SQL Server
  • Experience using functions, elementary procedural programming and flow-of-control statements, for example: If then Else and While Loop statements
  • Data Warehouse Architecture and Modeling: There Are No Guarantees
  • Architecture/Modeling: Advance Dimension Topics Surrogate Keys: It's Time for Time and Slowly Changing Dimensions
  • Architecture/Modeling-Industry-and Applicatoin-Specific Issues: Think Globally, Act Locally
  • Data Staging and Data Quality: Dealing with Dirty Data

    It is recommended you review these articles prior to attending the course: http://www.rkimball.com/html/articles.html
  • Data Warehouse Fundamental: TCO Starts with the End User and Fact Tables and Dimension Tables
  • Data Warehouse Architecture and Modeling: There Are No Guarantees
  • Architecture/Modeling: Advance Dimension Topics Surrogate Keys: It's Time for Time and Slowly Changing Dimensions
  • Architecture/Modeling- Industry-and Application-Specific Issues: Think Globally, Act Locally
  • Data Staging and Data Quality: Dealing with Dirty Data

SynopsisBusinessObjects Data Integrator XI R2 Accelerated enables the implementation of ETL (Extract, Transform, and Load) projects from disparate data sources to deliver more timely and accurate data that end users in an organization can trust. In this four-day course, you will learn about batch data transformation jobs, techniques for capturing changes in data, handling errors, mulit-user environments tasks, administering seriver and migration basics, The course also shows you how to use tools to audit, profile data, and manage metadata to assist in the implementation of an ETL project.

Activities in this course focus on the tools and features discussed in each lessons and enable you to create dimensions and a fact table, You will have a chance to apply different concepts learned from a few combined lesson in two one-hour comprehensive workshops. At the end of the course you will be able to put into practice data flow design concepts in a final two-three hour workshop.

As a business benefit, by being able to create efficient ETL projects, you can use the transformed data to help improve operational and supply chain effciences, enhance customer relationships creat new revenue opportunities, and optimize return on investment from enterprise applications.

Objectives

TopicsData Warehousing Concepts
  • Desribe dimensional modeling

    Understanding Data Integrator
  • Describe components, management tools, and the development process
  • Explain object relationship

    Defining Source and Target Metadata
  • Create a database datastore and import metadata
  • Create a new file format and handle errors in file formats

    Validating, Tracing and Debugging Jobs
  • Create a project, Job Work Flow and Data Flow with the Query transform
  • Understand the Target Table Editor

    Creating a Batch Job
  • Understand push-down operations
  • Desribe descriptions and annotations
  • Validate and trace Jobs
  • Use the View Data and the Interactive Debugger

    Using Built-in Transforms and Nested Data
  • Use the Case Merge, Validation, Data_Transfer, and Date_Generation transforms
  • Import metadata from XML and DTD documents
  • Udnerstand operations on nested data

    Using Built-in Functions
  • Define built-in functions
  • Use the Gen_Row_Num_BY_Group,Is_Group_Changed functions
  • Create Surrogates
  • Use the lookup functions to look up status in a table
  • Use match pattern functions to compare input strings to patterns

    Using Data Integrator Scripting Language and Variables
  • Explain differences between global and local variables
  • create global variables and custom functions
  • Use strings and varibale in Data Integrator scripting language

    Capturing Changes in Data
  • Use changed data capture (CDC) with time-stamped sources
  • Create and initial and delta load job
  • Use history preserving transforms

    Handling Errors and Auditing
  • Recover a failed job
  • Create a manual, recoverable work flow
  • Define audit points, rules and actions on failures

    Supporting a Mulit-user Environment
  • Describe terminology and repository types in a multi-user environment
  • Create and activate the central repository
  • Work with objects in the central repository

    Migrating Projects
  • Create multiple configurations in a datastore
  • Work with Projects in the central repository
  • Ceate a secure central repository
  • Implement and modify group permissions

    Using the Administrator
  • Add a repository and user roles
  • Set the job status interval and log retention period
  • Execute schedule, and monitor batch jobs
  • Understand acrchitecture, load balance index, and job executions in server groups

    Profiling Data
  • Set up the Data Profiler and users
  • Submit a profiling task
  • Monitor profiling tasks in the Administrator

    Managing Metadata
  • Import and export metadata
  • Use Metadata Reports