BusinessObjects Data Integrator XI R2: Accelerated: Extracting, Transforming, and Loading Data
|
|
| Audience | This course is designed for individuals responsible for implementing ETL projects (batch-mode), administering and managing projects taht involve Data Integrator |
| Prerequisites | Experience with these products or technologies will be helpful:
- Knowledge of data warehousing and ETL concepts, SQL language
- Experience with Microsoft SQL Server
- Experience using functions, elementary procedural programming and flow-of-control statements, for example: If then Else and While Loop statements
- Data Warehouse Architecture and Modeling: There Are No Guarantees
- Architecture/Modeling: Advance Dimension Topics Surrogate Keys: It's Time for Time and Slowly Changing Dimensions
- Architecture/Modeling-Industry-and Applicatoin-Specific Issues: Think Globally, Act Locally
- Data Staging and Data Quality: Dealing with Dirty Data
It is recommended you review these articles prior to attending the course: http://www.rkimball.com/html/articles.html - Data Warehouse Fundamental: TCO Starts with the End User and Fact Tables and Dimension Tables
- Data Warehouse Architecture and Modeling: There Are No Guarantees
- Architecture/Modeling: Advance Dimension Topics Surrogate Keys: It's Time for Time and Slowly Changing Dimensions
- Architecture/Modeling- Industry-and Application-Specific Issues: Think Globally, Act Locally
- Data Staging and Data Quality: Dealing with Dirty Data
|
| Synopsis | BusinessObjects Data Integrator XI R2 Accelerated enables the implementation of ETL (Extract, Transform, and Load) projects from disparate data sources to deliver more timely and accurate data that end users in an organization can trust. In this four-day course, you will learn about batch data transformation jobs, techniques for capturing changes in data, handling errors, mulit-user environments tasks, administering seriver and migration basics, The course also shows you how to use tools to audit, profile data, and manage metadata to assist in the implementation of an ETL project. Activities in this course focus on the tools and features discussed in each lessons and enable you to create dimensions and a fact table, You will have a chance to apply different concepts learned from a few combined lesson in two one-hour comprehensive workshops. At the end of the course you will be able to put into practice data flow design concepts in a final two-three hour workshop. As a business benefit, by being able to create efficient ETL
projects, you can use the transformed data to help improve operational and supply chain effciences, enhance customer relationships creat new revenue opportunities, and optimize return on investment from enterprise applications. |
| Topics | Data Warehousing Concepts Desribe dimensional modeling Understanding Data Integrator Describe components, management tools, and the development process Explain object relationship Defining Source and Target Metadata Create a database datastore and import metadata Create a new file format and handle errors in file formats Validating, Tracing and Debugging Jobs Create a project, Job Work Flow and Data Flow with the Query transform Understand the Target Table Editor Creating a Batch Job Understand push-down operations Desribe descriptions and annotations Validate and trace Jobs Use the View Data and the Interactive Debugger Using Built-in Transforms and Nested Data Use the Case Merge, Validation, Data_Transfer, and Date_Generation transforms Import metadata from XML and DTD documents Udnerstand operations on nested data Using Built-in Functions Define built-in functions Use the Gen_Row_Num_BY_Group,Is_Group_Changed functions Create Surrogates Use the lookup functions to look up status in a table Use match pattern functions to compare input strings to patterns Using Data Integrator Scripting Language and Variables Explain differences between global and local variables create global variables and custom functions Use strings and varibale in Data Integrator scripting language Capturing Changes in Data Use changed data capture (CDC) with time-stamped sources Create and initial and delta load job Use history preserving transforms Handling Errors and Auditing Recover a failed job Create a manual, recoverable work flow Define audit points, rules and actions on failures Supporting a Mulit-user Environment Describe terminology and repository types in a multi-user environment Create and activate the central repository Work with objects in the central repository Migrating Projects Create multiple configurations in a datastore Work with Projects in the central repository Ceate a secure central repository Implement and modify group permissions Using the Administrator Add a repository and user roles Set the job status interval and log retention period Execute schedule, and monitor batch jobs Understand acrchitecture, load balance index, and job executions in server groups Profiling Data Set up the Data Profiler and users Submit a profiling task Monitor profiling tasks in the Administrator Managing Metadata Import and export metadata Use Metadata Reports |
|