Overview of Talend
Talend is an open source big data integration tool. It delivers a single open platform for data integration across different on-premises environments.
- This course will help you learn Talend architecture, Desktop, Service, Mobile Apps, Visualizations, Reports, deriving business insights. Work on real-world projects.
Duration
3 Days
Prerequisite for Talend
Understanding of How data works and what it can do for an organization.
Course Outline for Talend
Introduction
- Installing the software
- Enabling tHashInput and tHashOutput
Metadata and Schemas
- Schema Metadata
- Schemas
- Repository Schemas
- Generic Schemas
- Shared Schemas
- Fixed Schemas and Columns
- Hand cracking a built-in Schema
- Date Patterns
- Nullable Elements
- Propagating Schema Changes
- Creating Schemas from Metadata and Lists
Validating Data
- Enabling and Disabling Reject Flows
- Validating against the Schema
- Rejecting rows using tMap
- Checking a column
- Creating validation rules
- Creating binary error codes
- Decrypting the Error Code
Mapping Data
- The tMap Component
- Single line of code
- Batch versus real time
- Creating tMap Expressions
- Testing Expressions
- Expression Editor
- Getting around the one line limitation
- Using the ternary operator
- Using intermediate variables
- Filtering input rows
- Splitting an input row
- Joining Data
- Hierarchical Joins
- Using Reload
- Loading the data into memory
- The globalMap Key
- The Where clause
Using Java in Talend
- Performing logic using tJava
- Setting the context using tJava
- Adding Complex logic using tJavaRow
- Creating pseudo components using tJavaFlex
- Creating custom functions using code routines
- Importing Jar files to allow external java classes
Managing Context Variables
- Transportable Code
- Context Variables
- Common values in contexts
- Passing command line parameters
- Creating a context group
- Database Context Variables
- Creating a contexting group
- Context Types
- Variable values using tree mode
- Adding context group to the job
- Adding contexts to context group
- Using tContextLoad
- Print Operations
- Using implicit context
- Setting the context file location
Working with Databases
- Setting up Database Connection
- Always create database connections
- Connection names
- Importing the table schemas
- Adding a simple calculated measure
- Reading from Database Files
- Multiple tables and complex queries
Managing Files
- Appending records to a file
- Concatenating Files
- Regular Expression
- Reading headers and trailers
- Validation subjob
Debugging, Logging, and Testing
- Find the location of compilation/Execution errors
- Using the Talend debugging mode
- Using the Java debugger
- Using tLogRow/tJavaRow
- Dumping the console output
- Creating test data (Simple/Complex/Random)
- Testing logic
- Killing a job
Deploying and Scheduling Talend Code
- Executable Code
- Managing job dependencies
- Using a different context
- Adding command line parameters
- Adding error checks
- Restartability
- Returning codes from a child job
- Executing non-Talend objects