Sunday, 23 February 2014

Talend Tutorial, A brief Idea

 Concepts for Talend Open Studio

What is a repository? A repository is the storage location for Talend Open Studio for Data Integration uses to gather data related to all of the technical items that you use either to describe business models or to design Jobs.

What is a project? Projects are structured collections of technical items and their associated metadata. All of the Jobs and business models you design are organized in Projects.

What is a workspace? A workspace is the directory where you store all your project folders. You need to have one workspace directory per connection (repository connection). Talend Open Studio for Data Integration enables to connect to different workspace directories, if you do not want to use the default one.


What is a Job? A Job is a graphical design, of one or more components connected together, that allows you to set up and run dataflow management processes. It translates business needs into code, routines and programs. Jobs address all of the different sources and targets that you need for data integration processes and all other related processes.

What is a component? A component is a preconfigured connector used to perform a specific data integration operation, no matter what data sources you are integrating: databases, applications, flat files, Web services, etc.

A component can minimize the amount of hand-coding required to work on data from multiple, heterogeneous sources.
Components are grouped in families according to their usage and displayed in the Palette of the Talend Open Studio for Data Integration main window.

What is an item? An item is the fundamental technical unit in a project. Items are grouped, according to their types, as: Job Design, Business model, Context, Code, Metadata, etc. One item can include other items. For example, the business models and the Jobs you design are items, metadata and routines you use inside your Jobs are items as well.