What Is SSIS
What Is SSIS
What Is SSIS
What is SSIS?
o SSIS stands for SQL Server Integration Services.
o It is a component available in the Microsoft SQL Server database software used to
perform a wide range of integration tasks.
o It is a data warehousing tool used for data extraction, loading the data into another
database, transformations such as cleaning, aggregating, merging data, etc.
o SSIS tool also contains the graphical tools and window wizards workflow functions
such as sending email messages, ftp operations, data sources.
o SSIS is used to perform a wide range of transformation and integration tasks. As a
whole, the SSIS tool is used in data migration.
o Data Modelling: In Data Modelling, you need first to create the data model and
perform operations on it.
o Data Profiling: Data Profiling is a process which is used to check the errors,
inconsistency, or variations in the available data. Data Profiling ensures the data
quality where data quality refers to the accuracy, consistency, and completeness of
data.
Why SSIS?
SSIS is used because of the following reasons:
o Data can be loaded in parallel to many varied destinations
SSIS is used to combine the data from multiple data sources to generate a single
structure in a unified view. Basically, it is responsible for collecting the data,
extracting the data from multiple data sources, and merging into a single data
source.
o Integration with other products
SSSIS tool provides tight integration with other products of Microsoft.
o Cheaper than other ETL tools
SSSIS tool is cheaper than most of the other tools. It can resist with other base
products, their manageability, business intelligence, etc.
o Complex error handling within dataflows
SSSIS allows you to handle the complex error within a dataflow. You can start and
stop the dataflow based on the severity of the error. You can even send an email to
admin when some error occurs. When an error is resolved, then you can pick the
path in between the workflow.
History of SIS
Prior, to SSIS, SQL Server, Data Transformation Services (DTS) was used, which
was part of SQL Server 7 and 2000
Version Detail
SQL Server 2012 It was the biggest release for SSIS. With this
version, the concept of the project
deployment model introduced. It allows
entire projects, and their packages are
deployed to a server, in place of specific
packages.
SQL Server 2014 In this version, not many changes are made
for SSIS. But new sources or transformations
were added which was done by separate
downloads through CodePlex or the SQL
Server Feature Pack.
Operational data
Operational data is a database used to integrate the data from multiple data sources to
perform additional operations on the data. It is the place where the data is housed for
current operation before sending to the data warehouse for storing, reporting, or archiving.
ETL
o ETL is the most important process in SSIS tool. ETL is used to Extract, Transform,
and Load the data into a data warehouse.
o ETL is a process responsible for pulling out the data multiple data sources,
transforming the data into useful data, and then storing the data into a data
warehouse. The data can be in any format xml file, flat file, or any database file.
o It also ensures that the data stored in the data warehouse is relevant, accurate, high
quality, and useful to the business users.
o It can be easily accessed so that the data warehouse can be used effectively and
efficiently.
o It also helps the organization to make data-driven decisions by retrieving the
structured and unstructured data from multiple data sources.
Follow the below steps to install the SQL Server Data tools:
Step 2: When you click on the above link, the screen appears shown below:
In the above screen, select the version of SSDT that you want to install.
Step 3: Once the downloading is completed, run the downloaded file. When you run the
downloaded file, the screen appears which is shown below:
Step 4: Click on the Next button.
Step 5: Select the visual studio instance and the tools that you want to install in the visual
studio 2017.
Step 6: Click on the Install button.
SSIS Architecture
Following are components of SSIS architecture:
1.Control Flow
Control flow is a brain of SSIS package. It helps you to arranges the order of
execution for all its components. The components contain containers and tasks which
are managed by precedence constraints.
2.Precedence Constraints
3.Task
4.Containers
The container is units for grouping tasks together into units of work. Apart from
offering visual consistency, it also allows you to declare variables and event handlers
A Sequence Container
A For Loop Container
Foreach Loop Container
Sequence Container: allows you to organize subsidiary tasks by grouping them, and
allows you to you apply transactions or assign logging to the container.
For each Loop Container: It also allows looping. But the difference that instead of
using a condition expression, loop s done over a set of objects, likes files in a folder.
5.Data Flow
The main use of the SSIS tool is to extract data into the server's memory, transform it,
and write it to another destination. If Control Flow is the brain, Data Flow is the heart
of SSIS
6.Packages
A package can help you to saves files onto a SQL Server, in the msdb or package
catalog database. It can save as a .dtsx file, which is a structured file very similar to
.rdl files are to Reporting Services.
7.Parameters
Parameters behave much like a variable but with a few main exceptions. It can be set
outside the package easily. It can be designated as values that must be passed in for
the package to start.
Data Flow Task This task can read data from one or more
sources. Transform the data when it is in the
memory and write it out against one or more
destinations.
Analysis Services Processing Task Use this task to process objects of a Tabular
model or as an SSAS cube.
Execute Package Task Use can use this SSIS task to execute other
packages from within the same project.
Execute Process Task With the help of this task, you can specify
command line parameters.
Script Task This is a blank task. You can write NET code
which performs any task; you want to
perform.
Send Mail Task You can send an email to notifying users that
your package has is finished, or some error
occurs.
Bulk Insert Task Use can loads data into a table by using the
bulk insert command.
WMI Event Watcher Task This task allows the SSIS package to wait for
and respond to certain WMI events.
Summary
SQL Server Integration Service (SSIS) is a component of the Microsoft
SSIS can be used to conduct a wide range of data integration tasks
SSIS tool helps you to merge data from various data stores
Important versions of SQL Server Integration Service are 2005, 2008, 2012,
2014 and 216
Studio Environments, Relevant data integration functions, and Effective
implementation speed are some important features of SSIS
Control Flow, Data Flow, Event Handler, Package Explorer, and Parameters
are essential SSIS architecture components
Execute SQL Task, Data Flow Task, Analysis Services Processing Task,
Execute Package Task, Execute Process Task, File System Task, FTP Tasks,
Send Mail Task, Web Service Task are some important
Broad documentation and support
The biggest drawback of SSIS is that it lacks support for alternative data
integration styles
SAP Data Services, SAS Data Management, Oracle Warehouse Builder
(OWB), PowerCenter Informatica, IBM Infosphere Information Server
SSIS is an in-memory pipeline. Therefore, it's essential to make sure that all
transformations occur in memory