SlideShare a Scribd company logo
EXTRACT
TRANSFORMATION LOAD
Extract Transformation Load (3) (1).pptx
ETL—meaning extract, transform, load—is a data integration process that
combines, cleans and organizes data from multiple sources into a single,
consistent data set for storage in a data warehouse, data lake or other
target system.
• ETL pipelines are often used by organizations to:
• Extract data from legacy systems
• Cleanse the data to improve data quality and establish consistency
• Load data into a target database
Functions of ETL
• Reporting & Dashboards- Share key performance indicators (KPI)
with decision makers.
• Forecasting – Project future sales, demand, and maintenance
requirements.
• Visualization – Provide a visual way to interact with data and make
new insights.
Architecture ETL function lies at the core of Business Intelligence
systems. With ETL, enterprises can obtain historical, current, and
predictive views of real business data. Let’s look at some ETL features
that are necessary for business intelligence.
How ETL Works? ETL systems are designed to accomplish three
complex database functions: extract, transform and load.
1. Extraction The extraction phase maps
the data from different sources into a
unified format before processing.
ETL systems ensure the following while
extracting data.
• Removing redundant (duplicate) or
fragmented data
• Removing spam or unwanted data
• Reconciling records with source data
• Checking data types and key attributes.
2. Transformation This stage involves applying algorithms and
modifying data according to business-specific rules. The common
operations performed in ETL’s transformation stage is computation,
concatenation, filters, and string operations like currency, time, data
format, etc. It also validates the following-
• Data cleaning like adding ‘0’ to null values
• Threshold validation like age cannot be more than two digits
• Data standardization according to the rules and lookup table.
3. Loading is a process of migrating structured data into the
warehouse. Usually, large volumes of data need to be loaded in a short
time. ETL applications play a crucial role in optimizing the load process
with efficient recovery mechanisms for the instances of loading
failures.
A typical ETL process involves three types of loading functions-
• Initial load: it populates the records in the data warehouse.
• Incremental load: it applies changes (updates) periodically as per
the requirements.
• Full refresh: It reloads the warehouse with fresh records by erasing
the old contents.
Why is ETL important?
Organizations today have both structured and unstructured data from
various sources.
By applying the process of extract, transform, and load (ETL), individual
raw datasets can be prepared in a format and structure that is more
consumable for analytics purposes, resulting in more meaningful
insights.
For example, online retailers can analyze data from points of sale to
forecast demand and manage inventory. Marketing teams can
integrate CRM data with customer feedback on social media to study
consumer behavior.
How does ETL benefit business intelligence? Extract, transform, and load
(ETL) improves business intelligence and analytics by making the process
more reliable, accurate, detailed, and efficient. Historical context ETL gives
deep historical context to the organization’s data. An enterprise can
combine legacy data with data from new platforms and applications. You
can view older datasets alongside more recent information, which gives
you a long-term view of data.
What is ELT? Extract, load, and transform (ELT) is an extension of extract,
transform, and load (ETL) that reverses the order of operations. You can
load data directly into the target system before processing
The intermediate staging area is not required because the target data
warehouse has data mapping capabilities within it.
ELT has become more popular with the adoption of cloud infrastructure,
which gives target databases the processing power they need for
transformations.
ETL compared to ELT The primary difference between ETL (Extract,
Transform, Load) and ELT (Extract, Load, Transform) is the order in
which data is processed.
Extract Transformation Load (3) (1).pptx
Extract Transformation Load (3) (1).pptx

More Related Content

PPTX
Lecture13- Extract Transform Load presentation.pptx
PDF
What Is ETL | Process of ETL 2023 | GrapesTech Solutions
PPTX
ETL_Methodology.pptx
PDF
Why shift from ETL to ELT?
PDF
A Comparitive Study Of ETL Tools
PDF
Data Migration vs ETL Know Key Difference
PPT
definign etl process extract transform load.ppt
PDF
What is ETL and Zero ETL | Extract, Transform, Load
Lecture13- Extract Transform Load presentation.pptx
What Is ETL | Process of ETL 2023 | GrapesTech Solutions
ETL_Methodology.pptx
Why shift from ETL to ELT?
A Comparitive Study Of ETL Tools
Data Migration vs ETL Know Key Difference
definign etl process extract transform load.ppt
What is ETL and Zero ETL | Extract, Transform, Load

Similar to Extract Transformation Load (3) (1).pptx (20)

PPTX
“Extract, Load, Transform,” is another type of data integration process
PDF
What is ETL? Difference between ETL and ELT?.pdf
DOCX
What are the benefits of learning ETL Development and where to start learning...
PPTX
1.3 CLASS-DW.pptx-ETL process in details with detailed descriptions
PPT
Should ETL Become Obsolete
PPTX
Extract, Transform and Load.pptx
PPTX
Etl process in data warehouse
PDF
Data warehousing
PDF
An Overview on Data Quality Issues at Data Staging ETL
PPTX
GROPSIKS.pptx
PDF
ETL Tools Ankita Dubey
PPTX
What is ETL testing & how to enforce it in Data Wharehouse
PDF
ETL-Advance IA to improve your skills-pdf
DOCX
What are the key points to focus on before starting to learn ETL Development....
PPT
ETL Testing Training Presentation
PDF
Automation Tools That Simplify ETL and ELT Migration Tasks
PPTX
ETL Technologies.pptx
PPT
Informatica_ Basics_Demo_9.6.ppt
PPTX
ETL Process
PPTX
Data Extraction Service |Extract, Transform and Load
“Extract, Load, Transform,” is another type of data integration process
What is ETL? Difference between ETL and ELT?.pdf
What are the benefits of learning ETL Development and where to start learning...
1.3 CLASS-DW.pptx-ETL process in details with detailed descriptions
Should ETL Become Obsolete
Extract, Transform and Load.pptx
Etl process in data warehouse
Data warehousing
An Overview on Data Quality Issues at Data Staging ETL
GROPSIKS.pptx
ETL Tools Ankita Dubey
What is ETL testing & how to enforce it in Data Wharehouse
ETL-Advance IA to improve your skills-pdf
What are the key points to focus on before starting to learn ETL Development....
ETL Testing Training Presentation
Automation Tools That Simplify ETL and ELT Migration Tasks
ETL Technologies.pptx
Informatica_ Basics_Demo_9.6.ppt
ETL Process
Data Extraction Service |Extract, Transform and Load
Ad

More from revathi148366 (20)

PPT
master data management data analytics modeling.ppt
PPTX
data driven fintech analysis blockchain.pptx
PPT
BI MicrosoftMDM - Frank Olav Estensen.ppt
PPTX
data driven fintech analysis blockchain.pptx
PPT
master data managementfor data analytics.ppt
PPTX
artificialintelligencedata driven analytics23.pptx
PPTX
artificialintelligenceandmachinelearningpowerpointpresentationslidescompleted...
PPTX
artificial intelligence deeplearning-200712115616.pptx
PPTX
Extract Transformation Loading1 (3).pptx
PPTX
Data analysis and fraud rulesfor data analytics.pptx
PPTX
Model training and parameter estimation techniques.pptx
PPTX
Digital Token based Electronic payment system.pptx
PPTX
Smart-Payment-System-Using-IoT11123.pptx
PPTX
Outliers or anamolies IN DATA ANALYTICS.pptx
PPTX
Mercantile-Process-Models-unit1 (2).pptx
PPTX
search engine optimization123244556.pptx
PPTX
programming templates in cpp123456688.pptx
PPTX
fraud detection and prevention12345.pptx
PPTX
Unit 2 data profilingin data analytics.pptx
PPTX
Unit 2 data profilingand cleansing12.pptx
master data management data analytics modeling.ppt
data driven fintech analysis blockchain.pptx
BI MicrosoftMDM - Frank Olav Estensen.ppt
data driven fintech analysis blockchain.pptx
master data managementfor data analytics.ppt
artificialintelligencedata driven analytics23.pptx
artificialintelligenceandmachinelearningpowerpointpresentationslidescompleted...
artificial intelligence deeplearning-200712115616.pptx
Extract Transformation Loading1 (3).pptx
Data analysis and fraud rulesfor data analytics.pptx
Model training and parameter estimation techniques.pptx
Digital Token based Electronic payment system.pptx
Smart-Payment-System-Using-IoT11123.pptx
Outliers or anamolies IN DATA ANALYTICS.pptx
Mercantile-Process-Models-unit1 (2).pptx
search engine optimization123244556.pptx
programming templates in cpp123456688.pptx
fraud detection and prevention12345.pptx
Unit 2 data profilingin data analytics.pptx
Unit 2 data profilingand cleansing12.pptx
Ad

Recently uploaded (20)

PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Business Analytics and business intelligence.pdf
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Mega Projects Data Mega Projects Data
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
Lecture1 pattern recognition............
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
SAP 2 completion done . PRESENTATION.pptx
PDF
Introduction to Data Science and Data Analysis
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Introduction to Knowledge Engineering Part 1
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Business Analytics and business intelligence.pdf
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
ISS -ESG Data flows What is ESG and HowHow
Introduction-to-Cloud-ComputingFinal.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
.pdf is not working space design for the following data for the following dat...
IBA_Chapter_11_Slides_Final_Accessible.pptx
Mega Projects Data Mega Projects Data
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Business Ppt On Nestle.pptx huunnnhhgfvu
Galatica Smart Energy Infrastructure Startup Pitch Deck
Lecture1 pattern recognition............
Fluorescence-microscope_Botany_detailed content
SAP 2 completion done . PRESENTATION.pptx
Introduction to Data Science and Data Analysis
Database Infoormation System (DBIS).pptx
Introduction to Knowledge Engineering Part 1

Extract Transformation Load (3) (1).pptx

  • 3. ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system. • ETL pipelines are often used by organizations to: • Extract data from legacy systems • Cleanse the data to improve data quality and establish consistency • Load data into a target database
  • 4. Functions of ETL • Reporting & Dashboards- Share key performance indicators (KPI) with decision makers. • Forecasting – Project future sales, demand, and maintenance requirements. • Visualization – Provide a visual way to interact with data and make new insights.
  • 5. Architecture ETL function lies at the core of Business Intelligence systems. With ETL, enterprises can obtain historical, current, and predictive views of real business data. Let’s look at some ETL features that are necessary for business intelligence.
  • 6. How ETL Works? ETL systems are designed to accomplish three complex database functions: extract, transform and load.
  • 7. 1. Extraction The extraction phase maps the data from different sources into a unified format before processing. ETL systems ensure the following while extracting data. • Removing redundant (duplicate) or fragmented data • Removing spam or unwanted data • Reconciling records with source data • Checking data types and key attributes.
  • 8. 2. Transformation This stage involves applying algorithms and modifying data according to business-specific rules. The common operations performed in ETL’s transformation stage is computation, concatenation, filters, and string operations like currency, time, data format, etc. It also validates the following- • Data cleaning like adding ‘0’ to null values • Threshold validation like age cannot be more than two digits • Data standardization according to the rules and lookup table.
  • 9. 3. Loading is a process of migrating structured data into the warehouse. Usually, large volumes of data need to be loaded in a short time. ETL applications play a crucial role in optimizing the load process with efficient recovery mechanisms for the instances of loading failures. A typical ETL process involves three types of loading functions- • Initial load: it populates the records in the data warehouse. • Incremental load: it applies changes (updates) periodically as per the requirements. • Full refresh: It reloads the warehouse with fresh records by erasing the old contents.
  • 10. Why is ETL important? Organizations today have both structured and unstructured data from various sources. By applying the process of extract, transform, and load (ETL), individual raw datasets can be prepared in a format and structure that is more consumable for analytics purposes, resulting in more meaningful insights. For example, online retailers can analyze data from points of sale to forecast demand and manage inventory. Marketing teams can integrate CRM data with customer feedback on social media to study consumer behavior.
  • 11. How does ETL benefit business intelligence? Extract, transform, and load (ETL) improves business intelligence and analytics by making the process more reliable, accurate, detailed, and efficient. Historical context ETL gives deep historical context to the organization’s data. An enterprise can combine legacy data with data from new platforms and applications. You can view older datasets alongside more recent information, which gives you a long-term view of data.
  • 12. What is ELT? Extract, load, and transform (ELT) is an extension of extract, transform, and load (ETL) that reverses the order of operations. You can load data directly into the target system before processing The intermediate staging area is not required because the target data warehouse has data mapping capabilities within it. ELT has become more popular with the adoption of cloud infrastructure, which gives target databases the processing power they need for transformations.
  • 13. ETL compared to ELT The primary difference between ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) is the order in which data is processed.