Posts

Showing posts from August, 2024

Data Analyst Vs Business Analyst Vs Data Engineer Vs BI Specialist

Image
  Data Science - Complete Track • Step 1 - Identify the business problem/value addition/question – ( Data Insights ) this has to be the starting point. • Step 2 - Data availability ( Data Governance ) - Have the structure of your data set defined – The real challenge starts here-                o Do we have the data?                o Do we have access to the required data? • Step 3 -Getting Data ( Data Mining / Data Pipeline ) – How to collect the data from different sources in the system • Step 4 - Data preparation (ETL / ELT) – Once you have data there will be lot of cleaning and preparation required, reduce/increase/combine/split the predictors, determine and eliminate outliers, populate missing values convert few categorical variables into numerical etc. • Step 5 –Exploratory Data Analysis (EDA) – In this step we do descriptive and diagnostic analysis of the existing data. We build multiple graphs that give us direction towards the next steps of predictive analytics. We migh

Data Analyst Vs Business Analyst Vs Data Engineer Vs BI Specialist

Image
  Data Science - Complete Track • Step 1 - Identify the business problem/value addition/question – ( Data Insights ) this has to be the starting point. • Step 2 - Data availability ( Data Governance ) - Have the structure of your data set defined – The real challenge starts here-                o Do we have the data?                o Do we have access to the required data? • Step 3 -Getting Data ( Data Mining / Data Pipeline ) – How to collect the data from different sources in the system • Step 4 - Data preparation (ETL / ELT) – Once you have data there will be lot of cleaning and preparation required, reduce/increase/combine/split the predictors, determine and eliminate outliers, populate missing values convert few categorical variables into numerical etc. • Step 5 –Exploratory Data Analysis (EDA) – In this step we do descriptive and diagnostic analysis of the existing data. We build multiple graphs that give us direction towards the next steps of predictive analytics. We migh