Data Analyst
Data Analyst
Data Analyst
Record eating food and how it relates to energy level on your body.
Think about where and how do you use data to make decisions in your life.
for example. What did you discover? What would you change?
Your analysis helps you find easier ways of doing things, identify patterns to save you time, and discover
new perspectives that can completely change the way you experience things.
1. Improve processes
2. Identify opportunities, themes, trends
3. Serve customers
4. Launch new products
5. Make thoughtful decisions
Capture wave of data, make sense of it and draw conclusions and make predictions
Turn data into insights. Those insights are shared with others, decisions are made and
businesses take action.
Data may lead them to a new product or unique service or find a way to deliver an incredible
customer experience.
Data Analysis
The collection, transformation, and organization of data in order to draw conclusions; make predictions
and drive informed decision making to drive positive change.
To put it another way, it is the art of extracting knowledge and insights from data.
Data Analyst
A person who is doing mentioned these.
To put it another way, A person who is an explorer, a detective and an artist all rolled into one.
Data Analytics
Data analytics in the simplest terms is the science of data. It's a very broad concept or umbrella term that
encompasses everything from the job of managing and using data to the tools and methods that data
workers use each and every day.
Data Science
A field of study that uses raw data to create new ways of modeling and understanding the unknown.
Data scientists create new questions using data, while analysts find answers to existing questions by
creating insights from data sources.
ASK: it is important to ask effective questions to management people to be able to define the
problems and the goals of the analysis
PREPARE: it is important to identify what data should be acquired. Develop timeline, questions,
data access rules, and potential data-related issues are discussed.
PROCESS: Data is collected and stored. The data cleaning process is involved. Respect and
protect people and their data.
ANALYZE: it is important to document exactly what they found in the analysis, no matter what the
result.
ACT: A data analytics team might validate insights, finalize a strategy, and put a plan into action.
Data Ecosystem
Data ecosystems are made up of various elements that interact with one another in order to produce,
manage, store, organize, analyze, and share data. These elements include hardware and software tools,
cloud, and the people who use them. It is like the jungle in a tropical rainforest.
For example, your retail store's database, which is an ecosystem filled with customer names,
addresses, previous purchases, and customer reviews. As a data analyst, you could use this information
to predict what these customers will buy in the future, and make sure the store has the products and
stock when they're needed.
Data analysts can use geological patterns in weather movement data to help farmers predict crop yields.
Institution of Oceanography, coral reefs all over the world are monitored digitally, so they can see how
organisms change over time, track their growth, and measure any increases or declines in individual
colonies.
So when you think about data, data analysis, and the data ecosystem, it's important to understand that
all of these things fit under the data analytics umbrella.
Data alone will never be as powerful as data combined with human experience, observation, and
intuition they have the ability to look at the results of data analysis and identify any inconsistencies,
make sense of gray areas, and eventually validate choices being made.
Detectives and data analysts have a lot in common. Both depend on facts and clues to make decisions.
Both collect and look at the evidence. Both talk to people who know part of the story. And both might
even follow some footprints to see where they lead. Whether you’re a detective or a data analyst, your
job is all about following steps to collect and understand facts.
When it comes to data driven decision using gut instinct can be problematic. If you ignore data by
preferring to make decisions based on your own experience, your decisions may be biased
the process of going from data to decision. Data goes through several phases as it gets created,
consumed, tested, processed, and reused
Key takeaway
From our journey to the pyramids and data in ancient Egypt to now, the way we analyze data has
evolved (and continues to do so). The data analysis process is like real life architecture, there are
different ways to do things but the same core ideas still appear in each model of the process. Whether
you use the structure of this Google Data Analytics Certificate or one of the many other iterations you
have learned about, we are here to help guide you as you continue on your data journey.
Week 2
Analytical skills are qualities and characteristics associated with solving problems using facts.
Context
The condition or information in which something exists or happens. The background information or
previous events that help to give meaning to something.
A technical mindset = The ability to break things into smaller steps or pieces and work with them in an
orderly and logical way
Data Strategy
The management of the people, processes and tools used in data analysis
Analytical thinking
Identifying and defining a problem and then solving it by using data in an organized, step by step
manner.
Why it is important to think in different way? Because solutions are almost never right in front of you.
Think critically to ask right question. Think creatively to get new and unexpected answers.
The way data analysts think and ask questions plays a big part in how businesses make decisions.
What is root cause of the problem? Ask why in five times to reveal root cause. (5 whys)
e.g. You want to make blueberry pie but could not buy any blueberry at store. What is root cause of
problem?
Gap Analysis = A method for examining and evaluating how a process works currently in order to get
where you want to be in the future. e.g. improve a product
What information or procedure might be missing from a process, so you can identify ways to make
better decisions and strategies moving forward.
Plan - Decide what kind of data is needed, how it will be managed, and who will be responsible
for it.
Capture/Acquire - Collect or bring in data from a variety of different sources.
Manage/Maintain – Data cleansing
Analyze/ Evaluate
Archive/Preserve - Keep relevant data stored for long-term and future reference.
Destroy/ Purge - Remove data from storage and delete any shared copies of the data.
Key Takeaway
Individual stages in the data life cycle will vary from company to company or by industry or sector.
Historical data is important to both the U.S. Fish and Wildlife Service and the USGS, so their data life
cycle focuses on archiving and backing up data. Harvard's interests are in research and teaching, so its
data life cycle includes visualization and interpretation and they don’t have a stage for purging or
destroying data. In contrast, the data life cycle for finance clearly identifies archive and purge stages. To
sum it up, although data life cycles vary, one data management principle is universal.
your data and results are objective and unbiased. Any decisions made from your analysis should always
be based on facts and be fair and impartial.
Process = Cleaning Data or Removing Errors or Transforming to useful format or combing two dataset
into one or Fix Typo/inconsistency/inaccurate data
Analysis = Use tools to draw conclusion, make preditions, and drive informed decision
Excel Function = think of a function as a simpler, more efficient way of doing something that would
normally take a lot of time
Tableau's simple drag-and-drop feature lets users create interactive graphs in dashboards and
worksheets
Looker communicates directly with a database, allowing you to connect your data right to the visual
tool you choose
What is the relationship between the Ask phase of the data analysis process and the
Plan phase of the data life cycle? How are they similar? How are they different?
While the data analysis process will drive your projects and help you reach your business goals, you
must understand the life cycle of your data in order to use that process. To analyze your data well, you
need to have a thorough understanding of it. Similarly, you can collect all the data you want, but the
data is only useful to you if you have a plan for analyzing it.
The Plan and Ask phases both involve planning and asking questions, but they tackle different subjects.
The Ask phase in the data analysis process focuses on big-picture strategic thinking about business goals.
However, the Plan phase focuses on the fundamentals of the project, such as what data you have access
to, what data you need, and where you’re going to get it.
Similarities:
Both phases involve identifying the purpose and objectives of the data
Differences:
Focus: The "Ask" phase is focused on determining the questions that need to be answered, while the
"Plan" phase is focused on identifying the data requirements for a specific project.
Excel
An observation includes all of the attributes for something contained in a row of a data table
You might have noticed a common theme across every example. They all have issues to explore,
questions to answer, or problems to solve
Issue
A question is designed to discover information and a problem is an obstacle or complication that needs
to be worked out.
Data-driven decision-making is when facts that have been discovered through data analysis
are used to guide business strategy.
The simplest way to think about decision-making is that it's a choice between consequences, good, bad,
or a combination of both.
What if they had made this decision without data? Let's say they just relied on observation and memory to
track the weather and make staffing schedules. Well, we already know that wouldn't have solve their
problem long-term.
When we make decisions based on just observation and gut feelings, we're only seeing part of the picture.
Data helps us see the whole thing. With data, we have a complete picture of the problem and its causes,
which lets us find new and surprising solutions we never would've been able to see before.
Fairness means ensuring that your analysis doesn't create or reinforce bias. In other words, as a data
analyst, you want to help create systems that are fair and inclusive to everyone. E.g. Male Employee
success why not female employee?
Every industry has specific data needs that have to be addressed differently by their data analysts
The same revenue data can be used in three different ways by data analysts in three different industries,
financial services, Telecom, and tech.
Business analyst — analyzes data to help businesses improve processes, products, or services
Data analytics consultant — analyzes the systems and models for using data
Data engineer — prepares and integrates data from different sources for analytical use
Data scientist — uses expert skills in technology and social science to find trends through data analysis
Data specialist — organizes or converts data for use in databases or software systems
Operations analyst — analyzes data to assess the performance of business operations and workflows
Structured thinking is the process of recognizing the current problem or situation, organizing available
information, revealing gaps and opportunities, and identifying options. In this process, you address a
vague, complex problem by breaking it down into smaller steps, and then those steps lead you to a
logical solution.
Step 1: Ask
It’s impossible to solve a problem if you don’t know what it is. These are some things to
consider:
Step 2: Prepare
You will decide what data you need to collect in order to answer your questions and how to
organize it so that it is useful. You might use your business task to decide:
Step 3: Process
Clean data is the best data and you will need to clean up your data to get rid of any possible
errors, inaccuracies, or inconsistencies. This might mean:
Step 4: Analyze
You will want to think analytically about your data. At this stage, you might sort and format
your data to make it easier to:
Perform calculations
Combine data from multiple sources
Create tables with your results
Step 5: Share
Everyone shares their results differently so be sure to summarize your results with clear and
enticing visuals of your analysis using data via tools like graphs or dashboards. This is your
chance to show the stakeholders you have solved their problems and how you got there.
Sharing will certainly help your team:
Step 6: Act
Now it’s time to act on your data. You will take everything you have learned from your data
analysis and put it to use. This could mean providing your stakeholders with
recommendations based on your findings so they can make data-driven decisions.
1. How can I use the feedback I received during the share phase (step 5) to actually meet
the stakeholder’s needs and expectations?
These six steps can help you to break the data analysis process into smaller, manageable
parts, which is called structured thinking. This process involves four basic activities:
Data-inspired decision-making explores different data sources to find out what they have in common.
Data is straightforward, facts collected together, values that describe something.
Individual data points become more useful when they're collected and structured, but they're still
somewhat meaningless by themselves. We need to interpret data to turn it into information
When we consume information, understand it, and apply it, that's when data is most useful.
As a data analyst, your own skills and knowledge will be the most important part of any
analysis project. It is important for you to keep a data-driven mindset, ask lots of questions,
experiment with many different possibilities, and use both logic and creativity along the way.
Making predictions -Using data to make informed decisions about how things may be
in the future.
Categorizing things - Grouping data based on common features.
Spotting something unusual - Identifying data that is different from the norm.
Identifying themes - Recognizing broader concepts and trends from categorized data.
Discovering connections - Identifying similar challenges across different entities—and
using data and insights to find common solutions.
Finding patterns -Using historical data about what happened in the past to
understand how likely it is to happen again.
Quantitative data is all about the specific and objective measures of numerical facts.
Qualitative data is great for helping us answer why questions. Add context to a problem.
A metric is a single, quantifiable type of data that can be used for measurement. Think of it this way. Data
starts as a collection of raw facts, until we organize them into individual metrics that represent a single
type of data.
Big Data: volume, variety, and velocity, Veracity (reliability)
Albert Einstein once said," If I were given one hour to save the planet, I would spend 59
minutes defining the problem and one minute resolving it. This is important because if you
define the problem clearly from the start, it'll be easier to solve, which saves a lot of time,
money, and resources.
Problem Domain
The specific area of analysis that encompasses every activity affecting or affected by the problem.
Structured thinking is the process of recognizing the current problem or situation, organizing available
information, revealing gaps and opportunities, and identifying the options. In other words, it's a way of
being super prepared.
In the business world, it's common for teams to spend hours of valuable time trying to solve an important
problem, only to end up back where they started. Not only is the initial problem not resolved,
but they've spent hours not resolving it. This outcome negatively affects you, your team, and the
organization as a whole. But it can usually be prevented. Many times, the situation is a result of not fully
understanding the issue.
The starting place for structured thinking is the problem domain, which you might have remembered from
earlier. Once you know the specific area of analysis, you can set your base and lay out
all your requirements and hypotheses before you start investigating.
A scope of work or SOW is an agreed- upon outline of the work you're going to perform on a project.
Deliverables: What work is being done, and what things are being created as a result of this project?
When the project is complete, what are you expected to deliver to the stakeholders? Be specific
here. Will you collect data for this project? How much, or for how long?
Milestones: This is closely related to your timeline. What are the major milestones for progress in
your project? How do you know when a given part of the project is considered complete?
Timeline: Your timeline will be closely tied to the milestones you create for your project. The timeline
is a way of mapping expectations for how long each step of the process should take. The timeline
should be specific enough to help all involved decide if a project is on schedule. When will the
deliverables be completed? How long do you expect the project will take to complete? If all goes as
planned, how long do you expect each component of the project will take? When can we expect to
reach each milestone?
Reports: Good SOWs also set boundaries for how and when you’ll give status updates to
stakeholders. How will you communicate progress with stakeholders and sponsors, and how often?
Will progress be reported weekly? Monthly? When milestones are completed? What information will
status reports contain?
ASK
What you will learn:
How data analysts solve problems with data
The use of analytics for making data-driven decisions
Spreadsheet formulas and functions
Dashboard basics, including an introduction to Tableau
Data reporting basics
Skill sets you will build:
Asking SMART and effective questions
Structuring how you think
Summarizing data
Putting things into context
Managing team and stakeholder expectations
Problem-solving and conflict-resolution
PREPARE
What you will learn:
How data is generated
Features of different data types, fields, and values
Database structures
The function of metadata in data analytics
Structured Query Language (SQL) functions
Skill sets you will build:
Ensuring ethical data analysis practices
Addressing issues of bias and credibility
Accessing databases and importing data
Writing simple queries
Organizing and protecting data
Connecting with the data community (optional)
PROCESS
SHARE
Like all good tales, your data story will be filled with characters, questions, challenges, conflict, and
hopefully a resolution. The trick is to avoid the conflict, overcome the challenges and answer the
questions
To put it simply, metadata is data about data. In essence, metadata tells the who, what, when, where,
which, how, and why of data.
When there is clean data and good alignment, you can get accurate insights and make
conclusions the data supports.
If there is good alignment but the data needs to be cleaned, clean the data before you
perform your analysis.
If the data only partially aligns with an objective, think about how you could modify
the objective, or use data constraints to make sure that the subset of data better
aligns with the business objective.
if a test is statistically significant, it means the results of the test are real and not an error
caused by random chance.
"Statistical power can be calculated and reported for a completed experiment to comment
on the confidence one might have in the conclusions drawn from the results of the study. It
can also be used as a tool to estimate the number of observations or sample size required in
order to detect an effect in an experiment."
In data analytics, compatibility describes how well two or more datasets are able to work
together.
Documentation which is the process of tracking changes, additions, deletions and errors involved in your
data cleaning effort.
Resume
Focus on your accomplishments first, and explain them using the formula “Accomplished X, as
measured by Y, by doing Z.”
Whenever possible, use numbers to explain your accomplishments. For example, “Increased
manufacturing productivity by 15% by improving shop floor employee engagement,” is better than
“Increased manufacturing productivity.”
Phrase your work experience and duties using Problem-Action-Result (PAR) statements.
For example, instead of saying “was responsible for two blogs a month,” phrase it as
“earned little-known website over 2,000 new clicks through strategic blogging.”
remember: the more you emphasize different things, the less that emphasis counts. The more you
emphasize one single thing, the more that counts.
A line chart is used to track changes over short and long periods of time
Column charts use size to contrast and compare two or more values, using height or lengths to
represent the specific values.
When smaller changes exist, line charts are better to use than bar graphs
The pie chart is a circular graph that is divided into segments representing proportions corresponding
to the quantity it represents, especially when dealing with parts of a whole.
Spotlighting
A data analyst scans the data to quickly identify the most important insights.
Hypothesis
Your initial hypothesis is a theory you're trying to prove or disprove with data.
A portfolio is a collection of case studies that you can share with potential employers.
Data analytics is a field of study that falls under the larger discipline of data science. The aim
of data analytics is to create methods to capture, process, and organize data to uncover
actionable insights for current problems. Analysts focus on processing the information stored
in existing datasets and establishing the best way to present this data. Data analysts rely on
statistics and data modeling to solve problems and offer recommendations that can lead to
immediate improvements.
Effective Communication
Purpose
Receiver/Audience
Sender