Data Science Resource Package!

Download as pdf or txt
Download as pdf or txt
You are on page 1of 14

Data Science Resource Package

1. Data Science Learning Syllabus

2. “Understanding Data Science Problems” template

3. General Mindset Book List

4. Data Science CV Templates

5. Links to Facebook and Telegram communities for Data Scientists

6. BONUS: Exclusive link to set up a free 1-1 career consultation

Brought to you by bitgrit Inc.


Learn Data Science

Introduction
The ultimate, step-by-step guide to learn data science from scratch and prepare for your
future job.

It is organised in a step-by-step way. Do not skip ahead. Do all the exercises that are
demanded. Whenever possible, write with pen and paper instead of on the computer only.
This will make you remember more of the material studied.

In general, prefer official documentation, if it is well-written, over external tutorials. Be very


selective about the educational materials you consume. If you learn the wrong stuff, it may be
worse than if you did not learn anything at all. It is harder to re-learn the correct
behaviour/skill/knowledge. Spend lots of time selecting what you study.

Skim through this repository and its content before deciding to go ahead with it. Finish what
you have started.

Do not expect that you will get a job just because you finished this course. Be realistic but also
optimistic.

Data Science is a multi-disciplinary field. Instead of going through a standardised curriculum,


such as Mathematics or Engineering, you have to be initially analytical in your approach to
learn separate things really well and then synthesise them together.

Always balance your study with practice. Make it at least 50% theoretical study and 50%
practice.

Examples of practice:

Web app that makes data science related visualisations


Open source project that has numerous reputable and active contributors
Internship or work related project that you can display publicly

Make sure that when you work for employers to discuss the possibility to be able to publish
your work as much as possible to the wider public. Not only does it serve the public more, you
will also be able to benefit from others seeing your work that you did and for which you get
paid. Of course, use common sense to avoid giving out confidential information.

A word of caution: study alone will get you nowhere. Data Science is a skill that you have to
practice everyday, much like playing the piano. If you do not practice Data Science, you lose
your skills gradually. That is also one reason why experienced and professional data scientists
are sought after. It is hard to maintain such a multi-disciplinary skill over a prolonged period
of time.

1 of 3
Try to make your study time more about recalling rather than reviewing. Test the key concepts
and contents of Data Science using flashcard such as Anki (https://apps.ankiweb.net) . Test more
than you learn initially. Go through the concepts, lectures, readings, books once and make
flashcards to test yourself over the next few days. No need to review. Recall.

When you learn coding, do not copy and paste. Do not. Instead, you read the code and re-write
them. Yes, it takes more time but you will remember more. Ultimately, you are looking for
being able to write simple codes out of memory without reference. You can test this by
making yourself offline and write a simple app from scratch. If you have to copy paste code
often, it is not a good sign. Programming languages are languages too. If you are fluent in a
language, you do not need to refer to dictionaries all the time.

Syllabus
1. Introduction to Data Science (https://cognitiveclass.ai/courses/data-science-101)

2. Data Science Tools (https://cognitiveclass.ai/courses/data-science-hands-open-source-tools-2)

3. Data Science Methodology (https://cognitiveclass.ai/courses/data-science-methodology-2)

4. Statistics 101 (https://cognitiveclass.ai/courses/statistics-101)

5. Predictive Modeling Fundamentals I (https://cognitiveclass.ai/courses/predictive-modeling-


fundamentals)

6. Python for Data Science (https://cognitiveclass.ai/courses/python-for-data-science)

7. Data Analysis with Python (https://cognitiveclass.ai/courses/data-analysis-python)

8. Data Visualization with Python (https://cognitiveclass.ai/courses/data-visualization-with-python)

9. Machine Learning with Python (https://cognitiveclass.ai/courses/machine-learning-with-python)

10. Deep Learning Fundamentals (https://cognitiveclass.ai/courses/introduction-deep-learning)

11. Deep Learning with TensorFlow (https://cognitiveclass.ai/courses/deep-learning-tensorflow)

Reference and Resources


bitgrit Data Science Community Resource Sheet
(https://docs.google.com/spreadsheets/d/1fmedmbXKLtXsEtYUiGlFDbRMnQWakr7CsZuS1V_nEPA)

bitgrit ebook: How to Succeed in Data Science


(https://docs.google.com/document/d/1fvxDOdCjPx0wS4aqSOME3NyATJGN7sASLeEyygIvcJA)

Curated AI/ML/DL resources, contributed for the community, by the community


(https://github.com/neomatrix369/awesome-ai-ml-dl/)

Curated Data relatd resources, contributed for the community, by the community
(https://github.com/neomatrix369/awesome-ai-ml-dl/blob/master/README-details.md#data)

2 of 3
Feedback and ideas
Google doc with draft list of feedback and idea - you can add your comments here, it is
READONLY
(https://docs.google.com/document/d/1GGMbKbxTfiBgGjYtyHaVOr1B0gTVvLjwsDz0Ub7VU5g/edit?
usp=sharing)

How to Contribute
You are welcome to make a PR to this repo. As you contribute more, you can also apply to
become an official contributor. Please go to our Facebook Group
(https://www.facebook.com/groups/datasciencebitgrit) and tag "John Wu" and ask for the permission to
become an official contributor. You can also reach me that way to get tech or usage support. I
am available during working hours for direct chats on Facebook Messenger.

3 of 3
Data Science is problem solving. But how do we
understand the problem?

First, let’s get some definitions out of the way.

Problem​: An undesirable situation attached to a certain person(s).


Solution​: A desirable alternative to an undesirable situation.

Data Science​: A multidisciplinary field of problem solving using data, involving business,
statistics, programming, and domain perspectives.

Cause​: A potential root of a problem.


Consequence​: A potential negative impact of a problem.

Return​: The potential positive impacts of a solution (in addressing the cause).
Opportunity Cost​: The potential cost of using one solution instead of another solution (different
consequences are addressed).

Interview​: Asking questions that are not loaded or leading to better understand a problem.
Contextual Inquiry​: Observing problems naturally, without bias or influence, to better
understand the context.

1. Conduct an Interview and a Contextual Inquiry to understand the problem.

Ask​: What is the problem?

Ask​: What are the known consequences of this problem? What negative impacts does it
create?

Ask​: Who are the stakeholders in this problem? (This lets you know who is attached to the
problem.)
Ask​: What are the potential causes of this problem?

Ask​: What attempts at a solution have been made?

Ask and observe​: What data do you have available that is related to the problem?

Ask​: How much of this data can be shared?

Observe​: How might the business look differently if the problem were solved?

2. List all the ways you can address the causes of the problem.

2a. How could you use data science, given the availability of data, to address each cause?
2b. How could you mitigate each consequence?

2c. How could you appease the stakeholders in other ways?

2d. How could you use existing attempts?

3. List the ​return ​and ​opportunity cost​ of each potential solution.

3a. What is the potential ​return ​of each answer in 2a-2d?

3b. What is the potential ​opportunity cost​ of each answer in 2a-2d?

4. Pick the solution with the greatest ratio of ​return : opportunity cost
5. Build the data science solution ​by collaborating with a group of data scientists​!

6. BONUS: Send this document to back to me (Frederik Bussler


linkedin.com/in/frederikbussler​) and let’s work on a solution!
Master Book List
Frederik Bussler

I don't agree with everything in all these books, so take the lessons and suggestions therein with
an open mind, and do not accept anything without challenge nor discount anything without
thought.

That being said, here’s a list of books I can recommend to you:

The Power of Now: Eckhart Tolle

The theme I got out of this book is this: When the past happened, you perceived it as "now," the
current moment is "now," and when the future comes, you'll perceive it as "now." You cannot be
in any moment except for right now. If you're stuck in worrying about the future or holding onto
the past, you'll build up anxiety and depression. The more you
focus only on the very moment, the better quality of life you'll have. It’s a classic eye-opener into
what it means to be “present.”

#AskGaryVee

Gary Vaynerchuck is a beast when it comes to building brands and businesses. If you ever want
to be a successful intrepreneur or entrepreneur, this is a must-read.

The Flinch

Most people live their lives in fear. This is a super quick read to help you blast through your own
emotions.

The Willpower Instinct

This is a great scientific read on maintaining extremely high levels of willpower. There’s no
success without hard work, and no hard work without willpower.

The 10X Rule


Ultimately a very simple premise, but something that needs massive repetition for you to accept
it and live it. Want better results? Put in more effort. Want 10X better results? Put in 10X the
effort.

The Alchemist

This fiction story is one of the most important books I’ve ever read. It reminds you that you’re on
this Earth because you have a purpose. Don’t lose sight of your path.

Extreme Ownership

This book was written by a Navy Seal who lead a team that was involved in a disastrous
operation resulting in fratricide. He teaches you to accept complete responsibility for everything.
If you live with any amount of awareness, you’ll see that everyone’s natural tendency is to skirt
responsibility. Carry the burden of responsibility and you’ll put yourself leagues ahead of your
competition.

The Meditations

A classic, short read on stoicism. It teaches you to face life’s problems as another part of the
journey, and to adopt a value system where you’re not drawn into petty problems.

Man’s Search for Meaning

Viktor Frankl was a survivor of multiple concentration camps during the Holocaust. This reading
gives massive perspective.

Homo Deus

This book fits the pieces of history together, massively expanding your understanding of the
world and our potential future. A great read to add more perspectives to your world-view.

Elon Musk

You’ve heard of Elon Musk and his relentless work ethic and success. He created SpaceX,
Solar City, Tesla, the Boring Company, and the original PayPal (x.com). How did he do it? This
book will give you insight into how Elon Musk became who he is today.
Other great reads:

https://ryanholiday.net/reading-list/

https://medium.com/the-mission/if-you-only-read-a-few-books-in-2018-read-these-89d03fc149c6
Frederik Bussler
Business Executive

Born in Germany, raised in


New York & currently living
in Tokyo, Japan.

ANALY
Worked as an Investment
Analyst at Maven 11 Capital,
Business Analyst at Hewlett
Packard Enterprise, Research

TICAL
Analyst at Rare Genomics
Institute, and Data Analyst at
Champlain College and
Salvation Farms.

ENTREP Founded Smart Contract


Auditing, co-founded EinFarm,
worked as a startup Chief Data

RENEUR Officer at HealthDex and


currently CEO at bitgrit.

Speaker at World Data Science

COMMU
Forum, Digital Media Asia,
Enterprise Ethereum Alliance,
Nikkei, IBM, and Slush Tokyo.

NICATOR
Fathered 3,000+ data science
community. Viral content
creator. Reaching 3 million+
audience. Closing deals such as
IBM partnership and clients.
Frederik Bussler
(+81) 80 5926 0071 | [email protected]
/in/FrederikBussler
Objective: Democratize AI

Professional Skills
Programming: Python, Pandas, NumPy, Matplotlib, Sklearn, SQL. MATLAB. Analysis: Excel, Tableau, Power BI,
SPSS. Communication: Public Speaking, visualization, Jupyter, Slack, Trello. Blockchain: Solidity, Smart Contract
Auditing, White Paper writing.

Work Experience
bitgrit Tokyo, Japan
CEO Dec 2018 – present
• Managing needs of users, clients, departments heads and the board of directors to ensure business success.
• Representing the company in engagements such as talks with the World Data Science Forum, NewsTV, Digital
Media Asia, Enterprise Ethereum Alliance, Nikkei, IBM, and Slush Tokyo.
• Growing a data science community from 30 to 3000+ and reaching 3 million+ audience with value materials such
as content, an educational AI bot, a Global Ambassador Program, and white paper.
• Closing partners such as IBM, SingularityNET, DAIA, datasaas, and Radiant Japan, and clients such as MacPaw.
Smart Contract Auditing New York
Founder and CEO Jun 2018 – Nov 2018
• Acquired clients like VibraVid through White Paper, smartcontractauditing.io, pitch deck, and joint ventures.
• Provided trust and responsibility to the blockchain space through proprietary audit framework combining manual
and automated analyses.
HealthDex Varied
Chief Data Officer May 2018 – Oct 2018
• Led own recruits to synergize AI and Blockchain to create a 2-way health data exchange MVP that democratizes
the health data sector.
• Authored White Paper and led PoC build in Federated Learning, Homomorphic Encryption, and Smart Contracts,
in compliance with GDPR.
Hewlett Packard Enterprise Colorado Springs, CO
Business Analyst May 2018 – Aug 2018
• Drove continuous improvements to GTM efficiency by submitting recommendations based on data mining.
• Led targeted initiatives to gain insights on content utilization and pinpoint why we lose and why we win.
• Increased global attach rates to maximize revenue with automated carrot-and-stick measures using Power BI,
SFDC, and Python.
Rare Genomics Institute Remote
Blockchain and ML Research Analyst May 2018 – Aug 2018
• Created partner outreach and development strategies to secure corporate support in genomics advancements.
• Identified prospective industry partners with aligning interests, specializing in ML and blockchain.
Champlain College Institutional Research Burlington, VT
Assistant Data Analyst Aug 2017 – May 2018
• Collaborated with Institutional Research to generate interactive, insight-driven analytics reports on student data.
• Visualized, cleaned, analyzed, and presented datasets using Tableau, Excel, Python, Pandas, NumPy, and Jupyter.
• Developed statistical skills by implementing testing, such as finding significant features in retention through chi-
square test for independence.
Links to exclusive Facebook and Telegram data science communities:

BONUS: Exclusive link to set up a free 1-1 career consultation. Please

do not share this outside of the community!

Brought to you by bitgrit Inc.

You might also like