Ebook830 pages4 hours

Mastering Scientific Computing with R

Name: Mastering Scientific Computing with R
Brand: Packt Publishing
Rating: 3.0 (1 reviews)

By Paul Gerrard and Radia M. Johnson

Rating: 3 out of 5 stars

3/5

()

Read preview

About this ebook

About This Book

Perform publication-quality science using R
Use some of R’s most powerful and least known features to solve complex scientific computing problems
Learn how to create visual illustrations of scientific results

Who This Book Is For

If you want to learn how to quantitatively answer scientific questions for practical purposes using the powerful R language and the open source R tool ecosystem, this book is ideal for you. It is ideally suited for scientists who understand scientific concepts, know a little R, and want to be able to start applying R to be able to answer empirical scientific questions. Some R exposure is helpful, but not compulsory.

Skip carousel

LanguageEnglish

PublisherPackt Publishing

Release dateJan 31, 2015

ISBN9781783555260

Author

Paul Gerrard

Related authors

Skip carousel

Related to Mastering Scientific Computing with R

Related ebooks

Skip carousel

R Object-oriented Programming
Ebook
R Object-oriented Programming
byKelly Black
Rating: 3 out of 5 stars
3/5
Learning Predictive Analytics with R
Ebook
Learning Predictive Analytics with R
byMayor Eric
Rating: 0 out of 5 stars
0 ratings
R: Data Analysis and Visualization
Ebook
R: Data Analysis and Visualization
byBrett Lantz
Rating: 5 out of 5 stars
5/5
Mastering Predictive Analytics with R
Ebook
Mastering Predictive Analytics with R
byRui Miguel Forte
Rating: 4 out of 5 stars
4/5
Mastering Data Analysis with R
Ebook
Mastering Data Analysis with R
byDaróczi Gergely
Rating: 5 out of 5 stars
5/5
R Graph Essentials
Ebook
R Graph Essentials
byDavid Alexander Lillis
Rating: 0 out of 5 stars
0 ratings
Machine Learning with R
Ebook
Machine Learning with R
byBrett Lantz
Rating: 4 out of 5 stars
4/5
Learning Bayesian Models with R
Ebook
Learning Bayesian Models with R
byM.Koduvely Dr. Hari
Rating: 5 out of 5 stars
5/5
R for Data Science
Ebook
R for Data Science
byDan Toomey
Rating: 5 out of 5 stars
5/5
R in Action, Third Edition: Data analysis and graphics with R and Tidyverse
Ebook
R in Action, Third Edition: Data analysis and graphics with R and Tidyverse
byRobert I. Kabacoff
Rating: 0 out of 5 stars
0 ratings
Mastering Python for Data Science
Ebook
Mastering Python for Data Science
bySamir Madhavan
Rating: 3 out of 5 stars
3/5
Simulation for Data Science with R
Ebook
Simulation for Data Science with R
byMatthias Templ
Rating: 0 out of 5 stars
0 ratings
Learning pandas
Ebook
Learning pandas
byHeydt Michael
Rating: 4 out of 5 stars
4/5
Mastering SciPy
Ebook
Mastering SciPy
byBlanco-Silva Francisco J.
Rating: 0 out of 5 stars
0 ratings
Julia for Data Science
Ebook
Julia for Data Science
byAnshul Joshi
Rating: 0 out of 5 stars
0 ratings
Mastering Machine Learning with R
Ebook
Mastering Machine Learning with R
byLesmeister Cory
Rating: 0 out of 5 stars
0 ratings
Machine Learning with R - Second Edition
Ebook
Machine Learning with R - Second Edition
byBrett Lantz
Rating: 5 out of 5 stars
5/5
Learning Data Mining with Python
Ebook
Learning Data Mining with Python
byRobert Layton
Rating: 0 out of 5 stars
0 ratings
Learning Predictive Analytics with Python
Ebook
Learning Predictive Analytics with Python
byKumar Ashish
Rating: 0 out of 5 stars
0 ratings
Python Data Analysis
Ebook
Python Data Analysis
byIvan Idris
Rating: 4 out of 5 stars
4/5
Regression Analysis with Python
Ebook
Regression Analysis with Python
byBoschetti Alberto
Rating: 0 out of 5 stars
0 ratings
Preparing Data for Analysis with JMP
Ebook
Preparing Data for Analysis with JMP
byRobert Carver
Rating: 0 out of 5 stars
0 ratings
Principles of Data Science
Ebook
Principles of Data Science
bySinan Ozdemir
Rating: 4 out of 5 stars
4/5
Building Machine Learning Systems with Python
Ebook
Building Machine Learning Systems with Python
byWilli Richert
Rating: 4 out of 5 stars
4/5
Beginning R: The Statistical Programming Language
Ebook
Beginning R: The Statistical Programming Language
byMark Gardener
Rating: 5 out of 5 stars
5/5
Practical Data Analysis
Ebook
Practical Data Analysis
byHector Cuesta
Rating: 4 out of 5 stars
4/5
R Graphs Cookbook Second Edition
Ebook
R Graphs Cookbook Second Edition
byJaynal Abedin
Rating: 3 out of 5 stars
3/5
Learning RStudio for R Statistical Computing
Ebook
Learning RStudio for R Statistical Computing
byvan derLoo Mark
Rating: 4 out of 5 stars
4/5
ggplot2 Essentials
Ebook
ggplot2 Essentials
byDonato Teutonico
Rating: 0 out of 5 stars
0 ratings
R High Performance Programming
Ebook
R High Performance Programming
byAloysius Lim
Rating: 4 out of 5 stars
4/5

Programming For You

Skip carousel

Coding All-in-One For Dummies
Ebook
Coding All-in-One For Dummies
byNikhil Abraham
Rating: 4 out of 5 stars
4/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Python: For Beginners A Crash Course Guide To Learn Python in 1 Week
Ebook
Python: For Beginners A Crash Course Guide To Learn Python in 1 Week
byTimothy C. Needham
Rating: 4 out of 5 stars
4/5
Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning
Ebook
Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning
byAnthony Adams
Rating: 4 out of 5 stars
4/5
SQL All-in-One For Dummies
Ebook
SQL All-in-One For Dummies
byAllen G. Taylor
Rating: 3 out of 5 stars
3/5
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
Ebook
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
byi Code Academy
Rating: 5 out of 5 stars
5/5
Python: Learn Python in 24 Hours
Ebook
Python: Learn Python in 24 Hours
byAlex Nordeen
Rating: 4 out of 5 stars
4/5
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps
Ebook
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps
byJason Scotts
Rating: 4 out of 5 stars
4/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
Ebook
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
byGwendolyn Faraday
Rating: 5 out of 5 stars
5/5
Learn Algorithmic Trading: Build and deploy algorithmic trading systems and strategies using Python and advanced data analysis
Ebook
Learn Algorithmic Trading: Build and deploy algorithmic trading systems and strategies using Python and advanced data analysis
bySebastien Donadio
Rating: 0 out of 5 stars
0 ratings
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
Ebook
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
byMark Chan
Rating: 5 out of 5 stars
5/5
Deep Learning For Dummies
Ebook
Deep Learning For Dummies
byJohn Paul Mueller
Rating: 0 out of 5 stars
0 ratings
HTML & CSS: Learn the Fundaments in 7 Days
Ebook
HTML & CSS: Learn the Fundaments in 7 Days
byMichael Knapp
Rating: 4 out of 5 stars
4/5
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
Ebook
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
byJoseph Labrecque
Rating: 5 out of 5 stars
5/5
Learn Python in 10 Minutes
Ebook
Learn Python in 10 Minutes
byVictor Ebai
Rating: 4 out of 5 stars
4/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
Expert Python Programming - Third Edition: Become a master in Python by learning coding best practices and advanced programming concepts in Python 3.7, 3rd Edition
Ebook
Expert Python Programming - Third Edition: Become a master in Python by learning coding best practices and advanced programming concepts in Python 3.7, 3rd Edition
byMichał Jaworski
Rating: 0 out of 5 stars
0 ratings
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
Ebook
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
byJames Tudor
Rating: 5 out of 5 stars
5/5
Python for Finance Cookbook: Over 50 recipes for applying modern Python libraries to financial data analysis
Ebook
Python for Finance Cookbook: Over 50 recipes for applying modern Python libraries to financial data analysis
byEryk Lewinson
Rating: 0 out of 5 stars
0 ratings
How To Become A Data Scientist With ChatGPT: A Beginner's Guide to ChatGPT-Assisted Programming
Ebook
How To Become A Data Scientist With ChatGPT: A Beginner's Guide to ChatGPT-Assisted Programming
byRafiq Muhammad
Rating: 4 out of 5 stars
4/5
Clean Code in JavaScript: Develop reliable, maintainable, and robust JavaScript
Ebook
Clean Code in JavaScript: Develop reliable, maintainable, and robust JavaScript
byJames Padolsey
Rating: 5 out of 5 stars
5/5
Learn AI with Python: Explore Machine Learning and Deep Learning techniques for Building Smart AI Systems Using Scikit-Learn, NLTK, NeuroLab, and Keras (English Edition)
Ebook
Learn AI with Python: Explore Machine Learning and Deep Learning techniques for Building Smart AI Systems Using Scikit-Learn, NLTK, NeuroLab, and Keras (English Edition)
byGaurav Leekha
Rating: 5 out of 5 stars
5/5
Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]: Career Elevator
Ebook
Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]: Career Elevator
byKevin Pitch
Rating: 5 out of 5 stars
5/5
JavaScript Enlightenment
Ebook
JavaScript Enlightenment
byCody Lindley
Rating: 4 out of 5 stars
4/5
Mastering C# and .NET Framework
Ebook
Mastering C# and .NET Framework
byMarino Posadas
Rating: 5 out of 5 stars
5/5
Coding with JavaScript For Dummies
Ebook
Coding with JavaScript For Dummies
byChris Minnick
Rating: 0 out of 5 stars
0 ratings
Ethical Hacking
Ebook
Ethical Hacking
byLakshay Eshan
Rating: 4 out of 5 stars
4/5
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
Ebook
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
byStephen Fleming
Rating: 5 out of 5 stars
5/5
Git Essentials
Ebook
Git Essentials
byFerdinando Santacroce
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

S1:E1 "The Beginning"
Podcast episode
S1:E1 "The Beginning"
byData Science Now
0 ratings
0% found this document useful
Robert Chang: Building the Minerva Metrics Store @ Airbnb: Robert Chang is a product manager for the data platform at Airbnb, where he helped build and roll out Minerva, Airbnb's internal metrics store. They use Minerva to track over 12,000(!) metrics and 4,000(!) dimensions with consistency across the...
Podcast episode
Robert Chang: Building the Minerva Metrics Store @ Airbnb: Robert Chang is a product manager for the data platform at Airbnb, where he helped build and roll out Minerva, Airbnb's internal metrics store. They use Minerva to track over 12,000(!) metrics and 4,000(!) dimensions with consistency across the...
byThe Analytics Engineering Podcast
0 ratings
0% found this document useful
#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
Podcast episode
#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
byDataFramed
0 ratings
0% found this document useful
#111 The Rise of the Julia Programming Language
Podcast episode
#111 The Rise of the Julia Programming Language
byDataFramed
0 ratings
0% found this document useful
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
Podcast episode
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
#10 Data Science, the Environment and MOOCs: Air pollution, the environment and data science: where do these intersect? Find out in this episode of DataFramed, in which Hugo speaks with Roger Peng, Professor in the Department of Biostatistics at the Johns Hopkins Bloomberg School of Public Health...
Podcast episode
#10 Data Science, the Environment and MOOCs: Air pollution, the environment and data science: where do these intersect? Find out in this episode of DataFramed, in which Hugo speaks with Roger Peng, Professor in the Department of Biostatistics at the Johns Hopkins Bloomberg School of Public Health...
byDataFramed
0 ratings
0% found this document useful
#35 Data Science in Finance
Podcast episode
#35 Data Science in Finance
byDataFramed
0 ratings
0% found this document useful
Data Operations vs. Data Analytics: Are we doing data and analytics correctly? Self service, centralization vs decentralization, analytics vs operations… so many aspects that data teams need to consider. Join this week’s episode of Catalog & Cocktails with hos...
Podcast episode
Data Operations vs. Data Analytics: Are we doing data and analytics correctly? Self service, centralization vs decentralization, analytics vs operations… so many aspects that data teams need to consider. Join this week’s episode of Catalog & Cocktails with hos...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
Episode 19 (Python for Data Science - Python Files - Scripts and Modules)
Podcast episode
Episode 19 (Python for Data Science - Python Files - Scripts and Modules)
byHow to Data (Joshiverse- Journey of a Budding Data Scientist)
0 ratings
0% found this document useful
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Podcast episode
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
byPapers Read on AI
0 ratings
0% found this document useful
Democratizing Causality - Aleksander Molak
Podcast episode
Democratizing Causality - Aleksander Molak
byDataTalks.Club
0 ratings
0% found this document useful
Looking Back at AI in 2021 with Jeremie from Towards Data Science: For our first episode in 2022, we are joined with our friends from the Towards Data Science podcast to discuss our thoughts about the AI-related trends and events that happened in 2021. Some things we discuss are: Foundation models continue to grow, ...
Podcast episode
Looking Back at AI in 2021 with Jeremie from Towards Data Science: For our first episode in 2022, we are joined with our friends from the Towards Data Science podcast to discuss our thoughts about the AI-related trends and events that happened in 2021. Some things we discuss are: Foundation models continue to grow, ...
byLast Week in AI
0 ratings
0% found this document useful
SE Radio 623: Michael J. Freedman on TimescaleDB: Michael J. Freedman, the Robert E. Kahn Professor in the Computer Science Department at Princeton University, as well as the co-founder and CTO of Timescale, spoke with SE Radio host about TimescaleDB. They revisit what time series data means in...
Podcast episode
SE Radio 623: Michael J. Freedman on TimescaleDB: Michael J. Freedman, the Robert E. Kahn Professor in the Computer Science Department at Princeton University, as well as the co-founder and CTO of Timescale, spoke with SE Radio host about TimescaleDB. They revisit what time series data means in...
bySoftware Engineering Radio - the podcast for professional software developers
0 ratings
0% found this document useful
22. Luke Marsden - Data Science Infrastructure and MLOps
Podcast episode
22. Luke Marsden - Data Science Infrastructure and MLOps
byTowards Data Science
0 ratings
0% found this document useful
Defining Success: Metrics and KPIs - Adam Sroka
Podcast episode
Defining Success: Metrics and KPIs - Adam Sroka
byDataTalks.Club
0 ratings
0% found this document useful
Four Most Commonly Asked Questions About AI with Dr. Jerry Smith: Dr. Jerry Smith welcomes you to another episode of AI Live and Unbiased to explore the breadth and depth of Artificial Intelligence and to encourage you to change the world, not just observe it! Dr. Jerry is talking today about questions and...
Podcast episode
Four Most Commonly Asked Questions About AI with Dr. Jerry Smith: Dr. Jerry Smith welcomes you to another episode of AI Live and Unbiased to explore the breadth and depth of Artificial Intelligence and to encourage you to change the world, not just observe it! Dr. Jerry is talking today about questions and...
byAI Live & Unbiased
0 ratings
0% found this document useful
Dataprep with Eric Anderson: Eric Anderson joins the podcast to talk about how Dataprep is simplifying data wrangling!
Podcast episode
Dataprep with Eric Anderson: Eric Anderson joins the podcast to talk about how Dataprep is simplifying data wrangling!
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
051: Strategy evaluation techniques, flaws and solutions with Dave Walton: Today we’re covering a topic which can really be a concern for traders of all levels, from beginner to pro, and that is the topic of strategy evaluation. Have you ever found that real-life performance does not match expected results? Or perhaps you...
Podcast episode
051: Strategy evaluation techniques, flaws and solutions with Dave Walton: Today we’re covering a topic which can really be a concern for traders of all levels, from beginner to pro, and that is the topic of strategy evaluation. Have you ever found that real-life performance does not match expected results? Or perhaps you...
byBetter System Trader
0 ratings
0% found this document useful
The Problem with MTTR: Learning from Incident Reports w/ Courtney Nash
Podcast episode
The Problem with MTTR: Learning from Incident Reports w/ Courtney Nash
byDev Interrupted
0 ratings
0% found this document useful
Episode 17: Perfecting Polymers Processing
Podcast episode
Episode 17: Perfecting Polymers Processing
byMaterialism: A Materials Science Podcast
0 ratings
0% found this document useful
JSJ 476: Understanding Search Engines and SEO (for devs) - Part 1: If you're building a website or web-app, there's a good chance that you want people to find it so that they will access it. These days this mostly means that you want it to appear in the relevant search engine results pages (SERP). In this episode we are joined by Martin Splitt, DevRel at Google for the Search & Web ecosystem, who explains in detail how search engines work, and what developers and SEOs need to know and do in order to be on their good side.
Podcast episode
JSJ 476: Understanding Search Engines and SEO (for devs) - Part 1: If you're building a website or web-app, there's a good chance that you want people to find it so that they will access it. These days this mostly means that you want it to appear in the relevant search engine results pages (SERP). In this episode we are joined by Martin Splitt, DevRel at Google for the Search & Web ecosystem, who explains in detail how search engines work, and what developers and SEOs need to know and do in order to be on their good side.
byJavaScript Jabber
0 ratings
0% found this document useful
JSJ 477: Understanding Search Engines and SEO (for devs) - Part 2: If you're building a website or web-app, there's a good chance that you want people to find it so that they will access it. These days this mostly means that you want it to appear in the relevant search engine results pages (SERP). In this episode we are joined by Martin Splitt, DevRel at Google for the Search & Web ecosystem, who explains in detail how search engines work, and what developers and SEOs need to know and do in order to be on their good side.
Podcast episode
JSJ 477: Understanding Search Engines and SEO (for devs) - Part 2: If you're building a website or web-app, there's a good chance that you want people to find it so that they will access it. These days this mostly means that you want it to appear in the relevant search engine results pages (SERP). In this episode we are joined by Martin Splitt, DevRel at Google for the Search & Web ecosystem, who explains in detail how search engines work, and what developers and SEOs need to know and do in order to be on their good side.
byJavaScript Jabber
0 ratings
0% found this document useful
AI and ML Networking: bridging the gap between performance and economy
Podcast episode
AI and ML Networking: bridging the gap between performance and economy
byTechnology Now
0 ratings
0% found this document useful
Spanner Myths Busted with Pritam Shah and Vaibhav Govil: This week, we’re busting myths around Google Cloud Spanner with our guests Pritam Shah and Vaibhav Govil. and host this episode and learn about the fantastic capabilities of Cloud Spanner. Our guests give us a quick run-down of Spanner database...
Podcast episode
Spanner Myths Busted with Pritam Shah and Vaibhav Govil: This week, we’re busting myths around Google Cloud Spanner with our guests Pritam Shah and Vaibhav Govil. and host this episode and learn about the fantastic capabilities of Cloud Spanner. Our guests give us a quick run-down of Spanner database...
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Similarities and Differences between ML and Analytics - Rishabh Bhargava
Podcast episode
Similarities and Differences between ML and Analytics - Rishabh Bhargava
byDataTalks.Club
0 ratings
0% found this document useful
Seven Failure Points When Engineering a Retrieval Augmented Generation System
Podcast episode
Seven Failure Points When Engineering a Retrieval Augmented Generation System
byPapers Read on AI
0 ratings
0% found this document useful
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture
Podcast episode
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture
byPapers Read on AI
0 ratings
0% found this document useful
34. Data-Driven Decision-Making and Intranet Design (feat. Christian Knoebel and Charlie Kreitzberg, Princeton University)
Podcast episode
34. Data-Driven Decision-Making and Intranet Design (feat. Christian Knoebel and Charlie Kreitzberg, Princeton University)
byNN/g UX Podcast
0 ratings
0% found this document useful
Conquering the Last Mile in Data - Caitlin Moorman
Podcast episode
Conquering the Last Mile in Data - Caitlin Moorman
byDataTalks.Club
0 ratings
0% found this document useful
Ahmed Elsamadisi, Narrator CEO, is a roboticist by training and one of the first engineers at WeWork. Now he's changing how the world tells stories with data.
Podcast episode
Ahmed Elsamadisi, Narrator CEO, is a roboticist by training and one of the first engineers at WeWork. Now he's changing how the world tells stories with data.
byAI and the Future of Work: Artificial Intelligence in the Workplace, Business, Ethics, HR, and IT for AI Enthusiasts, Leaders and Academics
0 ratings
0% found this document useful

Skip carousel

Comparing Time Series Data Like A Pro
Linux Format
Article
Comparing Time Series Data Like A Pro
Jun 1, 2021
8 min read
How Spooky Science Helps Us Peer Inside The Planets
All About Space
Article
How Spooky Science Helps Us Peer Inside The Planets
Dec 3, 2020
An assistant professor of computational science at the EPFL research centre in Lausanne, Switzerland, involved in the current research on metallic hydrogen. Could you explain how the machine-learning techniques used in your research work? Why were th
1 min read
The Machine Learning Revolution
APC
Article
The Machine Learning Revolution
Sep 6, 2021
8 min read
The Machine Learning Revolution
Maximum PC
Article
The Machine Learning Revolution
Aug 17, 2021
8 min read
The Race To Exascale Supercomputers
Maximum PC
Article
The Race To Exascale Supercomputers
Jun 21, 2022
9 min read
2 The Use of Python in AI and ML
Techfastly
Article
2 The Use of Python in AI and ML
Nov 30, 2020
3 min read
Ceramic Design with Artificial Intelligence
Ceramics: Art and Perception
Article
Ceramic Design with Artificial Intelligence
Sep 29, 2023
Technology determines design in different phases of time, and must adapt to corresponding methods and media. With the continuous development of science and technology, traditional ceramic technology and culture faces on-going transformation and upgra
8 min read
So Predictable? AI And Landscape Architecture
Landscape Architecture Australia
Article
So Predictable? AI And Landscape Architecture
Apr 30, 2023
6 min read
Google Answer Box Strategy
Techfastly
Article
Google Answer Box Strategy
Sep 21, 2020
Leveraging the Google PAA (People Also Ask) element on a Search Results Page for Targeted Content Creation with a Python Scraper All businesses that are online today are creating content at a furious pace. According to Technavio, a research firm, con
7 min read
The Infrastructure of an AI Factory
Techfastly
Article
The Infrastructure of an AI Factory
Mar 3, 2021
Data is a crucial element for machine learning algorithms. It can be considered as a fuel of AI factories. Collection of useful data and feeding it into frameworks and models is the foremost step. Data acts as a case or example that the algorithms re
1 min read
Machine Learning And Investing: The Cautious Seldom Err Or Write Great Poetry
Finweek - English
Article
Machine Learning And Investing: The Cautious Seldom Err Or Write Great Poetry
Oct 18, 2019
5 min read
Forward Thinking
Racecar Engineering
Article
Forward Thinking
Feb 4, 2022
8 min read
Q&A
Rotman Management
Article
Q&A
May 1, 2023
Describe the capability that companies like Netflix, UPS, Amazon and Caesars Entertainment have in common. These are all leading firms in their industries with respect to leveraging analytics as a source of competitive advantage. We now have so much
7 min read
Experiments In Photogrammetry
British Columbia History
Article
Experiments In Photogrammetry
Jun 15, 2023
Ever since the fire of June 30, 2021, destroyed the Lytton Museum and Archives, I have been trying to assemble preservation methods designed to reduce the effect of another catastrop loss. To this end, I have been studying ways of making digital thre
2 min read
The AI race
Racecar Engineering
Article
The AI race
Jul 7, 2023
10 min read
Machine Learning – With Zero Programming
APC
Article
Machine Learning – With Zero Programming
Aug 12, 2019
6 min read
3d Animation Basics 06
3D World
Article
3d Animation Basics 06
Jul 17, 2019
3 min read
Clever CAD Coding For Clients And Cigars
Linux Format
Article
Clever CAD Coding For Clients And Cigars
Apr 2, 2024
Credit: http://openscad.org Tam Hanna’s minimal creative capability makes him ideally suited to teaching all kinds of workarounds for problems that require the use of creativity. Catch up by ordering back issues on page 58! The experiments performed
7 min read
Questions for Angela Zutavern, Machine Intelligence Expert, Booz Allen Hamilton
Rotman Management
Article
Questions for Angela Zutavern, Machine Intelligence Expert, Booz Allen Hamilton
Jan 1, 2018
You believe that the world of leadership has hit an inflection point. How so? As useful as popular mental models and heuristics are, machine models now outstrip human performance in about half of the portfolio of cognitive tasks. Going forward, we wi
6 min read
Manipulate Data Like A Pro With Pandas
Linux Format
Article
Manipulate Data Like A Pro With Pandas
Jul 27, 2021
7 min read
Pragmatic Parametricism
Architectural Review Asia Pacific
Article
Pragmatic Parametricism
Nov 13, 2020
4 min read
Test If You’re Human With Our PHP Captcha
Linux Format
Article
Test If You’re Human With Our PHP Captcha
Nov 12, 2024
Creating a captcha is something of a challenge; you need to balance it being too difficult for a computer program to solve against not being so hard that a human can’t pass its test. Most of us have probably experienced difficult captchas that featur
9 min read
The Role Of Image Processors
Amateur Photographer
Article
The Role Of Image Processors
Dec 5, 2017
2 min read
ARTIFICIAL INTELLIGENCE (AI) IN SUPPLY CHAIN PLANNING THE Future is Here & Now
The European Business Review
Article
ARTIFICIAL INTELLIGENCE (AI) IN SUPPLY CHAIN PLANNING THE Future is Here & Now
Dec 3, 2019
7 min read
Increasing Image Resolution
Photo Review
Article
Increasing Image Resolution
May 27, 2021
4 min read
Alphafold Predicts The Future
APC
Article
Alphafold Predicts The Future
Sep 6, 2021
Folding@home isn’t the only group that’s interested in folding proteins. Google, via its faintly terrifying offshoot DeepMind (which has an AI that can play Starcraft much better than you can), has also been having a go, but its AlphaFold software ta
1 min read
How Image Recognition Works
APC
Article
How Image Recognition Works
Nov 4, 2019
4 min read
Level Up Video Game Assets
3D World
Article
Level Up Video Game Assets
Jan 30, 2024
5 min read
Set An AI To Make An AI
PC Gamer (US Edition)
Article
Set An AI To Make An AI
Sep 7, 2021
4 min read
How And Where You Use Machine-learning
APC
Article
How And Where You Use Machine-learning
Oct 7, 2019
4 min read

Related categories

Skip carousel

Reviews for Mastering Scientific Computing with R

Rating: 3 out of 5 stars

3/5

1 rating0 reviews

Book preview

Mastering Scientific Computing with R - Paul Gerrard

Mastering Scientific Computing with R

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers, and more

Why subscribe?

Free access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Downloading the color images of this book

Errata

Piracy

Questions

1. Programming with R

Data structures in R

Atomic vectors

Operations on vectors

Lists

Attributes

Factors

Multidimensional arrays

Matrices

Data frames

Loading data into R

Saving data frames

Basic plots and the ggplot2 package

Flow control

The for() loop

The apply() function

The if() statement

The while() loop

The repeat{} and break statement

Functions

General programming and debugging tools

Summary

2. Statistical Methods with R

Descriptive statistics

Data variability

Confidence intervals

Probability distributions

Fitting distributions

Higher order moments of a distribution

Other statistical tests to fit distributions

The propagate package

Hypothesis testing

Proportion tests

Two sample hypothesis tests

Unit root tests

Summary

3. Linear Models

An overview of statistical modeling

Model formulas

Explanatory variables interactions

Error terms

The intercept as parameter 1

Updating a model

Linear regression

Plotting a slope

Analysis of variance

Generalized linear models

Generalized additive models

Linear discriminant analysis

Principal component analysis

Clustering

Summary

4. Nonlinear Methods

Nonparametric and parametric models

The adsorption and body measures datasets

Theory-driven nonlinear regression

Visually exploring nonlinear relationships

Extending the linear framework

Polynomial regression

Performing a polynomial regression in R

Spline regression

Nonparametric nonlinear methods

Kernel regression

Kernel weighted local polynomial fitting

Optimal bandwidth selection

A practical scientific application of kernel regression

Locally weighted polynomial regression and the loess function

Nonparametric methods with the np package

Nonlinear quantile regression

Summary

5. Linear Algebra

Matrices and linear algebra

Matrices in R

Vectors in R

Matrix notation

The physical functioning dataset

Basic matrix operations

Element-wise matrix operations

Matrix subtraction

Matrix addition

Matrix sweep

Basic matrixwise operations

Transposition

Matrix multiplication

Multiplying square matrices for social networks

Outer products

Using sparse matrices in matrix multiplication

Matrix inversion

Solving systems of linear equations

Determinants

Triangular matrices

Matrix decomposition

QR decomposition

Eigenvalue decomposition

Lower upper decomposition

Cholesky decomposition

Singular value decomposition

Applications

Rasch analysis using linear algebra and a paired comparisons matrix

Calculating Cronbach's alpha

Image compression using direct cosine transform

Importing an image into R

The compression technique

Creating the transformation and quantization matrices

Putting the matrices together for image compression

DCT in R

Summary

6. Principal Component Analysis and the Common Factor Model

A primer on correlation and covariance structures

Datasets used in this chapter

Principal component analysis and total variance

Understanding the basics of PCA

How does PCA relate to SVD?

Scaled versus unscaled PCA

PCA for dimension reduction

PCA to summarize wine properties

Choosing the number of principal components to retain

Formative constructs using PCA

Exploratory factor analysis and reflective constructs

Familiarizing yourself with the basic terms

Matrices of interest

Expressing factor analysis in a matrix model

Basic EFA and concepts of covariance algebra

Concepts of EFA estimation

The centroid method

Multiple actors

Direct factor extraction by principal axis factoring

Performing principal axis factoring in R

Other factor extraction methods

Factor rotation

Orthogonal factor rotation methods

Quartimax rotation

Varimax rotation

Oblique rotations

Oblimin rotation

Promax rotation

Factor rotation in R

Advanced EFA with the psych package

Summary

7. Structural Equation Modeling and Confirmatory Factor Analysis

Datasets

Political democracy

Physical functioning dataset

Holzinger-Swineford 1939 dataset

The basic ideas of SEM

Components of an SEM model

Path diagram

Matrix representation of SEM

The reticular action model (RAM)

An example of SEM specification

An example in R

SEM model fitting and estimation methods

Assessing SEM model fit

Using OpenMx and matrix specification of an SEM

Summarizing the OpenMx approach

Explaining an entire example

Specifying the model matrices

Fitting the model

Fitting SEM models using lavaan

The lavaan syntax

Comparing OpenMx to lavaan

Explaining an example in lavaan

Explaining an example in OpenMx

Summary

8. Simulations

Basic sample simulations in R

Pseudorandom numbers

The runif() function

Bernoulli random variables

Binomial random variables

Poisson random variables

Exponential random variables

Monte Carlo simulations

Central limit theorem

Using the mc2d package

One-dimensional Monte Carlo simulation

Two-dimensional Monte Carlo simulation

Additional mc2d functions

The mcprobtree() function

The cornode() function

The mcmodel() function

The evalmcmod() function

Data visualization

Multivariate nodes

Monte Carlo integration

Multiple integration

Other density functions

Rejection sampling

Importance sampling

Simulating physical systems

Summary

9. Optimization

One-dimensional optimization

The golden section search method

The optimize() function

The Newton-Raphson method

The Nelder-Mead simplex method

More optim() features

Linear programming

Integer-restricted optimization

Unrestricted variables

Quadratic programming

General non-linear optimization

Other optimization packages

Summary

10. Advanced Data Management

Cleaning datasets in R

String processing and pattern matching

Regular expressions

Floating point operations and numerical data types

Memory management in R

Basic R memory commands

Handling R objects in memory

Missing data

Computational aspects of missing data in R

Statistical considerations of missing data

Deletion methods

Listwise deletion or complete case analysis

Pairwise deletion

Visualizing missing data

An overview of multiple imputation

Imputation basic principles

Approaches to imputation

The Amelia package

Getting estimates from multiply imputed datasets

Extracting the mean

Extracting the standard error of the mean

The mice package

Imputation functions in mice

Summary

Index

Mastering Scientific Computing with R

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the authors, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: January 2015

Production reference: 1270115

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham B3 2PB, UK.

ISBN 978-1-78355-525-3

www.packtpub.com

Cover image by Jason Dupuis Mayer (<[email protected]>)

Credits

Authors

Paul Gerrard

Radia M. Johnson

Reviewers

Laurent Drouet

Ratanlal Mahanta

Mzabalazo Z. Ngwenya

Donato Teutonico

Commissioning Editor

Kartikey Pandey

Acquisition Editor

Greg Wild

Content Development Editor

Akshay Nair

Technical Editors

Rosmy George

Ankita Thakur

Copy Editors

Shivangi Chaturvedi

Pranjali Chury

Puja Lalwani

Adithi Shetty

Project Coordinator

Mary Alex

Proofreaders

Simran Bhogal

Martin Diver

Ameesha Green

Paul Hindle

Bernadette Watkins

Indexer

Priya Subramani

Graphics

Sheetal Aute

Disha Haria

Abhinash Sahu

Production Coordinator

Conidon Miranda

Cover Work

Conidon Miranda

About the Authors

Paul Gerrard is a physician and healthcare researcher who is based out of Portland, Maine, where he currently serves as the medical director of the cardiopulmonary rehabilitation program at New England Rehabilitation Hospital of Portland. He studied business economics in college. After completing medical school, he did a residency in physical medicine and rehabilitation at Harvard Medical School and Spaulding Rehabilitation Hospital, where he served as chief resident and stayed on as faculty at Harvard before moving to Portland. He continues to collaborate on research projects with researchers at other academic institutions within the Boston area and around the country. He has published and presented research on a range of topics, including traumatic brain injury, burn rehabilitation, health outcomes, and the epidemiology of disabling medical conditions.

I would like to thank my beautiful wife, Deirdre, and my son, Patrick. My work on this book is dedicated to the loving memory of Fiona.

Radia M. Johnson has a doctorate degree in immunology and currently works as a research scientist at the Institute for Research in Immunology and Cancer at the Université de Montréal, where she uses genomics and bioinformatics to identify and characterize the molecular changes that contribute to cancer development. She routinely uses R and other computer programming languages to analyze large data sets from ongoing collaborative projects. Since obtaining her PhD at the University of Toronto, she has also worked as a research associate at the University of Cambridge in Hematology, where she gained experience using system biology to study blood cancer.

I would like to thank Dr. Charlie Massie for teaching me to love programming in R and Dr. Phil Kousis for all his support through the years. You are both excellent mentors and wonderful friends!

About the Reviewers

Laurent Drouet holds a PhD in economics and social sciences from the University of Geneva, Switzerland, and a master's degree in applied mathematics from the Institute of Applied Mathematics of Angers, France. He was also a postdoctoral research fellow at the Research Lab of Economics and Environmental Management at the Ecole Polytechnique Federale de Lausanne (EPFL), Switzerland. He was also a researcher at the Public Research Center Tudor, Luxembourg. He is currently a senior researcher at Fondazione Eni Enrico Mattei (FEEM) and a research affiliate at Centro Euro-Mediterraneo sui Cambiamenti Climatici (CMCC), Italy.

His main research is related to integrated assessment modeling and energy modeling. For more than a decade, he designed scientific tools to perform data analysis for this type of modeling. He also built optimization frameworks to couple models of many kinds (such as climate models, air quality models, and economy models). He created and developed the bottom-up techno-economic energy model ETEM to study optimal energy policies at urban or national levels.

I want to thank my wife for her support every day both in my private life and professional life.

Ratanlal Mahanta holds an MSc in computational finance. He is currently working at GPSK Investment Group as a senior quantitative analyst. He has 4 years of experience in quantitative trading and strategies developments for sell side and risk consulting firms. He is an expert in high frequency and algorithmic trading. He has expertise in these areas: quantitative trading (FX, equities, futures and options, and engineering on derivatives); algorithms—partial differential equations, stochastic differential equations, the finite difference method, Monte Carlo, and Machine Learning; code—R programming, C++, MATLAB, HPC, and scientific computing; data analysis—Big Data analytic [EOD to TBT], Bloomberg, Quandl, and Quantopian; and strategies—vol-arbitrage, vanilla and exotic options modeling, trend following, mean reversion, co-integration, Monte Carlo simulations, ValueatRisk, stress testing, buy side trading strategies with high Sharpe ratio, credit risk modeling, and credit rating.

He has reviewed Mastering R for Quantitative Finance, Packt Publishing. He is currently reviewing two other books for Packt Publishing: Mastering Python for Data Science and Machine Learning with R Cookbook.

Mzabalazo Z. Ngwenya holds a postgraduate degree in mathematical statistics from the University of Cape Town. He has worked extensively in the field of statistical consulting, wherein he utilized varied statistical software including R. His area of interest are primarily centered around statistical computing. Previously, he was involved in reviewing Learning RStudio for R Statistical Computing, Mark P.J. van der Loo and Edwin de Jonge; R Statistical Application Development Example Beginner's Guide, Prabhanjan Narayanachar Tattar; Machine Learning with R, Brett Lantz; R Graph Essentials, David Alexandra Lillis, and R Object-oriented Programming, Kelly Black, all by Packt Publishing. He currently works as a biometrician.

Donato Teutonico has several years of experience in modeling and the simulation of drug effects and clinical trials in industrial and academic settings. He received his PharmD degree from the University of Turin, Italy, specializing in chemical and pharmaceutical technology, and his PhD in pharmaceutical sciences from Paris-Sud University, France.

He is the author of two R packages for pharmacometrics, CTStemplate and panels-for-pharmacometrics, which are both available on Google Code. He is also the author of Instant R Starter, Packt Publishing.

www.PacktPub.com

Support files, eBooks, discount offers, and more

For support files and downloads related to your book, please visit www.PacktPub.com.

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at for more details.

At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.

https://www2.packtpub.com/books/subscription/packtlib

Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can search, access, and read Packt's entire library of books.

Why subscribe?

Fully searchable across every book published by Packt

Copy and paste, print, and bookmark content

On demand and accessible via a web browser

Free access for Packt account holders

If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view 9 entirely free books. Simply use your login credentials for immediate access.

Preface

As an open source computing environment, R is rapidly becoming the lingua franca of the statistical computing community. R's powerful base functions, powerful statistical tools, open source nature, and avid user community have led to R having an expansive library of powerful, cutting-edge quantitative methods not yet available to users of other high-cost statistical programs.

With this book, you will learn not just about R, but how to use R to answer conceptual, scientific, and experimental questions.

Beginning with an overview of fundamental R concepts, including data types, R program flow, and basic coding techniques, you'll learn how R can be used to achieve the most commonly needed scientific data analysis tasks, including testing for statistically significant differences between groups and model relationships in data. You will also learn parametric and nonparametric techniques for both difference testing and relationship modeling.

You will delve into linear algebra and matrix operations with an emphasis not on the R syntax, but on how these operations can be used to address common computational or analytical needs. This book also covers the application of matrix operations for the purpose of finding a structure in high-dimensional data using the principal component, exploratory factor, and confirmatory factor analysis in addition to structural equation modeling. You will also master methods for simulation, learn about an advanced analytical method, and finish by going to the next level with advanced data management focused on dealing with messy and problematic datasets that serious analysts deal with daily.

By the end of this book, you will be able to undertake publication-quality data analysis in R.

What this book covers

Chapter 1, Programming with R, presents an overview of how data is stored and accessed in R. Then, we will go over how to load data into R using built-in functions and useful packages for easy import from Excel worksheets. We will also cover how to use flow control statements and functions to reduce complexity and help you program more efficiently.

Chapter 2, Statistical Methods with R, presents an overview of how to summarize your data and get useful statistical information for downstream analysis. We will show you how to plot and get statistical information from probability distributions and how to test the fit of your sample distribution to well-defined probability distributions.

Chapter 3, Linear Models, covers linear models, which are probably the most commonly used statistical methods to study the relationships between variables. The Generalized linear model section will delve into a bit more detail than typical R books, discussing the nature of link functions and canonical link functions.

Chapter 4, Nonlinear Methods, reviews applications of nonlinear methods in R using both parametric and nonparametric methods for both theory-driven and exploratory analysis.

Chapter 5, Linear Algebra, covers algebra techniques in R. We will also learn linear algebra operations including transposition, inversion, matrix multiplication, and a number of matrix transformations.

Chapter 6, Principal Component Analysis and the Common Factor Model, helps you understand the application of linear algebra to covariance and correlation matrices. We will cover how to use PCA to account for total variance in a set of variables and how to use EFA to model common variance among these variables in R.

Chapter 7, Structural Equation Modeling and Confirmatory Factor Analysis, covers the fundamental ideas underlying structural equation modeling, which are often overlooked in other books discussing SEM in R, and then delve into how SEM is done in R.

Chapter 8, Simulations, explains how to perform basic sample simulations and how to use simulations to answer statistical problems. We will also learn how to use R to generate random numbers, and how to simulate random variables from several common probability distributions.

Chapter 9, Optimization, explores a variety of methods and techniques to optimize a variety of functions. We will also cover how to use a wide range of R packages and functions to set up, solve, and visualize different optimization problems.

Chapter 10, Advanced Data Management, walks you through the basic techniques for data handling and some basic memory management considerations.

What you need for this book

The software that we require for this book is R Version 3.0.1 or higher, OpenMx Version 1.4, and RStudio.

Who this book is for

If you want to learn how to quantitatively answer scientific questions for practical purposes using the powerful R language and the open source R tool ecosystem, this book is ideal for you. It is ideally suited for scientists who understand scientific concepts, know a little R, and want to start applying R to be able to answer empirical scientific questions. Some R exposure is helpful, but not compulsory.

Conventions

In this book, you will find a number of text styles that distinguish between different kinds of information. Here are some examples of these styles and an explanation of their meaning.

Code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles are shown as follows: You can also retrieve additional information on the objects stored in your environment using the str() function.

A block of code is set as follows:

> integer_vector <- c(1L, 2L, 12L, 29L)

> integer_vector

[1] 1 2 12 29

New terms and important words are shown in bold. Words that you see on the screen, for example, in menus or dialog boxes, appear in the text like this: To install R on Windows, click on Download R for Windows, and then click on base for the download link and installation instructions.

Note

Warnings or important notes appear in a box like this.

Tip

Tips and tricks appear like this.

Reader feedback

Feedback from our readers is always welcome. Let us know what you think about this book—what you liked or disliked. Reader feedback is important for us as it helps us develop titles that you will really get the most out of.

To send us general feedback, simply e-mail <[email protected]>, and mention the book's title in the subject of your message.

If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide at www.packtpub.com/authors.

Customer support

Now that you are the proud owner of a Packt book, we have a number of things to help you to get the most from your purchase.

Downloading the example code

You can download the example code files from your account at http://www.packtpub.com for all the Packt Publishing books you have purchased. If you purchased this book elsewhere, you can visit http://www.packtpub.com/support and register to have the files e-mailed directly to you.

Downloading the color images of this book

We also provide you with a PDF file that has color images of the screenshots/diagrams used in this book. The color images will help you better understand the changes in the output.

You can download this file from: https://www.packtpub.com/sites/default/files/downloads/5253OS_ColoredImages.pdf.

Errata

Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you find a mistake in one of our books—maybe a mistake in the text or the code—we would be grateful if you could report this to us. By doing so, you can save other readers from frustration and help us improve subsequent versions of this book. If you find any errata, please report them by visiting http://www.packtpub.com/submit-errata, selecting your book, clicking on the Errata Submission Form link, and entering the details of your errata. Once your errata are verified, your submission will be accepted and the errata will be uploaded to our website or added to any list of existing errata under the Errata section of that title.

To view the previously submitted errata, go to https://www.packtpub.com/books/content/support and enter the name of the book in the search field. The required information will appear under the Errata section.

Piracy

Piracy of copyrighted material on the Internet is an ongoing problem across all media. At Packt, we take the protection of our copyright and licenses very seriously. If you come across any illegal copies of our works in any form on the Internet, please provide us with the location address or website name immediately so that we can pursue a remedy.

Please contact us at <[email protected]> with a link to the suspected pirated material.

We appreciate your help in protecting our authors and our ability to bring you valuable content.

Questions

If you have a problem with any aspect of this book, you can contact us at <[email protected]>, and we will do our best to address the problem.

Chapter 1. Programming with R

Scientific computing is an informatics approach to problem solving using mathematical models and/or applying quantitative analysis techniques to interpret, visualize, and solve scientific problems. Generally speaking, scientists and data analysts are concerned with understanding certain phenomena or processes using observations from an experiment or through simulation. For example, a biologist may want to understand what changes in gene expression are required for a normal cell to become a cancerous cell, or a physicist may want to study the life cycle of galaxies through numerical simulations. In both cases, they will need to collect the data, and then manipulate and process it before it can be visualized and interpreted to answer their research question. Scientific computing is involved in all these steps.

R is an excellent open source language for scientific computing. R is broadly used in companies and academics as it has great performance value and provides a cutting-edge software environment. It was initially designed as a software tool for statistical modeling but has since then evolved into a powerful tool for data mining and analytics. In addition to its rich collection of classical numerical methods or basic actions, there are also hundreds of R packages for a wide variety of scientific computing needs such as state-of-the-art visualization methods, specialized data analysis tools, machine learning, and even packages such as Shiny to build interactive web applications. In this book, we will teach you how to use R and some of its packages to define and manipulate your data using a variety of methods for data exploration and visualization. This book will present to you state-of-the-art mathematical and statistical methods needed for scientific computing. We will also teach you how to use R to evaluate complex arithmetic expressions and statistical modeling. We will also cover how to deal with missing data and the steps needed to write your own functions tailored to your analysis requirements. By the end of this book, you will not only be comfortable using R and its many packages, but you will also be able to write your own code to solve your own scientific problems.

This first chapter will present an overview of how data is stored and accessed in R. Then, we will look at how to load your data into R using built-in functions and useful packages, in order to easily import data from Excel worksheets. We will also show you how to transform your data using the reshape2 package to make your data ready to graph by plotting functions such as those provided by the ggplot2 package. Next, you will learn how to use flow-control statements and functions to reduce complexity, and help you program more efficiently. Lastly, we will go over some of the debugging tools available in R to help you successfully run your programs in R.

The following is a list of the topics that we will cover in this chapter:

Atomic vectors

Lists

Object attributes

Factors

Matrices and arrays

Data frames

Plots

Flow control

Functions

General programming and debugging tools

Before we begin our overview of R data structures, if you haven't already installed R, you can download the most recent version from http://cran.r-project.org. R compiles and runs on Linux, Mac, and Windows so that you can download the precompiled binaries to install it on your computer. For example, go to http://cran.r-project.org, click on Download R for Linux, and then click on ubuntu to get the most up-to-date instructions to install R on Ubuntu. To install R on Windows, click on Download R for Windows, and then click on base for the download link and installation instructions. For Mac OS users, click on Download R for (Mac) OS X for the download links and installation instructions.

In addition to the most recent version of R, you may also want to download RStudio, which is an integrated development environment that provides a powerful user interface that makes learning R easier and fun. The main limitation of RStudio is that it has difficulty loading very large datasets. So if you are working with very large tables, you may want to run your analysis in R directly. That being said, RStudio is great to visualize the objects you stored in your workplace at the click of a button. You can easily search help pages and packages by clicking on the appropriate tabs. Essentially, RStudio provides all that you need to help analyze your data at your fingertips. The following screenshot is an example of the RStudio user interface running the code from this chapter:

You can download RStudio for all platforms at http://www.rstudio.com/products/rstudio/download/.

Finally, the font conventions used in this book are as follows. The code you should directly type into R is preceded by > and any lines preceded by # will be treated as comment in R.

> The user will type this into R

This is the response from R

> # If the user types this, R will treat it as a comment

Note

Note that all the code written in this book was run with R Version 3.0.2.

Data structures in R

R objects can be grouped into two categories:

Homogeneous: This is when the content is of the same type of data

Heterogeneous: This is when the content contains different types of data

Atomic vectors, Matrices, or Arrays are data structures that are used to store homogenous data, while Lists and Data frames are typically used to store heterogeneous data. R objects can also be organized based on the number of dimensions they contain. For example, atomic vectors and lists are one-dimensional objects, whereas matrices and data frames are two-dimensional objects. Arrays, however, are objects that can have any number of dimensions. Unlike other programming languages such as Perl, R does not have scalar or zero-dimensional objects. All single numbers and strings are stored in vectors of length one.

Atomic vectors

Vectors are the basic data structure in R and include atomic vectors and lists. Atomic vectors are flat and can be logical, numeric (double), integer, character, complex, or raw. To create a vector, we use the c() function, which means combine elements into a vector:

> x <- c(1, 2, 3)

To create an integer vector, add the number followed by L, as follows:

> integer_vector <- c(1L, 2L, 12L, 29L)

> integer_vector

[1] 1 2 12 29

To create a logical vector, add TRUE (T) and FALSE (F), as follows:.

> logical_vector <- c(T, TRUE, F, FALSE)

> logical_vector

[1] TRUE TRUE FALSE FALSE

Tip

Downloading the example code

To create a vector containing strings, simply add the words/phrases in double quotes:

> character_vector <- c(Apple, Pear, Red, Green, These are my favorite fruits and colors)

> character_vector

[1] Apple

[2] Pear

[3] Red

[4] Green

[5] These are my favorite fruits and colors

> numeric_vector <- c(1, 3.4, 5, 10)

> numeric_vector

[1] 1.0 3.4 5.0 10.0

R also includes functions that allow you to create vectors containing repetitive elements with rep() or a sequence of numbers with seq():

> seq(1, 12, by=3)

[1] 1 4 7 10

> seq(1, 12) #note the default parameter for by is 1

[1] 1 2 3 4 5 6 7 8 9 10 11 12

Instead of using the seq() function, you can also use a colon, :, to indicate that you would like numbers 1 to 12 to be stored as a vector, as shown in the following example:

> y <- 1:12

> y

[1] 1 2 3 4 5 6 7 8 9 10 11 12

> z <- c(1:3, y)

> z

[1] 1 2 3 1 2 3 4 5 6 7 8 9 10 11 12

To replicate elements of a vector, you can simply use the rep() function, as follows:

> x <- rep(3, 14)

> x

[1] 3 3 3 3 3 3 3 3 3 3 3 3 3 3

You can also replicate complex patterns as follows:

> rep(seq(1, 4), 3)

[1] 1 2 3 4 1 2 3 4 1 2 3 4

Atomic vectors can only be of one type so if you mix numbers and strings, your vector will be coerced into the most flexible type. The most to the least flexible vector types are Character, numeric, integer, and logical, as shown in the following diagram:

This means that if you mix numbers with strings, your vector will be coerced into a character vector, which is the most flexible type of the two. In the following paragraph, there are two different examples showing this coercion in practice. The first example shows that when a character and numeric vector are combined, the class of this new object becomes a character vector because a character vector is more flexible than a numeric vector. Similarly, in the second example, we see that the class of the new object x is numeric because a numeric vector is more flexible than an integer vector. The two examples are as follows:

Example 1:

> mixed_vector <- c(character_vector, numeric_vector)

> mixed_vector

[1] Apple

[2] Pear

[3] Red

[4]

Enjoying the preview?

Page 1 of 1

Mastering Scientific Computing with R

About this ebook

Paul Gerrard

Related authors

Related to Mastering Scientific Computing with R

Related ebooks

R Object-oriented Programming

Learning Predictive Analytics with R

R: Data Analysis and Visualization

Mastering Predictive Analytics with R

Mastering Data Analysis with R

R Graph Essentials

Machine Learning with R

Learning Bayesian Models with R

R for Data Science

R in Action, Third Edition: Data analysis and graphics with R and Tidyverse

Mastering Python for Data Science

Simulation for Data Science with R

Learning pandas

Mastering SciPy

Julia for Data Science

Mastering Machine Learning with R

Machine Learning with R - Second Edition

Learning Data Mining with Python

Learning Predictive Analytics with Python

Python Data Analysis

Regression Analysis with Python

Preparing Data for Analysis with JMP

Principles of Data Science

Building Machine Learning Systems with Python

Beginning R: The Statistical Programming Language

Practical Data Analysis

R Graphs Cookbook Second Edition

Learning RStudio for R Statistical Computing

ggplot2 Essentials

R High Performance Programming

Programming For You

Coding All-in-One For Dummies

Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees

Python: For Beginners A Crash Course Guide To Learn Python in 1 Week

Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning

SQL All-in-One For Dummies

SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days

Python: Learn Python in 24 Hours

Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps

Grokking Algorithms: An illustrated guide for programmers and other curious people

Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.

Learn Algorithmic Trading: Build and deploy algorithmic trading systems and strategies using Python and advanced data analysis

PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project

Deep Learning For Dummies

HTML & CSS: Learn the Fundaments in 7 Days

The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code

Learn Python in 10 Minutes

SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL

Expert Python Programming - Third Edition: Become a master in Python by learning coding best practices and advanced programming concepts in Python 3.7, 3rd Edition

Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)

Python for Finance Cookbook: Over 50 recipes for applying modern Python libraries to financial data analysis

How To Become A Data Scientist With ChatGPT: A Beginner's Guide to ChatGPT-Assisted Programming

Clean Code in JavaScript: Develop reliable, maintainable, and robust JavaScript

Learn AI with Python: Explore Machine Learning and Deep Learning techniques for Building Smart AI Systems Using Scikit-Learn, NLTK, NeuroLab, and Keras (English Edition)

Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]: Career Elevator

JavaScript Enlightenment

Mastering C# and .NET Framework

Coding with JavaScript For Dummies

Ethical Hacking

Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS

Git Essentials

Related podcast episodes

Related articles

Related categories

Reviews for Mastering Scientific Computing with R

What did you think?

Book preview

Mastering Scientific Computing with R - Paul Gerrard

Table of Contents

Mastering Scientific Computing with R

Mastering Scientific Computing with R

Credits

About the Authors

About the Reviewers