When you think about what data analysts and data scientists do on a day-to-day basis, you might have a general understanding of types of conclusions they make, but how do they arrive at those conclusions? The statistical programming language R is widely used in data science; understanding the basics of how it works can help you manipulate and visualize data in a quick, flexible manner, and it may improve your communication with data scientists on your team.

In this course, you will explore the basics of statistical programming and develop R skills. As you hone your ability to use commands in R, you will combine those basic skills to complete more complex tasks, such as data manipulation and visualization. Finally, you will examine how to repeat tasks in R, which makes it easier to manipulate large data sets. This course involves many hands-on coding exercises to help you gain confidence in your newfound programming skills.

System requirements: This course contains a virtual programming environment that does not support the use of Safari, Edge, tablets, or mobile devices. Please use Chrome, Firefox, or Internet Explorer on a computer for this course.

The real world is extremely complex, and revealing the patterns that underlie these complexities can be challenging. However, unlocking the power of a data set can provide you with remarkable insights and help guide decision-making. This course will prepare you to use summarization and visualization techniques to reveal patterns in real-world data, using examples from a variety of disciplines, including business and medicine.

In this course, Professor Basu will guide you as you begin to understand key data collection principles and how to make conclusions from data. Choosing which analyses to use depends on your question, so you will use a framework to help you choose which methods to use with your data. Then, you will use R to perform exploratory data analyses, which will allow you to identify key patterns and trends in a ready-to-analyze data set. You will also learn the importance of quantifying the uncertainty associated with your results, and how to measure variability in your data. This course involves many hands-on coding exercises in R to help you gain confidence in your programming skills.

System requirements: This course contains a virtual programming environment that does not support the use of Safari, Edge, tablets, or mobile devices. Please use Chrome, Firefox, or Internet Explorer on a computer for this course.

“Exploring Data Sets With R” must be completed prior to starting this course.

In this course, you will explore the steps associated with testing a hypothesis and use a variety of simulation methods to test hypotheses in R; these different methods will allow you to test hypotheses for various possible scenarios. As you perform hypothesis tests, you will discover how to assess the uncertainty associated with your data set and the test. You will also analyze the relationship between two or more variables using linear regression analysis and determine how to assess these relationships with simple diagnostic tools.

Throughout this course, you will perform hands-on coding exercises to practice simulations in R, which will help you gain confidence in both your programming and statistical skills. After completing this course, you will be able to test hypotheses that involve two or more variables in a ready-to-analyze data set using simulations in the programming language R. You will also understand the uncertainty associated with your hypothesis tests and how it impacts your conclusions.

System requirements: This course contains a virtual programming environment that does not support the use of Safari, Edge, tablets, or mobile devices. Please use Chrome, Firefox, or Internet Explorer on a computer for this course.

“Exploring Data Sets With R” and “Summarizing and Visualizing Data” must be completed prior to starting this course.

Data scientists use data collected from the real world to answer questions and solve problems that would otherwise be intractable. But since the world is complex, data collected to describe the world can also be complex, which makes it messy and difficult to work with. To successfully analyze data, data scientists need to spend time cleaning — or organizing and manipulating — their data to put it into a form that is easier to work with and understand.

In this course, you will delve into the world of data cleaning by presenting and manipulating your data with the Tidyverse in R. You will organize data by selecting only the variables you're interested in, creating new groups of data, and summarizing data in a way that makes sense for the questions you're trying to ask. You will also create high-quality plots to quickly summarize complex data. You will become familiar with the concept of tidy data and organize data sets in a way that allows for the most efficient analysis. Finally, you will work with data types of more complexity so that you can answer increasingly difficult questions as you take your new skills into your workplace. You will practice all these skills by working with four real-world, complex data sets. This course involves many hands-on coding exercises that will help you take your programming skills to the next level.

System requirements: This course contains a virtual programming environment that does not support the use of Safari, Edge, tablets, or mobile devices. Please use Chrome, Firefox, or Internet Explorer on a computer for this course.

“Exploring Data Sets With R” and “Measuring Relationships and Uncertainty” must be completed prior to starting this course.

How It Works

Request Information Now
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.