Python for Data ScienceCornell Certificate Program
Overview and Courses
Data science is one of today’s most in-demand functions — and Python is an essential skill in any data scientist’s toolbox. In this program, you will master the ability to analyze and visualize data in meaningful ways using Python to help solve complex business problems. Working with tools such as Jupyter Notebooks, NumPy, and Pandas, you will have the opportunity to analyze real-world datasets to identify patterns and relationships in data. You will gain experience using both built-in and custom-built data types to create expressive and computationally robust data science projects. Finally, you will build predictive machine learning models using Python and scikit-learn.
To be successful in this program you should have prior programming experience with a procedural language. Our Python Programming certificate program is a great option if you have less experience. The amount of time you spend in these courses will depend on your prior experience. These courses are designed to be taken in order as the concepts build upon each other throughout the program.
The courses in this certificate program are required to be completed in the order that they appear.Course list
This course introduces you to the different scenarios in which you will utilize built-in Python functions, classes, and data types as opposed to creating your own or using a combination of built-in and custom-built capabilities. You will gain experience working with both built-in and custom-built functions, classes, and data types. Through practice and application of these basic building blocks/tools, you will gain an in-depth understanding of how these aspects of Python interoperate to create useful programs.
You are required to have completed the following course or have equivalent experience before taking this course:
- Constructing Expressions in Python
Python is much more than a programming language. In this course, you will leverage the comprehensive Python ecosystem of libraries, frameworks, and tools to develop complex data science applications. Throughout this course, you will practice using the different Python tools appropriate to your dataset. You will leverage library resources for data acquisition and analysis as well as machine learning. Dataframes will be introduced as a means of manipulating structured data tables for advanced analysis. Additionally, you will practice basic routines for data visualization utilizing Jupyter Notebooks.
You are required to have completed the following courses or have equivalent experience before taking this course:
- Constructing Expressions in Python
- Writing Custom Python Functions, Classes, and Workflows
Decision-makers generally do not use raw data to make decisions; they prefer data be summarized in easily understood formats that facilitate efficient decision-making. This course introduces data manipulation and visualization, both critical components of any data science project. This course introduces two commonly used data manipulation tools in the Python ecosystem: NumPy and Pandas. In addition, the Python ecosystem also includes a variety of data plotting packages such as Matplotlib, Seaborn, and Bokeh — each of which specialize in particular aspects of data visualization. This course will give you experience integrating NumPy, Pandas, and the plotting packages to create rich, interactive data visualizations that help drive efficient decision-making.
You are required to have completed the following courses or have equivalent experience before taking this course:
- Constructing Expressions in Python
- Writing Custom Python Functions, Classes, and Workflows
- Developing Data Science Applications
Most data science projects that use Python will require you to access and integrate different types of data from a variety of external sources. This course will give you experience identifying and integrating data from spreadsheets, text files, websites, and databases. To prepare for downstream analyses, you first need to integrate any external data sources into your Python program. You will utilize existing packages and develop your own code to read data from a variety of sources. You will also practice using Python to prepare disorganized, unstructured, or unwieldy datasets for analysis by other stakeholders.
You are required to have completed the following courses or have equivalent experience before taking this course:
- Constructing Expressions in Python
- Writing Custom Python Functions, Classes, and Workflows
- Developing Data Science Applications
- Creating Data Arrays and Tables in Python
In order to be useful within a professional environment, data must be structured in a way that can be understood and applied to real-world scenarios. This course introduces using Python to perform statistical data analysis and create visualizations that uncover patterns in your data. Using the tools and workflows you developed in earlier courses, you will carry out analyses on real-world datasets to become familiar with recognizing and utilizing patterns. Finally, you will form and test hypotheses about your data which will become the foundation upon which data-driven decision-making is built.
You are required to have completed the following courses or have equivalent experience before taking this course:
- Constructing Expressions in Python
- Writing Custom Python Functions, Classes, and Workflows
- Developing Data Science Applications
- Creating Data Arrays and Tables in Python
- Organizing Data with Python
In this course, you will explore some of the machine learning tools you can use to magnify the analytical power of Python data science programs. You will use the scikit-learn package — a Python package developed for machine learning applications — to develop predictive machine learning models. You will then practice using these models to discover new relationships and patterns in your data. These capabilities allow you to unlock additional value in your data that will aid in making predictions and, in some cases, creating new data.
You are required to have completed the following courses or have equivalent experience before taking this course:
- Constructing Expressions in Python
- Writing Custom Python Functions, Classes, and Workflows
- Developing Data Science Applications
- Creating Data Arrays and Tables in Python
- Organizing Data with Python
- Analyzing and Visualizing Data with Python
How It Works
- View slide #1
- View slide #2
- View slide #3
- View slide #4
- View slide #5
- View slide #6
- View slide #7
- View slide #8
- View slide #9
Faculty Author
Key Course Takeaways
- Visualize data with Python
- Write custom functions and data classes in Python that can be stored for reuse
- Use key elements of Python control flow and iteration
- Use Jupyter Notebooks to integrate data analysis, visualization, and documentation
- Manipulate data arrays and tables using NumPy and Pandas
- Filter, integrate, and prepare data for analysis
- Perform statistical data analysis and visualization
- Explore datasets with machine learning
Download a Brochure
Not ready to enroll but want to learn more? Download the certificate brochure to review program details.What You'll Earn
- Python for Data Science Certificate from Cornell Center for Advanced Computing
- 105 Professional Development Hours (7 CEUs)
Watch the Video
Who Should Enroll
- Data analysts and business analysts
- Database managers
- Technical and systems analysts
- Programmers interested in data science
- Marketers
- Business managers
Request Information Now by completing the form below.
Python for Data Science
Select Payment Method | Cost |
---|---|
$3,900 | |