The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. Microsoft consulted data scientists and the companies that employ them to identify the core skills they need to be successful. This informed the curriculum used to teach key functional and technical skills, combining highly rated online courses with hands-on labs, concluding in a final capstone project.The program prepares you with the necessary knowledge base and useful skills to tackle real-world data analysis challenges. The program covers concepts such as probability, inference, regression, and machine learning and helps you develop an essential skill set that includes R programming, data wrangling with dplyr, data visualization with ggplot2, file organization with Unix/Linux, version control with git and GitHub, and reproducible document preparation with RStudio.

Throughout the program, we will be using the R software environment. You will learn R, statistical concepts, and data analysis techniques simultaneously. We believe that you can better retain R knowledge when you learn how to solve a specific problem. Furthermore, HarvardX has partnered with DataCamp for all assignments, which use code checking technology that will permit you to get hands-on practice during the courses.

  • Fundamental R programming skills
  • Statistical concepts such as probability, inference, and modeling and how to apply them in practice
  • Gain experience with the tidyverse, including data visualization with ggplot2 and data wrangling with dplyr
  • Become familiar with essential tools for practicing data scientists such as Unix/Linux, git and GitHub, and RStudio
  • Implement machine learning algorithms
  • In-depth knowledge of fundamental data science concepts through motivating real-world case studies
Course Number Title Track Mapping
DAT101x Microsoft Professional Orientation : Data Science 1
DAT201x Querying Data with Transact-SQL 2
DAT206x Analyzing and Visualizing Data with Excel 3.1
DAT207x Analyzing and Visualizing Data with Power BI 3.2
DAT222x Essential Statistics for Data Analysis using Excel 4
DAT204x Introduction to R for Data Science 5.1
DAT208x Introduction to Python for Data Science 5.2
DAT203.1x Data Science Essentials 6
DAT203.2x Principles of Machine Learning 7
DAT209x Programming with R for Data Science 8.1
DAT210x Programming with Python for Data Science 9.1
DAT202.3x Implementing Predictive Analytics with Spark in Azure HDInsight 9.2
DAT213x Analyzing Big Data with Microsoft R 10