Hello and welcome to Lab DATA2020

Introduction To Course

This course covers the major programming tools used in Data Science. This includes libraries within Python and the R programming language. Python libraries include those used in data preprocessing, managing data tables and data pipelines, dataflow programming on GPUs, visualization and scientific computing. Some R counterparts to these are covered. Major components of the R development environment are also taught including documentation and external integration.

Most of the labs are project-based application so that you can understand more how to apply it into real problems.

Submission Requirements (Depends on each lab)

<aside> 💡

For Python Lab: submit .ipynb notebook file

</aside>

<aside> 💡

For R Lab: submit .Rmd notebook file and .pdf output using knit

</aside>

Application Lab Week 1

Objectives

The Role of Programming In DS

Discussion Questions

Example: Uber ride datasets

https://www.kaggle.com/datasets/yashdevladdha/uber-ride-analytics-dashboard/data