Applied Data Science (ADS)

ADS 500A | PROBABILITY AND STATISTICS FOR DATA SCIENCE

Units: 3 Repeatability: No

This course is an introduction to probability and statistical concepts and their applications in solving real-world problems. This prerequisite course provides a solid background in the application of probability and statistics that will form the basis for advanced data science methods. Statistical concepts, probability theory, random and multivariate variables, data and sampling distributions, descriptive statistics, and hypothesis testing will be covered. The use of computer-based applications for the performance of basic statistics will be utilized. Covered topics include the numerical and graphical description of data, elements of probability, sampling distributions, probability distribution functions, estimation of population parameters, and hypothesis tests. This course will combine the learnings from texts, case studies, and standard organizational processes with practical problem-solving skills to present, structure, and plan the problem as it would be presented in large enterprises and execute the steps in a structured analytics process.

ADS 500B | DATA SCIENCE PROGRAMMING

Units: 3 Repeatability: No

This course is an introduction to fundamental concepts of programming and problem-solving techniques for data science. Python and R are the languages used to analyze and deliver insights from real-world datasets. Topics include the basics of Python and R, data acquisition, integration and transformation, problem understanding, data preparation, standardization, and exploratory data analysis. In addition, command line tools and editors are explored in UNIX, and methods to access and analyze RDBMS databases are examined. The course ends with introducing students to the basics of machine learning models.

ADS 501 | FOUNDATIONS OF DATA SCIENCE AND DATA ETHICS

Units: 3 Repeatability: No

Prerequisites: ADS 500A with a minimum grade of C- and ADS 500B with a minimum grade of C-

This course covers an introduction to the methods, concepts, and ethical considerations found and practiced in the field of professional data science. Topics include defining and structuring the problem, managing the business, the CRISP-DM and Agile processes, ensuring the science in data science using the scientific method, project management, managing ethical concerns and model bias, and the importance of performing exploratory data analysis. This course will combine the learnings from case studies, texts, and standard organizational processes with practical problem-solving skills to present, structure, plan, and present the problem as it would be done in large enterprises, including executing steps in the data science work-stream.

ADS 502 | APPLIED DATA MINING

Units: 3 Repeatability: No

Prerequisites: ADS 500A with a minimum grade of C- and ADS 500B with a minimum grade of C-

Data Mining is one of the most important topics in the data science field. This course discusses theoretical concepts and practical algorithms for both supervised and unsupervised learning techniques. The course provides data mining principles, methods, and applications with a variety of integrated theoretical and practical examples in classification, association analysis, cluster analysis, and anomaly detection. This course also includes applied examples associated with each topic in data mining using R and Python programming languages.

ADS 503 | APPLIED PREDICTIVE MODELING

Units: 3 Repeatability: No

Prerequisites: ADS 500A with a minimum grade of C- and ADS 500B with a minimum grade of C-

This course provides a working knowledge of applied predictive modeling. Students will obtain a broad understanding of model training, evaluations, and development procedures with a wide variety of applications to real-world problems. This course introduces best practices for managing data science projects and presenting analytical results to technical and non-technical audiences. Course topics include linear and non-linear regression modeling methods, linear and non-linear classification modeling methods, model selection, variable importance, variable selection and model applications, code, and R package management using RStudio.

ADS 504 | MACHINE LEARNING AND DEEP LEARNING FOR DATA SCIENCE

Units: 3 Repeatability: No

Prerequisites: ADS 500A with a minimum grade of C- and ADS 500B with a minimum grade of C-

This course covers the study of supervised and unsupervised algorithms in the Machine Learning context. Emphasis on formulating, choosing, applying, implementing, and evaluating machine learning models to capture key patterns exhibited in cross-sectional data and longitudinal data. This course also discusses the considerations of model complexity interpretations and implementation in real-world applications using Python and associated packages. An introduction to Deep Learning is provided in this course.

ADS 505 | APPLIED DATA SCIENCE FOR BUSINESS

Units: 3 Repeatability: No