The DataScience@Denver curriculum prepares students to design tools that collect, evaluate and interpret data to inform critical decisions. Through rigorous, mathematically based coursework, students master advanced concepts to tackle the world’s most important big-data challenges.
Months to Complete Full Time
Months to Complete Part Time
DataScience@Denver requires those without a computer science background to complete three 4-credit bridge courses. However, students may test out of these courses. Please note that these bridge courses do not count toward the 48 credits required for degree completion.
Designed to prepare students for success, bridge courses include:
Computer Science Programming Basics: Covers the basis of Python programming
Data Science Mathematics 1: Covers elements of calculus essential for data science
Data Science Mathematics 2: Covers elements of linear algebra and discrete math
These courses are required for all DataScience@Denver students, regardless of background or data science experience. Review our sample course schedule for an example of program progression and course sequence.
COMP 3006: Python Software Development (4 Credits)
This accelerated course covers advanced Python programming for data scientists. Course Objectives: name and demonstrate proficiency using advanced Python programming techniques for data science, analyze a programming task and create a development plan and high-level software design that accomplishes the task, relate common portions of the Python standard library to specific programming tasks, understand and apply aspects of the Python scientific programming ecosystem to achieve a data-science analysis goal, and collaborate with another data scientist to develop a software program that completes a given data-science task.
COMP 3421: Database Organization & Management I (4 Credits)
An introductory class explaining what a database is and how to use one. Topics include database design, ER modeling, database normalization, relational algebra, SQL and B trees. Each student will design, load, query and update a nontrivial database using a relational database management system (RDBMS). An introduction to a NoSQL database will be included.
COMP 4334: Parallel and Distributed Computing (4 Credits)
Current techniques for effective use of parallel processing and large-scale distributed systems for data science. Programming assignments will give students experience in the use of these techniques. Specific topics will vary from year to year to incorporate recent developments.
COMP 4431: Data Mining (4 Credits)
Data mining is the process of extracting useful information implicitly hidden in large databases. Various techniques from statistics and artificial intelligence are used here to discover hidden patterns in massive collections of data. This course is an introduction to these techniques and their underlying mathematical principles. Topics covered include basic data analysis, frequent pattern mining, clustering, classification and model assessment.
COMP 4432: Machine Learning (4 Credits)
This course will give an overview of machine learning techniques, their strengths and weaknesses, and the problems they are designed to solve. This will include the broad differences between supervised/unsupervised and reinforcement learning as well as associated learning problems such as classification and regression. Techniques covered, at the discretion of the instructor, may include approaches such as linear and logistic regression, neural networks, support vector machines, kNN, decision trees, random forests, Naive Bayes, EM, k-Means and PCA. After course completion, students will have a working knowledge of these approaches and experience applying them to learning problems.
COMP 4433: Data Visualization (4 Credits)
This course explores visualization techniques and theory. The course covers how to use visualization tools to effectively present data as part of quantitative statements within a publication/report and as an interactive system. Both design principles (color, layout, scale and psychology of vision) as well as technical visualization tools/languages will be covered.
COMP 4441: Introduction to Probability and Statistics for Data Science (4 Credits)
The course introduces fundamentals of probability for data science. Students will survey data visualization methods and summary statistics, develop models for data and apply statistical techniques to assess the validity of the models. The techniques will include parametric and non-parametric methods for parameter estimation and hypothesis testing for a single sample mean and two sample means, for proportions, and for simple linear regression. Students will acquire sound theoretical footing for the methods where practical, and will apply them to real-world data, primarily using R.
COMP 4442: Advanced Probability and Statistics for Data Science (4 Credits)
This course builds on material in Probability and Statistics 1. Students will carry out model fitting and diagnostics for multiple regression, ANOVA, ANCOVA and generalized linear models. Dimension reduction techniques such as PCA and Lasso are introduced, as are techniques for handling dependent data. The course introduces the principles of resampling and Bayesian Analysis. Students will acquire sound theoretical footing for the methods, where practical, and will apply them to real-world data, primarily using R.
COMP 4447: Data Science Tools I (4 Credits)
Organizations are using data science to extract actionable insight from data. To highlight the hidden patterns in the data, this course equips students with essential skills for data collection, cleanup, transformation, feature engineering, summarization and visualization. Students will do assignments and a final project. This is a hands-on course. Students will use Python libraries, Linux commands and various data sets to perform these activities.
COMP 4448: Data Science Tools II (4 Credits)
Building a successful predictive model is a multi-faceted process. This course focuses on hypothesis testing and the development of predictive models. Students will also learn how to perform graph-based modeling and optimization. Students will do assignments and a final project. This is a hands-on course. Students will use Python libraries, Linux commands and various data sets to perform these activities.
COMP 4449: Data Science Capstone (4 Credits)
Students identify and fill a demand for an innovative data science product, such as a database tool, analytical software or domain specific analysis. The product is defined, implemented, documented, tested and presented by the student or student team with the instructor and other stakeholders acting as project supervisors to verify that goals are met through the 10-week development process.
COMP 4581: Algorithms for Data Science (4 Credits)
This course introduces the design and analysis of algorithms within the context of data science. Topics include data structures, asymptotic complexity and algorithm design techniques such as incremental, divide and conquer, dynamic programming, randomization, greedy algorithms, and advanced sorting techniques. Examples to illustrate techniques are drawn from multi-dimensional clustering (k-means and probabilistic), regression, decision trees, order statistics, data mining using apriori algorithms, and algorithms for generating combinatorial objects.
Take on a Rewarding Challenge
Ready to develop the technical and analytical skills you need to advance your data science career? Request more information about the online MS in Data Science from the University of Denver.