You are reading the 2024/25 Academic Calendar. The 2023/24 version remains in effect until August 31, 2024 and is available here.

Data Science, Faculty of Science

DATA: Data Science


  1. DATA 101 (3) Making Predictions with Data

    Introduction to the techniques and software for handling real-world data. Topics include data cleaning, visualization, simulation, basic modelling, and prediction making. [3-1-0]

  2. DATA 301 (3) Introduction to Data Analytics

    Techniques for computation, analysis, and visualization of data using software. Manipulation of small and large data sets. Databases. Automation using scripting. Real-world applications from life sciences, physical sciences, economics, engineering, or psychology. No prior computing background is required. Cannot be used for credits toward a major in Computer Science, Data Science, Mathematics, or Statistics. Credit will be granted for only one of COSC 301, DATA 301 or DATA 501. [3-2-0]

    Prerequisite: Third-year standing.

  3. DATA 310 (3) Applied Regression Analysis

    Theory and application of simple and multiple linear regression models, estimation, inference (confidence intervals, prediction intervals and hypothesis testing), polynomial regression, ANOVA and ANCOVA, variable selection, model adequacy and residual diagnostics. [3-0-0]

    Prerequisite: MATH 221 and one of STAT 205, STAT 230.

  4. DATA 311 (3) Machine Learning

    Regression, classification, resampling, model selection and validation, fundamental properties of matrices, dimension reduction, tree-based methods, unsupervised learning. [3-2-0]

    Prerequisite: Either (a) one of STAT 205, STAT 230 or (b) a score more than 75% in one of APSC 254, BIOL 202, PSYO 373; and one of COSC 111, APSC 177.

  5. DATA 315 (3) Applied Time Series and Forecasting

    Trends, stationary and nonstationary time series models, forecasting, seasonal models. [3-0-0]

    Prerequisite: One of STAT 205, STAT 230.

  6. DATA 405 (3) Stochastic Modelling and Simulation

    Pseudorandom number generation and testing. Simulation and modelling of univariate and multivariate data; stochastic models, including Poisson processes and Markov chains; MCMC simulation, hidden Markov models, and queuing systems. Credit will be granted for only one of COSC 405, DATA 405, COSC 505, or DATA 505. [3-2-0]

    Prerequisite: One of STAT 205, STAT 230 [with 60% or above].

  7. DATA 407 (3) Sampling and Design

    Planning/practice of data collection. Pros/cons of both observational and experimental data. Survey samples: random sampling; bias and variance; unequal probability sampling; systematic, multistage, and stratified sampling; ratio and regression estimators. Experimental design: simple one-way comparisons; designs with randomization restrictions including blocking, split-plots, nested and repeated measures designs. Credit will be granted for only one of DATA 407 or STAT 507. [3-1-0]

    Prerequisite: One of STAT 205, STAT 230, PSYO 372, BIOL 202.

  8. DATA 410 (3) Regression and Generalized Linear Models

    Regression, linear models, generalized linear models, additive models, generalized additive models, mixed models, theory and numerical performance. Credit will be granted for only one of DATA 410 or STAT 538. [3-0-0]

    Prerequisite: DATA 310.

  9. DATA 419 (3-9) d Topics in Data Science

    Advanced or specialized topics in data science. Consult the department for the specific topic to be offered in any given year. This course may be taken more than once for credit with different topics. [3-0-0]

    Prerequisite: Fourth-year standing.

  10. DATA 448 (3/6) d Directed Studies in Data Science

    Investigation of a specific topic as agreed upon by the student and the faculty supervisor. Completion of a project and an oral presentation are required.

    Prerequisite: Third-year standing in the Data Science major or Honours, and permission of the department head.

  11. DATA 449 (6) Honours Thesis

    Students will undertake a research project as agreed upon by the student, supervising faculty member, and unit head. A written thesis and a public presentation (poster or seminar) are required. Restricted to students in the B.Sc. Data Science Honours Program.

    Prerequisite: Fourth-year standing and permission of the department head.

  12. DATA 500 (3) Communication and Consulting in Data Science

    Effective consulting practices, ethical considerations, methodology selection, data preparation, effective software development. Credit will be granted for only one of DATA 500 or STAT 400 when the subject matter is of the same nature.

  13. DATA 501 (3) Data Analytics

    Techniques for computation, analysis, and visualization of data using software. Manipulation of small and large data sets. Automation using scripting. Real-world applications from life sciences, physical sciences, engineering, or psychology. Credit will be granted for only one of COSC 301, DATA 301 or DATA 501.

  14. DATA 505 (3) Modelling and Simulation

    Simulation methodology: data collection, model design, output analysis, optimization, validation. Credit will be granted for only one of COSC 405, DATA 405, COSC 505, or DATA 505.

  15. DATA 530 (1) Computing Platforms for Data Science

    Introduction to software and tools for Data Science. Setup process. Restricted to students in the MDS program.

  16. DATA 531 (1) Programming for Data Science

    Programming including decisions, loops, functions, and using data structures and libraries. Restricted to students in the MDS program.

  17. DATA 532 (1) Algorithms and Data Structure

    Data structures including lists, queues, stacks, hash tables, trees and graphs. Recursion. Searching and sorting. Asymptotic complexity. Restricted to students in the MDS program.

  18. DATA 533 (1) Collaborative Software Development

    Software life cycle. Licensing. Packaging. Testing and quality control. Version control. Collaborative environments. Restricted to students in the MDS program.

    Prerequisite: DATA 532.

  19. DATA 534 (1) Web and Cloud Computing

    Parallel and cloud computing architectures and program deployment. Restricted to students in the MDS program.

  20. DATA 540 (1) Databases and Data Retrieval

    Using and querying relational and NoSQL databases for analysis. Experience with SQL, JSON, and programs that use databases. Restricted to students in the MDS program.

    Prerequisite: DATA 531.

  21. DATA 541 (1) Scripting and Reporting

    Scripting engines for data science. Reporting tools. Automation. Restricted to students in the MDS program.

  22. DATA 542 (1) Data Wrangling

    Manipulation of data using software tools. Data conversion, filtering, sorting, grouping, cleaning, parsing. Automation. Restricted to students in the MDS program.

    Prerequisite: All of DATA 532, DATA 540, DATA 541.

  23. DATA 543 (1) Data Collection

    Fundamental techniques in the collection of data. Focus will be devoted to understanding the effects of randomization, restrictions on randomization, repeated measures and blocking on the model fitting. Restricted to students in the MDS program.

    Prerequisite: All of DATA 540, DATA 570.

  24. DATA 550 (1) Dataviz I

    Data visualization to produce graphs and images. Advanced data analysis on spreadsheets. Restricted to students in the MDS program.

    Prerequisite: All of DATA 530, DATA 531.

  25. DATA 551 (1) Dataviz II

    Data visualization using business intelligence and data analysis software. Interactive visualization. Production of visualizations for mobile and web. Restricted to students in the MDS program.

    Prerequisite: All of DATA 534, DATA 543, DATA 550.

  26. DATA 552 (1) Communication and Argumentation

    Interpretation of data. Argumentation: hypothesis, claim, evidence and inference. Model limitations: bias, validity, reliability, sensitive analysis. Communication of recommendations to decision-makers. Restricted to students in the MDS program.

  27. DATA 553 (1) Privacy, Security and Professional Ethics

    Data privacy laws and expectations. Freedom of information. Ethics board. Licensing. Data security. Restricted to students in the MDS program.

  28. DATA 570 (1) Predictive Modelling

    Introduction to regression for Data Science. Simple linear regression, multiple linear regression, interactions, mixed variable types, model assessment, simple variable selection, k-nearest-neighbours regression. Restricted to students in the MDS program.

    Prerequisite: DATA 580.

  29. DATA 571 (1) Resampling and Regularization

    Resampling techniques and regularization for linear models. Bootstrap, jackknife, cross-validation, ridge regression, lasso, discussion of tuning parameters. Restricted to students in the MDS program.

    Prerequisite: DATA 570.

  30. DATA 572 (1) Supervised Learning

    Analysis of data with categorical responses. Logistic regression, k-nearest-neighbours classification, discriminant analysis, decision trees and random forests. Restricted to students in the MDS program.

    Prerequisite: DATA 571.

  31. DATA 573 (1) Unsupervised and Semi-supervised Learning

    Analyses for data with unknown responses. Distance measures, hierarchical clustering, k-means, mixture models. Restricted to students in the MDS program.

    Prerequisite: DATA 572.

  32. DATA 580 (1) Modelling and Simulation I

    Pseudorandom number generation, testing and transformation to other discrete and continuous data types. Introduction to Poisson processes and the simulation of data from predictive models, as well as temporal and spatial models. Restricted to students in the MDS program.

  33. DATA 581 (1) Modelling and Simulation II

    Markov chains and their applications, for example, queueing and Markov Chain Monte Carlo. Restricted to students in the MDS program.

    Prerequisite: DATA 580.

  34. DATA 582 (1) Bayesian Inference

    Introduction to Bayesian paradigm and tools for Data Science. Topics include Bayes theorem, prior, likelihood and posterior. A detailed analysis of the cases of binomial, normal samples, normal linear regression models. A significant focus will be on computational aspects of Bayesian problems using software packages. Restricted to students in the MDS program.

    Prerequisite: All of DATA 572, DATA 581.

  35. DATA 583 (1) Advanced Predictive Modelling

    Splines. Smoothing. Generalized linear models. Generalized additive models. An introduction to mixed models. Restricted to students in the MDS program.

    Prerequisite: All of DATA 572, DATA 581.

  36. DATA 585 (1) Optimization

    Modelling using mathematical programming. Fundamental continuous and discrete optimization algorithms. Optimization software for small to medium scale problems. Optimization algorithms for data science. Restricted to students in the MDS program.

    Prerequisite: DATA 580.

  37. DATA 586 (1) Advanced Machine Learning

    Neural networks, backpropagation, deep learning. Restricted to students in the MDS program.

    Prerequisite: DATA 580.

  38. DATA 589 (1) Special Topic

    Advanced or specialized topic in Data Science with applications to specific data sets. Restricted to students in the MDS program.

    Prerequisite: DATA 543.

  39. DATA 599 (6) Capstone

    A capstone design project designed to give students experience in performing data science on a complex multi-disciplinary project. Restricted to students in the MDS program.

    Prerequisite: All of DATA 583, DATA 586.


UBC Crest The official logo of the University of British Columbia. Urgent Message An exclamation mark in a speech bubble. Caret An arrowhead indicating direction. Arrow An arrow indicating direction. Arrow in Circle An arrow indicating direction. Arrow in Circle An arrow indicating direction. Chats Two speech clouds. Facebook The logo for the Facebook social media service. Information The letter 'i' in a circle. Instagram The logo for the Instagram social media service. Linkedin The logo for the LinkedIn social media service. Location Pin A map location pin. Mail An envelope. Menu Three horizontal lines indicating a menu. Minus A minus sign. Pencil A pencil indicating that this is editable. Telephone An antique telephone. Plus A plus symbol indicating more or the ability to add. Search A magnifying glass. Twitter The logo for the Twitter social media service. Youtube The logo for the YouTube video sharing service.