Data Science, Faculty of Science

DATA_O: Data Science


  1. DATA_O 101 (3) Making Predictions with Data

    Introduction to the techniques and software for handling real-world data. Topics include data cleaning, visualization, simulation, basic modelling, and prediction making. [3-1-0]

  2. DATA_O 301 (3) Introduction to Data Analytics

    Techniques for computation, analysis, and visualization of data using software. Manipulation of small and large data sets. Databases. Automation using scripting. Real-world applications from life sciences, physical sciences, economics, engineering, or psychology. No prior computing background is required. Cannot be used for credits toward a major in Computer Science, Data Science, Mathematics, or Statistics. Credit will be granted for only one of COSC 301, DATA 301 or DATA 501. [3-2-0] Prerequisite: Third-year standing.

  3. DATA_O 310 (3) Applied Regression Analysis

    Theory and application of simple and multiple linear regression models, estimation, inference (confidence intervals, prediction intervals and hypothesis testing), polynomial regression, ANOVA and ANCOVA, variable selection, model adequacy and residual diagnostics. [3-0-0] Prerequisite: MATH 221 and one of STAT 205, STAT 230.

  4. DATA_O 311 (3) Machine Learning

    Regression, classification, resampling, model selection and validation, fundamental properties of matrices, dimension reduction, tree-based methods, unsupervised learning. [3-2-0] Prerequisite: Either (a) one of STAT 205, STAT 230 or (b) a score more than 75% in one of APSC 254, BIOL 202, PSYO 373; and one of COSC 111, APSC 177.

  5. DATA_O 315 (3) Applied Time Series and Forecasting

    Trends, stationary and nonstationary time series models, forecasting, seasonal models. [3-0-0] Prerequisite: One of STAT 205, STAT 230.

  6. DATA_O 405 (3) Stochastic Modelling and Simulation

    Pseudorandom number generation and testing. Simulation and modelling of univariate and multivariate data; stochastic models, including Poisson processes and Markov chains; MCMC simulation, hidden Markov models, and queuing systems. Credit will be granted for only one of COSC 405, DATA 405, COSC 505, or DATA 505. [3-2-0] Prerequisite: One of STAT 205, STAT 230 [with 60% or above].

  7. DATA_O 407 (3) Sampling and Design

    Planning/practice of data collection. Pros/cons of both observational and experimental data. Survey samples: random sampling; bias and variance; unequal probability sampling; systematic, multistage, and stratified sampling; ratio and regression estimators. Experimental design: simple one-way comparisons; designs with randomization restrictions including blocking, split-plots, nested and repeated measures designs. Credit will be granted for only one of DATA 407 or STAT 507. [3-1-0] Prerequisite: One of STAT 205, STAT 230, PSYO 372, BIOL 202.

  8. DATA_O 410 (3) Regression and Generalized Linear Models

    Regression, linear models, generalized linear models, additive models, generalized additive models, mixed models, theory and numerical performance. Credit will be granted for only one of DATA 410 or STAT 538. [3-0-0] Prerequisite: DATA 310.

  9. DATA_O 419 (3-9) Topics in Data Science

    Advanced or specialized topics in data science. Consult the department for the specific topic to be offered in any given year. This course may be taken more than once for credit with different topics. [3-0-0] Prerequisite: Fourth-year standing.

  10. DATA_O 448 (3-6) Directed Studies in Data Science

    Investigation of a specific topic as agreed upon by the student and the faculty supervisor. Completion of a project and an oral presentation are required. Prerequisite: Third-year standing in the Data Science major or Honours, and permission of the department head.

  11. DATA_O 449 (6) Honours Thesis

    Students will undertake a research project as agreed upon by the student, supervising faculty member, and unit head. A written thesis and a public presentation (poster or seminar) are required. Restricted to students in the B.Sc. Data Science Honours Program. Prerequisite: Fourth-year standing and permission of the department head.

  12. DATA_O 500 (3) Communication and Consulting in Data Science

    Effective consulting practices, ethical considerations, methodology selection, data preparation, effective software development. Credit will be granted for only one of DATA 500 or STAT 400 when the subject matter is of the same nature.

  13. DATA_O 501 (3) Data Analytics

    Techniques for computation, analysis, and visualization of data using software. Manipulation of small and large data sets. Automation using scripting. Real-world applications from life sciences, physical sciences, engineering, or psychology. Credit will be granted for only one of COSC 301, DATA 301 or DATA 501.

  14. DATA_O 505 (3) Modelling and Simulation

    Simulation methodology: data collection, model design, output analysis, optimization, validation. Credit will be granted for only one of COSC 405, DATA 405, COSC 505, or DATA 505.

  15. DATA_O 530 (1) Computing Platforms for Data Science

    Introduction to software and tools for Data Science. Setup process. Restricted to students in the MDS program.

  16. DATA_O 531 (1) Programming for Data Science

    Programming including decisions, loops, functions, and using data structures and libraries. Restricted to students in the MDS program.

  17. DATA_O 532 (1) Algorithms and Data Structure

    Data structures including lists, queues, stacks, hash tables, trees and graphs. Recursion. Searching and sorting. Asymptotic complexity. Restricted to students in the MDS program.

  18. DATA_O 533 (1) Collaborative Software Development

    Software life cycle. Licensing. Packaging. Testing and quality control. Version control. Collaborative environments. Restricted to students in the MDS program. Prerequisite: DATA 532.

  19. DATA_O 534 (1) Web and Cloud Computing

    Parallel and cloud computing architectures and program deployment. Restricted to students in the MDS program.

  20. DATA_O 540 (1) Databases and Data Retrieval

    Using and querying relational and NoSQL databases for analysis. Experience with SQL, JSON, and programs that use databases. Restricted to students in the MDS program. Prerequisite: DATA 531.

  21. DATA_O 541 (1) Scripting and Reporting

    Scripting engines for data science. Reporting tools. Automation. Restricted to students in the MDS program.

  22. DATA_O 542 (1) Data Wrangling

    Manipulation of data using software tools. Data conversion, filtering, sorting, grouping, cleaning, parsing. Automation. Restricted to students in the MDS program. Prerequisite: All of DATA 532, DATA 540, DATA 541.

  23. DATA_O 543 (1) Data Collection

    Fundamental techniques in the collection of data. Focus will be devoted to understanding the effects of randomization, restrictions on randomization, repeated measures and blocking on the model fitting. Restricted to students in the MDS program. Prerequisite: All of DATA 540, DATA 570.

  24. DATA_O 550 (1) Dataviz I

    Data visualization to produce graphs and images. Advanced data analysis on spreadsheets. Restricted to students in the MDS program. Prerequisite: All of DATA 530, DATA 531.

  25. DATA_O 551 (1) Dataviz II

    Data visualization using business intelligence and data analysis software. Interactive visualization. Production of visualizations for mobile and web. Restricted to students in the MDS program. Prerequisite: All of DATA 534, DATA 543, DATA 550.

  26. DATA_O 552 (1) Communication and Argumentation

    Interpretation of data. Argumentation: hypothesis, claim, evidence and inference. Model limitations: bias, validity, reliability, sensitive analysis. Communication of recommendations to decision-makers. Restricted to students in the MDS program.

  27. DATA_O 553 (1) Privacy, Security and Professional Ethics

    Data privacy laws and expectations. Freedom of information. Ethics board. Licensing. Data security. Restricted to students in the MDS program.

  28. DATA_O 570 (1) Predictive Modelling

    Introduction to regression for Data Science. Simple linear regression, multiple linear regression, interactions, mixed variable types, model assessment, simple variable selection, k-nearest-neighbours regression. Restricted to students in the MDS program. Prerequisite: DATA 580.

  29. DATA_O 571 (1) Resampling and Regularization

    Resampling techniques and regularization for linear models. Bootstrap, jackknife, cross-validation, ridge regression, lasso, discussion of tuning parameters. Restricted to students in the MDS program. Prerequisite: DATA 570.

  30. DATA_O 572 (1) Supervised Learning

    Analysis of data with categorical responses. Logistic regression, k-nearest-neighbours classification, discriminant analysis, decision trees and random forests. Restricted to students in the MDS program. Prerequisite: DATA 571.

  31. DATA_O 573 (1) Unsupervised and Semi-supervised Learning

    Analyses for data with unknown responses. Distance measures, hierarchical clustering, k-means, mixture models. Restricted to students in the MDS program. Prerequisite: DATA 572.

  32. DATA_O 580 (1) Modelling and Simulation I

    Pseudorandom number generation, testing and transformation to other discrete and continuous data types. Introduction to Poisson processes and the simulation of data from predictive models, as well as temporal and spatial models. Restricted to students in the MDS program.

  33. DATA_O 581 (1) Modelling and Simulation II

    Markov chains and their applications, for example, queueing and Markov Chain Monte Carlo. Restricted to students in the MDS program. Prerequisite: DATA 580.

  34. DATA_O 582 (1) Bayesian Inference

    Introduction to Bayesian paradigm and tools for Data Science. Topics include Bayes theorem, prior, likelihood and posterior. A detailed analysis of the cases of binomial, normal samples, normal linear regression models. A significant focus will be on computational aspects of Bayesian problems using software packages. Restricted to students in the MDS program. Prerequisite: All of DATA 572, DATA 581.

  35. DATA_O 583 (1) Advanced Predictive Modelling

    Splines. Smoothing. Generalized linear models. Generalized additive models. An introduction to mixed models. Restricted to students in the MDS program. Prerequisite: All of DATA 572, DATA 581.

  36. DATA_O 585 (1) Optimization

    Modelling using mathematical programming. Fundamental continuous and discrete optimization algorithms. Optimization software for small to medium scale problems. Optimization algorithms for data science. Restricted to students in the MDS program. Prerequisite: DATA 580.

  37. DATA_O 586 (1) Advanced Machine Learning

    Neural networks, backpropagation, deep learning. Restricted to students in the MDS program. Prerequisite: DATA 580.

  38. DATA_O 589 (1) Special Topic

    Advanced or specialized topic in Data Science with applications to specific data sets. Restricted to students in the MDS program. Prerequisite: DATA 543.

  39. DATA_O 599 (6) Capstone

    A capstone design project designed to give students experience in performing data science on a complex multi-disciplinary project. Restricted to students in the MDS program. Prerequisite: All of DATA 583, DATA 586.


UBC Crest The official logo of the University of British Columbia. Urgent Message An exclamation mark in a speech bubble. Caret An arrowhead indicating direction. Arrow An arrow indicating direction. Arrow in Circle An arrow indicating direction. Arrow in Circle An arrow indicating direction. Chats Two speech clouds. Facebook The logo for the Facebook social media service. Information The letter 'i' in a circle. Instagram The logo for the Instagram social media service. Linkedin The logo for the LinkedIn social media service. Location Pin A map location pin. Mail An envelope. Menu Three horizontal lines indicating a menu. Minus A minus sign. Pencil A pencil indicating that this is editable. Telephone An antique telephone. Plus A plus symbol indicating more or the ability to add. Search A magnifying glass. Twitter The logo for the Twitter social media service. Youtube The logo for the YouTube video sharing service.