filmeu

Class Data Science

  • Presentation

    Presentation

    This module aims to introduce the foundational techniques and methodologies for analysing data from the interdisciplinary perspective of a Data Scientist. At the beginning of this module, students learn about the diverse nature of data and the symbolic power of different data structures. This foundational understanding naturally leads to a second stage in which students learn how to interrogate and extract information from data and justify their choices. During this stage, students learn about statistical inference, hypothesis testing, Frequentist vs Bayesian approaches to data, correlation, and causation. Finally, in the third part of the module, students learn the basics of machine learning through the theory and practice of regression, classification, and dimensionality reduction methods.
  • Code

    Code

    ULHT2531-22513
  • Syllabus

    Syllabus

    Python programming for Data Science Introduction to information visualisation Linear Algebra Basics of probability theory Statistical inference and hypothesis testing Introduction to machine learning Logistic regression Dimensionality reduction: MDS and PCA Dimensionality reduction: Non Negative Matrix Factorisation Current topics in Data Science
  • Objectives

    Objectives

    Learn the main Python programming techniques and methods used by Data Scientists through practice Get a solid introduction to information visualisation theory, approaches and implementation techniques Understand the nature of data from a Data Science perspective Perform exploratory data analyses with implementation in Python Perform statistical inference through hypothesis testing Learn the most common mistakes and fallacies in statistics and how to avoid them Understand how a statistician or decision scientist thinks about solving problems with data Understand what is Machine Learning, Supervised vs. Unsupervised methods Understand and implement regression and classification methods in Data Science Learn the basics of information retrieval metrics for evaluating the performance of a classifier Understand and implement various methods of dimensionality reduction with various applications
  • References

    References

      Grus, J. (2019). Data science from scratch: first principles with python. O'Reilly Media.  
SINGLE REGISTRATION
Lisboa 2020 Portugal 2020 Small financiado eu 2024 prr 2024 republica portuguesa 2024 Logo UE Financed Provedor do Estudante Livro de reclamaões Elogios