-
Presentation
Presentation
This module aims to introduce the foundational techniques and methodologies for analysing data from the interdisciplinary perspective of a Data Scientist. At the beginning of this module, students learn about the diverse nature of data and the symbolic power of different data structures. This foundational understanding naturally leads to a second stage in which students learn how to interrogate and extract information from data and justify their choices. During this stage, students learn about statistical inference, hypothesis testing, Frequentist vs Bayesian approaches to data, correlation, and causation. Finally, in the third part of the module, students learn the basics of machine learning through the theory and practice of regression, classification, and dimensionality reduction methods.
-
Class from course
Class from course
-
Degree | Semesters | ECTS
Degree | Semesters | ECTS
Bachelor | Semestral | 5
-
Year | Nature | Language
Year | Nature | Language
3 | Mandatory | Português
-
Code
Code
ULHT2531-22513
-
Prerequisites and corequisites
Prerequisites and corequisites
Not applicable
-
Professional Internship
Professional Internship
Não
-
Syllabus
Syllabus
Python programming for Data Science Introduction to information visualisation Linear Algebra Basics of probability theory Statistical inference and hypothesis testing Introduction to machine learning Logistic regression Dimensionality reduction: MDS and PCA Dimensionality reduction: Non Negative Matrix Factorisation Current topics in Data Science
-
Objectives
Objectives
Learn the main Python programming techniques and methods used by Data Scientists through practice Get a solid introduction to information visualisation theory, approaches and implementation techniques Understand the nature of data from a Data Science perspective Perform exploratory data analyses with implementation in Python Perform statistical inference through hypothesis testing Learn the most common mistakes and fallacies in statistics and how to avoid them Understand how a statistician or decision scientist thinks about solving problems with data Understand what is Machine Learning, Supervised vs. Unsupervised methods Understand and implement regression and classification methods in Data Science Learn the basics of information retrieval metrics for evaluating the performance of a classifier Understand and implement various methods of dimensionality reduction with various applications
-
Teaching methodologies and assessment
Teaching methodologies and assessment
Use of interactive class game dynamics through platforms like Kahoot.
-
References
References
Grus, J. (2019). Data science from scratch: first principles with python. O'Reilly Media.
-
Office Hours
Office Hours
-
Mobility
Mobility
No