Skip to main content

Data Mining and Deep Learning DAT550

The purpose of this course is for students to gain knowledge and practical experience of data mining and deep learning techniques. The course will prepare the students with a deep knowledge of technologies and be able to prepare large-scale data for data mining (pre-processing), feature extraction, dimensionality reduction and use a number of data mining and deep learning methods for classification, regression and clustering tasks that can help to extract actionable knowledge. The course will provide the opportunity for students to learn state-of-the-art data mining and deep learning algorithms and tools. The students will get hands-on experience to try these tools on real data through lab assignments and a project.


Course description for study year 2021-2022. Please note that changes may occur.

Facts
Course code

DAT550

Version

1

Credits (ECTS)

10

Semester tution start

Spring

Number of semesters

1

Exam semester

Spring

Language of instruction

English

Offered by

Faculty of Science and Technology, Department of Electrical Engineering and Computer Science

Learning outcome

Knowledge:

       Candidate:

  • has advanced knowledge in the field and specialized in the theory and practice of data preparation, selection and mining.
  • has in-depth knowledge of the scientific or art theory of the subject area and methods to gain insight from large data collections.
  • can apply knowledge in new areas of data mining and deep learning
  • can analyze knowledge extraction issues on the basis of data mining and deep learning

Skills:

Candidate can:

  • Be able to analyze and critically relate to various sources of information and use these to structure and formulate professional reasoning for various data mining tasks
  •  Be able to analyze existing theories, methods and interpretations within the area of data mining and deep learning and work independently with problem solving on data mining and deep learning tasks
  • Be able to use relevant data mining methods such as clustering, classification, graph, stream mining, frequent pattern mining, association rule mining, deep learning for research and professional application development
  • Be able to carry out an independent, limited research or development project under supervision and in accordance with current research ethics standards which involves preparing data mining pipelines, evaluation, and tune parameters for various data mining models and deep learning using state-of-the-art tools.

General competencies:

       Candidate:

  • can analyze relevant professional, research and ethical issues in data mining and deep learning
  • can apply their knowledge and skills in new areas to carry out advanced tasks and projects
  • can provide extensive independent work on data mining and deep learning issues
  • can communicate about data mining and deep learning issues, analyzes and conclusions within the subject area, both with specialists and to the general public.
  • can contribute to research and innovation in data mining and deep learning.
  • Identify the theoretical and practical issues behind various data mining and deep learning  techniques. Being able to list and describe strengths, limitations and trade-offs among various data mining techniques and choose the appropriate techniques for solving data science problems for various applications.
Content
  • Data cleansing, transformation and preparation
  • Dimensionality reduction, SVD, PCA
  • Recommendation systems
  • Graph mining
  • Classification
  • Neural Networks and Deep learning
  • Clustering
  • Mining data streams
Required prerequisite knowledge
None
Recommended prerequisites
DAT110 Introduction to Programming, DAT240 Advanced programming, DAT540 Introduction to data science, STA500 Probability and Statistics 2
Python programming and Jupyter notebooks. Statistics and Probability.
Exam

Written exam and project report

Form of assessment Weight Duration Marks Aid
Written exam 3/5 4 Hours A - F 1)
Project report 2/5 A - F

1) Textbooks and Lecture notes
2/5 project work and presentation + 3/5 written exam (digital)Project consisting one large assignment. The project is to be performed in a group. The grade for the project will be based on the submitted program code, project report document and an oral hearing in groups of the submitted program code and report. Both parts must be done before final grade is given. If a student fails the project, she/he has to take this part next time the subject is lectured.The written exam will be digital (Inspera).Both exam units must be passed in order to receive a final grade in the course.

Coursework requirements
Mandatory assignments

Three mandatory ungraded (Pass/Fail) exercises/programming assignments

All programming exercises must be passed to attend for the written exam and to get project approved. Completion of mandatory lab assignments are to be made on time. Absence due to illness or for other reasons must be communicated as soon as possible to the laboratory personnel. One cannot expect that provisions for completion of the lab assignments at other times are made unless prior arrangements with the laboratory personnel have been agreed upon. Failure to complete the assigned labs on time or not having them approved will result in barring from taking the exam of the course.

Course teacher(s)
Course coordinator: Vinay Jayarama Setty
Course teacher: Mina Farmanbar
Head of Department: Tom Ryen
Method of work
4 hours lectures/exercises and 2 hours of guided programming exercises and project. Programming exercises requires additional non-guided work effort.
Open for
Admission to Single Courses at the Faculty of Science and Technology Computer Science - Master's Degree Programme
Course assessment
Form and/or discussion.
Overlapping courses
Course Reduction (SP)
Web Search and Data Mining (DAT630) 5
Literature
Search for literature in Leganto