Knowledge and Data Integration
Welcome to the homepage of the fall 2021 edition of Knowledge and Data Integration, course of the Data Science degree at the University of Trento.

 

 

News


Lecture cancelled Monday 20/09

September 16th, 2021

 

 

Register on the Moodle page (Knowledge and Data Integration 2021/22 [145919] - GIUNCHIGLIA) to receive the lecturers communications.

September 13th, 2021

 

 

First lecture today September 13th, 17:30 at Room A215

September 13th, 2021

 

 

This class will start on Monday Sept 13st. More details in the Calendar and Material section.

August 16th, 2021

 

 

 

 

Last modification: September 16th, 2021

Instructions


The 2021 edition of KDI is taught with the presence in class of both lecturers and students. As for the last years, presence, even if not a formal requirement, is strongly suggested given that this is a hands-on lab course. Passing the exam amounts to developing a project, which ultimately will lead to the generation of a Knowledge Graph (and support documentation) starting from data which will have to be found in the Web. This goal will be reached under the continuous supervision of the lectures providing advice and support, and in collaboration, doing joint work with a colleague taking this course. There are no easy or cost-effective ways to achieve this goal without a continuous presence in class. Given the current Covid situation, in case someone taking the course cannot be in class for a certain lecture, that lecture will be registered and made available to him/ her. The request of a registration should be done as soon as possible before the beginning of the lecture (ideally 2-3 days before) and should supported by a valid justification. Notice, however, that these registered lectures will unlikely have the same quality as the physical lecture, in particular for those classes which will consist of one-to-one interactions among the students and the lecturers. In most cases an additional interaction with the lecturers during the Q&A lectures will achieve a better goal (see below).

The lectures will take place following the scheduling indicated in the section Calendar and Material. The course material includes slides, demo videos, support resources and links, all provided on the web site under the Calendar and Material section. For those interested, it is possible to consult the registrations of the lectures of the A.Y. 2020/2021 (here). This might be occasionally useful but with the following two points of attention: (1) while being very similar in spirit to the last year, the course this year presents some substantial differences, all expoiting that the lectures are in presence, and (2) in most cases we suggest you ask the lecturers for feedback or suggestions. In particular, just in order to help student, after the end of each phase of the methodology taught in the KDI course, there will be a Q&A lecture in which the students can ask questions about all their open problems and doubts.

At the end of the course students will be asked to fill an online questionnaire about the overall process and methodology they will have learned. This feedback is very important to us, as it is the basis for a continuous evolution and improvement of the methodology being taught. To this extent students are strongly encouraged to raise doubts, ask questions, discuss the doubts they have about the methodology itself during the Q&A lectures.

Syllabus


Course Objectives and Outcomes

The Knowledge and Data Integration course aims to providing motivations, definitions, theorems and techniques for a concrete and effective understanding of what (in the context of computer science) is meant for knowledge and data integration. Providing also, techniques for analyzing and modelling knowledge and data as well as techniques for data and knowledge integration. Stimulating the students to continue their career with higher interest into data and knowledge representation in their own field of expertise, and to produce computer-processable solutions of relevant problems.

 

General Description

This course will cover the following topics:
  • the main issues which can be addressed when data and knowledge resources have to be integrated.
  • a general methodology (iTelos) for knowledge and data analysis, modeling and integration.
  • an analysis of the state of the art tools and methodologies for data analysis, modeling and integration.
  • an introduction to ontologies, Extended ER models and linguistic resources.
This is a hands-on, lab and experiment based course. Students will be given a data analysis/modelling/integration problem that they will have to solve, possibly, while taking the class. During the experiment, students will have to apply to the problem the notions introduced in class. The students splitted in teams, where each team will solve an integration problem adopting the methodology taught during the lectures.

Teachers


Fausto Giunchiglia
Simone Bocca
Mayukh Bagchi
Daqian Shi
Alessio Zamboni
Fausto Giunchiglia
Simone Bocca
Mayukh Bagchi
Daqian Shi
Alessio Zamboni
fausto.giunchiglia@unitn.it
simone.bocca@unitn.it
mayukh.bagchi@studenti.unitn.it
daqian.shi@unitn.it
alessio.zamboni@unitn.it

Calendar and Material


The course runs from Sep, 13, 2021 till Dec 13, 2021 with the following schedule

     

  • Monday, 17:30-19:00, Room A209

  •  

  • Wednesday, 9:30-11:00, Room A224

 

You might want to read the Instructions to understand how to take the course.

 

Notice also the titles and structure of the lessons yet to be delivered might change slightly. The rule of the thumb is: if there are links with materials, things won’t change; if there are no links to the materials, titles and content are just suggestions.

 

Lesson Number Date                                  Time Material                              Content of Material Lecturer(s)                 External resources                         Phase documentation deadline                        
0 Mon 13 Sep, 2021 17:30 Slides Introduction & Representation Diversity. F. Giunchiglia, S. Bocca, M. Bagchi, D. Shi
1 Wed 15 Sep, 2021 9:30 Slides Solving Representation Diversity. F. Giunchiglia, S. Bocca
2 Wed 22 Sep, 2021 9:30 Solving Representation Diversity F. Giunchiglia, S. Bocca
3 Mon 27 Sep, 2021 17:30 Slides iTelos - methodology principles F. Giunchiglia, S. Bocca
4 Wed 29 Sep, 2021 9:30 Slides iTelos - methodology structure F. Giunchiglia, S. Bocca
5 Mon 4 Oct, 2021 17:30 Project Organization
Project Proposals
KDI Projects - organization & developments F. Giunchiglia, S. Bocca Project Example
6 Wed 6 Oct, 2021 9:30 Slides Metadata F. Giunchiglia, S. Bocca LiveSchema catalog
OpenDatTrentino catalog
SHAPEness
7 Mon 11 Oct, 2021 17:30 Slides Inception phase - theory F. Giunchiglia, S. Bocca Resources
8 Wed 13 Oct, 2021 9:30 Slides Inception phase - practice F. Giunchiglia, S. Bocca InceptionSheet
ProjectReportTemplate
9 Mon 18 Oct, 2021 17:30 Video
Slides
KOS
Data management Libs.
Project support tool (KOS)
Q&A
F. Giunchiglia, S. Bocca
DELETED Wed 20 Oct, 2021 9:30 Inception phase - Q&A F. Giunchiglia, S. Bocca
11 Mon 25 Oct, 2021 17:30 Slide Informal Modeling phase - Teleologies F. Giunchiglia, S. Bocca
12 Wed 27 Oct, 2021 9:30 Slides-1
Slides-2
Informal Modeling phase - ETG model building F. Giunchiglia, S. Bocca ModelingSheet
13 Wed 3 Nov, 2021 9:30 Slides-1
Slides-2
Inception & Informal Modeling phases - Evaluation F. Giunchiglia, S. Bocca Project report - Inception
14 Mon 8 Nov, 2021 17:30 Informal Modeling phase - Q&A F. Giunchiglia, S. Bocca
15 Wed 10 Nov, 2021 9:30 Slides-1
Slides-2
Slides-3
Formal Modeling phases - theory F. Giunchiglia, S. Bocca
16 Mon 15 Nov, 2021 17:30 Slides Formal Modeling phases - data management F. Giunchiglia, S. Bocca
17 Wed 17 Nov, 2021 9:30 Slides
Protege
Formal Modeling & Data Integration phases - evaluation F. Giunchiglia, S. Bocca Protege-guidelines
Base-schema-structure
18 Mon 22 Nov, 2021 17:30 Formal Modeling phases - practice + Q&A F. Giunchiglia, S. Bocca Project report - Informal modeling
19 Wed 24 Nov, 2021 9:30 Formal Modeling phase - Q&A F. Giunchiglia, S. Bocca
20 Mon 29 Nov, 2021 17:30 Slides Data Integration phases - theory F. Giunchiglia, S. Bocca
21 Wed 1 Dec, 2021 9:30 Karmalinker Data Integration phases - practice F. Giunchiglia, S. Bocca
22 Mon 6 Dec, 2021 17:30 Data Integration phases - Q&A F. Giunchiglia, S. Bocca Project report - Formal modeling
23 Mon 13 Dec, 2021 17:30 KG exploitation KG Exploitation (demo preparation) F. Giunchiglia, S. Bocca GraphDB
SPARQL-book
SPARQL-W3C
24 Wed 15 Dec, 2021 9:30 General Q&A F. Giunchiglia, S. Bocca

Exam


The exam will consist of two parts. The first, and most important, will be a presentation, in front of the lecturers and the colleagues, of the work developed. This will consist of a slide presentation, plus a demo. Details will be provided in class about how this will have to be done. The second and last part will consist of a written exam where students will be asked to describe and discuss the work done by the colleague in their group. In fact, the project will be done, modulo exceptions, by groups of two, where each person will be in charge of a different task (one in charge of developing the schema of the Knowledge Graph, the other developing the data populating it). The goal of the written exam will be to make sure that each student has a full understanding of the work done by the partner. But this should come for free for groups working together and continuously discussing the main issues which will arise, still each of them beingin charge of his/her.part.

Collaboration Opportunities


Multiple positions are available as 150h and internships. They should be considered as the first part of a research project and thesis with the Knowdive group. The general activities of the group are listed on the website (http://knowdive.disi.unitn.it/), while activities already scheduled and available now can be found at http://knowdive.disi.unitn.it/work-with-us/. The 150h activities have variable length and are strictly related to software development: for this reason, knowledge of software development with at least onr programming language is a must. All the activities can also be carried on in a remote fashion.

 

Anyone interested in these opportunities can send an email to knowdive-positions@disi.unitn.it, providing already information about preferences in terms of topics or activities (if known). For 150h activities it is important to provide information about known programming languages with the corresponding level, a value in the range [1 - 5] where 1= basic knowledge, 5= advanced knowledge.

 

The applications to the “150 ore” program can be done at the link:
https://www.unitn.it/servizi/224/collaborazioni-studenti-150-ore
Notice that the deadline for applications for the A.Y 2021-2022 is September 30, 2021