Information extraction

Advanced lecture, 6 ECTS credits, winter semester 2019–20

Basic Information

Type: Advanced lecture
Teacher: Simon Razniewski (lecturer), Cuong Xuan Chu (lab)
Time: Tuesday 10:00-12:00 (lecture), Tuesday 16:00-18:00 (lab)
Place: E1 4 room 024 (lecture), room 021 (lab)
Credits: 6 ECTS credits
Mailing list for discussion and announcements: https://groups.google.com/d/forum/ie1920
Please sign up for the mailing list if you plan to take the course (link now fixed as of October 4)
Course evaluation results

This advanced lecture focuses on how to construct knowledge bases using information extraction techniques. Topics will be automated information extraction using patterns, supervised extractors and open information extraction, infobox crawling, entity disambiguation and normalization, learning over knowledge bases, and their use in question answering. We will also touch upon crowdsourced KB construction, evaluation measures, and some state-of-the-art knowledge bases. In the labs, participants will implement step-by-step cor components of information extraction, using Wikipedia and Wikis from the Wikia fan community site as source.

Schedule

	Date	Lecture	Lab
1	15.10.	Introduction (pdf)	Dataset familiarization (pdf)
2	22.10.	Knowledge representation (pdf)	Domain modelling (pdf) (sample solution)
3	29.10.	Crawling and Scraping (pdf)	Scraping (pdf)
4	12.11.*	Entity typing (pdf)	Entity typing from Wikipedia first sentence (pdf, files)
5	19.11.	Taxonomy induction, coreference and disambiguation (pdf)	Taxonomy induction (pdf)
6	26.11.	Relation extraction (pdf)	Relation extraction (pdf, files)
7	3.12.	Relation extraction II (pdf)	OpenIE coding (pdf, files)
8	10.12.	Knowledge consolidation (pdf)	Rule mining (pdf, file)
9	17.12.	Applications (pdf)	Exam preparation
	(7.1.2020)	(Backup slot)
	14.+15.1.2020	Oral exam (E1 4 room 433, schedule)
	24.3.2020	Reexam (online, schedule)

* Attention: No lecture/lab on 5.11.

Rules and Grading

Assignments

There will be 8 weekly assignments
Each assignment submission receives a binary pass/fail score
To be admitted to take the final exam, at least 6 assignments have to be passed.
Weekly timeline:
- Assignments are posted on Tuesday morning
- The lab on Tuesday afternoon is intended to get started on the assignments
- Assignments are due Saturday in the same week, at 23:59
- Assessments are available Tuesday morning
Assignment results (link)

Exam

Oral exam
Covering the topics of lecture and assignments
E1 4 room 433
Exam schedule
Reexam schedule