TRACER TUTORIAL: TEXT REUSE DETECTION INTRODUCTION TO HISTORICAL TEXT REUSE DETECTION
Marco B¨ uchler, Emily Franzini and Greta Franzini
TRACER TUTORIAL: TEXT REUSE DETECTION INTRODUCTION TO HISTORICAL - - PowerPoint PPT Presentation
TRACER TUTORIAL: TEXT REUSE DETECTION INTRODUCTION TO HISTORICAL TEXT REUSE DETECTION M arco B uchler, Emily Franzini and Greta Franzini TABLE OF CONTENTS 1. Who am I? 2. What is text reuse? 3. Aspects of text reuse 4. ACID for the Digital
Marco B¨ uchler, Emily Franzini and Greta Franzini
2/34
company;
analysis;
4/34
5/34
7/34
9/34
10/34
Question: Why is text reuse so relevant for Humanities and Computer Science? Premise: The amount of digitally available data is growing exponentially (Big Data).
conditions.
11/34
12/34
ACID for the Digital Humanities:
14/34
15/34
How to be accepted by humanists if text mining is a black box we can’t look into?
16/34
Transparency: How to provide user-friendly insights into complex mining techniques and machine learning?
17/34
Ulrike Rieß (Big Data bestimmt die IT-Welt):
manually;
warehouse systems;
Information overload = large amounts of data (Big Data). Information poverty = noisy, missing, fragmentary, oral data (Humanities Data). COMPLEXITY
19/34
20/34
21/34
22/34
23/34
24/34
25/34
26/34
27/34
28/34
Question: The distribution of Reuse Types and Reuse Styles is often unknown - which model(s) should be chosen?
30/34
31/34
32/34
Team Marco B¨ uchler, Greta Franzini and Emily Franzini. Visit us http://www.etrap.eu contact@etrap.eu
33/34
The theme this presentation is based on is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Changes to the theme are the work of eTRAP.
34/34