Can we rely on Wikitext to get the links on a Wikipedia page? - - PowerPoint PPT Presentation

can we rely on wikitext to get the links on a wikipedia
SMART_READER_LITE
LIVE PREVIEW

Can we rely on Wikitext to get the links on a Wikipedia page? - - PowerPoint PPT Presentation

WikiHist.html : English Wikipedias Full Revision History in HTML Format Blagoj Mitrevski, Tiziano Piccardi, Robert West Can we rely on Wikitext to get the links on a Wikipedia page? '''Niue''' ({{lang-niu|Niue}}) is an [[island country]].


slide-1
SLIDE 1

WikiHist.html: English Wikipedia’s Full Revision History in HTML Format

Blagoj Mitrevski, Tiziano Piccardi, Robert West

Can we rely on Wikitext to get the links on a Wikipedia page?

slide-2
SLIDE 2

Wikitext Full history HTML Full history

'''Niue''' ({{lang-niu|Niu¯e}}) is an [[island country]].

<b>Niue</b> (<a href="/wiki/Niuean_language" title="Niuean language">Niuean</a>: <i lang="niu">Niu¯e</i>) is an <a href="/wiki/Island_country" title="Island country">island country</a>.

Multiple Docker instances Page rendered by using the correct template revision

Problem Implement.

Modules not available in the dump (LUA)

slide-3
SLIDE 3

Full dataset (7TB) available at:

https://zenodo.org/record/3605388

Clickstream shows transitions that are not possible on the Wikitext links network In average in 2019 a page contains 3 times the number of links visible in Wikitext

What did we discover?