SOLR & XAPIAN IN INVENIO Patrick Glauner 1 , Jan Iwaszkiewicz 1 , - - PowerPoint PPT Presentation

solr xapian in invenio
SMART_READER_LITE
LIVE PREVIEW

SOLR & XAPIAN IN INVENIO Patrick Glauner 1 , Jan Iwaszkiewicz 1 , - - PowerPoint PPT Presentation

SOLR & XAPIAN IN INVENIO Patrick Glauner 1 , Jan Iwaszkiewicz 1 , Jean-Yves Le Meur 1 , Tibor Simko 1 , Nikos Kasioumis 2 1/Author - 2/Speaker Open Repositories 2013 Repository Island, Charlottetown, PEI, Canada DOCUMENTS COMMUNITY USE


slide-1
SLIDE 1

SOLR & XAPIAN IN INVENIO

Patrick Glauner1, Jan Iwaszkiewicz1, Jean-Yves Le Meur1, Tibor Simko1, Nikos Kasioumis2

1/Author - 2/Speaker

Open Repositories 2013

Repository Island, Charlottetown, PEI, Canada

slide-2
SLIDE 2
slide-3
SLIDE 3

DOCUMENTS COMMUNITY USE CASES MODULAR MATURE CERN OSS INVENIO-SOFTWARE.ORG

slide-4
SLIDE 4
slide-5
SLIDE 5
slide-6
SLIDE 6
slide-7
SLIDE 7
slide-8
SLIDE 8
slide-9
SLIDE 9
slide-10
SLIDE 10
slide-11
SLIDE 11

ARCHITECTURE

slide-12
SLIDE 12
slide-13
SLIDE 13
slide-14
SLIDE 14
slide-15
SLIDE 15
slide-16
SLIDE 16
slide-17
SLIDE 17
slide-18
SLIDE 18
slide-19
SLIDE 19
slide-20
SLIDE 20

INGESTION

slide-21
SLIDE 21
slide-22
SLIDE 22
slide-23
SLIDE 23
slide-24
SLIDE 24
slide-25
SLIDE 25
slide-26
SLIDE 26
slide-27
SLIDE 27
slide-28
SLIDE 28

SEARCHING

slide-29
SLIDE 29
slide-30
SLIDE 30
slide-31
SLIDE 31
slide-32
SLIDE 32
slide-33
SLIDE 33
slide-34
SLIDE 34
slide-35
SLIDE 35
slide-36
SLIDE 36

FAST SEARCHING VS. SLOW INDEXING CUSTOM RANKING METHODS FULLTEXT INDEXING LIMITS ADVANCED CAPABILITIES

slide-37
SLIDE 37
slide-38
SLIDE 38

JAVA HTTP / XML VECTOR SPACE TF­IDF C++ LOCAL PROBABILISTIC BM25

slide-39
SLIDE 39

INGESTION

slide-40
SLIDE 40
slide-41
SLIDE 41
slide-42
SLIDE 42
slide-43
SLIDE 43
slide-44
SLIDE 44

SEARCHING

slide-45
SLIDE 45
slide-46
SLIDE 46
slide-47
SLIDE 47
slide-48
SLIDE 48
slide-49
SLIDE 49

SCALABILITY

slide-50
SLIDE 50
slide-51
SLIDE 51
slide-52
SLIDE 52

COMPARISON

FULLTEXT:"HIGGS BOSON"

slide-53
SLIDE 53

RESULTS 16,903 15,945 SEARCH 0.06 0.56 RANK 10 0.06 0.40 RANK 100 0.12 0.42 RANK 1K 0.22 0.45 RANK 10K 1.46 0.70

slide-54
SLIDE 54
slide-55
SLIDE 55
slide-56
SLIDE 56

CONCLUSION

OPTIMIZED INDEXING LIMITATIONS USE & REUSE USE CASES GENERIC BRIDGE FUTURE

slide-57
SLIDE 57

QUESTIONS

invenio-software.org info@invenio-software.org