Intuitive and machine understandable representation of the - - PowerPoint PPT Presentation

intuitive and machine understandable representation of
SMART_READER_LITE
LIVE PREVIEW

Intuitive and machine understandable representation of the - - PowerPoint PPT Presentation

Intuitive and machine understandable representation of the bioinformatics domain and of related resources with Resourceomes Nicola Cannata, Flavio Corradini, Sergio Gabrielli, Luana Leoni, Emanuela Merelli, Francesca Piersigilli, Leonardo Vito


slide-1
SLIDE 1

Intuitive and machine understandable representation of the bioinformatics domain and

  • f related resources with Resourceomes

Nicola Cannata, Flavio Corradini, Sergio Gabrielli, Luana Leoni, Emanuela Merelli, Francesca Piersigilli, Leonardo Vito

  • Mathematic and Computer Science Department,

University of Camerino, Italy

slide-2
SLIDE 2
slide-3
SLIDE 3

How many minutes (hours) do we spend every day in Google and bibliographic searches?

slide-4
SLIDE 4

In life sciences and “-omics” disciplines we are becoming used to deluges and overflows…

slide-5
SLIDE 5

The data overflow…

slide-6
SLIDE 6
slide-7
SLIDE 7

Besides data overflow we are experimenting also resources overflow. E.g. academic Articles,…

slide-8
SLIDE 8

Now science is becoming e-science…

slide-9
SLIDE 9
slide-10
SLIDE 10
slide-11
SLIDE 11

Will e-science become g-science…?

slide-12
SLIDE 12

Databases are essential resources for bioinformatics

slide-13
SLIDE 13
slide-14
SLIDE 14

201 86 1999 226 95 2000 281 73 2001 335 94 2002 386 95 2003 548 142 2004 719 137 2005 858 164 2006 968 174 2007 DB listed Articles Year

Database special issue Nucleic Acids Research (NAR) - Oxford Journals

slide-15
SLIDE 15

Web Servers are common tools for bioinformaticians

slide-16
SLIDE 16
slide-17
SLIDE 17

Articles published “ONLY” from main bioinformatics journals

BMC Journal of PLoS Computational Briefings IEEE Trans. Applied Int.J.of Bioinf. Computational Computational Biology in

  • Comp. Biol. And

Res.& Appl. Biology Biology and Chemistry Bioinformatics Bioinformatics (IJBRA) 2005 900 414 83 61 58 47 34 32 32 2004 627 209 69 53 40 22 24 2003 534 66 61 71 35 2002 365 40 52 77 41 2001 245 9 39 66 31 2000 178 1 52 72 33 Year Bioinformatics Bioinformatics Bioinformatics

100 200 300 400 500 600 700 800 900 2005 2004 2003 2002 2001 2000 Year Articles

Bioinformatics BMC Bioinformatics Journal of Computational Biology Computational Biology and Chemistry Briefings in Bioinformatics IEEE Trans. Comp. Biol. And Bioinformatics Applied Bioinformatics PLoS Computational Biology Int.J.of Bioinf. Res.& Appl. (IJBRA)

slide-18
SLIDE 18

Bioinformatics is evolving

“IN SILICO EXPERIMENTS” From command line… (till 90s) to web interfaces and (perl) scripts… (with the advent of the WWW) To Web Services and workflows (now)

slide-19
SLIDE 19

In this new bioinformatics Web Services play a key

  • role. Programs interact with programs on the web…
slide-20
SLIDE 20

Why to search for resources?

  • you have to develop a program, a database,

possibly avoiding to re-invent the wheel…

  • interdisciplinarity…

introduction to a new domain. Obtain a fast overview

  • f a (new, for you) scientific domain (preferably in a

visual fashion)

slide-21
SLIDE 21

So, where to search for (bioinformatics) resources

  • that of course you are not aware of - ?
slide-22
SLIDE 22

In search engines?… Good luck!

slide-23
SLIDE 23

In specialized web sites? … First, find them!

slide-24
SLIDE 24

In SIG web sites? … Are they updated?

slide-25
SLIDE 25
slide-26
SLIDE 26
slide-27
SLIDE 27
slide-28
SLIDE 28

In the literature? … When do you need them?

slide-29
SLIDE 29

And also when articles are detected, maybe the presented resources are not there anymore!

slide-30
SLIDE 30

In the age of WWW resources appear… and disappear

slide-31
SLIDE 31

If you are lucky you can find some nice and intuitive reviews

slide-32
SLIDE 32

What’s with the “staying updated”?

  • TOC e-mail services
  • Subject alert services
slide-33
SLIDE 33

Resource directories are good place to start…

slide-34
SLIDE 34
slide-35
SLIDE 35
slide-36
SLIDE 36

But the amount of resources and their variety require that directories would be machine understandable

slide-37
SLIDE 37

Intelligent software agents will then be able to “reason”

  • n the resources and to easily find them for you
slide-38
SLIDE 38
slide-39
SLIDE 39

Two “orthogonal” classifications

  • resources should be classified according their nature

(a program, a database, a paper, a person…) and according what they refer to, what they are for

slide-40
SLIDE 40

Computer science subjects can be classified…

slide-41
SLIDE 41
slide-42
SLIDE 42

As well as Mathematics subjects

slide-43
SLIDE 43
slide-44
SLIDE 44

But for Bioinformatics (and Life Sciences in general) is not existing any shared classification schema of the domain. And life scientists like taxonomies…

slide-45
SLIDE 45

BioInformatics SystemsBiology Structural Bioinformatics Genome Analysis Sequence Analysis Phylogenetics Genetics Population Analysis Databases And Ontologies Gene Expression Data And Text Mining IsA IsA IsA IsA IsA IsA IsA IsA IsA

The classification introduced for articles of “Oxford’s Bioinformatics” in 2005

slide-46
SLIDE 46

Resource Ontology

Concerns Concerns

C

  • n

c e r n s

Domain Ontology

Our proposal: Resourceomes

A Resourceomes permits to arrange in an intuitive (for humans) and machine-understandable (for SW) manner the perceived structure of a domain and to “stick” resources (with their semantic relationships) to concepts of the domain

slide-47
SLIDE 47
slide-48
SLIDE 48

This is just our first prototype of resource ontology

slide-49
SLIDE 49
slide-50
SLIDE 50
slide-51
SLIDE 51

Actor Literature resource Artifact

Publishes Creates CollaboratesWith

Literature resource Resource

Describes Cites

Examples of semantic relationships between resources

slide-52
SLIDE 52

Better this representation…

slide-53
SLIDE 53

… or this one?

slide-54
SLIDE 54

A web-based semantic browser for Resourceomes

slide-55
SLIDE 55

The first prototype of our browser

slide-56
SLIDE 56
slide-57
SLIDE 57
slide-58
SLIDE 58
slide-59
SLIDE 59
slide-60
SLIDE 60

Passing with the mouse over the icons you can see a description and the URI of the resource

slide-61
SLIDE 61
slide-62
SLIDE 62

Many open issues:

  • Annotation of resources (manual, semi-automatic,

automatic)

  • Representation of the domain (ontology, concept

maps, topic maps, SKOS?)

  • Ranking of resources (page rank, judgments )
  • Status of resources (agents checking them)
  • Graph visualization? (GRAPPA – Graphviz)
  • Not only browser but also visual editor (GrOWL?)
slide-63
SLIDE 63

Acknowledgment

www.litbio.org

This work is supported by the Italian Investment Funds for Basic Research (FIRB) project “Laboratory of Interdisciplinary Technologies in Bioinformatics” (LITBIO).

slide-64
SLIDE 64

Hoping that the bioinformatics community could soon say

Thank you for your attention!