Born Digital the Art of Archiving Phouos with Script & Batch - - PowerPoint PPT Presentation

born digital
SMART_READER_LITE
LIVE PREVIEW

Born Digital the Art of Archiving Phouos with Script & Batch - - PowerPoint PPT Presentation

Born Digital the Art of Archiving Phouos with Script & Batch Processing Our team The Librarian The Programmer The Archivist Rachel Evans is the Metadata Leslie Grove is the most senior Sharon Bradley , certified Services Librarian at UGA


slide-1
SLIDE 1

Born Digital

the Art of Archiving Phouos with Script & Batch Processing

slide-2
SLIDE 2

Leslie Grove is the most senior Web Developer at UGA School

  • f Law in the Information

Technology Services

  • Department. In addition to

coding, she designs art-jewelry!

Our team

The Librarian

Rachel Evans is the Metadata Services Librarian at UGA Law Library, and was previously a web developer at UGA Law School. Her undergrads are in Art & Music technology.

The Programmer The Archivist

Sharon Bradley, certified archivist, is the former Special Collections Librarian at UGA Law, and is now the Digital & Scholarly Resources Librarian at Mercer Law Library.

slide-3
SLIDE 3

A Brief History

Phouographs in our Repository

  • est. 2006
slide-4
SLIDE 4

Impetus & Inspiration

A Looming Web Upgrade & Credit to 2 CALICon 2019 Sessions

Beatty, John, “Automating Processing and Intake in the Institutional Repository with Python” (2019). Available
  • nline: http://2019.calicon.org/node/1/sessions/automating-processing-and-intake-institutional-repository-python
Bowman, Jesse and Martone, Stephan, “From Concept to Concrete: Teaching Law Students about AI” (2019). Available online: http://2019.calicon.org/node/1/sessions/concept-concrete-teaching-law-students-about-ai
slide-5
SLIDE 5

Google Vision Tests

  • Objects
  • Labels
  • Texu
slide-6
SLIDE 6

Other Uses for Google Vision: Incoming Class Phouos

slide-7
SLIDE 7

Project timeline

08.2019

Programmer & Librarian discuss in Web team the updated timeline for moving from Drupal 7 to 8; ask “what to do with all those old web-based galleries”

10.2019

Librarian & Archivist begin working closely with Office of Communications & Public Relations for project categories; create shared Google Sheet

01.2020

Librarian coordinates with bePress representative at Digital Commons to create new galleries and make some structural changes in the “archives” and within that “photographs” series

  • f the repository

04.2020

Programmer runs all three scripts during the month of April and delivers 3 lengthy spreadsheets of data to the Librarian who begins data cleanup, sorting and batch loading

06.2020

When we expect to be finished with this project And make images available in our Digital Commons

06.2019

CALICon ‘19 inspirational sessions

11.2019

Archivist changes institutions

03.2020

COVID-19 Closure provides more time for this project

08.2020

When I.T. plans to sunset all of the old galleries (drupal 7, flash, and html-based pages fade to black)

slide-8
SLIDE 8

Outside

Our existing series (we would restructure and expand some to make room for the new images)

Inside

The basic metadata fields underneath = Less is more! We used “batch upload/revise”, “group” and “collect”.

1. Title [title] 2. Date [publication_date] 3. Abstract [abstract] 4. URL [fulltext_url]

slide-9
SLIDE 9

sheet for “keep?” or nou sheet for “scope” guides

slide-10
SLIDE 10

Blue Pages

2015 Drupal-based

Red Pages

2013 - 2014 Flash-based

Yellow, Green & Orange

2001 - 2013 Various HTML-based

slide-11
SLIDE 11

Scripting to harvest

Script 1

2015 1 version Drupal-based Total Galleries = 50 Total Photos = 793

Script 2

2013 - 2014 1 version Flash-based Total Galleries = 88 Total Photos = 1,273

Script 3

2001 - 2013 3 versions of HTML-based Total Galleries = 465 Total Photos = 10,014

slide-12
SLIDE 12

The Script

Our existing series needed a little restructuring and some expanding to make room for the new images.

The Harvest

The basic metadata fields underneath = Less is more! We used “batch upload/revise”, “group” and “collect”.

1. Title [title] 2. Date [publication_date] 3. Abstract [abstract] 4. URL [fulltext_url]

slide-13
SLIDE 13

The Results

Toual Phouographs = 12,080 Toual Galleries = 603

slide-14
SLIDE 14

Questions?

Sources 1. Donovan, James M. and Watson, Carol A., "White Paper: Behind a Law School's Decision to Implement an Institutional Repository" (2008). Available online: https://digitalcommons.law.uga.edu/law_lib_artchop/15 2. Beatty, John, “Automating Processing and Intake in the Institutional Repository with Python” (2019). Available online: http://2019.calicon.org/node/1/sessions/automating-processing-and-intake-institutional-repository-python 3. Bowman, Jesse and Martone, Stephan, “From Concept to Concrete: Teaching Law Students about AI” (2019). Available online: http://2019.calicon.org/node/1/sessions/concept-concrete-teaching-law-students-about-ai

The Librarian rsevans@uga.edu The Programmer lgrove@uga.edu The Archivist bradley_s@law.mercer.edu