Platforms for Data Sharing, Data Analytics, and Data Visualization - - PowerPoint PPT Presentation

platforms for data sharing data analytics and data
SMART_READER_LITE
LIVE PREVIEW

Platforms for Data Sharing, Data Analytics, and Data Visualization - - PowerPoint PPT Presentation

Platforms for Data Sharing, Data Analytics, and Data Visualization Richard Glassco Noblis Consultant to FHWA Office of Operations: Research & Development Agenda Overview of the Safety Pilot Model Deployment Size-Related


slide-1
SLIDE 1

Platforms for Data Sharing, Data Analytics, and Data Visualization

Richard Glassco Noblis Consultant to FHWA Office of Operations: Research & Development

slide-2
SLIDE 2

2

U.S. Department of Transportation ITS Joint Program Office

Agenda

  • Overview of the Safety Pilot Model Deployment
  • Size-Related Considerations
  • Demonstration of the RDE
  • Other CDS Program Tool development activities
slide-3
SLIDE 3

3

U.S. Department of Transportation ITS Joint Program Office

RESEARCH DATA EXCHANGE

slide-4
SLIDE 4

4

U.S. Department of Transportation ITS Joint Program Office

Goals of the Safety Pilot Model Deployment

  • 1. Test connected vehicle operations in real-world

conditions

  • 2. Understand how regular drivers use connected vehicle

technologies

  • 3. Determine the safety benefits of a connected vehicle
slide-5
SLIDE 5

5

U.S. Department of Transportation ITS Joint Program Office

Ann Arbor, Michigan, site of SPMD

slide-6
SLIDE 6

6

U.S. Department of Transportation ITS Joint Program Office

SPMD TYPES OF MESSAGES

  • Basic Safety Messages (BSMs)
  • Signal Phase and Timing (SPaT) Messages
  • MAP messages
  • Traveler Information Messages (TIMs)

Plus contextual data:

  • Weather
  • Traffic counts

Plus documentation

slide-7
SLIDE 7

7

U.S. Department of Transportation ITS Joint Program Office

SPMD Statistics

  • 2836 vehicles driving on 73

miles of roadways (freeways and arterials)

  • Each vehicle generates Basic

Safety Messages (position, speed, acceleration, etc.) every 10th of a second

  • 2 Months of data (4 billion

BSMs) are on the RDE

  • The SPMD data environment is

structured in 6 data sets, with a total sanitized volume of approximately 24 GB

  • The original un-sanitized data

set was approximately 50GB

slide-8
SLIDE 8

8

U.S. Department of Transportation ITS Joint Program Office

SPMD Data in the RDE

slide-9
SLIDE 9

9

U.S. Department of Transportation ITS Joint Program Office

SPMD Contained Data that Could be Shared

  • Some Data may not be shared, for example:
  • It contains PII that cannot be removed
  • It contains commercially sensitive information
  • It contains intellectual property rights
  • There are legal reasons the data cannot be released to the

public

  • Such data may be stored in the FHWA’s Saxton Transportation

Operations Laboratory. Access to data in the Saxton Lab is restricted to researchers with appropriate qualifications and training.

slide-10
SLIDE 10

10

U.S. Department of Transportation ITS Joint Program Office

Requirements for Data Submitted to the RDE

  • PII is data that by itself or in combination with other data could be used to

determine the identity of individual private vehicles or persons from who driving data is obtained.

  • Examples of PII include driver age, gender, social security number, or

address, and vehicle VIN number, license plate number, exact length and width, or weight, or other identifying characteristics.

  • PII may also be derived from vehicle trajectories that reveal the origin or

destination of personal trips, or intermediate stops of personal nature such as a school, workplace, or frequent errand.

  • Public transit vehicles and public vehicles such as maintenance trucks are not

subject to this PII restriction

slide-11
SLIDE 11

11

U.S. Department of Transportation ITS Joint Program Office

PII had to be removed from the SPMD data while maintaining meaningfulness of the data PII had to be removed from the SPMD data while maintaining meaningfulness of the data

  • To protect participants’ identity the RDE

team rid all data files of data elements that contain PII

  • Data elements that could be paired with
  • ther publicly available data were also

deleted

  • Vehicle trajectories, with points collected

at 10Hz, revealed the identity of participants, therefore

□ Sanitization algorithms were

developed to truncate trajectories to mask trip origins and destinations

□ The algorithms were also applied to

dependent / related data elements

PII had to be removed from the SPMD data while maintaining meaningfulness of the data

  • To protect participants’ identity the RDE

team rid all data files of data elements that contain PII

  • Data elements that could be paired with
  • ther publicly available data were also

deleted

  • Vehicle trajectories, with points collected

at 10Hz, revealed the identity of participants, therefore

□ Sanitization algorithms were

developed to truncate trajectories to mask trip origins and destinations

□ The algorithms were also applied to

dependent / related data elements

slide-12
SLIDE 12

12

U.S. Department of Transportation ITS Joint Program Office

Because of the Size of the Data Environment …

  • Moved to storage on the Cloud rather than contractor's

server

  • Developed subsetting tool
slide-13
SLIDE 13

13

U.S. Department of Transportation ITS Joint Program Office

Full Year of SPMD Data

  • USDOT has reserved the entire year of data for future

study

  • Unsanitized, so it is not posted on the RDE – it will be

stored in the Saxton Lab

  • Restricted access
  • Special restrictions for video or audio data
slide-14
SLIDE 14

14

U.S. Department of Transportation ITS Joint Program Office

LIVE DEMONSTRATION OF RDE

slide-15
SLIDE 15

15

U.S. Department of Transportation ITS Joint Program Office

OTHER CDS TOOL DEVELOPMENT ACTIVITIES

slide-16
SLIDE 16

16

U.S. Department of Transportation ITS Joint Program Office

Data Visualization Project

  • Data visualization is #1 request of RDE stakeholders
  • Uses data on the RDE to develop data visualization

tools

  • Map and non-map based, interactive
  • Applicable to multiple data environments
slide-17
SLIDE 17

17

U.S. Department of Transportation ITS Joint Program Office

Samples from Data Visualization Project

slide-18
SLIDE 18

18

U.S. Department of Transportation ITS Joint Program Office

Dynamic Interrogative Data Capture (DIDC)

slide-19
SLIDE 19

19

U.S. Department of Transportation ITS Joint Program Office

BSM Emulator

slide-20
SLIDE 20

20

U.S. Department of Transportation ITS Joint Program Office

OSADP - Companion Website for Software

  • www.itsforge.net
slide-21
SLIDE 21

21

U.S. Department of Transportation ITS Joint Program Office

References and Contact Info

  • Research Data Exchange (RDE) website

https://www.its-rde.net/

  • Open Source Application Development

Portal (OSADP): www.itsforge.net

Jon Obenberger, PhD, P.E. Senior Transportation Research Engineer, FHWA Jon.Obenberger@dot.gov