We suggest the following itineraries:

  • If your main interest is in moving data, you should attend 1.1 and 1.2
  • If your main interest is metadata, you should attend 1.1 and 2.2
  • If your main interest is in technical details of PIDs and how they are integrated with EUDAT’s other services, you should attend 2.1 and 1.2
  • If your main interest is in the organisation of data and you’re interested in registration and metadata, you should attend 2.1 and 2.2.
TRAINING TRACK 1.1: DATA STAGING, REPLICATION AND STORAGE: INTEGRATING WITH EUDAT'S BUILDING BLOCKS
   Abstract        
A key benefit of a collaborative data infrastructure is the opportunities it offers for moving (replicating, uploading) data more easily for a variety of purposes such as preservation, access optimisation, to improve sharing, and to prepare data for processing. Data is precious: particularly that which corresponds to observations of real-life systems (such as seismic activity, ocean temperatures, observations about species and languages/cultures) that may not be reproducible.  In these cases, having reliable, off-site copies of your data can be invaluable. Also, sharing of data between scientists, organisations and disciplines is becoming more common, and provides possibilities of new approaches to problems by bringing together data from multiple sources. Finally, in order to have data readily available for computation or analysis it needs to be moved from where it is stored, to where it is processed (and back again). All of these use cases require that data can be moved efficiently and reliably. This session covers some of the technologies and techniques that can be used to do this, and will also discuss these in the context of EUDAT. After attending this session, you will know which technologies uses to provide its Data staging, Safe replication and SimpleStore services and how they are used. You will know the benefits of using these services, and have learned what the underlying technologies provide to help with moving of data. You will know how these EUDAT building blocks could be integrated with existing services.
 

AGENDA

09:00-09:15

Arrival & Registration

09:15-10:00

Introduction [slides]

10:00-10:45

Data Replication & Data Staging Services [slides]

10:45-11:15

Break

11:15-12:00

SimpleStore: Introduction [slides]

12:00-12:45

iRODS: What it is, what it can do [slides]

 
TRAINING TRACK 1.2: IMPLEMENTATION OF STAGING, REPLICATION AND STORAGE: SERVICES AND TOOLS
   Abstract        
In this session we will go into more depth regarding the technologies used in EUDAT's data services. We'll show you how iRODS has been set up and configured, and in particular how it has been integrated with services like EPIC to obtain PIDs. We will describe how you can use GridFTP to stage your data on to the EUDAT infrastructure or between EUDAT and HPC resources and we'll show you how EUDAT's data staging script can be used to help you make use of services like Globus Online. We'll talk about how repositories like those based on Fedora Commons can be integrated with the Safe Replication service and show a concrete example of how this has been done.

AGENDA

14:00-14:45

iRODS Hands On [slides]

14:45-15:30

Safe Replication Hands On [slides]

15:30-16:00

Safe Replication & Repositories

16:00-16:30 Break

16:30-16:50

Introduction to GridFTP [slides]

16:50-17:30 Demonstration of data staging service (including GridFTP)

17:30

Close

 
TRAINING TRACK 2.1: PERSISTENT IDENTIFIERS, HANDLES, TYPE REGISTRIES, EPIC
   Abstract        
In this session we’ll review how PIDs can be used, and we will discuss how the handle system can be used with a type registry to provide functionality beyond basic handle resolution. The session will also include a hands-on tutorial on the use of the EPIC API which can be used to construct or interact with the EPIC Handle service.

AGENDA

09:00-09:15

Arrival & Registration

09:15-09:45

Very brief introduction to PIDs. [slides]

09:45-10:45

How the handle system can be used with a type registry to provide
functionality beyond basic handle resolution. [slides]

10:45-11:15

Coffee Break

11:15-12:00

EPIC and Handles in EUDAT [slides]

12:00-12:45

EPIC/iRODS practical

[slides]

 
TRAINING TRACK 2.2: METADATA
   Abstract        
Metadata is going to serve increasingly more functions. Traditionally it is used for finding useful data and tools. In future it will be increasingly often used for scientific purposes and for orchestrating scientific workflows. It is therefore not surprising that metadata remains the focus of discussions and deservedly affords the attention of the Research Data Alliance. Data is really only of use in research when its context and provenance is understood. Context such as when it was created, who it was created by, to what it relates, how it was created and how it can be used is even more vital to understand once data has been stored or shared. Metadata provides this context and is key to being able to make the most of data by making it findable, understandable and reusable. This session covers the principles of Metadata, introduces important metadata efforts such as Dublin Core, ISO standards and in particular community based solutions to foster science and compares approaches such as the use of fixed schema and meta models. It will also discuss what the intentions and architectures of major initiatives such as EUDAT, DataONE, Europeana are, how the work in this area is carried out, how one can participate in the EUDAT services and how MD services can be used. We’ll also show how Metadata is used in EUDAT’s SimpleStore and how this has been built on Invenio.

AGENDA

14:00-14:40

Metadata Standards and Interoperability

14:50-15:45

How YOU can publish your metadata

15:45-16:15

Coffee Break

16:15-17:00

Metadata and SimpleStore / Invenio

17:00-17:30

Cancelled: Metadata in EUDAT / DataONE and Europeana
Metadata in DataONE and Europeana will now be covered in the Metadata track in the main conference.