AUMC - FDP implementation in myDRE environment

Creator: Priyanka Ojha

Source: GitHub - AmsterdamUMC/FDP_AMS at myDRE

ย 

ย 

Introduction

I had the task to try out the reference implementation of FDP and also try to
import data into it. This involved working on 2 projects namely FDP and SeMPyRO. I initially started with reproducing FDP on my Local Laptop. My environment consisted of myDRE which is a managed service with Azure under the hood. I used a ubuntu22 machine as a base and installed docker on it
followed steps mentioned below in that order :

  1. To install docker on DRE , I followed these instructions :

    1. Installing docker on Ubuntu 22

    2. Install using the apt repository

    3. Install Docker Engine on Ubuntu

The Ubuntu release was Jammy in my case.

  1. Then I installed Docker compose via this command :

sudo apt-get install docker-compose-plugin
  1. Check the version to ensure Docker compose in installed successfully

docker-compose version
image-20241030-140346.png
  1. Create a folder fdp in root directory

sudo mkdir fdp
  1. Give the read write and execute rights to it

  1. There I copied 2 files (docker-compose.yml and application.yml) I had created

    with the help of Local Deployment โ€” FAIR Data Point 1.16 documentation in the fdp

    folder.

  2. Go to the FDP folder via this command

  1. Give this command to start the docker container

  1. To check if the containers are started correctly : you can use the following
    commands to check it :

It lists all the containers that are started. There should be 4 services running like below:

ย 

image-20241030-140808.png
  1. If any of the services is not started, there might be an issue. Most probably, its
    related to port 80 being occupied. So You can also check for it before starting
    the docker container and kill the process and then can resume again from step
    8 above.

    1. Command to check if port 80 is occupied is:

    2. Command to kill process occumpying prot 80 is:

    3. You can also check for similar issue here

11Dww

  1. If all the services are running correctly, you should be able to access the FDP
    via this command.

And see the following output below:

ย 

  1. If a different message is shown, you might have to wait a bit as some services
    take a bit of time and try again . If that doesnโ€™t help, then something is wrong
    and you need to check the logs via this command.

  2. You can now via a browser from a windows machine in the same workspace
    access the URL via the IP address of the Host PC.

    1. Make sure to white list the IP address of the Linux machine.

    2. On the Windows machine in Azure workspace, disable the Proxy by
      following these steps:

Select the Start button, then select Settings > Network & Internet > Proxy . Turn all off. Once that is done, you should be able to access the FDP via the browser.

ย 

  1. Open the Edge or Chrome based browser, It doesnโ€™t work on Internet Explorer. Type the IP address of Linux PC and the FDP website should be accessible.

ย 

  1. Now, you can also import the Shacl File by following steps mentioned here.

    1. Make sure you are logged in as an admin user.

    2. Click on your user icon in the top right of the UI, and select the "Metadata schemas" option.

    3. Here you see a list of schemas. Click on the schema you want to update (for example Catalog).

    4. The last field on this page shows the Form definition. Here you can paste the shape definitions as found in this folder.

    5. Click Save and release on the bottom of the page.

    6. Select a version number

    7. Click Release.

  2. Now you can go to this project and try to run the notebook mentioned here
    and try importing the data to your FDP.

ย 

List of all sources cited, including original project documentation

  1. FAIR Data Point | FAIR Data Points can be used to describe your data sets in a FAIR way, using
    standard metadata and make them available through simple WWW protocols.

  2. FAIRDataTeam/FAIRDataPoint (http://github.com )

  3. FAIR Data Point Reference Implementation Documentation โ€” FAIR Data Point 1.16
    documentation

  4. Health-RI/SeMPyRO: pydantic data models for DCAT-AP v3 (http://github.com )

  5. AmsterdamUMC/FDP_AMS at myDRE (http://github.com