Skip to content
This repository has been archived by the owner on Sep 20, 2024. It is now read-only.

Latest commit

 

History

History
29 lines (16 loc) · 1.19 KB

ARCHITECTURE.md

File metadata and controls

29 lines (16 loc) · 1.19 KB

Architectural Design for the Scraping Toolkit

This document provides a summary of the design approach/philosophy behind the scraping toolkit. It gives you an overview of how all components in this kit fit together.

Process diagram

The diagram below outlines the process flow of the 4 manual steps needed to initiate a crawler to getting the output into CKAN.

process diagram for scraping toolkit

process diagram for scraping toolkit

Architecture Diagram

The diagram below showcases the architectural overview of the scraping toolkit. Also represented is an example of a system that can utilise the final outputted file(s) from the scraping toolkit (i.e. a CKAN portal via the CKAN harvester)

architecture diagram for scraping toolkit

architecture diagram for scraping toolkit

Docker Container Architecture Design

The diagram below represents the scraping kit architecture from a Docker container perspective

architecture diagram for scraping toolkit (docker container perspective)

architecture diagram for scraping toolkit (docker container perspective)