Starting from the end-user requirements highlighted in deliverable D7.1, this document defines the architecture of the integrated fast and Big Data ecosystem, which represents the central data management component of the EUBra-BIGSEA platform.
The proposed architecture integrates multiple classes of big data systems. Two different aspects of the proposed architecture are:
- A comprehensive evaluation and assessment of the big data tools available in the general landscape from data storage, access, analytics and mining standpoint.
- A deep data sources analysis in terms of data model, formats, volume, metadata, and functional needs.
Key features highlighted in this document are also the integration of different classes of big/fast data tools to address multifaceted use cases requirements, the dynamicity and elasticity of the environment jointly with a secured-by-design ecosystem.
The proposed architecture joins all these elements in a cloud environment aiming at providing, to some extent, a general approach to deal with the high social impact use cases and scenarios like the one proposed in the project.