Administrators & cloud service providers
Data scientists & domain researchers
Data Analytic development framework
Toolbox of descriptive and predictive models
Lemonade (Live Exploration and Mining Of a Non-trivial Amount of Data from Everywhere) is an analytics platform that supports intuitive definition of tasks for knowledge discovery, mining, and learning from large amounts of data that come from a wide spectrum of scenarios. The platform interface is a web application in which users may define analytics workflows visually by dragging and dropping operations and data sources, and connecting them. Lemonade is being developed by UFMG as part of the EUBra-BIGSEA project and supports the creation of a processing workflow, import, export or management of datasets, executing and managing workflows and the data visualisation.
LEMONADE was designed for a wide variety of users from areas such as Mathematics, Statistics, Business Administration, as well as Data Science practitioners from any knowledge area who are not familiar with programming languages but need to develop analytics workflows.
Features available successfully cover
Three different user roles are supported in Lemonade: a system administrator, a data scientist and a data explorer. System administrator will be responsible for keeping Lemonade running, adding new users, setting permissions and security, and managing data sets. Data scientists must know about Lemonade operations in order to create processing workflows and data being processed, their characteristics and how his/her results can be applied in a real scenario. Data explorers are the users of existing models.
LEMONADE can be downloaded from https://github.com/eubr-bigsea/lemonade
To be kept up and running, Lemonade requires a cluster of processing computers and data storages. The size and capacity of the cluster depends on the number of users, data volume and complexity of workflow/tasks. LEMONADE depends on Apache Mesos (standalone mode) or a distributed processing technology (Apache Spark, BSC COMPSs or CMCC Ophidia), Oracle MySQL database server and a Linux operating system distribution.
LEMONADE provides a rich web interface, which is both accessible to learners and powerful to experts. Lemonade scope plan comprises more than 30 different operations of data mining, machine learning and extraction, transformation and loading of data. The platform is also capable of processing massive amounts of data (“Big Data”), since it is being built on top of three scalable processing and storage technologies: Apache Spark, CMCC Ophidia and BSC COMPSs
Lemonade is an open-source solution. All dependencies (operating system, processing frameworks, infrastructure technologies) are also open source, so there are no licensing costs.
For more information get in touch at firstname.lastname@example.org
View related publications
--> A. Alic et al., BIGSEA: A Big Data analytics platform for public transportation information, Future Generation Computer Systems, Elsevier (2018)