Documentation with deployment examples: https://github.com/CogStack/CogStack-NiFi/tree/devel/deploy
Documentation on available services: https://github.com/CogStack/CogStack-NiFi/tree/devel/services
DockerHub: https://cloud.docker.com/repository/docker/cogstacksystems/cogstack-nifi
The official website: https://nifi.apache.org/
The official documentation: https://nifi.apache.org/docs.html
-| | |
-|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------|
-| CogStack-NiFi is the re-architected version of CogStack-Pipeline that replaces the fixed Spring Batch-based pipeline engine with [Apache NiFi](https://nifi.apache.org/). It focuses on fully configurable and scalable data flows with the data processing engine that is easy to use, deploy and tailor to any site-specific data flow requirements. Apache NiFi also comes in with build-in monitoring, data provenance and security features that puts the operations in better control and reliability. Documentation with deployment examples: https://github.com/CogStack/CogStack-NiFi/tree/devel/deploy Documentation on available services: https://github.com/CogStack/CogStack-NiFi/tree/devel/services DockerHub: https://cloud.docker.com/repository/docker/cogstacksystems/cogstack-nifi The official website: https://nifi.apache.org/ The official documentation: https://nifi.apache.org/docs.html
**CogStack-NiFi useful links:**
**Apache NiFi resources:**
|  |
+
-:::{IMPORTANT}
-Please note that CogStack-NiFi project is still under active development with the newest version **0.1.0**.
-:::
### Apache NiFi – overview
@@ -74,11 +46,8 @@ Each ingestion job that is being run by CogStack-Pipeline also requires a separa
Moreover, one of the main limitations of CogStack pipeline has been support only for a document-centric data model for performing ingestion where each ingested record could only contain one document to be processed. Apache NiFi does not enforce document-centric data model and provides flexibility on defining custom data flows and data schemas. Handling multiple documents in a single record or using a patient-centric data model is a matter of tailoring the pipeline and defining or tailoring appropriate schema.
-Moreover, fixed ETL operations (implemented as modules in CogStack-Pipeline) can be included as custom ETL scripts or application modules inside a defined Apache NiFi data flow. For example, the text extraction done by [Apache Tika](https://tika.apache.org/) and NLP functionality (such as running [MedCAT](https://github.com/CogStack/MedCATservice) or [GATE NLP](https://github.com/CogStack/gate-nlp-service) applications was implemented as external micro-services exposing that expose a REST API and hence can be used directly in the data flow. All the third-party application dependencies are handled by the external services that further allows for separating the responsibilities.
+Moreover, fixed ETL operations (implemented as modules in CogStack-Pipeline) can be included as custom ETL scripts or application modules inside a defined Apache NiFi data flow. For example, NLP functionality, such as running [MedCAT](https://github.com/CogStack/MedCATservice) was implemented as external micro-services exposing that expose a REST API and hence can be used directly in the data flow. All the third-party application dependencies are handled by the external services that further allows for separating the responsibilities.
-:::{IMPORTANT}
-Please note that the recommended minimal resources requirements for running Apache NiFi will be higher than for CogStack-Pipeline and these will depend on the actual use-case.
-:::
### Example deployment and services
diff --git a/docs/overview/Elasticsearch.md b/docs/overview/Elasticsearch.md
index 113665b..58a3750 100644
--- a/docs/overview/Elasticsearch.md
+++ b/docs/overview/Elasticsearch.md
@@ -1,7 +1,7 @@
-# Elasticsearch
+# Elasticsearch / OpenSearch
## Introduction
diff --git a/docs/overview/Natural Language Processing.md b/docs/overview/Natural Language Processing.md
index 21af7f8..794e35b 100644
--- a/docs/overview/Natural Language Processing.md
+++ b/docs/overview/Natural Language Processing.md
@@ -64,52 +64,6 @@ Key resources:
Please note that there is available public MedCAT model trained on MedMentions corpus that can be used to play with.
:::
-## GATE NLP applications
-
-### Overview of GATE NLP suite
-
-[GATE NLP suite](https://gate.ac.uk/) is a well established and rich set of open-source technologies implementing full-lifecycle solution for text processing. The GATE ecosystem is very broad and outside of the scope of this documentation – here we will only focus on two applications:
-
-- [GATE Developer](https://gate.ac.uk/family/developer.html),
-- [GATE Embedded](https://gate.ac.uk/family/embedded.html).
-
-GATE Developer is a development environment that provides a large set of graphical interactive tools for the creation, measurement and maintenance of software components for natural language processing. It allows to design, create and run NLP applications using an intuitive user interface. These applications can be later exported as a custom *gapp* or *xgapp* application with the used resources.
-
-GATE Embedded, on the other hand, is an object-oriented framework (or class library) implemented in Java. It is used in all GATE-based systems, and forms the core (non-visual) elements of GATE Developer. In principle, it implements the runtime for executing GATE applications. It allows to run the *gapp* and *xgapp* applications that have been previously created in GATE Developer.
-
-
-:::{IMPORTANT}
-When deploying GATE applications within CogStack one may be interested in defining and tailoring custom GATE applications directly by using GATE Developer. Such prepared application can be in the next step provided into CogStack **GATE NLP Runner Service** that uses GATE Embedded to execute GATE applications. This way, provided NLP application can be deployed as a service and used in the data pipeline.
-:::
-
-Although there have been developed and published many applications in GATE NLP suite, in this page we only briefly cover Bio-YODIE.
-
-### Bio-YODIE
-
-Bio-YODIE is a named entity linking system derived from GATE YODIE system. It links mentions in biomedical text to their referents in the UMLS. It defines a broad set of types such as `Disease` , `Drug`, `Observation` and many more all of the types belonging to `Bio` group – for detailed information please refer to [the official documentation](https://gate.ac.uk/applications/bio-yodie.html).
-
-Bio-YODIE can be run either within GATE Developer application or as a service within CogStack (based on GATE Embedded and running as a Service). Here we primarily focus on the latter and refer the reader to the official Bio-YODIE website.
-
-**Key resources:**
-
-- The official website: