Orchestrator JS

A JavaScript/TypeScript implementation of an RDF-based orchestrator for managing and executing data processing pipelines.

- Features
- Usage
- CLI
- Programmatic API
- Configuration
- Architecture
- Development
- Prerequisites
- Building
- Testing
- Linting & Formatting
- Contributing
- License

Features

- 🚀 Pipeline Management: Define and manage data processing pipelines using RDF
- ⚡ TypeScript Support: Built with TypeScript for better developer experience
- 🔄 Modular Architecture: Easily extensible with custom processors and runners
- 🧪 Test Coverage: Comprehensive test suite with Vitest
- 🛠️ Developer Tools: ESLint and Prettier for code quality

Usage

$3

The orchestrator can be run using the provided CLI:

``bash

`Install the orchestrator`


npm install @rdfc/orchestrator-js
Run with a pipeline configuration

npx rdfc path/to/your/pipeline.ttl


The CLI tool loads the RDF pipeline configuration, starts the gRPC server, spawns the configured runners, initializes processors, and manages the entire pipeline lifecycle.
Configuration
Pipeline configurations are defined using RDF/Turtle format. Here's an example configuration:

`turtle @prefix rdfc: . @prefix owl: .

`$3`


<> owl:imports <./.venv/lib/python3.13/site-packages/rdfc_runner/index.ttl>.
<> owl:imports <./.venv/lib/python3.13/site-packages/rdfc_log_processor/processor.ttl>.

$3

 a rdfc:Writer, rdfc:Reader;
    rdfc:logLevel "DEBUG". # optional, log messages to debug

$3

<> a rdfc:Pipeline;
   rdfc:consistsOf [
       rdfc:instantiates rdfc:PyRunner;
       rdfc:processor , ;
   ].

$3

 a rdfc:SendProcessorPy;
       rdfc:writer ;
       rdfc:msg "Hello, World!", "Good afternoon, World!",
                "Good evening, World!", "Good night, World!".
 a rdfc:LogProcessorPy;
      rdfc:reader ;
      rdfc:level "info";
      rdfc:label "test".

`Development`

`$3`

- Node.js 16+ - npm 7+ or yarn - TypeScript 4.7+

`$3`

`bash

`Install dependencies`


npm install
Build the project

npm run build
Watch for changes

npm run build -- --watch

$3

`bash

`Run tests`


npm test
Run tests with coverage

npm test -- --coverage
Run specific test file

npm test path/to/test/file.test.ts

$3

`bash

`Run linter`


npm run lint
Fix linting issues

npm run lint -- --fix
Format code

npm run format


Project Structure

`orchestrator-js/ ├── bin/ # Executable scripts │ └── orchestrator.js # mainStream CLI entry point and pipeline executor ├── lib/ # Compiled JavaScript output ├── src/ # TypeScript source files │ ├── index.ts # mainStream export file │ ├── instantiator.ts # Runner instantiation logic │ ├── jsonld.ts # JSON-LD utilities and RDF processing │ ├── jsonld.ttl # JSON-LD processor definitions │ ├── logUtil.ts # Logging utilities │ ├── model.ts # Data models and types │ ├── model.ttl # RDF model definitions │ ├── orchestrator.ts # Core orchestrator logic │ ├── server.ts # gRPC server implementation │ ├── util.ts # Utility functions │ ├── pipeline.ttl # Pipeline configuration schema │ └── minimal.ttl # Minimal example configuration ├── __tests__/ # Test files │ ├── orchestrator.test.ts │ ├── jsonld_derive.test.ts │ ├── config.ttl │ └── ... ├── .github/ # GitHub workflows and templates ├── .husky/ # Git hooks ├── package.json # Project configuration and dependencies ├── tsconfig.json # TypeScript configuration ├── jest.config.js # Jest test configuration ├── eslint.config.mjs # ESLint configuration ├── .prettierrc # Prettier configuration ├── .editorconfig # Editor configuration └── README.md # This file`

`Contributing`

Contributions are welcome! Please follow these steps:

1. Fork the repository 2. Create a feature branch (git checkout -b feature/AmazingFeature) 3. Commit your changes (git commit -m 'Add some AmazingFeature') 4. Push to the branch (git push origin feature/AmazingFeature) 5. Open a Pull Request

`$3`

We follow Conventional Commits for commit messages:

- feat: New feature -fix: Bug fix -docs: Documentation changes -style: Code style changes (formatting, etc.) -refactor: Code refactoring -test: Adding or modifying tests -chore: Build process or auxiliary tool changes

`License`

This project is licensed under the MIT License - see the LICENSE file for details.

---

`Architecture`

The system follows a modular architecture with the following main components:

- Orchestrator: Manages the overall pipeline execution - Runners: Handle the execution of processing tasks - Processors: Individual processing units that transform or analyze data - Server/Client: Communication layer between components

`$3`

#### Initialization Sequence


  Initialization sequence diagram

`mermaid sequenceDiagram autonumber participant O as Orchestrator participant R as Runner participant P as Processor

Note over O: Initialize gRPC server on port 50051 (by default) Load and parse RDF pipeline configuration

O->>O: startInstantiators(addr, pipeline) loop For each instantiator in pipeline O->>O: expectRunner(instantiator) Note over O: Create promise to wait for runner connection O->>R: Spawn runner process with address rect rgba(255, 0, 0, .1) R->>O: stub.connect() as mainStream end rect rgba(0, 0, 255, .1) R->>O: mainStream(FromRunner{identify: RunnerIdentify{ uri }}) end O->>O: Resolve runner connection promise rect rgba(0, 0, 255, .1) O->>R: Send pipeline configuration mainStream(ToRunner{ pipeline }) end end

Note over O,P: Initialize all processors loop For each processor in each runner O->>O: expectProcessor(instantiator) Note over O: Generate JSON-LD configuration for processor rect rgba(0, 0, 255, .1) O->>R: Start processor with configuration mainStream(ToRunner{proc: Processor{ uri, config, arguments }}) end R->>P: Initialize processor P->>R: Processor ready rect rgba(0, 0, 255, .1) R->>O: Initialized message with processor URI mainStream(FromRunner{initialized: ProcessorInitialized{ uri, error? }}) end O->>O: Resolve processor startup promise end

Note over O,P: Start all runners loop For each runner rect rgba(0, 0, 255, .1) O->>R: Processors can start mainStream(ToRunner{ start }) end loop For each processor in runner R->>P: Start processor execution end end`


    Message sequence diagram

`mermaid sequenceDiagram autonumber participant P1 as Processor 1 participant R1 as Runner 1 participant O as Orchestrator participant R2 as Runner 2 participant P2 as Processor 2

Note over P1: Processor generates message for a channel P1->>R1: Message with data rect rgba(0, 0, 255, .1) R1->>O: Send message to orchestrator mainStream(FromRunner{msg: SendingMessage { localSequenceNumber, channel, data }}) end

Note over O: Orchestrator routes message to target instantiator O->>O: Look up channelToInstantiator[channel] Translate localSequenceNumber to globalSequenceNumber rect rgba(0, 0, 255, .1) O->>R2: Forward message to receiving runner mainStream(ToRunner{msg: ReceivingMessage{ globalSequenceNumber, channel, data }}) end

R2->>P2: Runner forwards message to target processor P2->>P2: Process message

P2->>R2: Message processed rect rgba(0, 0, 255, .1) R2->>O: mainStream(FromRunner{processed: GlobalAck{ globalSequenceNumber, channel }}) end rect rgba(0, 0, 255, .1) O->>R1: mainStream(ToRunner{processed: LocalAck{ localSequenceNumber, channel }}) end Note over P1: Processor is allowed to send a new message`


    Streaming message sequence diagram

`mermaid sequenceDiagram autonumber participant P1 as Processor 1 participant R1 as Runner 1 participant O as Orchestrator participant R2 as Runner 2 participant P2 as Processor 2

P1->>R1: Start streaming message rect rgba(255, 0, 0, .1) R1->>O: Initiate sending stream stub.sendStreamMessage() as sendingStream end R1->>O: Send identify message sendingStream(StreamChunk{id: StreamIdentify{ localSequenceNumber, channel, runner }})

rect rgba(0, 0, 255, .1) O->>R2: Notify receiving runner of incoming stream message mainStream(ToRunner{streamMsg: ReceivingStreamMessage{ globalSequenceNumber, channel }}) end rect rgba(255, 0, 0, .1) R2->>O: Initiate receiving stream stub.receiveStreamMessage() as receivingStream end R2->>O: Send identify message receivingStream(SendingStreamControl{ globalSequenceNumber }) O->>R1: Send stream control message, indicating that the stream is ready to accept data sendingStream(ReceivingStreamControl{ streamSequenceNumber })

Note over P1: Begin streaming data loop For Each Chunk P1->>R1: Send a chunk of data R1->>O: Send a chunk sendingStream(StreamChunk{data: DataChunk{ data }}) O->>R2: Receive a chunk receivingStream(DataChunk{ data }) R2->>P2: Forward chunks to processor P2->>P2: Handle chunk P2->>R2: Chunk handled R2->>O: sequence number of the chunk in the stream receivingStream(SendingStreamControl{ streamSequenceNumber }) O->>R1: sendingStream(ReceivingStreamControl{ streamSequenceNumber }) Note over P1: Processor is allowed to send a new chunk end

P1->>R1: End of stream R1->>O: sendingStream closed O->>R2: receivingStream closed rect rgba(0, 0, 255, .1) R2->>O: mainStream(FromRunner{processed: GlobalAck{ globalSequenceNumber, channel }}) end rect rgba(0, 0, 255, .1) O->>R1: mainStream(ToRunner{processed: LocalAck{ localSequenceNumber, channel }}) end Note over P1: Processor is allowed to send a new message``

Orchestrator JS

A JavaScript/TypeScript implementation of an RDF-based orchestrator for managing and executing data processing pipelines.

- Features
- Usage
- CLI
- Programmatic API
- Configuration
- Architecture
- Development
- Prerequisites
- Building
- Testing
- Linting & Formatting
- Contributing
- License

Features

Usage

$3

The orchestrator can be run using the provided CLI:

``bash

`Install the orchestrator`


npm install @rdfc/orchestrator-js
Run with a pipeline configuration

npx rdfc path/to/your/pipeline.ttl


The CLI tool loads the RDF pipeline configuration, starts the gRPC server, spawns the configured runners, initializes processors, and manages the entire pipeline lifecycle.
Configuration
Pipeline configurations are defined using RDF/Turtle format. Here's an example configuration:

`turtle @prefix rdfc: . @prefix owl: .

`$3`


<> owl:imports <./.venv/lib/python3.13/site-packages/rdfc_runner/index.ttl>.
<> owl:imports <./.venv/lib/python3.13/site-packages/rdfc_log_processor/processor.ttl>.

$3

 a rdfc:Writer, rdfc:Reader;
    rdfc:logLevel "DEBUG". # optional, log messages to debug

$3

<> a rdfc:Pipeline;
   rdfc:consistsOf [
       rdfc:instantiates rdfc:PyRunner;
       rdfc:processor , ;
   ].

$3

 a rdfc:SendProcessorPy;
       rdfc:writer ;
       rdfc:msg "Hello, World!", "Good afternoon, World!",
                "Good evening, World!", "Good night, World!".
 a rdfc:LogProcessorPy;
      rdfc:reader ;
      rdfc:level "info";
      rdfc:label "test".

`Development`

`$3`

- Node.js 16+ - npm 7+ or yarn - TypeScript 4.7+

`$3`

`bash

`Install dependencies`


npm install
Build the project

npm run build
Watch for changes

npm run build -- --watch

$3

`bash

`Run tests`


npm test
Run tests with coverage

npm test -- --coverage
Run specific test file

npm test path/to/test/file.test.ts

$3

`bash

`Run linter`


npm run lint
Fix linting issues

npm run lint -- --fix
Format code

npm run format


Project Structure

`Contributing`

Contributions are welcome! Please follow these steps:

`$3`

We follow Conventional Commits for commit messages:

`License`

This project is licensed under the MIT License - see the LICENSE file for details.

---

`Architecture`

The system follows a modular architecture with the following main components:

`$3`

#### Initialization Sequence


  Initialization sequence diagram

`mermaid sequenceDiagram autonumber participant O as Orchestrator participant R as Runner participant P as Processor

Note over O: Initialize gRPC server on port 50051 (by default) Load and parse RDF pipeline configuration


    Message sequence diagram

`mermaid sequenceDiagram autonumber participant P1 as Processor 1 participant R1 as Runner 1 participant O as Orchestrator participant R2 as Runner 2 participant P2 as Processor 2

R2->>P2: Runner forwards message to target processor P2->>P2: Process message


    Streaming message sequence diagram

`mermaid sequenceDiagram autonumber participant P1 as Processor 1 participant R1 as Runner 1 participant O as Orchestrator participant R2 as Runner 2 participant P2 as Processor 2

@rdfc/orchestrator-js

Orchestrator JS

Table of Contents

Features

Usage

$3

Install the orchestrator

Run with a pipeline configuration

Configuration

$3

$3

$3

$3

Development

$3

$3

Install dependencies

Build the project

Watch for changes

$3

Run tests

Run tests with coverage

Run specific test file

$3

Run linter

Fix linting issues

Format code

Project Structure

Contributing

$3

License

Architecture

$3

@rdfc/orchestrator-js

Orchestrator JS

Table of Contents

Features

Usage

$3

Install the orchestrator

Run with a pipeline configuration

Configuration

$3

$3

$3

$3

Development

$3

$3

Install dependencies

Build the project

Watch for changes

$3

Run tests

Run tests with coverage

Run specific test file

$3

Run linter

Fix linting issues

Format code

Project Structure

Contributing

$3

License

Architecture

$3

`Install the orchestrator`

`$3`

`Development`

`$3`

`$3`

`Install dependencies`

`Run tests`

`Run linter`

`Contributing`

`$3`

`License`

`Architecture`

`$3`

`Install the orchestrator`

`$3`

`Development`

`$3`

`$3`

`Install dependencies`

`Run tests`

`Run linter`

`Contributing`

`$3`

`License`

`Architecture`

`$3`