Skip to main content

Overview

RocketRide is a high-performance data processing engine built on a C++ core with a Python-extensible node system. With 50+ pipeline nodes, native AI/ML support, and SDKs for TypeScript, Python, and MCP, it lets you process, transform, and analyze data at scale — entirely on your own infrastructure.

Key Capabilities

  • High-performance C++ engine — Native multithreading purpose-built for throughput, not prototypes.
  • Visual pipeline builder — Build, debug, test, and scale AI and data workloads from your IDE with an intuitive visual canvas. No browser required.
  • 50+ pipeline nodes — Python-extensible, with 13 LLM providers, 8 vector databases, OCR, NER, PII anonymization, and more.
  • Multi-agent workflows — Orchestrate and scale agents with built-in support for CrewAI and LangChain.
  • TypeScript, Python & MCP SDKs — Integrate pipelines into native applications or expose them as tools for AI assistants.
  • One-click deploy — Run on Docker, on-prem, or RocketRide Cloud.

Core Concepts

Pipelines

A pipeline is a directed graph of nodes that processes data from input to output. Pipelines are defined as .pipe files (JSON format) and rendered visually in the IDE extension. You can run, monitor, and debug pipelines directly from the canvas.

Nodes

Nodes are the building blocks of every pipeline. Each node performs a specific operation — calling an LLM, embedding text, querying a vector store, transforming data, and more. Nodes are organized into categories by function:

CategoryNodesDescription
Source15Where data enters the pipeline (webhook, chat, dropper)
LLM13Language model providers (OpenAI, Anthropic, Google, and more)
Store9Vector database integrations (Pinecone, Qdrant, Weaviate, and more)
Text7Text analysis and transformation (NER, PII, sentiment, and more)
Agentic4Agent framework orchestration (CrewAI, LangChain)
Other4Utilities and routing
Embedding3Generate vector representations
Image3Image processing and OCR
Preprocessor2Chunking and code processing
Audio2Transcription and playback
Data2Document parsing
Infrastructure2Output and export
Video1Frame extraction
Database1Direct database access

For a full breakdown, see the Nodes Overview.

Lanes

Lanes are the connections between nodes. Each node has typed input lanes and output lanes that define what data it accepts and produces. You wire nodes together by connecting an output lane of one node to a compatible input lane of another. Some nodes (like agents or LLMs) can also be invoked as tools by a parent node.

Source Types

Every pipeline begins with a source node that defines how data enters:

  • Webhook — Receives data via HTTP requests
  • Chat — Interactive conversational interface
  • Dropper — File-based input via drag-and-drop

Where to Go Next

  • Quickstart — Go from zero to a running pipeline in minutes.
  • Guides — Explore pre-built pipelines and common use cases.
  • Nodes Overview — Browse all 68 nodes by category.