Estuary

Estuary allows you to build real-time ETL/ELT data pipelines between various platforms supported by an array of connectors.

A video walkthrough can be found below:

Prerequisites

  1. WarpStream account - get access to WarpStream by registering here.

  2. Estuary account - get access to Estuary by registering here.

  3. Serverless WarpStream cluster up and running.

Step 1: Create a topic in your WarpStream cluster

You will need a WarpStream cluster, credentials, and at least one Topic with data available. If you do not have an active topic, steps 1 and 2 of this guide will provide a simple example to test with.

Obtain the Bootstrap Broker from the WarpStream console by navigating to your cluster and clicking the Connect tab. If you don't have SASL credentials yet, you can also create a set of credentials from the console.

Store these values for easy reference; they will be needed in Upsolver. If you are going to produce records to your topic from the command line, then export them as environment variables in a terminal window:

export BOOTSTRAP_HOST=<YOUR_BOOTSTRAP_BROKER> \
SASL_USERNAME=<YOUR_SASL_USERNAME> \
SASL_PASSWORD=<YOUR_SASL_PASSWORD>;

Then, create a topic in the WarpStream console if you don't already have one.

Step 2: Produce some records

You can use the WarpStream CLI to produce messages to your topic, in this case estuary_demo:

warpstream kcmd -bootstrap-host $BOOTSTRAP_HOST -tls -username $SASL_USERNAME -password $SASL_PASSWORD -type produce -topic estuary_demo --records '{"action": "click", "user_id": "user_0", "page_id": "home"},,{"action": "hover", "user_id": "user_0", "page_id": "home"},,{"action": "scroll", "user_id": "user_0", "page_id": "home"},,{"action": "click", "user_id": "user_1", "page_id": "home"},,{"action": "click", "user_id": "user_1", "page_id": "home"},,{"action": "click", "user_id": "user_2", "page_id": "home"}'

Note that the WarpStream CLI uses double commas (,,) as a delimiter between JSON records.

Step 3: Connect Estuary to WarpStream

In the Estuary Web Console, select Sources from the left-hand navigation. Then click "+ NEW CAPTURE." Search/select "Kafka":

Next, fill in your WarpStream endpoint configuration in Estuary, as illustrated below:

That's everything that you will need from WarpStream.

Next Steps

With WarpStream configured, all that's left is to choose your Destination in Estuary and configure it as needed.

Next, check out the WarpStream docs for configuring the WarpStream Agent, or review the Estuary docs to learn more about what is possible with WarpStream and Estuary!

Last updated