Serverless deployments

Note: GenSX Cloud is currently in developer preview.

Deploy your GenSX workflows as serverless APIs with support for both synchronous and asynchronous execution, as well as long-running operations.

Deploy with the CLI

Projects are a collection of workflows and environment variables that deploy together into an environment that you configure.

Each project has a gensx.yaml file at the root and a workflows.ts file that exports all of your deployable workflows.

Run gensx deploy from the root of your project to deploy it:


# Deploy the workflow file with default settings
npx gensx deploy src/workflows.ts
 
# Deploy with environment variables
npx gensx deploy src/workflows.ts -e OPENAI_API_KEY

Environment variables are encrypted with per-project encryption keys.

Deploying to different environments

GenSX supports multiple environments within a project (such as development, staging, and production) to help manage your deployment lifecycle.


# Deploy to a specific environment
npx gensx deploy src/workflows.ts --env production
 
# Deploy to staging with environment-specific variables
npx gensx deploy src/workflows.ts --env staging -e OPENAI_API_KEY -e LOG_LEVEL=debug

Each environment can have its own configuration and environment variables, allowing you to test in isolation before promoting changes to production.

When you deploy a workflow, GenSX:

Builds your TypeScript code for production
Bundles your dependencies
Uploads the package to GenSX Cloud
Configures serverless infrastructure
Creates API endpoints for each exported workflow
Encrypts and sets up environment variables
Activates the deployment

The entire process typically takes 15 seconds.

Running workflows from the CLI

Once deployed, you can execute workflows directly from the CLI:


# Run a workflow synchronously with input data
npx gensx run MyWorkflow --input '{"prompt":"Generate a business name"}' --project my-app
 
# Run and save the output to a file
npx gensx run MyWorkflow --input '{"prompt":"Generate a business name"}' --output results.json
 
# Run asynchronously (start the workflow but don't wait for completion)
npx gensx run MyWorkflow --input '{"prompt":"Generate a business name"}' --project my-app

CLI run options

Option	Description
`--input`	JSON string with input data
`--no-wait`	Do not wait for workflow to finish
`--output`	Save results to a file
`--project`	Specify the project name
`--env`	Specify the environment name

API endpoints

Each workflow is exposed as an API endpoint:


https://api.gensx.com/org/{org}/projects/{project}/environments/{environment}/workflows/{workflow}

{org} - Your organization ID
{project} - Your project name
{environment} - The environment (defaults to “default”)
{workflow} - The name of your workflow

For example, if you have a workflow named BlogWriter in project content-tools, the endpoint would be:


https://api.gensx.com/org/your-org/projects/content-tools/environments/default/workflows/BlogWriter

Authentication

All GenSX Cloud API endpoints require authentication using your GenSX API key as a bearer token:


curl -X POST https://api.gensx.com/org/your-org/projects/your-project/environments/default/workflows/YourWorkflow \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Tell me about GenSX"}'

Obtaining an API Key

To generate or manage API keys:

Log in to the GenSX Cloud console
Navigate to Settings > API Keys
Create a new key

Execution modes

Synchronous Execution

By default, API calls execute synchronously, returning the result when the workflow completes:


curl -X POST https://api.gensx.com/org/your-org/projects/your-project/environments/default/workflows/YourWorkflow \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Tell me about GenSX"}'

Asynchronous execution

For longer-running workflows, use asynchronous execution by calling the /start endpoint:


# Request asynchronous execution
curl -X POST https://api.gensx.com/org/your-org/projects/your-project/environments/default/workflows/YourWorkflow/start \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Tell me about GenSX"}'
 
# Response includes an execution ID
# { "executionId": "exec_123abc" }
 
# Check status later
curl -X GET https://api.gensx.com/executions/exec_123abc \
  -H "Authorization: Bearer your-api-key"

Streaming responses

For workflows that support streaming, you can receive tokens as they’re generated:


curl -X POST https://api.gensx.com/org/your-org/projects/your-project/environments/default/workflows/YourWorkflow \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Tell me about GenSX", "stream": true }'

The response is delivered as a stream of server-sent events (SSE).

Execution time limits

GenSX Cloud is optimized for long-running workflows and agents, with generous execution time limits:

Plan	Maximum Execution Time
Free Tier	Up to 5 minutes
Pro Tier	Up to 60 minutes
Enterprise	Custom limits available

These extended timeouts make GenSX ideal for complex AI workflows that might involve:

Multiple LLM calls in sequence
Real-time agent tool use
Complex data processing
Extensive RAG operations

Cold starts and performance

The GenSX Cloud serverless architecture is designed to minimize cold starts:

Fast cold starts: Cold starts typically range from ~100ms
Warm execution: Subsequent executions of recently used workflows start in 1-5ms
Auto-scaling: Infrastructure automatically scales with workloads

Managing deployments in the console

GenSX Cloud provides a console to run, debug, and view all of your workflows.

Viewing workflows

View projects in the console

Log in to app.gensx.com
Navigate to your project and environment
The workflows tab shows all deployed workflows with status information
Click on a workflow to view its details, including schema, recent executions, and performance metrics

The workflow page includes API documentation and code snippets that you can copy/paste to run your workflow from within another app:

View projects in the console

Running workflows manually

You can test workflows directly from the console:

Navigate to the workflow detail page
Click the “Run” button
Enter JSON input in the provided editor
Choose execution mode (sync, async, or streaming)
View results directly in the console

Run workflows in the console

Viewing execution history

Each workflow execution generates a trace you can review:

Navigate to the “Executions” tab in your project
Browse the list of recent executions
Click on any execution to see detailed traces
Explore the component tree, inputs/outputs, and execution timeline

Serverless deployments

Deploy with the CLI

Deploying to different environments

Running workflows from the CLI

CLI run options

API endpoints

Authentication

Obtaining an API Key

Execution modes

Synchronous Execution

Asynchronous execution

Streaming responses

Execution time limits

Cold starts and performance

Managing deployments in the console

Viewing workflows

Running workflows manually

Viewing execution history

Next steps

Serverless deployments

Deploy with the CLI

Deploying to different environments

Running workflows from the CLI

CLI run options

API endpoints

Authentication

Obtaining an API Key

Execution modes

Synchronous Execution

Asynchronous execution

Streaming responses

Execution time limits

Cold starts and performance

Managing deployments in the console

Viewing workflows

Running workflows manually

Viewing execution history

Next steps