provision
This guide shows you how to provision a Datafye Foundry deployment in Amazon Web Services.
Prerequisites
Before you begin, ensure:
The Datafye CLI is installed (see Installation)
AWS credentials are configured (see AWS Setup)
You have created the required descriptor files (see Deployment Descriptors)
Provision Command
Use the following command to provision an AWS Foundry deployment:
datafye foundry aws provision [options]Parameters
Required Parameters
--name <name>
Deployment name (must be unique per AWS profile)
--profile <profile>
AWS CLI profile to use (configured in AWS setup)
Descriptor Parameters
You must provide descriptor files using one of the following approaches:
Option 1: Combined descriptor file
--descriptor <path>
Path to a combined descriptor file containing data (and optionally algo) configuration
Option 2: Separate descriptor files
--data-descriptor <path>
Path to data descriptor file (required)
--algo-descriptor <path>
Path to algo descriptor file (optional - only for Full Stack scenario)
Optional Parameters
--region <region>
AWS region to provision in
us-east-1
--cidr <cidr>
CIDR block for VPC (format: x.y.z.w/nn)
10.0.0.0/16
Foundry Scenarios
AWS Foundry deployments support two scenarios:
Data Cloud Only
Provisions only the Data Cloud, allowing you to use your own algo containers.
Required descriptors:
Data descriptor only
Example:
Full Stack
Provisions Data Cloud, Algo Container runtime, Backtesting Engine, and MCP Server.
Required descriptors:
Data descriptor
Algo descriptor
Example:
Or using a combined descriptor:
Provisioning Process
During provisioning, the CLI will:
Validate descriptors - Check schema, semantics, and dependencies
Create VPC and networking - Provision Virtual Private Cloud, subnets, security groups
Provision EC2 instances - Create virtual machines for Datafye services
Deploy services - Install Data Cloud (and Algo Container/Backtesting/MCP if Full Stack)
Configure connectivity - Connect to data providers using your credentials
Run health checks - Verify all services are running correctly
Display access information - Show URLs and connection details
AWS provisioning typically takes 10-20 minutes. Do not interrupt the provisioning process.
After Provisioning
Once provisioning completes successfully, the CLI will display:
Deployment ID and status
API base URLs for accessing Datafye services
WebSocket endpoint for real-time data streams
MCP server endpoint (Full Stack only)
SSH access information (if needed for troubleshooting)
Accessing Your Deployment
The AWS deployment will be accessible via the URLs provided in the output:
Data Cloud REST API:
http://<deployment-endpoint>/datafye-api/v1/Data Cloud WebSocket:
ws://<deployment-endpoint>/datafye-wsMCP Server (Full Stack only):
http://<mcp-endpoint>:3000
Validate Connectivity
Test your deployment with:
Your AWS Foundry deployment is now running and ready to use!
Next Steps
After provisioning:
Data Cloud Only: Connect your own algo containers to the Data Cloud APIs (see Building Your First Algo - Using Own Container)
Full Stack: Start developing your algo using the SDK (see Building Your First Algo - Using Datafye Container)
Run backtests: Test your strategy with historical data (see Backtesting Your Algo)
Troubleshooting
AWS Credential Errors
If you see authentication errors:
Verify your AWS profile is configured:
aws sts get-caller-identity --profile datafyeEnsure your credentials have the necessary permissions (see AWS Setup)
Check that you're using the correct profile name
Deployment Name Conflicts
If you receive an error about the deployment name already existing:
Choose a different deployment name with
--nameOr deprovision the existing deployment:
datafye foundry aws deprovision --name <existing-name> --profile <profile>Retry the provision command
Resource Limit Errors
If you hit AWS resource limits:
Check your AWS account limits in the EC2 dashboard
Request limit increases if needed through AWS Support
Or deprovision unused deployments to free resources
VPC CIDR Conflicts
If you encounter VPC address conflicts:
Try a different CIDR block with
--cidr(e.g.,10.1.0.0/16,10.2.0.0/16)Ensure the CIDR doesn't conflict with existing VPCs in your account
Check existing VPCs:
aws ec2 describe-vpcs --profile <profile>
Region Unavailability
If services are not available in your chosen region:
Verify the region supports required AWS services (EC2, VPC, Route53)
Try a different region with
--region(e.g.,us-east-1,us-west-2)Contact Datafye support if you require a specific region
Descriptor Validation Errors
If descriptor validation fails:
Review the error message for specific field issues
Check descriptor format matches the schema (see Deployment Descriptors)
Verify referenced resources exist (container images, symbols)
Ensure credentials are provided via environment variables if using
${VAR}syntax
Data Provider Connection Errors
If data provider connections fail:
Verify your data provider credentials are correct
Check environment variables are set if using variable substitution
Ensure your data provider account is active and has necessary permissions
Review data provider status (some providers may have API outages)
Related Commands
Start an AWS Foundry - Start a stopped deployment
Stop an AWS Foundry - Stop a running deployment
Deprovision an AWS Foundry - Remove the deployment
Last updated: 2025-10-22
Last updated

