AWS
This quickstart will walk through creating a few simple features, labels, and a training set using Postgres and Redis. We will use a transaction fraud training set.
Step 1: Install Featureform client
Requirements
- Python 3.7+
Install the Featureform SDK via Pip.
Step 2: Deploy EKS
You can follow our Minikube or Kubernetes deployment guide. This will walk through a simple AWS deployment of Featureform with our quick start Helm chart containing Postgres and Redis.
Install the AWS CLI and eksctl then run the following command to create an EKS cluster.
Newer versions of eksctl
require you to separately add a Container Storage Interface (CSI) driver to support Persistent Volume Claims. For complete details on adding the Amazon EBS CSI driver to your EKS cluster, see Managing the Amazon EBS CSI driver as an Amazon EKS add-on; however, the below examples should allow for a simple deployment.
Create an Amazon EBS CSI Driver IAM Role
Create the Amazon EBS CSI Add-On
To easily find the account ID you used to create the cluster, run:
Then, to add the Amazon EBS CSI add-on, run:
Step 3: Install Helm charts
We’ll be installing three Helm Charts: Featureform, the Quickstart Demo, and Certificate Manager.
First we need to add the Helm repositories.
Prior to installing the Helm charts, export your FEATUREFORM_HOST
value:
Now we can install the Helm charts.
Step 4: Register providers
The Quickstart helm chart creates a Postgres instance with preloaded data, as well as an empty Redis standalone instance. Now that they are deployed, we can write a config file in Python.
Once we create our config file, we can apply it to our Featureform deployment.
Step 6: Define our resources
We will create a user profile for us, and set it as the default owner for all the following resource definitions.
Now we’ll register our user fraud dataset in Featureform.
Next, we’ll define a SQL transformation on our dataset.
Next, we’ll register a passenger entity to associate with a feature and label.
Finally, we’ll join together the feature and label into a training set.
Now that our definitions are complete, we can apply it to our Featureform instance.
Step 7: Serve features for training and inference
Once we have our training set and features registered, we can train our model.
We can serve features in production once we deploy our trained model as well.