Experimentation: Define and Interact with Resources
Once your data infrastructure is connected, you can begin the experimentation phase. In this phase, you register primary data sets and define transformations, features, and training sets. Offline stores offer methods likeregister_table
, register_file
, and register_directory
for registering initial data sets. Subsequently, you define transformations that build derivative training sets. Transformations involve decorating a method with metadata, specifying where it runs, inputs, and other relevant details.
client.dataframe
call.
client.dataframe
accepts either a tuple specifying the name and variant or the function object or data set object itself:
Production: Serve Features and Training Sets
In the production phase, you register features and labels based on data sets defined in the experimentation phase. This enables the creation of training sets and real-time feature serving. Features and labels are associated with entities, which function as primary keys.ff.register_training_set
method to create training sets by joining labels and features based on entity keys.
client.features
or client.training_set
methods.