Chapter 5. Hopsworks Feature Store
In this chapter, we will look in depth at the Hopsworks feature store. Hopsworks is a platform for the development and operation of batch, real-time, and LLM AI systems at scale. It can be installed on as little as one server or as many as hundreds of servers. Hopsworks includes a feature store as well as a complete MLOps and compute platform, but we will focus on the feature store in this chapter. We will show how to implement the data model for our credit card fraud model from Chapter 4 in Hopsworks. We will also see how the feature store concepts from the previous chapter are represented in Hopsworks using code snippets in Python. We will start with projects in Hopsworks—a secure, collaborative space for storing your feature data, training data, and models.
Hopsworks Projects
A Hopsworks cluster is organized into projects, where each project has a unique name. Hopsworks projects are secure spaces for teams to collaborate and manage data and models for AI. Similar to a repository in GitHub, a project has team members (with role-based access control), but instead of storing source code, Hopsworks projects store data for AI. Each project has its own feature store, a model registry, model deployments, and datasets for general-purpose file storage.
The following code snippet shows how to get a reference to a project object when you log in to Hopsworks. If you do not enter the name of the project, Hopsworks will return a reference to your main ...