Introduction to Apicurio Registry
This chapter introduces Apicurio Registry concepts and features and provides details on the supported artifact types that are stored in the registry:
What is Apicurio Registry?
Apicurio Registry is a datastore for sharing standard event schemas and API designs across event-driven and API architectures. You can use Apicurio Registry to decouple the structure of your data from your client applications, and to share and manage your data types and API descriptions at runtime using a REST interface.
Client applications can dynamically push or pull the latest schema updates to or from Apicurio Registry at runtime without needing to redeploy. Developer teams can query Apicurio Registry for existing schemas required for services already deployed in production, and can register new schemas required for new services in development.
You can enable client applications to use schemas and API designs stored in Apicurio Registry by specifying the Apicurio Registry URL in your client application code. Apicurio Registry can store schemas used to serialize and deserialize messages, which are referenced from your client applications to ensure that the messages that they send and receive are compatible with those schemas.
Using Apicurio Registry to decouple your data structure from your applications reduces costs by decreasing overall message size, and creates efficiencies by increasing consistent reuse of schemas and API designs across your organization. Apicurio Registry provides a web console to make it easy for developers and administrators to manage registry content.
You can configure optional rules to govern the evolution of your Apicurio Registry content. These include rules to ensure that uploaded content is valid, or is compatible with other versions. Any configured rules must pass before new versions can be uploaded to Apicurio Registry, which ensures that time is not wasted on invalid or incompatible schemas or API designs.
Apicurio Registry capabilities
-
Multiple payload formats for standard event schema and API specifications such as Apache Avro, JSON Schema, Google Protobuf, AsyncAPI, OpenAPI, and more.
-
Pluggable Apicurio Registry storage options for storing content in Apache Kafka or PostgreSQL database.
-
Rules for content validation, compatibility, and integrity to govern how Apicurio Registry content evolves over time.
-
Apicurio Registry content management using web console, REST API, command line, Maven plug-in, or language SDKs.
-
Full Apache Kafka schema registry support, including integration with Kafka Connect for external systems.
-
Kafka client serializers/deserializers (SerDes) to validate message types at runtime.
-
Compatibility with existing Confluent schema registry client applications.
-
Cloud-native Quarkus Java runtime for low memory footprint and fast deployment times.
-
Operator-based installation of Apicurio Registry on OpenShift/Kubernetes.
-
OpenID Connect (OIDC) authentication using Keycloak.
Schema and API artifacts in Apicurio Registry
The items stored in Apicurio Registry, such as event schemas and API designs, are known as registry artifacts. The following shows an example of an Apache Avro schema artifact in JSON format for a simple share price application:
{
"type": "record",
"name": "price",
"namespace": "com.example",
"fields": [
{
"name": "symbol",
"type": "string"
},
{
"name": "price",
"type": "string"
}
]
}
When a schema or API design is added as an artifact in Apicurio Registry, client applications can then use that schema or API design to validate that the client messages conform to the correct data structure at runtime.
Artifacts have metadata, both generated and editable. Standard metadata for an artifact includes (but may not be limited to):
Editable properties
-
name
-
description
-
labels
-
owner
Artifact Versions
Every artifact is composed of zero or more _artifact version_s. Only artifact versions have actual content (as well as metadata). These versions represent the evolution of the content of an artifact, and are immutable. You can think of an Artifact as an ordered sequence of Versions, typically with the most recent version representing the "current" schema or API design content.
Artifact Versions have metadata, both generated and editable. Standard metadata for an artifact version includes (but may not be limited to):
Generated or immutable properties
-
groupId
-
artifactId
-
version
-
globalId
-
contentId
-
owner
-
createdOn
-
modifiedBy
-
modifiedOn
Editable properties
-
name
-
description
-
labels
-
state
Groups of schemas and APIs
An artifact group is an optional named collection of schema or API artifacts. Each group contains a logically related set of schemas or API designs, typically managed by a single entity, belonging to a particular application or organization.
You can create optional artifact groups when adding your schemas and API designs to organize them in Apicurio Registry. For example, you could create groups to match your development
and production
application environments, or your sales
and engineering
organizations.
Schema and API groups can contain multiple artifact types. For example, you could have Protobuf, Avro, JSON Schema, OpenAPI, or AsyncAPI artifacts all in the same group.
Groups have metadata, both generated and editable. Standard metadata for a group includes (but may not be limited to):
Editable properties
-
description
-
labels
You can create schema and API artifacts and groups using the Apicurio Registry web console, REST API, command line, Maven plug-in, or Java client application.
Specifying a group is optional when using the Apicurio Registry web console, and the default group is used automatically. When using the REST API or Maven plug-in, specify the default group in the API path if you do not want to create a unique group.
|
-
For information on supported artifact types, see Apicurio Registry artifact reference.
-
For information on the Core Registry API, see the Apicurio Registry REST API documentation.
References to other schemas and APIs
Some Apicurio Registry artifact types can include artifact references from one artifact file to another. You can create efficiencies by defining reusable schema or API components, and then referencing them from multiple locations. For example, you can specify a reference in JSON Schema or OpenAPI using a $ref
statement, or in Google Protobuf using an import
statement, or in Apache Avro using a nested namespace.
The following example shows a simple Avro schema named TradeKey
that includes a reference to another schema named Exchange
using a nested namespace:
{
"namespace": "com.kubetrade.schema.trade",
"type": "record",
"name": "TradeKey",
"fields": [
{
"name": "exchange",
"type": "com.kubetrade.schema.common.Exchange"
},
{
"name": "key",
"type": "string"
}
]
}
{
"namespace": "com.kubetrade.schema.common",
"type": "enum",
"name": "Exchange",
"symbols" : ["GEMINI"]
}
An artifact reference is stored in Apicurio Registry as a collection of artifact metadata that maps from an artifact type-specific reference to an internal Apicurio Registry reference. Each artifact reference in Apicurio Registry is composed of the following:
-
Group ID
-
Artifact ID
-
Artifact version
-
Artifact reference name
You can manage artifact references using the Apicurio Registry core REST API, Maven plug-in, and Java serializers/deserializers (SerDes). Apicurio Registry stores the artifact references along with the artifact content. Apicurio Registry also maintains a collection of all artifact references so you can search them or list all references for a specific artifact.
Supported artifact types
Apicurio Registry currently supports artifact references for the following artifact types only:
-
Avro
-
Protobuf
-
JSON Schema
-
OpenAPI
-
AsyncAPI
-
For details on managing artifact references, see:
-
For a Java example, see the Apicurio Registry SerDes with references demonstration.
Manage content using the Apicurio Registry web console
You can use the Apicurio Registry web console to browse and search the schema and API artifacts and optional groups stored in the registry, and to add new schema and API artifacts, groups, and versions. You can search for artifacts by label, name, group, and description. You can view an artifact’s content or its available versions, or download an artifact file locally.
You can also configure optional rules for registry content, globally, for groups, and for each schema and API artifact. These optional rules for content validation and compatibility are applied when new schema and API artifacts or versions are uploaded to the registry.
For more details, see Apicurio Registry rule reference.
The Apicurio Registry web console is available from http://MY_REGISTRY_UI_URL/
.
Apicurio Registry REST API for clients
Client applications can use the Core Registry API v3 to manage the schema and API artifacts in Apicurio Registry. This API provides operations for the following features:
- Admin
-
Export or import Apicurio Registry data in a
.zip
file, and manage logging levels for the Apicurio Registry at runtime. - Groups
-
Manage groups of artifacts in Apicurio Registry. You can create groups to better organize your artifacts.
- Group Rules
-
Configure rules to govern the content evolution of schemas or API artifacts within a specific group to prevent invalid or incompatible content from being added to Apicurio Registry. Group rules override any global rules configured.
- Artifacts
-
Manage schema and API artifacts stored in Apicurio Registry.
- Artifact metadata
-
Manage details about a schema or API artifact. You can edit details such as artifact name, description, or labels. Details such as artifact group, and when the artifact was created or modified are read-only.
- Artifact rules
-
Configure rules to govern the content evolution of a specific schema or API artifact to prevent invalid or incompatible content from being added to Apicurio Registry. Artifact rules override any group and/or global rules configured.
- Artifact versions
-
Manage the sequence of versions that make up the content of a schema or API artifact. You can also manage the lifecycle state of an artifact version: enabled, disabled, or deprecated.
- Global rules
-
Configure rules to govern the content evolution of all schema and API artifacts to prevent invalid or incompatible content from being added to Apicurio Registry. Global rules are applied only if an artifact and its group do not have rules configured.
- Search
-
Browse or search for schema and API artifacts and versions, for example, by name, group, description, or label.
- System
-
Get the Apicurio Registry version and the limits on resources for the Apicurio Registry server.
- Users
-
Get the current Apicurio Registry user.
Compatibility with other schema registry REST APIs
Apicurio Registry also provides compatibility with the following schema registries by including implementations of their respective REST APIs:
-
Apicurio Registry Core Registry API v2
-
Confluent Schema Registry API v7
Applications using Confluent client libraries can use Apicurio Registry as a drop-in replacement.
-
For more information on the Core Registry API v3, see the Apicurio Registry REST API documentation.
-
For API documentation on the Core Registry API v3 and all compatible APIs, browse to the
/apis
endpoint of your Apicurio Registry, for example,http://MY-REGISTRY-URL/apis
.
Apicurio Registry storage options
Apicurio Registry provides the following options for the underlying storage of registry data:
Storage option | Description |
---|---|
In-memory |
The in-memory storage option is suitable for a development environment only. All data is lost when restarting Apicurio Registry with this storage. The PostgreSQL or Kafka storage option is recommended for a production environment. |
PostgreSQL database |
PostgreSQL is the recommended data storage option for performance, stability, and data management (backup/restore, and so on) in a production environment. |
Apache Kafka |
Kafka storage is provided for production environments where database management expertise is not available, or where storage in Kafka is a specific requirement. |
Validate Kafka messages using schemas and Java client serializers/deserializers
Kafka producer applications can use serializers to encode messages that conform to a specific event schema. Kafka consumer applications can then use deserializers to validate that messages have been serialized using the correct schema, based on a specific schema ID.
Apicurio Registry provides Kafka client serializers/deserializers (SerDes) to validate the following message types at runtime:
-
Apache Avro
-
Google Protobuf
-
JSON Schema
The Apicurio Registry Maven repository and source code distributions include the Kafka SerDes implementations for these message types, which Kafka client application developers can use to integrate with Apicurio Registry.
These implementations include custom Java classes for each supported message type, for example, io.apicurio.registry.serde.avro
, which client applications can use to pull schemas from Apicurio Registry at runtime for validation.
Stream data to external systems with Kafka Connect converters
You can use Apicurio Registry with Apache Kafka Connect to stream data between Kafka and external systems. Using Kafka Connect, you can define connectors for different systems to move large volumes of data into and out of Kafka-based systems.
Apicurio Registry provides the following features for Kafka Connect:
-
Storage for Kafka Connect schemas
-
Kafka Connect converters for Apache Avro and JSON Schema
-
Core Registry API to manage schemas
You can use the Avro and JSON Schema converters to map Kafka Connect schemas into Avro or JSON schemas. These schemas can then serialize message keys and values into the compact Avro binary format or human-readable JSON format. The converted JSON is less verbose because the messages do not contain the schema information, only the schema ID.
Apicurio Registry can manage and track the Avro and JSON schemas used in the Kafka topics. Because the schemas are stored in Apicurio Registry and decoupled from the message content, each message must only include a tiny schema identifier. For an I/O bound system like Kafka, this means more total throughput for producers and consumers.
The Avro and JSON Schema serializers and deserializers (SerDes) provided by Apicurio Registry are used by Kafka producers and consumers in this use case. Kafka consumer applications that you write to consume change events can use the Avro or JSON SerDes to deserialize these events. You can install the Apicurio Registry SerDes in any Kafka-based system and use them along with Kafka Connect, or with a Kafka Connect-based system such as Debezium.
Apicurio Registry demonstration examples
Apicurio Registry provides open source example applications that demonstrate how to use Apicurio Registry in different use case scenarios. For example, these include storing schemas used by Kafka serializer and deserializer (SerDes) Java classes. These classes fetch the schema from Apicurio Registry for use when producing or consuming operations to serialize, deserialize, or validate the Kafka message payload.
These applications demonstrate use cases such as the following examples:
-
Apache Avro Kafka SerDes
-
Apache Avro Maven plug-in
-
Apache Camel Quarkus and Kafka
-
CloudEvents
-
Confluent Kafka SerDes
-
Custom ID strategy
-
Event-driven architecture with Debezium
-
Google Protobuf Kafka SerDes
-
JSON Schema Kafka SerDes
-
REST clients
-
For more details, see https://github.com/Apicurio/apicurio-registry/tree/main/examples
Apicurio Registry available distributions
Apicurio Registry provides the following components as part of its distribution.
Component | Container Image |
---|---|
Back-end |
|
User Interface |
|
Apicurio Registry Operator |
https://hub.docker.com/r/apicurio/apicurio-registry-operator |
-
For details on building from source code and running Docker images, see https://github.com/Apicurio/apicurio-registry
-
For details on using the Operator to deploy, see https://github.com/Apicurio/apicurio-registry/tree/main/operator