Skip to main content

Data Providers

As a verified data provider partner, you can share and monetize your datasets through the Earthmover Marketplace. List free datasets to make your data more accessible to the community, or offer paid subscriptions on your own terms.

Your data lives in Icechunk repositories within your Arraylake organization. When you create a listing, subscribers from other organizations can discover your dataset on the Marketplace and subscribe to access it, including any updates you publish.

Becoming a Data Provider

Only verified organizations can publish listings to the Marketplace. If you're interested in becoming a data provider, contact sales@earthmover.io to get started.

Managing Listings

Listings are how you expose your data to the Marketplace. You can create, edit, and publish listings from the Marketplace tab in your organization settings.

Marketplace listings management page

Creating a Listing

To create a new listing:

  1. Navigate to the Marketplace tab in your organization settings
  2. Click + Create Listing
  3. Fill in the listing details:
FieldDescription
RepositorySelect the repo you want to list. You can leave this blank while drafting and add it before publishing. If you publish your listing without a repo attached, it will be marked "Coming soon" in the Marketplace and potential users will be able to contact you to register their interest.
Listing NameA clear, descriptive name for your dataset.
DescriptionA brief summary that appears in Marketplace search results.
Thumbnail URLAn image URL to visually represent your dataset. These thumbnails are prominently displayed in the Marketplace.
StatusSet to Unpublished while drafting. This lets you preview and iterate before making it public.
Pricing ModelChoose Free or Paid. Free datasets can be subscribed to instantly by anyone with an Arraylake account. Paid datasets require you to finalize terms with each subscriber before they gain access.
README ContentDetailed documentation for your dataset. Click "Use Template" for a starting point. Include variables, coordinates, update frequency, and anything that helps users work with the data.
LicenseDefine what subscribers can do with your data. You can select a Creative Commons license, link out to an existing license, or add custom terms.

Once you're happy with your listing, set the status to Published to make it live on the Marketplace!

How Subscriptions Work

When someone subscribes to your listing, a read-only mirror of your Arraylake repo appears in their organization. Subscribers read data directly from your object store, no data is copied. Subscribers will also be able to view your repo's history and commit messages.

Egress Considerations

Since subscribers read subscribed data directly from your object store bucket, you may incur egress costs when they access your data. To minimize these costs, we recommend hosting your data on storage with free or reduced egress pricing:

Keep this in mind for free, public datasets as read access is unbounded.

Coming Soon

We're working on an option for subscribers to materialize data into their own object store. This will allow subscribers to copy data locally for faster access, shifting egress costs to a one-time transfer.

Listing Metrics

Each listing includes metrics so you can understand how your data is being used. From the listing details page, you can view:

  • Total subscriptions — How many organizations have subscribed
  • Total access — Aggregate read activity across all subscribers
  • Unique viewers — Number of distinct users accessing your data
  • Subscribed organizations — A list of organizations currently subscribed to your dataset

Listing metrics dashboard

You can also view aggregated metrics across all your listings on your organization dashboard. These metrics help you understand the reach and impact of your data.