Skip to main content
Ingesting an API means pulling data from a SaaS platform’s API endpoint, transforming the responses into data streams, and then storing each stream as a table in NexusOne’s Iceberg lakehouse. This guide walks you through how to ingest an API.

Prerequisites

Select an API source

Selecting an API source is equivalent to selecting the SaaS platform you’ll be ingesting data from.
  1. Log in to NexusOne.
  2. From the NexusOne homepage, navigate to Ingest > API
  3. In the Source section, select a source connector from the drop-down list.
  4. Depending on the connector you select, enter its authentication configuration details.
  5. Optional: In the Streams section, enter a comma-separated list of stream names you want to sync.
  6. In the Destination section, enter a job name.
  7. Enter a target schema name.
  8. Optional: Set how often the ingestion job should run. Schedule options include Run Once, Every 3 hours, Daily, Weekly, Monthly, and Quarterly.
  9. Select a mode for how to store incoming records at the destination table.
    • Append: Add new records to the existing dataset.
    • Merge: Add or update existing records where applicable.
    • Overwrite: Replace all existing records.
  10. Optional: Select a DataHub domain. For example, Company, Product, Sales. This is only applicable if you have one previously created on DataHub via the Govern feature on NexusOne.
  11. Optional: Select or create one or more tags to label this job.
  12. Optional: Column transformations:
    • Click Add Transformation, then select any of the following column transformation types:
      • Cast: Converts a column’s data type during ingestion. To use this transformation type, enter the column name in the Column field, and then select a target type in the Target Type.
      • Drop: Removes a column from the dataset. To use this transformation type, enter the column name in the Column field.
      • Encrypt: Makes the data unreadable using an encryption key. To use this transformation type, enter the column name in the Column field, and then optionally enter a key name in the Key Name field.
      • Rename: Changes a column’s name. To use this transformation type, enter the column name in the Column field, and then enter a new column name in the New Name field.
    • Repeat until you have added all the transformations necessary for your use case.
  13. After configuring all fields, click Ingest to submit the job.

Monitor, trigger, or delete a job

When you schedule an ingested API, it runs as a job in Apache Airflow. You can monitor, trigger, or delete the job.
  1. When you create a job, a success message and a View Jobs button appear.
  2. Track the job status by clicking View Jobs or navigating to the NexusOne homepage and clicking Monitor.
  3. Use the three dots ... menu to trigger or delete a job.
  4. If you clicked Trigger job, then click the job’s name to open its DAG details in Airflow’s portal.

Additional resources