Prerequisites
- Appropriate permission:
nx1_engineer - A previously ingested dataset
Select a dataset and describe a transformation rule
- Web portal
- REST API
The catalog contains metadata about the schema and table.
The schema describes the data structure, such as the table name, column names, and data types. The table contains the data.You describe a rule using natural language so NexusOne can generate an SQL query.
- Log in to NexusOne.
- On the NexusOne homepage, click Engineer.
- Click Federated > Select catalogs and choose at least two catalogs.
- Select schemas from each catalog.
- Select tables from each catalog.
Selecting Iceberg as a catalog is equivalent to using a Lakehouse data transformation rule.
- Enter a transformation job name.
- Enter a natural language transformation prompt.
- Optional: Select the checkbox if you’d like a preview of your data after running the generated query from the prompt input you have provided.
- Click Transform. If you selected the preview checkbox in the previous step, then also click Finalize to proceed to the next step.
Schedule a transformation rule
- Web portal
- REST API
Scheduling a rule allows you to run the rule at specific time intervals using Apache AirFlow.
- Select a destination schema.
- Select a destination table.
- Optional: Select a DataHub domain.
- Optional: Select or create one or more tags to label this rule.
- Select a time interval for how often the transformation rule should run.
Schedule options include
None,Every 3 hours,Daily,Weekly,Monthly, andQuarterly. On the first schedule, the job/DAG on Apache Airflow automatically runs. Recurrent runs depend on your selected schedule option. - Select a mode for how to store incoming records at the destination table.
- Append: Add new records.
- Merge: Add or update existing records where applicable.
- Overwrite: Replace all existing records.
- Click Schedule.
Monitor, trigger, or delete a job
- Web portal
- REST API
When you schedule a rule, it runs as a job in Apache Airflow.
You can monitor, trigger, or delete the job.
- When you schedule a rule, a View your job button appears.
- Track the job status by clicking View your job or navigate to the NexusOne homepage and click Monitor.
- Use the three dots
...menu to trigger or delete a job. - If you clicked Trigger job, then click the job’s name to open its DAG details in Airflow’s portal.
Additional resources
- For general instructions about how to ingest a file in NexusOne, refer to How to ingest a file.
- For more information about the monitoring feature, refer to Monitor Overview.
- For more information about roles or permissions, refer to Govern Overview.