Prerequisite
Appropriate permission:nx1_ingest, nx1_monitor, nx1_s3_admin, airflow_user, superset_user,
spark_sql, and trino_admin
Add the public URL to the portal
- Log in to NexusOne.
- On the NexusOne homepage, navigate to Ingest > File.
- In the File Details section, click Public File URL.
- In File URL, enter the following URL to a Parquet file:
File URL box.
Add ingest details
Add the following information to the fields:- Name:
parquet_url - Schema:
parquet_url_schema - Table:
parquet_url_table - Schedule:
None - Mode:
append - Tags: Don’t add any tags
None specifies that the DAG on Apache Airflow won’t run.
After adding these details, click Ingest. Wait for a few minutes until you see a success message appear.
Monitor and trigger job
- When you ingest the file, this creates an Airflow job, so a success message with a View Jobs button appears.
- Track the job status by clicking View Jobs or navigate to the NexusOne homepage and click Monitor.
- You should see your job name,
parquet, in the list, and its current status. Use the three dots...menu to trigger the job. - If you clicked Trigger job, then click the job’s name to open its DAG details in Airflow’s portal.
Visualize your dataset
On the NexusOne homepage, navigate to Discover > New > SQL query. Then execute the following command:
Visualize your dataset
Additional resources
- To get an overview of what file ingestion is, refer to Data ingestion overview.
- For general instructions about how to ingest a file in NexusOne, refer to Ingest a file.
- For more information about roles or permissions, refer to Govern Overview.