Skip to main content
This tutorial walks you through how you can add a public URL to an ORC file in the NexusOne platform.

Prerequisites

  • Access to the NexusOne portal
  • Appropriate permission: nx1_ingest

Add the public URL to the portal

  1. Log in to NexusOne.
  2. On the NexusOne homepage, navigate to Ingest > File > Public File URL.
  3. In File URL, enter the following URL to an ORC file:
https://rapid-csv-upload.s3.us-east-1.amazonaws.com/customers.orc

Add ingest details

Add the following information to the fields:
  • Name: orc_url
  • Schema: orc_url_schema
  • Table: orc_url_table
  • Schedule: None
  • Mode: append
Leave the other field values as the default. For the Schedule field, None specifies that the DAG on Apache Airflow won’t run again. After adding these details, click Ingest. Wait for a few minutes until you see a success message appear.
The DAG automatically runs on the first schedule. Recurrent runs depend on your selected schedule option.

Monitor job

On the NexusOne homepage, click Monitor or click View Jobs. You should see your job name, orc_url, in the list, and its current status.

Visualize your dataset

On the NexusOne homepage, navigate to Discover > New > SQL query. Then execute the following command:
SELECT * FROM orc_url_schema.orc_url_table
visualize-orc-url-dataset-mtb58

Visualize your dataset

Additional resources