Skip to main content
This tutorial walks you through how you can add a public URL to an ORC file in the NexusOne platform.

Prerequisites

  • Access to the NexusOne portal
  • Appropriate permission: nx1_ingest

Add the public URL to the portal

  1. Log in to NexusOne.
  2. On the NexusOne homepage, navigate to Ingest > File > Public File URL.
  3. In File URL, enter the following URL to an ORC file:
https://rapid-file-tutorial.s3.us-east-1.amazonaws.com/customers.orc

Add ingest details

Add the following information to the fields:
  • Name: orc_url
  • Schema: orc_url_schema
  • Table: orc_url_table
  • Schedule: Run Once
  • Mode: append
  • Tags: Don’t add any tags
After adding these details, click Ingest. Wait for a few minutes until you see a success message appear.

Monitor job

When you ingest the file, this creates an Airflow job. To monitor the status of the job, use the following steps:
  1. Click View Jobs or navigate to the NexusOne homepage and click Monitor.
  2. Find your job name, orc_url, in the list, and watch its current status.
  3. Wait for a few minutes and refresh your browser until the status changes to Completed.

Visualize your dataset

Use the following steps to visualize your dataset:
  1. On the NexusOne homepage, click Discover to launch Superset.
  2. Hover your mouse over SQL, and then select SQL Lab.
  3. Enter the following command in the query box:
SELECT * FROM orc_url_schema.orc_url_table
visualize-orc-url-dataset-v4-1-2

Visualize your dataset

Additional resources