Skip to main content
The ingest feature allows you to ingest a file, database, or lakehouse into the NexusOne platform. It also lets you mirror your database in real-time into the platform. This page describes each of these sub-features.

How file ingestion works

This file ingest feature allows you ingest structured data into the platform. NexusOne supports two file ingestion options:
  • Upload file: Files stored on your local machine
  • Public file URL: A public URL to a file you’d like to upload. You might store this file in an S3 bucket and expose it over HTTPS.

Supported file formats

NexusOne currently supports these file formats:
  • CSV
  • Parquet
  • ORC
  • XML
  • XLS/XLSX
An existing catalog already exists on NexusOne, so when you ingest a file, you only have to define your schema and table. After ingesting the file, Apache Airflow schedules a DAG based on your configuration. This DAG uses Spark to process the file. To query and visualize the dataset, NexusOne uses Metabase. When querying, Metabase communicates with Trino, which then retrieves the dataset using the catalog.

Use cases

These examples show how different industries can use NexusOne’s file ingestion and query capabilities:
  • Financial services: Ingest Parquet-formatted market data feeds into NexusOne to monitor portfolio risk and run analytics on a single, secure platform without having to manage custom pipelines.
  • Education: Ingest Excel-formatted grade books into NexusOne to centralize student records and analyze student performance trends.

Additional resources

For full instructions about how to ingest a file in NexusOne, refer to How to ingest a file.