- Self-service analytics across curated and raw datasets
- OpenID Connect (OIDC) for unified authentication and authorization
- Federated SQL queries across Iceberg, Hive, PostgreSQL, and more
- BI dashboards powered by Superset
- DataOps workflows for validating and inspecting catalog data
- AI-assisted SQL generation
Environment configuration
This section describes how the NexusOne team configured Trino, and how different client tools connect to the service.Accessing the Trino UI
Trino provides that displays the details about the cluster. You can access the Trino UI on the NexusOne cluster using the following URL:When you purchase NexusOne, you receive a client name.
Replace
client with your assigned client name.Trino client setup
Within NexusOne, Trino supports OAuth 2.0 authenticated connections for JDBC, Python, and the command-line tool clients.Python
Example of connecting to Trino using Python.user and password.
JDBC
Example of connecting to Trino using JDBC.Command-line tool
Example of connecting to Trino using a command-line tool.user.
Iceberg/Hive integration
Trino integrates directly with the NexusOne data platform through preconfigured Iceberg and Hive catalogs. These catalogs use S3 as the underlying storage layer for all datasets and a Hive Metastore for providing centralized table and schema metadata.Superset integration
Superset connects to Trino using OAuth 2.0. Each Trino catalog appears as a database, and each schema inside a catalog is available for querying in SQL Lab, Superset’s interactive SQL editor. Connection example:Types of Trino queries
Trino supports different types of queries depending on the purpose and data sources. Understanding these types helps you choose the right approach when analyzing data.Analytical queries
Analytical queries interact with data within a single catalog or schema. One example is counting employees per job role in a single table,employees1,
and calculating their average salary.
Federated queries
Federated queries combine data across multiple catalogs or schemas. One example is aggregating employee data by department, combining information from an Iceberg table,employees1, and a Hive table, departments, in a single query.
Query execution management
Trino provides system-level mechanisms to keep query execution efficient in a multi-tenant environment such as NexusOne. This includes the following:- Parallel query execution across Kubernetes nodes
- Configurable query management properties that control query behavior and resource
usage, such as:
retry-policy: Defines how many times and how often to retry a failed queryquery_max_run_time: Maximum time a query can run before Trino stops itquery_max_memory: Maximum memory a query can use during executionquery_max_execution_time: Maximum CPU execution time a query can consume
Additional resources
- To learn about best practices when using Trino, refer to the Trino best practices page.
- To learn practical ways to use Trino in the NexusOne environment, refer to the Trino hands-on examples page.
- For more details about Trino, refer to the Trino official documentation.
- If you are using the NexusOne portal and want to learn how to launch Trino, refer to the Launch a hosted app page.