Getting started with iiDrak Data Platform

A modern Data Lakehouse solution with Open and Unified data processing platform for Data Lake and Data warehouse.

Getting started with iiDrak Data Platform

Overview of the iiDrak Data Platform

The iiDrak is a unified solution that combines the best features of data lakes and data warehouses, providing a single platform for all your data management, processing, and AI/ML needs. It is an On-premise first solution but extensive support for cross cloud integration, cloud-native and cloud-agnostic. Data and executors are totally decoupled. BYOS (Bring your own storage) & BYOC (Bring your own compute)

iiDrak Data platform supports 3 modes of executor:

  1. Clustered executors - Execute queries using Apache Spark cluster. Useful for querying large datasets
  2. Single Node executor - Leverage DuckDB on single node to execute lightning fast queries against tables holding medium size data (Ex: 1GB - 100GB)
  3. Serverless executor - Use browsers WASM capabilities to execute queries inside the browser for running queries against small data sets (Ex: < 10GB data). This could run exploratory queries against CSV files in object storage etc.,

iiDrak Data platform enhances this architecture with:

Key Benefits

System Requirements

Cloud Deployment

Supported Cloud Platforms:

Minimum Storage: 1TB for system storage Network: High-bandwidth internet connection (minimum 1 Gbps)

On-Premise Deployment

CPU: 8+ cores (recommended: Intel Xeon or AMD EPYC) RAM: 32GB minimum (recommended: 64GB) Storage:

System: 200GB SSD Data: Based on requirements (recommended: starts at 2TB)

Network: 10 Gbps network interface

Software Requirements

Operating System: Linux (Ubuntu 20.04 LTS or later) RedHat Enterprise Linux 8.x or later

Get Started - Installer

# Download the installer

curl -O https://nexaris.com/installer/iidrak_setup.sh

 

# Run installation script

./installer