Interactive Data Notebooks

Explore your data
at Spark speed.

Skeyenote brings the interactive notebook experience to Apache Spark — letting data teams run exploratory analysis, iterate fast, and scale effortlessly across massive parallel workloads.

Launch a Notebook See how it works

10×

faster iteration vs
batch pipelines

built-in interpreter
types

∞

horizontal scale
via Spark clusters

[1]

// Load sales data from data lake
val df = spark.read
  .parquet("s3a://datalake/sales/2024/")
  .filter("region = 'EMEA'")
val summary = df
  .groupBy("product_line")
  .agg(sum("revenue").as("total_rev"),
     avg("margin_pct").as("avg_margin"))
  .orderBy(desc("total_rev"))

✓ Executed in 1.4s — 2.1M rows processed across 24 executors

product_line	total_rev	avg_margin	Δ yoy
Enterprise SaaS	€ 4.2M	38.4%	+12.1%
Analytics Suite	€ 2.8M	31.7%	+6.3%
Data Connectors	€ 1.1M	18.2%	-2.4%

📂

Connect your data

Point Skeyenote at any JDBC source, Hive metastore, S3-compatible lake, or local files. No data movement required.

⚡

Write, run, iterate

Cells execute on your Spark cluster in real time. Inspect DataFrames, chart distributions, and refine logic cell by cell.

🔬

Share findings

Export annotated notebooks as live reports. Markdown cells let you embed context alongside your code and outputs.

What you get

Everything a data team
actually needs

Skeyenote keeps the full power of Spark within reach — without infrastructure headaches or context-switching between tools.

⚡

Massive Parallel Execution

Every cell dispatches jobs to your Spark cluster. Shuffle hundreds of millions of rows while you read the partial results streaming into the output pane.

🔗

JDBC Connectivity

Query Postgres, MySQL, Oracle, Hive, Trino, or any JDBC-compliant store without leaving the notebook. Schema introspection included.

📝

Markdown Cells

Narrate your analysis inline. Toggle between code and markdown to build self-documenting notebooks that communicate as well as they compute.

🐚

Shell Interpreter

Run sh cells for data downloads, file management, or CLI tools — all from the same interface, without switching terminals.

🔍

Variable Inspector

Inspect DataFrame schemas, row counts, and column statistics without extra code. The variable panel updates live as each cell completes.

🤝

Team Notebooks

Share notebooks with your data team, leave comments, and revisit checkpointed cell states — exploratory work that doesn't disappear after the session ends.

20 built-in interpreters

One notebook,
any data source.

Switch interpreter per cell. Query Spark, relational databases, NoSQL stores, search engines, statistical tools, and more — all within a single notebook session, without leaving the interface.

spark Scala / Spark

jdbc SQL / JDBC

mongo MongoDB

es Elasticsearch

r R / ggplot2

py Python

sh Shell

md Markdown

+12 more

jdbc customer_cohorts.snb · cell [4]

-- Monthly active user cohorts

SELECT

DATE_TRUNC('month', created_at) AS cohort,

COUNT(DISTINCT user_id) AS users,

ROUND(AVG(ltv), 2) AS avg_ltv

FROM dw.customers

WHERE plan != 'free'

GROUP BY 1

ORDER BY 1 DESC

LIMIT 12;

✓ 12 rows · 48ms

            2024-06   4,218 users   avg_ltv: €182.40

            2024-05   3,891 users   avg_ltv: €179.10

            …

Apache Spark at the core

Scale from laptop
to data center

Skeyenote talks directly to your Spark cluster. As your data grows, add executors — your notebooks stay the same. No rewrites, no migration.

Data scale Spark handles natively

100+

Concurrent executor nodes supported

<2s

Typical cell execution feedback loop

Data copies needed — query in place

Explore your dataat Spark speed.