What does a DeltaForge demo include?

Each demo is self-contained: a setup script creates tables and loads seed data, a query script runs SQL with assertions against expected values computed before the run, and a teardown drops everything.

Which categories of demos are available?

The repository covers open table formats (Delta Lake, Apache Iceberg), analytics and intelligence (property graphs and Cypher, geospatial), industry formats (FHIR, HL7, EDI), and common file formats.

How do I run the demos?

Open the Use Case Gallery in the DeltaForge desktop application, browse by category, run setup, run the query script, and inspect the assertion results. Teardown drops everything so the next demo starts clean.

Where do the expected values come from?

Expected values are derived outside the engine under test, so no demo passes by self-validation. Every assertion publishes the query, the expected value, and the actual value for inspection.

Delta Lake SQL Examples: Runnable Demo Gallery

1

Open the Use Case Gallery

Launch the DeltaForge GUI and browse use cases by category

2

Run the Setup

One click creates tables and loads the seed data

3

Query and Verify

Assertions validate results against expected values computed before the test ran

4

Clean Up

Teardown drops everything so you can move to the next use case

Open Table Formats

ACID transactions, time travel, schema evolution, and table maintenance on your own storage

Delta Lake

CRUD operations, MERGE patterns (SCD2, dedup, soft delete), time travel, change data feeds, partitioning, VACUUM, OPTIMIZE, Z-ORDER, deletion vectors, schema evolution, column mapping, and GDPR erasure patterns.

Time Travel MERGE CDC Schema Evolution

Apache Iceberg

V1, V2, and V3 table specs with schema evolution, partition transforms, bloom filters, CRUD operations, snapshot isolation, and UniForm interoperability between Iceberg and Delta Lake.

V1 / V2 / V3 UniForm Partition Transforms

Analytics and Algorithms

Graph traversal, geospatial indexing, and advanced SQL patterns

Graph Analytics

Cypher pattern matching, PageRank, community detection (Louvain), betweenness and closeness centrality, shortest paths, BFS, DFS, triangle counting, and KNN similarity. Tested on real-world public datasets.

PageRank Cypher Community Detection

Geospatial

Uber H3 hexagonal spatial indexing, WKT polygon operations, point-in-polygon queries, fleet tracking, delivery routing, and coverage analysis across multiple resolution levels.

H3 Indexing WKT Spatial Joins

Industry Formats

Parse healthcare, supply chain, and logistics standards directly in SQL

Healthcare

FHIR R4/R5 clinical resources, HL7 v2 patient and lab workflows, pseudonymisation, and PII lifecycle management patterns.

FHIR R4/R5 HL7 v2 Pseudonymisation

EDI and Supply Chain

HIPAA claims (837/835), eligibility (270/271), claim status (276/277), X12 purchase orders, EDIFACT international trade, TRADACOMS UK retail, and EANCOM supply chain messaging. Several of these include independent Python proof scripts that verify results without using DeltaForge.

HIPAA X12 EDIFACT TRADACOMS

File Formats

Read and query structured, semi-structured, and binary formats

Parquet and ORC

Columnar analytics with recursive directory scanning, predicate pushdown, file-level filtering, mixed compression codecs, schema evolution, and row group statistics.

Predicate Pushdown Schema Evolution

JSON and XML

Deep path extraction, namespace handling, repeating element strategies, schema evolution across files, subtree capture for audit trails, and hierarchical data flattening.

Deep Nesting XPath Schema Evolution

Avro and Protobuf

Binary format deserialization with logical types, nullable unions, nested message flattening, repeated fields, enum decoding, and mixed compression.

Logical Types Proto3

CSV and Excel

Delimiter options, quoting modes, header detection, sheet and range selection, data cleansing, multi-file joins, and cross-table analytics.

Custom Delimiters Multi-Sheet

Self-verifying design

Every use case is designed so you do not have to trust the output.

Assert-validated queries

Every query includes ASSERT statements with pre-calculated expected values. Row counts, specific cell values, and aggregates are verified automatically.

Self-contained

Each use case runs independently. Setup creates tables, queries exercise features, teardown drops everything.

Deterministic seed data

All seed data uses fixed values. Results are reproducible across platforms and runs. Several EDI use cases include independent Python proof scripts that verify expected values without using DeltaForge at all.

Does it really work?
Run it and find out

Open the Use Case Gallery

Run the Setup

Query and Verify

Clean Up

Open Table Formats

Delta Lake

Apache Iceberg

Analytics and Algorithms

Graph Analytics

Geospatial

Industry Formats

Healthcare

EDI and Supply Chain

File Formats

Parquet and ORC

JSON and XML

Avro and Protobuf

CSV and Excel

Self-verifying design

Assert-validated queries

Self-contained

Deterministic seed data

Further reading

SCD Type 2 on Delta Lake in Pure SQL

Query EDI Files with SQL

Run Cypher on Parquet and Delta Tables Without Neo4j

Pick a use case and run it

Does it really work?Run it and find out

Open the Use Case Gallery

Run the Setup

Query and Verify

Clean Up

Open Table Formats

Delta Lake

Apache Iceberg

Analytics and Algorithms

Graph Analytics

Geospatial

Industry Formats

Healthcare

EDI and Supply Chain

File Formats

Parquet and ORC

JSON and XML

Avro and Protobuf

CSV and Excel

Self-verifying design

Assert-validated queries

Self-contained

Deterministic seed data

Further reading

SCD Type 2 on Delta Lake in Pure SQL

Query EDI Files with SQL

Run Cypher on Parquet and Delta Tables Without Neo4j

Pick a use case and run it

Does it really work?
Run it and find out