Your first query

In this tutorial, you will define a model, register a connection, build a query, and read the results. By the end, you will have a working Semolina query you can adapt for your own semantic views.

Prerequisites: Semolina installed (Installation).

1. Define a model

A model maps to a semantic view in your warehouse. Create a file called demo.py and add this code:

from semolina import (
    SemanticView,
    Metric,
    Dimension,
)


class Sales(SemanticView, view="sales"):
    revenue = Metric()
    cost = Metric()
    country = Dimension()
    region = Dimension()

view="sales" is the name of the semantic view in your warehouse. Metric fields are aggregatable measures (revenue, cost). Dimension fields are categories for grouping (country, region).

In your warehouse, this model maps to a definition like:

CREATE OR REPLACE SEMANTIC VIEW sales
  TABLES (
    s AS source_table PRIMARY KEY (id)
  )
  DIMENSIONS (
    s.country AS country,
    s.region AS region
  )
  METRICS (
    s.revenue AS SUM(revenue),
    s.cost AS SUM(cost)
  )
;
CREATE OR REPLACE VIEW sales
  WITH METRICS
  LANGUAGE YAML
  AS $$
    version: 1.1
    source: source_table
    dimensions:
      - name: country
        expr: country
      - name: region
        expr: region
    measures:
      - name: revenue
        expr: SUM(revenue)
      - name: cost
        expr: SUM(cost)
  $$;

2. Register a connection pool

Semolina needs a connection pool to talk to your warehouse. Register one before running any queries:

from semolina import register, pool_from_config

pool, dialect = pool_from_config()  # reads .semolina.toml
register("default", pool, dialect=dialect)
from semolina import register, pool_from_config

pool, dialect = pool_from_config()  # reads .semolina.toml
register("default", pool, dialect=dialect)

The same Python code works for both backends. The type field in your .semolina.toml determines which warehouse to connect to.

See How to choose and configure a backend for full connection details and TOML configuration.

Tip

No warehouse? Use MockPool

If you want to follow along without a warehouse connection, use MockPool with fixture data:

from semolina import MockPool, register

pool = MockPool()
pool.load(
    "sales",
    [
        {
            "revenue": 1000,
            "cost": 100,
            "country": "US",
            "region": "West",
        },
        {
            "revenue": 2000,
            "cost": 200,
            "country": "CA",
            "region": "West",
        },
        {
            "revenue": 500,
            "cost": 50,
            "country": "US",
            "region": "East",
        },
    ],
)
register("default", pool, dialect="mock")

3. Build and run a query

Use Model.query() to start building. Chain .metrics() and .dimensions() to select the fields you want, then call .execute():

cursor = (
    Sales.query()
    .metrics(Sales.revenue)
    .dimensions(Sales.country)
    .execute()
)

Each chained method returns a new query object, so queries are immutable and reusable.

You can also pass metrics and dimensions directly to query():

cursor = Sales.query(
    metrics=[Sales.revenue],
    dimensions=[Sales.country],
).execute()

4. Read the results

.execute() returns a SemolinaCursor. Call .fetchall_rows() to get Row objects that support both attribute and dict-style access:

rows = cursor.fetchall_rows()
for row in rows:
    print(row.country, row.revenue)  # attribute access
    print(row["country"])  # dict-style access

You should see output like:

US 1000
US
CA 2000
CA
US 500
US

Complete example

Here is a self-contained demo using MockPool. To run against a real warehouse, replace the pool registration with your connection (see step 2).

Paste it into demo.py and run python demo.py:

from semolina import (
    SemanticView,
    Metric,
    Dimension,
    MockPool,
    register,
)


# 1. Define model
class Sales(SemanticView, view="sales"):
    revenue = Metric()
    cost = Metric()
    country = Dimension()
    region = Dimension()


# 2. Register pool with fixture data
pool = MockPool()
pool.load(
    "sales",
    [
        {
            "revenue": 1000,
            "cost": 100,
            "country": "US",
            "region": "West",
        },
        {
            "revenue": 2000,
            "cost": 200,
            "country": "CA",
            "region": "West",
        },
        {
            "revenue": 500,
            "cost": 50,
            "country": "US",
            "region": "East",
        },
    ],
)
register("default", pool, dialect="mock")

# 3. Build and execute query
cursor = (
    Sales.query()
    .metrics(Sales.revenue)
    .dimensions(Sales.country)
    .execute()
)

# 4. Use results
for row in cursor.fetchall_rows():
    print(row.country, row.revenue)

You should see:

US 1000
CA 2000
US 500

See also

Defining Models

Field types, SemanticView parameters, immutability.

How to define models
Building Queries

All query methods with examples.

How to build queries
Filtering

Field operators, named methods, AND/OR/NOT composition.

How to filter queries