Book 6 · Chapter Two — Your First API Request

Part One

The requests Library

Python's standard library can make HTTP requests, but the third-party requests library is simpler and is what almost every Python developer uses in practice. It handles encoding, redirects, headers, and timeouts cleanly, without the verbosity of the built-in alternatives.

If you are working in a Jupyter notebook or a local Python environment, install it once:

pip install requests

Or inside a notebook cell:

!pip install requests

Then import it at the top of your script:

Python · Copy to your notebook

import requests

url = "https://diavgeia.gov.gr/luminapi/api/search"
params = {
    "q": 'subject:"κλιματισμός"',
    "size": 5,
    "page": 0,
    "sort": "recent",
}

resp = requests.get(url, params=params, headers={"Accept": "application/json"}, timeout=30)
resp.raise_for_status()
data = resp.json()

print("Total decisions found:", data["info"]["total"])
print("Decisions on this page:", len(data["decisions"]))

Why these cells don't run here. The browser version of the course cannot make live requests to external servers. Copy any cell with a Copy button into a Jupyter notebook and run it there. The interactive Run cells later in this chapter use hardcoded responses so you can practice in the browser.

Part Two

The Request, Line by Line

Every call to the Diavgeia API follows the same four-line pattern. Let's go through each line.

The URL

The search endpoint never changes. You store it in a variable so you don't have to type the full address every time:

url = "https://diavgeia.gov.gr/luminapi/api/search"

The params dictionary

Instead of building a long URL string by hand, you pass a dictionary. The requests library URL-encodes the values automatically and appends them after the ?:

Python · Copy to your notebook

params = {
    "q": 'subject:"κλιματισμός"',
    "size": 10,
    "page": 0,
    "sort": "recent",
}

The four most common params in a Diavgeia search:

Parameter	Example value	What it does
`q`	`subject:"κλιματισμός"`	The search expression. Always includes a field name before the colon.
`size`	`10`	How many results to return per page. Maximum is 100.
`page`	`0`	Which page to return. Zero-based: first page is 0, second is 1.
`sort`	`recent`	Newest first. Use `relative` when you care about text relevance.

The q parameter mini-syntax

The value of q is not a plain search term — it uses a small query language. You always name the field you want to search, followed by a colon and the term in quotes:

Pattern	Searches in
`subject:"κλιματισμός"`	The decision title / subject field
`q:"κλιματισμός"`	Full text of the document
`ada:"6ΛΩΖ7ΛΞ-ΦΨΥ"`	A specific ADA code
`q:["κλιματισμός", "ψύξη"]`	Multiple terms (any match)

The get call

resp = requests.get(url, params=params, headers={"Accept": "application/json"}, timeout=30)

requests.get() sends the HTTP GET request and returns a response object. The two keyword arguments you should always include:

headers={"Accept": "application/json"} — tells the server you want JSON back, not HTML.
timeout=30 — if the server takes more than 30 seconds to respond, Python raises an error instead of waiting forever.

raise_for_status and .json()

resp.raise_for_status()
data = resp.json()

raise_for_status() checks the HTTP status code. If the server returned an error — 404 Not Found, 500 Server Error — it raises a Python exception immediately, so your code fails loudly rather than silently processing an empty or broken response.

resp.json() parses the response body as JSON and returns a Python dictionary. It is equivalent to writing json.loads(resp.text), but shorter.

Part Three

Working with the Response

The cell below simulates a realistic Diavgeia response — the same structure your code would receive after the resp.json() call. Run it and examine the output, then try changing the print statements.

Python · Try it

import json

# This is what resp.json() returns after a real API call
raw = """
{
  "info": {"total": 312},
  "decisions": [
    {
      "ada": "ΨΦΔΔ46ΜΤΛΡ-ΑΩΣ",
      "subject": "Ανάληψη υποχρέωσης για την προμήθεια κλιματιστικών μονάδων",
      "issueDate": "2025-06-10",
      "submissionTimestamp": "2025-06-11T09:15:00.000+0300",
      "documentUrl": "https://diavgeia.gov.gr/doc/ΨΦΔΔ46ΜΤΛΡ-ΑΩΣ",
      "documentType": "pdf",
      "organization": {"label": "ΔΗΜΟΣ ΑΘΗΝΑΙΩΝ"},
      "decisionType": {"label": "Ανάληψη Υποχρέωσης"}
    },
    {
      "ada": "6ΛΩΖ7ΛΞ-ΦΨΥ",
      "subject": "Σύμβαση προμήθειας και εγκατάστασης κλιματιστικών",
      "issueDate": "2025-06-08",
      "submissionTimestamp": "2025-06-09T11:30:00.000+0300",
      "documentUrl": "https://diavgeia.gov.gr/doc/6ΛΩΖ7ΛΞ-ΦΨΥ",
      "documentType": "pdf",
      "organization": {"label": "ΔΗΜΟΣ ΘΕΣΣΑΛΟΝΙΚΗΣ"},
      "decisionType": {"label": "Σύμβαση"}
    },
    {
      "ada": "ΩΒΚ746ΜΦΩΡ-ΠΔΓ",
      "subject": "Συντήρηση κλιματισμού δημοτικών κτιρίων",
      "issueDate": "2025-06-05",
      "submissionTimestamp": "2025-06-06T14:20:00.000+0300",
      "documentUrl": "https://diavgeia.gov.gr/doc/ΩΒΚ746ΜΦΩΡ-ΠΔΓ",
      "documentType": "pdf",
      "organization": {"label": "ΔΗΜΟΣ ΠΕΙΡΑΙΩΣ"},
      "decisionType": {"label": "Ανάληψη Υποχρέωσης"}
    }
  ]
}
"""

data = json.loads(raw)

print("Total decisions in database:", data["info"]["total"])
print("Decisions on this page:", len(data["decisions"]))
print()

for decision in data["decisions"]:
    print(decision["ada"])
    print("  Subject  :", decision["subject"])
    print("  Date     :", decision["issueDate"])
    print("  Org      :", decision["organization"]["label"])
    print("  Type     :", decision["decisionType"]["label"])
    print("  PDF      :", decision["documentUrl"])
    print()

Notice that data["info"]["total"] is 312 — that is the full count of matching decisions in the database — while len(data["decisions"]) is only 3, because that is what one page returns. Pagination is how you collect all 312, and you will do exactly that in a later chapter.

Part Four

The Safe Extraction Pattern

Real API responses are not always complete. A field that exists for one decision may be null — Python's None — for another. If you access decision["organization"]["label"] and organization is None, Python raises a TypeError and your script stops.

The standard defence is two steps. First, use .get() with a fallback. Second, guard the nested access with or {}:

Python · Try it

import json

# One of these decisions is missing the organization field entirely
raw = """
{
  "info": {"total": 2},
  "decisions": [
    {
      "ada": "ΨΦΔΔ46ΜΤΛΡ-ΑΩΣ",
      "subject": "Ανάληψη υποχρέωσης για κλιματιστικά",
      "issueDate": "2025-06-10",
      "organization": {"label": "ΔΗΜΟΣ ΑΘΗΝΑΙΩΝ"},
      "decisionType": {"label": "Ανάληψη Υποχρέωσης"},
      "documentUrl": "https://diavgeia.gov.gr/doc/ΨΦΔΔ46ΜΤΛΡ-ΑΩΣ"
    },
    {
      "ada": "6ΛΩΖ7ΛΞ-ΦΨΥ",
      "subject": "Σύμβαση εγκατάστασης κλιματιστικών",
      "issueDate": "2025-06-08",
      "organization": null,
      "decisionType": null,
      "documentUrl": null
    }
  ]
}
"""

data = json.loads(raw)

for decision in data["decisions"]:
    organization = decision.get("organization") or {}
    decision_type = decision.get("decisionType") or {}

row = {
        "ada":          decision.get("ada", ""),
        "subject":      decision.get("subject", ""),
        "issueDate":    decision.get("issueDate", ""),
        "documentUrl":  decision.get("documentUrl", ""),
        "organization": organization.get("label", ""),
        "decisionType": decision_type.get("label", ""),
    }
    print(row)

The expression decision.get("organization") or {} reads as: "get the value of organization, and if it is None or missing, use an empty dictionary instead." Then {}.get("label", "") safely returns an empty string. Your loop never crashes, and every row in your dataset has consistent keys.

Build a dict for each decision. Once you have a row dictionary like the one above, you can collect all of them in a list and pass that list directly to pandas.DataFrame(). That step — API → list of dicts → DataFrame — is the full pipeline you will build in the next chapter.

Part Five

Filtering with fq

The q parameter searches across all decisions. The fq parameter filters the results — it narrows the search to a specific organisation, decision type, or date range without affecting the relevance ranking.

To filter by organisation, you need the organisation's numeric UID. You can find it by searching Diavgeia's web interface and reading the URL, or by looking it up in the discovery endpoints listed in the cheat sheet. Once you have it, add it as a second key in your params dict:

Python · Copy to your notebook

import requests

url = "https://diavgeia.gov.gr/luminapi/api/search"
params = {
    "q": 'subject:"κλιματισμός"',
    "fq": 'organizationUid:"6167"',   # limit to one organisation
    "sort": "recent",
    "page": 0,
    "size": 20,
}

resp = requests.get(url, params=params, headers={"Accept": "application/json"}, timeout=30)
resp.raise_for_status()
data = resp.json()

for decision in data.get("decisions", []):
    org = (decision.get("organization") or {}).get("label", "")
    print(decision.get("ada"), "|", org)
    print(" ", decision.get("subject"))
    print()

You can also filter by date range. Diavgeia uses its own timestamp syntax inside the fq value:

Python · Copy to your notebook

import requests

url = "https://diavgeia.gov.gr/luminapi/api/search"
params = {
    "q": 'subject:"κλιματισμός"',
    "fq": "issueDate:[DT(2024-01-01T00:00:00) TO DT(2024-12-31T23:59:59)]",
    "sort": "recent",
    "page": 0,
    "size": 20,
}

resp = requests.get(url, params=params, headers={"Accept": "application/json"}, timeout=30)
resp.raise_for_status()
data = resp.json()

print("Decisions in 2024:", data["info"]["total"])

fq can be stacked. If you need to filter by both organisation and date, you can pass a list as the value: "fq": ['organizationUid:"6167"', "issueDate:[DT(2024-01-01T00:00:00) TO DT(2024-12-31T23:59:59)]"]. The requests library will send both as separate fq parameters.

Part Six

Your Turn — Extract and Organise

The cell below contains a response with four decisions. Your task is to build a list of dictionaries using the safe extraction pattern, then print the total number of decisions and each row. Use .get() everywhere and guard nested access with or {}.

Python · Your turn

import json

raw = """
{
  "info": {"total": 4},
  "decisions": [
    {
      "ada": "ΨΦΔΔ46ΜΤΛΡ-ΑΩΣ",
      "subject": "Ανάληψη υποχρέωσης για κλιματιστικά",
      "issueDate": "2025-06-10",
      "documentUrl": "https://diavgeia.gov.gr/doc/ΨΦΔΔ46ΜΤΛΡ-ΑΩΣ",
      "documentType": "pdf",
      "organization": {"label": "ΔΗΜΟΣ ΑΘΗΝΑΙΩΝ"},
      "decisionType": {"label": "Ανάληψη Υποχρέωσης"}
    },
    {
      "ada": "6ΛΩΖ7ΛΞ-ΦΨΥ",
      "subject": "Σύμβαση εγκατάστασης κλιματιστικών",
      "issueDate": "2025-06-08",
      "documentUrl": "https://diavgeia.gov.gr/doc/6ΛΩΖ7ΛΞ-ΦΨΥ",
      "documentType": "pdf",
      "organization": {"label": "ΔΗΜΟΣ ΘΕΣΣΑΛΟΝΙΚΗΣ"},
      "decisionType": {"label": "Σύμβαση"}
    },
    {
      "ada": "ΩΒΚ746ΜΦΩΡ-ΠΔΓ",
      "subject": "Συντήρηση κλιματισμού δημοτικών κτιρίων",
      "issueDate": "2025-06-05",
      "documentUrl": null,
      "documentType": null,
      "organization": {"label": "ΔΗΜΟΣ ΠΕΙΡΑΙΩΣ"},
      "decisionType": null
    },
    {
      "ada": "ΡΨΞ946ΜΤΛΡ-ΒΩΔ",
      "subject": "Προμήθεια και τοποθέτηση κλιματιστικών μονάδων",
      "issueDate": "2025-06-01",
      "documentUrl": "https://diavgeia.gov.gr/doc/ΡΨΞ946ΜΤΛΡ-ΒΩΔ",
      "documentType": "pdf",
      "organization": null,
      "decisionType": {"label": "Σύμβαση"}
    }
  ]
}
"""

data = json.loads(raw)

rows = []
for decision in data["decisions"]:
    organization = decision.get("organization") or {}
    decision_type = decision.get("decisionType") or {}
    rows.append({
        "ada":          decision.get("ada", ""),
        "subject":      decision.get("subject", ""),
        "issueDate":    decision.get("issueDate", ""),
        "documentUrl":  decision.get("documentUrl", ""),
        "organization": organization.get("label", ""),
        "decisionType": decision_type.get("label", ""),
    })

print("Rows collected:", len(rows))
print()
for row in rows:
    print(row)

What you learned in this chapter: how to call the Diavgeia search endpoint with requests.get(), how to use the params dictionary to build the query, what raise_for_status() and resp.json() do, and how to extract nested fields safely with .get() and or {}. In the next chapter you will pass this list of rows to pandas and turn it into a DataFrame.

Chapter Navigation

Move between chapters.

Previous: Chapter 1 — What is an API?