Changelog:

Date	Description
07/09/2023	Add chapter “Accessing `undefined` value in negated expression”

This post summarizes my journey of learning Rego. While much of the information overlaps with the official Rego reference, it is structured like a “getting started” guide for Rego newbies. The emphasis is put on the programming paradigms (logic versus procedural) which is helpful for programmers used to imperative languages such as Python or Java.

Normal links (e.g. Wikipedia) lead to further information while links in square brackets (e.g. [1]) are sources of statements made in this post.

All code snippets in this post have been run with OPA v0.54.0.

Thanks to Jasper Van der Jeugt reading a draft of this blog post and suggesting improvements.

Introduction to Rego

The Open Policy Agent (“OPA”) project gained a lot of attention since its acceptance as a graduate project of the CNCF [0]. OPA is a general-purpose policy engine used to enforce authorization frameworks. Its policies are defined in a query language called Rego which is in the focus of this post.

Rego is based on Datalog, a declarative logic query language. It provides constructs for pattern matching, filtering and iteration. Rego is comparable to SQL as both are general-purpose query languages. However, SQL is designed to work with tabular data and Rego operates on JSON-formatted data.
Being a declarative logic query language gives Rego some properties atypical to other programming languages. These properties are essential to understand Rego.

Declarative Programming

Declarative programming expresses the logic and rules of a problem without explicitly describing the steps to solve it. It focuses on the what, rather than the how. In contrast, a more popular programming paradigm is called “imperative” where instructions change a program’s state to produce a result. Common examples of declarative languages are SQL ([1], p. 79) or HTML [2]. Infrastructure-as-code languages, such as HCL, typically use the declarative approach as well, although often mixed with imperative elements [3]. In fact, many programming languages allow to mix both paradigms, but typically lean more towards imperative programming, such as Python [4].

An imperative programming example (Python):

def is_even(x):
    remainder = x % 2

    if remainder == 0:
        return True

    return False

A declarative logic programming example (Rego):

is_even {
      remainder == 0
      remainder = input % 2
}


`==`	the comparison operator, used to compare variable values
`:=`	the assignment operator, used to assign values to variables
`=`	the unification operator, used to combine assignment and comparison


`array`	then `var1` will the iterated, `var2` is assigned each index and `var3` the value (assuming `var2` hasn’t been assigned a value before)
`object`	then `var2` must be a key and `var3` is the value


`array`	then `var1` will be iterated and `var2` contains the index
`object`	then it will be checked if key `var2` exists in `var1`
`set`	then `var1` will be iterated and `var2` contains the each item

Introduction to Rego#

Declarative Programming#

Logic Programming#

Existential Quantification#

Entrypoint#

Optimization#

Equality#

Policies, Rules & Functions#

Policies#

Rules#

Existential Quantification#

Functions#

Control Flow#

Basics#

Loops#

Syntax Ambiguity#

Pitfalls#

Exit-Early Optimization#

undefined values#

JSON “true”#

Testing#

Debugging#

Common Errors#

Complete rules must not produce multiple outputs#

Accessing undefined value in negated expression#

OPA for Critical Paths#

Introduction to Rego

Declarative Programming

Logic Programming

Existential Quantification

Entrypoint

Optimization

Equality

Policies, Rules & Functions

Policies

Rules

Existential Quantification

Functions

Control Flow

Basics

Loops

Syntax Ambiguity

Pitfalls

Exit-Early Optimization

`undefined` values

JSON “true”

Testing

Debugging

Common Errors

`Complete rules must not produce multiple outputs`

Accessing `undefined` value in negated expression

OPA for Critical Paths