Document Unfolding and Cycle Detection Reference

TerminusDB provides automatic document unfolding for linked documents through two mechanisms: class-level @unfoldable and field-level @unfold annotations. This reference guide explains how unfolding works, how cycle detection prevents infinite recursion, and performance characteristics of the implementation.

--> Valid as of the 11.2 release.

What is Document Unfolding?

Document unfolding is the process of automatically expanding referenced documents when retrieving data through the Document API, GraphQL, or WOQL.

There are two ways to enable unfolding:

Class-level @unfoldable: Mark a class with @unfoldable: [] to automatically expand all references to documents of that class
Field-level @unfold: Add @unfold: true to individual properties to selectively enable expansion for specific relationships

Example Schema

Example: JSON

{
  "@type": "Class",
  "@id": "Person",
  "@unfoldable": [],
  "name": "xsd:string",
  "friend": {
    "@type": "Set",
    "@class": "Person"
  }
}

Unfolded vs Non-Unfolded Results

Without @unfoldable (Reference Only):

Example: JSON

{
  "@id": "Person/Alice",
  "@type": "Person",
  "name": "Alice",
  "friend": "Person/Bob"  // Just an ID string
}

With @unfoldable (Automatically Expanded):

Example: JSON

{
  "@id": "Person/Alice",
  "@type": "Person",
  "name": "Alice",
  "friend": [
    {
      "@id": "Person/Bob",
      "@type": "Person",
      "name": "Bob"
    }
  ]
}

Field-Level @unfold

In addition to class-level @unfoldable, you can enable unfolding on individual properties using @unfold: true. This provides fine-grained control when you need different unfolding behavior for different relationships to the same class.

Field-Level @unfold Example

Example: JSON

{
  "@type": "Class",
  "@id": "Order",
  "orderNumber": "xsd:string",
  "customer": {
    "@type": "Optional",
    "@class": "Customer",
    "@unfold": true
  },
  "product": {
    "@type": "Optional",
    "@class": "Product"
  }
}

Result when retrieving an Order:

Example: JSON

{
  "@id": "Order/123",
  "@type": "Order",
  "orderNumber": "ORD-123",
  "customer": {
    "@id": "Customer/alice",
    "@type": "Customer",
    "name": "Alice",
    "email": "alice@example.com"
  },
  "product": "Product/widget"
}

The customer is unfolded inline while product remains as an ID reference.

When to Use Field-Level vs Class-Level

Scenario	Recommendation
All references to a class should unfold	Use class-level `@unfoldable`
Different properties need different behavior	Use field-level `@unfold`
Target class is external/unmodifiable	Use field-level `@unfold`
Mixed use cases for same class	Use field-level `@unfold` on specific properties

Interaction Between @unfoldable and @unfold

Class `@unfoldable`	Property `@unfold`	Behavior
No	No	Return ID reference
No	Yes	Unfold inline
Yes	No	Unfold inline
Yes	Yes	Unfold inline

The property-level @unfold: true acts as an override to enable unfolding for properties pointing to non-unfoldable classes.

Supported Property Types

The @unfold annotation works with all property type families:

Property Type	Example Syntax
Optional	`{ "@type": "Optional", "@class": "Customer", "@unfold": true }`
Set	`{ "@type": "Set", "@class": "Customer", "@unfold": true }`
Array	`{ "@type": "Array", "@class": "Customer", "@unfold": true }`
List	`{ "@type": "List", "@class": "Customer", "@unfold": true }`
Cardinality	`{ "@type": "Cardinality", "@class": "Customer", "min": 1, "max": 5, "@unfold": true }`

Mixed Unfold Behavior Example

Example: JSON

[
  {
    "@type": "Class",
    "@id": "UnfoldableClass",
    "@unfoldable": [],
    "data": "xsd:string"
  },
  {
    "@type": "Class",
    "@id": "RegularClass",
    "value": "xsd:string"
  },
  {
    "@type": "Class",
    "@id": "TestClass",
    "unfoldableRef": {
      "@type": "Optional",
      "@class": "UnfoldableClass"
    },
    "regularWithUnfold": {
      "@type": "Optional",
      "@class": "RegularClass",
      "@unfold": true
    },
    "regularWithoutUnfold": {
      "@type": "Optional",
      "@class": "RegularClass"
    }
  }
]

When retrieving a TestClass document with unfold=true:

Example: JSON

{
  "@id": "TestClass/test1",
  "@type": "TestClass",
  "unfoldableRef": {
    "@id": "UnfoldableClass/u1",
    "@type": "UnfoldableClass",
    "data": "unfoldable data"
  },
  "regularWithUnfold": {
    "@id": "RegularClass/r1",
    "@type": "RegularClass",
    "value": "regular value 1"
  },
  "regularWithoutUnfold": "RegularClass/r2"
}

unfoldableRef is unfolded because UnfoldableClass has @unfoldable: []
regularWithUnfold is unfolded because the property has @unfold: true
regularWithoutUnfold returns just the ID because neither condition is met

Best Practices for Field-Level @unfold

1. Use for Context-Specific Expansion:

Example: JSON

{
  "@type": "Class",
  "@id": "Invoice",
  "billingAddress": {
    "@type": "Optional",
    "@class": "Address",
    "@unfold": true
  },
  "shippingAddress": {
    "@type": "Optional",
    "@class": "Address"
  }
}

Billing address is always needed inline, but shipping address can be fetched separately.

2. Combine with @unfoldable for Default Behavior: Use @unfoldable for classes that should always be expanded, and @unfold for context-specific overrides.

3. Consider Performance: Each unfolded property adds to the response size. Only use @unfold for properties that are frequently accessed together with the parent document.

4. Avoid Deep Unfold Chains: Be cautious with nested @unfold chains that create 4+ levels of automatic expansion.

Cycle Detection

When documents reference themselves directly or indirectly, TerminusDB's cycle detection mechanism prevents infinite recursion while ensuring all nodes are properly rendered.

How Cycle Detection Works

The unfolding implementation uses a path stack to track the current traversal from root to the current node. When a document ID is encountered that's already in the current path, a cycle is detected:

Path Stack Maintained: As traversal descends into children, document IDs are pushed onto the stack
Cycle Check: Before expanding a document, check if its ID is already in the current path
ID Reference Returned: If a cycle is detected, return just the @id string instead of expanding
Backtrack: When returning from a child, pop its ID from the stack

Cycle Detection Behavior Examples

Direct Self-Reference

Schema:

Example: JSON

{
  "@type": "Class",
  "@id": "LinguisticObject",
  "@unfoldable": [],
  "name": "xsd:string",
  "partOf": {
    "@type": "Set",
    "@class": "LinguisticObject"
  }
}

Data:

Example: JSON

{
  "@id": "LinguisticObject/self",
  "@type": "LinguisticObject",
  "name": "Self Referencing",
  "partOf": ["LinguisticObject/self"]  // Points to itself
}

Result:

Example: JSON

{
  "@id": "LinguisticObject/self",
  "@type": "LinguisticObject",
  "name": "Self Referencing",
  "partOf": ["LinguisticObject/self"]  // ID string, not expanded
}

Circular Reference Chain (A→B→A)

Data:

Example: JSON

[
  {
    "@id": "Node/A",
    "@type": "Node",
    "name": "Node A",
    "next": "Node/B"
  },
  {
    "@id": "Node/B",
    "@type": "Node",
    "name": "Node B",
    "next": "Node/A"  // Back to A
  }
]

Result (retrieving Node/A):

Example: JSON

{
  "@id": "Node/A",
  "@type": "Node",
  "name": "Node A",
  "next": {
    "@id": "Node/B",
    "@type": "Node",
    "name": "Node B",
    "next": "Node/A"  // Cycle detected, ID string returned
  }
}

Multiple Circular Paths

For complex graphs with multiple interconnected cycles, each path is tracked independently. Nodes are expanded until they appear again in the current traversal path.

Graph:

Example: Text

A → B → C → A (cycle)
A → D → A (cycle)
B → D

The cycle detection ensures no node is expanded more than once per path, preventing infinite recursion while rendering all reachable nodes.

Deep Nested Structures

For long chains (e.g., 100+ nodes without cycles), TerminusDB traverses the entire structure:

Example: JSON

{
  "@id": "ChainNode/0",
  "value": 0,
  "next": {
    "@id": "ChainNode/1",
    "value": 1,
    "next": {
      "@id": "ChainNode/2",
      "value": 2,
      // ... continues for all 100 nodes
    }
  }
}

Work Limit Protection

To prevent excessive resource consumption during document unfolding, TerminusDB implements a work limit that caps the total number of operations during traversal.

Configuration

Environment Variable: TERMINUSDB_DOC_WORK_LIMIT

Default: 500,000 operations

Setting Custom Limit:

Example: Bash

# Linux/macOS
export TERMINUSDB_DOC_WORK_LIMIT=1000000

# Docker
docker run -e TERMINUSDB_DOC_WORK_LIMIT=1000000 terminusdb/terminusdb-server:latest

# Kubernetes ConfigMap
env:
  - name: TERMINUSDB_DOC_WORK_LIMIT
    value: "1000000"

When Work Limit is Exceeded

If document traversal exceeds the work limit:

Traversal Terminates: Document retrieval stops
Error Returned: Returns DocRetrievalError::LimitExceeded
Partial Results: No partial data is returned
Document IRI Included: Error message includes the document IRI that triggered the limit

Recommended Limits by Use Case:

Use Case	Recommended Limit	Rationale
Simple documents	100,000	Default for most use cases
Complex hierarchies	500,000 (default)	Balanced performance/safety
Large knowledge graphs	1,000,000 - 5,000,000	Deep traversals needed
Real-time APIs	50,000 - 100,000	Prioritize response time

Performance Characteristics

Path Stack Implementation

TerminusDB uses a Vec-based path stack for cycle detection, which is optimal for this use case:

Why Vec (not HashSet):

Path stack semantics: The visited collection tracks the current DFS path, not all visited nodes
Small size: Path depth is typically 10-50 nodes, not thousands
Cache-friendly: Sequential access pattern
Stack mirroring: Push/pop operations naturally mirror traversal stack

Performance benchmarks show approx double speed of Vec across both small and large documents.

Empirical Results:

For path depth < 100: Vec is faster than HashSet (no hash overhead)
For path depth > 100: Difference is negligible in practice
Real-world path depths: typically 10-50 nodes

Schema Design Recommendations

1. Limit Depth:

Example: JSON

{
  "@type": "Class",
  "@id": "Category",
  "@unfoldable": [],
  "name": "xsd:string",
  "parent": {
    "@type": "Optional",
    "@class": "Category"  // Parent-child hierarchy
  },
  "subcategories": {
    "@type": "Set",
    "@class": "SubCategory"  // Use different class for children
  }
}

2. Separate Unfoldable and Non-Unfoldable Relationships:

Example: JSON

{
  "@type": "Class",
  "@id": "Person",
  "@unfoldable": [],
  "name": "xsd:string",
  "profile": {
    "@type": "Optional",
    "@class": "Profile"  // Profile is @unfoldable
  },
  "posts": {
    "@type": "Set",
    "@class": "Post"  // Post is NOT @unfoldable (too many)
  }
}

3. Use Optional or Set/Cardinality for Potentially Circular References:

Example: JSON

{
  "@type": "Class",
  "@id": "Node",
  "@unfoldable": [],
  "next": {
    "@type": "Optional",  // Allows termination, similar to Set/Cardinality
    "@class": "Node"
  }
}

Troubleshooting

Document Retrieval Returns Just IDs

Symptom: Expected nested objects, got ID strings

Cause: Cycle detected or class not marked @unfoldable

Solution:

Verify class has @unfoldable: [] annotation
Check if circular reference exists (expected behavior)
Review schema for proper unfoldable annotations

Work Limit Exceeded Errors

Symptom: DocRetrievalError::LimitExceeded during retrieval

Cause: Document graph too large or deeply nested

Solutions:

Increase limit: Set TERMINUSDB_DOC_WORK_LIMIT environment variable
Reduce unfoldable depth: Mark fewer classes as @unfoldable
Break circular references: Ensure proper data structure
Use pagination: Fetch large collections separately

Performance Degradation

Symptom: Slow document retrieval

Cause: Large unfoldable graphs

Solutions:

Profile query: Check path depth and node count
Reduce unfoldable scope: Only unfold necessary relationships

API Examples

Document API

Example: Bash

# Retrieve with automatic unfolding (default)
curl -X GET "http://localhost:6363/api/document/admin/mydb" \
  -H "Authorization: Basic YWRtaW46cm9vdA==" \
  -d '{"graph_type": "instance", "id": "Person/Alice", "as_list": true}'

GraphQL

Example: GraphQL

# Unfolding happens automatically for @unfoldable classes
query {
  Person {
    name
    friend {  # Automatically expanded
      name
      friend {  # Nested expansion
        name
      }
    }
  }
}

WOQL

Example: JavaScript

// Using WOQL to read documents with unfolding
WOQL.read_document("Person/Alice", "v:Doc")

Schema Reference Guide - Complete schema annotation reference
Document API Reference - HTTP API for documents
GraphQL Reference - GraphQL query syntax
Path Queries - Advanced path traversal

Summary

Key Takeaways:

@unfoldable (class-level) automatically expands all references to a class
@unfold: true (field-level) selectively enables expansion for specific properties
Cycle detection prevents infinite recursion using ancestor path tracking
Vec-based implementation is optimal for path-bounded traversal
TERMINUSDB_DOC_WORK_LIMIT protects against excessive operations
ID references returned when cycles detected (not an error)
Path depth typically 10-50 nodes (not total document count)

Performance Notes:

Vec path stack: O(d) lookup where d = depth (typically < 50)
Work limit default: 500,000 operations
Memory overhead: 8 bytes per path depth level
Cache-friendly sequential access pattern

Last Updated: October 31, 2025
Applies to: TerminusDB 11.2+

What is Document Unfolding?

Example Schema

Unfolded vs Non-Unfolded Results

Field-Level @unfold

Field-Level @unfold Example

When to Use Field-Level vs Class-Level

Interaction Between @unfoldable and @unfold

Supported Property Types

Mixed Unfold Behavior Example

Best Practices for Field-Level @unfold

Cycle Detection

How Cycle Detection Works

Cycle Detection Behavior Examples

Direct Self-Reference

Circular Reference Chain (A→B→A)

Multiple Circular Paths

Deep Nested Structures

Work Limit Protection

Configuration

When Work Limit is Exceeded

Performance Characteristics

Path Stack Implementation

Schema Design Recommendations

Troubleshooting

Document Retrieval Returns Just IDs

Work Limit Exceeded Errors

Performance Degradation

API Examples

Document API

GraphQL

WOQL

Related Documentation

Summary

Was this helpful?