ZON Specification

Zero Overhead Notation - Formal Specification

Version: 1.3.0

Date: 2025-12-01

Status: Stable Release

Authors: ZON Format Contributors

License: MIT

Abstract

Zero Overhead Notation (ZON) is a compact, line-oriented text format that encodes the JSON data model with minimal redundancy optimized for large language model token efficiency. ZON achieves up to 23.8% token reduction compared to JSON through single-character primitives (T, F), null as null, explicit table markers (@), colon-less nested structures, and intelligent quoting rules. Arrays of uniform objects use tabular encoding with column headers declared once; metadata uses flat key-value pairs. This specification defines ZON's concrete syntax, canonical value formatting, encoding/decoding behavior, conformance requirements, and strict validation rules. ZON provides deterministic, lossless representation achieving 100% LLM retrieval accuracy in benchmarks.

Status of This Document

This document is a Stable Release v1.1.0 and defines normative behavior for ZON encoders, decoders, and validators. Implementation feedback should be reported at https://github.com/ZON-Format/zon-TS.

Backward compatibility is maintained across v1.0.x releases. Major versions (v2.x) may introduce breaking changes.

Normative References

[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, March 1997.
https://www.rfc-editor.org/rfc/rfc2119

[RFC8174] Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words", BCP 14, RFC 8174, May 2017.
https://www.rfc-editor.org/rfc/rfc8174

[RFC8259] Bray, T., "The JavaScript Object Notation (JSON) Data Interchange Format", STD 90, RFC 8259, December 2017.
https://www.rfc-editor.org/rfc/rfc8259

Informative References

[RFC4180] Shafranovich, Y., "Common Format and MIME Type for Comma-Separated Values (CSV) Files", RFC 4180, October 2005.
https://www.rfc-editor.org/rfc/rfc4180

[ISO8601] ISO 8601:2019, "Date and time — Representations for information interchange".

[UNICODE] The Unicode Consortium, "The Unicode Standard", Version 15.1, September 2023.

Introduction
Terminology and Conventions
Data Model
Encoding Normalization
Decoding Interpretation
Concrete Syntax
Primitives
Strings and Keys
Objects
Arrays
Table Format
Quoting and Escaping
Whitespace
Conformance
Strict Mode Errors
Security
Internationalization
Interoperability
Media Type
Error Handling
Appendices

Introduction (Informative)

Purpose

ZON addresses token bloat in JSON while maintaining structural fidelity. By declaring column headers once, using single-character tokens, and eliminating redundant punctuation, ZON achieves optimal compression for LLM contexts.

Design Goals

Minimize tokens - Every character counts in LLM context windows
Preserve structure - 100% lossless round-trip conversion
Human readable - Debuggable, understandable format
LLM friendly - Explicit markers aid comprehension
Deterministic - Same input → same output
Deep Nesting - Efficiently handles complex, recursive structures

Use Cases

✅ Use ZON for:

LLM prompt contexts (RAG, few-shot examples)
Log storage and analysis
Configuration files
Browser storage (localStorage)
Tabular data interchange
Complex nested data structures (ZON excels here)

❌ Don't use ZON for:

Public REST APIs (use JSON for compatibility)
Real-time streaming protocols (not yet supported)
Files requiring comments (use YAML/JSONC)

Example

JSON (118 chars):

{"users":[{"id":1,"name":"Alice","active":true},{"id":2,"name":"Bob","active":false}]}

ZON (64 chars, 46% reduction):

users:@(2):active,id,name
T,1,Alice
F,2,Bob

1. Terminology and Conventions

1.1 RFC2119 Keywords

The keywords MUST, MUST NOT, REQUIRED, SHALL, SHALL NOT, SHOULD, SHOULD NOT, RECOMMENDED, MAY, and OPTIONAL are interpreted per [RFC2119] and [RFC8174].

1.2 Definitions

ZON document - UTF-8 text conforming to this specification

Line - Character sequence terminated by LF (\n)

Key-value pair - Line pattern: key:value

Table - Array of uniform objects with header + data rows

Table header - Pattern: key:@(N):columns or @(N):columns

Meta separator - Colon (:) separating keys/values

Table marker - At-sign (@) indicating table structure

Primitive - Boolean, null, number, or string (not object/array)

Uniform array - All elements are objects with identical keys

Strict mode - Validation enforcing row/column counts

2. Data Model

2.1 JSON Compatibility

ZON encodes the JSON data model:

Primitives: string | number | boolean | null
Objects: { [string]: JsonValue }
Arrays: JsonValue[]

2.2 Ordering

Arrays: Order MUST be preserved exactly
Objects: Key order MUST be preserved
- Encoders SHOULD sort keys alphabetically
- Decoders MUST preserve document order

2.3 Canonical Numbers

Requirements for ENCODER:

No leading zeros: 007 → invalid
No trailing zeros: 3.14000 → 3.14
No unnecessary decimals: Integer 5 stays 5, not 5.0
No scientific notation: 1e6 → 1000000, 1e-3 → 0.001
Special values map to null:
- NaN → null
- Infinity → null
- -Infinity → null

Implementation:

Integers: Use standard string representation
Floats: Ensure decimal point present, convert exponents to fixed-point
Special values: Normalized to null before encoding

Examples:

1000000      ✓ (not 1e6 or 1e+6)
0.001        ✓ (not 1e-3)
3.14         ✓ (not 3.140000)
42           ✓ (integer, no decimal)
null         ✓ (was NaN or Infinity)

Scientific notation:

1e6     ⚠️  Decoders MUST accept, encoders SHOULD avoid (prefer 1000000)
2.5E-3  ⚠️  Decoders MUST accept, encoders SHOULD avoid (prefer 0.0025)

Requirements:

Encoders MUST ensure decode(encode(x)) === x (round-trip fidelity)
No trailing zeros in fractional part (except .0 for float clarity)
No leading zeros (except standalone 0)
-0 normalizes to 0

2.4 Special Values

NaN → null
Infinity → null
-Infinity → null

3. Encoding Normalization

3.1 Host Type Mapping

Encoders MUST normalize non-JSON types before encoding:

JavaScript/TypeScript:

Input	ZON Output	Notes
`undefined`	`null`	Null
`Symbol()`	`null`	Not serializable
`function() {}`	`null`	Not serializable
`new Date()`	`"2025-11-28T10:00:00Z"`	ISO 8601 string
`new Set([1,2])`	`"[1,2]"`	Convert to array
`new Map([[k,v]])`	`"{k:v}"`	Convert to object
`BigInt(999)`	`"999"`	String if outside safe range

Python:

Input	ZON Output	Notes
`None`	`null`	Null
`datetime.now()`	`"2025-11-28T10:00:00Z"`	ISO 8601
`set([1,2])`	`"[1,2]"`	Convert to list
`Decimal('3.14')`	`3.14` or `"3.14"`	Number if no precision loss
`bytes(b'\x00')`	`"<base64>"`	Base64 encode

Implementations MUST document their normalization policy.

4. Decoding Interpretation

4.1 Type Inference

Unquoted tokens:

T           → true (boolean)
F           → false (boolean)
null        → null
42          → 42 (integer)
3.14        → 3.14 (float)
1e6         → 1000000 (number)
05          → "05" (string, leading zero)
hello       → "hello" (string)

Quoted tokens:

"T"         → "T" (string, not boolean)
"123"       → "123" (string, not number)
"hello"     → "hello" (string)
""          → "" (empty string)

4.2 Escape Sequences

Only these escapes are valid:

\\ → \
\" → "
\n → newline
\r → carriage return
\t → tab

Invalid escapes MUST error:

"\x41"      ❌ Invalid
"\u0041"    ❌ Invalid (use literal UTF-8)
"\b"        ❌ Invalid

4.3 Leading Zeros

Numbers with leading zeros are strings:

05          → "05" (string)
007         → "007" (string)
0           → 0 (number)

5. Concrete Syntax

5.1 Line Structure

ZON documents are line-oriented:

Lines end with LF (\n)
Empty lines are whitespace-only
Blank lines separate metadata from tables

5.2 Root Form

Determined by first non-empty line:

Root table:

@(2):id,name
1,Alice
2,Bob

Root object:

name:Alice
age:30

Root primitive:

5.3 ABNF Grammar

document     = object-form / table-form / primitive-form
object-form  = *(key-value / table-section)
table-form   = table-header 1*data-row
primitive-form = value

key-value    = key ":" value LF
table-header = [key ":"] "@" "(" count ")" ":" column-list LF
table-section = table-header 1*data-row
data-row     = value *("," value) LF

key          = unquoted-string / quoted-string
value        = primitive / quoted-compound
primitive    = "T" / "F" / "null" / number / unquoted-string
quoted-compound = quoted-string  ; Contains JSON-like notation

column-list  = column *("," column)
column       = key
count        = 1*DIGIT
number       = ["-"] 1*DIGIT ["." 1*DIGIT] [("e"/"E") ["+"/"-"] 1*DIGIT]

6. Primitives

6.1 Booleans

Encoding:

true → T
false → F

Decoding:

T (case-sensitive) → true
F (case-sensitive) → false

Rationale: 75% character reduction

6.2 Null

Encoding:

null → null (4-character literal)

Decoding:

null → null
Also accepts (case-insensitive): none, nil

Rationale: Clarity and readability over minimal compression

6.3 Numbers

Examples:

age:30
price:19.99
score:-42
temp:98.6
large:1000000

Rules:

Integers without decimal: 42
Floats with decimal: 3.14
Negatives with - prefix: -17
No thousands separators
Decimal separator is . (period)

7. Strings and Keys

7.1 Safe Strings (Unquoted)

Pattern: ^[a-zA-Z0-9_\-\.]+$

Examples:

name:Alice
full_name:Alice Smith
user_id:u123
version:v1.1.0
api-key:sk_test_key

7.2 Required Quoting

Quote strings if they:

Contain structural chars: ,, :, [, ], {, }, "
Match literal keywords: T, F, true, false, null, none, nil
Look like PURE numbers: 123, 3.14, 1e6 (Complex patterns like 192.168.1.1 or v1.1.0 do NOT need quoting)
Have whitespace: Leading/trailing spaces (MUST quote to preserve). Internal spaces are allowed in unquoted strings.
Are empty: "" (MUST quote to distinguish from null)
Contain escapes: Newlines, tabs, quotes (MUST quote to prevent structure breakage)

Examples:

message:"Hello, world"
path:"C:\Users\file"
empty:""
quoted:"true"
number:"123"
spaces:" padded "

7.3 ISO Date Optimization

ISO 8601 dates MAY be unquoted:

created:2025-11-28
timestamp:2025-11-28T10:00:00Z
time:10:30:00

Decoders interpret these as strings (not parsed as Date objects unless application logic does so).

8. Objects

8.1 Flat Objects

active:T
age:30
name:Alice

Decodes to:

{"active": true, "age": 30, "name": "Alice"}

8.2 Nested Objects

Grouped object notation (preferred):

config{database{host:localhost,port:5432},cache{ttl:3600}}

Quoted compound notation (legacy/alternative):

config:"{database:{host:localhost,port:5432},cache:{ttl:3600}}"

Alternatively using JSON string:

config:"{"database":{"host":"localhost","port":5432}}"

8.3 Empty Objects

metadata:"{}"

9. Arrays

9.1 Format Selection

Decision algorithm:

All elements are objects with same keys? → Table format
Otherwise → Inline quoted format

9.2 Inline Arrays

Primitive arrays:

tags:"[nodejs,typescript,llm]"
numbers:"[1,2,3,4,5]"
flags:"[T,F,T]"
mixed:"[hello,123,T,null]"

Empty:

items:"[]"

9.3 Irregularity Threshold

Uniform detection:

Calculate irregularity score:

For each pair of objects (i, j):
  similarity = shared_keys / (keys_i + keys_j - shared_keys)  # Jaccard
Avg_similarity = mean(all_similarities)
Irregularity = 1 - avg_similarity

Threshold:

If irregularity > 0.6 → Use inline format
If irregularity ≤ 0.6 → Use table format

10. Table Format

10.1 Header Syntax

With key:

users:@(2):active,id,name

Root array:

@(2):active,id,name

Components:

users - Array key (optional for root)
@ - Table marker (REQUIRED)
(2) - Row count (REQUIRED for strict mode)
: - Separator (REQUIRED)
active,id,name - Columns, comma-separated (REQUIRED)

10.2 Column Order

Columns SHOULD be sorted alphabetically:

users:@(2):active,id,name,role
T,1,Alice,admin
F,2,Bob,user

10.3 Data Rows

Each row is comma-separated values:

T,1,Alice,admin

Rules:

One row per line
Values encoded as primitives (§6-7)
Field count MUST equal column count (strict mode)
Missing values encode as null

10.4 Sparse Tables (v2.0)

Optional fields append as key:value:

users:@(3):id,name
1,Alice
2,Bob,role:admin,score:98
3,Carol

Row 2 decodes to:

{"id": 2, "name": "Bob", "role": "admin", "score": 98}

10.6 Hierarchical Sparse Encoding (v1.1.0)

Nested objects in tables are flattened using dot notation for sparse columns.

users:@(2):id
1,meta.role:admin
2,meta.role:user,meta.active:T

11. Quoting and Escaping

11.1 CSV Quoting (RFC 4180)

For table values containing commas:

messages:@(1):id,text
1,"He said ""hello"" to me"

Rules:

Wrap in double quotes: "value"
Escape internal quotes by doubling: " → ""

11.2 Escape Sequences

multiline:"Line 1\nLine 2"
tab:"Col1\tCol2"
quote:"She said \"Hi\""
backslash:"C:\\path\\file"

Valid escapes:

\\ → \
\" → "
\n → newline
\r → CR
\t → tab

11.3 Unicode

Use literal UTF-8 (no \uXXXX escapes):

chinese:王小明
emoji:✅
arabic:مرحبا

12. Whitespace and Line Endings

12.1 Encoding Rules

Encoders MUST:

Use LF (\n) line endings
NOT emit trailing whitespace on lines
NOT emit trailing newline at EOF (RECOMMENDED)
MAY emit one blank line between metadata and table

12.2 Decoding Rules

Decoders SHOULD:

Accept LF or CRLF (normalize to LF)
Ignore trailing whitespace per line
Treat multiple blank lines as single separator

13. Conformance and Options

13.1 Encoder Checklist

✅ A conforming encoder MUST:

13.2 Decoder Checklist

✅ A conforming decoder MUST:

13.3 Strict Mode

Enabled by default in reference implementation.

Enforces:

Table row count = declared (N)
Each row field count = column count
No malformed headers
No invalid escapes
No unterminated strings

Non-strict mode MAY tolerate count mismatches.

14. Schema Validation (LLM Evals)

ZON includes a runtime schema validation library designed for LLM guardrails. It allows defining expected structures and validating LLM outputs against them.

14.1 Schema Definition

import { zon } from 'zon-format';

const UserSchema = zon.object({
  name: zon.string().describe("Full name"),
  age: zon.number(),
  role: zon.enum(['admin', 'user']),
  tags: zon.array(zon.string()).optional()
});

14.2 Prompt Generation

Schemas can generate system prompts to guide LLMs:

const prompt = UserSchema.toPrompt();
// Output:
// object:
//   - name: string - Full name
//   - age: number
//   - role: enum(admin, user)
//   - tags: array of [string] (optional)

14.3 Validation

import { validate } from 'zon-format';

const result = validate(llmOutputString, UserSchema);

if (result.success) {
  console.log(result.data); // Typed data
} else {
  console.error(result.error); // "Expected number at age, got string"
}

15. Strict Mode Errors

15.1 Table Errors

Code	Error	Example
E001	Row count mismatch	`@(2)` but 3 rows
E002	Field count mismatch	3 columns, row has 2 values
E003	Malformed header	Missing `@`, `(N)`, or `:`
E004	Invalid column name	Unescaped special chars

14.2 Syntax Errors

Code	Error	Example
E101	Invalid escape	`"\x41"` instead of `"A"`
E102	Unterminated string	`"hello` (no closing quote)
E103	Missing colon	`name Alice` → `name:Alice`
E104	Empty key	`:value`

14.3 Format Errors

Code	Error	Example
E201	Trailing whitespace	Line ends with spaces
E202	CRLF line ending	`\r\n` instead of `\n`
E203	Multiple blank lines	More than one consecutive
E204	Trailing newline	Document ends with `\n`

17. Security Considerations

15.1 Resource Limits

Implementations SHOULD limit:

Document size: 100 MB
Line length: 1 MB
Nesting depth: 100 levels
Array length: 1,000,000
Object keys: 100,000

Prevents denial-of-service attacks.

15.2 Validation

Validate UTF-8 strictly
Error on invalid escapes
Reject malformed numbers
Limit recursion depth

15.3 Injection Prevention

ZON does not execute code. Applications MUST sanitize before:

SQL queries
Shell commands
HTML rendering

18. Internationalization

16.1 Character Encoding

REQUIRED: UTF-8 without BOM

Decoders MUST:

Reject invalid UTF-8
Reject BOM (U+FEFF) at start

16.2 Unicode

Full Unicode support:

Emoji: ✅, 🚀
CJK: 王小明, 日本語
RTL: مرحبا, שלום

16.3 Locale Independence

Decimal separator: . (period)
No thousands separators
ISO 8601 dates for internationalization

19. Interoperability

17.1 JSON

ZON → JSON: Lossless
JSON → ZON: Lossless, with 35-50% compression for tabular data

Example:

{"users": [{"id": 1, "name": "Alice"}]}

↓ ZON (42% smaller)

users:@(1):id,name
1,Alice

17.2 CSV

CSV → ZON: Add type awareness ZON → CSV: Table rows export cleanly

Advantages over CSV:

Type preservation
Metadata support
Nesting capability

17.3 TOON

Comparison:

ZON: Flat, @(N), T/F/null → Better compression
TOON: Indented, [N]{fields}:, true/false → Better readability Both are LLM-optimized; choose based on data shape.

20. Media Type & File Extension

18.1 File Extension

Extension: .zonf

ZON files use the .zonf extension (ZON Format) for all file operations.

Examples:

data.zonf
users.zonf
config.zonf

18.2 Media Type

Status: Provisional (not yet registered with IANA) Media type: text/zonf

Charset: UTF-8 (always)

ZON documents are always UTF-8 encoded. The charset=utf-8 parameter may be specified but defaults to UTF-8 when omitted.

20.1 HTTP Headers

Content-Type: text/zonf
Content-Type: text/zonf; charset=utf-8  # Explicit (optional)

20.2 Nginx Configuration

types {
    text/zonf zonf;
}

server {
    default_type text/zonf;
}

20.3 Apache Configuration

AddType text/zonf .zonf

20.4 File Extension

Files containing ZON data should use the .zonf extension.

20.5 MIME Sniffing

If the Content-Type header is missing, clients may sniff the content. ZON files typically start with a header row or a ZON node.

HTTP/1.1 200 OK
Content-Type: text/zonf; charset=utf-8
Content-Length: 1234

users:@(2):id,name
1,Alice
2,Bob

18.4 Character Encoding

Normative requirement: ZON files MUST be UTF-8 encoded.

Rationale:

Universal support across programming languages
Compatible with JSON (RFC 8259)
No byte-order mark (BOM) required
Supports full Unicode character set

Encoding declaration: Not required (always UTF-8)

18.5 IANA Registration

Current status: Not registered

Future work: Formal registration with IANA is planned for v2.0.

Template for registration:

Type name: text
Subtype name: zon
Required parameters: None
Optional parameters: charset (default: utf-8)
Encoding considerations: Always UTF-8
Security considerations: See §15
Interoperability considerations: None known
Published specification: This document
Applications that use this media type: Data serialization for LLMs
Fragment identifier considerations: N/A
Additional information:
  File extension: .zonf
  Macintosh file type code: TEXT
  Uniform Type Identifier: public.zon
Person & email address: See repository
Intended usage: COMMON
Restrictions on usage: None

18.2 IANA Status

Provisional (not yet IANA-registered). May pursue formal registration at v2.0.

Appendices

Appendix A: Examples

A.1 Simple Object

active:T
age:30
name:Alice

A.2 Table

users:@(2):active,id,name
T,1,Alice
F,2,Bob

A.3 Mixed

tags:"[api,auth]"
version:1.0
users:@(1):id,name
1,Alice

A.4 Root Array

@(2):id,name
1,Alice
2,Bob

Appendix B: Test Suite

Coverage:

✅ 28/28 unit tests
✅ 27/27 roundtrip tests
✅ 100% data integrity

Test categories:

Primitives (T, F, null, numbers, strings)
Tables (uniform arrays)
Quoting, escaping
Round-trip fidelity
Edge cases, errors

Appendix C: Changelog

v1.1.0 (2025-12-01)

Sparse Tables (key:value in rows)
Hierarchical Sparse Encoding (deep flattening)
Metadata Optimization (grouped objects)
Type Coercion (opt-in)
LLM Optimizations (encodeLLM)

v1.0.4 (2025-11-29)

Disabled sequential column omission
100% LLM accuracy achieved
All columns explicit

v1.0.2 (2025-11-27)

Irregularity threshold tuning
ISO date detection
Sparse table encoding v1.0.1 (2025-11-26)
License: MIT
Documentation updates

v1.0.0 (2025-11-26)

Initial stable release
Single-character primitives
Table format
Lossless round-trip

Appendix D: Parsing Algorithm

Decoder flow:

1. Split by lines (LF/CRLF)
2. Detect root form (table / object / primitive)
3. If table:
   a. Parse header: @(N):columns
   b. Read N rows
   c. Split by comma (CSV-aware)
   d. Map to objects
4. If object:
   a. Parse key:value pairs
   b. Build object
5. Return decoded value

CSV-aware row splitting:

function parseRow(line, columns) {
  const values = [];
  let current = '';
  let inQuotes = false;
  
  for (let i = 0; i < line.length; i++) {
    const char = line[i];
    
    if (char === '"' && !inQuotes) {
      inQuotes = true;
    } else if (char === '"' && inQuotes) {
      if (line[i+1] === '"') {  // Escaped quote
        current += '"';
        i++;
      } else {
        inQuotes = false;
      }
    } else if (char === ',' && !inQuotes) {
      values.push(parseValue(current.trim()));
      current = '';
    } else {
      current += char;
    }
  }
  
  values.push(parseValue(current.trim()));
  return values;
}

Appendix E: License

MIT License

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

End of Specification

FilesExpand file tree

SPEC.md

Latest commit

History

SPEC.md

File metadata and controls

ZON Specification

Zero Overhead Notation - Formal Specification

Abstract

Status of This Document

Normative References

Informative References

Table of Contents

Introduction (Informative)

Purpose

Design Goals

Use Cases

Example

1. Terminology and Conventions

1.1 RFC2119 Keywords

1.2 Definitions

2. Data Model

2.1 JSON Compatibility

2.2 Ordering

2.3 Canonical Numbers

2.4 Special Values

3. Encoding Normalization

3.1 Host Type Mapping

4. Decoding Interpretation

4.1 Type Inference

4.2 Escape Sequences

4.3 Leading Zeros

5. Concrete Syntax

5.1 Line Structure

5.2 Root Form

5.3 ABNF Grammar

6. Primitives

6.1 Booleans

6.2 Null

6.3 Numbers

7. Strings and Keys

7.1 Safe Strings (Unquoted)

7.2 Required Quoting

7.3 ISO Date Optimization

8. Objects

8.1 Flat Objects

8.2 Nested Objects

8.3 Empty Objects

9. Arrays

9.1 Format Selection

9.2 Inline Arrays

9.3 Irregularity Threshold

10. Table Format

10.1 Header Syntax

10.2 Column Order

10.3 Data Rows

10.4 Sparse Tables (v2.0)

10.6 Hierarchical Sparse Encoding (v1.1.0)

11. Quoting and Escaping

11.1 CSV Quoting (RFC 4180)

11.2 Escape Sequences

11.3 Unicode

12. Whitespace and Line Endings

12.1 Encoding Rules

12.2 Decoding Rules

13. Conformance and Options

13.1 Encoder Checklist

13.2 Decoder Checklist

13.3 Strict Mode

14. Schema Validation (LLM Evals)

14.1 Schema Definition

14.2 Prompt Generation

14.3 Validation

15. Strict Mode Errors

15.1 Table Errors

14.2 Syntax Errors

14.3 Format Errors

17. Security Considerations

15.1 Resource Limits

15.2 Validation