Collateral for working on "DSLX", the DSL from XLS, via LLMs

Repository for LLM prompts and Q&A samples for the XLS Domain Specific Language (DSL) called DSLX.

Structure of this project

prompt.md: "prelude" for making a query of an LLM for some synthesizable hardware computation. Paste this in to your LLM interface first before a sample prompt.
samples/*.md: sample prompts and acceptance tests that can cause an LLM response to be accepted/rejected for the associated prompt.
test_prompt.py: a pytest based test file that extracts all the DSLX code blocks from the prompt and runs them against an interpreter binary to determine that they all pass. This is useful to help ensure the prompt is showing correct and complete examples as we add/expand its content.
eval.py: feeds the system prompt and sample prompt(s) to a model API and tests them against the acceptance tests, feeding back any errors that occur up to a max retry count. Emits a scorecard at the end as a success indicator.
proc_eval.py: proc-oriented analogue of eval.py, using proc-specific prompt material and proc samples under proc_eval/.
proc_eval/: proc-specific prompt collateral, samples, and tests.
docs/openrouter.md: how to run the harness against arbitrary hosted models through OpenRouter.

To run eval on a specific sample:

export OPENAI_API_KEY=""  # replace with your key
export XLSYNTH_TOOLS=""  # replace with xlsynth tools dir via github.com/xlsynth/xlsynth/releases
python eval.py --model gpt-3.5-turbo --sample saturating_addsub --max-retries 5

To list or run proc samples:

python proc_eval.py --list
python proc_eval.py --model gpt-4o-mini --sample counter --max-retries 5

Making the case for DSLX over Verilog

Some arguments in favor of LLMs targeting DSLX over the underlying Verilog:

XLS as platform: The DSL (DSLX) is a fairly lightweight layer on top of XLS IR. XLS IR provides a platform for analysis and transformation that is fully open source with fully defined semantics and equal representative capabilities. XLS lives "underneath" and completely understands the hardware computation descriptions, and is capable of simulating them at naive speed.

We can write and slot-in new analysis tools easily, and our understanding of the platform is complete -- this is a major challenge for all RTL toolchains and Verilog/SystemVerilog semantics in general, often the ones used in practice are proprietary and the SystemVerilog support for fully open toolchains is partial at best.
Transfer learning from Rust and software: DSLX mimics Rust and so presumably gets a good amount of transferred understanding from LLM knowledge of function/program construction in the software domain.
Function-oriented for easy retargeting/composition: DSLX functions are written as:
- largely pure functions, with limited side effects and immutable values
- with transparent dataflow semantics
- and no undefined behavior,
and so can be verified easily and then can be retmined as pipelines (via XLS' scheduler) or lifted into a recurrence in time (via procs, for stateful evolution (i.e. generate a state machine), similar to how we reason about loops in turing complete languages).
Not deep-inductive, tricky-state-space oriented: Knowledge of how to write functions as effective building blocks from software programming languages and associated program synthesis presumably side-steps some of the difficulty in learning Verilog semantics and challenges of structural and temporal composition.
- Verilog semantically offers a flat program that is inherently operating in time, operationally mutating deep state spaces, with undefined behavior. Type promotion and matching in expressions is even difficult for expert humans to understand and use correctly, in addition to a slew of "best practices" required to avoid traps around 4-value semantics.
- Verilog tools often have implementation-defined but cross-implementation-undefined / specification-undefined behavior -- i.e. behavior that is not fully defined by the specification, but that will coincidentally have some semantics for a given tool. This is abundant as a language becomes less fully-specified.
  
  Overfitting for the observed behavior of a single tool and believing it is specified due to empirical observations is a natural trap. "Transfer learning" from the observed semantics of one tool to expectations of what another tool will do is tempting, but will likely lead to incorrect results due to well-defined vs implementation-defined semantics confusion.
Straightforwardly composable primitives (i.e. libraries work): the DSL has a standard library of functions that can be composed without fear of correctness errors due to the latency-insensitive nature of the design descriptions. As more standard library functions are built/offered, and more "batteries included" modules are created, LLMs will be able to leverage straightforward notions of composition from a more powerful basis. Composition is more challenging in a timed and transition-based programming model.

In summary, targeting a slightly higher level and more well defined set of constructs that are more semantically similar to software should aid in the construction of correct, robust hardware computation.

By analogy to software High Level Languages: we don't try to get LLMs to emit correct assembly from natural descriptions because they get an uplift from the higher level semantics in a similar fashion to how humans get a productivity, reasoning, and correctness uplift from the higher level languages we use.

Developer Tips

To test the samples in the prompt Markdown:

DSLX_STDLIB_PATH=$HOME/opt/xlsynth/latest/xls/dslx/stdlib/ pytest test_prompt.py

Ideas not yet added

Various hashers and PRNGs, e.g. xoshiro256** and similar.
More arbiters: LRU, round robin, hierarchical round robin (via composition).
Iterative shift-add multiplier.
Dot product / matmul.

Ideas that are too simple

Parity: this is simply a call to the std::popcount function https://google.github.io/xls/dslx_std/#stdpopcount
Bit Reversal: this is simply a call to the rev built-in function https://google.github.io/xls/dslx_std/#rev
One-Hot Encoder: this is simply a call to the encode built-in function https://google.github.io/xls/dslx_std/#encode

`fp_sqrt` sample

The floating-point square root requires additional tests that can be generated with gen_float_tests.py script based on testfloat_gen output.

python gen_float_tests.py --output-file ./tests/fp_sqrt.x --only-numbers --tested-func test_fp16_sqrt --function sqrt

It generates minimal number of test cases for a square root of 16-bit floating-point number that call test_fp16_sqrt defined in fp_sqrt.md. The --only-numbers skips all tests that includes NaN. With the tests, the sample can be run as follows:

python eval.py --model $MODEL --sample fp_sqrt --max-retries 5 --save-to generated.x --test-file ./tests/fp_sqrt.x --reduce-test-errors 6

The produced code may not cover all edge-cases, but provides base logic of the operation.

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
.github/workflows		.github/workflows
docs		docs
proc_eval		proc_eval
samples		samples
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
critic.py		critic.py
dslx_run_flags.py		dslx_run_flags.py
dslx_text.py		dslx_text.py
eval.py		eval.py
eval_shared.py		eval_shared.py
gen_float_tests.py		gen_float_tests.py
openai_compat.py		openai_compat.py
proc_eval.py		proc_eval.py
prompt.md		prompt.md
provider_google.py		provider_google.py
provider_openai.py		provider_openai.py
providers.py		providers.py
requirements.txt		requirements.txt
tempcompat.py		tempcompat.py
test_gentest.py		test_gentest.py
test_prompt.py		test_prompt.py
tools.py		tools.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Collateral for working on "DSLX", the DSL from XLS, via LLMs

Structure of this project

Making the case for DSLX over Verilog

Developer Tips

Ideas not yet added

Ideas that are too simple

`fp_sqrt` sample

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Collateral for working on "DSLX", the DSL from XLS, via LLMs

Structure of this project

Making the case for DSLX over Verilog

Developer Tips

Ideas not yet added

Ideas that are too simple

fp_sqrt sample

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`fp_sqrt` sample

Packages