Logging and Sampling `outlines.processors` by lapp0 · Pull Request #35 · lapp0/outlines

lapp0 · 2024-06-25T02:16:06Z

No description provided.

rlouf · 2024-07-20T16:22:08Z

The idea is interesting. Do we really want to increase the surface area of the library? There's a risk of spreading ourselves too thinly by adding extra code to maintain.

just-cameron

A few comments here and there. Was able to get this to work with a modified example:

import outlines
import outlines.processors as processors
model = outlines.models.transformers(
    "openaccess-ai-collective/tiny-mistral",
)
# Create a chained logits processor
logits_processor = (
    processors.sequence_logging(model.tokenizer) |  # Log the generated sequence
	processors.logits_logging(model.tokenizer) |  # Log the raw logits
	processors.regex(r"[0-9]*", model.tokenizer) |  # Restrict the logits to match the pattern
	processors.temperature(0.5) |  # Set temperature to 0.5
	processors.logits_logging(model.tokenizer)  # Log the restricted, temperature-augmentent, sampled logits
)
generator = outlines.generate.base(model, logits_processor)
generator("What is your favorite number? ")

We'll also need to export the base generator method, i.e. __init__.py should include this line:

from .api import SequenceGenerator
from .base import base
from .cfg import cfg
from .choice import choice
from .format import format
from .fsm import fsm
from .json import json
from .regex import regex
from .text import text

just-cameron · 2024-09-09T23:34:38Z

+	processors.logits_logging(model.tokenizer)  # Log the restricted, temperature-augmentent, sampled logits
+)
+
+generator = outlines.generate.base(model, logits_process)


Suggested change

generator = outlines.generate.base(model, logits_process)

generator = outlines.generate.text(model, logits_processor)

should be logits_processor

Is base defined here? I haven't been able to find it (yet)

https://github.com/lapp0/outlines/pull/35/files#diff-6ac112829a30b104639758077458b24d55ab12343a6fa1d54caf548f59075f3b

No it's not documented (yet)

just-cameron · 2024-09-09T23:49:40Z

+model = outlines.models.llamacpp(
+    repo_id="M4-ai/TinyMistral-248M-v2-Instruct-GGUF",
+    filename="TinyMistral-248M-v2-Instruct.Q4_K_M.gguf"
+)


This doesn't seem to work with llamacpp, but it does work with transformers:

import outlines import outlines.processors as processors model = outlines.models.transformers( "openaccess-ai-collective/tiny-mistral", ) # Create a chained logits processor logits_processor = ( processors.sequence_logging(model.tokenizer) | # Log the generated sequence processors.logits_logging(model.tokenizer) | # Log the raw logits processors.regex(r"[0-9]*", model.tokenizer) | # Restrict the logits to match the pattern processors.temperature(0.5) | # Set temperature to 0.5 processors.logits_logging(model.tokenizer) # Log the restricted, temperature-augmentent, sampled logits ) generator = outlines.generate.base(model, logits_processor) generator("What is your favorite number? ")

The error was

Traceback (most recent call last): File "/home/cameron/dottxt/outlines/demo-logging.py", line 16, in <module> generator("What is your favorite number? ") File "/home/cameron/dottxt/outlines/outlines/generate/api.py", line 503, in __call__ completions = self.model.generate( File "/home/cameron/dottxt/outlines/outlines/models/llamacpp.py", line 288, in generate completion = self.model(prompts, **llama_cpp_params) File "/home/cameron/dottxt/outlines/.venv/lib/python3.10/site-packages/llama_cpp/llama.py", line 1799, in __call__ return self.create_completion( File "/home/cameron/dottxt/outlines/.venv/lib/python3.10/site-packages/llama_cpp/llama.py", line 1732, in create_completion completion: Completion = next(completion_or_chunks) # type: ignore File "/home/cameron/dottxt/outlines/.venv/lib/python3.10/site-packages/llama_cpp/llama.py", line 1216, in _create_completion for token in self.generate( File "/home/cameron/dottxt/outlines/.venv/lib/python3.10/site-packages/llama_cpp/llama.py", line 810, in generate token = self.sample( File "/home/cameron/dottxt/outlines/.venv/lib/python3.10/site-packages/llama_cpp/llama.py", line 704, in sample else logits_processor(self._input_ids[: idx + 1], logits) File "/home/cameron/dottxt/outlines/.venv/lib/python3.10/site-packages/llama_cpp/llama.py", line 2250, in __call__ scores = processor(input_ids, scores) File "/home/cameron/dottxt/outlines/.venv/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context return func(*args, **kwargs) File "/home/cameron/dottxt/outlines/outlines/processors/base_logits_processor.py", line 82, in __call__ processed_logits = self.process_logits( File "/home/cameron/dottxt/outlines/outlines/processors/base_logits_processor.py", line 168, in process_logits result = processor.process_logits(input_ids, result) File "/home/cameron/dottxt/outlines/outlines/processors/logging.py", line 63, in process_logits self.logger.info(self.tokenizer.decode(input_ids)) File "/home/cameron/dottxt/outlines/outlines/models/llamacpp.py", line 56, in decode decoded_bytes = self.tokenizer.detokenize(token_ids) File "/home/cameron/dottxt/outlines/.venv/lib/python3.10/site-packages/llama_cpp/llama_tokenizer.py", line 52, in detokenize return self._model.detokenize(tokens) File "/home/cameron/dottxt/outlines/.venv/lib/python3.10/site-packages/llama_cpp/_internals.py", line 224, in detokenize self.model, llama_cpp.llama_token(token), buffer, size, 0, special TypeError: 'list' object cannot be interpreted as an integer

just-cameron · 2024-09-09T23:50:41Z

+            self.logger = logger
+        else:
+            self.logger = logging.getLogger("logits_logger")
+            self.logger.setLevel(logging.info)


I believe this should be

Suggested change

self.logger.setLevel(logging.info)

self.logger.setLevel(logging.INFO)

just-cameron · 2024-09-09T23:50:53Z

+            self.logger = logger
+        else:
+            self.logger = logging.getLogger("sequence_logger")
+            self.logger.setLevel(logging.info)


Suggested change

self.logger.setLevel(logging.info)

self.logger.setLevel(logging.INFO)

just-cameron · 2024-09-09T23:54:54Z

Oh, also, the transformers output I think is the token indices, maybe

['What is your favorite number? ']
[{4130: tensor(0.0001), 2: tensor(1.8356e-05), 22315: tensor(0.0002), 26286: tensor(0.0002), 21646: tensor(0.0002), 22835: tensor(0.0002), 8473: tensor(0.0002), 16349: tensor(0.0001), 20126: tensor(0.0002)}]
[{4130: tensor(0.0001), 2: tensor(1.8356e-05), 22315: tensor(0.0002), 26286: tensor(0.0002), 21646: tensor(0.0002), 22835: tensor(0.0002), 8473: tensor(0.0002), 16349: tensor(0.0001), 20126: tensor(0.0002)}]
[{28770: tensor(0.1335), 2: tensor(0.0203), 51: tensor(0.0793), 28787: tensor(0.0984), 54: tensor(0.0466), 57: tensor(0.0612), 58: tensor(0.1677), 59: tensor(0.0338), 28734: tensor(0.1308)}]
[{28770: tensor(0.1335), 2: tensor(0.0203), 51: tensor(0.0793), 28787: tensor(0.0984), 54: tensor(0.0466), 57: tensor(0.0612), 58: tensor(0.1677), 59: tensor(0.0338), 28734: tensor(0.1308)}]
['What is your favorite number? 2']
[{2: tensor(2.8889e-05), 11651: tensor(0.0001), 27974: tensor(0.0002), 4650: tensor(0.0002), 13322: tensor(0.0001), 16362: tensor(0.0001), 12301: tensor(0.0001), 18383: tensor(0.0001), 16753: tensor(0.0001)}]
[{2: tensor(2.8889e-05), 11651: tensor(0.0001), 27974: tensor(0.0002), 4650: tensor(0.0002), 13322: tensor(0.0001), 16362: tensor(0.0001), 12301: tensor(0.0001), 18383: tensor(0.0001), 16753: tensor(0.0001)}]
[{2: tensor(0.0412), 28740: tensor(0.0566), 28750: tensor(0.1070), 28787: tensor(0.1236), 51: tensor(0.0709), 52: tensor(0.0612), 53: tensor(0.0479), 59: tensor(0.1091), 28734: tensor(0.1091)}]
[{2: tensor(0.0412), 28740: tensor(0.0566), 28750: tensor(0.1070), 28787: tensor(0.1236), 51: tensor(0.0709), 52: tensor(0.0612), 53: tensor(0.0479), 59: tensor(0.1091), 28734: tensor(0.1091)}]
['What is your favorite number? 26']
[{2: tensor(2.2782e-05), 24968: tensor(0.0001), 7438: tensor(0.0001), 29904: tensor(0.0002), 11856: tensor(0.0001), 2484: tensor(0.0001), 12731: tensor(0.0001), 1629: tensor(0.0001), 19135: tensor(0.0001)}]
[{2: tensor(2.2782e-05), 24968: tensor(0.0001), 7438: tensor(0.0001), 29904: tensor(0.0002), 11856: tensor(0.0001), 2484: tensor(0.0001), 12731: tensor(0.0001), 1629: tensor(0.0001), 19135: tensor(0.0001)}]
[{28770: tensor(0.0952), 2: tensor(0.0239), 28740: tensor(0.0407), 28784: tensor(0.0665), 28787: tensor(0.1372), 53: tensor(0.0650), 58: tensor(0.1083), 60: tensor(0.0717), 28734: tensor(0.1166)}]
[{28770: tensor(0.0952), 2: tensor(0.0239), 28740: tensor(0.0407), 28784: tensor(0.0665), 28787: tensor(0.1372), 53: tensor(0.0650), 58: tensor(0.1083), 60: tensor(0.0717), 28734: tensor(0.1166)}]
['What is your favorite number? 260']
[{2: tensor(1.9624e-05), 14182: tensor(0.0002), 1514: tensor(0.0001), 3024: tensor(0.0001), 2867: tensor(0.0001), 10966: tensor(0.0002), 6423: tensor(0.0001), 12028: tensor(0.0002), 12799: tensor(0.0001)}]
[{2: tensor(1.9624e-05), 14182: tensor(0.0002), 1514: tensor(0.0001), 3024: tensor(0.0001), 2867: tensor(0.0001), 10966: tensor(0.0002), 6423: tensor(0.0001), 12028: tensor(0.0002), 12799: tensor(0.0001)}]
[{28770: tensor(0.0558), 2: tensor(0.0160), 28782: tensor(0.1015), 28784: tensor(0.2751), 28787: tensor(0.0615), 52: tensor(0.0761), 57: tensor(0.0305), 60: tensor(0.0684), 28734: tensor(0.1016)}]
[{28770: tensor(0.0558), 2: tensor(0.0160), 28782: tensor(0.1015), 28784: tensor(0.2751), 28787: tensor(0.0615), 52: tensor(0.0761), 57: tensor(0.0305), 60: tensor(0.0684), 28734: tensor(0.1016)}]
['What is your favorite number? 2607']
[{2: tensor(1.3555e-05), 9032: tensor(0.0001), 12556: tensor(0.0001), 8974: tensor(0.0001), 8911: tensor(0.0001), 918: tensor(0.0001), 11287: tensor(0.0002), 26521: tensor(0.0002), 25274: tensor(0.0002)}]
[{2: tensor(1.3555e-05), 9032: tensor(0.0001), 12556: tensor(0.0001), 8974: tensor(0.0001), 8911: tensor(0.0001), 918: tensor(0.0001), 11287: tensor(0.0002), 26521: tensor(0.0002), 25274: tensor(0.0002)}]
[{28770: tensor(0.0809), 2: tensor(0.0092), 28740: tensor(0.0617), 28781: tensor(0.0459), 28782: tensor(0.0592), 28787: tensor(0.1464), 52: tensor(0.0830), 57: tensor(0.1384), 58: tensor(0.0557)}]
[{28770: tensor(0.0809), 2: tensor(0.0092), 28740: tensor(0.0617), 28781: tensor(0.0459), 28782: tensor(0.0592), 28787: tensor(0.1464), 52: tensor(0.0830), 57: tensor(0.1384), 58: tensor(0.0557)}]
['What is your favorite number? 26076']

lapp0 · 2024-09-10T00:47:18Z

Thanks for the review @cpfiffer ! It's a good refresher since I haven't looked at this in a while.

I'll see if I have some free time this weekend and can get this ready for review in outlines-dev

And indeed, it is token indices. Decoded tokens would be better.

lapp0 changed the base branch from main to logits-processor-integrations-fix June 25, 2024 02:17

lapp0 force-pushed the logging-processor branch from 6d45e36 to 4d0b9bd Compare June 25, 2024 12:24

lapp0 force-pushed the logits-processor-integrations-fix branch 2 times, most recently from c07de55 to c3e8673 Compare June 29, 2024 13:14

lapp0 mentioned this pull request Jul 20, 2024

Suite of outlines.processors for Sampling Techniques and Debug Logging dottxt-ai/outlines#1055

Open

lapp0 changed the title ~~Logging processor~~ Logging and Sampling outlines.processors Jul 20, 2024

lapp0 force-pushed the logging-processor branch from 4d0b9bd to d49fd23 Compare September 9, 2024 21:11

lapp0 changed the base branch from logits-processor-integrations-fix to main September 9, 2024 21:13

lapp0 force-pushed the logging-processor branch from d49fd23 to 31a485a Compare September 9, 2024 21:16

WIP logging logits processor

6e69272

lapp0 force-pushed the logging-processor branch from 31a485a to 6e69272 Compare September 9, 2024 21:29

just-cameron reviewed Sep 9, 2024

View reviewed changes

just-cameron mentioned this pull request Sep 9, 2024

Log Logits dottxt-ai/outlines#616

Closed

3 tasks

just-cameron mentioned this pull request Sep 25, 2024

Allow Debug Logging of Logits dottxt-ai/outlines#614

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Logging and Sampling `outlines.processors`#35

Logging and Sampling `outlines.processors`#35
lapp0 wants to merge 1 commit into
mainfrom
logging-processor

lapp0 commented Jun 25, 2024

Uh oh!

rlouf commented Jul 20, 2024

Uh oh!

just-cameron left a comment •

edited

Loading

Uh oh!

just-cameron Sep 9, 2024

Uh oh!

lapp0 Sep 10, 2024

Uh oh!

just-cameron Sep 9, 2024

Uh oh!

just-cameron Sep 9, 2024

Uh oh!

just-cameron Sep 9, 2024

Uh oh!

just-cameron commented Sep 9, 2024

Uh oh!

lapp0 commented Sep 10, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	generator = outlines.generate.base(model, logits_process)
	generator = outlines.generate.text(model, logits_processor)

	self.logger.setLevel(logging.info)
	self.logger.setLevel(logging.INFO)

Conversation

lapp0 commented Jun 25, 2024

Uh oh!

rlouf commented Jul 20, 2024

Uh oh!

just-cameron left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

just-cameron Sep 9, 2024

Choose a reason for hiding this comment

Uh oh!

lapp0 Sep 10, 2024

Choose a reason for hiding this comment

Uh oh!

just-cameron Sep 9, 2024

Choose a reason for hiding this comment

Uh oh!

just-cameron Sep 9, 2024

Choose a reason for hiding this comment

Uh oh!

just-cameron Sep 9, 2024

Choose a reason for hiding this comment

Uh oh!

just-cameron commented Sep 9, 2024

Uh oh!

lapp0 commented Sep 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

just-cameron left a comment •

edited

Loading

lapp0 commented Sep 10, 2024 •

edited

Loading