enhance #10 : avoid allocations on `parseUUID` by ashwingopalsamy · Pull Request #12 · ash3in/uuidv8

ashwingopalsamy · 2024-12-24T02:49:50Z

Changes

With this PR, we rewrite parseUUID to process input directly, avoiding unnecessary string allocations when handling UUIDs with -. The function now uses a pre-allocated byte slice and skips dashes inline to improve its efficiency, with few additional checks.

This should address the issue raised in #10. Let me know your thoughts! @vtolstov

codecov · 2024-12-24T02:50:43Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Flag	Coverage Δ
unittests	`90.67% <100.00%> (+0.72%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
helper.go	`100.00% <100.00%> (ø)`

ccoVeille · 2024-12-24T16:16:56Z

+	if len(uuid) == 32 {
+		// Fast path for UUIDs without dashes
+		return hex.DecodeString(uuid)
+	} else if len(uuid) == 36 {
+		// Validate dash positions
+		if uuid[8] != '-' || uuid[13] != '-' || uuid[18] != '-' || uuid[23] != '-' {
+			return nil, errors.New("invalid UUID format")
+		}
+	} else {
 		return nil, errors.New("invalid UUID length")
 	}


Suggested change

if len(uuid) == 32 {

// Fast path for UUIDs without dashes

return hex.DecodeString(uuid)

} else if len(uuid) == 36 {

// Validate dash positions

if uuid[8] != '-' || uuid[13] != '-' || uuid[18] != '-' || uuid[23] != '-' {

return nil, errors.New("invalid UUID format")

}

} else {

return nil, errors.New("invalid UUID length")

}

switch len(uuid) {

case 32:

// Fast path for UUIDs without dashes

return hex.DecodeString(uuid)

case 36:

// Validate dash positions

if uuid[8] != '-' || uuid[13] != '-' || uuid[18] != '-' || uuid[23] != '-' {

return nil, errors.New("invalid UUID format")

}

default:

return nil, errors.New("invalid UUID length")

}

Also, if you consider this

#12 (comment)

Suggested change

if len(uuid) == 32 {

// Fast path for UUIDs without dashes

return hex.DecodeString(uuid)

} else if len(uuid) == 36 {

// Validate dash positions

if uuid[8] != '-' || uuid[13] != '-' || uuid[18] != '-' || uuid[23] != '-' {

return nil, errors.New("invalid UUID format")

}

} else {

return nil, errors.New("invalid UUID length")

}

switch len(uuid) {

case 32:

// Fast path for UUIDs without dashes

return hex.DecodeString(uuid)

case 36:

// Validate dash positions

if uuid[8] != '-' || uuid[13] != '-' || uuid[18] != '-' || uuid[23] != '-' {

return nil, errors.New("invalid UUID format")

}

return hex.DecodeString(strings.ReplaceAll(uuid, "-", ""))

default:

return nil, errors.New("invalid UUID length")

}

Or this

Suggested change

if len(uuid) == 32 {

// Fast path for UUIDs without dashes

return hex.DecodeString(uuid)

} else if len(uuid) == 36 {

// Validate dash positions

if uuid[8] != '-' || uuid[13] != '-' || uuid[18] != '-' || uuid[23] != '-' {

return nil, errors.New("invalid UUID format")

}

} else {

return nil, errors.New("invalid UUID length")

}

if len(uuid) == 36 {

// Validate dash positions

if uuid[8] != '-' || uuid[13] != '-' || uuid[18] != '-' || uuid[23] != '-' {

return nil, errors.New("invalid UUID format")

}

uid = strings.ReplaceAll(uid, "-", "")

}

if len(uuid) != 32 {

return nil, errors.New("invalid UUID length")

}

// Fast path for UUIDs without dashes

return hex.DecodeString(uuid)

ccoVeille · 2024-12-24T16:22:57Z

+	// Remove dashes while copying characters
+	result := make([]byte, 32)
+	j := 0
+	for i := 0; i < len(uuid); i++ {
+		if uuid[i] == '-' {
+			continue
+		}
+		result[j] = uuid[i]
+		j++
+	}


Why not a simple strings.ReplaceAll?

Good point! Primary reason was to address #10 . When I attempted 'Benchmarking', the current implementation was faster and avoids memory allocations.

Here are the benchmark results comparing the two:

Implementation Time per Operation (ns/op) Speed

strings.ReplaceAll() 146.2 ns/op Baseline

Current Approach 68.17 ns/op 2.22x faster

How?

This approach processes the UUID in one pass, directly copying characters into a pre-allocated byte slice. This avoids creating a new string, which is what strings.ReplaceAll does and keeps things lean.

ccoVeille · 2024-12-24T17:20:15Z

+		"123",                                   // Too short
+		"123e4567e89b12d3a4564266141740000000",  // Too long
+		"123e4567e89b12d3a45642661417400g",      // Invalid character
+		"123e-4567-e89b-12d3-a456-426614174000", // Misplaced dashes


Please add:

a valid uid with the dashes at the right place

an invalid uid with the dash at the right place plus one randomly placed,or simply 36 dashes

Sure thing! Added in the latest commit.

vtolstov · 2024-12-26T06:31:09Z

is this changes in some parts goes to google/uuid repo pr ?

ashwingopalsamy · 2024-12-26T06:34:59Z

Let me see what I can do best there. I had the flexibility of writing this repo in my preferred style and structure.

With google/uuid, I had to follow what was already in-place..

ashwingopalsamy · 2024-12-26T06:43:14Z

Sadly, people aren't reviewing the proposal there. It'd be great if the PR gets some attention.

Update helper.go

446522c

ashwingopalsamy added the enhancement Improvements to existing feature or configuration label Dec 24, 2024

ashwingopalsamy self-assigned this Dec 24, 2024

ashwingopalsamy mentioned this pull request Dec 24, 2024

avoid allocations on parseUUID #10

Closed

ashwingopalsamy changed the title ~~enhance #10 :~~ enhance #10 : avoid allocations on parseUUID Dec 24, 2024

tests: added unit tests for updated parseUUID

4023410

ccoVeille reviewed Dec 24, 2024

View reviewed changes

ashwingopalsamy added 3 commits December 26, 2024 11:33

fix: merge with main

0f275bc

tests: add more testcases for invalid uuid inputs

7b52e05

enhance: parseUUID() logic handling

4c42601

ashwingopalsamy merged commit 4ccd19c into main Dec 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enhance #10 : avoid allocations on `parseUUID`#12

enhance #10 : avoid allocations on `parseUUID`#12
ashwingopalsamy merged 5 commits intomainfrom
fix/issue-10/avoid-allocations-on-parseUUID-helper

ashwingopalsamy commented Dec 24, 2024

Uh oh!

codecov Bot commented Dec 24, 2024 •

edited

Loading

Uh oh!

ccoVeille Dec 24, 2024

Uh oh!

ccoVeille Dec 24, 2024 •

edited

Loading

Uh oh!

ccoVeille Dec 24, 2024 •

edited

Loading

Uh oh!

ccoVeille Dec 24, 2024 •

edited

Loading

Uh oh!

ashwingopalsamy Dec 26, 2024

Uh oh!

ccoVeille Dec 24, 2024

Uh oh!

ashwingopalsamy Dec 26, 2024

Uh oh!

vtolstov commented Dec 26, 2024

Uh oh!

ashwingopalsamy commented Dec 26, 2024

Uh oh!

ashwingopalsamy commented Dec 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Implementation	Time per Operation (ns/op)	Speed
`strings.ReplaceAll()`	146.2 ns/op	Baseline
Current Approach	68.17 ns/op	2.22x faster

Conversation

ashwingopalsamy commented Dec 24, 2024

Changes

Uh oh!

codecov Bot commented Dec 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ccoVeille Dec 24, 2024

Choose a reason for hiding this comment

Uh oh!

ccoVeille Dec 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ccoVeille Dec 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ccoVeille Dec 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ashwingopalsamy Dec 26, 2024

Choose a reason for hiding this comment

Uh oh!

ccoVeille Dec 24, 2024

Choose a reason for hiding this comment

Uh oh!

ashwingopalsamy Dec 26, 2024

Choose a reason for hiding this comment

Uh oh!

vtolstov commented Dec 26, 2024

Uh oh!

ashwingopalsamy commented Dec 26, 2024

Uh oh!

ashwingopalsamy commented Dec 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov Bot commented Dec 24, 2024 •

edited

Loading

ccoVeille Dec 24, 2024 •

edited

Loading

ccoVeille Dec 24, 2024 •

edited

Loading

ccoVeille Dec 24, 2024 •

edited

Loading