fix(dsl): prevent lexer panic on string ending in trailing backslash by ManthanNimodiya · Pull Request #3007 · Permify/permify

ManthanNimodiya · 2026-06-19T01:12:29Z

Bug

The schema DSL lexer panics with a slice-out-of-range error when lexing an unterminated string that ends in a trailing backslash.
Minimal repro:

NewParser(""\").Parse()
panic: runtime error: slice bounds out of range [:3] with length 2
pkg/dsl/lexer/lexer.go:231

Reached from the SchemaWrite API via Parse(); the gRPC recovery interceptor catches it, so it errors the request instead of crashing the server, but it's still an uncontrolled panic on untrusted input.

Root cause

In lexString, when a \ is the last character of the input, l.readChar() (skipping the backslash) sets l.ch = 0 but the code still executes position = l.position + 1, advancing position one past len(input).
The loop then exits on l.ch == 0, and the final str += l.input[position:l.position] slices with a start index greater than the string length, panicking.

Fix

Detect l.ch == 0 immediately after skipping the backslash and break early with position = l.position, treating it as an unterminated string instead of continuing to advance past the input bounds.

Testing

Added a regression test in lexer_test.go asserting the lexer doesn't panic on ""\", and verified the exact reporter NewParser(""\").Parse()) now returns cleanly. Full pkg/dsl/... suite still passes.

Closes #3004

Summary by CodeRabbit

Bug Fixes
- Fixed a crash in the DSL parser when processing string literals with unterminated escape sequences.
Tests
- Added test coverage to verify the parser handles malformed strings gracefully without panicking.

coderabbitai · 2026-06-19T01:12:47Z

📝 Walkthrough

Walkthrough

The fix adds an end-of-input guard inside lexString in the DSL lexer: when a backslash is encountered and l.ch is 0 (EOF), the lexer updates its position and exits the escape loop instead of attempting to parse a non-existent escape character. A Ginkgo regression test confirms no panic occurs on the two-byte input "\.

Changes

Lexer panic fix for trailing backslash

Layer / File(s)	Summary
`lexString` EOF guard and regression test `pkg/dsl/lexer/lexer.go`, `pkg/dsl/lexer/lexer_test.go`	`lexString` checks `l.ch == 0` after a `\` and breaks out of the escape loop instead of slicing past end-of-input; a Ginkgo test verifies the fix by iterating tokens on `"\` until EOF and asserting no panic.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~5 minutes

Poem

A backslash sat at the end of the line,
No char came after — a panic by design!
But the rabbit hopped in with a nil-check so neat,
Now the lexer just shrugs and accepts the defeat.
No more slice-out-of-range, no more fright — 🐇
Unterminated strings handled just right!

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'fix(dsl): prevent lexer panic on string ending in trailing backslash' clearly and specifically describes the main change: preventing a panic in the DSL lexer for strings with trailing backslashes.
Linked Issues check	✅ Passed	The changes fully address issue `#3004`: the fix detects unterminated strings ending with a backslash and breaks early to prevent out-of-bounds slicing, while the regression test verifies the lexer no longer panics on the problematic input.
Out of Scope Changes check	✅ Passed	All changes are directly scoped to fixing the reported bug: modifications to lexString function logic and addition of a regression test, with no extraneous changes detected.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

ManthanNimodiya · 2026-06-19T01:13:24Z

@omer-topal @ucatbas please have a look and lmk for any required changes

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

pkg/dsl/lexer/lexer_test.go (1)

1193-1207: 💤 Low value

Consider asserting on token properties for more comprehensive validation.

The test correctly verifies that no panic occurs, which addresses the primary objective. However, it could be enhanced by also asserting:

The token type (expected: token.STRING)
The literal value (expected: empty string "")
The number of tokens before EOF (expected: 1)

These additions would document the expected behavior more explicitly and catch any regression in how unterminated strings are tokenized.

📋 Example enhancement

 	It("Case 11 - Unterminated string ending in a trailing backslash should not panic", func() {
 		// Regression test: a string literal that ends right after a backslash
 		// (no character left to escape) used to overshoot the input bounds
 		// and panic with a slice-out-of-range error.
 		l := NewLexer("\"\\")
 
-		Expect(func() {
-			for {
-				tok := l.NextToken()
-				if tok.Type == token.EOF {
-					break
-				}
-			}
-		}).ShouldNot(Panic())
+		var tokens []token.Token
+		Expect(func() {
+			for {
+				tok := l.NextToken()
+				tokens = append(tokens, tok)
+				if tok.Type == token.EOF {
+					break
+				}
+			}
+		}).ShouldNot(Panic())
+		
+		Expect(tokens).To(HaveLen(2)) // STRING token + EOF token
+		Expect(tokens[0].Type).To(Equal(token.STRING))
+		Expect(tokens[0].Literal).To(Equal("")) // Unterminated string yields empty literal
 	})

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/dsl/lexer/lexer_test.go` around lines 1193 - 1207, The test for the
unterminated string with trailing backslash currently only validates that no
panic occurs, but should also verify the correct tokenization behavior. Add
assertions within the token iteration loop to capture and verify: the token type
should be token.STRING before reaching token.EOF, the token literal value should
be an empty string, and there should be exactly one token before the EOF token.
Store the tokens in a slice or counter variable during the loop, then assert
these properties after the loop completes to document the expected behavior and
catch any regression in how unterminated strings are tokenized by the Lexer.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@pkg/dsl/lexer/lexer_test.go`:
- Around line 1193-1207: Add additional regression test cases to the test file
following the same pattern as the existing "Case 11" test. Create separate test
cases for valid escape sequences at string end (such as "\n and "\t), invalid
escape sequences at string end (such as "\x and "\z), and multiple backslashes
at string end (such as "\\ and "\\\). Each new test case should follow the same
structure as Case 11: create a NewLexer with the unterminated string input,
iterate through tokens with NextToken() until reaching token.EOF, and verify
that the operation does not panic. This ensures comprehensive regression
coverage for all unterminated string patterns with escape sequences.

---

Nitpick comments:
In `@pkg/dsl/lexer/lexer_test.go`:
- Around line 1193-1207: The test for the unterminated string with trailing
backslash currently only validates that no panic occurs, but should also verify
the correct tokenization behavior. Add assertions within the token iteration
loop to capture and verify: the token type should be token.STRING before
reaching token.EOF, the token literal value should be an empty string, and there
should be exactly one token before the EOF token. Store the tokens in a slice or
counter variable during the loop, then assert these properties after the loop
completes to document the expected behavior and catch any regression in how
unterminated strings are tokenized by the Lexer.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 5a2c45ea-e34e-4ca5-a6ac-619f66a5facb

📥 Commits

Reviewing files that changed from the base of the PR and between aa3a7c6 and 08a3dd0.

📒 Files selected for processing (2)

pkg/dsl/lexer/lexer.go
pkg/dsl/lexer/lexer_test.go

coderabbitai · 2026-06-19T01:16:40Z

+	It("Case 11 - Unterminated string ending in a trailing backslash should not panic", func() {
+		// Regression test: a string literal that ends right after a backslash
+		// (no character left to escape) used to overshoot the input bounds
+		// and panic with a slice-out-of-range error.
+		l := NewLexer("\"\\")
+
+		Expect(func() {
+			for {
+				tok := l.NextToken()
+				if tok.Type == token.EOF {
+					break
+				}
+			}
+		}).ShouldNot(Panic())
+	})


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Description: Search for existing tests covering escape sequences and unterminated strings # Search for test cases involving escape sequences in strings rg -n -A 5 -B 2 'NewLexer.*\\\\' pkg/dsl/lexer/lexer_test.go # Search for test cases with unterminated strings rg -n -A 5 -B 2 'unterminated|Unterminated' pkg/dsl/lexer/lexer_test.go

Repository: Permify/permify

Length of output: 680

🏁 Script executed:

# Get all test cases in the file to see the scope rg -n 'It\(' pkg/dsl/lexer/lexer_test.go | head -20

Repository: Permify/permify

Length of output: 615

🏁 Script executed:

# Search for tests with actual escape sequences in test inputs rg -n '\\\\n|\\\\t|\\\\x|\\\\z|\\\\\\\\' pkg/dsl/lexer/lexer_test.go | head -20

Repository: Permify/permify

Length of output: 41

🏁 Script executed:

# Get context around Case 11 and nearby test cases to see what else is tested sed -n '1150,1250p' pkg/dsl/lexer/lexer_test.go

Repository: Permify/permify

Length of output: 1448

🏁 Script executed:

# Find and examine the lexer implementation to understand escape sequence handling fd lexer.go pkg/dsl/lexer/ --type f

Repository: Permify/permify

Length of output: 82

🏁 Script executed:

# Search for escape sequence handling in the lexer rg -n -B 3 -A 10 'escape|backslash' pkg/dsl/lexer/lexer.go | head -40

Repository: Permify/permify

Length of output: 823

🏁 Script executed:

# Get more of the lexString function to see complete escape handling sed -n '205,280p' pkg/dsl/lexer/lexer.go

Repository: Permify/permify

Length of output: 1836

Add regression tests for other escape-sequence edge cases in unterminated strings.

The trailing backslash case (Case 11) is properly tested, but similar edge cases with escape sequences lack coverage. The lexer implementation has a safe if l.ch == 0 check that prevents panics, but the following scenarios should be explicitly tested:

Valid escape sequences at string end: "\n, "\t

Invalid escape sequences at string end: "\x, "\z

Multiple backslashes at string end: "\\, "\\\

Adding these tests ensures regression protection and documents the expected behavior for all unterminated string patterns.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@pkg/dsl/lexer/lexer_test.go` around lines 1193 - 1207, Add additional regression test cases to the test file following the same pattern as the existing "Case 11" test. Create separate test cases for valid escape sequences at string end (such as "\n and "\t), invalid escape sequences at string end (such as "\x and "\z), and multiple backslashes at string end (such as "\\ and "\\\). Each new test case should follow the same structure as Case 11: create a NewLexer with the unterminated string input, iterate through tokens with NextToken() until reaching token.EOF, and verify that the operation does not panic. This ensures comprehensive regression coverage for all unterminated string patterns with escape sequences.

codecov · 2026-06-19T01:21:59Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 74.69%. Comparing base (70b1110) to head (08a3dd0).
⚠️ Report is 26 commits behind head on master.

❌ Your project check has failed because the head coverage (74.69%) is below the target coverage (75.00%). You can increase the head coverage or adjust the target coverage.

❗ There is a different number of reports uploaded between BASE (70b1110) and HEAD (08a3dd0). Click for more details.

HEAD has 2 uploads less than BASE

Flag BASE (70b1110) HEAD (08a3dd0)

3 1

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3007      +/-   ##
==========================================
- Coverage   82.58%   74.69%   -7.89%     
==========================================
  Files          74       83       +9     
  Lines        8300     9215     +915     
==========================================
+ Hits         6854     6882      +28     
- Misses        910     1798     +888     
+ Partials      536      535       -1

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

fix(dsl): prevent lexer panic on string ending in trailing backslash

08a3dd0

coderabbitai Bot reviewed Jun 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(dsl): prevent lexer panic on string ending in trailing backslash#3007

fix(dsl): prevent lexer panic on string ending in trailing backslash#3007
ManthanNimodiya wants to merge 1 commit into
Permify:masterfrom
ManthanNimodiya:fix/lexer-string-panic-on-trailing-backslash

ManthanNimodiya commented Jun 19, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Jun 19, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

ManthanNimodiya commented Jun 19, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Jun 19, 2026

Uh oh!

codecov Bot commented Jun 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ManthanNimodiya commented Jun 19, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bug

Root cause

Fix

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Jun 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

ManthanNimodiya commented Jun 19, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Jun 19, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ManthanNimodiya commented Jun 19, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 19, 2026 •

edited

Loading