Context: skip missing @file refs (reduce prompt noise) #244

zikajk · 2025-12-21T19:28:19Z

When users include @some-file in prompts (or inside AGENTS.md), missing/unreadable files used to inject placeholder strings like “File not found” into the model context, creating pure noise and risking downstream issues.

This change makes missing/unreadable file contexts a no-op:

llm-api/refine-file-context logs warnings and returns nil instead of placeholder text
context collection skips nil contexts (including recursive AGENTS.md parsing and directory expansions), preventing nil/“File not found” from reaching the prompt
prompt metadata clarifies AGENTS.md as “Primary System Directives & Coding Standards”
I added a entry in changelog under unreleased section.

Copilot

Pull request overview

This PR improves context handling by skipping missing or unreadable file references instead of injecting placeholder text like "File not found" into the model context. When users include @file references in prompts or AGENTS.md, missing files now log warnings and are excluded from the context, reducing noise in the LLM prompt.

Key changes:

refine-file-context returns nil for missing/unreadable files instead of error strings
Context collection functions filter out nil contexts using if-let, when-let, and keep
AGENTS.md description updated to "Primary System Directives & Coding Standards"

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
src/eca/llm_api.clj	Modified `refine-file-context` to log warnings and return nil for missing/unreadable files instead of returning error strings
src/eca/features/context.clj	Updated `parse-agents-file`, `file->refined-context`, and `raw-contexts->refined` to skip nil contexts; refactored `contexts-str-from-prompt` structure
src/eca/features/prompt.clj	Updated AGENTS.md description text in context metadata
test/eca/features/context_test.clj	Added test for missing file handling in AGENTS.md; updated existing tests to mock `fs/readable?` for consistency
test/eca/features/chat_test.clj	Added test verifying missing file references don't add context noise; updated existing test structure with proper `with-redefs` wrapping
CHANGELOG.md	Added entry documenting the change

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/eca/llm_api.clj

src/eca/features/context.clj

Copilot · 2025-12-21T19:34:19Z

src/eca/features/context.clj

+             at-mentions (extract-at-mentions root-content)
+             parent-dir (str (fs/parent path))
+             resolved-paths (map (fn [mention]
+                                   (cond
+                                     ;; Absolute path
+                                     (string/starts-with? mention "/")
+                                     (str (fs/canonicalize (fs/file mention)))

-                                   ;; Relative path (./... or ../...)
-                                   (or (string/starts-with? mention "./")
-                                       (string/starts-with? mention "../"))
-                                   (str (fs/canonicalize (fs/file parent-dir mention)))
+                                     ;; Relative path (./... or ../...)
+                                     (or (string/starts-with? mention "./")
+                                         (string/starts-with? mention "../"))
+                                     (str (fs/canonicalize (fs/file parent-dir mention)))

-                                   ;; Simple filename, relative to current file's directory
-                                   :else
-                                   (str (fs/canonicalize (fs/file parent-dir mention)))))
-                               at-mentions)
-           ;; Deduplicate resolved paths
-           unique-paths (distinct resolved-paths)
-           ;; Recursively parse all mentioned files
-           nested-results (mapcat #(parse-agents-file % visited') unique-paths)]
-       (concat [{:type :agents-file
-                 :path path
-                 :content root-content}]
-               nested-results)))))
+                                     ;; Simple filename, relative to current file's directory
+                                     :else
+                                     (str (fs/canonicalize (fs/file parent-dir mention)))))


parse-agents-file resolves @ mentions in AGENTS.md to absolute paths using fs/canonicalize without constraining them to the workspace or a safe base directory, so a malicious AGENTS.md in an untrusted repo can reference paths like ../../.ssh/id_rsa and cause the client to read arbitrary files outside the project. Those file contents are then included in :agents-file contexts and sent to the LLM provider, allowing silent exfiltration of sensitive local data whenever the workspace is opened or a chat is started. Restrict @ mentions to stay within the workspace/config roots (e.g., reject or strip ../absolute paths, or enforce that canonicalized targets are under allowed directories) before reading and including them in contexts.

This can be a problem when user has an auto-approval and some malicious project (libraries and nested AGENTS.md).
It is outside of scope of this PR but it is maybe a good idea to make various permissions configurable? WDYT @ericdallo?

I dunno, it's something to discuss in other issue I guess, there are multiple ways to harm in a AGENTS.md in a project which we don't have control, so not sure there s much to do

When users include @some-file in prompts (or inside AGENTS.md), missing/unreadable files used to inject placeholder strings like “File not found” into the model context, creating pure noise and risking downstream issues. This change makes missing/unreadable file contexts a no-op: - llm-api/refine-file-context logs warnings and returns nil instead of placeholder text - context collection skips nil contexts (including recursive AGENTS.md parsing and directory expansions), preventing nil/“File not found” from reaching the prompt - prompt metadata clarifies AGENTS.md as “Primary System Directives & Coding Standards”

ericdallo

Looks good!

zikajk requested review from Copilot and ericdallo December 21, 2025 19:28

Copilot started reviewing on behalf of zikajk December 21, 2025 19:28 View session

Copilot AI reviewed Dec 21, 2025

View reviewed changes

zikajk removed the request for review from ericdallo December 21, 2025 20:15

zikajk marked this pull request as draft December 21, 2025 20:16

zikajk force-pushed the improve-context-handling branch from c36db6e to 59d7a43 Compare December 21, 2025 20:21

zikajk marked this pull request as ready for review December 21, 2025 20:24

zikajk requested a review from ericdallo December 21, 2025 20:24

zikajk force-pushed the improve-context-handling branch from 59d7a43 to 9b4fc9d Compare December 22, 2025 07:39

ericdallo approved these changes Dec 22, 2025

View reviewed changes

ericdallo merged commit 396d7fb into master Dec 22, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Context: skip missing @file refs (reduce prompt noise) #244

Context: skip missing @file refs (reduce prompt noise) #244

Uh oh!

zikajk commented Dec 21, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Dec 21, 2025

Uh oh!

zikajk Dec 21, 2025

Uh oh!

ericdallo Dec 22, 2025

Uh oh!

ericdallo left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Context: skip missing @file refs (reduce prompt noise) #244

Context: skip missing @file refs (reduce prompt noise) #244

Uh oh!

Conversation

zikajk commented Dec 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI Dec 21, 2025

Choose a reason for hiding this comment

Uh oh!

zikajk Dec 21, 2025

Choose a reason for hiding this comment

Uh oh!

ericdallo Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

ericdallo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zikajk commented Dec 21, 2025 •

edited

Loading