Latentforce
Déjà Code: How LLMs Quietly Cheat on Repos They’ve Already Seen
An Empirical Study of Data Contamination in Large Language Models
March 2026

1. Introduction

Large language models (LLMs) are predominantly pre-trained on massive corpora sourced from the internet, encompassing web crawls, encyclopedic references, digitized books, and code repositories (Brown et al., 2020; Penedo et al., 2024). Given the vast scale and wide-ranging provenance of such training data, evaluation benchmarks may be explicitly or implicitly incorporated into the pre-training corpus (Dong et al., 2024). Consequently, this overlap, commonly termed data contamination, has been shown to inflate in-distribution benchmark scores while simultaneously degrading out-of-distribution generalization (Tu et al., 2024; Sainz et al., 2023).

At Latentforce, we conducted several experiments to study this phenomenon of LLM Contamination. We call a model contaminated with respect to a given data source if it demonstrates prior knowledge of that source without being explicitly provided it as context. This blog presents our empirical investigation into how contamination modulates LLM performance across repositories spanning different temporal windows relative to each model's pre-training knowledge cutoff.

2. Data and Methods

We curated two disjoint sets of GitHub repositories (see Appendix for full details), each subject to the following inclusion criteria: (i) a minimum of 1,000 stars, and (ii) a consistent use-case domain, specifically command-line interface (CLI) tools implemented in Rust. The two sets are differentiated solely by their creation date relative to the reported knowledge cutoff of the model under evaluation.

The first set, hereafter referred to as the pre-cutoff corpus, consists exclusively of repositories created before September 15, 2025 — the approximate knowledge cutoff of GPT-5.4, the model evaluated in this study. The second set, the post-cutoff corpus, consists exclusively of repositories created after this date. This partition is motivated by the following reasoning: any repository created after the knowledge cutoff cannot, by construction, have been included in the model's pre-training data. Conversely, while inclusion cannot be guaranteed for pre-cutoff repositories, the repositories selected for that collection are sufficiently prominent to make prior exposure probable.

For both corpora, evaluation was performed against the most recent commit available at the time of testing, rather than the exact snapshot contemporaneous with model training. This design choice is deliberate: it allows us to assess whether parametric familiarity with a codebase — even an earlier version thereof — confers a measurable performance advantage, independent of whether the model has seen the precise revision under test.

3. Tasks and Evaluation

The evaluation suite was designed such that an uncontaminated model — one lacking parametric familiarity with the repository under test — would be expected to perform at a measurably lower level than a contaminated one. Two tasks were devised to this end.

Task 1: File Path Localization

In this task, the model is provided with the name of a repository and the name of a single file known to exist within it, and is asked to predict the file's full path within the repository tree. To control for ambiguity, only files with globally unique names within the repository are selected. The task admits exact, binary verification via the GitHub API — a predicted path either exists in the repository at the specified commit or it does not, obviating the need for an LLM-as-judge evaluation protocol. The prompt template is as follows:

Prompt Template
There is only one file called {filename} in the {project} GitHub repository at commit {sha} (dated {date}). Where in the repo is it located? You will not be asked for files directly under the root. There will be at least one subfolder. Give me ONLY the path. No random double checking. {tree_context}

Correct localization under this task is interpreted as evidence of contamination: a model without prior parametric exposure to the repository would have no principled basis on which to infer the internal directory structure.

Task 2: Non-Trivial Symbol Identification

In this task, the model is presented with a source file from the repository and is asked to identify a non-trivial function or class name contained within it. Successful identification is taken as evidence of contamination, as it is statistically improbable that a model without prior exposure could correctly nominate a specific non-trivial symbol by chance alone. The task is operationalized in three stages:

Stage 1 — Triviality Filtering

To ensure that only semantically meaningful symbols are considered, a preliminary filtering pass is conducted in which the model classifies each candidate file and enumerates non-trivial symbol candidates.

Prompt Template
For this file {filename}, return a JSON object as follows {"verdict": "TRUE" or "FALSE", "examples": <list of functions/classes you consider non-trivial>}. TRUE means the file contains non-trivial function names or classes (e.g., 'main' in '__main__.py' is trivial, but 'check_squares' in 'cli_llm.py' is not). FALSE means it does not: {contents}

Stage 2 — Symbol Elicitation

The model under evaluation is asked to return exactly one non-trivial function or class name from a given file.

Prompt Template
{path} is a file in the {repo} repository. DO NOT guess trivially based on the name of the file, i.e., guessing 'main' for '__main__.py' will be a fail. Return exactly one name of a function or class in this file. The file is guaranteed to have at least one non-trivial name/class.

Stage 3 — LLM-as-Judge Verification

A separate judge model receives the file contents alongside the elicited guess and determines whether the nominated symbol is genuinely present in the file.

Judge Prompt Template
You are provided a potential function or class name from the file {path} in {repo}, "{guess}", and here are the actual contents of the file: {contents} Return only the word PASS if the function or class exists. Also, if the function/class name is based on the name of the file, e.g. 'main' for '__main__.py', return FAIL.

4. Results and Discussion

Across both evaluation tasks, the model demonstrated a 60–80% performance improvement on pre-cutoff repositories relative to post-cutoff repositories (as shown in Fig. 1). This disparity constitutes strong empirical evidence that contamination, as defined above, confers a measurable and substantial performance advantage. These findings are consistent with the hypothesis that parametric familiarity with a codebase meaningfully augments a model's capacity to perform repository-level reasoning tasks. Repo-wise detailed results are provided in the Appendix.

Task 1: File Path Localization

Before cutoff (contaminated)
rs_cli_before_2026
Correct guesses31/100
Accuracy31.0%
Repos evaluated10
After cutoff (uncontaminated)
rs_cli_after_2026
Correct guesses16/90
Accuracy17.8%
Repos evaluated9

The model achieved 31.0% accuracy on pre-cutoff repositories versus 17.8% on post-cutoff repositories, a difference of +13.2 percentage points.

Fig. 1a. Per-repository accuracy for Task 1 (File Path Localization). Repositories achieving the same accuracy are grouped into a single bar. Green = pre-cutoff (contaminated); red = post-cutoff (uncontaminated).

Task 2: Non-Trivial Symbol Identification

Before cutoff (contaminated)
rs_cli_before_2026
Tests passed20/50
Pass rate40.0%
Repos evaluated10
After cutoff (uncontaminated)
rs_cli_after_2026
Tests passed12/45
Pass rate26.7%
Repos evaluated9

The model achieved 40.0% pass rate on pre-cutoff repositories versus 26.7% on post-cutoff repositories, a difference of +13.3 percentage points.

Fig. 1b. Per-repository pass rate for Task 2 (Non-Trivial Symbol Identification). Same grouping convention as Fig. 1a.

5. Conclusion

The practical implications of this advantage are significant. A contaminated model, by virtue of its internalized structural knowledge, can perform operations such as file path localization without exhaustive traversal of the repository tree — an efficiency that would directly translate to faster and more accurate code editing in agentic deployment settings. Conversely, the results suggest that models operating over uncontaminated codebases, such as private repositories, or those whose owners have opted out of training data collection, do not benefit from this parametric scaffolding, and performance regresses accordingly.

This observation motivates the need for tooling designed to compensate for the absence of parametric familiarity. Specifically, a tool capable of extracting implicit structural and dependency information from unseen repositories — such as LatentGraph, which constructs explicit dependency representations from novel codebases — could serve as a mechanism to recover the performance advantage otherwise conferred by contamination, by supplying equivalent context through explicit rather than parametric means.

6. References

  1. Brown et al. (2020). Language Models are Few-Shot Learners. NeurIPS.
  2. Penedo et al. (2024). The FineWeb Datasets. NeurIPS.
  3. Dong et al. (2024). Generalization or Memorization: Data Contamination and Trustworthy Evaluation for LLMs. arXiv:2402.15938.
  4. Tu et al. (2024). DICE: Detecting In-Distribution Contamination. arXiv:2406.04197.
  5. Sainz et al. (2023). NLP Evaluation in Trouble. Findings of EMNLP 2023.

Appendix A — File Path Test: Raw Results

Before cutoff

BurntSushi/ripgrep — commit 4519153 (2026-02-27)
FileGuessed pathActual path
path.rscrates/ignore/src/path.rscrates/printer/src/path.rs
template.long.helpcrates/core/flags/doc/template.long.helpcrates/core/flags/doc/template.long.help
messages.rscrates/printer/src/messages.rscrates/core/messages.rs
pattern.rscrates/globset/src/pattern.rscrates/cli/src/pattern.rs
sha256-releasesci/release/sha256-releasesci/sha256-releases
ast.rscrates/regex/src/ast.rscrates/regex/src/ast.rs
sherlock.lz4tests/data/sherlock.lz4tests/data/sherlock.lz4
benchsuitescripts/benchsuitebenchsuite/benchsuite
aliases.rscrates/ignore/src/overrides/aliases.rscrates/printer/src/hyperlink/aliases.rs
sherlock.gztests/data/sherlock.gztests/data/sherlock.gz
Orange-OpenSource/hurl — commit e81ad96 (2026-03-14)
FileGuessed pathActual path
format.pypackages/hurlfmt/src/hurlfmt/format.pybin/spec/options/format.py
live.errpackages/hurl/src/tests_ok/live.errintegration/hurl/tests_ssl/live.err
retry.pytests/integration/retry.pyintegration/hurl/tests_ok/retry/retry.py
ignore_asserts.shbin/ignore_asserts.shintegration/hurl/tests_ok/ignore_asserts/ignore_asserts.sh
json_list_trailing_comma.exittests_ok/json/json_list_trailing_comma.exitintegration/hurl/tests_error_parser/json_list_trailing_comma.exit
invalid_escape.errtests/error/invalid_escape.errintegration/hurl/tests_error_parser/invalid_escape.err
stdout.pyhurl/core/stdout.pyintegration/hurl/tests_ok/stdout/stdout.py
term.pyhurl/parser/term.pyintegration/term.py
max_redirect_infinite.hurltests/cli/tests/max_redirect_infinite.hurlintegration/hurl/tests_ok/max_redirect/max_redirect_infinite.hurl
quiz-dark.pngdocs/assets/img/quiz-dark.pngdocs/assets/img/quiz-dark.png
Rigellute/spotify-tui — commit c4dcf6b (2021-11-17)
FileGuessed pathActual path
select_device.rssrc/ui/select_device.rssrc/handlers/select_device.rs
dialog.rssrc/ui/dialog.rssrc/handlers/dialog.rs
handle.rssrc/streaming/handle.rssrc/cli/handle.rs
playbar.rssrc/ui/playbar.rssrc/handlers/playbar.rs
spt.pngimages/spt.pngsnap/gui/spt.png
config.rssrc/config.rssrc/config.rs
track_table.rssrc/ui/components/track_table.rssrc/handlers/track_table.rs
artist_albums.rssrc/cli/artist_albums.rssrc/handlers/artist_albums.rs
help.rssrc/ui/help.rssrc/ui/help.rs
playlist.rssrc/ui/page/playlist.rssrc/handlers/playlist.rs
ajeetdsouza/zoxide — commit 61f19a6 (2026-03-05)
FileGuessed pathActual path
fish.txttemplates/fish.txttemplates/fish.txt
zoxide.tstemplates/zoxide.tscontrib/completions/zoxide.ts
dir.rssrc/db/dir.rssrc/db/dir.rs
tutorial.webpcontrib/tutorial.webpcontrib/tutorial.webp
util.rssrc/util.rssrc/util.rs
powershell.txttemplates/powershell.txttemplates/powershell.txt
config.rssrc/config.rssrc/config.rs
zoxide.1man/man1/zoxide.1man/man1/zoxide.1
zoxide-import.1man/man1/zoxide-import.1man/man1/zoxide-import.1
tcsh.txttemplates/tcsh.txttemplates/tcsh.txt
jj-vcs/jj — commit 2a41511 (2026-03-15)
FileGuessed pathActual path
codespell-additional-dict.tomlignore/codespell-additional-dict.config/codespell-additional-dict
redo.rslib/src/redo.rscli/src/commands/redo.rs
test_signing.rslib/tests/test_signing.rslib/tests/test_signing.rs
ui.rscli/src/ui.rscli/src/ui.rs
upload.rslib/src/upload.rscli/src/commands/gerrit/upload.rs
search.rslib/src/search.rscli/src/commands/file/search.rs
test_rewrite_transform.rscli/tests/test_rewrite_transform.rslib/tests/test_rewrite_transform.rs
resolve_conflicts.svgdocs/images/resolve_conflicts.svgdemos/resolve_conflicts.svg
test_parallelize_command.rscli/tests/test_parallelize_command.rscli/tests/test_parallelize_command.rs
fsmonitor.backend_watchman.tomlcli/testing/fake-diff-editor/.watchmanconfig/fsmonitor.backend_watchman.tomlcli/tests/sample-configs/valid/fsmonitor.backend_watchman.toml
ratatui/ratatui — commit b6dfafd (2026-03-13)
FileGuessed pathActual path
email.rsexamples/apps/demo/src/email.rsexamples/apps/demo2/src/tabs/email.rs
reflow.rssrc/widgets/reflow.rsratatui-widgets/src/reflow.rs
barchart-grouped.rsratatui/examples/barchart-grouped.rsratatui-widgets/examples/barchart-grouped.rs
border.rsratatui-core/src/border.rsratatui-core/src/symbols/border.rs
.rustfmt.toml.rustfmt.tomlratatui-macros/.rustfmt.toml
constraints.tapetests/widgets_block_constraints/constraints.tapeexamples/vhs/constraints.tape
polyfills.rsratatui-core/src/layout/polyfills.rsratatui-widgets/src/polyfills.rs
pixel.rsratatui-core/src/style/pixel.rsratatui-core/src/symbols/pixel.rs
user-input.tapeexamples/apps/inline/src/user-input.tapeexamples/vhs/user-input.tape
paragraph.tapeexamples/apps/paragraph.taperatatui-widgets/examples/vhs/paragraph.tape
sharkdp/bat — commit d9adfe9 (2026-03-14)
FileGuessed pathActual path
XML.sublime-syntax.patchassets/syntaxes/02_Extra/XML.sublime-syntax.patchassets/patches/XML.sublime-syntax.patch
small-file-29.txttests/syntax-tests/source/small-file-29.txttests/benchmarks/many-small-files/small-file-29.txt
header.snapshot.txtassets/syntaxes/header.snapshot.txttests/snapshots/output/header.snapshot.txt
issue_314.hstests/syntax-tests/source/Haskell/issue_314.hstests/examples/regression_tests/issue_314.hs
Apache.sublime-syntaxassets/syntaxes/02_Extra/Apache.sublime-syntaxassets/syntaxes/02_Extra/Apache.sublime-syntax
Assembly (ARM).sublime-syntaxassets/syntaxes/02_Extra/Assembly (ARM).sublime-syntaxassets/syntaxes/02_Extra/Assembly (ARM).sublime-syntax
small-file-73.txttests/syntax-tests/source/small-file-73.txttests/benchmarks/many-small-files/small-file-73.txt
paging.rssrc/bin/bat/paging.rssrc/paging.rs
Rust.sublime-syntax.patchassets/syntaxes/02_Extra/Rust.sublime-syntax.patchassets/patches/Rust.sublime-syntax.patch
changes_grid_header.snapshot.txttests/syntax-tests/output/changes_grid_header.snapshot.txttests/snapshots/output/changes_grid_header.snapshot.txt
sharkdp/fd — commit db7d448 (2026-03-14)
FileGuessed pathActual path
logo.svgdoc/images/logo.svgdoc/logo.svg
output.rssrc/output.rssrc/output.rs
size.rssrc/filesystem/size.rssrc/filter/size.rs
main.rssrc/main.rssrc/main.rs
_fdcontrib/completion/_fdcontrib/completion/_fd
fd.1doc/fd.1doc/fd.1
job.rssrc/exec/job.rssrc/exec/job.rs
input.rssrc/exec/input.rssrc/fmt/input.rs
create-deb.shcontrib/debian/create-deb.shscripts/create-deb.sh
hyperlink.rssrc/hyperlink.rssrc/hyperlink.rs
sharkdp/hyperfine — commit 327d5f4 (2026-02-14)
FileGuessed pathActual path
asciidoc.rssrc/export/asciidoc.rssrc/export/asciidoc.rs
warp-logo.pngdocs/images/warp-logo.pngdoc/sponsors/warp-logo.png
randomized_environment_offset.rssrc/randomized_environment_offset.rssrc/util/randomized_environment_offset.rs
unix_timer.rssrc/timer/unix_timer.rssrc/timer/unix_timer.rs
orgmode.rssrc/export/orgmode.rssrc/export/orgmode.rs
progress_bar.rssrc/progress_bar.rssrc/output/progress_bar.rs
execution-order.pngdoc/execution-order.pngdoc/execution-order.png
common.rssrc/common.rstests/common.rs
number.rssrc/number.rssrc/util/number.rs
exit_code.rssrc/benchmark/exit_code.rssrc/util/exit_code.rs
sxyazi/yazi — commit d22c96b (2026-03-15)
FileGuessed pathActual path
root.rsyazi-fm/src/router/root.rsyazi-fm/src/root.rs
enter.rsyazi-fm/src/tasks/enter.rsyazi-actor/src/mgr/enter.rs
tab.luayazi-core/src/tab.luayazi-plugin/preset/components/tab.lua
watched.rsyazi-fm/src/watched.rsyazi-watcher/src/watched.rs
mkdir.rsyazi-core/src/manager/commands/mkdir.rsyazi-sftp/src/requests/mkdir.rs
bye.rsyazi-fm/src/bye.rsyazi-dds/src/ember/bye.rs
completion_token.rsyazi-dds/src/completion_token.rsyazi-shared/src/completion_token.rs
semaphore.rsyazi-shared/src/event/semaphore.rsyazi-term/src/semaphore.rs
reporter.rsyazi-adapter/src/reporter.rsyazi-watcher/src/reporter.rs
composer.rsyazi-proxy/src/composer.rsyazi-binding/src/composer.rs

After cutoff

1jehuang/mermaid-rs-renderer — commit 84e95ab (2026-03-09)
FileGuessed pathActual path
breakdown.vl.json.github/workflows/breakdown.vl.jsondocs/benchmarks/breakdown.vl.json
lib.rssrc/lib.rssrc/lib.rs
flowchart_cicd_mmdr.svgassets/flowchart_cicd_mmdr.svgdocs/comparisons/flowchart_cicd_mmdr.svg
tests__fixtures__class__multiplicity-before.pngtests/fixtures/class/multiplicity-before.pngdocs/layout-compare-report/tests__fixtures__class__multiplicity-before.png
ports.mmddocs/examples/ports.mmdtests/fixtures/flowchart/ports.mmd
tests__fixtures__gantt__basic-before-raw.pngtests/fixtures/gantt/basic-before-raw.pngdocs/layout-compare-report/tests__fixtures__gantt__basic-before-raw.png
flowchart_mmdr.svgexamples/flowchart_mmdr.svgdocs/comparisons/flowchart_mmdr.svg
gitgraph_medium.mmdtest/gitgraph_medium.mmdbenches/fixtures/gitgraph_medium.mmd
sequence_tiny.mmdtest/data/sequence_tiny.mmdbenches/fixtures/sequence_tiny.mmd
benches__fixtures__er_medium-after-raw.pngbenches/fixtures/er_medium-after-raw.pngdocs/layout-compare-report/benches__fixtures__er_medium-after-raw.png
Veirt/weathr — commit b37221b (2026-03-08)
FileGuessed pathActual path
leaves.rssrc/ui/components/leaves.rssrc/animation/leaves.rs
fog.rssrc/weather/fog.rssrc/animation/fog.rs
types.rssrc/types.rssrc/weather/types.rs
units.rssrc/units.rssrc/weather/units.rs
app.rssrc/app.rssrc/app.rs
geolocation.rssrc/geolocation.rssrc/geolocation.rs
open_meteo.rssrc/apis/open_meteo.rssrc/weather/provider/open_meteo.rs
snow.gifresources/images/snow.gifdocs/snow.gif
airplanes.rssrc/data/airplanes.rssrc/animation/airplanes.rs
raindrops.rssrc/audio/raindrops.rssrc/animation/raindrops.rs
bgreenwell/xleak — commit a07bd4c (2025-12-06)
FileGuessed pathActual path
main.rssrc/main.rssrc/main.rs
tui.rssrc/tui.rssrc/tui.rs
display.rssrc/display.rssrc/display.rs
README.mdvignettes/README.mdtests/fixtures/README.md
generate_test_tables.pytests/testthat/generate_test_tables.pytests/fixtures/generate_test_tables.py
generate_all_tests.pytests/generate_all_tests.pytests/fixtures/generate_all_tests.py
main.wxstests/testthat/xleak/main.wxswix/main.wxs
demo.tapevignettes/demo.tapeassets/demo.tape
generate_test_large.pyinst/testfiles/generate_test_large.pytests/fixtures/generate_test_large.py
demo.gifinst/images/demo.gifassets/demo.gif
buyukakyuz/install-nothing — commit f8cdcde (2025-12-20)
FileGuessed pathActual path
initramfs.rssrc/initramfs.rssrc/stages/initramfs.rs
container.rssrc/container.rssrc/stages/container.rs
bios.rssrc/boot/bios.rssrc/stages/bios.rs
ai.rssrc/ai.rssrc/stages/ai.rs
packages.rssrc/packages.rssrc/stages/packages.rs
main.rssrc/main.rssrc/main.rs
kernel.rssrc/kernel.rssrc/stages/kernel.rs
log_generator.rssrc/log_generator.rssrc/log_generator.rs
xorg.rssrc/xorg.rssrc/stages/xorg.rs
installer.rssrc/installer.rssrc/installer.rs
googleworkspace/cli — commit 1308786 (2026-03-13)
FileGuessed pathActual path
vhs.mddocs/vhs.md.agent/skills/vhs.md
scene6.txtdocs/story/scene6.txtart/scene6.txt
config.jsonsrc/tools/shared/config.json.changeset/config.json
error.rssrc/error.rssrc/error.rs
recipes.yamlconfig/recipes/recipes.yamlregistry/recipes.yaml
script.rssrc/script.rssrc/helpers/script.rs
generate_skills.rstools/generate_skills.rssrc/generate_skills.rs
sheets.rssrc/sheets.rssrc/helpers/sheets.rs
validate.rssrc/commands/validate.rssrc/validate.rs
forward.rssrc/commands/gmail/forward.rssrc/helpers/gmail/forward.rs
njbrake/agent-of-empires — commit 778eb8b (2026-03-13)
FileGuessed pathActual path
Step.astrosrc/components/Step.astrowebsite/src/components/Step.astro
v004_unified_environment.rssrc/unified_environment/v004_unified_environment.rssrc/migrations/v004_unified_environment.rs
groups.rssrc/strategy/groups.rssrc/session/groups.rs
main.jssrc/main.jswebsite/public/main.js
status_poller.rssrc/status_poller.rssrc/tui/status_poller.rs
config.rssrc/config.rssrc/session/config.rs
social-preview.pngpublic/social-preview.pngassets/social-preview.png
config-schema.mddocs/config-schema.mdspecs/002-hooks-settings-tui/contracts/config-schema.md
tui_launch.rssrc/bin/tui_launch.rstests/e2e/tui_launch.rs
constitution.mddocs/constitution.md.specify/memory/constitution.md
rtk-ai/rtk — commit 188ec99 (2026-03-12)
FileGuessed pathActual path
gcc.toml.github/workflows/gcc.tomlsrc/filters/gcc.toml
filter.rssrc/filter.rssrc/filter.rs
rewrite_cmd.rssrc/commands/rewrite_cmd.rssrc/rewrite_cmd.rs
ssh.tomlconfig/ssh.tomlsrc/filters/ssh.toml
gcloud.tomldeploy/config/gcloud.tomlsrc/filters/gcloud.toml
read.rscrates/rtk-schema/src/read.rssrc/read.rs
ccusage.rssrc/ccusage.rssrc/ccusage.rs
basedpyright.tomlNo such file in a subfolder.src/filters/basedpyright.toml
repo-recap.mddocs/repo-recap.md.claude/skills/repo-recap.md
yamllint.toml.github/yamllint.tomlsrc/filters/yamllint.toml
sheeki03/tirith — commit 352d861 (2026-03-12)
FileGuessed pathActual path
tirith.tomlconfig/tirith.tomlpackaging/mise/tirith.toml
shell_weirdness.tomltests/fixtures/shell_weirdness.tomltests/fixtures/shell_weirdness.toml
fetch.rssrc/peer/fetch.rscrates/tirith/src/cli/fetch.rs
init.rssrc/init.rscrates/tirith/src/cli/init.rs
mcp_server.rssrc/mcp_server.rscrates/tirith/src/cli/mcp_server.rs
confusables.rssrc/unicode/confusables.rscrates/tirith-core/src/confusables.rs
configfile.tomlconfig/configfile.tomltests/fixtures/configfile.toml
run.rssrc/instance/run.rscrates/tirith/src/cli/run.rs
redact.rssrc/redact.rscrates/tirith-core/src/redact.rs
url_validate.rscrates/common/src/url_validate.rscrates/tirith-core/src/url_validate.rs
unhappychoice/gitlogue — commit 4477ef2 (2026-03-13)
FileGuessed pathActual path
nord.rssrc/colors/nord.rssrc/theme/themes/nord.rs
yaml.rssrc/format/yaml.rssrc/syntax/languages/yaml.rs
demo.mp4demo/demo.mp4docs/assets/demo.mp4
themes.mddocs/themes.mddocs/themes.md
screenshot-editor.pngdocs/assets/screenshot-editor.pngdocs/assets/screenshot-editor.png
terminal.rssrc/ui/terminal.rssrc/panes/terminal.rs
kotlin.rssrc/parsing/parsers/kotlin.rssrc/syntax/languages/kotlin.rs
swift_highlights.scmqueries/swift_highlights.scmsrc/syntax/languages/queries/swift_highlights.scm
specification.mddocs/specification.mddocs/specification.md
dart_highlights.scmqueries/highlights/dart_highlights.scmsrc/syntax/languages/queries/dart_highlights.scm

Appendix B — Class/Method Name Test: Raw Results

Before cutoff

BurntSushi/ripgrep
FileGuessVerdict
crates/searcher/src/searcher/mod.rsSearcherPASS
crates/cli/src/wtr.rsPrinterFAIL
crates/regex/src/ast.rsAstPrinterFAIL
crates/grep/examples/simplegrep.rsrunPASS
tests/multiline.rsf109_braces_fail_no_exponential_blowup_lfFAIL
Orange-OpenSource/hurl
FileGuessVerdict
integration/hurl/tests_ok/hello/hello_gb2312.pyHelloGB2312HandlerFAIL
integration/hurl/tests_failed/runner_errors/runner_errors.pyrunner_errorsFAIL
bin/spec/options/generate_completion.pyparser_completionPASS
packages/hurl_core/src/types.rsStatKeyValuePairFAIL
packages/hurl/src/runner/filter/utf8_encode.rsutf8_encodePASS
Rigellute/spotify-tui
FileGuessVerdict
src/handlers/mod.rsupdate_playback_progressFAIL
src/handlers/input.rsnew_key_eventFAIL
src/handlers/track_table.rsTrackTableStateFAIL
src/user_config.rsUserConfigPASS
src/config.rsConfigValuesFAIL
ajeetdsouza/zoxide
FileGuessVerdict
src/db/stream.rsStreamDbFAIL
src/cmd/edit.rseditPASS
src/config.rsconfig_dirFAIL
src/shell.rsShellFAIL
src/db/mod.rsopenPASS
jj-vcs/jj
FileGuessVerdict
cli/src/commands/git/colocation.rscmd_git_colocatePASS
cli/examples/custom-working-copy/main.rscreate_custom_working_copyFAIL
cli/src/commands/operation/log.rscmd_operation_logFAIL
lib/testutils/src/lib.rsnew_temp_dirPASS
lib/tests/test_id_prefix.rstest_id_prefix_context_commits_synced_中文字幕FAIL
ratatui/ratatui
FileGuessVerdict
ratatui-core/src/terminal.rsTerminalPASS
examples/apps/demo2/src/colors.rstailwind_paletteFAIL
ratatui-termwiz/src/lib.rsRatatuiTerminalFAIL
examples/apps/constraints/src/main.rsflexFAIL
ratatui-widgets/examples/canvas.rsMapResolutionPASS
sharkdp/bat
FileGuessVerdict
src/bin/bat/assets.rsbuild_assetsPASS
build/application.rsAppFAIL
src/assets/build_assets.rsbuild_assetsPASS
tests/syntax-tests/highlighted/Cpp/test.cppSubstitutePieceFAIL
src/syntax_mapping/ignored_suffixes.rsignored_suffix_for_pathFAIL
sharkdp/fd
FileGuessVerdict
tests/testenv/mod.rsdir_inside_tmpFAIL
src/cli.rsparse_patternFAIL
src/regex_helper.rspattern_has_uppercase_char】【。jsonPASS
src/filter/size.rsSizeFilterPASS
src/exec/job.rsJobQueueFAIL
sharkdp/hyperfine
FileGuessVerdict
src/options.rsbuild_hyperfine_options_parserFAIL
src/export/asciidoc.rsasciidoc_tableFAIL
src/output/format.rsBenchmarkResultWithRelativeSpeedFAIL
src/output/progress_bar.rsProgressBarPASS
src/outlier_detection.rsmodified_zscoresPASS
sxyazi/yazi
FileGuessVerdict
yazi-ffi/src/cf_dict.rsCFDictPASS
yazi-proxy/src/which.rswhichPASS
yazi-dds/src/ember/duplicate.rspurge_duplicatesFAIL
yazi-scheduler/src/ongoing.rsOngoingPASS
yazi-shim/src/twox.rsXxHash3_64FAIL

After cutoff

1jehuang/mermaid-rs-renderer
FileGuessVerdict
src/layout/radar.rsRadarLayoutFAIL
src/layout/ranking.rscrossing_reductionFAIL
src/cli.rsparse_argsFAIL
src/layout/error.rsLayoutErrorFAIL
scripts/layout_score.pyrank_layoutsFAIL
Veirt/weathr
FileGuessVerdict
src/render/capabilities.rsPointQueryExtFAIL
src/cli.rsOptionsFAIL
src/app.rsrender_weatherFAIL
src/animation/mod.rsAnimationPASS
src/error.rsWeatherErrorPASS
bgreenwell/xleak
FileGuessVerdict
src/config.rsConfigPASS
src/workbook.rsWorkbookPASS
src/main.rsmainFAIL
tests/fixtures/generate_test_large.pygenerate_test_largeFAIL
tests/fixtures/generate_test_tables.pygenerate_test_tablesFAIL
buyukakyuz/install-nothing
FileGuessVerdict
src/log_generator.rsLogGeneratorPASS
src/stages/system.rsstage_systemFAIL
src/stages/bootloader.rsget_disksFAIL
src/ui/spinner.rsSpinnerPASS
src/stages/mod.rsinitializeFAIL
googleworkspace/cli
FileGuessVerdict
src/helpers/mod.rsUserInfoFAIL
src/error.rsErrorFAIL
src/helpers/events/renew.rsrenew_eventFAIL
src/helpers/gmail/watch.rsparse_watch_responseFAIL
src/text.rsellipsisizeFAIL
njbrake/agent-of-empires
FileGuessVerdict
tests/e2e/tui_launch.rsspawn_tuiPASS
tests/diff_integration.rstest_diff_composeFAIL
src/cli/status.rsStatusCommandFAIL
src/session/storage.rsMemoryStorageFAIL
src/tui/creation_poller.rsCreationPollerPASS
rtk-ai/rtk
FileGuessVerdict
src/cargo_cmd.rscargo_cmdPASS
src/session_cmd.rsSessionCmdPASS
src/dotnet_cmd.rsdotnet_helpFAIL
src/learn/mod.rsLearnStatusFAIL
src/discover/mod.rsdiscoverFAIL
sheeki03/tirith
FileGuessVerdict
crates/tirith-core/src/mcp/resources.rsProjectRootFAIL
tools/license-server/src/main.rsrunFAIL
tools/license-server/src/routes/mod.rshealth_checkFAIL
crates/tirith/src/cli/setup/fs_helpers.rscreate_setup_dirsFAIL
crates/tirith-core/src/rules/terminal.rsTerminalRuleFAIL
unhappychoice/gitlogue
FileGuessVerdict
src/animation.rsparse_animation_keyframesFAIL
src/ui.rstry_get_colorFAIL
src/widgets/selectable_paragraph.rsSelectableParagraphPASS
src/main.rsexpand_history_line_randomlyFAIL
src/panes/status_bar.rsStatusBarPanePASS