Only call doCache one per request by cwperks · Pull Request #6001 · opensearch-project/security

cwperks · 2026-03-11T02:02:09Z

Description

We recently saw this method in the hotthreads output of a cluster that had many aliases pointing to the same concrete index.

   100.4% (502.1ms out of 500ms) cpu usage by thread 'opensearch[...][search][T#11]'
     3/10 snapshots sharing following 49 elements
       org.opensearch.security.privileges.dlsfls.AbstractRuleBasedPrivileges.hasRestrictedRulesExplicit(AbstractRuleBasedPrivileges.java:286)
       org.opensearch.security.privileges.dlsfls.AbstractRuleBasedPrivileges.isUnrestricted(AbstractRuleBasedPrivileges.java:212)
       org.opensearch.security.privileges.dlsfls.FieldMasking.isUnrestricted(FieldMasking.java:53)
       org.opensearch.security.configuration.DlsFlsValveImpl.hasFlsOrFieldMasking(DlsFlsValveImpl.java:519)
       org.opensearch.security.OpenSearchSecurityPlugin$4.doCache(OpenSearchSecurityPlugin.java:839)
       app//org.apache.lucene.search.IndexSearcher.createWeight(IndexSearcher.java:981)
       app//org.opensearch.search.internal.ContextIndexSearcher.createWeight(ContextIndexSearcher.java:239)
       app//org.apache.lucene.search.BooleanWeight.<init>(BooleanWeight.java:58)
       app//org.apache.lucene.search.BooleanQuery.createWeight(BooleanQuery.java:265)
       app//org.apache.lucene.search.IndexSearcher.createWeight(IndexSearcher.java:979)
       app//org.opensearch.search.internal.ContextIndexSearcher.createWeight(ContextIndexSearcher.java:239)
       app//org.apache.lucene.search.BooleanWeight.<init>(BooleanWeight.java:58)
       app//org.apache.lucene.search.BooleanQuery.createWeight(BooleanQuery.java:265)
       app//org.apache.lucene.search.IndexSearcher.createWeight(IndexSearcher.java:979)
       app//org.opensearch.search.internal.ContextIndexSearcher.createWeight(ContextIndexSearcher.java:239)
       app//org.apache.lucene.search.BooleanWeight.<init>(BooleanWeight.java:58)
       app//org.apache.lucene.search.BooleanQuery.createWeight(BooleanQuery.java:265)
       app//org.apache.lucene.search.IndexSearcher.createWeight(IndexSearcher.java:979)
       app//org.opensearch.search.internal.ContextIndexSearcher.createWeight(ContextIndexSearcher.java:239)

On further investigation, I found that doCache can sometimes be called many times per query when I would expect it to run once per request. The changes in this PR ensure that its called once per request per shard, stores the result in the threadcontext and uses that for subsequent clauses.

For instance, in the query below doCache would be called 5x per shard

{
  "query": {
    "bool": {
      "filter": [
        { "term": { "field_b": "other_0" } },
        { "term": { "field_b": "other_1" } },
        { "term": { "field_b": "other_2" } },
        { "term": { "field_b": "other_3" } },
        { "term": { "field_b": "other_4" } }
      ]
    }
  }
}

Category (Enhancement, New feature, Bug fix, Test fix, Refactoring, Maintenance, Documentation)

Bug fix

Issues Resolved

To be filed

Check List

New functionality includes testing
New functionality has been documented
New Roles/Permissions have a corresponding security dashboards plugin PR
API changes companion pull request created
Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Craig Perkins <cwperx@amazon.com>

codecov · 2026-03-11T02:28:03Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 73.81%. Comparing base (1d77e46) to head (0e8791a).
⚠️ Report is 5 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6001      +/-   ##
==========================================
- Coverage   73.82%   73.81%   -0.02%     
==========================================
  Files         439      439              
  Lines       27087    27104      +17     
  Branches     4018     4022       +4     
==========================================
+ Hits        19998    20007       +9     
- Misses       5180     5189       +9     
+ Partials     1909     1908       -1

Files with missing lines	Coverage Δ
.../opensearch/security/OpenSearchSecurityPlugin.java	`85.19% <100.00%> (+0.12%)`	⬆️

... and 10 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

nibix · 2026-03-12T17:43:29Z

src/main/java/org/opensearch/security/OpenSearchSecurityPlugin.java


                @Override
                public Weight doCache(Weight weight, QueryCachingPolicy policy) {
+                    if (!indexSettings.getValue(IndicesRequestCache.INDEX_CACHE_REQUEST_ENABLED_SETTING)) {


I am not entirely sure if this setting is the correct one here. The method is a about the query cache, but this setting is index cache specific.

Maybe we can approach the issue by adding a cache to hasRestrictedRulesExplicit() to avoid recurring rule calculations. I can look into this.

yea, I closed it because I'm not confident that I understand the different levels of caching deeply enough yet. I will re-open once I can take a deeper dive if it make sense.

In my testing, I'm finding that doCache can be called multiple times per query. For the example below it would be called 5x for each clause:

{ "query": { "bool": { "filter": [ { "term": { "field_b": "other_0" } }, { "term": { "field_b": "other_1" } }, { "term": { "field_b": "other_2" } }, { "term": { "field_b": "other_3" } }, { "term": { "field_b": "other_4" } } ] } } }

I think we can do some caching of the first result and short-circuit accordingly.

yea, i can take care of that.

Signed-off-by: Craig Perkins <cwperx@amazon.com>

Skip logic in doCache if index request cache is disabled

0672dad

Signed-off-by: Craig Perkins <cwperx@amazon.com>

cwperks requested review from DarshitChanpura, RyanL1997, derek-ho, nibix, reta, shikharj05 and willyborankin as code owners March 11, 2026 02:02

Add to CHANGELOG

f845f1c

Signed-off-by: Craig Perkins <cwperx@amazon.com>

cwperks closed this Mar 12, 2026

nibix reviewed Mar 12, 2026

View reviewed changes

Ensure that doCache is called once per request

df3b963

Signed-off-by: Craig Perkins <cwperx@amazon.com>

cwperks reopened this Mar 12, 2026

cwperks changed the title ~~Skip logic in doCache if index request cache is disabled~~ Only call doCache one per request Mar 12, 2026

cwperks closed this Mar 12, 2026

Use LogsRules for assertion

0e8791a

Signed-off-by: Craig Perkins <cwperx@amazon.com>

cwperks reopened this Mar 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only call doCache one per request#6001

Only call doCache one per request#6001
cwperks wants to merge 4 commits intoopensearch-project:mainfrom
cwperks:skip-do-cache

cwperks commented Mar 11, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 11, 2026 •

edited

Loading

Uh oh!

nibix Mar 12, 2026

Uh oh!

cwperks Mar 12, 2026

Uh oh!

cwperks Mar 12, 2026 •

edited

Loading

Uh oh!

nibix Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cwperks commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issues Resolved

Check List

Uh oh!

codecov bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

nibix Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

cwperks Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

cwperks Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nibix Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cwperks commented Mar 11, 2026 •

edited

Loading

codecov bot commented Mar 11, 2026 •

edited

Loading

cwperks Mar 12, 2026 •

edited

Loading