Reduce Microsoft Graph API calls during large parallel applies using a shared list cache by Hu6li · Pull Request #101 · microsoft/terraform-provider-msgraph

Hu6li · 2026-03-10T08:12:58Z

Problem

When running terraform apply with many resources (e.g. 200+ group members or detection rules), the provider issues one [GET /resource/{id}] request per resource during Create, Update, Read, and the post-create consistency polling loop. This quickly exhausts Microsoft Graph API rate limits and results in long apply times or throttling errors.

Solution

Instead of fetching each resource individually, the provider now calls GET /{collection} once and finds the item by ID in the response. A short-lived (10 s) in-memory cache ensures that concurrent resource operations share a single collection fetch per TTL window rather than each firing their own request.

To prevent a thundering-herd when many resources miss the cache simultaneously, an in-flight deduplication mechanism ensures only one goroutine issues the GET /{collection} request — all others wait on that result.

Changes

MSGraphClient gains a cachedList method (cache + in-flight dedup), ReadFromList (single fetch, find by ID), and ReadFromListWithWait (polls until item appears, used after Create).
All write operations (Create, Update, Delete, Action) invalidate the relevant cache entry immediately after success so the next read always sees fresh data.
Create uses ReadFromListWithWait to confirm visibility without per-resource polling loops.
Update uses a plain ReadFromList for the final state read — no polling needed since the item already exists.
Read uses ReadFromList instead of a direct [GET /resource/{id}]
Delete no longer calls WaitForDeletion — a successful 204 No Content response is sufficient.
Update no longer calls WaitForUpdate — the cache invalidation + single collection fetch is enough.

Testing

Validated with a real tenant apply of 700+ resources. API call volume dropped significantly. Deployments with that amount of resources haven't been possible before.

Remarks

This approach significantly reduces Graph API calls in large deployments. I'm happy to adapt the implementation if maintainers prefer a different caching strategy.

Reduces GET /collection calls during large parallel applies by sharing a single cached response across concurrent resource operations.

Replaces individual GET /resource/{id} calls with shared collection fetches. Removes WaitForDeletion and WaitForUpdate after PATCH/PUT.

Hu6li · 2026-03-10T09:26:19Z

@microsoft-github-policy-service agree company="Die Schweizerische Post AG"

cwe1ss · 2026-03-17T20:31:28Z

Would this now call e.g. „GET /users“ and potentially load 1000s of users or more depending on the size of the directory, even if there‘s just a few users managed by terraform? Fetching entire collections without the possibility to set filters doesn’t seem like a good solution to me.

Hu6li · 2026-03-18T17:57:41Z

Yes, I see your concerns and appreciate the thorough review.

The fundamental problem is the hard limit of 1500 requests per hour imposed by the Graph API. The current main branch implementation makes a minimum of 5 API calls per resource per apply due to WaitForUpdate and WaitForDeletion both using ContinuousTargetOccurence: 3 in the consistency polling logic.
As far as I understand it, this would mean the resource must be polled successfully 3 consecutive times before the operation is considered complete. For 200 resources on a single terraform apply:

200 × POST /collection = 200 calls
200 × WaitForUpdate (minimum 3 polls each) = 600 calls
200 × GET /collection/{id} (post-create read) = 200 calls
Total: ~1000 calls minimum, and realistically 1500+ given Graph's eventual consistency, which means a single apply of 200 resources already hits or exceeds the rate limit.

I can confirm that in practice I was unable to apply a set of 200 resources using the main branch version, especially if there are already several resources deployed.

I see your point regarding filtering, this would be a worthwhile improvement as a next step to address the large-tenant case

Hubler Jens added 2 commits March 5, 2026 18:16

clients: add list cache with in-flight deduplication

d46a4fc

Reduces GET /collection calls during large parallel applies by sharing a single cached response across concurrent resource operations.

resource: use list cache in Create, Update and Read

36dd282

Replaces individual GET /resource/{id} calls with shared collection fetches. Removes WaitForDeletion and WaitForUpdate after PATCH/PUT.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce Microsoft Graph API calls during large parallel applies using a shared list cache#101

Reduce Microsoft Graph API calls during large parallel applies using a shared list cache#101
Hu6li wants to merge 2 commits intomicrosoft:mainfrom
Hu6li:feature/list-cache-optimization

Hu6li commented Mar 10, 2026

Uh oh!

Hu6li commented Mar 10, 2026 •

edited

Loading

Uh oh!

cwe1ss commented Mar 17, 2026

Uh oh!

Hu6li commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Hu6li commented Mar 10, 2026

Uh oh!

Hu6li commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cwe1ss commented Mar 17, 2026

Uh oh!

Hu6li commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Hu6li commented Mar 10, 2026 •

edited

Loading