Optimize artifact lookup in DefaultProjectDependencyAnalyzer by sanjana2505006 · Pull Request #270 · apache/maven-dependency-analyzer

sanjana2505006 · 2026-02-02T18:34:08Z

While reviewing the core analysis logic, I noticed a performance bottleneck in DefaultProjectDependencyAnalyzer. \n\nPreviously, it used a linear search to find which artifact a class belongs to ($O(N)$ for every class usage), which could be a bottleneck in large projects with many dependencies. These changes switch to a Map-based lookup ($O(1)$) and modernize the code using computeIfAbsent and more robust scope checks.\n\nThis significantly improves the efficiency of the analysis process for projects with large dependency trees.

…pendencyAnalyzer

sanjana2505006 · 2026-02-02T18:38:27Z

Hello @slawekjaranowski, I’ve opened a PR with a small improvement I noticed.
Whenever you have time, I’d appreciate your thoughts on whether this approach makes sense.
Thank you!

sanjana2505006 · 2026-02-24T12:03:15Z

Hello @slawekjaranowski, just a gentle follow-up on this PR. Whenever you have time, could you please share your feedback or let me know if any further changes are needed from my side? Happy to update anything required. Thank you

Copilot

Pull request overview

This PR optimizes core dependency analysis by replacing per-class linear artifact lookup with a precomputed class-to-artifact map, improving performance for projects with large dependency graphs.

Changes:

Build a Map<String, Artifact> index (buildClassToArtifactMap) for O(1) artifact lookup by class name.
Update used-artifact computation to use the new map and modernize collection population via computeIfAbsent.
Minor robustness/style updates (scope constant, formatting).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-26T20:52:35Z

src/main/java/org/apache/maven/shared/dependency/analyzer/DefaultProjectDependencyAnalyzer.java

-                return entry.getKey();
+            Artifact artifact = entry.getKey();
+            for (String className : entry.getValue()) {
+                classToArtifactMap.put(className, artifact);


buildClassToArtifactMap overwrites any existing mapping when the same class name appears in multiple artifacts. Previously the lookup returned the first matching artifact encountered in artifactClassMap iteration order; with put(...) the last artifact wins, which can change results (e.g., shaded/duplicate classes) and cause incorrect used/unused dependency reporting. Preserve the previous behavior (e.g., putIfAbsent to keep the first mapping) or explicitly detect duplicates and handle/report them deterministically.

Suggested change

classToArtifactMap.put(className, artifact);

classToArtifactMap.putIfAbsent(className, artifact);

@sanjana2505006 please look for this one

I've updated it to putIfAbsent in the latest commit to preserve the original first-match-wins semantics. Thank you for flagging this, @slawekjaranowski!

Copilot · 2026-03-26T20:52:36Z

src/main/java/org/apache/maven/shared/dependency/analyzer/DefaultProjectDependencyAnalyzer.java

    private static Map<Artifact, Set<DependencyUsage>> buildUsedArtifacts(
-            Map<Artifact, Set<String>> artifactClassMap, Set<DependencyUsage> dependencyClasses) {
+            Map<String, Artifact> classToArtifactMap, Set<DependencyUsage> dependencyClasses) {
        Map<Artifact, Set<DependencyUsage>> usedArtifacts = new HashMap<>();

        for (DependencyUsage classUsage : dependencyClasses) {
-            Artifact artifact = findArtifactForClassName(artifactClassMap, classUsage.getDependencyClass());
+            Artifact artifact = classToArtifactMap.get(classUsage.getDependencyClass());

            if (artifact != null && !includedInJDK(artifact)) {
-                Set<DependencyUsage> classesFromArtifact = usedArtifacts.get(artifact);
-                if (classesFromArtifact == null) {
-                    classesFromArtifact = new HashSet<>();
-                    usedArtifacts.put(artifact, classesFromArtifact);
-                }
-                classesFromArtifact.add(classUsage);
+                usedArtifacts.computeIfAbsent(artifact, k -> new HashSet<>()).add(classUsage);
            }
        }


There are no unit tests covering the new class-to-artifact resolution path (including duplicate-class scenarios and ensuring the chosen artifact is deterministic). Adding focused tests around buildUsedArtifacts/buildClassToArtifactMap would help prevent regressions in dependency classification as this is core analysis logic.

That's a fair point. I'll add focused unit tests covering the new class-to-artifact resolution path, including duplicate-class scenarios, to ensure the chosen artifact is deterministic. Will update it.

…mantics The original findArtifactForClassName returned the first matching artifact during iteration. Using put() overwrites previous entries, making the last artifact win instead. putIfAbsent preserves the first-match-wins behavior, preventing incorrect used/unused dependency reporting for shaded/duplicate classes.

Optimize artifact lookup and improve code quality in DefaultProjectDe…

90cdf73

…pendencyAnalyzer

slawekjaranowski requested review from Copilot and slawekjaranowski March 20, 2026 22:02

Copilot started reviewing on behalf of slawekjaranowski March 26, 2026 20:50 View session

slawekjaranowski added the enhancement New feature or request label Mar 26, 2026

Copilot AI reviewed Mar 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize artifact lookup in DefaultProjectDependencyAnalyzer#270

Optimize artifact lookup in DefaultProjectDependencyAnalyzer#270
sanjana2505006 wants to merge 2 commits intoapache:masterfrom
sanjana2505006:perf/optimize-artifact-lookup

sanjana2505006 commented Feb 2, 2026 •

edited

Loading

Uh oh!

sanjana2505006 commented Feb 2, 2026

Uh oh!

sanjana2505006 commented Feb 24, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 26, 2026

Uh oh!

slawekjaranowski Mar 26, 2026

Uh oh!

sanjana2505006 Mar 27, 2026

Uh oh!

Copilot AI Mar 26, 2026

Uh oh!

sanjana2505006 Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	classToArtifactMap.put(className, artifact);
	classToArtifactMap.putIfAbsent(className, artifact);

Conversation

sanjana2505006 commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sanjana2505006 commented Feb 2, 2026

Uh oh!

sanjana2505006 commented Feb 24, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

slawekjaranowski Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

sanjana2505006 Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

sanjana2505006 Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sanjana2505006 commented Feb 2, 2026 •

edited

Loading