HADOOP-1593. [ABFS] Add vectored read support in ABFS driver by anmolanmol1234 · Pull Request #8400 · apache/hadoop

anmolanmol1234 · 2026-04-02T17:41:52Z

This PR introduces vectored read support in the Azure Blob File System (ABFS) driver to improve read performance for workloads that issue multiple small, non-contiguous read requests.

Vectored reads enable batching of multiple read ranges into fewer network calls, reducing request overhead and improving throughput—especially beneficial for analytics engines like Spark.

Current ABFS read implementation performs sequential, independent read operations for each requested range. This leads to:
Increased number of network calls
Higher latency for small/random reads
Inefficient utilization of bandwidth

Vectored I/O addresses these issues by coalescing multiple read requests into a single or fewer backend calls.

… HADOOP-15963_poc

hadoop-yetus · 2026-04-02T19:57:37Z

🎊 +1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 53s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 0s		No case conflicting files found.
+0 🆗	codespell	0m 1s		codespell was not available.
+0 🆗	detsecrets	0m 1s		detect-secrets was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
+1 💚	test4tests	0m 0s		The patch appears to include 2 new or modified test files.
			_ trunk Compile Tests _
+1 💚	mvninstall	48m 13s		trunk passed
+1 💚	compile	1m 1s		trunk passed with JDK Ubuntu-21.0.10+7-Ubuntu-124.04
+1 💚	compile	1m 4s		trunk passed with JDK Ubuntu-17.0.18+8-Ubuntu-124.04.1
+1 💚	checkstyle	0m 58s		trunk passed
+1 💚	mvnsite	1m 7s		trunk passed
+1 💚	javadoc	0m 59s		trunk passed with JDK Ubuntu-21.0.10+7-Ubuntu-124.04
+1 💚	javadoc	0m 58s		trunk passed with JDK Ubuntu-17.0.18+8-Ubuntu-124.04.1
+1 💚	spotbugs	1m 35s		trunk passed
+1 💚	shadedclient	34m 39s		branch has no errors when building and testing our client artifacts.
-0 ⚠️	patch	35m 11s		Used diff version of patch file. Binary files and potentially other changes not applied. Please rebase and squash commits if necessary.
			_ Patch Compile Tests _
+1 💚	mvninstall	0m 37s		the patch passed
+1 💚	compile	0m 32s		the patch passed with JDK Ubuntu-21.0.10+7-Ubuntu-124.04
+1 💚	javac	0m 32s		the patch passed
+1 💚	compile	0m 35s		the patch passed with JDK Ubuntu-17.0.18+8-Ubuntu-124.04.1
+1 💚	javac	0m 35s		the patch passed
+1 💚	blanks	0m 0s		The patch has no blanks issues.
-0 ⚠️	checkstyle	0m 25s	/results-checkstyle-hadoop-tools_hadoop-azure.txt	hadoop-tools/hadoop-azure: The patch generated 10 new + 5 unchanged - 0 fixed = 15 total (was 5)
+1 💚	mvnsite	0m 39s		the patch passed
+1 💚	javadoc	0m 29s		the patch passed with JDK Ubuntu-21.0.10+7-Ubuntu-124.04
+1 💚	javadoc	0m 29s		the patch passed with JDK Ubuntu-17.0.18+8-Ubuntu-124.04.1
+1 💚	spotbugs	1m 17s		the patch passed
+1 💚	shadedclient	32m 56s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
+1 💚	unit	2m 10s		hadoop-azure in the patch passed.
+1 💚	asflicense	0m 35s		The patch does not generate ASF License warnings.
		134m 18s

Subsystem	Report/Notes
Docker	ClientAPI=1.54 ServerAPI=1.54 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-8400/1/artifact/out/Dockerfile
GITHUB PR	#8400
Optional Tests	dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle codespell detsecrets
uname	Linux bec554755465 5.15.0-173-generic #183-Ubuntu SMP Fri Mar 6 13:29:34 UTC 2026 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `437ffc8`
Default Java	Ubuntu-17.0.18+8-Ubuntu-124.04.1
Multi-JDK versions	/usr/lib/jvm/java-21-openjdk-amd64:Ubuntu-21.0.10+7-Ubuntu-124.04 /usr/lib/jvm/java-17-openjdk-amd64:Ubuntu-17.0.18+8-Ubuntu-124.04.1
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-8400/1/testReport/
Max. process+thread count	574 (vs. ulimit of 10000)
modules	C: hadoop-tools/hadoop-azure U: hadoop-tools/hadoop-azure
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-8400/1/console
versions	git=2.43.0 maven=3.9.11 spotbugs=4.9.7
Powered by	Apache Yetus 0.14.1 https://yetus.apache.org

This message was automatically generated.

hadoop-yetus · 2026-04-02T22:32:54Z

🎊 +1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 52s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 1s		No case conflicting files found.
+0 🆗	codespell	0m 0s		codespell was not available.
+0 🆗	detsecrets	0m 0s		detect-secrets was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
+1 💚	test4tests	0m 0s		The patch appears to include 2 new or modified test files.
			_ trunk Compile Tests _
+1 💚	mvninstall	47m 14s		trunk passed
+1 💚	compile	1m 1s		trunk passed with JDK Ubuntu-21.0.10+7-Ubuntu-124.04
+1 💚	compile	1m 3s		trunk passed with JDK Ubuntu-17.0.18+8-Ubuntu-124.04.1
+1 💚	checkstyle	0m 56s		trunk passed
+1 💚	mvnsite	1m 7s		trunk passed
+1 💚	javadoc	0m 59s		trunk passed with JDK Ubuntu-21.0.10+7-Ubuntu-124.04
+1 💚	javadoc	0m 57s		trunk passed with JDK Ubuntu-17.0.18+8-Ubuntu-124.04.1
+1 💚	spotbugs	1m 36s		trunk passed
+1 💚	shadedclient	34m 1s		branch has no errors when building and testing our client artifacts.
-0 ⚠️	patch	34m 33s		Used diff version of patch file. Binary files and potentially other changes not applied. Please rebase and squash commits if necessary.
			_ Patch Compile Tests _
+1 💚	mvninstall	0m 37s		the patch passed
+1 💚	compile	0m 32s		the patch passed with JDK Ubuntu-21.0.10+7-Ubuntu-124.04
+1 💚	javac	0m 32s		the patch passed
+1 💚	compile	0m 34s		the patch passed with JDK Ubuntu-17.0.18+8-Ubuntu-124.04.1
+1 💚	javac	0m 34s		the patch passed
+1 💚	blanks	0m 0s		The patch has no blanks issues.
-0 ⚠️	checkstyle	0m 25s	/results-checkstyle-hadoop-tools_hadoop-azure.txt	hadoop-tools/hadoop-azure: The patch generated 1 new + 5 unchanged - 0 fixed = 6 total (was 5)
+1 💚	mvnsite	0m 39s		the patch passed
+1 💚	javadoc	0m 28s		the patch passed with JDK Ubuntu-21.0.10+7-Ubuntu-124.04
+1 💚	javadoc	0m 29s		the patch passed with JDK Ubuntu-17.0.18+8-Ubuntu-124.04.1
+1 💚	spotbugs	1m 16s		the patch passed
+1 💚	shadedclient	33m 33s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
+1 💚	unit	2m 8s		hadoop-azure in the patch passed.
+1 💚	asflicense	0m 35s		The patch does not generate ASF License warnings.
		133m 20s

Subsystem	Report/Notes
Docker	ClientAPI=1.54 ServerAPI=1.54 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-8400/2/artifact/out/Dockerfile
GITHUB PR	#8400
Optional Tests	dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle codespell detsecrets
uname	Linux ad47bd66149a 5.15.0-173-generic #183-Ubuntu SMP Fri Mar 6 13:29:34 UTC 2026 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `975bf73`
Default Java	Ubuntu-17.0.18+8-Ubuntu-124.04.1
Multi-JDK versions	/usr/lib/jvm/java-21-openjdk-amd64:Ubuntu-21.0.10+7-Ubuntu-124.04 /usr/lib/jvm/java-17-openjdk-amd64:Ubuntu-17.0.18+8-Ubuntu-124.04.1
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-8400/2/testReport/
Max. process+thread count	586 (vs. ulimit of 10000)
modules	C: hadoop-tools/hadoop-azure U: hadoop-tools/hadoop-azure
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-8400/2/console
versions	git=2.43.0 maven=3.9.11 spotbugs=4.9.7
Powered by	Apache Yetus 0.14.1 https://yetus.apache.org

This message was automatically generated.

anmolanmol1234 added 22 commits January 16, 2026 04:30

vectored read config changes

8c28b18

Vectored read code

08617b7

Fix tests

95cbb73

Made changes for inprogress list

d03d3cd

Merge branch 'trunk' of https://github.com/anmolanmol1234/hadoop into…

1e62df3

… HADOOP-15963_poc

Merge conflicts

3f36997

Checkstyle fix

c4313e9

Merge branch 'trunk' of https://github.com/anmolanmol1234/hadoop into…

5043c90

… HADOOP-15963_poc

Fix checkstyle

0106fc1

Fix checkstyle

1430b4a

Add explanations

0927ca1

Add debug log statements

08ca94a

Checkstyle fixes

02834ef

Merge branch 'trunk' of https://github.com/anmolanmol1234/hadoop into…

64e5da6

… HADOOP-15963_poc

Merge branch 'trunk' of https://github.com/anmolanmol1234/hadoop into…

eb52c29

… HADOOP-15963_poc

fix null issue

1ea571e

Fix vectored read

279e7f4

Merge branch 'trunk' of https://github.com/anmolanmol1234/hadoop into…

7de0740

… HADOOP-15963_poc

Merge branch 'trunk' of https://github.com/anmolanmol1234/hadoop into…

7c46352

… HADOOP-15963_poc

range validation fixes

25fa821

fix issues

5b2632a

Fix checkstyle

437ffc8

github-actions bot added trunk TOOLS ABFS labels Apr 2, 2026

fix checkstyle

975bf73

mukund-thakur self-requested a review April 2, 2026 21:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HADOOP-1593. [ABFS] Add vectored read support in ABFS driver #8400

HADOOP-1593. [ABFS] Add vectored read support in ABFS driver #8400
anmolanmol1234 wants to merge 23 commits intoapache:trunkfrom
anmolanmol1234:HADOOP-15963

anmolanmol1234 commented Apr 2, 2026

Uh oh!

hadoop-yetus commented Apr 2, 2026

Uh oh!

hadoop-yetus commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

anmolanmol1234 commented Apr 2, 2026

Uh oh!

hadoop-yetus commented Apr 2, 2026

Uh oh!

hadoop-yetus commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants