-
Notifications
You must be signed in to change notification settings - Fork 4
Docs updates #114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Docs updates #114
Changes from all commits
Commits
Show all changes
10 commits
Select commit
Hold shift + click to select a range
ad63216
#15 - Add Spark socket wire benchmark notes
cliffburdick 33e52a9
#15 - Guard TCP socket connection reuse
cliffburdick beb699d
#15 - Update documentation graphics and benchmarking
cliffburdick a96a65e
Group benchmarking pages under docs/benchmarks/ and surface them as a
RamyaGuru 8a2f8a8
#15 - Repoint stale benchmarking_examples.md links after merge
RamyaGuru 1766e78
#15 - Drop Benchmarking Overview tile from landing page
RamyaGuru eecd4b7
#15 - Surface Bare-Metal CMake Build in top-nav Tutorials dropdowns
RamyaGuru 4ed1f5a
#15 - Refine landing graphic data path animation
cliffburdick 1db6a4e
#15 - Refine landing page graphics
cliffburdick 9fe967a
#15 - Remove generated PCIe schematic artifact
cliffburdick File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,49 @@ | ||
| # Benchmarking | ||
|
|
||
| DAQIRI ships with several backends to handle different types of incoming and outgoing streams. Choosing the stream type depends on the type of sensor being used and its capabilities. The `stream_type` is decided from the decision tree below: | ||
|
|
||
|  | ||
|
|
||
| ## Choose a backend | ||
|
|
||
| | Use case | DAQIRI config | Benchmark | Start here | | ||
| |---|---|---|---| | ||
| | Ingest from or egress to a programmable PCIe sensor, such as an FPGA on the PCIe bus. | `stream_type: "pcie"` | Coming soon | PCIe benchmarking docs are coming soon. | | ||
| | Compare against normal Linux networking, run on a non-NVIDIA NIC, or test a peer that speaks TCP/UDP sockets. | `stream_type: "socket"` with `protocol: "tcp"` or `protocol: "udp"` | `daqiri_bench_socket` | [Socket and RDMA Benchmarking](socket_benchmarking.md) | | ||
| | Test a peer that already implements RDMA verbs over RoCE. | `stream_type: "socket"` with `protocol: "roce"` | `daqiri_bench_rdma` | [Socket and RDMA Benchmarking](socket_benchmarking.md#run-the-rdma-roce-benchmark) | | ||
| | Drive raw Ethernet packets directly from an NVIDIA NIC under DAQIRI control. | `stream_type: "raw"` | `daqiri_bench_raw_gpudirect` and the other `raw_*` benches | [Raw Ethernet Benchmarking](raw_benchmarking.md) | | ||
|
|
||
| !!! note "PCIe backend status" | ||
|
|
||
| The PCIe programmable-sensor path is under development. Once completed it will allow 3rd party PCIe devices | ||
| to read from and write to the GPU's BAR1 memory. | ||
|
|
||
| !!! note "Why RDMA is listed under socket" | ||
|
|
||
| The RoCE benchmark uses the connection-oriented socket/RDMA configuration model. The executable is named `daqiri_bench_rdma` to show the RDMA-specific API calls. | ||
|
|
||
| ## Common benchmark workflow | ||
|
|
||
| 1. Build the examples with the backend you plan to test. The default container build enables all three: | ||
|
|
||
| ```bash | ||
| BASE_TARGET=dpdk DAQIRI_MGR="dpdk socket rdma" scripts/build-container.sh | ||
| ``` | ||
|
|
||
| 2. Pick the physical pair or host pair that should carry the traffic. For same-host Spark wire tests, prefer a client namespace and a server namespace so the route cannot silently fall back to loopback. | ||
|
cliffburdick marked this conversation as resolved.
|
||
|
|
||
| 3. Prove the direction with hardware counters before trusting bandwidth numbers. For one-way client-to-server tests, the important counters are the client-side `tx_packets_phy` / `tx_bytes_phy` and the server-side `rx_packets_phy` / `rx_bytes_phy`. | ||
|
|
||
| 4. Run the DAQIRI benchmark and a known baseline such as `iperf3` or `ib_send_bw` with the same namespace, interface, and message-size assumptions. | ||
|
|
||
| 5. Monitor line rate with NIC counters or `mlnx_perf`; application-side byte counts are useful, but hardware counters answer whether packets actually reached the physical path. | ||
|
|
||
| ## Page map | ||
|
|
||
| - [Socket and RDMA Benchmarking](socket_benchmarking.md) covers Linux TCP/UDP and RoCE/RDMA runs with matching client/server namespace setup. | ||
| - [Raw Ethernet Benchmarking](raw_benchmarking.md) covers the DPDK/raw Ethernet examples, hugepage sizing, physical loopback configuration, and raw benchmark troubleshooting. | ||
| - [Understanding the Configuration File](../tutorials/configuration-walkthrough.md) explains the YAML fields once you have selected the backend and example config. | ||
|
|
||
| --- | ||
| **Previous:** [System Configuration](../tutorials/system_configuration.md)<br> | ||
| **Next:** [Socket and RDMA Benchmarking](socket_benchmarking.md) | ||
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am not sure I can reach that page
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can get it from my local copy