Skip to content

Commit bab7cfd

Browse files
committed
update Comparison
1 parent 406083a commit bab7cfd

9 files changed

Lines changed: 164 additions & 45 deletions

index.md

Lines changed: 164 additions & 45 deletions
Original file line numberDiff line numberDiff line change
@@ -16,13 +16,14 @@ affiliations: |
1616
<sup>2</sup>Shanghai Jiao Tong University<br>
1717
<sup>3</sup>Key Laboratory of Target Cognition and Application Technology, Aerospace Information Research Institute, Chinese Academy of Sciences<br>
1818
<sup>4</sup>University of Science and Technology of China
19-
paper: https://aaa
20-
# video: https://aaa
19+
paper: https://spatialretrievalad.github.io/
2120
code: https://github.com/SpatialRetrievalAD/
2221
data: https://huggingface.co/datasets/SpatialRetrievalAD/nuScenes-Geography-Data
2322

2423
---
2524

25+
<br>
26+
2627
<div class="columns is-centered has-text-centered video-container">
2728
<div class="column is-four-fifths">
2829
<!-- <h2>SpatialRetrievalAD Summary Video</h2> -->
@@ -145,7 +146,11 @@ data: https://huggingface.co/datasets/SpatialRetrievalAD/nuScenes-Geography-Data
145146

146147

147148

148-
<!-- Data Example -->
149+
150+
151+
152+
153+
<!-- Dataset Visualization -->
149154
<div class="columns is-centered has-text-centered">
150155
<div class="column is-four-fifths">
151156
<h2>Dataset Visualization</h2>
@@ -181,6 +186,162 @@ data: https://huggingface.co/datasets/SpatialRetrievalAD/nuScenes-Geography-Data
181186
</div>
182187
</div>
183188

189+
190+
191+
192+
193+
194+
<!-- Quantitative Comparison -->
195+
<div class="columns is-centered has-text-centered" style="margin-bottom: 0rem;">
196+
<div class="column is-four-fifths">
197+
<h2>Comparison</h2>
198+
<div class="content has-text-justified">
199+
</div>
200+
</div>
201+
</div>
202+
203+
204+
205+
<!-- Generative World Model Results -->
206+
207+
<div class="columns is-centered has-text-centered" style="margin-bottom: 0rem;">
208+
<div class="column is-four-fifths">
209+
<div class="content has-text-justified">
210+
<p><strong>Generative World Model Results.</strong> Conditioning UniMLVG and MagicDriveDiT on geographic images leads to lower FVD and FID, effectively preventing scene drift and preserving geometric consistency during rollouts. This demonstrates that spatial retrieval provides a structural scaffold for coherent world modeling.</p>
211+
</div>
212+
</div>
213+
</div>
214+
215+
<div class="columns is-centered has-text-centered" style="margin-bottom: 0rem;">
216+
<div class="column is-four-fifths">
217+
<div class="content">
218+
<img src="./static/image/result/result_generative.jpg" alt="generative" style="width: 65%;">
219+
</div>
220+
</div>
221+
</div>
222+
223+
<div class="columns is-centered has-text-centered" style="margin-bottom: 3rem;">
224+
<div class="column is-four-fifths">
225+
<div class="content">
226+
<img src="./static/image/visualization/visualization_generative.jpg" alt="visualization_generative" style="width: 100%;">
227+
</div>
228+
</div>
229+
</div>
230+
231+
232+
233+
234+
235+
<!-- Online Mapping Results -->
236+
237+
<div class="columns is-centered has-text-centered" style="margin-bottom: 0rem;">
238+
<div class="column is-four-fifths">
239+
<div class="content has-text-justified">
240+
<p><strong>Online Mapping Results.</strong> Integrating geographic priors into MapTR and MapTRv2 substantially improves online mapping. The extra background information enables recovery of occluded lanes.</p>
241+
</div>
242+
</div>
243+
</div>
244+
245+
246+
<div class="columns is-centered has-text-centered" style="margin-bottom: 0rem;">
247+
<div class="column is-four-fifths">
248+
<div class="content">
249+
<img src="./static/image/result/result_mapping.jpg" alt="mapping" style="width: 100%;">
250+
</div>
251+
</div>
252+
</div>
253+
254+
<div class="columns is-centered has-text-centered" style="margin-bottom: 3rem;">
255+
<div class="column is-four-fifths">
256+
<div class="content">
257+
<img src="./static/image/visualization/visualization_mapping.jpg" alt="visualization_mapping" style="width: 100%;">
258+
</div>
259+
</div>
260+
</div>
261+
262+
263+
264+
<!-- Occupancy Results -->
265+
266+
<div class="columns is-centered has-text-centered" style="margin-bottom: 0rem;">
267+
<div class="column is-four-fifths">
268+
<div class="content has-text-justified">
269+
<p><strong>Occupancy Results.</strong> Extending FB-OCC and FlashOCC yields consistent mIoU improvements, particularly on static categories. The incorporation of geographic priors further boosts mIoU on static terrain, as they provide additional background information.</p>
270+
</div>
271+
</div>
272+
</div>
273+
274+
275+
276+
<div class="columns is-centered has-text-centered" style="margin-bottom: 0rem;">
277+
<div class="column is-four-fifths">
278+
<div class="content">
279+
<img src="./static/image/result/result_occupancy.jpg" alt="occupancy" style="width: 100%;">
280+
</div>
281+
</div>
282+
</div>
283+
284+
<div class="columns is-centered has-text-centered" style="margin-bottom: 3rem;">
285+
<div class="column is-four-fifths">
286+
<div class="content">
287+
<img src="./static/image/visualization/visualization_occupancy.jpg" alt="visualization_occupancy" style="width: 100%;">
288+
</div>
289+
</div>
290+
</div>
291+
292+
293+
294+
295+
<!-- End-to-end Planning Results -->
296+
297+
<div class="columns is-centered has-text-centered" style="margin-bottom: 0rem;">
298+
<div class="column is-four-fifths">
299+
<div class="content has-text-justified">
300+
<p><strong>End-to-end Planning Results.</strong> We evaluate how spatial retrieval improves safe planning with VAD. Geographic priors provide stable road-layout information, compensating for sensing failures under occlusion or low light. With similar trajectory accuracy, our method achieves better safety margins, reducing the collision rate from 0.55% to 0.48% in challenging night scenes.</p>
301+
</div>
302+
</div>
303+
</div>
304+
305+
<div class="columns is-centered has-text-centered" style="margin-bottom: 0rem;">
306+
<div class="column is-four-fifths">
307+
<div class="content">
308+
<img src="./static/image/result/result_planning.jpg" alt="planning" style="width: 100%;">
309+
</div>
310+
</div>
311+
</div>
312+
313+
<div class="columns is-centered has-text-centered" style="margin-bottom: 3rem;">
314+
<div class="column is-four-fifths">
315+
<div class="content">
316+
<img src="./static/image/visualization/visualization_planning.jpg" alt="visualization_planning" style="width: 70%;">
317+
</div>
318+
</div>
319+
</div>
320+
321+
322+
323+
324+
325+
326+
327+
328+
<!-- Conclusion -->
329+
<div class="columns is-centered has-text-centered">
330+
<div class="column is-four-fifths">
331+
<h2>Conclusion</h2>
332+
<div class="content has-text-justified">
333+
&emsp;&emsp;In this work, we present the spatial retrieval paradigm for AD, introducing geographic data as an additional input. We extend nuScenes with geographic data by Google Maps APIs and evaluate five key AD tasks on the extended <a href="https://huggingface.co/datasets/SpatialRetrievalAD/nuScenes-Geography-Data" target="_blank">nuScenes-Geography</a> dataset. We propose a general plug-and-play Spatial Retrieval Adapter module as an intuitive baseline to incorporate geographic data. We propose Reliability Estimation to adaptively fuse geographic information based on the reliability of the retrieved data. Extensive experiments show that the proposed paradigm can enhance the performance of multiple AD tasks, demonstrating the substantial potential of the new paradigm.
334+
</div>
335+
</div>
336+
</div>
337+
338+
339+
340+
341+
342+
343+
344+
184345
<!-- Citation Section -->
185346
<div class="columns is-centered has-text-centered" style="margin-bottom: 3rem;">
186347
<div class="column is-four-fifths">
@@ -224,45 +385,3 @@ data: https://huggingface.co/datasets/SpatialRetrievalAD/nuScenes-Geography-Data
224385

225386

226387

227-
228-
229-
230-
231-
<!-- > Note: This is an example of a Jekyll-based project website template: [Github link](https://github.com/shunzh/project_website).\
232-
> The following content is generated by ChatGPT. The figure is manually added. -->
233-
234-
<!-- ## Background
235-
The paper "On Computable Numbers, with an Application to the Entscheidungsproblem" was published by Alan Turing in 1936. In this groundbreaking paper, Turing introduced the concept of a universal computing machine, now known as the Turing machine.
236-
237-
## Objective
238-
Turing's main objective in this paper was to investigate the notion of computability and its relation to the Entscheidungsproblem (the decision problem), which is concerned with determining whether a given mathematical statement is provable or not.
239-
240-
241-
## Key Ideas
242-
1. Turing first presented the concept of a "computable number," which refers to a number that can be computed by an algorithm or a definite step-by-step process.
243-
2. He introduced the notion of a Turing machine, an abstract computational device consisting of an infinite tape divided into cells and a read-write head. The machine can read and write symbols on the tape, move the head left or right, and transition between states based on a set of rules.
244-
3. Turing demonstrated that the set of computable numbers is enumerable, meaning it can be listed in a systematic way, even though it is not necessarily countable.
245-
4. He proved the existence of non-computable numbers, which cannot be computed by any Turing machine.
246-
5. Turing showed that the Entscheidungsproblem is undecidable, meaning there is no algorithm that can determine, for any given mathematical statement, whether it is provable or not.
247-
248-
![Turing Machine](/static/image/Turing_machine.png)
249-
250-
*Figure 1: A representation of a Turing Machine. Source: [Wiki](https://en.wikipedia.org/wiki/Turing_machine).*
251-
252-
## Table: Comparison of Computable and Non-Computable Numbers
253-
254-
| Computable Numbers | Non-Computable Numbers |
255-
|-------------------|-----------------------|
256-
| Rational numbers, e.g., 1/2, 3/4 | Transcendental numbers, e.g., π, e |
257-
| Algebraic numbers, e.g., √2, ∛3 | Non-algebraic numbers, e.g., √2 + √3 |
258-
| Numbers with finite decimal representations | Numbers with infinite, non-repeating decimal representations |
259-
260-
He used the concept of a universal Turing machine to prove that the set of computable functions is recursively enumerable, meaning it can be listed by an algorithm.
261-
262-
## Significance
263-
Turing's paper laid the foundation for the theory of computation and had a profound impact on the development of computer science. The Turing machine became a fundamental concept in theoretical computer science, serving as a theoretical model for studying the limits and capabilities of computation. Turing's work also influenced the development of programming languages, algorithms, and the design of modern computers.
264-
265-
## Citation
266-
```
267-
268-
``` -->
83.2 KB
Loading
202 KB
Loading
183 KB
Loading
138 KB
Loading
627 KB
Loading
793 KB
Loading
608 KB
Loading
128 KB
Loading

0 commit comments

Comments
 (0)