Code challenge solution #374

nurtoltor · 2026-01-26T20:00:25Z

Solution

I built a small carousel scraper using Nokogiri and regex. It finds the best Knowledge Graph carousel container by looking for data-attrid sections and links that include stick=, then extracts fields for each item. I prioritised semantic HTML (role, aria-label, alt, title) over class names. It also skips "show more" items and only includes images already present in the HTML (data:image, encrypted-tbn, or knowledgecard icons).

The output is a hash where the key matches the search results selected tab (e.g., artworks, cast, albums). If no tab is selected, it defaults to results.

Structure

lib/carousel_scraper.rb: Orchestrates the extraction and chooses the correct carousel scope.
lib/carousel_item_extractor.rb: Extracts name, extensions, link, and image from a single item link.

I tested against 3 other result pages to find common patterns:

"David Bowie albums" search: files/david-bowie-albums.html
"George Orwell books" search: files/george-orwell-books.html
"Lord of the Rings cast" search: files/lord-of-the-rings-cast.html

How to run

Install dependencies:

bundle install

Run with the default Van Gogh paintings HTML (outputs to files/van-gogh-paintings-expected-array.json):

ruby main.rb

Run with a specific HTML file (outputs JSON to the same directory):

ruby main.rb files/david-bowie-albums.html
ruby main.rb files/george-orwell-books.html
ruby main.rb files/lord-of-the-rings-cast.html

Run the tests:

bundle exec rspec

Initial setup: add basic gems and html carrousel examples

a0e2173

nurtoltor changed the title ~~Initial setup: add basic gems and html carrousel examples~~ Code challenge solution Jan 26, 2026

nurtoltor added 4 commits January 26, 2026 23:21

Add carousel scraper and script to run it

d17e935

Add carousel scraper tests

896ccff

Update README with my solution and steps to run the program

a3df2a7

Write output to JSON file instead of printing to the console

7112cf4

nurtoltor marked this pull request as ready for review January 26, 2026 23:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code challenge solution #374

Code challenge solution #374

Uh oh!

nurtoltor commented Jan 26, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Code challenge solution #374

Are you sure you want to change the base?

Code challenge solution #374

Uh oh!

Conversation

nurtoltor commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Solution

Structure

How to run

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

nurtoltor commented Jan 26, 2026 •

edited

Loading