Desent Solutions – Developer Assessment

Website Archiver Tool

Build a tool that archives any website in its current state — including all assets and server-side API responses — so the archived version works even when the original site is completely offline. Similar to the Wayback Machine,

Get Started

Download the test website package. It includes a Docker-based sample site with static content, images, and API endpoints you can spin up and tear down to verify your archiver works.

Download Test Website (ZIP)

Contains: Docker setup, Express server, HTML/CSS/JS, and image generator script

Requirements

Must Have (Core)

→Accept a URL as input and crawl the full page
→Download all assets: HTML, CSS, images, JS, fonts, media
→Intercept and cache API calls — capture XHR/fetch requests server-side and replay their responses in the archive
→Serve the archived version via a local HTTP server
→The archived site must work 100% offline

Should Have

→Multi-page archiving (crawl depth ≥ 1)
→Simple web UI or CLI to trigger archiving and browse archives
→Metadata storage (archive date, original URL, page title)

Nice to Have (Bonus)

→Client-side rendering support (SPA/React/Vue) via headless browser
→Scheduled re-archiving
→Diff view between archive versions
→Export as .warc or .zip

How to Test

# 1. Unzip and start the test website

cd archiver-test-website

node create-images.js

docker-compose up -d

# 2. Archive the site while it's running

your-tool archive http://localhost:3777

# 3. Shut down the original

docker-compose down

# 4. Serve the archive and verify everything works

your-tool serve

What the Test Site Covers

Feature	What to Verify
Static HTML + CSS	Layout and styles render correctly
Multiple images (SVG)	All images load from archive
Google Fonts (external CDN)	Fonts display correctly offline
Fetch API → /api/products	Product list appears without server
Fetch API → /api/stats	Stats bar shows data offline
Fetch API → /api/reviews/:id	Parameterized API responses cached
JavaScript interactivity	Click handlers work in archive
CSS background-image url()	Background pattern renders
Responsive design	Media queries still apply