Published on April 12, 2025

HTTP Health Checker

OsamaNagi

@OsamaNagi

Go Http Health Checker

A fast and efficient web crawler that performs deep health checks on websites. This tool crawls through internal links of a website and reports their HTTP status, content types, and any errors encountered.

Features

🚀 Concurrent crawling with configurable concurrency limits
🔄 Rate limiting per host to prevent overwhelming servers
📊 Detailed status reporting for each URL
🔍 Content-type detection
🌐 Internal link detection and filtering
⚡ Efficient memory usage with synchronized map handling

Installation

Prerequisites

Go 1.18 or higher
Git

Steps

Clone the repository:

git clone https://github.com/OsamaNagi/http-health-checker.git
cd http-health-checker

Install dependencies:

go mod tidy

Build the binary:

go build -o http-health-checker

Usage

Basic Command

./http-health-checker status <url> [maxConcurrency] [requestsPerHost] [rateInterval]

Parameters

url: The website URL to crawl (required)
maxConcurrency: Maximum number of concurrent requests (default: 10)
requestsPerHost: Maximum requests per host within the rate interval (default: 30)
rateInterval: Time interval for rate limiting (default: 30s)

Examples

Basic usage:

./http-health-checker status example.com

With custom concurrency:

./http-health-checker status example.com 20

With custom rate limiting:

./http-health-checker status example.com 10 50 1m

Output Format

The tool provides a detailed health status report:

Starting deep health check of example.com
This may take a while depending on the site size...
Health Status Report for example.com
=====================================
✓ https://example.com                    Status: 200 OK
✓ https://example.com/about              Status: 200 OK
✗ https://example.com/missing            Status: 404 Not Found