Magento Website Detector Scraper identifies whether a website is running Magento 1.x or Magento 2.x with high accuracy. It helps businesses, agencies, and researchers quickly classify e-commerce sites and generate actionable insights for outreach and analysis.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for magento-website-detector you've just found your team — Let’s Chat. 👆👆
This project scans websites to determine if they are powered by Magento and identifies the major version in use. It solves the challenge of manually inspecting sites at scale and guessing their underlying e-commerce stack. It is built for marketers, sales teams, analysts, and developers who need reliable platform detection.
- Detects Magento 1.x and 2.x installations automatically
- Processes single URLs or large URL lists in bulk
- Uses multiple verification signals to improve accuracy
- Designed for fast analysis and repeatable workflows
| Feature | Description |
|---|---|
| Magento Version Detection | Identifies Magento 1.x or Magento 2.x with high confidence. |
| Bulk URL Processing | Analyze hundreds or thousands of domains in one run. |
| High Accuracy Signals | Combines headers, assets, and page structure checks. |
| Lightweight Execution | Minimal resource usage for large-scale scans. |
| Clean Structured Output | Results are easy to integrate into CRM or analytics tools. |
| Field Name | Field Description |
|---|---|
| url | The website URL that was analyzed. |
| isMagento | Boolean indicating if Magento was detected. |
| magentoVersion | Detected Magento version (1.x or 2.x), if available. |
| detectionSignals | Technical indicators used to confirm detection. |
| confidenceScore | Confidence level of the detection result. |
| checkedAt | Timestamp of when the URL was processed. |
Magento Website Detector/
├── src/
│ ├── index.js
│ ├── detector/
│ │ ├── headersCheck.js
│ │ ├── domCheck.js
│ │ └── assetCheck.js
│ ├── utils/
│ │ ├── httpClient.js
│ │ └── logger.js
│ └── config/
│ └── defaults.json
├── data/
│ ├── input.sample.txt
│ └── output.sample.json
├── package.json
└── README.md
- Lead generation teams use it to identify Magento stores so they can target relevant e-commerce prospects.
- Agencies analyze competitor platforms to refine sales and migration strategies.
- Market researchers classify large datasets of online stores for ecosystem analysis.
- Developers audit legacy Magento 1.x sites to plan upgrades or migrations.
Does this tool detect custom or heavily modified Magento sites? Yes, it relies on multiple technical signals rather than a single indicator, improving detection even on customized installations.
Can it handle large URL lists reliably? It is designed for bulk processing and performs consistently on large datasets with minimal resource usage.
Does it detect non-Magento platforms? The focus is Magento detection. Non-Magento sites are clearly flagged as unsupported platforms.
Is Magento 1.x still relevant for detection? Yes, many legacy stores still run Magento 1.x, making identification valuable for upgrade and migration efforts.
Primary Metric: Average detection accuracy above 90% across mixed e-commerce datasets.
Reliability Metric: Stable processing with consistent results across repeated scans of the same domains.
Efficiency Metric: Capable of analyzing hundreds of URLs per minute on standard configurations.
Quality Metric: High precision in distinguishing Magento from other popular e-commerce platforms.
