GILES - General Information Location and Extraction System v3.0

UK Computer Reseller Scraping Jobs

ID Source URL Status Scraping Progress Actions

Last Process Start: Initializing...

Data Quality Workflow: UK Reseller Feed

SOURCE DATA
Raw Feed Input
6 UK Resellers, 250K Rows
TRANSFORMATION & CLEANING
Deduplication Engine
Match: SKU, Name (80% confidence)
Map Inconsistent Brands
Normalize "HP Inc" to "HP" (12 rules)
Standardize Unit Values
Convert "G" to "GB", "Inch" to '"'
Field Cleaning (Regex)
Remove HTML tags and special characters from descriptions.
DATA DESTINATION
Export to Data Warehouse
Target: Snowflake DB (CLEAN_PRICING_UK)
Notification: Success
Email Analyst Team upon completion.

Hover over rules to manage or click "Add New Rule."

Automated ETL and Enrichment Jobs

ID Pipeline Purpose Schedule Status Progress Actions
P-101 **Pricing Enrichment & Margin Calculation** Hourly Running
65%
P-102 **Competitor Geo-Tagging Service** Daily (2 AM) Finished
100%
P-103 **Final Data Load to Reporting DB** On Success of P-101 Pending
0%
P-104 **Cost Data API Synchronization** Hourly Failed
20% (Timeout)
**Pipeline Health:** P-101 is currently running. Note that **P-103 is dependent on P-101 completion** and will start automatically.

User-Defined Market Analysis Reports

Report Name Status Last Generated Controls & Actions
**Component Price Volatility (Q3)** Running 15 minutes ago
**Currys vs. Scan UK Price Index (Weekly)** Finished 2 hours ago
**Competitor Stock-Out Rate by Category** Paused 1 day ago

Running reports consume processing pipeline resources.