antanst
349968d019
Improve error handling and add duplicate snapshot cleanup
2025-06-29 22:38:38 +03:00
antanst
2357135d5a
Fix snapshot overwrite logic to preserve successful responses
...
- Prevent overwriting snapshots that have valid response codes
- Ensure URL is removed from queue when snapshot update is skipped
- Add last_crawled timestamp tracking for better crawl scheduling
- Remove SkipIdenticalContent flag, simplify content deduplication logic
- Update database schema with last_crawled column and indexes
2025-06-29 22:38:38 +03:00
antanst
8588414b14
Enhance crawler with seed list and SQL utilities
...
Add seedList module for URL initialization, comprehensive SQL utilities for database analysis, and update project configuration.
2025-06-29 22:38:38 +03:00
4ef3f70f1f
Implement structured logging with slog
...
- Replace zerolog with Go's standard slog package
- Add ColorHandler for terminal color output
- Add context-aware logging system
- Format attributes on the same line as log messages
- Use green color for INFO level logs
- Set up context value extraction helpers
2025-06-29 22:38:38 +03:00