scrapyrt
https://github.com/scrapinghub/scrapyrt
Python
HTTP API for Scrapy spiders
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported1 Subscribers
Add a CodeTriage badge to scrapyrt
Help out
- Issues
- override scrapyRT default settings with project settings on app startup
- Unable to override settings file
- ScrapyRT Port Unreachable in Kubernetes Docker Container Pod
- Add package support and support for launching via `python -m scrapyrt`
- Saving scraped items in a feed
- Concurrent scrapyrt requests
- document deployment
- scrapyrt becomes unresponsive
- Scrapyrt scrape multiple spiders asynchronously at once instead of overwhelming the server with request
- [WIP] Add max_items
- Docs
- Python not yet supported