Skip to content

HTTPError should be catched. #1

@EltonChou

Description

@EltonChou

HTTPError should be catched.

The sitechecksum which caused error should be skipped to prevent blocking others behind.

@app.task
def check_new_release():
scheduled_jobs = []
site_checksums: list[Type[SiteChecksum]] = [
AlterChecksum,
GSCChecksum,
NativeChecksum
]
with pgsql_session():
scrapy_util = ScrapydUtil(
os.getenv("SCRAPYD_URL", "http://127.0.0.1:6800"), "product_crawler"
)
for site_checksum in site_checksums:
checksum = site_checksum(scrapyd_util=scrapy_util)
if checksum.is_changed:
spider_jobs = checksum.trigger_crawler()
scheduled_jobs.extend(spider_jobs)
checksum.update()
return scheduled_jobs

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions