fix(crawler): walk list pages incrementally; stop on empty page (0.45.1) #4
Reference in New Issue
Block a user
Delete Branch "bugfix/crawler-incremental-pagination"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Reviewed & approved. Incremental page walk (
1,2,3...until empty) fixes chapter loss when the site's pagination links under-report the true last page.VecDequeimport +parse_last_page/build_page_order+ their 5 unit tests, no dangling refs;collapse_whitespacestill used; no test asserts the old walk behavior.max_refs+ the incrementalshould_stop+ the run-scopedseendedup. Lockstep 0.45.1.limit=0it relies on the site returning an empty page past the end (documented). AMAX_PAGESbackstop would close that corner; offered separately rather than silently overriding the documented design.🤖 Generated with Claude Code