allowed with permission
User-owned websites
Allowed when the user confirms ownership or authorization and rate limits are respected.
Safe Web Research
This is a Crawl4AI-style adapter plan for future user-authorized and public research intake. Live crawling is disabled until dependencies, hosting, queueing, robots handling, storage, and legal review are complete.
allowed with permission
Allowed when the user confirms ownership or authorization and rate limits are respected.
allowed with permission
Allowed for research summaries when robots.txt, license terms, and provider terms allow access.
allowed with permission
Allowed for market research notes when collection is transparent and rate limited.
blocked
Blocked. Do not bypass paywalls, logins, platform rules, or access controls.
blocked
Blocked. Use official APIs, public datasets, or user-provided exports where permitted.
needs review
Review source terms, attribution, freshness, and permitted use before collection.
URL review
User-provided URL required
Permission checkbox
Owner confirms permission before collection
Crawl depth
1-2 levels until reviewed
Rate limit
slow, respectful, and source-specific
Storage
summary placeholders only until database/RLS review
Live crawling
disabled
Default: allowed with permission
company homepage, owned landing page, owned help docs
Default: allowed with permission
official documentation, public README, release notes
Default: allowed with permission
pricing page, feature page, public changelog
Default: blocked
paywalled pages, private accounts, login-required dashboards
Default: needs review
government data, public statistics, open datasets