Where your signals come from¶
The StockClaw pipeline crawls nine source types on your behalf, roughly every 20 minutes, filters the output to items relevant to your watchlist, and stores matching items as signals. Claw Chat only ever reads from this set — if a topic isn't covered by any source below, Claw won't have it.
This page enumerates every source currently wired in, so you can tell what Claw can see and what it can't.
At a glance¶
| Source type | What it gives you | Ticker-filtered? |
|---|---|---|
| News RSS (editorial) | Third-party journalism on earnings, M&A, macro | ✓ |
| Yahoo Finance (per-ticker) | Ticker-scoped news feeds | ✓ |
| SEC EDGAR | Regulatory filings (8-K / Form 4 / 13D / 13G) | ✓ |
| YouTube transcripts | Video commentary from major finance channels | ✓ |
| Retail discussion across 4 financial subreddits | ✓ | |
| Stocktwits | Retail message-board chatter | ✓ |
| Central bank / gov | Fed / ECB / BOJ / BOE / BIS press & speeches | ✗ (macro exempt) |
| Company newswires | First-party press releases (10-30s ahead of editorial) | ✓ |
| Economic calendar | Macro indicator news (CPI / NFP / FOMC) | ✗ (macro exempt) |
"Ticker-filtered" means the watchlist alias filter runs on the item — sources marked ✗ bypass it because macro events affect all tickers and shouldn't require a name-match to reach you.
Detailed breakdown¶
📰 News RSS (editorial)¶
Third-party journalism — articles written about companies, not by them.
| Feed | Region | Notes |
|---|---|---|
| CNBC | US | Multi-category (top news + finance + tech) |
| Yahoo Finance | US | Main RSS feed |
| MarketWatch | US | Top stories + MarketPulse |
| WSJ | US | RSS teasers only — paywalled bodies |
| Financial Times | Global | RSS teasers only — paywalled bodies |
| Bloomberg | Global | RSS only, not terminal data |
| Seeking Alpha | US | Analyst-style pieces |
| Benzinga | US | News + analyst actions |
| The Motley Fool | US | Investor commentary |
| Zacks | US | Analyst / ratings focus |
| BBC Business | Global | Macro / international |
| Investopedia | US | Explainers / education |
| Sina Finance | China | 中文 news on A-shares and Chinese economy |
| Caixin | China | 中文 policy and business coverage |
Typically drives: SignalScope, SectorScan.
📈 Yahoo Finance (per-ticker news)¶
yfinance returns a per-ticker news feed whose publisher varies (Reuters, Barron's, Investor's Business Daily, etc. — whatever yfinance aggregates for that ticker). Runs once per watchlist ticker per cycle.
Typically drives: SignalScope.
🏛️ SEC EDGAR (US regulatory filings)¶
Pulls four form types from SEC's EDGAR submissions API, plus their /A amendments:
| Form | What it is | Classification labels on the pill |
|---|---|---|
| 8-K | Material events filed within 4 days | 8-K: earnings / M&A / bankruptcy / exec change / delisting / … (item-code derived) |
| Form 4 | Insider transactions (director / officer buys & sells) | Form 4 |
| SC 13D | Activist stakes (>5%, active intent) | 13D, 13D/A for amendments |
| SC 13G | Passive stakes (>5%, passive holder) | 13G, 13G/A |
8-K items are ranked by severity — a filing carrying both bankruptcy and Reg-FD items surfaces as "bankruptcy". 10-K / 10-Q annual reports are intentionally excluded from the default cadence (too bulky, news wires already cover them).
Typically drives: SignalScope, SectorScan.
🎥 YouTube transcripts¶
Titles + descriptions (transcripts are fetched lazily) from the following channels:
- CNBC (@CNBC)
- Bloomberg (@markets)
- Yahoo Finance (@YahooFinance)
Typically drives: SentimentPulse.
💬 Reddit (retail discussion)¶
Ticker-keyed search across four subreddits, filtered to posts with a minimum upvote score:
- r/wallstreetbets
- r/stocks
- r/investing
- r/StockMarket
Typically drives: SentimentPulse.
💬 Stocktwits¶
Public message-board stream, ticker-filtered. Used as a proxy for real-time retail sentiment (our Twitter/X path is not wired — see "What's not covered" below).
Typically drives: SentimentPulse.
🏦 Central bank / government feeds¶
Rate decisions, FOMC minutes, speeches, press releases. Bypasses the watchlist ticker filter (see ADR-0005) — macro signals affect every ticker.
- Federal Reserve — all press releases / speeches / monetary-policy specific
- European Central Bank — press releases
- Bank of Japan — Japanese-language feed (the English feed was retired; our translation layer handles the language)
- Bank of England — news
- Bank for International Settlements — press releases
Typically drives: PolicyRadar, MacroMind.
📢 Company newswires¶
First-party press releases — the originals that editorial outlets like CNBC paraphrase 10-30 seconds later. Earnings beats/misses, M&A, material contracts, buyback authorizations.
- PR Newswire — all news releases + financial-services subfeed
- GlobeNewswire — Public Companies feed (specifically public-company announcements, best signal-to-noise)
Typically drives: SignalScope, SectorScan.
📊 Economic calendar (macro indicators)¶
Retrospective coverage of macro releases (CPI, NFP, GDP, PMI, FOMC decisions) and forex commentary. Bypasses the watchlist ticker filter — same rationale as gov feeds.
- Investing.com Economic Indicators News (news_95)
- Investing.com Economy News (news_14)
- Investing.com Forex News (news_1)
Typically drives: MacroMind, PolicyRadar.
Note: this is news about macro releases, not a structured schedule of upcoming events. Forward-looking event scheduling would need FRED / BLS APIs — not currently wired.
What's not covered¶
Honest inventory of blind spots, so you know what Claw can't answer no matter how you phrase the question:
| Gap | Why | Workaround |
|---|---|---|
| Bloomberg Terminal / Refinitiv commentary | Paid-only, no public feed | — |
| WSJ / FT paywalled article bodies | Only RSS teasers are public | Subscribe separately; Claw sees the teaser |
| A-share 官方公告 (PBoC, CSRC, MOF, State Council RSS) | All Chinese ministry RSS endpoints 404'd after site redesigns | Partial coverage via Sina / Caixin editorial — original disclosures are missing |
| HKEX / cninfo.com.cn disclosures | Not wired yet — needs HTML scraping, planned but not implemented | — |
| Twitter / X | Public API paywalled ($200+/mo); the twitter_crawler.py stub is intentional. Retail chatter comes from Reddit + Stocktwits instead. |
Use Stocktwits / Reddit for retail sentiment |
| Real-time quotes / tick data | yfinance is delayed ~15 min; we don't pay for real-time feeds | Dashboard index bar shows delayed quotes |
| Japanese domestic financial news (Nikkei, Reuters Japan) | Not wired — BOJ only covers central bank, not market commentary | — |
| Korean financial news | Not wired | — |
| US Treasury RSS | Treasury retired their public RSS; no discoverable replacement | Fed feeds cover most of the policy surface |
| Earnings call transcripts | Not scraped yet (Seeking Alpha / Motley Fool transcripts are future work) | 8-K earnings releases give the headline numbers |
| Clinical trial results | Not wired — ClinicalTrials.gov RSS is a future addition for biotech coverage | — |
| Forex Factory economic calendar | Blocks all non-browser clients (403) | Investing.com calendar RSS covers the same events retrospectively |
If a source above turns out to be something you need, flag it — expanding coverage is typically a few hours of work per source.
How signals end up in Claw's retrieval¶
Every signal that passes the ticker filter (or bypasses it, for macro sources) gets:
- Persisted to the
signalstable in Postgres, scoped to youruser_idvia Row Level Security - Tagged with the source origin — the dashboard pill renders this as
RSS(CNBC),SEC(Form 4),GOV(Fed), etc. - Available in both BM25 (Postgres full-text) and ChromaDB (dense vector) retrieval when you ask Claw a question
See Claw Chat for the user-facing behaviour and Claw Chat (technical) for the retrieval mechanics.