Data Discovery
Lomita's primary capability is finding data you didn't know existed. The 104 pre-built sources are a starting point — the real product is the discovery engine.
How discovery works
When you create a research hypothesis, the Discovery Agent follows a three-step strategy:
Step 1: Search the catalog
The agent searches 104 pre-built data sources across 8 domains:
- Macro & Economics (FRED, BLS, Census, Treasury)
- Energy (EIA, ERCOT, Baker Hughes)
- Sentiment (Reddit, Polymarket)
- Commodities (CFTC, USDA, World Bank)
- Real Estate (Zillow, MTA)
- Transportation (TomTom, FAA)
- Weather (NOAA, NWS)
- Frontier (DeFi, crypto)
If the catalog has what you need, the agent connects the sources immediately.
Step 2: Search the internet
If the catalog doesn't cover your topic, the agent searches the internet using Exa for:
- Public APIs that provide the data
- Government datasets and open data portals
- Free data services with API access
When the agent finds a potential source, it creates a custom integration and the Integration Engineer builds a pipeline automatically.
Step 3: Connect custom / internal sources
For private data sources (your CRM, ERP, internal APIs):
- Tell the chat: "Connect my HubSpot" or "Connect our internal sales API"
- Provide API credentials when asked
- The Integration Engineer reads the API docs and builds the pipeline
- Credentials are encrypted with AES-256-GCM and stored securely
Intelligent substitution
When exact data isn't available, the Discovery Agent finds proxies:
- No Uber ride data? → Uses TomTom traffic congestion as a mobility proxy
- No direct competitor pricing? → Uses industry benchmark data from public filings
- Historical data unavailable? → Documents the gap and works with what's available
The agent always documents its reasoning — you can see why it chose specific sources and what gaps exist.
Your data stays yours
- Each tenant has an isolated data lake (MinIO S3)
- Scoped credentials prevent cross-tenant data access
- Custom integrations are encrypted at rest
- Data you connect is only visible to your team