Data Discovery

Lomita's primary capability is finding data you didn't know existed. The 104 pre-built sources are a starting point — the real product is the discovery engine.

How discovery works

When you create a research hypothesis, the Discovery Agent follows a three-step strategy:

Step 1: Search the catalog

The agent searches 104 pre-built data sources across 8 domains:

  • Macro & Economics (FRED, BLS, Census, Treasury)
  • Energy (EIA, ERCOT, Baker Hughes)
  • Sentiment (Reddit, Polymarket)
  • Commodities (CFTC, USDA, World Bank)
  • Real Estate (Zillow, MTA)
  • Transportation (TomTom, FAA)
  • Weather (NOAA, NWS)
  • Frontier (DeFi, crypto)

If the catalog has what you need, the agent connects the sources immediately.

Step 2: Search the internet

If the catalog doesn't cover your topic, the agent searches the internet using Exa for:

  • Public APIs that provide the data
  • Government datasets and open data portals
  • Free data services with API access

When the agent finds a potential source, it creates a custom integration and the Integration Engineer builds a pipeline automatically.

Step 3: Connect custom / internal sources

For private data sources (your CRM, ERP, internal APIs):

  • Tell the chat: "Connect my HubSpot" or "Connect our internal sales API"
  • Provide API credentials when asked
  • The Integration Engineer reads the API docs and builds the pipeline
  • Credentials are encrypted with AES-256-GCM and stored securely

Intelligent substitution

When exact data isn't available, the Discovery Agent finds proxies:

  • No Uber ride data? → Uses TomTom traffic congestion as a mobility proxy
  • No direct competitor pricing? → Uses industry benchmark data from public filings
  • Historical data unavailable? → Documents the gap and works with what's available

The agent always documents its reasoning — you can see why it chose specific sources and what gaps exist.

Your data stays yours

  • Each tenant has an isolated data lake (MinIO S3)
  • Scoped credentials prevent cross-tenant data access
  • Custom integrations are encrypted at rest
  • Data you connect is only visible to your team