Data Sources Overview

Lomita includes 104 pre-built data sources across 8 domains — but this is just the starting point. AI agents can also search the internet for sources not in the catalog and connect your private data (CRM, ERP, APIs). See Connecting Your Own Data.

Categories

Macro & Economics (27 sources)

FRED series covering rates, yields, spreads, monetary policy, and economic indicators.

SourceWhat it measuresUpdate frequency
fred-dgs1010-Year Treasury yieldDaily
fred-fedfundsFederal funds rateDaily
fred-sofrSecured Overnight Financing RateDaily
fred-umcsentUniversity of Michigan Consumer SentimentMonthly
fred-vixclsCBOE Volatility Index (VIX)Daily
fred-dcoilwticoCrude oil price (WTI)Daily
fred-m2slM2 money supplyWeekly
fred-walclFed balance sheetWeekly
treasury-tga-balanceTreasury General AccountDaily
treasury-auction-resultsTreasury bond auctionsAs scheduled
bls-qcew-wagesQuarterly wagesQuarterly
census-retail-salesRetail salesMonthly

Full macro source list →

Sentiment & Markets (10 sources)

Reddit, Polymarket, and market sentiment indicators.

SourceWhat it measures
reddit-wallstreetbetsWSB post scores, engagement
reddit-retail-sentimentAggregated retail investor sentiment
reddit-stocksr/stocks activity
reddit-investingr/investing activity
polymarket-marketsPrediction market odds
alphavantage-sentimentMarket news sentiment
deribit-optionsBTC/ETH options data

Energy (8 sources)

EIA, ERCOT, and energy market data.

SourceWhat it measures
eia-electricity-generationUS electricity generation
eia-petroleum-supplyPetroleum supply/demand
eia-steo-forecastsShort-term energy outlook
ercot-settlement-pricesTexas electricity prices
ercot-rt-lmpsReal-time locational marginal prices

Frontier / DeFi (5 sources)

Crypto, DeFi, and emerging data.

SourceWhat it measures
defillama-tvlTotal value locked in DeFi
defillama-stablecoinsStablecoin supply
coingecko-derivativesCrypto derivatives data

Transportation (4 sources)

Traffic, flights, and border crossings.

Commodities (3 sources)

CFTC, USDA, and World Bank data.

Weather (4 sources)

NOAA, NWS, and weather data.

Real Estate (3 sources)

Google Places, MTA, and property data.

How discovery works

Your agent calls discover_sources("your keywords") to search the catalog:

Agent: discover_sources("consumer sentiment")
→ Returns: fred-umcsent, reddit-retail-sentiment, alphavantage-sentiment, ...

Each source includes:

  • Name and description
  • Data domain and tags
  • Update frequency
  • Whether credentials are required
  • Sample fields

Data format

  • Raw data is stored as JSON in s3://plantos-raw/{source}/{date}/
  • Transformed data is stored as Parquet in s3://plantos-analytics/marts/
  • Query both using analyze_data with SQL

Custom sources

You can upload your own data:

"Upload my portfolio CSV to the data lake"
→ upload_data("portfolio.csv")
→ Returns a presigned URL for upload

Or create custom Dagster assets for new API sources:

"Create a custom asset to scrape gold prices from the LBMA API"
→ propose_asset("gold_lbma_price", python_code, description)
→ Opens a PR for review