Data Sources Overview
Lomita includes 104 pre-built data sources across 8 domains — but this is just the starting point. AI agents can also search the internet for sources not in the catalog and connect your private data (CRM, ERP, APIs). See Connecting Your Own Data.
Categories
Macro & Economics (27 sources)
FRED series covering rates, yields, spreads, monetary policy, and economic indicators.
| Source | What it measures | Update frequency |
|---|---|---|
| fred-dgs10 | 10-Year Treasury yield | Daily |
| fred-fedfunds | Federal funds rate | Daily |
| fred-sofr | Secured Overnight Financing Rate | Daily |
| fred-umcsent | University of Michigan Consumer Sentiment | Monthly |
| fred-vixcls | CBOE Volatility Index (VIX) | Daily |
| fred-dcoilwtico | Crude oil price (WTI) | Daily |
| fred-m2sl | M2 money supply | Weekly |
| fred-walcl | Fed balance sheet | Weekly |
| treasury-tga-balance | Treasury General Account | Daily |
| treasury-auction-results | Treasury bond auctions | As scheduled |
| bls-qcew-wages | Quarterly wages | Quarterly |
| census-retail-sales | Retail sales | Monthly |
Sentiment & Markets (10 sources)
Reddit, Polymarket, and market sentiment indicators.
| Source | What it measures |
|---|---|
| reddit-wallstreetbets | WSB post scores, engagement |
| reddit-retail-sentiment | Aggregated retail investor sentiment |
| reddit-stocks | r/stocks activity |
| reddit-investing | r/investing activity |
| polymarket-markets | Prediction market odds |
| alphavantage-sentiment | Market news sentiment |
| deribit-options | BTC/ETH options data |
Energy (8 sources)
EIA, ERCOT, and energy market data.
| Source | What it measures |
|---|---|
| eia-electricity-generation | US electricity generation |
| eia-petroleum-supply | Petroleum supply/demand |
| eia-steo-forecasts | Short-term energy outlook |
| ercot-settlement-prices | Texas electricity prices |
| ercot-rt-lmps | Real-time locational marginal prices |
Frontier / DeFi (5 sources)
Crypto, DeFi, and emerging data.
| Source | What it measures |
|---|---|
| defillama-tvl | Total value locked in DeFi |
| defillama-stablecoins | Stablecoin supply |
| coingecko-derivatives | Crypto derivatives data |
Transportation (4 sources)
Traffic, flights, and border crossings.
Commodities (3 sources)
CFTC, USDA, and World Bank data.
Weather (4 sources)
NOAA, NWS, and weather data.
Real Estate (3 sources)
Google Places, MTA, and property data.
How discovery works
Your agent calls discover_sources("your keywords") to search the catalog:
Agent: discover_sources("consumer sentiment")
→ Returns: fred-umcsent, reddit-retail-sentiment, alphavantage-sentiment, ...
Each source includes:
- Name and description
- Data domain and tags
- Update frequency
- Whether credentials are required
- Sample fields
Data format
- Raw data is stored as JSON in
s3://plantos-raw/{source}/{date}/ - Transformed data is stored as Parquet in
s3://plantos-analytics/marts/ - Query both using
analyze_datawith SQL
Custom sources
You can upload your own data:
"Upload my portfolio CSV to the data lake"
→ upload_data("portfolio.csv")
→ Returns a presigned URL for upload
Or create custom Dagster assets for new API sources:
"Create a custom asset to scrape gold prices from the LBMA API"
→ propose_asset("gold_lbma_price", python_code, description)
→ Opens a PR for review