Lomita Documentation
Lomita is a decision framework. Ask any research question and get a statistically rigorous, data-backed answer — across internal and external data, automatically.
What Lomita does
- Finds the data you need — 104+ pre-built sources, plus AI agents that search the internet for APIs and datasets you don't know exist
- Builds the pipeline automatically — agents read API documentation, handle authentication, and start ingesting data without manual setup
- Runs rigorous analysis — statistical tests with confidence intervals, not guesses. Correlations, regressions, Granger causality, regime analysis.
- Delivers a verdict — SUPPORTED, REFUTED, or INCONCLUSIVE — with the methodology to prove it
- Builds your knowledge graph — every hypothesis, correlation, and data source compounds into an organizational decision map
Two ways to use Lomita
From your browser — sign in at lomita.io, go to the Explore page, and ask questions in the built-in chat. The knowledge graph grows as you research.
From your AI agent — connect Claude Code, Claude Desktop, or any MCP-compatible agent for deeper research, scripted workflows, and programmatic access.
Quick links
- Quickstart — sign up to first research in 5 minutes
- Using the Chat — the primary research interface
- Data Discovery — how agents find and connect new data
- Connecting Your Own Data — bring your CRM, ERP, or any API
- Tools Reference — every tool explained
Who it's for
Decision-makers who need data-backed answers but don't have a data science team. CEOs validating market hypotheses. CMOs measuring campaign effectiveness against real benchmarks. Founders making resource allocation decisions with evidence, not gut feel.
You don't need to know SQL, Python, or data engineering. You need a question worth answering.
Pricing
- Standard ($640/mo) — 3 team members, 104+ data sources, autonomous research agents, email + webhook delivery
- Pro ($990/mo) — 10 team members, priority agent execution, advanced monitoring
Each instance is a dedicated container. No shared resources. Cancel anytime.
Lomita is a product of Plantos Technologies, Inc.
What is Lomita?
Lomita is a decision framework that finds, integrates, and analyzes data so you can validate assumptions before committing resources.
The problem
Every company makes decisions on incomplete data. You validate a market hypothesis by asking an analyst who queries one database. You measure campaign ROI against a single attribution model. You optimize operations based on internal metrics without external context.
Each decision is made within a silo — one or two dimensions of data — because finding, integrating, and analyzing additional sources is prohibitively expensive and slow.
The result: decisions backed by gut feel dressed up as data. Internal data tells you what happened. It rarely tells you why.
The solution
State a hypothesis. Lomita finds the data, runs the analysis, and delivers a verdict.
The product is not a pre-built data warehouse. The 104 pre-built sources are a starting point. The actual product is the ability to:
- Discover data that doesn't exist in your system yet — AI agents search the internet for relevant APIs, datasets, and services
- Integrate any data source automatically — agents read API documentation, build data pipelines, handle authentication, and start ingesting
- Analyze across all dimensions — internal data (your CRM, your ERP) combined with external data (macro indicators, sentiment, weather, competitors)
- Validate or refute with statistical rigor — not dashboards, not gut feel. Statistical tests with confidence intervals and methodology you can audit.
- Build a persistent knowledge graph — every hypothesis, every correlation, every data connection compounds. The system gets smarter with every question.
How it works
You: "Does ERCOT energy demand correlate with Dallas weather extremes?"
Lomita:
1. Discovery Agent searches 104+ sources + the internet for ERCOT and weather data
2. Integration Engineer builds data pipelines for ERCOT, NOAA, EIA
3. Quant Analyst runs Pearson correlations, Granger causality, regime analysis
4. Research Narrator compiles an executive report with verdict
Result: SUPPORTED (r = 0.67, n = 8,760, Granger p < 0.01)
Report auto-delivered to your email and Slack.
Two ways to use Lomita
Browser chat (easiest) — Sign in, go to Explore, type a question. The knowledge graph grows in real time as agents work. Click any hypothesis to see connected data, correlations, and reports.
AI agent via MCP — Connect Claude Code, Claude Desktop, or any MCP-compatible agent for deeper research, scripted workflows, and programmatic access. One config snippet, no API key setup required.
Who it's for
Decision-makers who understand that decisions should be backed by data — but don't have the technical skills or headcount to do rigorous analysis themselves.
- CEOs — "Is this market actually growing, or are we riding our own momentum?"
- CMOs — "Is our spend working, or is the market lifting everyone?"
- Founders — "Should we expand into this vertical? What does the data say?"
- Operations leaders — "Is this supply chain issue specific to us, or industry-wide?"
- PE/VC analysts — "Is this company's growth organic, or riding macro tailwinds?"
At $640/month, Lomita replaces the $150K/year hire you can't justify.
Pricing
| Tier | Price | Team | What you get |
|---|---|---|---|
| Standard | $640/mo | 3 members | 104+ data sources, autonomous agents, email + webhook delivery |
| Pro | $990/mo | 10 members | Everything in Standard + priority execution, advanced monitoring |
Each instance is a dedicated container. No shared resources, no noisy neighbors. Cancel anytime.
Quickstart
Sign up, ask a question, get a data-backed answer. Under 5 minutes.
1. Create an account
Go to lomita.io and sign in with Google, GitHub, Apple, or email.
2. Launch an instance
Click New Instance, pick your tier (Standard $640/mo or Pro $990/mo), and complete payment. Your dedicated container provisions automatically — about 3 minutes.
The dashboard shows "Provisioning..." with a spinner. When ready, the status changes to Active.
3. Start researching
You have two options:
Option A: Use the built-in chat (easiest)
Click Explore in the sidebar. Type your question in the chat at the bottom:
Research whether oil prices predict inflation
That's it. The agent team starts working immediately:
- Discovery Agent finds relevant data sources
- Integration Engineer builds data pipelines
- Quant Analyst runs statistical analysis
- Research Narrator writes an executive report
Watch the Hypotheses panel on the right — your research appears with a pulsing dot while agents work. When it turns green, yellow, or red, the verdict is in.
Option B: Connect your AI agent (for power users)
Go to Connect in the sidebar. Copy the config for your framework:
Claude Code:
claude mcp add --transport http lomita https://YOUR-INSTANCE-mcp.lomita.io/mcp
Your browser opens for authentication — no API key needed. Then ask:
Research whether consumer sentiment predicts stock market volatility
Claude Desktop: Download the config from the Connect page and import it.
Other frameworks: Any MCP-compatible agent works. Copy the JSON config from the Connect page.
4. Get your report
When research completes:
- On the Explore page — click "View Report" next to the hypothesis
- By email — tell the chat: "Deliver this report to [email protected]"
- To Slack — tell the chat: "Deliver this to our Slack channel" and provide the webhook URL
- Via your AI agent —
deliver(email: "[email protected]")
Reports include: executive summary, key findings with statistics, data sources used, methodology, limitations, and a clear verdict.
5. Keep going
- Ask follow-up questions with the hypothesis selected
- Click nodes on the knowledge graph to explore connections
- Connect your own data sources (CRM, ERP, any API) from the Sources page
- Set up continuous monitoring: "Monitor this weekly and alert me if anything changes"
No LLM setup required
Lomita includes a bundled AI model for all agent operations. You don't need to bring your own API key — research works immediately after connecting.
Advanced users can optionally override the default model with their own provider (Anthropic, OpenAI, Google Gemini) via the set_provider tool.
Using the Chat
The Explore page is the primary interface. It combines a knowledge graph, a hypothesis panel, and a chat — all in one view.
The layout
- Knowledge graph — the main canvas. Every data source, entity, and hypothesis is a node. Correlations and connections are edges. The graph grows as you research.
- Hypothesis panel (right side) — all your research questions with status. Click one to filter the graph to its connected data.
- Chat (bottom) — your research assistant. Ask questions, create hypotheses, request deliveries, analyze data.
How to use the chat
Type any question or command:
| What you say | What happens |
|---|---|
| "Research whether oil prices predict inflation" | Creates a hypothesis, agents start discovering data and running analysis |
| "What data do you have about energy?" | Searches the 104+ source catalog |
| "Deliver this report to [email protected]" | Emails the report for the selected hypothesis |
| "Why are these two nodes connected?" | Explains the statistical relationship using the graph data |
| "Connect my HubSpot" | Starts a custom integration — agents build the data pipeline |
Hypothesis panel
When you click a hypothesis in the right panel:
- The graph fades unrelated nodes and highlights connected data
- The chat gains context — it knows which hypothesis you're investigating
- The chat placeholder changes to "Ask about this hypothesis..."
- You can say "deliver this" or "what did you find?" without specifying which research
Click Show all to restore the full graph view.
Status indicators
| Dot color | Meaning |
|---|---|
| Blue (pulsing) | Agents are working — discovery, integration, analysis, or narration |
| Green | Hypothesis supported by data |
| Yellow | Inconclusive — some evidence but not definitive |
| Red | Hypothesis refuted by data |
Tips
- Shift+click nodes on the graph to multi-select and ask about relationships
- Search the graph using the search bar to find specific entities
- Click View Report on any completed hypothesis to see the full analysis
- The graph polls every 15 seconds — new nodes appear automatically as agents discover data
Connecting an AI Agent
The built-in chat on the Explore page handles most research needs. Connecting an external AI agent gives you additional power: scripted workflows, batch operations, and integration with your existing development tools.
Claude Code
Copy the command from the Connect page in your dashboard:
claude mcp add --transport http lomita https://YOUR-INSTANCE-mcp.lomita.io/mcp
Your browser opens for sign-in — no API key copy-paste needed. Start a new Claude Code session and Lomita tools are available.
Claude Desktop
Download the pre-filled config from the Connect page, or add manually:
Settings > Developer > Edit Config:
{
"mcpServers": {
"lomita": {
"type": "http",
"url": "https://YOUR-INSTANCE-mcp.lomita.io/mcp",
"headers": {
"Authorization": "Bearer YOUR_API_KEY"
}
}
}
}
Get your API key from Settings in the dashboard.
OpenCode / Other MCP frameworks
Use the JSON config from the Connect page. Any framework that supports HTTP MCP transport works:
{
"type": "http",
"url": "https://YOUR-INSTANCE-mcp.lomita.io/mcp",
"headers": {
"Authorization": "Bearer YOUR_API_KEY"
}
}
What your agent can do
Once connected, your agent has access to these tools:
| Tool | What it does |
|---|---|
research | Ask a question — agents find data and produce a report |
follow_up | Dig deeper into a previous finding |
status | Check progress on active research |
sources | Search available data sources |
connect_integration | Connect your own API, CRM, or database |
deliver | Send a report to email, Slack, or webhook |
monitor | Set up continuous monitoring with alerts |
upload | Add your own CSV, JSON, or Parquet data |
No LLM setup required
The research agent team uses a bundled AI model. You don't need to configure a provider — research works immediately.
If you want to use your own model (Anthropic, OpenAI, Gemini), you can optionally override with set_provider. This is not required.
Asking Research Questions
State a hypothesis. Lomita finds the data, runs the analysis, and delivers a verdict.
What happens when you ask a question
When you type "Research whether ERCOT energy demand correlates with Dallas weather" — either in the chat or via your AI agent — here's what happens behind the scenes:
1. Discovery (30-60 seconds)
The Discovery Agent searches for relevant data:
- Scans the 104-source catalog for matches (ERCOT, NOAA weather, EIA energy)
- Searches the internet via Exa for APIs not in the catalog
- Connects the best sources to your hypothesis
- Documents any gaps (e.g., "Uber ride data not publicly available — using TomTom traffic as proxy")
2. Integration (1-2 minutes)
The Integration Engineer builds data pipelines:
- Reads API documentation for each source
- Writes and deploys ingestion scripts (Dagster assets)
- Handles authentication and scheduling
- Verifies data is flowing into the data lake
3. Analysis (1-2 minutes)
The Quant Analyst runs statistical tests:
- Pearson and Spearman correlations
- Granger causality (does X predict Y?)
- Lag analysis (how many days/weeks of lead time?)
- Regime analysis (does the relationship change in different conditions?)
- OLS regression with confidence intervals
4. Report (30-60 seconds)
The Research Narrator compiles findings:
- Executive summary with a clear verdict
- Key findings table with statistics
- Data sources used with coverage dates
- Methodology description
- Limitations and caveats
- Recommendation
Total time: 3-10 minutes depending on complexity.
Verdicts
| Verdict | What it means |
|---|---|
| Supported (green) | Statistical evidence supports your hypothesis |
| Inconclusive (yellow) | Some evidence, but not enough for a definitive answer |
| Refuted (red) | Data contradicts your hypothesis |
Every verdict comes with the methodology to prove it. You can audit the statistics, check the data sources, and verify the reasoning.
Follow-up research
After a hypothesis completes, you can:
- Follow up — "Break this down by season" creates a child hypothesis that branches from the original
- Deliver — Send the report to email, Slack, or any webhook
- Monitor — "Watch this weekly and alert me if the correlation shifts"
- Connect more data — "Connect my internal sales data and re-analyze"
Tips for good questions
Good questions (specific, testable):
- "Does consumer sentiment predict retail sales with a lag?"
- "Are ERCOT energy prices affected by Dallas weather extremes?"
- "Does VIX volatility correlate with S&P 500 drawdowns?"
Weaker questions (too broad):
- "Tell me about the economy" — too vague, no testable hypothesis
- "What should I invest in?" — Lomita analyzes data, it doesn't give investment advice
The system works best with questions that have a clear independent variable and dependent variable.
Data Discovery
Lomita's primary capability is finding data you didn't know existed. The 104 pre-built sources are a starting point — the real product is the discovery engine.
How discovery works
When you create a research hypothesis, the Discovery Agent follows a three-step strategy:
Step 1: Search the catalog
The agent searches 104 pre-built data sources across 8 domains:
- Macro & Economics (FRED, BLS, Census, Treasury)
- Energy (EIA, ERCOT, Baker Hughes)
- Sentiment (Reddit, Polymarket)
- Commodities (CFTC, USDA, World Bank)
- Real Estate (Zillow, MTA)
- Transportation (TomTom, FAA)
- Weather (NOAA, NWS)
- Frontier (DeFi, crypto)
If the catalog has what you need, the agent connects the sources immediately.
Step 2: Search the internet
If the catalog doesn't cover your topic, the agent searches the internet using Exa for:
- Public APIs that provide the data
- Government datasets and open data portals
- Free data services with API access
When the agent finds a potential source, it creates a custom integration and the Integration Engineer builds a pipeline automatically.
Step 3: Connect custom / internal sources
For private data sources (your CRM, ERP, internal APIs):
- Tell the chat: "Connect my HubSpot" or "Connect our internal sales API"
- Provide API credentials when asked
- The Integration Engineer reads the API docs and builds the pipeline
- Credentials are encrypted with AES-256-GCM and stored securely
Intelligent substitution
When exact data isn't available, the Discovery Agent finds proxies:
- No Uber ride data? → Uses TomTom traffic congestion as a mobility proxy
- No direct competitor pricing? → Uses industry benchmark data from public filings
- Historical data unavailable? → Documents the gap and works with what's available
The agent always documents its reasoning — you can see why it chose specific sources and what gaps exist.
Your data stays yours
- Each tenant has an isolated data lake (MinIO S3)
- Scoped credentials prevent cross-tenant data access
- Custom integrations are encrypted at rest
- Data you connect is only visible to your team
The Knowledge Graph
Every hypothesis, correlation, and data source builds a persistent knowledge graph. Over time, this becomes your organization's decision map — a living record of what you've investigated and what the data says.
What's in the graph
| Node type | What it represents | Example |
|---|---|---|
| Hypothesis | A research question with a verdict | "Does ERCOT demand correlate with Dallas weather?" (Green) |
| Entity | A concept extracted from research | "Dallas Temperature", "ERCOT Load", "VIX Volatility" |
| Source | A data source used in research | "NOAA CDO Weather", "EIA Electricity Generation" |
| Edge type | What it means |
|---|---|
| CORRELATES_WITH | Statistical correlation between entities (includes r-value, p-value, method) |
| INVESTIGATED | A hypothesis examined this entity |
| USED_SOURCE | A hypothesis used data from this source |
| MENTIONED_IN | An entity appears in data from this source |
How it grows
The graph starts with 104 data source nodes (the catalog). As you research:
- Ask a question → a Hypothesis node appears
- Agents discover data → Source nodes connect to the hypothesis
- Analysis runs → Entity nodes are created from correlations
- Correlations found → CORRELATES_WITH edges with r-values connect entities
- More research → the graph densifies. Connections between past and current research emerge.
Why it matters
A CEO asked about interest rates and mortgage applications 3 months ago. Today, someone asks about housing starts and lumber prices. The graph already knows these are connected — interest rates link both investigations. The system doesn't re-discover what it already knows.
This is the anti-silo architecture. Marketing's research connects to Operations' research connects to Finance's research — through shared entities and data sources, not departmental boundaries.
Using the graph
- Click a hypothesis in the right panel to filter the graph to its connected nodes
- Shift+click nodes to multi-select and ask about relationships
- Search to find specific entities by name
- Hover to see node labels and types
- The graph polls every 15 seconds — new nodes appear as agents work
Report Delivery
Research is only useful if it reaches the people who make decisions. Lomita delivers reports to email, Slack, and any webhook endpoint.
Delivering a report
From the chat
With a hypothesis selected, say:
Deliver this report to [email protected]
Send this to our Slack channel
(Provide the Slack webhook URL when asked)
From your AI agent
deliver(email: "[email protected]", hypothesis_id: "abc-123")
deliver(webhook_url: "https://hooks.slack.com/services/...", hypothesis_id: "abc-123")
Without specifying a hypothesis
If you don't specify a hypothesis ID, Lomita delivers the most recently completed report.
Supported channels
| Channel | How to use | Format |
|---|---|---|
| Provide email address(es) | HTML email with executive summary and "View Full Report" link | |
| Slack | Provide webhook URL | Block Kit message with findings, status badge, and report link |
| Discord | Provide webhook URL | Rich embed with color-coded status |
| Microsoft Teams | Provide webhook URL | MessageCard with action button |
| Generic webhook | Provide any URL | JSON POST with X-Lomita-Signature (HMAC-SHA256) |
Auto-delivery
When research completes, the Research Narrator automatically delivers the report to the account email on file. You don't need to ask — it arrives in your inbox.
Viewing reports
Reports are also stored in your research repository:
https://YOUR-INSTANCE-git.lomita.io/lomita/research
Click View Report on any hypothesis in the dashboard or explore page to go directly to the report.
Report format
Every report follows the same structure:
- Question — the original hypothesis in plain language
- Answer — 1-2 sentence verdict with key numbers
- Key Findings — table of results with confidence levels
- Data Sources — what data was used, date ranges, observation counts
- Methodology — statistical methods, sample sizes, confidence intervals
- Limitations — caveats and data quality notes
- Recommendation — actionable next step based on findings
Continuous Monitoring
Set a hypothesis to re-analyze on a schedule. Get alerts when correlations shift, trends reverse, or your data signals change.
Setting up monitoring
From the chat
Monitor this weekly and send updates to [email protected]
From your AI agent
monitor(hypothesis_id: "abc-123", frequency: "weekly")
How it works
- You enable monitoring on a completed hypothesis
- On schedule (daily, weekly, or monthly), the Pipeline Monitor agent checks for due hypotheses
- The agent team re-runs the analysis against the latest data
- A delta report is generated: what changed, what stayed the same
- The report is delivered to all configured channels (email, Slack, webhook)
What you receive
When nothing changed
MONITORING UPDATE: STABLE
Your hypothesis "Do interest rates affect mortgage applications?"
remains SUPPORTED. No significant changes detected since last check.
Key metrics:
- Correlation: r = 0.67 (was 0.67) — unchanged
- Trend direction: still negative
The system always reports — even stable findings. Absence of change is a valuable signal.
When something changed
MONITORING UPDATE: STATUS CHANGED (Green → Yellow)
Your hypothesis "Do interest rates affect mortgage applications?"
has shifted from SUPPORTED to INCONCLUSIVE.
What changed:
- Correlation weakened: r = 0.67 → r = 0.34
- New data from Q2 shows divergence in the relationship
Status changes trigger an immediate alert outside the regular schedule.
Frequencies
| Frequency | When it runs |
|---|---|
| Daily | Every 24 hours |
| Weekly | Every 7 days |
| Monthly | Every 30 days |
Disabling monitoring
Stop monitoring this hypothesis
Or via your agent:
monitor(hypothesis_id: "abc-123", enabled: false)
Previous reports are preserved — only future checks stop.
Data Sources Overview
Lomita includes 104 pre-built data sources across 8 domains — but this is just the starting point. AI agents can also search the internet for sources not in the catalog and connect your private data (CRM, ERP, APIs). See Connecting Your Own Data.
Categories
Macro & Economics (27 sources)
FRED series covering rates, yields, spreads, monetary policy, and economic indicators.
| Source | What it measures | Update frequency |
|---|---|---|
| fred-dgs10 | 10-Year Treasury yield | Daily |
| fred-fedfunds | Federal funds rate | Daily |
| fred-sofr | Secured Overnight Financing Rate | Daily |
| fred-umcsent | University of Michigan Consumer Sentiment | Monthly |
| fred-vixcls | CBOE Volatility Index (VIX) | Daily |
| fred-dcoilwtico | Crude oil price (WTI) | Daily |
| fred-m2sl | M2 money supply | Weekly |
| fred-walcl | Fed balance sheet | Weekly |
| treasury-tga-balance | Treasury General Account | Daily |
| treasury-auction-results | Treasury bond auctions | As scheduled |
| bls-qcew-wages | Quarterly wages | Quarterly |
| census-retail-sales | Retail sales | Monthly |
Sentiment & Markets (10 sources)
Reddit, Polymarket, and market sentiment indicators.
| Source | What it measures |
|---|---|
| reddit-wallstreetbets | WSB post scores, engagement |
| reddit-retail-sentiment | Aggregated retail investor sentiment |
| reddit-stocks | r/stocks activity |
| reddit-investing | r/investing activity |
| polymarket-markets | Prediction market odds |
| alphavantage-sentiment | Market news sentiment |
| deribit-options | BTC/ETH options data |
Energy (8 sources)
EIA, ERCOT, and energy market data.
| Source | What it measures |
|---|---|
| eia-electricity-generation | US electricity generation |
| eia-petroleum-supply | Petroleum supply/demand |
| eia-steo-forecasts | Short-term energy outlook |
| ercot-settlement-prices | Texas electricity prices |
| ercot-rt-lmps | Real-time locational marginal prices |
Frontier / DeFi (5 sources)
Crypto, DeFi, and emerging data.
| Source | What it measures |
|---|---|
| defillama-tvl | Total value locked in DeFi |
| defillama-stablecoins | Stablecoin supply |
| coingecko-derivatives | Crypto derivatives data |
Transportation (4 sources)
Traffic, flights, and border crossings.
Commodities (3 sources)
CFTC, USDA, and World Bank data.
Weather (4 sources)
NOAA, NWS, and weather data.
Real Estate (3 sources)
Google Places, MTA, and property data.
How discovery works
Your agent calls discover_sources("your keywords") to search the catalog:
Agent: discover_sources("consumer sentiment")
→ Returns: fred-umcsent, reddit-retail-sentiment, alphavantage-sentiment, ...
Each source includes:
- Name and description
- Data domain and tags
- Update frequency
- Whether credentials are required
- Sample fields
Data format
- Raw data is stored as JSON in
s3://plantos-raw/{source}/{date}/ - Transformed data is stored as Parquet in
s3://plantos-analytics/marts/ - Query both using
analyze_datawith SQL
Custom sources
You can upload your own data:
"Upload my portfolio CSV to the data lake"
→ upload_data("portfolio.csv")
→ Returns a presigned URL for upload
Or create custom Dagster assets for new API sources:
"Create a custom asset to scrape gold prices from the LBMA API"
→ propose_asset("gold_lbma_price", python_code, description)
→ Opens a PR for review
Connecting Your Own Data
Lomita works with any data source — not just the 104 in the catalog. Connect your CRM, ERP, internal APIs, databases, or upload files. AI agents handle the integration automatically.
From the chat
Tell the chat what you want to connect:
Connect my HubSpot — I need deal pipeline and close dates
Connect our internal sales API at api.company.com
The Discovery Agent will ask for:
- API documentation URL (helps the agent understand endpoints)
- Authentication type (API key, OAuth, bearer token)
- Credentials (encrypted with AES-256-GCM, stored securely)
- What specific data you need
The Integration Engineer then reads the docs, builds a Dagster data pipeline, and starts ingesting data — all automatically.
From your AI agent
connect_integration(
service: "hubspot",
credentials: { api_key: "pat-na1-..." },
config: {
goal: "Pull deal pipeline data — amounts, stages, close dates",
docs_url: "https://developers.hubspot.com/docs/api"
}
)
From the Sources page
Click Connect via Form on the Sources page to fill in:
- Service name
- API credentials (key-value pairs)
- What data you need (plain English)
- API documentation URL (optional, helps the agent)
- Sync schedule (hourly, daily, weekly)
What happens next
- Credentials are encrypted and stored
- The Integration Engineer receives a task
- The agent reads the API documentation
- A Dagster data pipeline is created and deployed
- Data starts flowing into your private data lake
- The source becomes available for all future research
Uploading files
For one-off data (CSV, JSON, Parquet):
upload(filename: "sales-q1.csv", content: "...")
Or use the Sources page upload feature. Uploaded data is queryable immediately via the analytics engine.
Security
- Credentials encrypted with AES-256-GCM at rest
- Decrypted only at pipeline runtime by the Integration Engineer
- Stored in your tenant's isolated PostgreSQL database
- Never shared across tenants
- API key rotation supported via the Sources page
Example: ERCOT Energy vs Dallas Weather
A real research investigation run on Lomita, demonstrating data discovery, agent chain execution, and statistical analysis.
The question
"Does ERCOT energy demand correlate with Dallas weather extremes?"
What happened
Time: ~5 minutes from question to completed report
Discovery Agent (30 seconds)
Searched the catalog and found:
ercot-public-api— ERCOT locational marginal pricing and grid load datanoaa-cdo-weather— Historical temperature and weather for Dallaseia-electricity-generation— Real-time electricity generation by fuel typeeia-steo-forecasts— Short-term energy outlook
Integration Engineer (2 minutes)
Built data pipelines for each source. Three Integration Engineers ran in parallel — each building a separate pipeline.
Quant Analyst (2 minutes)
Ran statistical analysis:
- Pearson correlation between temperature and ERCOT load
- Granger causality tests (does temperature predict demand?)
- Lag analysis (how many hours of lead time?)
- Regime analysis (does the relationship differ in summer vs winter?)
Research Narrator (1 minute)
Compiled findings into an executive report.
The verdict
SUPPORTED (Green)
Key findings:
- Strong positive correlation between temperature extremes and ERCOT load (r = 0.67)
- Bidirectional Granger causality (p < 0.01) — temperature predicts demand
- Strongest effect at 1-hour lag in summer (AC load)
- Winter heating demand also significant but weaker
What the graph shows
After this research completed, the knowledge graph showed:
- Hypothesis node (green) connected to 4 source nodes
- Entity nodes: "Dallas Temperature", "ERCOT Load", "ERCOT LZ_NORTH Price"
- CORRELATES_WITH edges with r-values between entities
- Cross-domain connections (weather → energy) visible in the graph layout
Delivery
The report was auto-delivered to email and manually delivered to Slack — both via the chat:
Deliver this report to [email protected]
Try it yourself
This exact research can be reproduced on any Lomita instance. Go to the Explore page and type:
Research whether ERCOT energy demand correlates with Dallas weather extremes
Example: Treasury Yields vs VIX Volatility
This example shows a real research run on the Lomita platform. The entire process — from question to published report — ran autonomously.
The question
"Do rising 10-year Treasury yields predict VIX volatility changes?"
What happened
research("Does rising 10-year Treasury yields predict VIX volatility changes?")
The research team executed automatically:
-
Found FRED DGS10 (10-Year Treasury) and FRED VIXCLS (CBOE Volatility Index) — 9,084 daily observations from 1990 to 2026.
-
Verified data freshness and confirmed both series were aligned.
-
Ran 7 statistical tests:
- Pearson correlation (levels and changes)
- 2-week lag analysis (the critical test)
- Linear regression
- Regime analysis (high-yield vs low-yield environments)
- Structural break test (pre-2008 vs post-2008)
-
Compiled the findings into an executive report.
The result
REFUTED. The 2-week lag correlation is r = +0.009 — effectively zero.
| Finding | Result | Confidence |
|---|---|---|
| Pearson correlation (levels) | r = -0.085, R² = 0.007 | HIGH |
| Change-on-change | r = -0.157, R² = 0.025 | HIGH |
| 2-week lag (critical test) | r = +0.009 | HIGH |
| Regime: high-yield vs low-yield | Sign reversal (+0.13 vs -0.02) | MEDIUM |
| Structural break: pre vs post 2008 | Correlation weakened (-0.13 → -0.05) | MEDIUM |
Key insight
Even the strongest observed relationship (change-on-change r = -0.157) explains only 2.5% of variance. The regime analysis found an interesting sign reversal — in high-yield environments the correlation flips positive — but it's not reliable enough to be actionable.
Time to complete
The full analysis ran across several cycles (approximately 15-20 minutes from question to published report).
Report location
The full report with analysis artifacts was published to:
git.YOUR-INSTANCE.lomita.io/lomita/research/dgs10-vix-statistical-analysis/report.md
Try it yourself
After connecting your agent:
Set provider to zen with key YOUR-ZEN-KEY
Research whether consumer sentiment predicts retail sales with a 1-month lag
Tools Reference
When you connect your AI agent to Lomita, you see 9 tools. These are all you need — your research team handles everything else behind the scenes.
Your Tools
| Tool | What it does |
|---|---|
research | Ask a research question and get a data-backed answer |
follow_up | Dig deeper into a previous finding |
status | Check on your active research |
sources | Search 104+ data sources |
upload | Add your own data (CSV, JSON, Parquet) |
monitor | Watch a research question over time with scheduled re-analysis and alerts |
deliver | Send reports to email, Slack, Discord, Teams, Zulip, or any webhook |
set_provider | Configure your LLM provider |
list_providers | See which providers are configured |
pause_agents | Pause all research agents to save LLM tokens |
resume_agents | Resume paused agents |
How to use them
research
The primary tool. Ask any research question and the research team does the rest.
"Research whether consumer sentiment predicts retail sales growth"
What happens behind the scenes:
- Formalizes your research question
- Finds relevant data from 104+ sources
- Prepares and connects the data
- Runs statistical tests (correlation, lag analysis, regression, regime analysis)
- Compiles an executive report
- Report is published to your research repository
You get back a Research ID. Use status to track progress.
follow_up
Branch from an existing investigation to explore a related question.
"Follow up on research abc-123: break this down by pre-2020 vs post-2020"
Creates a follow-up investigation linked to the original, then runs the same research process.
status
Check progress on your research. Call with no arguments to see everything, or with a Research ID for details.
"Check the status of my research"
"What's the status of research abc-123?"
Returns: research status, connected data sources, recent activity, and a link to the report when complete.
sources
Search the data catalog to see what's available before starting research.
"What data sources do you have for energy markets?"
"Search for sentiment data"
Returns matching sources with descriptions, domains, and whether authentication is required.
upload
Add your own proprietary data to the data lake. Returns a presigned upload URL.
"Upload my-sales-data.csv"
Your data is stored privately and becomes available for analysis alongside the 104 public sources.
monitor
Set up continuous monitoring on a research question so it is automatically re-analyzed on a schedule.
"Monitor research abc-123 weekly and send results to my email and Slack"
Parameters:
| Parameter | Description |
|---|---|
hypothesis_id | Research ID — the question to monitor |
frequency | How often to re-analyze: daily, weekly, or monthly |
channels | Where to deliver results: email, webhook, or both |
enabled | Set to false to disable monitoring |
How monitoring works:
- On each scheduled cycle, the research team re-runs the full analysis against the latest data.
- Reports are always delivered — even when findings are stable. Absence of change is valuable information.
- If a research status flips (e.g., SUPPORTED to INCONCLUSIVE), an immediate alert is sent outside the normal schedule.
- Updated reports are committed to your research repository as new versions. Use
git diffto see exactly what changed between cycles.
To disable monitoring:
"Stop monitoring research abc-123"
See Continuous Monitoring for a full walkthrough.
deliver
Send a completed report to email, a webhook, or both. Auto-detects the platform from the webhook URL.
"Deliver report abc-123 to [email protected]"
"Deliver report abc-123 to https://hooks.slack.com/services/T00/B00/xxx"
"Deliver report abc-123 to [email protected] and https://discord.com/api/webhooks/123/abc"
Supported destinations:
| Destination | How to specify |
|---|---|
| Provide an email address | |
| Slack | Provide a Slack incoming webhook URL |
| Discord | Provide a Discord webhook URL |
| Microsoft Teams | Provide a Teams incoming webhook URL |
| Zulip | Provide a Zulip webhook URL |
| Generic webhook | Provide any HTTPS URL |
The platform is auto-detected from the URL — no extra configuration needed. You can provide both an email and a webhook URL in a single call.
set_provider
Configure the LLM that powers your research agents. This is typically the first thing you do after connecting.
"Set provider to zen with key sk-abc123..."
"Configure anthropic with my API key"
Supported providers: Anthropic, OpenAI, Google (Gemini), DeepSeek, Zen (OpenCode), Groq, Mistral, xAI, OpenRouter, Cerebras.
Zen tiers:
| Tier | What you get |
|---|---|
zen | Standard pay-per-token via OpenCode |
zen-go | Unlimited subscription — uses Kimi K2.6 and DeepSeek V4 Pro with no per-token charges |
To use the unlimited tier:
"Set provider to zen-go with my subscription key"
When you set a provider, your research team is automatically configured and ready to work.
list_providers
See which LLM providers you've configured.
"List my configured providers"
pause_agents
Pause all research agents to stop LLM token consumption. Useful when you're done for the day or want to review results before continuing.
"Pause all agents"
Any in-progress work is suspended gracefully. Agents can be resumed at any time.
resume_agents
Resume previously paused agents to continue research.
"Resume agents"
Agents pick up where they left off.
What you don't see
Behind your 9 tools, there are 44+ internal tools that the research team uses. These handle data connections, SQL queries, code review, task management, and coordination. You never need to touch them — the research tool orchestrates everything automatically.
Reports
When research completes, the report is published to your research repository:
https://YOUR-INSTANCE-git.lomita.io/lomita/research
Each investigation gets its own directory with:
report.md— the executive report (rendered as formatted markdown)analysis/— SQL queries and Python scripts useddata-sources.md— what data was used
The repository's README shows an auto-generated index of all your research with status and links.
FAQ
How much does it cost?
| Tier | Price | Team | What's included |
|---|---|---|---|
| Standard | $640/mo | 3 members | Dedicated container, 104+ data sources, autonomous agents, email + webhook delivery |
| Pro | $990/mo | 10 members | Everything in Standard + priority execution, advanced monitoring |
No additional API key or LLM costs — the research agent team's AI model is included.
Do I need to set up an AI provider?
No. Lomita includes a bundled AI model for all agent operations. Research works immediately after connecting — no API key setup required.
Advanced users can optionally override the default model with their own provider (Anthropic, OpenAI, Google Gemini) via the set_provider tool.
Do I need an AI agent to use Lomita?
No. The built-in chat on the Explore page is the primary interface. You can ask research questions, connect data sources, and receive reports entirely from your browser.
Connecting an external AI agent (Claude Code, Claude Desktop, etc.) is optional — it gives you programmatic access, scripted workflows, and deeper integration.
Can Lomita find data I don't have?
Yes. This is the primary product capability. When you ask a question, AI agents:
- Search the 104-source catalog
- Search the internet for APIs and datasets not in the catalog
- Connect to your private data sources (CRM, ERP, APIs) if you provide credentials
The agents build data pipelines automatically — you don't need to know where the data lives.
Can I connect my own data?
Yes. Tell the chat "Connect my HubSpot" or go to the Sources page. Provide credentials and describe what data you need. The Integration Engineer reads the API docs and builds the pipeline.
Supported: any REST API, CRM, database, or file upload (CSV, JSON, Parquet). Credentials are encrypted with AES-256-GCM.
Can I deliver reports to Slack, email, or Teams?
Yes. From the chat:
Deliver this report to [email protected]
Send this to our Slack channel (provide webhook URL)
Supported: Email (Resend), Slack, Discord, Microsoft Teams, Zulip, and any webhook endpoint.
Reports are also auto-delivered to your email when research completes.
How does monitoring work?
Enable monitoring on any completed hypothesis:
Monitor this weekly and alert me if anything changes
Lomita re-runs the analysis on schedule (daily, weekly, monthly) and delivers delta reports. You always get a report — even when nothing changed. If the verdict flips (e.g., SUPPORTED to INCONCLUSIVE), you get an immediate alert.
Where are my reports?
Reports are accessible from:
- Overview dashboard — click "View Report" on any completed hypothesis
- Explore page — click "View Report" in the hypothesis panel
- Research repository —
https://YOUR-INSTANCE-git.lomita.io/lomita/research - Email — auto-delivered on completion
Can my team access the same instance?
Yes. Standard supports 3 team members, Pro supports 10. Invite teammates from the Team page — they get their own API key and can access the same research, data sources, and knowledge graph.
What happens if I cancel?
You keep access through the end of your billing period. Your data, reports, and research history are preserved. You can resubscribe anytime.