Skip to content

[kit] [mission-possible]Add EDA Copilot — AI-powered Exploratory Data Analysis agent [agentkit-challenge]#138

Open
A-b-h-a-y-0-2 wants to merge 3 commits intoLamatic:mainfrom
A-b-h-a-y-0-2:main
Open

[kit] [mission-possible]Add EDA Copilot — AI-powered Exploratory Data Analysis agent [agentkit-challenge]#138
A-b-h-a-y-0-2 wants to merge 3 commits intoLamatic:mainfrom
A-b-h-a-y-0-2:main

Conversation

@A-b-h-a-y-0-2
Copy link
Copy Markdown

@A-b-h-a-y-0-2 A-b-h-a-y-0-2 commented Apr 14, 2026

Contribution Type

  • Kit (kits/agentic/eda-copilot/)

What is EDA Copilot?

Upload any CSV. Get instant AI-powered Exploratory Data Analysis in seconds — schema detection, statistical profiling, correlation analysis, outlier flagging, and ML readiness scoring — all powered by a 3-stage Lamatic.ai flow pipeline.

Problem Statement: Data scientists spend 60–80% of project time on EDA before any modeling begins. EDA Copilot automates the entire process: from raw CSV to actionable ML-ready insights, in one click.


How It Works

Three sequential Lamatic.ai flows form the agent pipeline:

Stage Flow What it does
1 eda-schema-analysis Detects column types, missing data, scores data quality 0–100
2 eda-statistical-insights Interprets distributions, correlations, outliers, class imbalances
3 eda-ml-readiness ML readiness score, preprocessing checklist, pitfall warnings

Key architectural decision: All heavy computation (mean, std, quartiles, Pearson correlation, skewness) runs client-side in the browser using a pure TypeScript EDA engine. Only a compact JSON summary is sent to Lamatic flows — keeping token usage low and making the app fast even on 100k+ row datasets.


Features

  • Drag-and-drop CSV upload
  • Automatic schema detection and data type inference (numeric, categorical, datetime, boolean)
  • Statistical summary table (mean, std, min/max, quartiles, skewness)
  • Missing data bar chart with severity colour coding
  • Top correlation cards with strength labels
  • AI-generated schema analysis, statistical narrative, and ML readiness report
  • Downloadable .md report
  • One-click Vercel deploy

File Structure

kits/agentic/eda-copilot/
├── app/
│ ├── api/analyze/route.ts # Lamatic SDK gateway (3 sequential flows)
│ ├── page.tsx # Main UI — upload, progress, results
│ ├── layout.tsx
│ └── globals.css
├── actions/
│ └── orchestrate.js # Flow definitions & SDK config
├── components/
│ ├── FileUpload.tsx # Drag-and-drop CSV uploader
│ ├── DataPreview.tsx # Column chips + sample table
│ ├── AnalysisProgress.tsx # Live 3-step pipeline tracker
│ └── AnalysisResults.tsx # Charts + AI markdown panels
├── lib/
│ └── utils.ts # Client-side EDA engine
├── flows/
│ ├── eda-schema-analysis/ # config.json, inputs.json, meta.json, README.md
│ ├── eda-statistical-insights/
│ └── eda-ml-readiness/
├── config.json
├── .env.example
└── README.md


PR Checklist

  • Kit (kits/agentic/eda-copilot/)
  • PR is for one project only (no unrelated changes)
  • No secrets, API keys, or real credentials committed — .env.local is gitignored
  • Folder name uses kebab-case: eda-copilot
  • All changes documented in README.md (purpose, setup, env vars, usage, architecture)
  • config.json present with valid metadata (name, description, tags, steps, author, env keys)
  • All 3 flows in flows/<flow-name>/ each include config.json, inputs.json, meta.json, README.md
  • .env.example with placeholder values only
  • npm install && npm run dev works locally ✅ (Next.js build passes with 0 errors)
  • PR title is clear with [kit] prefix

Tech Stack

  • Framework: Next.js 15, TypeScript
  • AI SDK: lamatic npm package (executeFlow method)
  • CSV Parsing: PapaParse (client-side)
  • Charts: Recharts
  • Styling: Tailwind CSS

Environment Variables

Variable Description
LAMATIC_API_URL Your Lamatic project endpoint (e.g. https://org-project.lamatic.dev)
LAMATIC_PROJECT_ID Your Lamatic project ID
LAMATIC_API_KEY Your Lamatic API key
EDA_SCHEMA_ANALYSIS_FLOW_ID Flow ID for schema analysis (Flow 1)
EDA_STATISTICAL_INSIGHTS_FLOW_ID Flow ID for statistical insights (Flow 2)
EDA_ML_READINESS_FLOW_ID Flow ID for ML readiness (Flow 3)

A-b-h-a-y-0-2 and others added 3 commits April 14, 2026 22:17
- Replace raw fetch() with lamatic npm SDK (executeFlow method)
- Lazy-init Lamatic client to fix build-time env var error
- Update package.json to include lamatic@^0.3.2
- Update .env.example with correct endpoint URL format
- Update orchestrate.js to reflect SDK-based flow definitions
Each flow now has its own directory containing all 4 required files:
- config.json  (Lamatic flow node graph)
- inputs.json  (input schema definition)
- meta.json    (name, description, author, tags, test inputs)
- README.md    (setup instructions and I/O docs)

Flows restructured:
- flows/eda-schema-analysis/
- flows/eda-statistical-insights/
- flows/eda-ml-readiness/

Also: update tsconfig.json with ES2017 target
@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai bot commented Apr 14, 2026

Important

Review skipped

Auto reviews are limited based on label configuration.

🏷️ Required labels (at least one) (1)
  • agentkit-challenge

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Repository UI (base), Organization UI (inherited)

Review profile: ASSERTIVE

Plan: Pro

Run ID: 82dc0eb6-971d-4681-9131-ea10c36365b6

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@A-b-h-a-y-0-2 A-b-h-a-y-0-2 changed the title [kit] []Add EDA Copilot — AI-powered Exploratory Data Analysis agent [agentkit-challenge] [kit] [mission-possible]Add EDA Copilot — AI-powered Exploratory Data Analysis agent [agentkit-challenge] Apr 14, 2026
@A-b-h-a-y-0-2
Copy link
Copy Markdown
Author

I can't add a label!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant