Data Profiling

Data profiling is the first step in understanding your tables and identifying any anomalies. Cocoon uses LLMs to bring semantic insights to your profile.

Table Profiling GIF

Get Started

Online Service

Quick Trial

Drop your csv, and we will generate the profile in ~10 min

Max 1MB and 40 columns

Open Source

Full Features

Cocoon is open-source. Try out Cocoon in Google Colab.

This requires an LLM API (e.g., GPT-4, Claude-3, Gemini-Ultra, or your local LLMs) but offers an interactive experience with no size or column limitations. It also supports databases (e.g., Snowflake, Duckdb...).

Need support or have questions? Contact Us

Gallery

More example profiles, from Kaggle datasets

Air Quality

The table lists air quality information for different cities in various countries.

Profile
Property Sales

The table contains information about Residential property sales.

Profile
Animal Shelter Cats

The table is about cats in an animal shelter.

Profile
Book Details

The table lists books with details.

Profile
Breast Cancer Diagnosis

The table is about breast cancer diagnosis.

Profile
Divorced Individuals

The table is about records of divorced individuals.

Profile
Credit Risk Assessment

The table is about credit risk assessment.

Profile
Korean TV Series

The table contains information about Korean TV series.

Profile
Used Cars

The table contains information about used cars.

Profile