NEWAI DQ Rule suggestions, now built in

You don't need expensive tools
to understand your data.

Know exactly where your data is broken. Get instant data quality insights in your browser — no setup, no code, no months-long onboarding.

No credit cardFree forever for soloData stays in your browser
app.sohovi.com/dashboard
Live
customer_data.csv
128,447 rows · 23 columns · Profiled 3s ago
DQ Score 87
DQ Score 87/10087/100
Overall Health
Dimensions
Completeness
94
Validity
88
Uniqueness
72
Accuracy
95
Consistency
81
Integrity
66
8-week trend
⚠ Null rate spike
phone col · 12.4%
🔒 PII detected
email, phone columns

Built for data-sensitive teams in

Finance & BankingHealthcareE-commerceLogistics & Supply ChainMarketing AgenciesSaaS & TechConsultingNon-profitFinance & BankingHealthcareE-commerceLogistics & Supply ChainMarketing AgenciesSaaS & TechConsultingNon-profit
0
ISO-standard quality dimensions
<0 sec
From upload to first score
0 bytes
Data sent to any server
$0
To get started

The Problem

Most data quality tools weren't built for your team.

The traditional approach to data quality is slow, expensive, and assumes you have a dedicated IT department. It also requires your data to travel to external servers before you've run a single check.

Months to first value

Traditional tools require scoping, procurement, consultants, and months of implementation before a single dataset is scored.

Data uploaded to third-party servers

Most DQ platforms require you to connect data sources directly or upload files to their cloud — before you have established trust.

Pricing designed for 1,000-person orgs

Annual contracts, per-seat licensing, and module add-ons make professional data quality inaccessible to teams that just need to trust their spreadsheets.

There's a different way.

The Sohovi Way
$0/ month to get started

Pro plans from $29/month when you need more

  • Works in 30 seconds — no setup, no onboarding call
  • All processing runs inside your browser. No data ever sent.
  • Full 10-dimension DQ scoring from day one
  • Transparent pricing — what you see is what you pay
  • Built for teams of 1 to 50
Start Free — No Credit Card

Platform Capabilities

Everything your team needs to measure and improve data quality.

Instant Understanding

See everything about your data before you write a single rule.

Drop any CSV or Excel file. Sohovi reads every column — null rates, type distributions, unique value counts, outliers, and PII flags — computed entirely client-side, delivered in seconds.

  • Null rates, value distributions, and top-N values per column
  • Automatic type inference: email, phone, date, ID, numeric
  • PII detection — flags emails, phone numbers, SSNs before you share data
  • Statistical outlier detection using ±3σ analysis
CSVExcel100K+ rowsInstant

customer_data.csv — profiling results

email
EmailPII
Null 2.3%
Unique 98.1%
customer_id
ID
Null 0%
Unique 100%
phone
PhonePII
Null 8.7%
Unique 91.2%

Measurable Quality

A score you can show your team — and explain in plain English.

Every DQ score is built from 10 ISO-standard dimensions. You see exactly which rules passed or failed, how many rows are affected, and why. No black-box output.

  • 10 dimensions: Completeness, Validity, Uniqueness, Accuracy, Consistency, and more
  • Column-level, dataset-level, and catalog-level scores
  • Drill into any failing rule to see the exact affected rows
  • Score history tracked across every run
10 DimensionsTransparentDrillableHistorical

Live DQ Score

0DQ Score
Completeness
94
Validity
88
Uniqueness
72
Accuracy
95

Intelligent Rules

Rules that already know what kind of column they're looking at.

Sohovi's built-in ML engine reads your column names and data patterns to suggest the right quality rules before you ask. Accept, edit, extend, and reuse across any dataset.

  • Auto-detects: email → regex rule, ID → uniqueness rule, date → format rule
  • No external API calls — ML runs entirely in your browser
  • Save rules as reusable templates across datasets
  • Custom rules: regex, ranges, lookups, cross-column checks
Auto-SuggestCustom Rules10 Rule TypesReusable

Suggested Rules

ML SUGGESTED
email
not_null — HIGH confidence
regex_match — email format
customer_id
unique_column — no duplicates
age
range_check — 0 ≤ age ≤ 120
not_null
Apply All Suggestions →

Continuous Monitoring

Track how your data quality changes — before it breaks production.

Run Sohovi weekly against the same dataset. Watch your DQ score improve over time, catch regressions early, and get notified the moment quality drops below your threshold.

  • Historical trend charts across every run
  • Schema drift detection — new columns, removed columns, renames
  • Anomaly alerts — null spikes, pattern changes, value range shifts
  • Set per-asset score thresholds
Trend ChartsSchema DriftAnomaly DetectionThreshold Alerts

DQ Score — Last 8 Weeks

W1
W2
W3
W4
W5
W6
W7
W8

Anomaly Detected

null_rate spike in column “phone” — 2 days ago

How It Works

From file to insight in four steps.

No integration setup. No account configuration. Just your file and your answers.

015 sec

Upload your file

Drop a CSV or Excel file. Sohovi reads it locally — nothing is sent anywhere.

0210–30 sec

Profile your columns

Automatic deep profiling of every column. Null rates, types, PII flags, outliers.

031–2 min

Set your quality rules

Accept ML suggestions or write your own. Regex, ranges, uniqueness, lookups.

04Ongoing

Score and track

Run your rules. Get a DQ score. Come back next week and watch it improve.

Use Cases

Built for every data team.

From marketing to finance, here's how teams use Sohovi to improve data quality, catch errors before they spread, and deliver cleaner data — without writing a single line of code.

Marketing & Revenue Ops

Clean CRM exports, lead lists, and email files before campaigns. Catch invalid emails, duplicate contacts, and missing fields that tank deliverability and waste ad spend.

Analytics & BI Teams

Validate datasets before they feed your dashboards. Score every file for completeness, uniqueness, and consistency — so your reports and KPIs are built on clean, trustworthy data.

E-commerce & Product

Audit product catalog exports for missing descriptions, invalid prices, and duplicate SKUs. Find issues before they reach customers or break your PIM or ERP sync.

HR & People Operations

Verify HRIS exports and applicant data are complete, consistently formatted, and free of duplicates before syncing to payroll, ATS, or benefits platforms.

Finance & Compliance

Validate transaction records, expense reports, and financial exports for accuracy and completeness. Generate explainable DQ scores to support audits and regulatory reporting.

Freelancers & Consultants

Deliver clean data to clients as a standard part of every engagement. Run a data quality audit in minutes, export a score report, and add credibility to every deliverable.

Privacy Architecture

Your data never leaves your browser.That's the architecture.

This isn't a marketing tagline — it's how the product is engineered. All file reading, profiling, and scoring runs inside Web Workers in your browser tab.

How it actually works

  1. 1

    You select a file from your device

  2. 2

    Sohovi reads it using the browser File API — no network request

  3. 3

    All processing runs in a Web Worker — isolated, off the main thread

  4. 4

    Only DQ scores, rule metadata, and aggregated summaries are ever stored

Don't take our word for it.Open DevTools → Network tab while running an analysis. You'll see zero outbound requests for your data.

No Server Upload

Raw data never leaves your machine. File reading happens entirely via the browser File API.

Client-Side Processing

All computation — profiling, scoring, rule evaluation — runs in Web Workers inside your browser tab.

No Cloud Storage

Your rows are never written to any database. Only DQ scores and rule metadata are stored.

GDPR-Friendly by Design

No PII is transmitted. No consent is required for the processing layer.

Zero Breach Risk

There is nothing on our servers to breach. Your data exists only in your browser session.

PII Detection Built In

Identify sensitive columns — emails, phones, SSNs — before you share results with anyone.

Customer Stories

Teams that ship cleaner data with Sohovi.

From freelance consultants to 50-person data teams.

Cut manual data checking from 3 hours to 5 minutes

We used to spend Monday mornings manually reviewing CRM exports before any reporting could start. Sohovi reduced that to a five-minute check. The PII detection flag alone prevented a compliance incident we never saw coming.

PS
Priya Sharma
Data Analyst · FinTech Startup
Caught a format inconsistency clients hadn't noticed for months

The AI DQ Rule suggestions are genuinely impressive. It flagged that my postal_code column had mixed formats — a problem that had been silently affecting address matching for months. My clients now ask for a Sohovi DQ report with every delivery.

MW
Marcus Webb
Freelance Data Consultant · Independent
Legal approved it in one day — zero compliance concerns

The deciding factor was knowing our customer data stays on our machine. Legal reviewed the architecture and approved it in a single day. The DQ score trend chart has become part of our weekly ops review.

KN
Kavita Nair
Operations Lead · E-commerce
01 / 02

Pricing

Simple, transparent pricing.

Start free. Scale when you need to. No hidden fees.

MonthlyAnnual

Free

$0/forever

Perfect for individuals exploring data quality.

Get Started Free
  • 5 data assets
  • Unlimited profiling runs
  • 5 DQ rules per asset
  • Basic scoring (column + dataset level)
  • CSV and Excel support
  • 7-day run history
Most Popular

Pro

$29/per month

For freelancers and small teams who need more power.

  • Unlimited data assets
  • Unlimited DQ rules
  • Full 10-dimension scoring
  • AI DQ Rule suggestions
  • Historical trend charts
  • Workflows & automation
  • Alerts & anomaly detection
  • PDF/Excel report export
  • 90-day run history
  • PII detection

Business

$59/per month

For data teams managing multiple catalogs and business units.

7-day free trial · no credit card required

  • Everything in Pro
  • Unlimited business units
  • Catalog-level DQ scoring
  • Cross-column validations
  • Remediation + export cleaned file
  • Rule testing sandbox
  • Ownership & stewardship fields
  • Lineage & context metadata
  • Priority support
  • Unlimited run history

All plans include 100% client-side processing — your data never leaves your browser.

FAQ

Questions we hear most often.

Still have questions? Email us.

This deserves a direct answer. When you drop a file into Sohovi, it's read using the browser's File API — no network request is made. The actual analysis runs inside a Web Worker, which is a sandboxed browser thread that cannot make outbound HTTP calls. You can verify this yourself: open DevTools → Network tab → drop a file in Sohovi. You'll see zero requests containing your file data. The only network calls Sohovi makes are to save your DQ scores and rule definitions — never your actual rows.
Most established DQ platforms are built for enterprise IT departments — they require months of implementation, dedicated ops teams, and significant annual budgets. Sohovi is built for data analysts and small teams who need to trust their data now. You can be scoring your first dataset in under 10 minutes with zero setup. We cover 90% of real-world DQ use cases at a fraction of the cost, and your data never has to leave your environment.
Excel can do basic checks, but it doesn't give you a structured quality score you can track over time, automatically suggest rules based on your column types, or detect PII and schema drift. Sohovi gives you a reproducible, comparable DQ score across runs — so you can say 'our data quality improved from 74 to 89 over 8 weeks' and back it up with a report. Rules also follow your column names automatically when they change.
CSV and Excel (.xlsx, .xls) files up to 200MB. For very large files, Sohovi automatically samples up to 100,000 rows for profiling while still applying rules across all rows. JSON and Parquet support are on the roadmap. You can also connect live data sources (Google Sheets, Airtable, REST APIs) on the Business plan.
The Free plan is genuinely useful for individuals. You get 5 data assets, unlimited profiling runs, 5 DQ rules per asset, and 7-day run history. It's not a 14-day trial — it's free forever for solo use. When you need unlimited assets, AI DQ Rule suggestions, PDF reports, and full trend history, that's when Pro ($29/mo) makes sense.
Yes — Business includes a 7-day free trial with no credit card required. You get full access to all Business features: unlimited business units, team collaboration, cross-column validations, remediation tools, and priority support. If you don't upgrade by the end of the trial, you move to the Free plan automatically — no charges.
Each of the 10 DQ dimensions has a defined formula. Completeness = (non-null rows / total rows) × 100. Validity = (rows matching your rule / total rows) × 100. Every score shows exactly which rule was applied and how many rows failed — there are no proprietary algorithms or opaque weighting. The overall score is a weighted average of all dimension scores. It's transparent math you can verify.
Yes — and this is Sohovi's strongest compliance story. Because your raw data never leaves your browser, there is no data transfer to our servers, which means Sohovi doesn't become a data processor under GDPR for your customer data. We only store DQ metadata (scores, rule configs). Legal teams in healthcare, finance, and HR typically approve Sohovi in 24 hours because there's simply nothing to audit on our end.
You can export your rules as JSON anytime. Your run history and score data are available in your account until you delete them. We don't hold your data hostage — there's an export button in every section.

Start measuring your data quality today.

Free forever for individual use. Pro plans from $29/month when you need more.

No credit card required  ·  100% browser-based  ·  Cancel anytime