Skip to main content
Platform-Specific Data Quality

How to Maintain Data Quality in Google Sheets

Google Sheets is powerful and flexible — which means it's easy for data quality to slip without any guardrails. Here's how to add structure and quality controls to your spreadsheet-based data.

Google Sheets doesn't enforce data quality by default. Anyone with edit access can type anything into any cell. Dates can be stored as text or as dates, inconsistently. Categorical values can be entered as "Active," "active," "ACTIVE," or "Actv." Over time, a Sheets-based database becomes a data quality maintenance headache.

The good news: Sheets has enough built-in features to implement meaningful data quality controls — if you know where to look.

Data Validation in Google Sheets

Google Sheets' Data Validation feature restricts what values can be entered in a range of cells. Access it under Data → Data Validation.

Dropdown lists: Restrict categorical fields to an approved list. Create a "Status" column that only allows "Active," "Inactive," or "Pending" — no free-text entry. Create the approved list on a separate sheet and reference it in the validation.

Number ranges: Restrict numeric fields to plausible ranges. A "Discount %" column that only accepts values between 0 and 100. A "Price" column that rejects negative values.

Date constraints: Restrict date fields to a date range — only dates after 2020-01-01 for a "Contract Start Date" column.

Custom formulas: For complex validation, use a formula. ISNUMBER(SEARCH("@",A1)) validates that a field contains "@" as a basic email check.

Configure validation to either reject invalid entries or warn the user — rejection is stricter; warning allows overrides with acknowledgment.

COUNTIF for Duplicate Detection

The most common data quality check in Sheets: =COUNTIF($A:$A,A2) in a helper column counts how many times the value in A2 appears in column A. Add conditional formatting to highlight cells where this count exceeds 1 — instant duplicate visualization.

Conditional Formatting for Quality Issues

Apply conditional formatting to highlight quality problems visually:

  • Highlight blank cells in required columns red
  • Highlight dates more than 12 months in the past orange
  • Highlight cells where a REGEXMATCH formula fails (invalid format)
  • Highlight cells where COUNTIF > 1 (potential duplicate)

This creates a visual quality dashboard in your sheet — quality problems are visible at a glance.

[IMAGE: A Google Sheets spreadsheet with conditional formatting applied — blank required fields in red, duplicates in orange, invalid emails highlighted]

Frequently Asked Questions

Q: How do I add a dropdown list to a Google Sheets column? Select the column or range, go to Data → Data Validation, choose "List from a range" (to reference a list on another sheet) or "List of items" (to enter the options directly). Set "On invalid data" to "Reject input" for strict enforcement.

Q: How do I find duplicates in Google Sheets? Add a helper column with the formula =COUNTIF($A:$A,A2) where A is your key column. Filter for values greater than 1 to see all rows with duplicates. Apply conditional formatting to highlight cells where the COUNTIF exceeds 1 for visual identification.

Q: Can Google Sheets validate email addresses? With a custom formula in Data Validation: =ISNUMBER(SEARCH("@",A1)) checks for "@" presence (basic). A more complete regex validation using REGEXMATCH is also possible but more complex. For production email validation, these Sheets-based checks should be considered minimum viable — a dedicated validation service is more thorough.

Q: What is the IMPORTRANGE function and how does it affect data quality? IMPORTRANGE pulls data from another Google Sheet into the current one. Data imported via IMPORTRANGE doesn't inherit the validation rules of the source sheet, so quality problems in the source appear unfiltered. Always validate imported data separately.

Q: How do I protect a Google Sheets range so only certain users can edit it? Right-click the range → Protect range → add editors who should have edit access. Other users can view but not modify the protected range. This prevents accidental or unauthorized changes to validated data or reference tables.

Q: What is a Google Sheets data validation "List from a range"? Instead of hardcoding the dropdown options in the validation rule, you reference a range of cells on another sheet. When the approved list changes, update the reference range — all dropdown menus using that range update automatically. More maintainable than hardcoded lists for frequently changing categorical values.

Q: Can I run a data quality audit on a Google Sheet? Yes, manually using COUNTIF (for duplicates), COUNTBLANK (for missing values), and REGEXMATCH formulas (for format validation). For a more automated approach, export as CSV and run through a dedicated data quality tool.

Q: What are Apps Script and how can they improve Sheets data quality? Google Apps Script lets you write JavaScript-based automation for Sheets. You can write scripts that: run quality checks on a schedule, automatically format incoming data, send email alerts when quality thresholds are breached, or build custom validation more sophisticated than what's possible in native Data Validation.

Q: Is Google Sheets appropriate for managing business-critical data? For small datasets and small teams, Sheets works well with proper validation and access controls. For larger datasets, higher-stakes data (financial records, PII), or data shared across many users, a proper database or CRM is more appropriate. Sheets lacks audit logs, row-level access control, and the performance characteristics needed for large-scale data management.

Q: What is the most important data quality feature to configure in a new Google Sheet being used as a database? Data Validation dropdown lists for all categorical fields — they prevent free-text entry inconsistencies from the start. Combined with COUNTIF-based duplicate detection, these two features catch the most common Sheets data quality problems before they accumulate.


Google Sheets can be a reliable data management tool with the right controls in place. Data Validation, COUNTIF-based duplicate detection, and conditional formatting together provide meaningful data quality guardrails without any coding.

[INTERNAL LINK: Can Excel Really Handle Your Data Quality Needs?] [INTERNAL LINK: Data Quality for Small Business: Budget-Friendly Options]

Sohovi Team

Data quality, for people who ship

The Sohovi team writes practical guides on data quality, profiling, and governance to help teams ship better data.

Start for free

Stop guessing. Start knowing your data quality.

Sohovi profiles your datasets in minutes — surfacing completeness gaps, type mismatches, and duplicate patterns before they reach production.

No credit card required · Free forever plan