CSV File Diff
Overview
The CSV File Diff feature lets you compare data across CSV or other table-structured files (such as TSV, Excel, etc.). This is useful for validating file exports, ETL processes, or migrations.
Video Demonstration
How to Use SmartDiff CSV File Diff
Step 1: Access SmartDiff
- Select Smart Diff from the left panel
- Click Workflow to view workflow history
Step 2: Create New File Diff Workflow
- Click CREATE DIFF
Step 3: Select File Diff Option
- Choose FILE DIFF (default is DATA SOURCE DIFF)
- Enables diff operations for local CSV or other supported files
Step 4: Upload Files
- Upload two CSV (or supported) files for comparison Example: movies_tv_data_source.csv vs movies_tv_data_target.csv
- Click Next
Step 5: Review Sample Data
- Review sample data from both files
- Customize headers, add/remove columns, etc.
- Click Next
Step 6: Configure Column Mapping
- Review the column list for both files
- Adjust by dragging columns, adding/removing as needed
- Set multiple key columns if required
- Click Next
Step 7: View Results
Diff Overview
Shows high-level differences between files:
- Diff Columns: Columns with differences
- Diff Rows: Rows with differences
- Same Rows: Identical rows
- Rows in Source / Rows in Target
- Missing Rows in Target
- New Rows in Target
Summary
- ALL DATA: All columns with counts
- ONLY DIFF: Only differing columns
- BY SOURCE / BY TARGET: Frequency-based comparisons
- GRAPHS:
- Numeric: Range-based counts
- Date: Monthly frequencies
- String/Other: Item frequency
- Export Results: CSV/PDF
Meta Diff
- Shows metadata differences per column (name, datatype, etc.)
- Highlights differences with color coding
- Export Results available
Data Diff
- Groups differences into clusters
- ONLY DIFF / ALL DATA views
- Side by Side and Inline display modes
- Hide non-relevant columns
- Export Results: CSV/PDF
Related Guide
- For XML comparisons, see XML File Diff