Key Facts
- Category
- Data Processing
- Input Types
- textarea, select, checkbox
- Output Type
- text
- Sample Coverage
- 4
- API Ready
- Yes
Overview
The Data Range Limiter is a powerful utility designed to enforce numerical constraints on your datasets by clipping, filtering, or flagging values that fall outside defined boundaries. It ensures data consistency and quality, making it an essential tool for cleaning sensor outputs, preparing machine learning inputs, and enforcing strict business rules across your CSV files.
When to Use
- •When you need to clean sensor data by removing or capping extreme outliers.
- •When preparing datasets for machine learning models that require strictly normalized input ranges.
- •When enforcing business constraints, such as ensuring age, salary, or temperature values remain within logical limits.
How It Works
- •Upload your CSV data and define your range constraints using a simple JSON configuration.
- •Select a handling strategy: 'Clip' to force values to boundaries, 'Filter' to remove invalid rows, or 'Mark' to flag discrepancies.
- •Optionally enable auto-detection to identify numeric columns and generate statistical reports on the modifications performed.
- •Download your processed, validated dataset with optional modification flags for easy auditing.
Use Cases
Examples
1. Sensor Data Cleaning
Data Engineer- Background
- A set of temperature sensor logs contains occasional spikes due to electrical interference, with values reading above 100°C in a room-temperature environment.
- Problem
- The spikes are skewing the average temperature calculations and need to be capped at a realistic maximum.
- How to Use
- Upload the CSV, set the strategy to 'Clip', and define the range for the 'temperature' column.
- Example Config
-
{"temperature": {"min": 15, "max": 40}} - Outcome
- All temperature readings above 40°C are automatically set to 40°C, ensuring the dataset remains usable for analysis without extreme outliers.
2. Enforcing Business Rules
HR Analyst- Background
- An employee database contains age entries that are clearly erroneous, such as values below 18 or above 100.
- Problem
- These invalid entries need to be removed from the report to ensure accurate demographic analysis.
- How to Use
- Upload the employee CSV, select the 'Filter' strategy, and configure the age range.
- Example Config
-
{"age": {"min": 18, "max": 100}} - Outcome
- Any row containing an age outside the 18-100 range is removed, resulting in a clean dataset ready for HR reporting.
Try with Samples
json, csv, videoRelated Hubs
FAQ
What is the difference between clipping and filtering?
Clipping adjusts out-of-bounds values to the nearest allowed minimum or maximum, while filtering removes the entire row containing the invalid value.
Can I apply different ranges to different columns?
Yes, the JSON configuration allows you to specify unique min and max values for each individual column in your dataset.
What happens if I don't specify target columns?
The tool will automatically detect all numeric columns in your CSV and apply the range constraints to them.
Can I keep my original data for comparison?
Yes, by enabling 'Preserve Original Columns', the tool will retain your original values with an '_original' suffix in the output.
Does this tool support non-numeric data?
The tool is designed for numeric range enforcement; non-numeric columns are ignored during the range processing phase.