Whitespace Normalization Samples
Sample text files with various whitespace issues for testing normalization tools
Key Facts
- Category
- Text Processing
- Items
- 5
- Format Families
- text
Sample Overview
Sample text files with various whitespace issues for testing normalization tools This sample set belongs to Text Processing and can be used to test related workflows inside Elysia Tools.
📄 Multiple Consecutive Spaces
Text with irregular spacing between words and multiple consecutive spaces that need normalization
Text with irregular spacing between words and multiple consecutive spaces that need normalization
Download FileFile size: 528 B
📄 Leading and Trailing Whitespace
Lines with excessive whitespace at the beginning and end that require trimming
Lines with excessive whitespace at the beginning and end that require trimming
Download FileFile size: 856 B
📄 Mixed Tabs and Spaces
Text containing both tabs and spaces with inconsistent indentation patterns
Text containing both tabs and spaces with inconsistent indentation patterns
Download FileFile size: 672 B
📄 Dense Text with Extra Spaces
Paragraphs with irregular spacing, extra spaces between words, and formatting inconsistencies
Paragraphs with irregular spacing, extra spaces between words, and formatting inconsistencies
Download FileFile size: 2.1 KB
📄 Code with Messy Indentation
Programming code with inconsistent indentation using mixed tabs and spaces
Programming code with inconsistent indentation using mixed tabs and spaces
Download FileFile size: 1.4 KB