Elysia Tools
Navigation
Text Processing
Duplicate Line Samples
Sample files with various types of duplicate lines for testing duplicate removal tools
Samples
Entries inside this sample collection
Log File with Duplicates
Server log file with repeated log entries showing common patterns of duplicate messages
Difficulty
3/10
Estimated time
2 min
Tags
logs, duplicates, server, timestamps, adjacent, global
log-file-with-duplicates.txt
TXT · 1.7 KB
- fileSize
- 1.7 KB
- encoding
- UTF-8
- lineCount
- 27
- characterCount
- 1693
- hasAdjacentDuplicates
- true
- hasGlobalDuplicates
- true
- hasTimestamps
- true
- hasLogLevels
- true
Email List with Duplicates
Mailing list containing duplicate email addresses with variations in case and spacing
Difficulty
2/10
Estimated time
2 min
Tags
emails, duplicates, mailing-list, case-variations, spacing
email-list-with-duplicates.txt
TXT · 973 B
- fileSize
- 973 B
- encoding
- UTF-8
- lineCount
- 41
- characterCount
- 973
- hasExactDuplicates
- true
- hasCaseVariations
- true
- hasSpacingVariations
- true
- totalEmails
- 38
Config File Duplicates
Configuration file with repeated settings and adjacent duplicate configuration lines
Difficulty
3/10
Estimated time
2 min
Tags
config, duplicates, settings, adjacent, key-value
config-file-duplicates.txt
TXT · 1.6 KB
- fileSize
- 1.6 KB
- encoding
- UTF-8
- lineCount
- 74
- characterCount
- 1627
- hasAdjacentDuplicates
- true
- hasComments
- true
- hasKeyValuePairs
- true
- configSections
- 7
Data Export with Duplicates
CSV format data export containing duplicate records and partial field matches
Difficulty
4/10
Estimated time
3 min
Tags
csv, data, duplicates, records, export
data-export-with-duplicates.txt
TXT · 1.2 KB
- fileSize
- 1.2 KB
- encoding
- UTF-8
- lineCount
- 22
- characterCount
- 1217
- hasFullRowDuplicates
- true
- hasPartialDuplicates
- true
- format
- CSV
- totalRecords
- 15
Mixed Content Duplicates
File containing various types of content with duplicates: code, text, configs, and comments
Difficulty
5/10
Estimated time
3 min
Tags
mixed, duplicates, code, text, config, comments
mixed-content-duplicates.txt
TXT · 1.3 KB
- fileSize
- 1.3 KB
- encoding
- UTF-8
- lineCount
- 79
- characterCount
- 1342
- hasCode
- true
- hasComments
- true
- hasConfig
- true
- hasText
- true
Tools
Tools frequently paired with this sample
Related