Text Cleaning

Features

Text Cleaning Types

Not all applications support every type of text cleaning. The table below may guide you on what is supported.
- no title specified
 

CleanHaven

Anagram

Change Case

Data Cleansing

DeDuplicate

Find & Replace

Inside Text

Its About Time

Sort It Out

 

TEXT SOURCE

          

Pasted Text

 

 

SQL Database

   

      
           

CASE

          

iNVERT cASE

 

      

lowercase

 

      

RaNdOM cAsE

 

      

Title Case

 

      

Sentence case

 

      

UPPERCASE

 

      
           

SORT

          

Ascending

       

 

Case-Sensitive

       

 

Descending

       

 

Numerical Value

       

 

Random

       

 

Reverse

       

 

Words Within Paragraph

       

 
           

DUPLICATES

          

Remove Duplicates

   

     

Show Only Duplicates

   

     

Show Only Unique

   

     
           

CONVERT

          

Add Time to Date

  

   

  

Alphagram by Paragraph

    

   

Alphagram by Word

    

   

Anagram by Paragraph

    

   

Anagram by Word

        

Bible Verse to Passage

     

   

Bullets to Hyphens

  

  

   

Butchers Talk

        

Date Format

  

   

  

Day of the Week

  

   

  

Day of the Year

  

   

  

Days between two Dates

         

Days left in the Year

  

   

  

Days left until Christmas

      

  

Diacritical Characters to non-diacritical

 

       

Double Metaphone

     

   

Em/En-dash to Hyphen

  

  

   

Encrypt

         

Hash Value

  

  

   

Hyphen to Em-dash

  

  

   

Hyphen to En-dash

  

  

   

L33T Speak

 

      

LineFeeds to Returns

  

  

   

Morse Code

         

Non-breaking Spaces to normal Space

  

  

   

Normal Spaces to non-breaking Space

  

  

   

Number Format

  

  

   

Number of Letters

  

  

   

Number of Words

  

  

   

Number to Words

  

  

   

Palindrome

        

Phone Keypad Number

    

   

Pig Latin

        

“Quotes”

  

  

   

Random Password

         

Returns to LineFeeds

  

  

   

Revise Negative Verbs

     

   

Shift ASCII value

 

      

Soundex Code

  

  

   

SQL compliant

     

   

Symbols

  

  

   

Tabs to Spaces

  

  

   

Time Difference between two Dates

      

  

Transpose Rows and Columns

     

   

USA to British period placement

     

   

Value

  

  

   

Week of the Year

  

   

  

Wrapped Lines

     

   
           

REMOVE

          

All ASCII

  

  

   

ASCII Control characters

  

  

   

Email Addresses

         

Excess Returns

  

  

   

Excess Spaces

  

  

   

Excess Tabs

  

  

   

LineFeeds

  

  

   

Non ASCII

  

  

   

Non Letters

  

  

   

Non Numbers

  

  

   

Periods

  

  

   

Punctuation

  

  

   

Returns

  

  

   

Spaces

  

  

   

Tabs

  

  

   

Web Addresses

         
           

PERSONAL

          

Address Lookup on Web

         

Combine Columns

     

   

Combine Paragraphs

     

   

Common First Names

         

Common Surnames

         

Distance between two Postcodes

         

Distance between two Latitude/Longitudes

         

Email to Web

  

      

Name to Salutation/First/Last

  

      

Phone Number

  

      

Postcode to Region

         

Salutation

         
           

INFO

          

Change Text Encoding

         

Frequency

   

 

   

Frequency Within

     

   

Spelling

     

   

Text Information

     

   

View As ASCII

    

   

View As Binary

    

   

View As Description

    

   

View As Hexadecimal

    

   

View As Octal

    

   

View Text Differences

         

View Text Encoding

     

   
           

INTERNET

          

Add HTML Script

         

HTML Name to Text

         

HTML Number to Text

         

Show HTML Conversion Table

         

Text from HTML (All)

         

Text from HTML (Visible)

         

URL Encoding to Text

         

View as HTML Number

         

View as URL Encoding

         

View as Web

         
           

REPLACE

          

Anywhere

    

    

At Start

    

    

At End

    

    

Between Texts

    

    

From Find Text to End

    

    

Insert from Start

    

    

Insert from End

    

    

Up until Find Text

    

    

Characters from Start

    

    

Characters in Middle

    

    

Characters from End

    

    

Regular Expression

    

    
           
           
 
 
Text Cleaning Applications
Stacks Image 937
CleanHaven
Paste your text into the Source field. Your cleaned text appears in the Destination. Choose from dozens of different text cleaning types, including most of the conversions in the other applications below.
Stacks Image 944
Anagram
Anagram is a utility to jumble or convert a list of words or paragraphs, including:
  • Alphagram —Jumbles the letters within each word (or paragraph) into a alphabetical order. Useful for creating puzzles. Hyphenated words are jumbled separately on either side of the hyphen.
  • Anagram —Jumbles the letters within each word (or paragraph) into a random order. Useful for creating puzzles. Hyphenated words are jumbled separately on either side of the hyphen. Clicking the arrow to face Up will jumble the Converted Text!
  • Butchers Talk — Reverses the letters of each word within a sentence. This is a common technique that Australian butchers use to point out when a good looking girl walks in or to direct an assistant to sell someone an alternative product, all without the customers being aware. Clicking the arrow to face Up will return the Converted Text to their original order.
  • L33T Speak — Replaces certain letters within a sentence to a numerical alternative. This is useful for creating passwords that are easy to remember, look like the original text and are more secure (A=4, B=8, C=(, E=3, G=6, H=#, i=1, L=7, O=0, P=9, S=5, T=+ and Z=2). Clicking the arrow to face Up will return the Converted Text to their original letters.
  • Palindrome — Reverses the order of the letters of the whole text. If you want to reverse the letters by word then choose Butcher's Talk.
  • Phone Keypad Number — Converts the text into the numbers and symbols found on most telephones and mobiles phones.
  • Pig Latin — Converts the text into Pig Latin. This is often used by children to speak to a friend openly but hide the meaning from other listeners. Each word starting with a consonant has that consonant moved to the end of the word with the sound 'ay' added to it. Words starting with a vowel just have '-way' appended. Certain grouped consonants such as 'qu' are moved together. You can also reverse the Pig Latin, but there is an ambiguity with words that originally started with a 'w'. Clicking the arrow to face Up will return the Converted Text to the original text.
  • Shift ASCII value — Moves the ASCII value of every character up or down by the selected amount. If you shift up one then 'A' becomes ‘B’; if you shift down -2 then 'G' becomes 'E', etc. Normal text characters are ASCII 32 to 127. Shifting any character to below ASCII 32 or above ASCII 127 may result in an unreadable and unrecoverable text. Clicking the arrow to face Up will return the Converted Text to their original characters.
  • View as ASCII, Binary, Description, Hexadecimal and Octal — Convert the text into their alternative counterparts. It allows you to surround them with brackets to keep the values separate.
Stacks Image 949
Change Case
Change Case is a utility to translate plain text into different cases, including:
  • Diacritical characters — all accents (e.g. acutes, graves, umlauts, etc) are removed from the characters, leaving their plain counterparts.
  • iNVERT cASE — what was UPPERCASE is now lowercase, and vice versa.
  • lowercase — whatever the case was before, it is now lowercase.
  • RaNdOM cAsE — the case of each letter within each word is random. Click OK repeatedly to change again.
  • Title Case — normally this means every word starts with an UPPERCASE letter with the rest of the word being lowercase. But you have the option of keeping conjunctions all lowercase.
  • Sentence case — the whole of each sentence is in lowercase, except the first letter of the first word of each sentence is UPPERCASE.
  • UPPERCASE — whatever the case was before, it is now UPPERCASE.
Stacks Image 954
Data Cleansing
Data Cleansing is a utility to clean up the field values in a MySQL database. Perform Data Cleansing on any MySQL database. Clean your Text, Number and Date field values with 50 different cleansing types, including:
  • Case — iNVERT cASE, lowercase, RaNdOM cAsE, Title Case, Sentence case and UPPERCASE.
  • Text Cleaning —Add Time to Date, Bullets to Hyphens, Date Format, Day of the Week, Day of the Year, Days left in the Year, Em/En-dash to Hyphen, Hash Value, Hyphen to Em-dash, Hyphen to En-dash, L33T Speak, LineFeeds to Returns, Non-breaking Spaces to normal Space, Normal Spaces to non-breaking Space, Number Format, Number of Letters, Number of Words, Number to Words, “Quotes”, Returns to LineFeeds, Shift ASCII value, Soundex Code, Symbols, Tabs to Spaces, Value and Week of the Year.
  • Remove Text —Remove All, Remove All ASCII, Remove ASCII Control characters, Remove Excess Returns, Remove Excess Spaces, Remove Excess Tabs, Remove LineFeeds, Remove Non ASCII, Remove Non Letters, Remove Non Numbers, Remove Periods, Remove Punctuation, Remove Returns, Remove Spaces and Remove Tabs.
  • Personal Data — Email to Web, Name to Salutation/First/Last and Phone Number.
Stacks Image 959
DeDuplicate
DeDuplicate is a utility to identify and remove duplicate items from a list of text, including:
  • Remove Duplicates — sorts the list then removes the excess occurrences. Unique items will always be included, but only one occurrence of each duplicate will be included.
  • Show only Duplicates — sorts the list then shows only those items that occur two or more times.
  • Show only Uniques — Sorts the list then shows only those items that occur once.
  • Frequency — a list of all the items in descending order of frequency
Stacks Image 964
Find And Replace
Find And Replace is a utility to Find and Replace text items from a list of paragraphs, including:
  • Anywhere — Replaces all occurrences of the Find Text with the Replace Text. Change the 'Clean By' field in the Preferences to search the text as a whole or by paragraph or by word. Keep the Replace text empty to delete the Find text. Click the Special popups to find or replace invisible characters. Click the Next button to highlight the next occurrence of the Find text in the main text window. Click the Swap button to swap the Find and Replace fields. If the Find Text include a «CR» then the Source Text will be treated as a whole rather than as separate paragraphs.
  • At Start — Replaces the Find Text with the Replace Text only if it occurs at the start of the paragraph (or whole text). Leaving the Find text blank allows you to prefix every paragraph with the Replace text.
  • At End — Replaces the Find Text with the Replace Text only if it occurs at the end of the paragraph (or whole text). Leaving the Find text blank allows you to suffix every paragraph with the Replace text.
  • Between Texts — Replaces the first occurrence of the Text contained within the 'Between' text limits, with the Replace Text. You can decide to also replace the 'Between' text limits themselves or leave them intact. If the Find Text contains text it will do a Find and Replace within 'Between' text limits.
  • From Find Text to End — Replaces all text beginning with the first occurrence of the Find Text all the way to the end of the paragraph (or whole text) with the Replace text. You can decide to replace the Find text itself or leave it intact.
  • Insert from Start — Inserts the Replace text 'From …' characters from the left of the text. None of the text will be deleted. The paragraph must be equal or longer than the number of characters otherwise it will be ignored.
  • Insert from End — Inserts the Replace text 'for …' characters from the right of the text. None of the text will be deleted. The paragraph must be equal or longer than the number of characters otherwise it will be ignored.
  • Up until Find Text — Replaces all text from the beginning of the paragraph (or whole text) to the first occurrence of the Find Text with the Replace text.
  • Characters from Start — Replaces the first 'From …' characters of every paragraph (or the whole text) with the Replace Text.
  • Characters in Middle — Replaces all characters of every paragraph (or the whole text) beginning with the 'From …' character position for a total of 'for …' characters with the Replace Text. The paragraph must be longer than the 'From …' value. If the 'for …' value is zero it lets you substitute text inside the paragraph without removing anything.
  • Characters from End — Replaces the last 'for …' characters of every paragraph (or the whole text) with the Replace Text.
  • Regular Expression — search for patterns in text and either extract those pattern or replace them with another RegEx pattern.

Note that:
  • clicking the Special buttons will insert a representation of an invisible character for Carriage Returns («CR»), LineFeeds («LF»), Tab («Tab») or Escape («Esc»).
  • clicking the Next button will repeatedly search for the next occurrence of the Find Text in the main Text field.
  • clicking the Swap button will replace the Find text with the Replace text and vice versa.
  • clicking the Source button will replace the main Text field with the Converted text.
  • clicking the Clipboard button will place the Converted text onto the clipboard.
Stacks Image 969
Inside Text
Inside Text is a utility to extract information about text. It changes, combines, converts and removes text, as well as examining it for trends and more detail, including:
  • Combine Columns, Combine Paragraphs
  • Convert Bible Verse to Passage, Convert Bullets to Hyphens, Convert Double Metaphone, Convert Em/En-dash to Hyphen, Convert Hash Value, Convert Hyphen to Em-dash, Convert Hyphen to En-dash, Convert LineFeeds to Returns, Convert Non-breaking Spaces to normal Space, Convert Normal Spaces to non-breaking Space, Convert Number Format, Convert Number of Letters, Convert Number of Words, Convert Number to Words, Convert Phone Keypad Number, Convert Returns to LineFeeds, Convert Revise Negative Verbs, Convert Soundex Code, Convert SQL compliant, Convert Symbols, Convert Tabs to Spaces, Convert Transpose Rows and Columns, Convert USA to British period placement, Convert Value, Convert Wrapped Lines, Convert “Quotes”
  • Info re Frequency Within, Info re Frequency, Info re Spelling, Info re Text Information
  • Remove All ASCII, Remove ASCII Control characters, Remove Excess Returns, Remove Excess Spaces, Remove Excess Tabs, Remove LineFeeds, Remove Non ASCII, Remove Non Letters, Remove Non Numbers, Remove Periods, Remove Punctuation, Remove Returns, Remove Spaces, Remove Tabs
  • Save As SQLite
  • View As ASCII, View As Binary, View As Description, View As Hexadecimal, View As Octal, View Text Encoding,
Stacks Image 974
Its About Time
Its About Time is a utility to manipulate dates or tell you information about a list of dates, including:
  • Add Time to Date — Add any amount of years, months, days, hours, minutes or seconds to a date to reveal the new date and time. It is possible for the change values to be negative to go back in time. The start date must be in a format as defined by the computer's Date and Time preferences.
  • Date Format — Converts date (and time) text into other date formats. For correct parsing the date being converted needs to match the settings defined in the computer's Date and Time format settings, such as DD/MM/YY or MM/DD/YY. If a date cannot be parsed the text will remain unchanged.
  • Day of the Week — Returns the name of the week day on which the date occurs such as 'Sunday' through 'Saturday'.
  • Day of the Year — Returns the number of the day in the Year on which the date occurs. The first of January, whatever the year, will therefore be day one.
  • Days left in the Year — Returns the number of the days left in the Year on which the date occurs until 31 December of the same year.
  • Days until Christmas — Returns the number of the days left in the Year on which the date occurs until Christmas (25 December) of the same year.
  • Time Difference between two Dates — Returns the amount of time between two tab-delimited dates. The dates must be in the format set in the computer's International Date and Time format. Any times included (in HH:MM:SS format) will be added to the dates. If the second date is missing it will compare the first date with the computer's current date and time, so you can easily find out a person's age (given their birthday) or the time that has passed since an event.
  • Week of the Year — Returns the Week Number within the Year on which the date occurs. The first week is numbered 1. The first of January of any year is therefore in week one. The first week may be incomplete. If January 1 falls on a Saturday, then the next day is in week 2.

Note that:
  • You can subtract time by using negative values in Add Time to Date.
  • The Date format returned by Add Time to Date is set in the Date Format option.
Stacks Image 979
Sort It Out
Sort It Out is a utility to re-sort the order of a set of words or paragraphs, including:
  • Ascending — sorts items by increasing alphabetical order. Uppercase and lowercase values appear together.
  • Case-Sensitive — sorts items by increasing ASCII order. All uppercase letters come before any lowercase letters, so A-Z comes before a-z. Symbols are also in ASCII order.
  • Descending — sorts items by descending alphabetical order. It is the opposite of ascending order. Uppercase and lowercase values appear together.
  • Numerical Value — sorts items by increasing numerical value order. It takes negatives and the decimal point into account. Spaces at the front of a number need to be removed, otherwise the result is zero. All text has a value of zero. 'INF' represents infinity.
  • Random Sort — sorts items by random order, so it is different each time you sort.
  • Reverse — sorts items in the opposite order than they currently appear.