Introduction: Word Cloud From Google Chats

I created a word cloud from all of my google chats between my wife and I over 1 year. I used wordle .net to generate the cloud, getting the chats to a single text file was the tricky part. If you have any other tips or tricks to make this easier please let me know...


...after going through this again, you might be able to skip steps 7 & 8 if you create a label in gmail that is all chats from a single person. then you can use the thunderbird export tool to put the entire folder into one text file:  "Tools"-> "ImportExportTools" -> "Export all messages in the folder" -> "Plain Text Format (one file)".  I haven't tested this, but it might work.

Step 1: Enable IMAP in GMail

In Gmail settings, under "Forwarding and POP/IMAP", select "Enable IMAP"

Step 2: Set Chats to Be Shown in IMAP

In gmail settings, find "Chats" and select "Show in IMAP"

Step 3: Install Thunderbird

Install Thunderbird, set it up to you gmail account, make sure it is configured as IMAP and not POP

Step 4: Subscribe to "chats" in Thunderbird

Make sure you are subscribed to "Chats" in the folder tree on the left side of Thunderbird (right click to change)

Step 5: Install Export Tools Into Thunderbird

Install Thunderbird add-on ImportExportTools

Step 6: Export All Chats to a Directory

In thunderbird with the "Chats" folder highlighted, go to "Tools"-> "ImportExportTools" -> "Export all messages in the folder" -> "Plain Text Format" and select a folder to save it to.

Step 7: Prepare Text Files From a Single Person Into a Single Directory.

Copy only chats from desired person into their on directory. Search that directory for a particular persons name and copy those files only to a simple directory like "C:\Chats"

Run batch renaming file to remove spaces from file names and shorten them. Save the "rename.bat" file in that same directory.. [i got it here]. In the command prompt go to C:\Chats and type in:  rename.bat -files " " "_" C:\Chats in order to get rid of the spaces. You may also want to replace the name of the person in the file with initials to make it smaller. rename.bat -files " Bob Dole " "bd" C:\Chats

Step 8: Combine All Chats Into a Single Text File.

From the start menu -> run -> type in "cmd" and enter. 
Navigate to the correct folder using "cd .." and cd "Chats", etc...
Use DOS command:  copy /a *.txt allchats.txt
This combines all of the text files into a single file called "allchats.txt"

Step 9: Edit Chat Text File to Get Rid of Some Common Words/expressions

Open "allchats.txt" in Notepad++ (or similar) and:
 - use find/Replace to remove strings like " me ", " AM ", " AM:", " PM ", " PM:", " minutes ", etc...notice the spaces and colons in there. I ended up going back to this step several times, otherwise some words are really big. You can also remove words from the wordle site.
 - optional: select all and right click to change all to upper or lower case.

Step 10: Run Text Through Wordle.net Website.

copy all text into wordle.net website.

After doing this you can set different coler/text options. You can also remove words by right clicking them.