Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Download all messages from a Google group

I'm in the process of developing a Google apps migration/archive system and at this point in development I'm trying to come up with a way to download all messages in all the groups that my domain users have created. I know that I can set up forwarding filters and have all messages archived to an email, but this doesn't help with older messages.

Is there a way to download these messages from a Google group and if so, is there away in the admin API to get a list of all groups that users have created?

like image 672
LexNix Avatar asked May 07 '14 15:05

LexNix


People also ask

Can you export emails from Google Groups?

You can download a file with a list of your group members and their email addresses to a CSV file. You can load the list into a spreadsheet to manage group membership, count members, or organize smaller groups within your group.

How do I export a group email?

Select the group you want to export, and on the right side of the ribbon select Export Group Mailbox. Read the dialog box that appears, then select Export Mailbox to begin the export.


2 Answers

If you don't mind using #bash, you may try a tool I wrote

https://github.com/icy/google-group-crawler

It can download all mbox files from Google Group. If you have a cookie file, you can even download all files from a private Google Group, and/or to see all original emails. It can also read rss feeds and fetch the latest posts ; and this is useful for daily mirror.

An example result is here http://l.archlinuxvn.org/archlinuxvn/. MHonArch is used to convert mbox files into HTML format.

like image 65
Anh Huynh Avatar answered Sep 30 '22 07:09

Anh Huynh


Ultimately I ended up using the gdata python library to get a list of all groups along with their respective URLs. From there I used selenium to scrape the groups for messages and all replies. Probably not the best solution but it works for what I need.

like image 39
LexNix Avatar answered Sep 30 '22 07:09

LexNix