Joel Pfeiffer
Joseph J. Pfeiffer, III
jpfeiffer at purdue dot edu

Lawson 2149 #20
Purdue University
Department of Computer Science
305 North University Street
West Lafayette, IN 47907-2066

Enron MySQL 5.5 Dump File Overview

Shetty and Adibi took the Enron dataset that was released to Cohen by the Federal Energy Regulatory Commission of the Enron email records and did a substantial amount of cleaning in order to create a nice, structured, MySQL database from the initial dump of emails -- the history and their MySQL dump file can be found on Adibi's site. However, the syntax from MySQL 4.0 (their dump file) to MySQL 5.5 (what I tried to load it in) has changed, making importing the file through MySQL workbench not straightforward, namely:

  • USE '%dbname%';
    Missing from the dump file (presumably could be used as a command line option, but Workbench doesn't seem to support it). This will need to be changed for whatever database you want to load it into.
  • ENGINE=MyISAM;
    MySQL 4.0 used the keyword TYPE=MyISAM, which 5.5 won't accept.

I've gone ahead and modified the file (below) to make it straightforward to load into 5 using the MySQL Workbench (or command line). Instructions for loading via MySQL Workbench are can be found here, while instructions for loading from the command line can be found here. Whichever you use, make sure to alter the USE command to be for a schema already created (from the command line you can specify it and take it out of the file). Additionally, it is likely this works with other versions of MySQL 5.*, but I haven't tried it.

Files/Links