Contained are the code and data needed to re-create the arxiv abstract submission predictor. I promise I'll write more this weekend.
Legal
The data aren't ours, but we did write a lot of the code. Obviously everything, e.g., code, data, etc., are provided without any guarantee about accuracy, quality, or the impact factor of the journal you can publish them in. Please cite the appropriate paper for each data set and the appropriate paper for any internal code (e.g., if you use any R libraries).
License
GPL (>= 3.0), where applicable.