-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME
61 lines (34 loc) · 1.76 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
# grab_gamefaqs
.
A simple, quick & dirty script, written in old-school Python, to grab the FAQs
of a particular game off of GameFAQs.
.
REQUIREMENTS:
- Python 2.7, but older versions of Python will probably work as well. (The
current version of the script will NOT work with Python 3.x.)
HOW TO USE:
.(1). Go to the FAQs page for a particular game, and copy the URL.
For example: [http://www.gamefaqs.com/pc/564920-loom/faqs]
.(2). Feed this URL to grab_gamefaqs.py, and specify an output directory:
$ python grab_gamefaqs.py http://www.gamefaqs.com/pc/564920-loom/faqs loom
.(3). That's it! All the FAQs found on the given page should now be written to
the output directory, retaining their original names.
Note that some FAQs are *BIG*, and may require a minute or two to download,
especially on slower connections.
Also: kindly do not abuse this script. It is meant as a handy program to
quickly get all of a game's FAQs at once, not as a tool to scrape the whole
GameFAQs site.
UPDATE (version 1.2, 2015-05-24):
It seems this script no longer worked properly. I fixed the detection and
extraction of plain text files and images, and added support for custom
user-agents.
UPDATE (version 1.2.1, 2015-10-15):
Added -d switch which displays some debugging info.
Apparently GameFAQs changed their format again; files that used to be
accessible as plain text (ASCII) are now rendered as HTML pretending to be
ASCII. The original filename seems to be gone. As a result, these files are
now downloaded as HTML files with a meaningless name. :-/
The same happened to images, but I fixed this (since the original image name
is still available... for now anyway).
I am thinking of abandoning this script and writing one for NeoSeeker instead.
:-/