-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy path03-06a-load.qmd
50 lines (40 loc) · 1.87 KB
/
03-06a-load.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
---
title: "Google Load"
---
The free cloud instance of Neo4j (Aura) requires that csv files are stored in [public cloud storage](https://neo4j.com/docs/aura/aurads/importing-data/load-csv/) like Google Drive or Dropbox.
Therefore, my project requires an intermediary step.
```{dot}
digraph google_drive_storage {
// top to bottom
rankdir = TD;
nodesep = 1.0;
splines = false;
// node styles
node [shape=box, style="filled,rounded", fillcolor="#f0f0f0", fontname="Arial"];
edge [fontname="Arial"];
// nodes
source_files [label="Local Files\n(./{hostkeys}/process)", shape=folder, fillcolor="#ffdead"];
gdrive_api [label="Google Drive API", shape=component, fillcolor="#ffffcc"];
get_folder [label="Get Folder Details"];
create_folder [label="Create Folder if Needed"];
upload_nodes [label="Upload Files\nMatching Node Pattern\nto './{hostkeys}/node'", shape=folder, fillcolor="#ffdead"];
upload_relationships [label="Upload Files\nMatching Relationship Pattern\nto './{hostkeys}/relationships'", shape=folder, fillcolor="#ffdead"];
process_done [label="Process Complete", shape=ellipse, fillcolor="#e0ffe0"];
// edges and labels
source_files -> gdrive_api [label="Connect via API"];
gdrive_api -> get_folder;
get_folder -> create_folder;
create_folder -> upload_nodes;
create_folder -> upload_relationships;
upload_nodes -> process_done;
upload_relationships -> process_done;
}
```
File storage and directories are controlled via Config. I created a publicly shared folder in Google drive which contains all project csvs:
```
- root: Google Drive folder
- hostkeys (automatically created, unless override)
- nodes
- relationships
```
data:image/s3,"s3://crabby-images/d3746/d374618bd0570a27496e46177d0d919f80d292f1" alt="Screenshot of Google Drive folder and files for MSc Data Science (INB112): note that files were created by Google API"