-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy path07-filtering-flowdata.qmd
159 lines (80 loc) · 6.59 KB
/
07-filtering-flowdata.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
# Filtering flowdata
This chapter concerns the filtering procedures for flowdata in order to reduce the graphic complexity of the flow map - the so-called *spaghetti effect*.
Filtering procedures are also important to select useful information: links and/or nodes to obtain an interesting flowmap.
Without filtering, the flowmap patterns reveals a *spaghetti-effect*. See example
*EXAMPLE:* RICardo international trade flows like a dish of spaghetti.
![](images/RICardo_spaghetti.png){fig-align="left"}
The selected information shown is described in percent on the bottom of the right panel. Here, all the information is represented :100% of the links,100% of the nodes and 100% of the total information).
Filtering can be performed in *Arabesque* on all variables describing the nodes and/or links.
## Available filtering procedures
### Filtering possibilities
Filtering is available on numerical, temporal and categorical variables.
#### Numerical variables
Numerical filtering applies to quantitative (absolute, continuous or pseudo-continuous) dat is done either visually with a brush on an interactive histogram or bar chart. It is also possible to indicate a threshold.
#### Temporal variables
The filtering possibilities of the temporal matrices are currently only available for links. They concern the choice of a date or a period, in a numerical or graphical form (slider). Three date formats are available (string, hours, timestamp).
#### Categorical variables
Categorical filtering is applied to nominal qualitative data available in node and/or link datasets. Three possibilities are offered, graphically and numerically:
-- **Selection of multiple variables**: the user chooses one or more of the available categorical variables (e.g., prefectures and sub-prefectures if a variable specifying the administrative profile of French cities is available) or the type of links (e.g., import or export for trade data);
-- **Selection of a single variable**: the user chooses a single variable (for the nodes: for example, the flows from and to France for mobility data; the import type flows for trade data);
-- **Selection by deletion of a single variable**: the user chooses one or more variables relating to the nodes and links to be deleted. If the deletion selection concerns nodes, all the corresponding links (for example, the Ile-de-France region for an analysis at the national level) are deleted.
### Implementing filtering
The *Arabesque* filtering possibilities depend on the type of variable/procedure of filtering, as well as the corresponding graph for visual implementation.
Three main possibilities are proposed
-- (1) visual filtering using modular\* **selection area (brush)** applied on an histogram or an bar chart (see above) ;
-- (2) **numerical thresholding** by indicating a threshold ;
![](images/schema_histogram.png)
![](images/schema_BarChart.png)
-- (3) visual filtering **using a slider** (hereby, on links)
![](images/schema_Slider_temp.png)
## Filtering by links
![](images/icon_links_filtering.png) Icon indicating the existence of a filtering possibility applied on the links.
### Add filter
![](images/Add_filter.PNG)
First, it is to click on Add filter on the bottom of the right panel, then on Link in the opened window.
![](images/Add_filter_links1.png)
The most common filtering consists in thresholding on the flow values (the volume variable declared when importing the data)
![](images/Add_filter_links_volume.png)
For links, the possibility of filtering according to the distance traveled is also proposed.
It is possible to choose the variable - or declare/type its format in order to draw an histogram for example.
### Choose the format of the variable to be filtered
You must declare the type of data so that the corresponding graph can be displayed to allow visual thresholding.
![](images/Add_filter_links_volume_continuous.png)
**Categorial** is for qualitative categorical variables
**Continuous** is for filtering quantitative variable with the slider on a measure or on an interval of values.
**Discrete** is for selecting one or more modality/measure relating to links (and the corresponding nodes, and reversly) from be flowmap.
**Categorical** is for filtering on a single/several modality of a qualitative variable
Above, the volume is declare as numeral for plotting the resulting map with RICardo data set:
![](images/Add_links_filter_volume_histogramme.png)
![](images/RICardo_filter_links_continuous.png){fig-align="left"}
The selected information shown is described in percent on the bottom of the panel. Here only 2,6% of the links and 15% of the nodes represents 56% of the total volume of trade flows.
The minimum and maximum values are automatically shown on the histogram. It is possible to change these values by brushing on the histogram.
### Multi variables filtering
Several filter can be applied on links. Hereby, we add a second filter on the distance variable.
![](images/Add_filter_links_dist.png)
The two filter will then be available in the right panel (hereby, volume and distance)
![](images/Add_links_filter_vol_distance.png)
Only flows that have traveled less than 4900 km are depicted.
### Local filter of links: by origin or destination
It is also possible to filter the links by origin or by destination.
![](images/Add_links_filter_origin.png)
Then, it is a matter of specifying a type, here categorical, to display the flows issued from a country.
![](images/Add_filter_links_origin_categorial.png)
You must then choose the country in the list or enter the name of the selected one.
![](images/Add_filter_node_category_select.png){fig-align="left"}
Here we entered France.
![](images/Add_filter_links_origin_categorial_SelectFrance.png){fig-align="left"}
In order to display all flows from France, the (possible) other active filters have been set to zero.
![](images/Add_links_filter_origin_France.png)
*NOTE*: in order to be able to see all the flows from one place of origin (or destination), it is necessary to set the previous filters to zero (but it is not necessary to delete them).
### Filter by a categorical variable
It is possible to select several categorical variables (ie several countries of origin).
![](images/Add_filter_links_category.png)
### Filter by a temporal variable
The filtering of temporal data considers different field formats: date, hours and time stamp.
![](images/Add_filter_links_temporal.png)
### Filter by a Remove
## Filtering by nodes
First, it is to click on Add filter, then on Node.
![](images/icon_nodes_filtering.png) Icon indicating the existence of a filtering possibility applied on the nodes.
![](images/Add_filter_nodes1.png)