-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathREADME.Rmd
49 lines (31 loc) · 911 Bytes
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
output: rmarkdown::github_document
---
# cdx
Query Web Archive Crawl Indexes ('CDX')
## Description
Methods are provided to retrieve web archive crawl index ('CDX') metadata and directly query the 'CDX' 'API' endpoint to retrieve mementos for a given set of parameters.
## What's Inside The Tin
The following functions are implemented:
- `cdx_query`: Query a CDX index endpoint
- `fetch_collections_index`: Fetch collections index
## Installation
```{r eval=FALSE}
devtools::install_github("hrbrmstr/cdx")
```
```{r message=FALSE, warning=FALSE, error=FALSE, include=FALSE}
options(width=120)
```
## Usage
```{r message=FALSE, warning=FALSE, error=FALSE}
library(cdx)
library(tidyverse)
# current verison
packageVersion("cdx")
```
### Example
```{r message=FALSE, warning=FALSE, error=FALSE}
cidx <- fetch_collections_index()
rprj <- cdx_query(cidx$cdx_api[1], "*.r-project.org")
rprj
```