>')
```
Next, you can run `eppo_database_download` function that will download (by default to your **working directory**, which can be override with `filepath` argument) archive with the *SQLite* file. If you are on **Linux** operating system, file will be extracted into your working directory (or other directory provided in `filepath` argument). On **Windows** you will be asked to extract the database file manually.
```{r eval = FALSE}
eppo_database_download()
```
Last step of setup is to connect to database file, which can be easily done with `eppo_database_connect` function.
```{r eval = F}
eppo_SQLite <- eppo_database_connect()
```
With this three short steps you are ready to go.
## Extracting tables with scientific, common and synonym names
Currently searching for pest names supports scientific names, synonyms and common names. By default search will work with partial names -- e.g. when you query for *Cydia packardi* you get information related to this species, while when you query for *Cydia* you get information on whole genus. Likewise, when you search for *Droso* you will get information on all species that contain *Droso* in their names. Moreover you can pass whole vector of terms in one shot, e.g. `c('Xylella', 'Cydia packardi', 'Droso')`.
```{r eval = FALSE}
### Create vector of names that you are looking for
pests_query <- c('Cydia', "Triticum aestivum", "abcdef", "cadang")
```
Than you should start with querying for names and assigning your results to a variable. This variable will contain `eppocodes` that are used by other functions to extract data from **EPPO REST API**. `eppo_names_tables` takes two arguments: first is a vector of names to query the database, second is variable with connection to *SQLite* database.
```{r eval = FALSE}
pest_names <- eppo_names_tables(pests_query, eppo_SQLite)
names(pest_names)
### names that exist in database
head(pest_example[[1]], 5)
### names that were not found
head(pest_example[[2]], 5)
### preferred names for eppocodes from first table
head(pest_example[[3]], 5)
### all names that are associated to eppocodes from first data frame
head(pest_example[[4]], 5)
```
```{r eval = TRUE, echo = FALSE}
names_example <- readRDS("vignette_mock_names.RDS")
names(names_example)
names_example[[1]] %>%
head(5) %>%
knitr::kable(caption = "Names that exist in database", format = "html",
table.attr = "style='width:80%;'")
names_example[[3]] %>%
head(5) %>%
knitr::kable(caption = 'Preferred names for eppocodes from first table',format = "html",
table.attr = "style='width:80%;'")
names_example[[4]] %>%
head(5) %>%
knitr::kable(caption = 'All names that are associated to eppocodes from first data frame',format = "html",
table.attr = "style='width:80%;'")
```
As a result you will get `list` containing 3 `data.frames` and `vector`:
* `exist_in_DB` -- `data.frame` with names that are present in **EPPO Data Services**;
* `not_in_DB` -- `vector` with names that are not present in database;
* `pref_names` -- `data.frame` with preferred names and `eppocodes`,
* `all_associated_names` -- `data.frame` with all associated names to `eppocodes` from third `data.frame`.
**REMEMBER:** Other `eppo_tabletools_` functions use results of this function or *raw* eppocodes to access data from **EPPO Global Database** and **EPPO Data Services**.
## `eppo_tabletools_` functions to extract categorization, hosts, taxonomy, distribution and pests
This functions works separately from each other, thus there is no need to use all of them. There is no need to use them in any particular order. Functions for categorization, hosts taxonomy and pests takes two arguments:
* `names_table` -- variable containing result of `eppo_names_tables`;
* `token` -- variable created with `create_eppo_token`
**OR** three arguments:
* `token` -- same as above;
* `raw_eppocodes` -- character vector of eppocodes (e.q. `c("XYLEFA", "ABIAL"));
* `use_raw_codes` -- logical set to TRUE.
### Categorization of pests
As result `eppo_tabletools_cat` you will get `list` with two elements:
* `data.frame` with categorization tables
* `data.frame` with categorization for each eppocode condensed to single cell.
```{r eval = FALSE}
pests_cat <- eppo_tabletools_cat(pest_names, eppo_token)
### long format table
head(pests_cat[[1]], 5)
### comapct table with information for each eppocode condensed into one cell
head(pests_cat[[2]],5)
```
```{r eval = TRUE, echo = FALSE}
cat_example <- readRDS("vignette_mock_cat.RDS")
cat_example[[1]] %>%
head(5) %>%
knitr::kable(caption = "Long format table with pests categorization")
cat_example[[2]] %>%
head(5) %>%
knitr::kable(caption = "Compact table with condensed information on categorization")
```
If you will to limit the data received from **EPPO Data Services**, *and you are confident that you know exactly what you are looking for*, you can use eppocodes directly.
```{r eval = FALSE}
pests_cat <- eppo_tabletools_cat(token = eppo_token,
raw_eppocodes = c("LASPPA", "TRZAX", "CCCVD0"),
use_raw_codes = TRUE)
pest_cat[[2]]
```
```{r eval = TRUE, echo = FALSE}
cat_raw_example <- readRDS("vignette_raw_cat.RDS")
cat_raw_example[[2]] %>%
knitr::kable(caption = "Limited results of using eppocodes LASPPA, TRZAX, CCCVD0")
```
### Hosts of pests
`eppo_tabletools_hosts` as a result returns a `list` of two `data.frame`:
* long table format with all the data, for all pests combined;
* with hosts are combined into single cell for each eppocode.
```{r eval = FALSE}
pests_hosts <- eppo_tabletools_hosts(pest_names, eppo_token)
head(pests_hosts[[1]], 5)
head(pests_hosts[[2]], 5)
```
```{r eval = TRUE, echo = FALSE}
hosts_example <- readRDS("vignette_mock_hosts.RDS")
hosts_example[[1]] %>%
head(5) %>%
knitr::kable(caption = "Long format table with pests hosts")
hosts_example[[2]] %>%
head(5) %>%
knitr::kable(caption = "Compact table with condensed information on hosts")
```
### Taxonomy of Pests and hosts
`eppo_tabletools_taxo` as other functions from this family returns a `list` with two `data.frame`:
* fist is a long format taxonomy table;
* second is table with *main category* of each eppocode.
Suppose, that from previous name query we are interested only in viroids and viruses. As they usually have a *viroid* or *virus* phrase in their name, we can simply limit the query to certain eppocodes.
```{r eval = FALSE}
virs_eppocodes <- pest_names$all_associated_names %>%
dplyr::filter(grepl("viroid", fullname) | grepl("virus", fullname)) %>%
.[,5] %>% ## eppocodes are in 5th column
unique()
```
We can now pass `virs_eppocodes` as `raw_eppocodes` argument, and in consequence receive taxonomy of viroids and viruses only.
```{r eval = FALSE}
virs_taxonomy <- eppo_tabletools_taxo(token = eppo_token,
raw_eppocodes = virs_eppocodes,
use_raw_codes = TRUE)
virs_taxonomy$long_table ## you can also access list elements by their names
virs_taxonomy$compact_table
```
```{r eval = TRUE, echo = FALSE}
example_taxo <- readRDS("vignette_viroid_taxo.RDS")
example_taxo$long_table %>%
knitr::kable(caption = "Long table of viruses and viroids taxonomy")
example_taxo$compact_table %>%
knitr::kable(caption = "Compact table of viruses and viroids taxonomy")
```
### Pests of hosts
It is possible to obtain data on pests of particular hosts with function `eppo_tabletools_pests`. Lets say we want to know all the pests associated with *Abies alba* (eppocode: **ABIAL**).
```{r eval = FALSE}
abies_pests <- eppo_tabletools_pests(token = eppo_token,
raw_eppocodes = "ABIAL",
use_raw_codes = TRUE)
head(abies_pests[[1]], 5)
head(abies_pests[[2]], 5)
```
```{r eval = TRUE, echo = FALSE}
example_pests <- readRDS("vignette_mock_pests.RDS")
example_pests$long_table %>%
head(5) %>%
knitr::kable(caption = "Long table of Abies alba pests")
example_pests$compact_table %>%
head(5) %>%
knitr::kable(caption = "Compact table of Abies alba pests")
```
### Distribution of pests
`eppo_tabletools_distri` does not connect to REST API, but it downloads information from *csv* files directly from **EPPO Global Database**. As a consequence there is no `token` argument (since it does not need the EPPO token) -- a variable containing result of `eppo_names_tables`.
The function returns a two element `list`:
* first one contains `dataframe` with distribution for organism/virus, including *invalid records and eradicated status*;
* second contains single cell with condensed distribution for each eppocode, however including only *present* status.
```{r eval = FALSE}
pest_distri <- eppo_tabletools_distri(pest_names)
head(pestr_distri[[1]], 5)
head(pestr_distri[[2]], 5)
```
```{r evel = TRUE, echo = FALSE}
example_distri <- readRDS("vignette_mock_distri.RDS")
example_distri$long_table %>%
head(5) %>%
knitr::kable(caption = "Long table with distribution of pests")
example_distri$compact_table %>%
head(5) %>%
knitr::kable(caption = "Compact table with distribution of pests")
```
## Whole condensed table in one shot (currently does not include `eppo_tabletools_pests`):
Last, but not least, package offers a simple wrapper over above mentioned functions. If you want to make one table with all the informations: names, categorization, hosts, distribution and taxonomy -- condensed to **one cell per pest**, please use `eppo_table_full` function that takes arguments:
* `names vector` -- a character vector of pests/hosts names;
* `sqlConnection` -- a variable for *SQLite* connection (result of `eppo_database_connect`);
* `token` -- an variable storing *EPPO token* (`eppo_token`).
```{r eval = FALSE}
eppo_fulltable <- eppo_table_full(c("Meloidogyne ethiopica", "Crataegus mexicana"),
eppo_SQLite,
eppo_token)
eppo_fulltable
```
```{r eval = TRUE, echo = FALSE}
example_full_table <- readRDS("vignette_mock_full_table.RDS")
example_full_table %>%
knitr::kable(caption = "Full table with names, hosts, distribution, taxonomy and categorizatio of pests")
```