Skip to content

Commit

Permalink
post: izw joins openapc
Browse files Browse the repository at this point in the history
  • Loading branch information
cbroschinski committed Jan 17, 2024
1 parent 5c872b6 commit 3a7d206
Show file tree
Hide file tree
Showing 4 changed files with 147 additions and 0 deletions.
97 changes: 97 additions & 0 deletions Rmd/2024-01-17-izw.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,97 @@
---
layout: post
author: Christoph Broschinski
author_lnk: https://github.com/cbroschinski
title: IZW joins OpenAPC
date: 2024-01-17 07:00:00
summary:
categories: [general, openAPC]
comments: true
---


```{r, echo = FALSE}
knitr::opts_knit$set(base.url = "/")
knitr::opts_chunk$set(
comment = "#>",
collapse = TRUE,
warning = FALSE,
message = FALSE,
echo = FALSE,
fig.width = 9,
fig.height = 6
)
options(scipen = 1, digits = 2)
knitr::knit_hooks$set(inline = function(x) {
prettyNum(x, big.mark=",")
})
```

The Forschungszentrum Jülich collects publication cost data from German institutions as part of the DFG programme "[Open Access Publication Funding](https://www.fz-juelich.de/en/zb/open-science/open-access/monitoring-dfg-oa-publication-funding)".

When reporting their publication costs, the [Leibniz-Institut für Zoo- und Wildtierforschung (IZW)](https://www.izw-berlin.de/de/start.html) (Leibniz Institute for Zoo and Wildlife Research) had agreed to share the data with OpenAPC as well. This data has now been transferred and integrated, making the institution participate in OpenAPC for the first time.


## Cost data

```{r, cache.lazy = TRUE}
#' Download APC spreadsheet from github which requires to Curl installed
download_apc <- function(path = NULL, dir = "tmp", file = "apc_de.csv"){
if(is.null(path)) {
path <- c("https://raw.githubusercontent.com/OpenAPC/openapc-de/master/data/apc_de.csv")
}
dir.create(dir)
download.file(url = path, destfile = paste(dir, file, sep = "/"), method = "curl")
read.csv(paste(dir, file, sep = "/"), header = T,sep =",")
}
my.apc <- download_apc()
my.apc <- my.apc[my.apc$institution == "Leibniz-IZW",]
my.apc <- droplevels(my.apc)
```

The initial data set provided by the ZMT covers publication fees for `r format(nrow(my.apc), big.mark =",")` articles, total expenditure amounts to `r sum(my.apc$euro)`€ and the average fee is `r sum(my.apc$euro)/nrow(my.apc)`€.

```{r}
d_frame = data.frame(table(my.apc$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
my.apc$publisher <- factor(my.apc$publisher, levels = d_frame$Publisher)
df.summary <-cbind(tapply(my.apc$euro, my.apc$publisher, length),
tapply(my.apc$euro, my.apc$publisher, sum),
tapply(my.apc$euro, my.apc$publisher, mean))
colnames(df.summary) <- c("Articles", "Fees paid in EURO", "Mean Fee paid")
knitr::kable(as.data.frame(df.summary), digits = 2)
```

## Overview

A detailed analysis of the contributed data provides the following overview:

### Fees paid per publisher (in EURO)

```{r tree_izw_2024_01_17_full}
tt <- aggregate(my.apc$euro, by = list(my.apc$publisher), sum)
colnames(tt) <- c("Publisher", "Euro")
treemap::treemap(tt, index = c("Publisher"), vSize = "Euro", palette = "Paired")
```

### Average costs per publisher (in EURO)

```{r box_izw_2024_01_17_publisher_full, echo = FALSE, message = FALSE}
require(ggplot2)
d_frame = data.frame(table(my.apc$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
publishers = as.character(d_frame$Publisher[d_frame$Freq > 0])
my.apc_reduced = my.apc[my.apc$publisher %in% publishers,]
q <- ggplot(my.apc_reduced, aes(publisher, euro)) + geom_boxplot() + geom_point()
q <- q + ylab("Fees paid (in EURO)") + theme(legend.position="top") + theme_bw(base_size = 18) + coord_flip()
q + xlab("Publisher") + ylab("APC")
```
50 changes: 50 additions & 0 deletions _posts/2024-01-17-izw.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
---
layout: post
author: Christoph Broschinski
author_lnk: https://github.com/cbroschinski
title: IZW joins OpenAPC
date: 2024-01-17 07:00:00
summary:
categories: [general, openAPC]
comments: true
---




The Forschungszentrum Jülich collects publication cost data from German institutions as part of the DFG programme "[Open Access Publication Funding](https://www.fz-juelich.de/en/zb/open-science/open-access/monitoring-dfg-oa-publication-funding)".

When reporting their publication costs, the [Leibniz-Institut für Zoo- und Wildtierforschung (IZW)](https://www.izw-berlin.de/de/start.html) (Leibniz Institute for Zoo and Wildlife Research) had agreed to share the data with OpenAPC as well. This data has now been transferred and integrated, making the institution participate in OpenAPC for the first time.


## Cost data



The initial data set provided by the ZMT covers publication fees for 23 articles, total expenditure amounts to 43,228€ and the average fee is 1,879€.



| | Articles| Fees paid in EURO| Mean Fee paid|
|:-----------------------------|--------:|-----------------:|-------------:|
|MDPI AG | 5| 6946| 1389|
|Wiley-Blackwell | 5| 7673| 1535|
|Elsevier BV | 4| 8874| 2218|
|Frontiers Media SA | 4| 8320| 2080|
|Springer Nature | 3| 9581| 3194|
|Oxford University Press (OUP) | 1| 560| 560|
|The Royal Society | 1| 1275| 1275|



## Overview

A detailed analysis of the contributed data provides the following overview:

### Fees paid per publisher (in EURO)

![plot of chunk tree_izw_2024_01_17_full](/figure/tree_izw_2024_01_17_full-1.png)

### Average costs per publisher (in EURO)

![plot of chunk box_izw_2024_01_17_publisher_full](/figure/box_izw_2024_01_17_publisher_full-1.png)
Binary file added figure/box_izw_2024_01_17_publisher_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added figure/tree_izw_2024_01_17_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 3a7d206

Please sign in to comment.