Skip to content

Commit

Permalink
Add configuration for parsing Wehe data (#421)
Browse files Browse the repository at this point in the history
* Add wehe parsing for scamper1 & annotation types
* Restore public archive source in staging
  • Loading branch information
stephen-soltesz committed Apr 6, 2023
1 parent 27ccc01 commit df863f9
Show file tree
Hide file tree
Showing 2 changed files with 26 additions and 3 deletions.
5 changes: 2 additions & 3 deletions apply-cluster.sh
Original file line number Diff line number Diff line change
Expand Up @@ -18,10 +18,9 @@ CLUSTER=${CLOUDSDK_CONTAINER_CLUSTER:?Please provide cluster name: $USAGE}
DATE_SKIP=${DATE_SKIP:-"0"} # Number of dates to skip between each processed date (for sandbox).
TASK_FILE_SKIP=${TASK_FILE_SKIP:-"0"} # Number of files to skip between each processed file (for sandbox).

# Use sandbox in sandbox, staging in staging, measurement-lab in oti.
# Use sandbox in sandbox, measurement-lab in staging & oti.
SOURCE_PROJECT=${PROJECT_ID/mlab-oti/measurement-lab}
# TODO(soltesz): restore or remove.
#SOURCE_PROJECT=${SOURCE_PROJECT/mlab-staging/measurement-lab}
SOURCE_PROJECT=${SOURCE_PROJECT/mlab-staging/measurement-lab}
sed -i \
-e 's/{{ANNOTATION_SOURCE_PROJECT}}/'${SOURCE_PROJECT}'/g' \
config/config.yml
Expand Down
24 changes: 24 additions & 0 deletions config/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,7 @@ monitor:
polling_interval: 1m
sources:
# NOTE: It now matters what order these are in.
## NDT
- bucket: archive-{{ANNOTATION_SOURCE_PROJECT}}
experiment: ndt
datatype: annotation2
Expand Down Expand Up @@ -66,3 +67,26 @@ sources:
raw: raw_ndt
join: ndt
daily_only: true
## WEHE
- bucket: archive-{{ANNOTATION_SOURCE_PROJECT}}
experiment: wehe
datatype: annotation2
target_datasets:
tmp: tmp_wehe
raw: raw_wehe
daily_only: true
- bucket: archive-{{ANNOTATION_SOURCE_PROJECT}}
experiment: wehe
datatype: hopannotation2
target_datasets:
tmp: tmp_wehe
raw: raw_wehe
daily_only: true
- bucket: archive-{{ANNOTATION_SOURCE_PROJECT}}
experiment: wehe
datatype: scamper1
target_datasets:
tmp: tmp_wehe
raw: raw_wehe
join: wehe
daily_only: true

0 comments on commit df863f9

Please sign in to comment.