Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Web Scraper for Svelte Jobs #38

Open
vnphanquang opened this issue Mar 7, 2023 · 0 comments
Open

Web Scraper for Svelte Jobs #38

vnphanquang opened this issue Mar 7, 2023 · 0 comments
Labels
domain:backend type:feature new stuff type:setup bootstrap & preparation
Milestone

Comments

@vnphanquang
Copy link
Member

vnphanquang commented Mar 7, 2023

Context

Our project and community is non-profit. In our job board, we don't have a pay-to-post model but rather use a web scraper to collect jobs from various sites.

Criteria:

  • Svelte related jobs,
  • Location specific, Vietnam centric, expanding to ASEAN,

Site to scrap:

Implementation

We should probably use python for this. A couple of questions to consider & plan:

  • Where do we host the python source code? In another repo? In this repo as a monorepo?

  • Where to store the data? plain json files? cloud-based (preferred) like supabase or upslash? But maybe more lightweight, some alternative cloud service that provides SQLite?

    -> Let's stick to supabase because it's already good with lots of battery included support like real time subscription (maybe we can expand to do cool stuff like realtime toast for new comer from discord, new sponsor, new mail list registration?)

  • Which format (schema) to store this data as?

  • How to run this scraper? As a vercel cron job perhaps?

  • How to handle language specific texts (because our site is bilingual)?

  • How to bypass rate-restricted & blockers?

@vnphanquang vnphanquang self-assigned this Mar 7, 2023
@vnphanquang vnphanquang added type:feature new stuff type:setup bootstrap & preparation domain:backend labels Mar 7, 2023
@vnphanquang vnphanquang added this to the Lạc Việt milestone Mar 7, 2023
@vnphanquang vnphanquang changed the title Web Scraper for Svelte Jobs in Vietnam Web Scraper for Svelte Jobs Aug 17, 2023
@vnphanquang vnphanquang removed their assignment Mar 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
domain:backend type:feature new stuff type:setup bootstrap & preparation
Projects
Status: Backlog
Development

No branches or pull requests

1 participant