ML Trail Library

Work in progress.

Lib to process results, make analysis of Trail running races and eventually build models to predict performances.

Installation

Install Poetry:

Linux, macOS, Windows (WSL)

curl -sSL https://install.python-poetry.org | python3 -

Windows (Powershell)

(Invoke-WebRequest -Uri https://install.python-poetry.org -UseBasicParsing).Content | py -

If you have installed Python through the Microsoft Store, replace py with python in the command above.

(More details, methods or problem solving on Poetry Installation Page.)

Execute the following command:

poetry install

Launch web app

Launch the following command:

streamlit run front/MLTrail.py

In some old macOS systems a special installation may be necessary to have streamlit working.

Run the following only if the above installation runs but the streamlit runcommand fails.

conda create -n mltrail -c conda-forge python=3.9 streamlit -y
conda activate mltrail
pip install pytest python-dotenv html5lib beautifulsoup4 lxml matplotlib numpy pandas pyarrow=="1.15.0"
poetry install --only-root

Download data from LiveTrail locally

Launch the following command:

python src/database/loader_LiveTrail/db_LiveTrail_loader.py

If nothing is modified, default folder is .data/, inside we will find the DB (events.db) and a folder csv/ containing all the downloaded results.

usage: db_LiveTrail_loader.py [-h] [-p PATH] [-d DATA_PATH] [-c] [-u]

Data loader from LiveTrail website into DB.

options:
  -h, --help            show this help message and exit
  -c, --clean           Remove all data from tables before execution.
  -u, --update          Download only events and reces not already present in DB.

If you want to skip races, create a text file parsed_races.txt containing one code and year per line wanting to be ignored, for example:

saintelyon 2018
saintelyon 2017
saintelyon 2016
saintelyon 2015
saintelyon 2014
saintelyon 2013
penyagolosa 2024
penyagolosa 2023
penyagolosa 2022
penyagolosa 2021
penyagolosa 2019
# lut 2016 -> parcours.php is empty
lut 2016
# oxfamtrailwalkerhk 2021 -> Password protected
oxfamtrailwalkerhk 2021

In order to recompute the Results table, the script src/database/loader_LiveTrail/CSV_to_DB_results.py can be used:

CSV_to_DB_results.py -c Will reload the full table from the data folder, emptying it first. CSV_to_DB_results.py -u races.txt -f Will force the computation and loading of the races events and years specified in races.txt from the data folder.

options:
  -h, --help            show this help message and exit
  -c, --clean           Remove all data from table before execution.
  -f, --force-update    Remove all data from specified tables in "years" before execution.
  -s PATH, --skip PATH
                        Filepath to list of events and years to ignore during update. db_LiveTrail_loader.py generates this list as update.txt
  -u JSON/PATH, --update JSON/PATH
                        dict in "years" format containing the list of events and years to update or path for the file containing the list.

NOTE: --update and --skip options cannot be used together.

Same syntax and options apply to To recompute the Timing_points table and script src/database/loader_LiveTrail/CSV_to_DB_timing_points.py can be used.

More details and visual example in the notebook examples/parse_LiveTrail_to_DB.ipynb

⚠️ Warning: Changing paths in scripts through the -p or data-path options is discouraged. Advanced users only.

Collaborating

Don't hesitate to get in contact or open an issue!

TO-DO list

Automatically parse LiveTrail data.
Add tests.
Start a simple Front-End
Add templates for manual (csv, json) results and control points
Improve Front-End
Integrate ML/AI predictors
Before 2014 link is different (e.g. https://livetrail.net/histo/ecotrail2013/ instead of https://livetrail.net/histo/ecotrail_2013/)

Scraping

BUG: for some races, control point code is not unique since it gets revisited in different laps (e.g. event 'tapalpa23', 2023, 'enigma')
Scraped timestamps are different (there is no day added and it gets back to 00:00:00 after 24h in race)
Get the name of the checkpoints from the website.
Set objective directly by time and not by position.
Get a list of available races in LiveTrail.
Rename Scraper to LiveTrail Scraper (others may come later)
Change camelCase style to snake_case style naming

Relational DB

ML/AI

Add inference points from models
Add modelling capabilities from own data, start simple (ensemble methods)
Generate training file with simple variables (dist_total, D_total, d_total, dist_segment, dist_cumul, D_segment, D_cumul, d_segment, d_cumul, time)
Research constrained methods (total_estimation = sum(sections_estimation))
Test unsupervised clustering models to generate a performance index (such as ITRA performance index, UTMB index, Niveau Betrail, etc.)
Choose different models in function of # of training samples

FrontEnd

Make rows in my results pages links to races' results
Make header in my results pages clickable (for sorting)
Add a switch button to tables between cumulative race time and time of the day(s)
Add normalized pace plot and add a switch button between it and regular pace one. --> try with st.container
Integrate printing version of times
Show race profile from distance, D+ and D- data? Maybe too aproximative and need real gps data
Objective graph is only paces, show times / normalised pace?
Bug when races include departure time in timing_points file
Add Warinings about prediction methods not being accurate, and that more data usually shows better results.

BackEnd

CI/CD

Create a CI
Contenarize
Create a CD Pipeline once contenarized
Add installation procedure

Name		Name	Last commit message	Last commit date
Latest commit History 284 Commits
.github/workflows		.github/workflows
data/csv/transgrancanaria		data/csv/transgrancanaria
examples		examples
front		front
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML Trail Library

Installation

Launch web app

Download data from LiveTrail locally

Collaborating

TO-DO list

Scraping

Relational DB

ML/AI

FrontEnd

BackEnd

CI/CD

About

Releases

Packages

Contributors 2

Languages

Victorivus/MLTrail

Folders and files

Latest commit

History

Repository files navigation

ML Trail Library

Installation

Launch web app

Download data from LiveTrail locally

Collaborating

TO-DO list

Scraping

Relational DB

ML/AI

FrontEnd

BackEnd

CI/CD

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages