LAPD ML project - Predictic GHI from meteo data and webcam images

Welcome to the LAPD GHI prediction repository! As societies worldwide strive to embrace sustainable energy, Switzerland stands out with a remarkable increase in photovoltaic power generation. In 2022, photovoltaic installations alone contributed a substantial 3858 GWh, a significant leap from 299 GWh in 2012 source: Federal Office of Energy Statistics, 2023. This surge in solar energy production underscores the growing impact on the Swiss power grid.

In response to the dynamic nature of solar power, energy companies face the challenge of predicting and optimizing grid performance. Traditional methods rely on meteorological companies that employ satellite imagery and complex algorithms to forecast Global Horizontal Irradiance (GHI), a key metric tied to solar panel performance. However, existing predictions often lack spatial resolution for small areas and may not be highly accurate within specific timeframes source: Solcast.

This GitHub repository introduces a machine learning solution to address these challenges. Our goal is to develop an algorithm capable of generating precise local predictions of GHI two hours into the future. Leveraging meteorological data, such as wind, current GHI, date, etc., and webcam images from two EPFL campus cameras, we tried to enhance the accuracy of short-term GHI predictions highly locally. This project have been proposed by the Laboratory of Applied Photonics Devices (LAPD) source: LAPD and takes part of the ML4Science initiative proposed by the Machine Learning course (CS-433) at EPFL.

Feel free to explore the code, datasets, and documentation to gain insights into our machine learning approach for accurate GHI predictions.

Github organization

models

Each model have it own folder, containing the python notebook of the model, the saved best model (.pt) and the saved result on the validation sets (.pkl)

Model A_a:
This folder contains the code used to make model A1. For reproducibility purposes you can run the code which loads the saved model (model1_meteo_after_LSTM2.pt). Images included in this folder represent the prediction made with this model on the validation sets.
Model A_b:
This folder contains the code used to make model A_b as well as the saved model (.pt) and the saved predictions (.pkl)
Model B:
This folder contains the code used to make model B as well as the saved model (.pt) and the saved predictions (.pkl)
Model C:
This folder contains the code used to make model C as well as the saved model (.pt) and the saved predictions (.pkl). This folder also contains some extra files as preprocessing had to be done for this model. The raw data, preprocessed data, means and standard deviations, and median/Q1/Q3 can also be found in this folder. A helper.py and the preprocessing notebook corresponding to this model can also be found here.
Model D:
This folder contains the code used to make model D as well as the saved model (.pt) and the saved predictions (.pkl)
Saved model and comparison :
This folder contains pickle files with the predictions done by each model. It also contains a notebook (result.ipynb) in which the different predictions are plotted together for analysis/comparison purposes. Schematic:
This folder contains powerpoint files with schematics of our network architectures.

visualisation

Visualisation:
This folder contains the code (notebook) used to make the visualisation and the saved images.

custom features

Cloud detector:
This folder contains a notebook with different functions defined to detect clouds

original code

Original_Code_Keras.ipynb :
This is the original code given to us by the lab as a basis.

Dataset

Due to it size (especially for images), we will not provide the full dataset directely on the github. There are two datasets.

meteo only dataset : can be found here on the github page in the model C folder.
meteo + images dataset : provided in a different way as it contains images and is therefore too heavy for the github. The dataset is available at this link

Running the code

We recommend running this code on google colab to avoid having to change the code too much for the import of the data. When running each file make sure all the data is at the same level and that the path to the data is correct (google drive preferably). This means that to run e.g. model B, all the dataset files must be in a folder along with model_B.pt to make the importation work. The path that you should initialize is 'drive/MyDrive/<your-path-in-your-google-drive>' and can be changed at the beginning of each notebook.

To run a file :

open google collab and import the targeted notebook
open your drive and import the dataset on a "data" folder. Upload the .pt and the .pkl corresponding to the notebook/model in the same folder (more generally, upload the content of the git folder in the drive folder)
open the notebook and update the path to the data
make sure you meet all the requierement in the requierement.txt
if you don't want to retrain the model, run all the cells before the training, run the model loading below the training and run the validation to obtain the result.

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
.ipynb_checkpoints		.ipynb_checkpoints
cloud detector		cloud detector
model A_a		model A_a
model A_b		model A_b
model B		model B
model C		model C
model D		model D
saved model and comparison		saved model and comparison
schematic		schematic
visualisation		visualisation
.DS_Store		.DS_Store
Original_Code_Keras.ipynb		Original_Code_Keras.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LAPD ML project - Predictic GHI from meteo data and webcam images

Github organization

models

visualisation

custom features

original code

Dataset

Running the code

About

Releases

Packages

Contributors 3

Languages

CS-433/ml-project-2-lapd

Folders and files

Latest commit

History

Repository files navigation

LAPD ML project - Predictic GHI from meteo data and webcam images

Github organization

models

visualisation

custom features

original code

Dataset

Running the code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages